# content-policy

標記為「content-policy」的 4 篇文章

微調 API 濫用

微調 API 如何被濫用以建立去審查模型、規避內容政策並嘗試訓練資料外洩——可接受使用政策與技術執行之間的落差。

api-abuseuncensored-modelscontent-policydata-exfiltrationfine-tuningacceptable-use

中級

Lab: Content Policy Boundary Mapping

Map the content policy boundaries of LLM applications to identify permitted and restricted topic areas systematically.

labscontent-policymappingbeginner

入門

Setting Up AI Guardrails

Step-by-step walkthrough for implementing AI guardrails: input validation with NVIDIA NeMo Guardrails, prompt injection detection with rebuff, output filtering for PII and sensitive data, and content policy enforcement.

guardrailsnemoinput-validationoutput-filteringpii-detectioncontent-policywalkthrough

中級

Response Boundary Enforcement

Step-by-step walkthrough for keeping LLM responses within defined topic, format, and content boundaries, covering boundary definition, violation detection, response rewriting, and boundary drift monitoring.

response-boundariesoutput-filteringcontent-policyguardrailsdefensewalkthrough

中級