# content-policy
標記為「content-policy」的 4 篇文章
微調 API 濫用
微調 API 如何被濫用以建立去審查模型、規避內容政策並嘗試訓練資料外洩——可接受使用政策與技術執行之間的落差。
api-abuseuncensored-modelscontent-policydata-exfiltrationfine-tuningacceptable-use
Lab: Content Policy Boundary Mapping
Map the content policy boundaries of LLM applications to identify permitted and restricted topic areas systematically.
labscontent-policymappingbeginner
Setting Up AI Guardrails
Step-by-step walkthrough for implementing AI guardrails: input validation with NVIDIA NeMo Guardrails, prompt injection detection with rebuff, output filtering for PII and sensitive data, and content policy enforcement.
guardrailsnemoinput-validationoutput-filteringpii-detectioncontent-policywalkthrough
Response Boundary Enforcement
Step-by-step walkthrough for keeping LLM responses within defined topic, format, and content boundaries, covering boundary definition, violation detection, response rewriting, and boundary drift monitoring.
response-boundariesoutput-filteringcontent-policyguardrailsdefensewalkthrough