# content-policy

4 articlestagged with “content-policy”

Fine-Tuning API Abuse

How fine-tuning APIs are abused to create uncensored models, circumvent content policies, and attempt training data exfiltration -- the gap between acceptable use policies and technical enforcement.

api-abuseuncensored-modelscontent-policydata-exfiltrationfine-tuningacceptable-use

Intermediate

Lab: Content Policy Boundary Mapping

Map the content policy boundaries of LLM applications to identify permitted and restricted topic areas systematically.

labscontent-policymappingbeginner

Beginner

Setting Up AI Guardrails

Step-by-step walkthrough for implementing AI guardrails: input validation with NVIDIA NeMo Guardrails, prompt injection detection with rebuff, output filtering for PII and sensitive data, and content policy enforcement.

guardrailsnemoinput-validationoutput-filteringpii-detectioncontent-policywalkthrough

Intermediate

Response Boundary Enforcement

Step-by-step walkthrough for keeping LLM responses within defined topic, format, and content boundaries, covering boundary definition, violation detection, response rewriting, and boundary drift monitoring.

response-boundariesoutput-filteringcontent-policyguardrailsdefensewalkthrough

Intermediate