# content-policy

4 artikelengetagd met “content-policy”

Misbruik van de fine-tuning-API

How fine-tuning APIs are abused to create uncensored models, circumvent content policies, and attempt training data exfiltration -- the gap between acceptable use policies and technical enforcement.

api-abuseuncensored-modelscontent-policydata-exfiltrationfine-tuningacceptable-use

Gemiddeld

Lab: grenzen van het contentbeleid in kaart brengen

Map the content policy boundaries of LLM applications to identify permitted and restricted topic areas systematically.

labscontent-policymappingbeginner

Beginner

AI-guardrails opzetten

Step-by-step walkthrough for implementing AI guardrails: input validation with NVIDIA NeMo Guardrails, prompt injection detection with rebuff, output filtering for PII and sensitive data, and content policy enforcement.

guardrailsnemoinput-validationoutput-filteringpii-detectioncontent-policywalkthrough

Gemiddeld

Afdwingen van responsgrenzen

Stapsgewijze walkthrough om LLM-responses binnen gedefinieerde onderwerp-, formaat- en contentgrenzen te houden, met grensdefinitie, detectie van overtredingen, het herschrijven van responses en het monitoren van grensafwijking.

response-boundariesoutput-filteringcontent-policyguardrailsdefensewalkthrough

Gemiddeld