# safety-testing
6 artikelengetagd met “safety-testing”
Evaluatie-ontwijking bij fine-tuning
Crafting fine-tuned models that pass standard safety evaluations while containing hidden unsafe behaviors that activate under specific conditions.
Lab: inleiding tot veiligheidstesten
Learn the fundamentals of LLM safety testing including test case design, baseline measurement, and result documentation.
Lab: testen van de veiligheidsimpact van fine-tuning
Measure how fine-tuning affects model safety by comparing pre and post fine-tuning safety benchmark scores.
Beveiligingsgates in ML-deployment
Implementing security checkpoints in ML deployment pipelines: automated safety testing, performance regression detection, bias evaluation, approval workflows, and designing gates that balance security with deployment velocity.
Lab: Gequantiseerde modellen exploiteren
Praktijklab dat slagingskansen van aanvallen vergelijkt over quantisatieniveaus: jailbreaks testen op FP16 vs INT8 vs INT4, veiligheidsdegradatie meten, en quantisatiebewuste exploits ontwerpen.
Lokale modellen analyseren en testen met Ollama
Walkthrough for using Ollama to run, analyze, and security-test local LLMs, covering model configuration, safety boundary testing, system prompt extraction, fine-tuning vulnerability assessment, and building a local red team lab.