# safety-testing

6 artikelengetagd met “safety-testing”

Evaluatie-ontwijking bij fine-tuning

Crafting fine-tuned models that pass standard safety evaluations while containing hidden unsafe behaviors that activate under specific conditions.

fine-tuningevaluationevasionsafety-testing

Gevorderd

Lab: inleiding tot veiligheidstesten

Learn the fundamentals of LLM safety testing including test case design, baseline measurement, and result documentation.

labssafety-testingfundamentalsbeginner

Beginner

Lab: testen van de veiligheidsimpact van fine-tuning

Measure how fine-tuning affects model safety by comparing pre and post fine-tuning safety benchmark scores.

labsfine-tuningsafety-testingintermediate

Gemiddeld

Beveiligingsgates in ML-deployment

Implementing security checkpoints in ML deployment pipelines: automated safety testing, performance regression detection, bias evaluation, approval workflows, and designing gates that balance security with deployment velocity.

deployment-gatessafety-testingbias-detectionapproval-workflowsregression-testingml-deployment

Gevorderd

Lab: Gequantiseerde modellen exploiteren

Praktijklab dat slagingskansen van aanvallen vergelijkt over quantisatieniveaus: jailbreaks testen op FP16 vs INT8 vs INT4, veiligheidsdegradatie meten, en quantisatiebewuste exploits ontwerpen.

labquantizationhands-onjailbreaksafety-testingtransformers

Gevorderd

Lokale modellen analyseren en testen met Ollama

Walkthrough for using Ollama to run, analyze, and security-test local LLMs, covering model configuration, safety boundary testing, system prompt extraction, fine-tuning vulnerability assessment, and building a local red team lab.

ollamalocal-modelsmodel-analysissafety-testingred-team-labwalkthrough

Beginner