# llm-judge

5 artikelengetagd met “llm-judge”

LLM-as-Judge verdedigingssystemen

Hoe LLM-as-judge-architecturen de outputs van andere LLM's evalueren op veiligheid, inclusief sequentiële en parallelle ontwerpen, prompt-engineering voor judges, en technieken om judge-modellen aan te vallen.

llm-judgesafety-evaluationdefense-architectureadversarialjudge-bypass

Expert

Manipulatie van een LLM-judge

Craft responses that exploit LLM-as-judge evaluation patterns to achieve high safety scores while embedding harmful content.

labsllm-judgemanipulationintermediate

Gemiddeld

Lab: een LLM-judge-evaluator bouwen

Hands-on lab for building an LLM-based evaluator to score red team attack outputs, compare model vulnerability, and lay the foundation for automated attack campaigns.

labllm-judgeevaluationautomation

Gemiddeld

Contentfiltering opzetten

Step-by-step walkthrough for implementing multi-layer content filtering for AI applications: keyword filtering, classifier-based detection, LLM-as-judge evaluation, testing effectiveness, and tuning for production.

content-filteringdefenseclassifiersmoderationllm-judgeimplementationwalkthrough

Gemiddeld

Implementatie van een LLM-judge

Step-by-step walkthrough for using an LLM to judge another LLM's outputs for safety and quality, covering judge prompt design, scoring rubrics, calibration, cost optimization, and deployment patterns.

llm-judgeoutput-validationsafetyevaluationdefensewalkthrough

Gevorderd