# llm-judge

5 articlestagged with “llm-judge”

LLM-as-Judge Defense Systems

How LLM-as-judge architectures evaluate other LLM outputs for safety, including sequential and parallel designs, judge prompt engineering, and techniques for attacking judge models.

llm-judgesafety-evaluationdefense-architectureadversarialjudge-bypass

Expert

LLM Judge Manipulation

Craft responses that exploit LLM-as-judge evaluation patterns to achieve high safety scores while embedding harmful content.

labsllm-judgemanipulationintermediate

Intermediate

Lab: Building an LLM Judge Evaluator

Hands-on lab for building an LLM-based evaluator to score red team attack outputs, compare model vulnerability, and lay the foundation for automated attack campaigns.

labllm-judgeevaluationautomation

Intermediate

Setting Up Content Filtering

Step-by-step walkthrough for implementing multi-layer content filtering for AI applications: keyword filtering, classifier-based detection, LLM-as-judge evaluation, testing effectiveness, and tuning for production.

content-filteringdefenseclassifiersmoderationllm-judgeimplementationwalkthrough

Intermediate

LLM Judge Implementation

Step-by-step walkthrough for using an LLM to judge another LLM's outputs for safety and quality, covering judge prompt design, scoring rubrics, calibration, cost optimization, and deployment patterns.

llm-judgeoutput-validationsafetyevaluationdefensewalkthrough

Advanced