# classifier

16 artikelengetagd met “classifier”

Prompt Shields & injectiedetectie

Hoe Azure Prompt Shield en speciale injectiedetectiemodellen werken, hun detectiepatronen op basis van fijngestelde classifiers, en systematische benaderingen om ze te omzeilen.

prompt-shieldinjection-detectionazureclassifierbypassfine-tuning

Expert

Trainen van prompt-injectieclassifiers

Methodologieën voor het trainen en evalueren van ML-classifiers die prompt-injectiepogingen met hoge nauwkeurigheid detecteren.

defenseclassifiertraining

Gevorderd

ML-gebaseerde detectie van prompt-injectie

Het bouwen en trainen van ML-classificatoren voor realtime detectie van prompt-injectie.

defenseinjection-detectionml-basedclassifier

Gevorderd

Training van een custom safety-classifier

Train a custom input safety classifier and then develop payloads that reliably evade it to understand classifier limitations.

classifiercustomadvancedlabtraininglabs

Gevorderd

Inversie van veiligheidsclassifiers

Reverse-engineer safety classifier decision boundaries through systematic probing.

labsclassifierinversionadvanced

Gevorderd

Basis classifier-ontwijking

Evade basic input/output classifiers using paraphrasing, synonym substitution, and formatting tricks.

labsclassifierevasionbeginner

Beginner

Classifier Gauntlet: ontwijking in 10 fases

Bypass 10 progressively harder input classifiers using different evasion techniques at each stage.

labsctfclassifierevasion

Gevorderd

Lab: geavanceerde classifier-ontwijking

Evade advanced input classifiers using adversarial perturbation, semantic paraphrasing, and gradient-free optimization.

labsclassifierevasion-advancedintermediate

Gemiddeld

Lab: ontwijking van ML-classifiers

Develop payloads that evade machine learning-based input classifiers through adversarial text perturbation.

classifierintermediateevasionlablabs

Gemiddeld

Grenzen van de input-classifier in kaart brengen

Systematically probe input classifiers to map their decision boundaries and find bypass regions.

labsclassifierboundary-mappingintermediate

Gemiddeld

Een input safety-classifier bouwen

Build a production-quality input classifier that detects prompt injection attempts.

walkthroughsdefenseclassifierinput-safety

Gemiddeld

Opzetten van een Constitutional classifier

Step-by-step walkthrough for implementing constitutional AI-style classifiers that evaluate LLM outputs against a set of principles, covering principle definition, classifier training, chain-of-thought evaluation, and deployment.

constitutional-aiclassifierprinciplessafetydefensewalkthrough

Gevorderd

Classifier voor uitvoercontent

Step-by-step walkthrough for building a classifier to filter harmful LLM outputs, covering taxonomy definition, multi-label classification, threshold calibration, and deployment as a real-time output gate.

output-filteringclassifiercontent-moderationsafetydefensewalkthrough

Gemiddeld

Training van een prompt-classifier

Step-by-step walkthrough for training a machine learning classifier to detect malicious prompts, covering dataset curation, feature engineering, model selection, training pipeline, evaluation, and deployment as a real-time detection service.

classifiermachine-learningprompt-injectiondetectiontrainingdefensewalkthrough

Gevorderd

ML-gebaseerde systemen voor detectie van prompt injection

Walkthrough for building and deploying ML-based prompt injection detection systems, covering training data collection, feature engineering, model architecture selection, threshold tuning, production deployment, and continuous improvement.

prompt-injectionmachine-learningdetectionclassifierdefensewalkthrough

Gevorderd

Een prompt injection-classifier trainen

Train a custom prompt injection detection classifier using labeled datasets and modern NLP techniques.

walkthroughsdefenseclassifiertraining

Gemiddeld