# model-security

11 artikelengetagd met “model-security”

Incident Response voor de AI-toeleveringsketen

Incident response-procedures voor compromittering in de AI-toeleveringsketen, waaronder modelrepository's, trainingspijplijnen en dependency-ketens.

ai-forensics-irsupply-chainincident-responsemodel-security

Gevorderd

Forensics voor detectie van modelbackdoors

Forensische technieken voor het detecteren, analyseren en toeschrijven van backdoors die via aanvallen tijdens of na de training in machine learning-modellen zijn geïmplanteerd.

ai-forensics-irbackdoor-detectionmodel-securitytrojan-detection

Gevorderd

Backdoordetectie in fijn-afgestemde modellen

Backdoors detecteren in fijn-afgestemde AI-modellen: activatieanalyse, technieken voor triggerscanning, strategieën voor gedragsmatige probing en statistische methoden om verborgen kwaadaardige functionaliteit te identificeren.

backdoordetectionfine-tuningmodel-security

Gevorderd

Capstone: Beveiligingsaudit van een open-source LLM

Voer een uitgebreide beveiligingsaudit uit van een open-source groot taalmodel, met aandacht voor de integriteit van modelgewichten, evaluatie van safety-alignment, verificatie van de toeleveringsketen en het testen van adversarial robuustheid.

capstonesecurity-auditopen-sourcemodel-securitysupply-chain

Expert

Beveiligingsrisico's van cloud AI-modelmarktplaatsen

Assessing security risks in cloud AI model marketplaces including AWS Bedrock Model Garden, Azure AI Model Catalog, GCP Vertex AI Model Garden, and Hugging Face Hub, covering supply chain attacks, trojan models, and verification gaps.

cloud-ai-securitymodel-marketplacesupply-chainmodel-securitytrojan-detection

Gevorderd

Beveiliging van fine-tuning

Comprehensive overview of how fine-tuning can compromise model safety -- attack taxonomy covering dataset poisoning, safety degradation, backdoor insertion, and reward hacking in the era of widely available fine-tuning APIs.

fine-tuningsafetydataset-poisoningbackdoorreward-hackingrlhfloramodel-security

Gemiddeld

Aanvalsoppervlak van LoRA en adapters

Overview of security vulnerabilities in parameter-efficient fine-tuning methods including LoRA, QLoRA, and adapter-based approaches -- how the efficiency and shareability of adapters create novel attack vectors.

loraqloraadapterpeftfine-tuningattack-surfacemodel-security

Gemiddeld

Model Repository Security

Defense-focused guide to securing model downloads from public repositories like Hugging Face, covering backdoored model detection, namespace attacks, signature verification, and safe download procedures.

supply-chainhugging-facemodel-securitybackdoorsignaturesdefense

Gemiddeld

Diepe duiken per model

Why model-specific knowledge matters for AI red teaming, how different architectures create different attack surfaces, and a systematic methodology for profiling any new model.

model-securityred-teamingattack-surfacemethodologyarchitecture

Gemiddeld

Beveiliging van open-weight-modellen

Security analysis of open-weight models including Llama, Mistral, Qwen, and DeepSeek, covering unique risks from full weight access, fine-tuning attacks, and deployment security challenges.

open-weightllamamistralqwendeepseekmodel-securityfine-tuning

Gemiddeld

Beveiliging van trainingscheckpoints

Dreigingsanalyse van de opslag, serialisatie en herstel van modelcheckpoints, waaronder checkpointvergiftiging, deserialisatie-aanvallen en integriteitsverificatie.

training-pipelinecheckpointsmodel-securitysupply-chain

Gemiddeld