# scoring

標記為「scoring」的 8 篇文章

AI 特定嚴重性評分框架

為 AI 安全事件設計之嚴重性評分框架：模型完整性影響、資料暴露範圍、爆炸半徑分析、可逆性評估與複合評分方法論。

severityscoringrisk-assessmentincident-response

中級

社群挑戰概觀

如何參與月度 AI 紅隊挑戰、賺取分數、分享結果，並與社群一同成長你之技能。

communitychallengesoverviewparticipationscoring

入門

攻擊結果評分框架

開發框架以多重成功標準自動評分攻擊結果。

frameworkresultdevscoringexploit

中級

結果評分系統

設計自動化評分系統評估攻擊成功,包括語意分類器、規則型偵測器與 LLM-as-judge 方法。

exploit-devscoringevaluationmetrics

中級

AI Risk 評量 Methodology

Structured approaches to evaluating AI system risks including identification, scoring frameworks, treatment planning, and templates for conducting comprehensive AI risk assessments.

risk-assessmentmethodologyscoringtemplatesrisk-management

中級

Lab: Vulnerability Scoring Fundamentals

學習漏洞 scoring frameworks adapted for LLM systems including severity, exploitability, and impact assessment.

labsscoring漏洞-ratingbeginner

入門

毒性評分管線

建置 LLM 輸出過濾毒性評分管線的逐步詳解,涵蓋模型選擇、多維評分、閾值校準與即時評分的生產部署。

toxicityscoringoutput-filteringcontent-moderationsafetydefensewalkthrough

中級

PyRIT 自訂評分整合

將自訂評分指標整合至 PyRIT,用於組織特定的紅隊評估標準。

integrationtoolspyritscoringwalkthroughs

中級