# classification

建構 a basic 提示詞注入 detection tool using pattern matching, heuristics, and LLM-based classification to identify malicious inputs before they reach the target model.

labinjection-detectiondefenseclassificationbeginnerhands-on

入門

發現嚴重度分類

標準化 AI 安全發現嚴重度分類框架,包含風險評分方法與業務衝擊評估。

professionalseverityclassificationreporting

中級

提示詞注入分類

提示詞注入攻擊的完整分類框架，涵蓋直接與間接向量、遞送機制、目標層級與嚴重度評估，用於系統化紅隊測試。

prompt-injectiontaxonomyclassificationred-teamingframework

入門

攻擊技術分類法參考

AI 安全攻擊技術的完整分類法,交叉對應 MITRE ATLAS、OWASP LLM Top 10 與自訂分類方案。

referencetaxonomyMITRE-ATLASclassification

中級

AI 漏洞分類系統

依類型、影響與可利用性為 AI 特有漏洞分類的結構化系統。

vulnerabilityclassificationmethodologywalkthroughs

中級

Classifying AI 漏洞 Severity

Framework for consistently classifying the severity of AI and LLM vulnerabilities, with scoring criteria, impact assessment, and examples across common finding categories.

severityclassificationvulnerabilityrisk-assessmentmethodologywalkthrough

中級

將發現對應至 OWASP LLM Top 10

將 AI 紅隊發現對應至 OWASP LLM 應用程式 Top 10 的實作詳解,涵蓋分類指引、報告範本與緩解對應。

owaspllm-top-10classificationstandardsmethodologywalkthrough

中級