# failure-modes
標記為「failure-modes」的 2 篇文章
Safety Comparison Across Models
Comparing safety across GPT-4, Claude, Gemini, and open-weight models using standardized test suites, failure mode analysis, and defense coverage gap identification.
safety-comparisonbenchmarkingfailure-modescoverage-gapscross-model
跨模型安全比較
以標準化測試套件、失敗模式分析與防禦覆蓋缺口辨識,比較 GPT-4、Claude、Gemini 與開源權重模型之安全。
safety-comparisonbenchmarkingfailure-modescoverage-gapscross-model