Output Format Manipulation (攻擊 導覽)
Forcing specific output formats to bypass LLM safety checks by exploiting the tension between format compliance and content restriction.
jailbreakingoutput-formatstructured-outputformat-manipulationsafety-bypassred-teaming