# chain-of-thought

37 articlestagged with “chain-of-thought”

Manipulating Reasoning Chains

Techniques for influencing an AI agent's chain-of-thought reasoning to steer its planning, decision-making, and tool selection toward attacker-desired outcomes.

agentschain-of-thoughtreasoningmanipulationadvanced

Advanced

Chain-of-Thought Exploitation Techniques

Deep analysis of how reasoning traces in CoT models can be manipulated to produce adversarial outputs while maintaining coherent reasoning.

frontierchain-of-thoughtreasoning

Advanced

Reasoning Model Attacks

Overview of security risks in reasoning-enabled LLMs: how chain-of-thought models introduce new attack surfaces, exploit primitives, and defensive challenges.

reasoningo1chain-of-thoughtattacks

Expert

Reasoning Model Jailbreaks

How reasoning capabilities create novel jailbreak surfaces: chain-of-thought exploitation, scratchpad attacks, and why higher reasoning effort increases attack success.

reasoningjailbreakchain-of-thoughto1o3adaptive-attacksresearch

Advanced

Steganographic Reasoning

Hidden communication channels within AI reasoning traces, where models encode information or coordinate behavior through patterns invisible to human overseers, including detection methods and implications for AI safety.

steganographyreasoninghidden-communicationchain-of-thoughtai-safetyoversight

Expert

Unfaithful Chain-of-Thought Reasoning

Analysis of unfaithful chain-of-thought reasoning in language models, where the visible reasoning trace does not accurately reflect the model's actual computational process, including detection methods, implications for oversight, and exploitation techniques.

unfaithful-reasoningchain-of-thoughtreasoninginterpretabilityoversightai-safety

Advanced

Reasoning Model Exploitation

Exploiting extended thinking and chain-of-thought reasoning in o1, Claude, and DeepSeek-R1 models.

frontier-researchreasoningexploitationchain-of-thought

Expert

Red Teaming Reasoning Traces

Techniques for analyzing and exploiting visible reasoning traces in chain-of-thought models.

frontier-researchreasoning-tracesred-teamingchain-of-thought

Advanced

Injection in Reasoning Models

Research into injection attacks specific to reasoning-augmented models that exploit chain-of-thought processes and self-reflection mechanisms.

researchreasoningchain-of-thoughtinjection

Advanced

Lab: Reasoning Model Exploitation

Attack reasoning models like o1, o3, and DeepSeek-R1 by exploiting chain-of-thought manipulation, reasoning budget exhaustion, and thought-injection techniques.

labreasoningo1chain-of-thought

Advanced

Reasoning Trace Exploitation in CoT Models

Exploit visible chain-of-thought reasoning traces in models like o1 and DeepSeek-R1 to manipulate outputs.

labsreasoning-tracechain-of-thoughtexploitationadvanced

Advanced

Reasoning Model Chain-of-Thought Exploitation

Exploit extended thinking and chain-of-thought reasoning in models like o1 and DeepSeek-R1.

labsreasoningchain-of-thoughtexpert

Expert

Lab: Chain-of-Thought Exploitation

Exploit chain-of-thought reasoning to leak internal model reasoning, bypass safety filters, and manipulate decision processes.

labschain-of-thoughtexploitationintermediate

Intermediate

Reasoning Model Security Analysis

Security analysis of reasoning-augmented models (o1, DeepSeek-R1) focusing on chain-of-thought manipulation and reasoning-specific attack vectors.

modelsreasoningchain-of-thoughto1

Advanced

Reasoning Model Exploitation Walkthrough

Exploit extended thinking in reasoning models to inject false premises and manipulate conclusion generation.

walkthroughsreasoningexploitationchain-of-thought

Advanced

Thought Injection for Reasoning Models

Techniques for injecting malicious content into chain-of-thought reasoning traces of thinking models, exploiting the gap between reasoning and safety enforcement.

jailbreakingthought-injectionchain-of-thoughtreasoning-modelsCoTred-teaming

Advanced

Thought Injection in Reasoning Models Walkthrough

Inject adversarial thoughts into the reasoning chain of thinking models to manipulate final outputs.

walkthroughsthought-injectionreasoningchain-of-thought

Advanced

Chain-of-Thought Hijacking Walkthrough

Walkthrough of hijacking visible reasoning traces in CoT models to redirect conclusions and bypass safety checks.

walkthroughschain-of-thoughthijackingreasoning

Advanced

操弄推理鏈

影響 AI 代理的 chain-of-thought 推理，將其規劃、決策與工具選擇導向攻擊者期望結果的技術。

agentschain-of-thoughtreasoningmanipulationadvanced

Advanced

2026 年的推理模型安全

o1、o3 與 DeepSeek-R1 等思維鏈推理模型如何改變 AI 安全版圖——新的攻擊面與新的防禦機會。

reasoningchain-of-thoughto1o3security

Chain-of-Thought 利用ation Techniques

Deep analysis of how reasoning traces in CoT models can be manipulated to produce adversarial outputs while maintaining coherent reasoning.

frontierchain-of-thoughtreasoning

Advanced

推理模型攻擊

推理啟用 LLM 之安全風險概觀：思維鏈模型如何引入新攻擊面、利用原語與防禦挑戰。

reasoningo1chain-of-thoughtattacks

Expert

Reasoning 模型越獄s

How reasoning capabilities create novel jailbreak surfaces: chain-of-thought exploitation, scratchpad attacks, and why higher reasoning effort increases attack success.

reasoningjailbreakchain-of-thoughto1o3adaptive-attacksresearch

Advanced

Steganographic Reasoning

steganographyreasoninghidden-communicationchain-of-thoughtai-safetyoversight

Expert

Unfaithful Chain-of-Thought Reasoning

unfaithful-reasoningchain-of-thoughtreasoninginterpretabilityoversightai-safety

Advanced

Reasoning 模型利用ation

利用ing extended thinking and chain-of-thought reasoning in o1, Claude, and DeepSeek-R1 models.

frontier-researchreasoningexploitationchain-of-thought

Expert

紅隊演練 Reasoning Traces

Techniques for analyzing and exploiting visible reasoning traces in chain-of-thought models.

frontier-researchreasoning-tracesred-teamingchain-of-thought

Advanced

Injection in Reasoning 模型s

Research into injection attacks specific to reasoning-augmented models that exploit chain-of-thought processes and self-reflection mechanisms.

researchreasoningchain-of-thoughtinjection

Advanced

實驗室: Reasoning 模型利用ation

攻擊 reasoning models like o1, o3, and DeepSeek-R1 by exploiting chain-of-thought manipulation, reasoning budget exhaustion, and thought-injection techniques.

labreasoningo1chain-of-thought

Advanced

Reasoning Trace 利用ation in CoT 模型s

利用 visible chain-of-thought reasoning traces in models like o1 and DeepSeek-R1 to manipulate outputs.

labsreasoning-tracechain-of-thoughtexploitationadvanced

Advanced

Reasoning 模型 Chain-of-Thought 利用ation

利用 extended thinking and chain-of-thought reasoning in models like o1 and DeepSeek-R1.

labsreasoningchain-of-thoughtexpert

Expert

實驗室: Chain-of-Thought 利用ation

利用 chain-of-thought reasoning to leak internal model reasoning, bypass safety filters, and manipulate decision processes.

labschain-of-thoughtexploitationintermediate

Intermediate

Reasoning 模型安全 Analysis

安全 analysis of reasoning-augmented models (o1, DeepSeek-R1) focusing on chain-of-thought manipulation and reasoning-specific attack vectors.

modelsreasoningchain-of-thoughto1

Advanced

Reasoning 模型利用ation 導覽

利用 extended thinking in reasoning models to inject false premises and manipulate conclusion generation.

walkthroughsreasoningexploitationchain-of-thought

Advanced

Thought Injection for Reasoning 模型s

Techniques for injecting malicious content into chain-of-thought reasoning traces of thinking models, exploiting the gap between reasoning and safety enforcement.

jailbreakingthought-injectionchain-of-thoughtreasoning-modelsCoTred-teaming

Advanced

Thought Injection in Reasoning 模型s 導覽

Inject adversarial thoughts into the reasoning chain of thinking models to manipulate final outputs.

walkthroughsthought-injectionreasoningchain-of-thought

Advanced

Chain-of-Thought Hijacking 導覽

導覽 of hijacking visible reasoning traces in CoT models to redirect conclusions and bypass safety checks.

walkthroughschain-of-thoughthijackingreasoning

Advanced

# chain-of-thought

Manipulating Reasoning Chains

Chain-of-Thought Exploitation Techniques

Reasoning Model Attacks

Reasoning Model Jailbreaks

Steganographic Reasoning

Unfaithful Chain-of-Thought Reasoning

Reasoning Model Exploitation

Red Teaming Reasoning Traces

Injection in Reasoning Models

Lab: Reasoning Model Exploitation

Reasoning Trace Exploitation in CoT Models

Reasoning Model Chain-of-Thought Exploitation

Lab: Chain-of-Thought Exploitation

Reasoning Model Security Analysis

Reasoning Model Exploitation Walkthrough

Thought Injection for Reasoning Models

Thought Injection in Reasoning Models Walkthrough

Chain-of-Thought Hijacking Walkthrough

操弄推理鏈

2026 年的推理模型安全

Chain-of-Thought 利用ation Techniques

推理模型攻擊

Reasoning 模型 越獄s

Steganographic Reasoning

Unfaithful Chain-of-Thought Reasoning

Reasoning 模型 利用ation

紅隊演練 Reasoning Traces

Injection in Reasoning 模型s

實驗室: Reasoning 模型 利用ation

Reasoning Trace 利用ation in CoT 模型s

Reasoning 模型 Chain-of-Thought 利用ation

實驗室: Chain-of-Thought 利用ation

Reasoning 模型 安全 Analysis

Reasoning 模型 利用ation 導覽

Thought Injection for Reasoning 模型s

Thought Injection in Reasoning 模型s 導覽

Chain-of-Thought Hijacking 導覽

# chain-of-thought

Manipulating Reasoning Chains

Chain-of-Thought Exploitation Techniques

Reasoning Model Attacks

Reasoning Model Jailbreaks

Steganographic Reasoning

Unfaithful Chain-of-Thought Reasoning

Reasoning Model Exploitation

Red Teaming Reasoning Traces

Injection in Reasoning Models

Lab: Reasoning Model Exploitation

Reasoning Trace Exploitation in CoT Models

Reasoning Model Chain-of-Thought Exploitation

Lab: Chain-of-Thought Exploitation

Reasoning Model Security Analysis

Reasoning Model Exploitation Walkthrough

Thought Injection for Reasoning Models

Thought Injection in Reasoning Models Walkthrough

Chain-of-Thought Hijacking Walkthrough

操弄推理鏈

2026 年的推理模型安全

Chain-of-Thought 利用ation Techniques

推理模型攻擊

Reasoning 模型 越獄s

Steganographic Reasoning

Unfaithful Chain-of-Thought Reasoning

Reasoning 模型 利用ation

紅隊演練 Reasoning Traces

Injection in Reasoning 模型s

實驗室: Reasoning 模型 利用ation

Reasoning Trace 利用ation in CoT 模型s

Reasoning 模型 Chain-of-Thought 利用ation

實驗室: Chain-of-Thought 利用ation

Reasoning 模型 安全 Analysis

Reasoning 模型 利用ation 導覽

Thought Injection for Reasoning 模型s

Thought Injection in Reasoning 模型s 導覽

Chain-of-Thought Hijacking 導覽

Reasoning 模型越獄s

Reasoning 模型利用ation

實驗室: Reasoning 模型利用ation

Reasoning 模型安全 Analysis

Reasoning 模型利用ation 導覽

Reasoning 模型越獄s

Reasoning 模型利用ation

實驗室: Reasoning 模型利用ation

Reasoning 模型安全 Analysis

Reasoning 模型利用ation 導覽