# tokenization
標記為「tokenization」的 8 篇文章
Tokenization & Its Security Implications
How BPE and SentencePiece tokenizers work, and how tokenizer behavior creates exploitable attack surfaces including boundary attacks, homoglyphs, and encoding tricks.
Tokenization and Its Security Implications
How tokenization works and why it creates security-relevant behaviors in language models.
Tokenization-Based Attacks
How tokenizer behavior creates exploitable gaps between human-readable text and model-internal representations, enabling filter bypass and payload obfuscation.
Token Smuggling
Exploiting LLM tokenization quirks to smuggle harmful content past safety filters by manipulating how text is split into tokens at the subword level.
Tokenization & Its 安全 Implications
How BPE and SentencePiece tokenizers work, and how tokenizer behavior creates exploitable attack surfaces including boundary attacks, homoglyphs, and encoding tricks.
Tokenization and Its 安全 Implications
How tokenization works and why it creates security-relevant behaviors in language models.
基於分詞的攻擊
分詞器行為如何在人類可讀文字與模型內部表示之間建立可利用落差,使過濾器繞過與 payload 混淆成為可能。
Token Smuggling
利用ing LLM tokenization quirks to smuggle harmful content past safety filters by manipulating how text is split into tokens at the subword level.