# tokenization
4 articlestagged with “tokenization”
Tokenization & Its Security Implications
How BPE and SentencePiece tokenizers work, and how tokenizer behavior creates exploitable attack surfaces including boundary attacks, homoglyphs, and encoding tricks.
tokenizationbpesecurityencodingintermediate
Tokenization and Its Security Implications
How tokenization works and why it creates security-relevant behaviors in language models.
foundationstokenizationsecurityencoding
Tokenization-Based Attacks
How tokenizer behavior creates exploitable gaps between human-readable text and model-internal representations, enabling filter bypass and payload obfuscation.
tokenizationbpesentencepiecefilter-bypassexploit-primitives
Token Smuggling
Exploiting LLM tokenization quirks to smuggle harmful content past safety filters by manipulating how text is split into tokens at the subword level.
jailbreakingtokenizationtoken-smugglingBPEsubwordcontent-filter-bypassred-teaming