Skip to content

AI Fundamentals

BPE (Byte Pair Encoding)

A tokenization algorithm that iteratively merges the most frequent character pairs into single tokens. BPE is widely used in code models because it efficiently represents programming keywords, operators, and identifiers.