π©
SentencePiece is a foundational NLP library implementing BPE and unigram language model tokenization. It's language-independent, purely data-driven, and widely used as a core component in LLM training pipelines and neural machine translation systems.