SentencePiece whitespace-escape character: ▁ (U+2581).
▁
Build from GGUF metadata fields (see GgufModelMetadata.rawFields).
GgufModelMetadata.rawFields
Build from a parsed HuggingFace tokenizer.json root object where model.type == "Unigram".
tokenizer.json
model.type == "Unigram"