Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
antalvdb
/
lib-tokenizer
like
0
MLZoo/edu-fineweb-10B
English
tokenizer
lib
less-is-better
supra-word
cognitively-inspired
License:
gpl-3.0
Model card
Files
Files and versions
xet
Community
main
lib-tokenizer
Commit History
Retrain with prepend-only space convention
13d9e39
verified
antalvdb
commited on
2 days ago
Upload tokenizer.json with huggingface_hub
ee771eb
verified
antalvdb
commited on
Feb 18
Update README.md
53cc5cc
verified
antalvdb
commited on
Feb 18
Update README.md
048c2df
verified
antalvdb
commited on
Feb 18
Upload README.md with huggingface_hub
811acf2
verified
antalvdb
commited on
Feb 18
Upload tokenizer.json with huggingface_hub
1489718
verified
antalvdb
commited on
Feb 16
initial commit
d2b3ac2
verified
antalvdb
commited on
Feb 16