Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

alea-institute
/
kl3m-multi-word-001-64k

Fill-Mask
Transformers
English
tokenizer
legal
bpe
byte-pair-encoding
multi-word
kl3m
legal-domain
hierarchical
Model card Files Files and versions
xet
Community
kl3m-multi-word-001-64k
4.66 MB
  • 1 contributor
History: 5 commits
alea-institute's picture
alea-institute
Upload KL3M multi-word tokenizer (64K) - Update README
6022da7 verified 4 days ago
  • .gitattributes
    1.52 kB
    initial commit 4 days ago
  • README.md
    12.7 kB
    Upload KL3M multi-word tokenizer (64K) - Update README 4 days ago
  • special_tokens_map.json
    189 Bytes
    Upload KL3M multi-word tokenizer (64K) 4 days ago
  • tokenizer.json
    4.64 MB
    Upload KL3M multi-word tokenizer (64K) 4 days ago
  • tokenizer_config.json
    9.36 kB
    Upload KL3M multi-word tokenizer (64K) 4 days ago