Instructions to use vocab-transformers/distilbert-tokenizer_256k-MLM_500k with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use vocab-transformers/distilbert-tokenizer_256k-MLM_500k with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="vocab-transformers/distilbert-tokenizer_256k-MLM_500k")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("vocab-transformers/distilbert-tokenizer_256k-MLM_500k") model = AutoModelForMaskedLM.from_pretrained("vocab-transformers/distilbert-tokenizer_256k-MLM_500k") - Notebooks
- Google Colab
- Kaggle
- Xet hash:
- d86074461acab8e7517a6609e2b13952cf413da6274941b70f1324a62d844d2b
- Size of remote file:
- 962 MB
- SHA256:
- 40cef46146629bf41cb331c7d9d10e361bf3b2d925f0a8782439bd443a9b46c1
·
Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.