Instructions to use FacebookAI/xlm-mlm-enro-1024 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use FacebookAI/xlm-mlm-enro-1024 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="FacebookAI/xlm-mlm-enro-1024")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("FacebookAI/xlm-mlm-enro-1024") model = AutoModelForMaskedLM.from_pretrained("FacebookAI/xlm-mlm-enro-1024") - Notebooks
- Google Colab
- Kaggle
Updates the tokenizer configuration file
#2
by lysandre HF Staff - opened
The tokenizer configuration file is missing/incorrect and therefore leading to unforeseen errors after the migration of the canonical models.
Refer to the following issue for more information: transformers#29050
The current failing code is the following:
from transformers import AutoTokenizer
>>> previous_tokenizer = AutoTokenizer.from_pretrained("xlm-mlm-enro-1024")
>>> current_tokenizer = AutoTokenizer.from_pretrained("FacebookAI/xlm-mlm-enro-1024")
>>> print(previous_tokenizer.model_max_length, current_tokenizer.model_max_length)
512, 512
This is the result after the fix:
from transformers import AutoTokenizer
>>> previous_tokenizer = AutoTokenizer.from_pretrained("xlm-mlm-enro-1024")
>>> current_tokenizer = AutoTokenizer.from_pretrained("FacebookAI/xlm-mlm-enro-1024")
>>> print(previous_tokenizer.model_max_length, current_tokenizer.model_max_length)
512, 512