pinned
Running
1
GlotWeb
🕸
Indexing the presence of low-resource languages on the web.
NLP, Representation Learning, Machine Translation
GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts
Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners
Indexing the presence of low-resource languages on the web.
Identify the language of a sentence with confidence scores
Identify languages in code-switched text
Multilingual LLM Leaderboard via Cross-Lingual Alignment