view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 2 days ago • 36
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 5 days ago • 223
AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models Paper • 2511.13029 • Published 19 days ago • 1
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases about 1 month ago • 52
view article Article Classement compar:IA : des votes des utilisateurs au classement participatif des modèles Nov 3 • 6
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 34
view article Article BigCodeArena: Judging code generations end to end with code executions Oct 7 • 18
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 118