deepseek-ai/DeepSeek-V3.2-Speciale Text Generation β’ 685B β’ Updated Dec 1, 2025 β’ 29.8k β’ 628
view article Article Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained β Whatβs Really Changing in Transformers? Apr 4, 2025 β’ 16
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) Jan 19, 2025 β’ 38
Running on CPU Upgrade 184 LLM Hallucination Leaderboard π 184 View and filter LLM hallucination leaderboard
intfloat/multilingual-e5-large-instruct Feature Extraction β’ 0.6B β’ Updated Jul 10, 2025 β’ 1.36M β’ β’ 593