-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 23 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
Collections
Discover the best community collections!
Collections including paper arxiv:2502.13923
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 429 -
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 211 -
Qwen3 Technical Report
Paper • 2505.09388 • Published • 317 -
Qwen-Image Technical Report
Paper • 2508.02324 • Published • 263
-
LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update
Paper • 2106.13914 • Published • 1 -
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges
Paper • 2506.15196 • Published • 3 -
Ascend HiFloat8 Format for Deep Learning
Paper • 2409.16626 • Published • 1 -
Recipes for Pre-training LLMs with MXFP8
Paper • 2506.08027 • Published • 1
-
Qwen2.5 VL 32B Instruct Demo
🏃158Interact with Qwen2.5-VL-32B-Instruct for text and image/video responses
-
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 211 -
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text • 33B • Updated • 258k • • 469 -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • 73B • Updated • 135k • • 569
-
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization
Paper • 2503.10615 • Published • 17 -
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
Paper • 2503.10630 • Published • 6 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL
Paper • 2503.07536 • Published • 88
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 23 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Qwen2.5 VL 32B Instruct Demo
🏃158Interact with Qwen2.5-VL-32B-Instruct for text and image/video responses
-
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 211 -
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text • 33B • Updated • 258k • • 469 -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • 73B • Updated • 135k • • 569
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 429 -
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 211 -
Qwen3 Technical Report
Paper • 2505.09388 • Published • 317 -
Qwen-Image Technical Report
Paper • 2508.02324 • Published • 263
-
LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update
Paper • 2106.13914 • Published • 1 -
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges
Paper • 2506.15196 • Published • 3 -
Ascend HiFloat8 Format for Deep Learning
Paper • 2409.16626 • Published • 1 -
Recipes for Pre-training LLMs with MXFP8
Paper • 2506.08027 • Published • 1
-
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization
Paper • 2503.10615 • Published • 17 -
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
Paper • 2503.10630 • Published • 6 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL
Paper • 2503.07536 • Published • 88