Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2504.05520

digital-entities-evolution

nex-agi/agent-sft

Preview • Updated Dec 9, 2025 • 325 • 106
nick007x/arxiv-papers

Viewer • Updated Oct 14, 2025 • 2.55M • 4.63k • 179
LLMDH/other

Viewer • Updated Sep 8, 2025 • 422k • 926
SustcZhangYX/ChatEnv

Viewer • Updated Jul 31, 2025 • 113k • 147 • 1

Difficulty Estimation Math Datasets

We perform difficulty estimation on popular math datasets.

lime-nlp/DeepScaleR_Difficulty

Viewer • Updated Apr 10, 2025 • 5.06M • 367 • 10
lime-nlp/GSM8K_Difficulty

Viewer • Updated Apr 9, 2025 • 1.13M • 44 • 1
lime-nlp/orz_math_difficulty

Viewer • Updated Apr 10, 2025 • 6.18M • 24
lime-nlp/MATH_Difficulty

Viewer • Updated Apr 10, 2025 • 1.61M • 31

Reasoning, Thinking, RL and Test-Time Scaling

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 39
Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46
Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 36
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 47

Papers from LIME Lab

Papers from LIME Lab

Safer-Instruct: Aligning Language Models with Automated Preference Data

Paper • 2311.08685 • Published Nov 15, 2023 • 1
CLIMB: A Benchmark of Clinical Bias in Large Language Models

Paper • 2407.05250 • Published Jul 7, 2024 • 2
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20, 2025 • 45
WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback

Paper • 2408.15549 • Published Aug 28, 2024 • 2

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7, 2025 • 26
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Paper • 2504.04718 • Published Apr 7, 2025 • 43
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement

Paper • 2504.03561 • Published Apr 4, 2025 • 18
Concept Lancet: Image Editing with Compositional Representation Transplant

Paper • 2504.02828 • Published Apr 3, 2025 • 16

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13, 2025 • 17
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Paper • 2503.10630 • Published Mar 13, 2025 • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 38
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10, 2025 • 88

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

Paper • 2404.10667 • Published Apr 16, 2024 • 24
Instruction-tuned Language Models are Better Knowledge Learners

Paper • 2402.12847 • Published Feb 20, 2024 • 26
DoRA: Weight-Decomposed Low-Rank Adaptation

Paper • 2402.09353 • Published Feb 14, 2024 • 32

digital-entities-evolution

nex-agi/agent-sft

Preview • Updated Dec 9, 2025 • 325 • 106
nick007x/arxiv-papers

Viewer • Updated Oct 14, 2025 • 2.55M • 4.63k • 179
LLMDH/other

Viewer • Updated Sep 8, 2025 • 422k • 926
SustcZhangYX/ChatEnv

Viewer • Updated Jul 31, 2025 • 113k • 147 • 1

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7, 2025 • 26
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Paper • 2504.04718 • Published Apr 7, 2025 • 43
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement

Paper • 2504.03561 • Published Apr 4, 2025 • 18
Concept Lancet: Image Editing with Compositional Representation Transplant

Paper • 2504.02828 • Published Apr 3, 2025 • 16

Difficulty Estimation Math Datasets

We perform difficulty estimation on popular math datasets.

lime-nlp/DeepScaleR_Difficulty

Viewer • Updated Apr 10, 2025 • 5.06M • 367 • 10
lime-nlp/GSM8K_Difficulty

Viewer • Updated Apr 9, 2025 • 1.13M • 44 • 1
lime-nlp/orz_math_difficulty

Viewer • Updated Apr 10, 2025 • 6.18M • 24
lime-nlp/MATH_Difficulty

Viewer • Updated Apr 10, 2025 • 1.61M • 31

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13, 2025 • 17
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Paper • 2503.10630 • Published Mar 13, 2025 • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 38
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10, 2025 • 88

Reasoning, Thinking, RL and Test-Time Scaling

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 39
Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46
Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 36
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 47

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

Paper • 2404.10667 • Published Apr 16, 2024 • 24
Instruction-tuned Language Models are Better Knowledge Learners

Paper • 2402.12847 • Published Feb 20, 2024 • 26
DoRA: Weight-Decomposed Low-Rank Adaptation

Paper • 2402.09353 • Published Feb 14, 2024 • 32

Papers from LIME Lab

Papers from LIME Lab

Safer-Instruct: Aligning Language Models with Automated Preference Data

Paper • 2311.08685 • Published Nov 15, 2023 • 1
CLIMB: A Benchmark of Clinical Bias in Large Language Models

Paper • 2407.05250 • Published Jul 7, 2024 • 2
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20, 2025 • 45
WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback

Paper • 2408.15549 • Published Aug 28, 2024 • 2

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs