liu

Harold-lkk

AI & ML interests

None yet

Recent Activity

authored a paper 29 days ago

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

authored a paper 29 days ago

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

upvoted a paper 29 days ago

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

View all activity

Organizations

None yet

authored 2 papers 29 days ago

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published 30 days ago • 46

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Paper • 2512.10534 • Published 30 days ago • 31

upvoted 3 papers 29 days ago

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Paper • 2512.10534 • Published 30 days ago • 31

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Paper • 2512.10756 • Published 30 days ago • 34

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published 30 days ago • 46

authored a paper 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

upvoted a paper 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

authored 4 papers 6 months ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Paper • 2507.16814 • Published Jul 22, 2025 • 21

upvoted 2 papers 6 months ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Paper • 2507.16814 • Published Jul 22, 2025 • 21

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Paper • 2507.13332 • Published Jul 17, 2025 • 48

liked a dataset 8 months ago

openbmb/Ultra-FineWeb

Viewer • Updated about 1 month ago • 1.29B • 53.8k • 291

upvoted a paper 9 months ago

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published Mar 31, 2025 • 29

upvoted a paper 10 months ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4, 2025 • 19

upvoted a paper 11 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10, 2025 • 58

upvoted a paper 12 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20, 2025 • 109

liked a dataset about 1 year ago

allenai/qasper

Viewer • Updated Oct 7, 2022 • 1.59k • 2.38k • 91

authored a paper about 1 year ago

CIBench: Evaluating Your LLMs with a Code Interpreter Plugin

Paper • 2407.10499 • Published Jul 15, 2024

liu

AI & ML interests

Recent Activity

Organizations

Harold-lkk's activity