Mingyang Song's picture

In a Training Loop 🔄

9 9 19

Mingyang Song

Nickyang

·

nick7nlp

AI & ML interests

LRMs, Long-Context LLMs, LLM Judges, Many-Shot ICL

Recent Activity

upvoted a collection about 1 month ago

updated a model 3 months ago

Nickyang/ConciseR-Zero-7B-Preview

liked a Space 3 months ago

tencent/Hunyuan-MT-7B

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

DeepSeek-R1

10 items • Updated 30 days ago • 825

updated a model 3 months ago

Nickyang/ConciseR-Zero-7B-Preview

Text Generation • Updated Sep 25 • 36 • 1

liked a Space 3 months ago

Hunyuan MT 7B

New activity in tencent/Hunyuan-MT-7B 3 months ago

Update README.md

#16 opened 3 months ago by

New activity in tencent/Hunyuan-MT-Chimera-7B 3 months ago

Update README.md

#10 opened 3 months ago by

New activity in tencent/Hunyuan-MT-7B 3 months ago

Update README.md

#15 opened 3 months ago by

upvoted a paper 4 months ago

Hunyuan-MT Technical Report

Paper • 2509.05209 • Published Sep 5 • 14

commented a paper 4 months ago

Hunyuan-MT Technical Report

Paper • 2509.05209 • Published Sep 5 • 14 •

New activity in tencent/Hunyuan-MT-Chimera-7B 4 months ago

Update README.md

#8 opened 4 months ago by

New activity in tencent/Hunyuan-MT-7B 4 months ago

Update README.md

#13 opened 4 months ago by

upvoted a collection 4 months ago

Hunyuan-MT

4 items • Updated 2 days ago • 38

liked 4 models 4 months ago

tencent/Hunyuan-MT-Chimera-7B-fp8

Translation • 8B • Updated Sep 2 • 11.7k • 21

tencent/Hunyuan-MT-7B-fp8

Translation • 8B • Updated Sep 2 • 2.02k • 30

tencent/Hunyuan-MT-Chimera-7B

Translation • 8B • Updated Sep 9 • 2.06k • 86

tencent/Hunyuan-MT-7B

Translation • 8B • Updated Sep 18 • 13.4k • 711

updated a model 7 months ago

Nickyang/ConciseR-Zero-7B

Text Generation • Updated Jun 6 • 14 • 1

upvoted a paper 7 months ago

Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!

Paper • 2406.11629 • Published Jun 17, 2024 • 1

liked a model 7 months ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated May 29 • 486k • • 1k

updated 2 collections 7 months ago

FastCuRL

The collection for the Paper "Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient Training R1-like Reasoning Models" • 6 items • Updated May 29 • 3

ConciseR

The collection for the Paper "Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning" • 5 items • Updated Jun 4 • 2