Chujie Zheng's picture

Chujie Zheng

chujiezheng

·

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

upvoted a paper 12 days ago

Soft Adaptive Policy Optimization

authored a paper 12 days ago

Soft Adaptive Policy Optimization

authored a paper 12 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted a paper 12 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 19 days ago • 39

authored 2 papers 12 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 19 days ago • 39

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 13 days ago • 89

upvoted a paper 12 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 13 days ago • 89

liked 16 models about 2 months ago

Qwen/Qwen3-VL-32B-Thinking-FP8

Image-Text-to-Text • 33B • Updated 18 days ago • 9.52k • 18

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20 • 35.3k • 90

Qwen/Qwen3-VL-2B-Instruct-FP8

Image-Text-to-Text • 2B • Updated Oct 20 • 45.2k • 29

Qwen/Qwen3-VL-2B-Thinking-FP8

Image-Text-to-Text • 2B • Updated 18 days ago • 1.78k • 20

Qwen/Qwen3-VL-32B-Thinking

Image-Text-to-Text • 33B • Updated Oct 21 • 444k • 69

Qwen/Qwen3-VL-32B-Instruct-FP8

Image-Text-to-Text • 33B • Updated Oct 22 • 163k • 28

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23 • 532k • 230

Qwen/Qwen3-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Oct 21 • 719k • • 136

Qwen/Qwen3-4B-Thinking-2507-FP8

Text Generation • 4B • Updated Aug 6 • 180k • 42

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6 • 608k • • 488

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated 18 days ago • 1.29M • • 440

Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated 18 days ago • 9.19k • 24

Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

Image-Text-to-Text • 31B • Updated 18 days ago • 124k • 45

Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

Image-Text-to-Text • 236B • Updated 18 days ago • 316k • 32

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

Image-Text-to-Text • 31B • Updated 18 days ago • 170k • 91

Qwen/Qwen3-VL-30B-A3B-Thinking

Image-Text-to-Text • 31B • Updated 18 days ago • 56.7k • • 163