1 19 9

Yijie Chen

pppa

pppa2019

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

MMGR: Multi-Modal Generative Reasoning

upvoted a collection 10 days ago

NVIDIA Nemotron v3

liked a dataset 24 days ago

liwu/MNBVC

View all activity

Organizations

upvoted a paper 9 days ago

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published 10 days ago • 114

upvoted a collection 10 days ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 6 items • Updated 3 days ago • 103

liked a dataset 24 days ago

liwu/MNBVC

Updated 23 days ago • 99.8k • 568

upvoted a paper about 2 months ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31 • 70

upvoted 2 papers 6 months ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 133

Magistral

Paper • 2506.10910 • Published Jun 12 • 66

upvoted a paper 7 months ago

BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

Paper • 2506.07530 • Published Jun 9 • 20

liked a model 9 months ago

zhibinlan/LLaVE-2B

Image-Text-to-Text • 2B • Updated Mar 14 • 45 • 45

upvoted a paper 10 months ago

New Trends for Modern Machine Translation with Large Reasoning Models

Paper • 2503.10351 • Published Mar 13 • 25

liked a model 10 months ago

google/metricx-23-xxl-v2p0

Updated Feb 7, 2024 • 50 • 9

upvoted a paper 10 months ago

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Paper • 2503.00865 • Published Mar 2 • 64

liked a dataset 10 months ago

HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27 • 4.48B • 59.1k • 707

upvoted a paper 10 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 166

upvoted 3 papers 11 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 48

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 300

upvoted a paper 12 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 286

updated a collection 12 months ago

Code&Math&Reasoning

Collection

5 items • Updated Jan 3 • 1

Yijie Chen

AI & ML interests

Recent Activity

Organizations

pppa's activity