Ge Yi

GY2233

ggeeyyi

AI & ML interests

Efficient ML & Reasoning

Recent Activity

upvoted a paper 18 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

updated a model about 2 months ago

nics-efc/R2R_router_collections

updated a collection about 2 months ago

R2R

View all activity

Organizations

upvoted a paper 18 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published 27 days ago • 104

updated a model about 2 months ago

nics-efc/R2R_router_collections

Text Classification • Updated Oct 30 • 1

updated a collection about 2 months ago

R2R

Collection

Collections for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing" • 5 items • Updated Oct 16 • 2

published a model about 2 months ago

nics-efc/R2R_router_collections

Text Classification • Updated Oct 30 • 1

upvoted a paper about 2 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3 • 97

authored 2 papers about 2 months ago

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27 • 71

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13 • 176

upvoted a paper about 2 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13 • 176

updated a dataset 2 months ago

GY2233/ChartQA

Viewer • Updated Oct 2 • 32.7k • 50

published a dataset 2 months ago

GY2233/ChartQA

Viewer • Updated Oct 2 • 32.7k • 50

upvoted a paper 2 months ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

updated a model 3 months ago

GY2233/Qwen2.5-32B-Instruct-NVFP4A16

Text Generation • 19B • Updated Sep 16 • 66

published a model 3 months ago

GY2233/Qwen2.5-32B-Instruct-NVFP4A16

Text Generation • 19B • Updated Sep 16 • 66

updated a model 3 months ago

GY2233/Qwen2.5-14B-Instruct-NVFP4A16

Text Generation • 9B • Updated Sep 16 • 12

published a model 3 months ago

GY2233/Qwen2.5-14B-Instruct-NVFP4A16

Text Generation • 9B • Updated Sep 16 • 12

updated a model 3 months ago

GY2233/Qwen2.5-7B-Instruct-NVFP4A16

Text Generation • 5B • Updated Sep 16 • 14

published a model 3 months ago

GY2233/Qwen2.5-7B-Instruct-NVFP4A16

Text Generation • 5B • Updated Sep 16 • 14

updated a model 3 months ago

GY2233/Qwen2.5-3B-Instruct-NVFP4A16

Text Generation • 2B • Updated Sep 16 • 57

published a model 3 months ago

GY2233/Qwen2.5-3B-Instruct-NVFP4A16

Text Generation • 2B • Updated Sep 16 • 57

updated a model 3 months ago

GY2233/Qwen2.5-32B-NVFP4A16

Text Generation • 19B • Updated Sep 16 • 6

Ge Yi

AI & ML interests

Recent Activity

Organizations

GY2233's activity