14 21 19

yiyexy

yiyexy

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

lmms-lab-encoder/JumpScore

published a dataset 1 day ago

lmms-lab-encoder/JumpScore

liked a model 18 days ago

deepseek-ai/DeepSeek-V4-Pro

View all activity

Organizations

updated a dataset 1 day ago

lmms-lab-encoder/JumpScore

Viewer • Updated 1 day ago • 189 • 40

published a dataset 1 day ago

lmms-lab-encoder/JumpScore

Viewer • Updated 1 day ago • 189 • 40

liked a model 18 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 6 days ago • 2.02M • • 3.88k

upvoted a paper about 1 month ago

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Paper • 2604.04901 • Published Apr 6 • 40

upvoted a paper about 2 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

upvoted an article 2 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 159

upvoted a paper 2 months ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Paper • 2603.03241 • Published Mar 3 • 87

upvoted a paper 3 months ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published Feb 9 • 52

commented a paper 3 months ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published Feb 9 • 52 •

submitted a paper to Daily Papers 3 months ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published Feb 9 • 52

updated 2 models 3 months ago

lmms-lab/LLaVA-OneVision-1.5-4B-Instruct

Image-Text-to-Text • 5B • Updated Feb 6 • 6.36k • 18

lmms-lab-encoder/onevision-encoder-large

0.3B • Updated Feb 5 • 1.86k • 14

upvoted a paper 4 months ago

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

Paper • 2601.10305 • Published Jan 15 • 36

liked a model 4 months ago

lmms-lab-encoder/onevision-encoder-large

0.3B • Updated Feb 5 • 1.86k • 14

liked a model 5 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 12.2M • • 692

upvoted a paper 5 months ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 188

upvoted a paper 6 months ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 96

updated a dataset 6 months ago

mvp-lab/LLaVA-OneVision-1.5-Instruct-Data

Viewer • Updated Nov 21, 2025 • 21.9M • 93.6k • 71

upvoted a paper 6 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 132

New activity in mvp-lab/LLaVA-OneVision-1.5-Instruct-Data 7 months ago

SFT is fully uploaded and available. If you encounter missing images or broken links, please report them — we will fix issues on a rolling basis.

👍 3

#10 opened 7 months ago by

xiangan

yiyexy

AI & ML interests

Recent Activity

Organizations

yiyexy's activity

NEO-unify: Building Native Multimodal Unified Models End to End

SFT is fully uploaded and available. If you encounter missing images or broken links, please report them — we will fix issues on a rolling basis.