EgoLife-Team

🔥🔥Introducing Ola! State-of-the-art omni-modal understanding model with advanced progressive modality alignment strategy!
Ola ranks #1 on OpenCompass Leaderboard (<10B)
.
📜Paper: https://arxiv.org/abs/2502.04328
🛠️Code: https://github.com/Ola-Omni/Ola

🛠️We have fully released our video&audio training data, intermediate image&video model at THUdyh/ola-67b8220eb93406ec87aeec37. Try to build your own powerful omni-modal model with our data and models!

THUdyh

authored a paper 10 months ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published Feb 6 • 30

THUdyh

authored a paper 11 months ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 27

THUdyh

posted an update about 1 year ago

Post

1604

🚀🚀🚀Introducing Insight-V! An early attempt towards o1-like multi-modal reasoning.
We offer a structured long-chain visual reasoning data generation pipeline and a multi-agent system to unleash the reasoning potential of MLLMs.
📜 Paper: https://arxiv.org/abs/2411.14432
🛠️ Github: https://github.com/dongyh20/Insight-V
💼 Model Weight: THUdyh/insight-v-673f5e1dd8ab5f2d8d332035

Jingkang

authored a paper about 1 year ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 25

THUdyh

authored a paper about 1 year ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 25

THUdyh

posted an update about 1 year ago

Post

3384

🔥🔥🔥Introducing Oryx-1.5!
A series of unified MLLMs with much stronger performance on all the image, video, and 3D benchmarks 😍
🛠️Github: https://github.com/Oryx-mllm/Oryx
🚀Model: THUdyh/oryx-15-6718c60763845525c2bba71d
🎨Demo: THUdyh/Oryx
👋Try the top-tier MLLM yourself!

👀Stay tuned for more explorations on MLLMs!

AI & ML interests

Recent Activity

Team members 3

EgoGPT's activity