Ougrid Dumdang

Ougrid-D

ougrid

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models

upvoted a paper 3 days ago

LongVideoAgent: Multi-Agent Reasoning with Long Videos

upvoted an article 4 days ago

The Optimal Architecture for Small Language Models

View all activity

Organizations

upvoted 2 papers 3 days ago

Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models

Paper • 2512.21337 • Published 7 days ago • 26

LongVideoAgent: Multi-Agent Reasoning with Long Videos

Paper • 2512.20618 • Published 8 days ago • 52

upvoted an article 4 days ago

Article

The Optimal Architecture for Small Language Models

6 days ago

•

upvoted an article 20 days ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

23 days ago

•

upvoted an article 27 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

28 days ago

•

550

upvoted an article about 2 months ago

Article

Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data

Mar 26, 2025

•

upvoted 2 papers 2 months ago

ARGenSeg: Image Segmentation with Autoregressive Image Generation Model

Paper • 2510.20803 • Published Oct 23, 2025 • 9

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published Oct 22, 2025 • 30

upvoted a paper 3 months ago

RAG-Anything: All-in-One RAG Framework

Paper • 2510.12323 • Published Oct 14, 2025 • 53

upvoted a paper 4 months ago

LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence

Paper • 2509.12203 • Published Sep 15, 2025 • 19

liked a model 4 months ago

loolootech/no-name-ner-th

Token Classification • Updated Aug 20, 2025 • 10 • 5

upvoted 2 papers 4 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 248

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

liked a Space 4 months ago

Chat with Kimi-VL-A3B-Thinking-2506

🤔

180

Chat with images, videos, or PDFs to generate text

upvoted a paper 4 months ago

A Survey on Diffusion Language Models

Paper • 2508.10875 • Published Aug 14, 2025 • 34

upvoted a paper 5 months ago

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Paper • 2508.05954 • Published Aug 8, 2025 • 6

liked 2 models 5 months ago

kpsss34/Stable-Diffusion-3.5-Small-Preview1

Text-to-Image • Updated Aug 13, 2025 • 178 • 40

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6, 2025 • 480k • • 505

upvoted an article 5 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5, 2025

•

508

liked a model 5 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 250k • • 2.31k

Ougrid Dumdang

AI & ML interests

Recent Activity

Organizations

Ougrid-D's activity

The Optimal Architecture for Small Language Models

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

We Got Claude to Fine-Tune an Open Source LLM

Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data

Chat with Kimi-VL-A3B-Thinking-2506

Welcome GPT OSS, the new open-source model family from OpenAI!