Melih Özcan

staycoolish

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

SOP: A Scalable Online Post-Training System for Vision-Language-Action Models

upvoted a paper 23 days ago

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

upvoted a paper 23 days ago

VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation

View all activity

Organizations

None yet

upvoted a paper 22 days ago

SOP: A Scalable Online Post-Training System for Vision-Language-Action Models

Paper • 2601.03044 • Published 23 days ago • 28

upvoted 3 papers 23 days ago

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

Paper • 2601.01425 • Published 25 days ago • 51

VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation

Paper • 2601.02256 • Published 24 days ago • 33

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Paper • 2601.02204 • Published 24 days ago • 61

upvoted 7 papers about 2 months ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 77

Video Generation Models Are Good Latent Reward Models

Paper • 2511.21541 • Published Nov 26, 2025 • 45

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 234

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 90

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 294

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 254

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 151

upvoted 9 papers 4 months ago

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16, 2025 • 91

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Paper • 2509.13312 • Published Sep 16, 2025 • 105

Towards General Agentic Intelligence via Environment Scaling

Paper • 2509.13311 • Published Sep 16, 2025 • 71

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Paper • 2509.13309 • Published Sep 16, 2025 • 67

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Paper • 2509.13313 • Published Sep 16, 2025 • 80

Melih Özcan

AI & ML interests

Recent Activity

Organizations

staycoolish's activity