Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation Paper • 2508.20470 • Published Aug 28, 2025 • 75
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis Paper • 2503.11509 • Published Mar 14, 2025 • 3
Where do Large Vision-Language Models Look at when Answering Questions? Paper • 2503.13891 • Published Mar 18, 2025 • 8
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait Paper • 2503.12963 • Published Mar 17, 2025 • 7
Why Personalizing Deep Learning-Based Code Completion Tools Matters Paper • 2503.14201 • Published Mar 18, 2025 • 4