JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery Paper • 2307.16377 • Published Jul 31, 2023
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Paper • 2405.08748 • Published May 14, 2024 • 23
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model Paper • 2503.19839 • Published Mar 25, 2025