Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Paper • 2403.07764 • Published Mar 12, 2024 • 1
Stable-Hair v2: Real-World Hair Transfer via Multiple-View Diffusion Model Paper • 2507.07591 • Published Jul 10
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 3 days ago • 135
PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design Paper • 2512.04082 • Published 3 days ago • 10
InstantIR: Blind Image Restoration with Instant Generative Reference Paper • 2410.06551 • Published Oct 9, 2024 • 6
3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Paper • 2502.05761 • Published Feb 9 • 7
Dynamic Pyramid Network for Efficient Multimodal Large Language Model Paper • 2503.20322 • Published Mar 26 • 1
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation Paper • 2506.07977 • Published Jun 9 • 41
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale Paper • 2508.10711 • Published Aug 14 • 144
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper • 2510.14975 • Published Oct 16 • 84
view post Post 5976 Want to iterate on a Hugging Face Space with an LLM? Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model! multimodalart/repo2txt See translation 🤗 3 3 🚀 1 1 👍 1 1 + Reply
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper • 2504.12395 • Published Apr 16 • 16