XiaNanWang98's picture

24 4

XiaNanWang98

XiaNanWang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

upvoted a paper about 2 months ago

PICABench: How Far Are We from Physically Realistic Image Editing?

upvoted a paper about 2 months ago

TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 3 days ago • 139

upvoted 2 papers about 2 months ago

PICABench: How Far Are We from Physically Realistic Image Editing?

Paper • 2510.17681 • Published Oct 20 • 62

TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model

Paper • 2510.16449 • Published Oct 18 • 34

upvoted a paper 6 months ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published May 28 • 49

upvoted a paper 9 months ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10 • 68

liked 2 models 9 months ago

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated Mar 12 • 29.9k • • 1.43k

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11 • 56.6k • • 2.87k

upvoted 13 papers 10 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 191

RadEdit: stress-testing biomedical vision models via diffusion image editing

Paper • 2312.12865 • Published Dec 20, 2023 • 5

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

Paper • 2312.13271 • Published Dec 20, 2023 • 6

Mini-GPTs: Efficient Large Language Models through Contextual Pruning

Paper • 2312.12682 • Published Dec 20, 2023 • 10

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Paper • 2312.12468 • Published Dec 19, 2023 • 11

Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models

Paper • 2312.13763 • Published Dec 21, 2023 • 11

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Paper • 2312.12742 • Published Dec 20, 2023 • 14

TinySAM: Pushing the Envelope for Efficient Segment Anything Model

Paper • 2312.13789 • Published Dec 21, 2023 • 15

Splatter Image: Ultra-Fast Single-View 3D Reconstruction

Paper • 2312.13150 • Published Dec 20, 2023 • 16

InstructVideo: Instructing Video Diffusion Models with Human Feedback

Paper • 2312.12490 • Published Dec 19, 2023 • 18

Generative Multimodal Models are In-Context Learners

Paper • 2312.13286 • Published Dec 20, 2023 • 37

DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation

Paper • 2312.13578 • Published Dec 21, 2023 • 29

DreamTuner: Single Image is Enough for Subject-Driven Generation

Paper • 2312.13691 • Published Dec 21, 2023 • 28