Shizun Wang's picture

Shizun Wang

littlepure2333

·

https://littlepure2333.github.io/home

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

Lightricks/LTX-2

upvoted a paper 9 days ago

SpotEdit: Selective Region Editing in Diffusion Transformers

upvoted a paper about 1 month ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

View all activity

Organizations

None yet

upvoted a paper 9 days ago

SpotEdit: Selective Region Editing in Diffusion Transformers

Paper • 2512.22323 • Published 14 days ago • 37

upvoted 2 papers about 1 month ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 182

Vision Bridge Transformer at Scale

Paper • 2511.23199 • Published Nov 28, 2025 • 45

upvoted a paper about 2 months ago

Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark

Paper • 2511.13853 • Published Nov 17, 2025 • 34

upvoted 2 papers 3 months ago

First Try Matters: Revisiting the Role of Reflection in Reasoning Models

Paper • 2510.08308 • Published Oct 9, 2025 • 24

WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation

Paper • 2510.07313 • Published Oct 8, 2025 • 6

upvoted 2 papers 7 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

Test3R: Learning to Reconstruct 3D at Test Time

Paper • 2506.13750 • Published Jun 16, 2025 • 27

upvoted a paper 8 months ago

Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding

Paper • 2505.16990 • Published May 22, 2025 • 22

upvoted a collection 11 months ago

SigLIP2

36 items • Updated Jul 10, 2025 • 104

upvoted a collection about 1 year ago

Cosmos

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/nvidia-cosmos-2 • 31 items • Updated 4 days ago • 299

upvoted 6 papers over 1 year ago

Heavy Labels Out! Dataset Distillation with Label Space Lightening

Paper • 2408.08201 • Published Aug 15, 2024 • 21

FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention

Paper • 2407.19918 • Published Jul 29, 2024 • 51

Video-Infinity: Distributed Long Video Generation

Paper • 2406.16260 • Published Jun 24, 2024 • 29

GFlow: Recovering 4D World from Monocular Video

Paper • 2405.18426 • Published May 28, 2024 • 17

SwiftAvatar: Efficient Auto-Creation of Parameterized Stylized Character on Arbitrary Avatar Engines

Paper • 2301.08153 • Published Jan 19, 2023 • 1

MindBridge: A Cross-Subject Brain Decoding Framework

Paper • 2404.07850 • Published Apr 11, 2024 • 1