robot
updated
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper
• 2503.15558
• Published • 50
Humanoid Policy ~ Human Policy
Paper
• 2503.13441
• Published
RoboFactory: Exploring Embodied Agent Collaboration with Compositional
Constraints
Paper
• 2503.16408
• Published • 42
Dita: Scaling Diffusion Transformer for Generalist
Vision-Language-Action Policy
Paper
• 2503.19757
• Published • 51
Gemini Robotics: Bringing AI into the Physical World
Paper
• 2503.20020
• Published • 31
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable
Objects from Videos
Paper
• 2503.17973
• Published • 8
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for
Open-Vocabulary Robotic Manipulation
Paper
• 2503.10546
• Published • 3
Being-0: A Humanoid Robotic Agent with Vision-Language Models and
Modular Skills
Paper
• 2503.12533
• Published • 68
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range
Movements and Scenes
Paper
• 2503.13435
• Published • 18
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via
Residual Learning
Paper
• 2503.21860
• Published • 4
TesserAct: Learning 4D Embodied World Models
Paper
• 2504.20995
• Published • 22
CaRL: Learning Scalable Planning Policies with Simple Rewards
Paper
• 2504.17838
• Published • 4
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement
Learning
Paper
• 2505.02835
• Published • 28
Interactive Post-Training for Vision-Language-Action Models
Paper
• 2505.17016
• Published • 6
ScanBot: Towards Intelligent Surface Scanning in Embodied Robotic
Systems
Paper
• 2505.17295
• Published • 9
BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning
with Vision-Language Models
Paper
• 2506.07961
• Published • 11
EmbodiedGen: Towards a Generative 3D World Engine for Embodied
Intelligence
Paper
• 2506.10600
• Published • 8
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action
Models
Paper
• 2507.23682
• Published • 24
AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of
Visuomotor Policies
Paper
• 2508.08113
• Published • 11
Genie Envisioner: A Unified World Foundation Platform for Robotic
Manipulation
Paper
• 2508.05635
• Published • 73