Intern Robotics

Team

non-profit

https://internrobotics.shlab.org.cn

AI & ML interests

None defined yet.

Recent Activity

Axi404 updated a dataset less than a minute ago

InternRobotics/InternData-M1

gordonhu authored a paper 10 days ago

G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

tenstep authored a paper about 2 months ago

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

View all activity

Papers

G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

View all Papers

Axi404

updated a dataset less than a minute ago

InternRobotics/InternData-M1

Viewer • Updated less than a minute ago • 687k • 10.4k • 25

aliaia-a

updated a dataset about 23 hours ago

InternRobotics/InternData-N1

Updated 17 days ago • 56.5k • 37

gordonhu

authored a paper 10 days ago

G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Paper • 2511.21688 • Published 10 days ago • 8

tenstep

authored a paper about 2 months ago

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Paper • 2510.13778 • Published Oct 15 • 16

YangZhou24

authored a paper 2 months ago

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15 • 104

wuzhi-hao

authored 2 papers 2 months ago

ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians

Paper • 2406.16815 • Published Jun 24, 2024 • 7

Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image

Paper • 2406.16710 • Published Jun 24, 2024

wpengz

authored a paper 2 months ago

InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts

Paper • 2509.10813 • Published Sep 13 • 30

wuzhi-hao

authored a paper 2 months ago

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Paper • 2509.22281 • Published Sep 26 • 31

Jiangmiao

authored a paper 3 months ago

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

Paper • 2509.15937 • Published Sep 19 • 20

Jiangmiao

authored a paper 4 months ago

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Paper • 2508.14879 • Published Aug 20 • 68

ZhaoyangLyu

authored a paper 4 months ago

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Paper • 2508.14879 • Published Aug 20 • 68

mengwei0427

authored a paper 4 months ago

StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling

Paper • 2507.05240 • Published Jul 7 • 47

Jiangmiao

authored 2 papers 5 months ago

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 87

ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting

Paper • 2507.15454 • Published Jul 21 • 7

hanqing94

authored 5 papers 5 months ago

DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation

Paper • 2308.07498 • Published Aug 14, 2023

GRUtopia: Dream General Robots in a City at Scale

Paper • 2407.10943 • Published Jul 15, 2024 • 25

Evolving Symbolic 3D Visual Grounder with Weakly Supervised Reflection

Paper • 2502.01401 • Published Feb 3 • 1

NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance

Paper • 2505.08712 • Published May 13 • 6

StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling

Paper • 2507.05240 • Published Jul 7 • 47