Submitted by Chao Xu 20 An Anatomy of Vision-Language-Action Models: From Modules to Milestones and Challenges IROOTECH TECHNOLOGY 2