Teahose.
SIGN IN
NEW HERE — WHAT TEAHOSE DOES
We read the entire AI & tech firehose — so you don't have to.
PODPodcastsAll-In, No Priors, Acquired…
NEWNewslettersStratechery, Newcomer…
PAPPapersPhysical AI research
PHProduct Huntdaily launches
VCInvestor ScoutSequoia, a16z, Benchmark…
CLAUDE DISTILLS →
7 reads, 30 sec each — free, 6 AM ET.
+ a live graph of the companies, people & themes underneath.
HOME/ARXIV PHYSICAL AI
PAPR

arXiv Physical AI

PAPERS 127LATEST JUN 17, 2026HOST BHAWNA PALIWAL, JITENDRA MALIK, ET AL. (ARXIV PHYSICAL AI)
01
JUN 17, 2026
Do as I Do: Dexterous Manipulation Data from Everyday Human Videos
BHAWNA PALIWAL, JITENDRA MALIK, ET AL. (ARXIV PHYSICAL AI)
02
JUN 17, 2026
TactSpace: Learning a Physics-enriched Shared Latent Space for Tactile Sim-to-Real Transfer
ARUNIM JOARDER, MARCO HUTTER, ET AL. (ARXIV PHYSICAL AI)
03
JUN 15, 2026
Kairos: A Native World Model Stack for Physical AI
KAIROS TEAM, XIAOGANG WANG, ET AL. (ARXIV PHYSICAL AI)
04
JUN 15, 2026
T-Rex: Tactile-Reactive Dexterous Manipulation
DANTONG NIU, TREVOR DARRELL, ET AL. (ARXIV PHYSICAL AI)
05
JUN 15, 2026
What Matters in Orchestrating Robot Policies: A Systematic Study of Hierarchical VLA Agents
JIAHENG HU, ANNIE XIE, ET AL. (ARXIV PHYSICAL AI)
06
JUN 12, 2026
Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack
HE ZHANG, ZHENGYOU ZHANG, ET AL. (ARXIV PHYSICAL AI)
07
JUN 11, 2026
Improving Robotic Generalist Policies via Flow Reversal Steering
ANDY TANG, SERGEY LEVINE, ET AL. (ARXIV PHYSICAL AI)
08
JUN 11, 2026
Mana: Dexterous Manipulation of Articulated Tools
ZHAO-HENG YIN, C. KAREN LIU, ET AL. (ARXIV PHYSICAL AI)
09
JUN 11, 2026
WEAVER, Better, Faster, Longer: An Effective World Model for Robotic Manipulation
ARNAV KUMAR JAIN, ANDREA BAJCSY, ET AL. (ARXIV PHYSICAL AI)
10
JUN 11, 2026
WT-UMI: Tactile-based Whole-Body Manipulation via Force-Supervised Contact-Aware Planning
JAEHWI JANG, YE ZHAO, ET AL. (ARXIV PHYSICAL AI)
11
JUN 10, 2026
CHORUS: Decentralized Multi-Embodiment Collaboration with One VLA Policy
RIA DOSHI, JEANNETTE BOHG, ET AL. (ARXIV PHYSICAL AI)
12
JUN 10, 2026
FACTR 2: Learning External Force Sensing for Commodity Robot Arms Improves Policy Learning
STEVEN OH, DEEPAK PATHAK, ET AL. (ARXIV PHYSICAL AI)
13
JUN 9, 2026
IMPACT: Learning Internal-Model Predictive Control for Forceful Robotic Manipulation
JIAWEI GAO, YILUN DU, ET AL. (ARXIV PHYSICAL AI)
14
JUN 9, 2026
TacForeSight: Force-Guided Tactile World Model for Contact-Rich Manipulation
YUJIE ZANG, WENCHAO DING, ET AL. (ARXIV PHYSICAL AI)
15
JUN 8, 2026
MotionWAM: Towards Foundation World Action Models for Real-Time Humanoid Loco-Manipulation
JIA ZHENG, JUNWEI LIANG, ET AL. (ARXIV PHYSICAL AI)
16
JUN 7, 2026
OASIS: From Simulation Data Collection to Real-World Humanoid Loco-Manipulation
ZEHAO YU, XUELONG LI, ET AL. (ARXIV PHYSICAL AI)
17
JUN 5, 2026
LARA: Latent Action Representation Alignment for Vision-Language-Action Models
MENGYA LIU, SIYUAN HUANG, ET AL. (ARXIV PHYSICAL AI)
18
JUN 5, 2026
Shield-Loco: Shielding Locomotion Policies with Predictive Safety Filtering
ADITYA SHIRWATKAR, MAJID KHADIV, ET AL. (ARXIV PHYSICAL AI)
19
JUN 4, 2026
OneVLA: A Unified Framework for Embodied Tasks
LINGFENG ZHANG, WENBO DING, ET AL. (ARXIV PHYSICAL AI)
20
JUN 4, 2026
ProGAL-VLA: Grounded Alignment through Prospective Reasoning in Vision-Language-Action Models
NASTARAN DARABI, A. TRIVEDI
21
JUN 3, 2026
FlowPRO: Reward-Free Reinforced Fine-Tuning of Flow-Matching VLAs via Proximalized Preference Optimization
YIHAO WU, ZHENGYOU ZHANG, ET AL. (ARXIV PHYSICAL AI)
22
JUN 3, 2026
GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors
TIANYI XIE, YE YUAN, ET AL. (ARXIV PHYSICAL AI)
23
JUN 2, 2026
ElegantVLA: Learning When to Think for Efficient Vision-Language-Action Models
YE LI, ZHI WANG, ET AL. (ARXIV PHYSICAL AI)
24
JUN 2, 2026
Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking
ZEKUN QI, LI YI, ET AL. (ARXIV PHYSICAL AI)
25
JUN 2, 2026
NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation
NVIDIA, ZIAN WANG, ET AL. (ARXIV PHYSICAL AI)
26
JUN 2, 2026
RDGen: Demonstration Generation for High-Quality Robot Learning via Reinforcement Learning
ZIJIA ZHU, XINHAI SUN, ET AL. (ARXIV PHYSICAL AI)
27
JUN 1, 2026
Colosseum V2: Benchmarking Generalization for Vision Language Action Models
JEREMY MORGAN, ISHIKA SINGH, ET AL. (ARXIV PHYSICAL AI)
28
JUN 1, 2026
VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models
SHENGYUN SI, YU-GANG JIANG, ET AL. (ARXIV PHYSICAL AI)
29
MAY 29, 2026
How to Instruct Your Robot: Dense Language Annotations Power Robot Policy Learning
BOSUNG KIM, PRITHVIRAJ AMMANABROLU, ET AL. (ARXIV PHYSICAL AI)
30
MAY 28, 2026
BORA: Bridging Offline Reinforcement Learning and Online Residual Adaptation for Real-World Dexterous VLA Models
ZHONGXI CHEN, WENZHAO LIAN, ET AL. (ARXIV PHYSICAL AI)
31
MAY 28, 2026
EXPO-FT: Sample-Efficient Reinforcement Learning Finetuning for Vision-Language-Action Models
PERRY DONG, CHELSEA FINN, ET AL. (ARXIV PHYSICAL AI)
32
MAY 28, 2026
FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies
XINTONG HU, TAO YU, ET AL. (ARXIV PHYSICAL AI)
33
MAY 28, 2026
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments
QIUYUE WANG, XIONGHUI CHEN, ET AL. (ARXIV PHYSICAL AI)
34
MAY 28, 2026
VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies
MINGJIAN GAO, YUETING ZHUANG, ET AL. (ARXIV PHYSICAL AI)
35
MAY 27, 2026
Beyond Binary: Sim-to-Real Dexterous Manipulation with Physics-Grounded Contact Representation
JIAHE PAN, TORU LIN, ET AL. (ARXIV PHYSICAL AI)
36
MAY 27, 2026
Cybo-Waiter: A Physical Agentic Framework for Humanoid Whole-Body Locomotion-Manipulation
PENGHUA REN, KAIYANG CHEN, ET AL. (ARXIV PHYSICAL AI)
37
MAY 27, 2026
Humanoid Everyday: A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation
ZHENYU ZHAO, YUE WANG, ET AL. (ARXIV PHYSICAL AI)
38
MAY 26, 2026
RIO: Flexible Real-Time Robot I/O for Cross-Embodiment Robot Learning
PABLO ORTEGA-KRAL, JEAN OH, ET AL. (ARXIV PHYSICAL AI)
39
MAY 22, 2026
GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization
XIAOSONG JIA, YU-GANG JIANG, ET AL. (ARXIV PHYSICAL AI)
40
MAY 22, 2026
Towards Long-horizon Embodied Agents with Tool-Aligned Vision-Language-Action Models
ZIXING LEI, SIHENG CHEN, ET AL. (ARXIV PHYSICAL AI)
41
MAY 21, 2026
Factored Diffusion Policies:Compositionally Generalized Robot Control with a Single Score Network
SAYAN MITRA, ABHISHEK PAI, ET AL. (ARXIV PHYSICAL AI)
42
MAY 21, 2026
Imagine2Real: Towards Zero-shot Humanoid-Object Interaction via Video Generative Priors
JIAHE CHEN, JINGBO WANG, ET AL. (ARXIV PHYSICAL AI)
43
MAY 21, 2026
Judge, Then Drive: A Critic-Centric Vision Language Action Framework for Autonomous Driving
LIJIN YANG, HAO YANG, ET AL. (ARXIV PHYSICAL AI)
44
MAY 21, 2026
PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance
YUPENG ZHENG, WENCHAO DING, ET AL. (ARXIV PHYSICAL AI)
45
MAY 21, 2026
Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning
ISMAIL GELES, DAVIDE SCARAMUZZA, ET AL. (ARXIV PHYSICAL AI)
46
MAY 20, 2026
PointACT: Vision-Language-Action Models with Multi-Scale Point-Action Interaction
SHIZHE CHEN, PAUL PACAUD, CORDELIA SCHMID
47
MAY 18, 2026
MolmoAct2: Action Reasoning Models for Real-world Deployment
HAOQUAN FANG, RANJAY KRISHNA, ET AL. (ARXIV PHYSICAL AI)
48
MAY 18, 2026
StableVLA: Towards Robust Vision-Language-Action Models without Extra Data
YIYANG FU, DAQUAN ZHOU, ET AL. (ARXIV PHYSICAL AI)
49
MAY 18, 2026
TacSE3: Equivariant SE(3) Motion Estimation from Low-Texture Visuotactile Images for In-Gripper Tracking and Compensation
ZHONGYUAN LIAO, MICHAEL YU WANG, ET AL. (ARXIV PHYSICAL AI)
50
MAY 17, 2026
Do World Action Models Generalize Better than VLAs? A Robustness Study
ZHANGUANG ZHANG, YINGXUE ZHANG, ET AL. (ARXIV PHYSICAL AI)
51
MAY 17, 2026
VLA-ATTC: Adaptive Test-Time Compute for VLA Models with Relative Action Critic Model
WENHAO LI, CHANG XU, ET AL. (ARXIV PHYSICAL AI)
52
MAY 15, 2026
Offline Semantic Guidance for Efficient Vision-Language-Action Policy Distillation
JIN SHI, BRADY ZHANG, YISHUN LU
53
MAY 15, 2026
RLDX-1 Technical Report
DONGYOUNG KIM, JINWOO SHIN, ET AL. (ARXIV PHYSICAL AI)
54
MAY 14, 2026
CoCo-InEKF: State Estimation with Learned Contact Covariances in Dynamic, Contact-Rich Scenarios
MICHAEL BAUMGARTNER, MORITZ BÄCHER, ET AL. (ARXIV PHYSICAL AI)
55
MAY 13, 2026
Realtime-VLA FLASH: Speculative Inference Framework for Diffusion-based VLAs
JIAHUI NIU, HUAWEI LI, ET AL. (ARXIV PHYSICAL AI)
56
MAY 13, 2026
What to Ignore, What to React: Visually Robust RL Fine-Tuning of VLA Models
YUANFANG PENG, RUI WANG, ET AL. (ARXIV PHYSICAL AI)
57
MAY 5, 2026
TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation
WILLIAM SHEN, TOM'AS LOZANO-P'EREZ, ET AL. (ARXIV PHYSICAL AI)
58
MAY 4, 2026
$\Delta$VLA: Prior-Guided Vision-Language-Action Models via World Knowledge Variation
YIJIE ZHU, ZITONG YU, ET AL. (ARXIV PHYSICAL AI)
59
MAY 1, 2026
Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies
YI WANG, JIANLAN LUO, ET AL. (ARXIV PHYSICAL AI)
60
APR 30, 2026
DOT-Sim: Differentiable Optical Tactile Simulation with Precise Real-to-Sim Physical Calibration
YANG YOU, LEONIDAS GUIBAS, ET AL. (ARXIV PHYSICAL AI)
61
APR 30, 2026
FlexiTac: A Low-Cost, Open-Source, Scalable Tactile Sensing Solution for Robotic Systems
BINGHAO HUANG, YUNZHU LI
62
APR 30, 2026
LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models
HAO CHEN, PHENG-ANN HENG, ET AL. (ARXIV PHYSICAL AI)
63
APR 30, 2026
MotuBrain: An Advanced World Action Model for Robot Control
MOTUBRAIN TEAM, JUN ZHU, ET AL. (ARXIV PHYSICAL AI)
64
APR 30, 2026
RopeDreamer: A Kinematic Recurrent State Space Model for Dynamics of Flexible Deformable Linear Objects
TIM MISSAL, PAULA DORNHOFER PARO COSTA, ET AL. (ARXIV PHYSICAL AI)
65
APR 29, 2026
Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising
JUN GUO, HUAPING LIU, ET AL. (ARXIV PHYSICAL AI)
66
APR 24, 2026
RedVLA: Physical Red Teaming for Vision-Language-Action Models
YUHAO ZHANG, JIAMING JI, ET AL. (ARXIV PHYSICAL AI)
67
APR 24, 2026
dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model
YAXUAN LI, YICHEN ZHU, ET AL. (ARXIV PHYSICAL AI)
68
APR 22, 2026
Cortex 2.0: Grounding World Models in Real-World Industrial Deployment
ADRIANA AIDA, PAVAN UPPUTURI, ET AL. (ARXIV PHYSICAL AI)
69
APR 21, 2026
EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training
YIYANG DU, CHENYAN XIONG, ET AL. (ARXIV PHYSICAL AI)
70
APR 21, 2026
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling
BOYU CHEN, YIXIAO GE, ET AL. (ARXIV PHYSICAL AI)
71
APR 20, 2026
AnoleVLA: Lightweight Vision-Language-Action Model with Deep State Space Models for Mobile Manipulation
YUSUKE TAKAGI, KOMEI SUGIURA, ET AL. (ARXIV PHYSICAL AI)
72
APR 19, 2026
FLASH: Fast Learning via GPU-Accelerated Simulation for High-Fidelity Deformable Manipulation in Minutes
SIYUAN LUO, FAN SHI, ET AL. (ARXIV PHYSICAL AI)
73
APR 19, 2026
Novel Algorithms for Smoothly Differentiable and Efficiently Vectorizable Contact Manifold Construction
ONUR BEKER, GEORG MARTIUS, ET AL. (ARXIV PHYSICAL AI)
74
APR 17, 2026
Observing and Controlling Features in Vision-Language-Action Models
HUGO BUURMEIJER, MARCO PAVONE, ET AL. (ARXIV PHYSICAL AI)
75
APR 17, 2026
VADF: Vision-Adaptive Diffusion Policy Framework for Efficient Robotic Manipulation
XINGLEI YU, YANWEI FU, ET AL. (ARXIV PHYSICAL AI)
76
APR 17, 2026
VP-VLA: Visual Prompting as an Interface for Vision-Language-Action Models
ZIXUAN WANG, JIAYA JIA, ET AL. (ARXIV PHYSICAL AI)
77
APR 16, 2026
DEX-Mouse: A Low-cost Portable and Universal Interface with Force Feedback for Data Collection of Dexterous Robotic Hands
JOONHO KOH, CHANGJOO NAM, ET AL. (ARXIV PHYSICAL AI)
78
APR 16, 2026
R3D: Revisiting 3D Policy Learning
ZHENGDONG HONG, JIAYUAN GU, ET AL. (ARXIV PHYSICAL AI)
79
APR 16, 2026
Switch: Learning Agile Skills Switching for Humanoid Robots
YUEN-FUI LAU, PING TAN, ET AL. (ARXIV PHYSICAL AI)
80
APR 16, 2026
Vision-Based Safe Human-Robot Collaboration with Uncertainty Guarantees
JAKOB THUMM, MARCO PAVONE, ET AL. (ARXIV PHYSICAL AI)
81
APR 15, 2026
A Mechanistic Analysis of Sim-and-Real Co-Training in Generative Robot Policies
YU LEI, YUKE ZHU, ET AL. (ARXIV PHYSICAL AI)
82
APR 14, 2026
Learning Versatile Humanoid Manipulation with Touch Dreaming
YARU NIU, DING ZHAO, ET AL. (ARXIV PHYSICAL AI)
83
APR 13, 2026
Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching
JEREMY DAO, ALAN FERN
84
APR 13, 2026
Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models
ZHILONG ZHANG, YANG YU, ET AL. (ARXIV PHYSICAL AI)
85
APR 13, 2026
ViserDex: Visual Sim-to-Real for Robust Dexterous In-hand Reorientation
ARJUN BHARDWAJ, MARCO HUTTER, ET AL. (ARXIV PHYSICAL AI)
86
APR 10, 2026
Sim-to-Real Transfer for Muscle-Actuated Robots via Generalized Actuator Networks
JAN SCHNEIDER, DIETER BÜCHLER, ET AL. (ARXIV PHYSICAL AI)
87
APR 9, 2026
A-SLIP: Acoustic Sensing for Continuous In-hand Slip Estimation
UKSANG YOO, JEFFREY ICHNOWSKI, ET AL. (ARXIV PHYSICAL AI)
88
APR 9, 2026
HEX: Humanoid-Aligned Experts for Cross-Embodiment Whole-Body Manipulation
SHUANGHAO BAI, BADONG CHEN, ET AL. (ARXIV PHYSICAL AI)
89
APR 9, 2026
LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation
JINGJING WANG, GUOFENG ZHANG, ET AL. (ARXIV PHYSICAL AI)
90
APR 9, 2026
Learning Humanoid Standing-up Control across Diverse Postures
TAO HUANG, JIANGMIAO PANG, ET AL. (ARXIV PHYSICAL AI)
91
APR 9, 2026
RPL: Learning Robust Humanoid Perceptive Locomotion on Challenging Terrains
YUANHANG ZHANG, GUANYA SHI, ET AL. (ARXIV PHYSICAL AI)
92
APR 9, 2026
SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds
YUNSONG ZHOU, JIANGMIAO PANG, ET AL. (ARXIV PHYSICAL AI)
93
APR 9, 2026
Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation
JOHN Z. ZHANG, SIMON LE CLÉAC'H, ET AL. (ARXIV PHYSICAL AI)
94
APR 8, 2026
CMP: Robust Whole-Body Tracking for Loco-Manipulation via Competence Manifold Projection
ZIYANG CHENG, JIWEN LU, ET AL. (ARXIV PHYSICAL AI)
95
APR 7, 2026
Action Images: End-to-End Policy Learning via Multiview Video Generation
HAOYU ZHEN, CHUANG GAN, ET AL. (ARXIV PHYSICAL AI)
96
APR 7, 2026
SnapFlow: One-Step Action Generation for Flow-Matching VLAs via Progressive Self-Distillation
WUYANG LUAN, RUI MA, ET AL. (ARXIV PHYSICAL AI)
97
APR 6, 2026
DySL-VLA: Efficient Vision-Language-Action Model Inference via Dynamic-Static Layer-Skipping for Robot Manipulation
ZEBIN YANG, MENG LI, ET AL. (ARXIV PHYSICAL AI)
98
APR 6, 2026
Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models
YANRU WU, YUE WANG, ET AL. (ARXIV PHYSICAL AI)
99
APR 5, 2026
Adaptive Action Chunking at Inference-time for Vision-Language-Action Models
YUANCHANG LIANG, PRAHLAD VADAKKEPAT, ET AL. (ARXIV PHYSICAL AI)
100
APR 5, 2026
MobileManiBench: Simplifying Model Verification for Mobile Manipulation
WENBO WANG, BAINING GUO, ET AL. (ARXIV PHYSICAL AI)
101
APR 5, 2026
Not All Features Are Created Equal: A Mechanistic Study of Vision-Language-Action Models
BRYCE GRANT, XIJIA ZHAO, PENG WANG
102
APR 5, 2026
frax: Fast Robot Kinematics and Dynamics in JAX
DANIEL MORTON, MARCO PAVONE
103
APR 4, 2026
ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning
YANDAN YANG, MU XU, ET AL. (ARXIV PHYSICAL AI)
104
APR 2, 2026
ForeAct: Steering Your VLA with Efficient Visual Foresight Planning
ZHUOYANG ZHANG, SONG HAN, ET AL. (ARXIV PHYSICAL AI)
105
APR 2, 2026
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution
RUISI CAI, QUAN ZHOU, ET AL. (ARXIV PHYSICAL AI)
106
APR 1, 2026
Functional Force-Aware Retargeting from Virtual Human Demos to Soft Robot Policies
UKSANG YOO, HARSHA PRAHLAD, ET AL. (ARXIV PHYSICAL AI)
107
APR 1, 2026
Learning Humanoid Navigation from Human Data
WEIZHUO WANG, MONROE KENNEDY, ET AL. (ARXIV PHYSICAL AI)
108
APR 1, 2026
SMASH: Mastering Scalable Whole-Body Skills for Humanoid Ping-Pong with Egocentric Vision
JUNLI REN, PING LUO, ET AL. (ARXIV PHYSICAL AI)
109
MAR 31, 2026
Coordinated Humanoid Manipulation with Choice Policies
HAOZHI QI, JITENDRA MALIK, ET AL. (ARXIV PHYSICAL AI)
110
MAR 31, 2026
DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA
YI CHEN, XIHUI LIU, ET AL. (ARXIV PHYSICAL AI)
111
MAR 30, 2026
FocusVLA: Focused Visual Utilization for Vision-Language-Action Models
YICHI ZHANG, JIA WAN, ET AL. (ARXIV PHYSICAL AI)
112
MAR 30, 2026
OmniGuide: Universal Guidance Fields for Enhancing Generalist Robot Policies
YUNZHOU SONG, KOSTAS DANIILIDIS, ET AL. (ARXIV PHYSICAL AI)
113
MAR 30, 2026
SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning
PHILIP SCHROEDER, ONDREJ BIZA, ET AL. (ARXIV PHYSICAL AI)
114
MAR 30, 2026
Scaling World Model for Hierarchical Manipulation Policies
QIAN LONG, XINGHANG LI, ET AL. (ARXIV PHYSICAL AI)
115
MAR 29, 2026
Rethinking Visual-Language-Action Model Scaling: Alignment, Mixture, and Regularization
YE WANG, QIN JIN, ET AL. (ARXIV PHYSICAL AI)
116
MAR 29, 2026
ST4VLA: Spatially Guided Training for Vision-Language-Action Models
JI-LU YE, JIANGMIAO PANG, ET AL. (ARXIV PHYSICAL AI)
117
MAR 28, 2026
VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model
YANJIANG GUO, CHELSEA FINN, ET AL. (ARXIV PHYSICAL AI)
118
MAR 27, 2026
Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining
WENYAO ZHANG, LI ZHANG, ET AL. (ARXIV PHYSICAL AI)
119
MAR 26, 2026
A Unified and General Humanoid Whole-Body Controller for Versatile Locomotion
YUFEI XUE, JIANGMIAO PANG, ET AL. (ARXIV PHYSICAL AI)
120
MAR 26, 2026
OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction
LUJIE YANG, GUANYA SHI, ET AL. (ARXIV PHYSICAL AI)
121
MAR 26, 2026
SoftMimicGen: A Data Generation System for Scalable Robot Learning in Deformable Object Manipulation
MASOUD MOGHANI, AJAY MANDLEKAR, ET AL. (ARXIV PHYSICAL AI)
122
MAR 25, 2026
Steerable Vision-Language-Action Policies for Embodied Reasoning and Hierarchical Control
WILLIAM CHEN, SERGEY LEVINE, ET AL. (ARXIV PHYSICAL AI)
123
MAR 23, 2026
DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models
ZHIDE ZHONG, HAOANG LI, ET AL. (ARXIV PHYSICAL AI)
124
MAR 23, 2026
UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos
GU ZHANG, HUAZHE XU, ET AL. (ARXIV PHYSICAL AI)
125
MAR 23, 2026
WholeBodyVLA: Towards Unified Latent VLA for Whole-Body Loco-Manipulation Control
HAORAN JIANG, HONGYANG LI, ET AL. (ARXIV PHYSICAL AI)
126
MAR 22, 2026
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos
SHENYUAN GAO, LINXIJIMFAN, ET AL. (ARXIV PHYSICAL AI)
127
MAR 19, 2026
OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation
YUHANG ZHENG, WENCHAO DING, ET AL. (ARXIV PHYSICAL AI)