An Thai Le

News

Jul 2026 Several new preprints on vision-language-action models (efficient finetuning, equivariance, diffusion-noise selection, and a unified inference runtime) and on payload-robust humanoid locomotion and whole-body compliance, now under review. Most of this work comes from the many bright VinUniversity students and VinRobotics residents I am grateful to work with. See the publications below.
Jun 2026 Happy to serve as an Area Chair for CoRL 2026. Looking forward to reading your submissions!
Jun 2026 Invited talks at ICRA 2026 Workshop on Robot Architecture and the 2nd Next-Gen Robot Learning Symposium on Designing Planning Algorithms for the Era of Parallelism. The slides are available here.
May 2026 Rarity of rocket-driven Penrose extraction in Kerr spacetime accepted at Physical Review D.
Oct 2025 Started as Assistant Professor at VinUniversity and Visiting Professor at TU Darmstadt.
Aug 2025 Joined VinRobotics as Director of Foundation AI, leading RL-for-locomotion, humanoid VLA architecture, and edge-deployment efforts.
Aug 2025 Successfully defended my Ph.D. thesis on Tensor Search Methods for Vectorizing Motion Planning at TU Darmstadt! You can find the full thesis here. Endless thanks to Prof. Jan Peters for tolerating my mischief and supporting me throughout. I could not have asked for a better advisor. Huge gratitude to Prof. Siddhartha Srinivasa for immediately accepting to serve as external examiner, and to the rest of the committee for their time and feedback.
Jul 2025 Model Tensor Planning accepted at TMLR 2025 and ICLR 2026 (J2C track). Thanks to the co-authors and reviewers for getting it there.
Jul 2025 Global Tensor Motion Planning published in IEEE RA-L 2025 and accepted at ICRA 2026.
Jun 2025 Motion Planning Diffusion published in IEEE T-RO 2025.
May 2025 Invited talks at RMIT University, Rice University, and HUST on tensor search methods for motion planning.

Research

I try to scale planning to settings classical methods struggle with: long horizons, high-dimensional state spaces, large plan sets, many agents. I do this by treating search as a batched tensor operation and by leaning on generative models where structure runs out. Most current work targets humanoid loco-manipulation and vision-language-action models that stay efficient and reliable enough to run on real robots.

Tensor Search & Batched Planning

Casting search and trajectory optimization as batched tensor operations on the GPU, the spine of my thesis and most of my recent planners.

Diffusion & Flow Matching for Motion

Using diffusion and flow matching as priors over trajectories and policies, especially when the solution landscape is multimodal and gradients alone are not enough. Lately, choosing the initial noise itself to make action chunking smoother and more robust.

Humanoid Loco-manipulation

Whole-body RL and model-based control for humanoids in contact-rich tasks: perceptive locomotion that holds up over rough terrain and under payload, and compliant control that yields safely during contact and cooperative carrying. Ongoing, and many things still fall over.

Vision-Language-Action Models

Making VLA policies cheaper to run and steadier on real robots: equivariant architectures, lighter finetuning, better grounding from fewer demos, and inference runtimes for the edge.

Optimal Transport & Gradient Flows

Borrowing entropic OT and gradient-flow machinery to design planners, blend policies, and train networks where standard gradients break down.

Numerical General Relativity

A weekend hobby: JAX/PyTorch CUDA simulators for processes in curved spacetime, including Kerr orbits, Penrose extraction, and warp-drive energy conditions.

Humanoid Demos

Selected humanoid locomotion and manipulation demonstrations from VinRobotics. These videos show sim-to-real RL policies, perceptive stair climbing, and whole-body control running on our own hardware stack.

Compliant Whole-Body Control

Instead of fighting back when pushed, the ~70 kg VR-M3 follows the human body, bends its knees, drops the pelvis, and yields with the force. So compliant we held its hand to co-draw a picture, with a passivity guarantee not to damage it.

Read article →

Perceptive Stair Locomotion

VR-M3 (~60 kg) climbs unfamiliar staircases at 0.6 m/s with a 5 kg payload using onboard terrain sensing and learned locomotion, without LiDAR, mocap, or teleoperation.

Read article →

Human-Level Walking Speed

VR-H3 (178 cm, 85 kg) reaches human-level walking speed via RL-based locomotion with gait reward design, domain randomization, and curriculum learning.

Read article →

Built from the Motor Up

Custom high-torque-density actuators with real-time EtherCAT communication enable 1.5–1.8 m/s dynamic walking. Full native stack, fast iteration.

Read article →

Real-World Football

Humanoid robot participates in a real game, passing, running alongside people, and celebrating goals in an unscripted outdoor environment.

Read article →

Global Debut

Platform preview for Computex and ICRA 2026: whole-body teleoperation, dynamic payload handling, MPC + RL locomotion, and perception-action learning.

Read article →

Selected Publications

* indicates co-first or co-last authors. See also my Google Scholar profile.

★ Featured

RoboGaze: Evaluating Robot World Models via Structured Vision-Language Analysis

M.L. Nguyen, N.T. Diep, H.K. Nguyen, M. Le, D.L. Thien, H.H. Tran, D.D. Le, V. Duong, D. Sonntag, An Thai Le, D.M.H. Nguyen, N.A. Vien, T.V. Nhiem

Submitted
arXiv Website
★ Featured

Whole-Body Compliance for Heavy Humanoids via Force Latent Estimation and Residual Impedance Targets

T.D. Do*, C.T. Trinh*, P.T. Dat, T.D. Dang, C. Le, T. Ly, V.A. Ngo, An Thai Le

Submitted
Website PDF
Self-Improving VLA Policies: Selected Diffusion Noise for Spurious-Robust Action Smoothing

D.M. Nguyen, B.N. Dao, T.M. Luu, B.G. Nguyen, V. Tong, A. Liu, V.N. Duong, D.D. Le, D. Sonntag, T. Le, N. Le, J. Peters, An Thai Le, M.N. Vu, M. Niepert, K.D. Doan, D.M.H. Nguyen, V.A. Ngo

Submitted
arXiv
★ Featured

TACT-ful: Multi-Channel Terrain Affordance and Compliance Training for Payload-Robust Perceptive Humanoid Locomotion

T. Ly*, T.D. Dang*, C. Le, T.D. Do, P.T. Dat, C.T. Trinh, V.A. Ngo, An Thai Le

Submitted
arXiv Website
Start Right, Arrive Right: Asynchronous Execution via Initial Noise Selection

T.B. Ho*, Q.T. Nguyen*, T.L. Ha*, G.B. Nguyen, V.T. Nguyen, L. Dinh, M.N. Vu, D.M.H. Nguyen, An Thai Le, V.A. Ngo

Submitted
arXiv Website
★ Featured

EquiVLA: A General Framework for Rotationally Equivariant Vision-Language-Action Models

T.L. Ha, Q.T. Nguyen, T.B. Ho, L. Dinh, M.D. Nguyen, G.B. Nguyen, T.Q. Pham, M.N. Vu, D.M.H. Nguyen, An Thai Le, V.A. Ngo

Submitted
arXiv Website
★ Featured

vla.cpp: A Unified Inference Runtime for Vision-Language-Action Models

K.D. Nguyen, H.T. Ho, C.T. Nguyen, T.Q. Duong, L.D. Le, D.M.H. Nguyen, V.A. Ngo, An Thai Le

Submitted
arXiv Website
Finetuning Vision-Language-Action Models Requires Fewer Layers Than You Think

G.B. Nguyen, T.B. Ho, T.L. Ha, K. Vo, P.L. Møller, Q.T. Nguyen, L. Dinh, T.M. Luu, T. Dam, V. Duong, T. Le, N.D.Q. Bui, M. Vu, T.N. Le, An Thai Le, N. Le, D. Sonntag, J. Zou, J. Peters, D.M.H. Nguyen, V.A. Ngo

Submitted
arXiv Website
StructSAM: Structure- and Spectrum-Preserving Token Merging for Segment Anything Models

D.M.H. Nguyen, T.A. Tran, D. Nguyen, S. Xie, T.Q. Nguyen, M.T.N. Truong, D. Palenicek, An Thai Le, M. Barz, T. Nguyen, T. Dam, N. Le, M. Vu, K. Doan, V. Ngo, P. Xie, J. Zou, D. Sonntag, J. Peters, M. Niepert

Submitted
arXiv OpenReview
★ Featured

Training Non-differentiable Networks via Optimal Transport

An Thai Le

Submitted
arXiv Code
★ Featured

AAC: Admissible-by-Architecture Differentiable Landmark Compression for ALT

An Thai Le, V.A. Ngo

Submitted
arXiv Code
CLOT: Multi-Robot Motion Planning Via Collaborative Optimal Transport under Signal Temporal Logic Tasks

Y. Zhang, Y. Zhang, An Thai Le, M. Guo

ICRA 2026
PDF
FOCA: Future-Oriented Conditioning for Data-Efficient Vision-Language-Action Adaptation

M.D. Nguyen, T.D. Nghiem, G.B. Nguyen, T.B. Ho, D.T. Le, Q.T. Nguyen, T.L. Ha, V.N. Tran, B. Thach, X.N. Tran, T.A. Tran, A. Habuda, P.L. Møller, N.L. Tran, D. Sonntag, M. Niepert, K.D. Doan, V.N. Duong, H. Ngo, M.N. Vu, D.M.H. Nguyen, An Thai Le*, V.A. Ngo*

ICML 2026
arXiv Website Code
★ Featured

Observer-robust energy condition verification for warp drive spacetimes

An Thai Le

CQG 2026 Submitted
arXiv Code
Rarity of rocket-driven Penrose extraction in Kerr spacetime

An Thai Le

PRD 2026
DOI arXiv Code
★ Featured

Model Tensor Planning

An Thai Le, Khai Nguyen, Minh Nhat Vu, João Carvalho, Jan Peters

ICLR 2026 TMLR 2025
OpenReview Code
★ Featured

Global Tensor Motion Planning

An Thai Le, Kay Pompetzki, João Carvalho, Joe Watson, Julen Urain, Armin Biess, Georgia Chalvatzaki, Jan Peters

IEEE RA-L 2025 ICRA 2026
arXiv Code
Motion Planning Diffusion: Learning and Adapting Robot Motion Planning with Diffusion Models

João Carvalho, An Thai Le, Philipp Jahr, Qiao Sun, Julen Urain, Dorothea Koert, Jan Peters

IEEE T-RO 2025 AAAI 2026
arXiv Code
DoublyAware: Dual Planning and Policy Awareness for Temporal Difference Learning in Humanoid Locomotion

Khang Nguyen, An Thai Le, Jan Peters, Nhat Minh Vu

IEEE RA-L 2025
arXiv
Machine Learning with Physics Knowledge for Prediction: A Survey

Joe Watson, Chen Song, Oliver Weeger, Theo Gruner, An Thai Le, Kay Hansel, Ahmed Hendawy, Alexander Arenz, William Trojak, Kyle Cranmer, C Alberto D'Eramo, Felix Buelow, Tanmay Goyal, Jan Peters, Marc W. Hoffmann

TMLR 2025
arXiv
FlowMP: Learning Motion Fields for Robot Planning with Conditional Flow Matching

Khang Nguyen, An Thai Le, T. Pham, M. Huber, Jan Peters, Nhat Minh Vu

IROS 2025
arXiv Code
Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)×R³

João Carvalho, An Thai Le, Philipp Jahr, Qiao Sun, Julen Urain, Dorothea Koert, Jan Peters

2025
arXiv
Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks

D.M.H. Nguyen*, N. Lukashina*, T. Nguyen, An Thai Le, T. Nguyen, N. Ho, Jan Peters, D. Sonntag, V. Zaverkin, M. Niepert

ICML 2024
arXiv Code
Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model

An Thai Le*, D.M.H. Nguyen*, T.Q. Nguyen, N.T. Diep, T. Nguyen, D. Duong-Tran, Jan Peters, L. Shen, M. Niepert, D. Sonntag

ACML 2024
arXiv
Accelerating Motion Planning via Optimal Transport

An Thai Le, Georgia Chalvatzaki, Armin Biess, Jan Peters

NeurIPS 2023
arXiv Code
Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models

João Carvalho, An Thai Le, Mark Baierl, Dorothea Koert, Jan Peters

IROS 2023
Hierarchical Policy Blending As Optimal Transport

An Thai Le, Kay Hansel, Jan Peters, Georgia Chalvatzaki

L4DC 2023
Proceedings
Learning to reason over scene graphs: a case study of finetuning GPT-2 into a robot language model for grounded task planning

Georgia Chalvatzaki, Ali Younes, Daljeet Nandha, An Thai Le, L.F.R. Ribeiro, I. Gurevych

Frontiers in Robotics and AI 2023
arXiv
Learning Implicit Priors for Motion Optimization

An Thai Le*, Julen Urain*, Alexander Lambert*, Georgia Chalvatzaki, Byron Boots, Jan Peters

IROS 2022
arXiv Code
Learning forceful manipulation skills from multi-modal human demonstrations

An Thai Le, Meng Guo, Niels Van Duijkeren, L. Rozo, R. Krug, A.G. Kupcsik, M. Buerger

IROS 2021
arXiv
Hierarchical Human-Motion Prediction and Logic-Geometric Programming for Minimal Interference Human-Robot Tasks

An Thai Le, P. Kratzer, S. Hagenmayer, M. Toussaint, Jim Mainprice

IEEE RO-MAN 2021
arXiv

Open Source

anindex/note_model_opt

A collection of notes and examples on model deployment.

26 2

anindex/aac

Differentiable, architecturally admissible compressor (AAC) for A* search.

Python 1 0

anindex/polystep

Training non-differentiable networks via optimal transport.

Python 16 0

anindex/warpax

Observer-robust energy condition verification for warp drive spacetimes.

Python 1 1

anindex/mtp

Model Tensor Planning in JAX. TMLR 2025 & ICLR 2026.

Python 38 2

anindex/penrose_process

Penrose energy extraction simulation in Kerr spacetime.

Python 2 0

anindex/mpot

Motion Planning via Optimal Transport (MPOT) in PyTorch. NeurIPS 2023.

Python 66 11

anindex/gtmp

Global Tensor Motion Planning (GTMP) in JAX. RA-L 2025 & ICRA 2026.

Python 34 5

VinRobotics/vla.cpp

A unified inference runtime for VLA models.

C++ 30 3

VinRobotics/vinrobotics_mjlab

RL training pipeline for high-payload humanoid locomotion, built on MuJoCo-warp.

Python 30 3

VinRobotics/model-quantization-recipes

Practical quantization recipes for LLMs and speech models, from preparation through deployment-oriented validation.

Python 17 2

vincekurtz/hydrax

Sampling-based model predictive control on GPU with JAX / MJX.

Python 277 48

Experience

Visiting Professor

TU Darmstadt

Oct 2025–Present · Darmstadt, Germany

Co-advising MSc and PhD students at IAS on robot learning research.

Assistant Professor

VinUniversity

Oct 2025–Present · Hanoi, Vietnam

Building a research group on efficient learning and planning for robotics loco-manipulation, designing fundamental algorithms and methods.

Director of Foundation AI

VinRobotics

Aug 2025–Present · Hanoi, Vietnam

RL stack for high-payload humanoid locomotion
Humanoid VLA architecture and training recipe
Model optimization and edge-deployment toolchain

Ph.D. in Computer Science

Technische Universität Darmstadt - Intelligent Autonomous Systems (IAS)

2022–2025 · Darmstadt, Germany

Thesis: Tensor Search Methods for Vectorizing Motion Planning, supervised by Prof. Jan Peters.

Research Intern

Bosch AI

May 2020–Dec 2020 · Renningen, Germany

Worked on forceful imitation learning applied to E-bike assembly tasks, hosted by Dr. Meng Guo in the robotics team.

M.Sc. Information Technology

Universität Stuttgart

2019–2021 · Stuttgart, Germany

Thesis: Learning task-parameterized Riemannian motion policies, supervised by Dr. Jim Mainprice and Dr. Meng Guo. Graduated First class. Info-Preis for Best Diploma Award. Sony Research Award. Deutschlandstipendium.

Research Assistant

HLRS (High-Performance Computing Center Stuttgart)

Nov 2019–Apr 2020 · Stuttgart, Germany

Implemented back-end functionalities in the DASH project; maintained and configured HPC systems.

B.Eng. Electrical Engineering and Information Technology

Frankfurt University of Applied Sciences

2015–2019 · Frankfurt, Germany

Thesis: Approaches to solve kidnapped robot problem. Graduated First class. DAAD Scholarship. AmCham Scholarship. eSilicon Scholarship.

Engineer Intern

Intel Corporation

Jan 2017–May 2017 · Ho Chi Minh City, Vietnam

Designed data analysis systems for high-volume manufacturing unit-test data; validated and reported quality of Intel Thunderbolt product manufacturing line.

Teaching

Reinforcement Learning TU Darmstadt · SS 2022
Statistical Machine Learning TU Darmstadt · SS 2023, WS 2023/24, SS 2024, WS 2024/25
Probabilistic Methods for Computer Science TU Darmstadt · WS 2024/25
Robot Learning Integrated Project / Expert Lab / Mechatronics TU Darmstadt · WS 2024/25