Google DeepMind @ ICRA 2024

Welcome to Yokohama! Here is where you can meet the Google DeepMind team at ICRA 2024.

Vincent Vanhoucke
3 min readMay 12, 2024
A dreamy rendering of Yokohama, Japan, according to Gemini

Workshops

Agile Robotics: From Perception to Dynamic Action, organized by Huang (Raven) Huang, Shouren Huang, Jeffrey Ichnowski, Atil Iscen, Yuntao Ma, Gabriel Margolis, Pannag Sanketi, Daniel Seita, Guanya Shi, Joanne Truong, Yuji Yamakawa, Yuxiang Yang and Tingnan Zhang.

NEW! Check out our fully open-source Barkour quadruped design.

Mobile Manipulation and Embodied Intelligence (MOMA.v2) — speaker: Keerthana Gopalakrishnan

Vision-Language Models for Navigation and Manipulation, organized by Chris Paxton, Fei Xia, Karmesh Yadav, Nur Muhammad Mahi Shafiullah, Naoki Wake, Weiyu Liu, Yujin Tang adn Zhutian Yang
- Debater: Ted Xiao
-
Paper: AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents, Michael Ahn, Debidatta Dwibedi, Chelsea Finn, Montserrat Gonzalez Arenas, Keerthana Gopalakrishnan, Karol Hausman, brian ichter, Alex Irpan, Nikhil J Joshi, Ryan Julian, Sean Kirmani, Isabel Leal, Tsang-Wei Edward Lee, Sergey Levine, Yao Lu, sharath maddineni, Kanishka Rao, Dorsa Sadigh, Pannag R Sanketi, Pierre Sermanet, Quan Vuong, Stefan Welker, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Zhuo Xu

Advancements in Trajectory Optimization and Model Predictive Control for Legged Systems — speaker: Yuval Tassa

Events

Open X-Embodiment mixer: Wednesday May 15th, 5:45 PM — 6:45 PM

Session Chairs

AI-Enabled Robotics I: Montserrat Gonzalez Arenas

Papers

RoboVQA: Multimodal Long-Horizon Reasoning for Robotics, Pierre Sermanet, Tianli Ding, Jeffrey Zhao, Fei Xia, Debidatta Dwibedi, Keerthana Gopalakrishnan, Christine Chan, Gabriel Dulac-Arnold, Sharath Maddineni, Nikhil J Joshi, Pete Florence, Wei Han, Robert Baruch, Yao Lu, Suvir Mirchandani, Peng Xu, Pannag Sanketi, Karol Hausman, Izhak Shafran, Brian Ichter, Yuan Ca

Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation, Annie Xie, Lisa Lee, Ted Xiao, Chelsea Finn

How to Prompt Your Robot: A Prompt Book for Manipulation Skills with Code As Policies, Montserrat Gonzalez Arenas, Ted Xiao, Sumeet Singh, Vidhi Jain, Allen Z Ren, Quan Vuong, Jake Varley, Alexander Herzog, Isabel Leal, Sean Kirmani, Dorsa Sadigh, Vikas Sindhwani, Kanishka Rao, Jacky Liang, Andy Zeng

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation, Mel Vecerik, Carl Doersch, Yi Yang, Todor Davchev, Yusuf Aytar, Guangyao Zhou, Raia Hadsell, Lourdes Agapito, Jon Scholz

Best Paper Award! Open X-Embodiment: Robotic Learning Datasets and RT-X Models, The Open X-Embodiment Collaboration

Best Paper Award in Robot Manipulation! SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention, Isabel Leal, Krzysztof Choromanski, Deepali Jain, Avinava Dubey, Jake Varley, Michael Ryoo, Yao Lu, Frederick Liu, Vikas Sindhwani, Quan Vuong, Tamas Sarlos, Ken Oslund, Karol Hausman, Kanishka Rao

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots, Thomas Lampe, Abbas Abdolmaleki, Sarah Bechtle, Sandy H. Huang, Jost Tobias Springenberg, Michael Bloesch, Oliver Groth, Roland Hafner, Tim Hertweck, Michael Neunert, Markus Wulfmeier, Jingwei Zhang, Francesco Nori, Nicolas Heess, Martin Riedmiller

Physically Grounded Vision-Language Models for Robotic Manipulation, Jensen Gao, Bidipta Sarkar, Fei Xia, Ted Xiao, Jiajun Wu, Brian Ichter, Anirudha Majumdar, Dorsa Sadigh

Conditionally Combining Robot Skills Using Large Language Models, K.R. Zentner, Ryan Julian, Brian Ichter, Gaurav S. Sukhatme

Distilling and Retrieving Generalizable Knowledge for Robot Manipulation Via Language Corrections, Lihan Zha, Yuchen Cui, Li-Heng Lin, Minae Kwon, Montserrat Gonzalez Arenas, Andy Zeng, Fei Xia, Dorsa Sadigh

Robotic Offline RL from Internet Videos Via Value-Function Learning, Chethan Bhateja, Derek Guo, Dibya Ghosh, Anikait Singh, Manan Tomar, Quan Vuong, Yevgen Chebotar, Sergey Levine, Aviral Kumar

Learning Manipulation of Steep Granular Slopes for Fast Mini Rover Turning, Deniz Kerimoglu, Daniel Soto, Malone Lincoln Hemsley, Joseph Brunner, Sehoon Ha, Tingnan Zhang, Daniel I. Goldman

Robots That Can See: Leveraging Human Pose for Trajectory Prediction, Tim Salzmann, Lewis Chiang, Markus Ryll, Dorsa Sadigh, Carolina Parada, Alex Bewley — Top entry on the JackRabbot Trajectory Forecasting leaderboard

--

--

Vincent Vanhoucke

I am a Distinguished Engineer at Waymo, working on Machine Learning and Robotics. Previously head of robotics research at Google DeepMind.