RT-X and the Dawn of Large Multimodal Models: Google Breakthrough and 160-page Report Highlights

A huge new insider report on GPT Vision is released by Microsoft and just in the last few hours the RT-X series is dropped by Google in Robotics. I will not only break down the 160 page report on what GPT-4V can do and what it can’t, including new use cases, prompting techniques and failure modes, I’ll also go through the full RT-2-X and RT-1-X demo, which I am calling the GPT2 moment for robotics. Plus its huge new opensource Open X-Embodiment dataset.
Back to Top