Google DeepMind Enhances Robots with Advanced Gemini AI
New capabilities enable robots to navigate and perform tasks using extended context from videos and human instructions.
- DeepMind's robots use Gemini 1.5 Pro to understand and navigate real-world environments.
- The AI model's long context window allows for more complex task execution and multi-step instructions.
- Robots are trained by watching video tours and absorbing extensive environmental details.
- Tests show a 90% success rate in following over 50 different user instructions in a controlled environment.
- Potential applications include home assistance, healthcare, and various service industries.