Particle.news

Download on the App Store

OpenAI Launches o3 and o4-mini Models with Visual Reasoning and Autonomous Tool Use

The AI leader also debuts Codex CLI, an open-source coding agent, as part of its latest push to dominate the evolving AI landscape.

Image
OpenAI cofounder and CEO Sam Altman. Today the company released two new AI "reasoning" models, o3 and o4-mini, as it seeks to show it can remain at the front of the AI pack.
Google has embarked on deeper Gemini Live within Android phones. (Official photo)
Image

Overview

  • OpenAI has released o3, its most advanced reasoning model yet, and o4-mini, a cost-efficient alternative, both available to ChatGPT Plus, Pro, and Team users as well as developers via API.
  • The new models introduce a 'think with images' capability, enabling them to integrate and analyze visual inputs like sketches, diagrams, and blurry images directly within their reasoning processes.
  • o3 and o4-mini can autonomously utilize all ChatGPT tools, including web browsing, Python code execution, image processing, and image generation, for solving complex, multi-step problems.
  • OpenAI also launched Codex CLI, an open-source coding agent designed to seamlessly connect AI models with local coding environments, supported by a $1 million initiative to fund early projects.
  • This release marks a significant step in OpenAI's strategy to maintain its competitive edge in the global AI race, addressing both user and enterprise demands for advanced, multimodal AI tools.