Particle.news
Download on the App Store

Google Rolls Out Gemini 3 Flash as Its Default High‑Speed Model

Google is positioning the model for real-time deployment with pricing set for high‑volume adoption.

Overview

  • Gemini 3 Flash is now the default across the Gemini API in AI Studio, the Gemini CLI, Google Antigravity, the Gemini app, AI Mode in Search, Vertex AI and Gemini Enterprise.
  • Google lists pricing at $0.50 per million input tokens and $3 per million output tokens, with audio input at $1 per million, and claims roughly 30% fewer tokens than 2.5 Pro and about 3x faster performance.
  • Developer guides detail native multimodality with file and video processing plus very large context windows reported up to 2 million tokens.
  • APIs and SDKs support function calling, tool use and context caching to cut latency and cost for repeated queries over large documents.
  • Google describes Gemini 3 as its most thoroughly safety‑evaluated model to date, and developers report API usage exceeding 1 trillion tokens per day since the broader Gemini 3 rollout.