Overview
- Gemini 3 Flash is now the default across the Gemini API in AI Studio, the Gemini CLI, Google Antigravity, the Gemini app, AI Mode in Search, Vertex AI and Gemini Enterprise.
- Google lists pricing at $0.50 per million input tokens and $3 per million output tokens, with audio input at $1 per million, and claims roughly 30% fewer tokens than 2.5 Pro and about 3x faster performance.
- Developer guides detail native multimodality with file and video processing plus very large context windows reported up to 2 million tokens.
- APIs and SDKs support function calling, tool use and context caching to cut latency and cost for repeated queries over large documents.
- Google describes Gemini 3 as its most thoroughly safety‑evaluated model to date, and developers report API usage exceeding 1 trillion tokens per day since the broader Gemini 3 rollout.