14:00 UTC·
Google[GOOGLE]

Google releases Gemini 3.1 Flash-Lite at $0.25/M input — 2.5x faster than prior generation

Source
What Happened

Google released Gemini 3.1 Flash-Lite, an efficiency-focused model delivering 2.5x faster response times and 45% faster output generation compared to earlier Gemini versions. Input pricing is $0.25 per million tokens — among the lowest of any frontier-class model currently available. The model is accessible via Google AI Studio and Vertex AI.

Why It Matters

At $0.25/M input, Gemini 3.1 Flash-Lite prices below DeepSeek's API tier and well below GPT-4o Mini's comparable pricing, intensifying a cost race at the high-volume developer tier. Speed and cost together target the workloads where developers currently use smaller, cheaper models — classification, summarization, extraction, routing — but want frontier-quality outputs. Google is subsidizing market share in the developer tier where Anthropic and OpenAI currently have stronger mind share.

More from Google
GET THE DAILY DIGEST IN YOUR INBOX →

Every story from each day, delivered at midnight UTC.

← back to 2026-03-10
NWSRM · AI FEED
Built by [COMPANY] · Powered by nwsrm.ai