HeadlinesBriefing favicon HeadlinesBriefing.com

Google's Gemini 3.1 Flash-Lite: Faster AI for Developers

Android Central •
×

Google has unveiled Gemini 3.1 Flash-Lite, its latest AI model designed specifically for developers handling complex data workloads. The company positions this as its fastest and most affordable option yet, with pricing set at $0.25 per 1M input tokens and $1.50 per 1M output tokens. This new model aims to replace the previous 2.5 Flash while offering significant performance improvements.

According to Google, 3.1 Flash-Lite delivers 2.5X faster Time to First Answer Token and a 45% boost in output speed compared to its predecessor. The model scored 1,432 on the Arena.ai Leaderboard, demonstrating strong performance in reasoning and multimodal understanding benchmarks. Google claims it outperforms both competing models and its own 2.5 Flash in these areas.

Developers can now access Gemini 3.1 Flash-Lite through a preview in the Gemini API, available in AI Studio and Vertex AI starting March 3. The model offers enhanced customization options, allowing developers to fine-tune how the AI "thinks" for tasks requiring in-depth reasoning, UI generation, and simulation creation. Early testers from companies like Latitude, Cartwheel, and Whering have reported positive experiences with the new model.