Google Unveils Gemini 2.5 Model Series with 30% Boost in AI Inference Performance
1 day ago / Read about 0 minute
Author:小编   

Google has recently introduced the Gemini 2.5 Hybrid Inference Model Series, comprising three distinct versions: 2.5 Pro, Flash, and Flash-Lite. This series is designed to significantly enhance the efficiency and performance of AI inference models. Notably, the preview version of Gemini 2.5 Flash-Lite stands apart for its impressive speed and cost-effectiveness, offering seamless access to tools such as Google Search and code execution. Boasting multimodal input capabilities and support for context lengths of up to 1 million tokens, this model is ideally suited for high-throughput tasks, including translation and classification. The Gemini 2.5 series possesses the ability to "think" before responding, leading to a deeper understanding of prompts, effective decomposition of complex tasks, and strategic planning of answers, ultimately resulting in improved inference accuracy.