#38
Gemini 2.0 Flash-Lite
gemini-2.0-flash-lite

Gemini 2.0 Flash-Lite is a cost-efficient model optimized for high-volume, latency-sensitive tasks. Outperforming Gemini 1.5 Flash on most benchmarks while maintaining similar speed and cost, it handles real-time translation, content classification, and lightweight assistant tasks at scale with minimal resource usage.

Performance Metrics

Helpfulness
Instruction Following
Comprehension
Empathy
Creative Writing
69.60
Helpfulness
74
Empathy
65
Instruction Following
76
Creative Writing
58
Comprehension
75
Speed
Avg 94 tok/s

Model Specifications

Release Date
February 2025
Lab
Google
Type
Proprietary
Context Size
1M
Max Output Tokens
8.2K
Cost per 1M tokens
$0.075 / $0.30
Model Inputs
Text, Audio, Images, Video
Model Outputs
Text
Tool Calling
Enabled

Compare With Similar Models