Pricing models
Priced to help you bring your app to the world
Gemini 2.0 Flash Available now
Our production-ready model with higher rate limits, enhanced performance, and simplified pricing.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
15 RPM (requests per minute)
1 million TPM (tokens per minute)
1.5K RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Free of charge, up to 1 million tokens of storage per hour
Available February 24, 2025
Tuning price
Not available
Grounding with Google Search
500 QPD (queries per day)
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate limits
Tier 1: 2,000 RPM (requests per minute) / 4 million TPM (tokens per minute)
Tier 2: 10,000 RPM (requests per minute) / 10 million TPM (tokens per minute)
Input Pricing
$0.10 / 1 million tokens (text / image / video)
$0.70 / 1 million tokens (audio)
Output Pricing
$0.40 / 1 million tokens (text)
Context caching
$0.025 / 1 million tokens (text / image / video)
$0.175 / 1 million tokens (audio)
Available February 24, 2025
Context caching (storage)
Tuning price
Not available
Grounding with Google Search
Tier 1: for up to 5K requests per day
Tier 2: for up to 10K requests per day
First 1,500 grounding requests per day are free of charge; additional requests are billed at $35 / 1K
Used to improve our products
Gemini 2.0 Flash-Lite In preview, pricing effective at GA
Our cost-optimized model for large scale text output use cases. Now in preview.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
30 RPM (requests per minute)
1 million TPM (tokens per minute)
1.5K RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Free of charge, up to 1 million tokens of storage per hour
Tuning price
Not available
Grounding with Google Search
500 QPD (queries per day)
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate limits
Tier 1: 4,000 RPM (requests per minute) / 4 million TPM (tokens per minute)
Tier 2: 60,000 RPM (requests per minute) / 10 million TPM (tokens per minute)
Input Pricing
$0.075 / 1 million tokens (text / image / video / audio)
Output Pricing
$0.30 / 1 million tokens (text)
Context caching
$0.01875 / 1 million tokens
Context caching (storage)
$1.00 / 1 million tokens per hour
Tuning price
Not available
Grounding with Google Search
Up to 1,500 grounding requests per day free of charge; additional requests are billed at $35 / 1K
Used to improve our products
Gemini 1.5 Flash Available now
Our fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million token context window. Now generally available for production use.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
15 RPM (requests per minute)
1 million TPM (tokens per minute)
1,500 RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Free of charge, up to 1 million tokens of storage per hour
Tuning price
Input/output prices are the same for tuned models. Tuning service is free of charge.
Grounding with Google Search
Not available
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate limits
2,000 RPM (requests per minute)
4 million TPM (tokens per minute)
Prompts up to 128k tokens
Input Pricing
$0.075 / 1 million tokens
output Pricing
$0.30 / 1 million tokens
Context Caching
$0.01875 / 1 million tokens
Prompts longer than 128k
Input Pricing
$0.15 / 1 million tokens
output Pricing
$0.60 / 1 million tokens
Context Caching
$0.0375 / 1 million tokens
Context caching (storage)
$1.00 / 1 million tokens per hour
Tuning price
Input/output prices are the same for tuned models. Tuning service is free of charge.
Grounding with Google Search
$35 / 1K grounding requests (for up to 5K requests per day).
Used to improve our products
Gemini 1.5 Flash-8B Available now
Our smallest model for lower intelligence use cases with a 1 million token context window. Now generally available for production use.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
15 RPM (requests per minute)
1 million TPM (tokens per minute)
1,500 RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Free of charge, up to 1 million tokens of storage per hour
Tuning price
Input/output prices are the same for tuned models. Tuning service is free of charge.
Grounding with Google Search
Not available
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate limits
4,000 RPM (requests per minute)
4 million TPM (tokens per minute)
Prompts up to 128k tokens
Input Pricing
$0.0375 / 1 million tokens
output Pricing
$0.15 / 1 million tokens
Context Caching
$0.01 / 1 million tokens
Prompts longer than 128k
Input Pricing
$0.075 / 1 million tokens
output Pricing
$0.30 / 1 million tokens
Context Caching
$0.02 / 1 million tokens
Context caching (storage)
$0.25 / 1 million tokens per hour
Tuning price
Input/output prices are the same for tuned models. Tuning service is free of charge.
Grounding with Google Search
$35 / 1K grounding requests (for up to 5K requests per day).
Used to improve our products
Gemini 1.5 Pro Available now
Our next-generation model with a breakthrough 2 million token context window. Now generally available for production use.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
2 RPM (requests per minute)
32,000 TPM (tokens per minute)
50 RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Not applicable
Tuning price
Not available
Grounding with Google Search
Not available
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate Limits
1,000 RPM (requests per minute)
4 million TPM (tokens per minute)
Prompts up to 128k tokens
Input Pricing
$1.25 / 1 million tokens
output Pricing
$5.00 / 1 million tokens
Context Caching
$0.3125 / 1 million tokens
Prompts longer than 128k
Input Pricing
$2.50 / 1 million tokens
output Pricing
$10.00 / 1 million tokens
Context Caching
$0.625 / 1 million tokens
Context caching (storage)
$4.50 / 1 million tokens per hour
Tuning price
Not available
Grounding with Google Search
$35 / 1K grounding requests (for up to 5K requests per day).
Used to improve our products
Gemini 1.0 Pro Available now
Our first-generation model offering only text and image reasoning. Generally available for production use.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
15 RPM (requests per minute)
32,000 TPM (tokens per minute)
1,500 RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Not applicable
Tuning price
Not available
Grounding with Google Search
Not available
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate Limits
360 RPM (requests per minute)
120,000 TPM (tokens per minute)
30,000 RPD (requests per day)
Input Pricing
$0.50 / 1 million tokens
Output Pricing
$1.50 / 1 million tokens
Context caching
Not available
Tuning price
Not available
Grounding with Google Search
Not available
Used to improve our products
Imagen 3 Available now
Our highest quality text-to-image model available in the Gemini API.
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate limits
20 RPM (requests per minute)
Pricing
$0.03 / image
Tuning price
Not available
Used to improve our products
Text Embedding 004 Available now
Our state-of-the-art text embedding model.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
1,500 RPM (requests per minute)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Not applicable
Tuning price
Not applicable
Used to improve our products