Question 1

How much does the Gemini API cost?

Accepted Answer

Gemini 3.1 Pro is $1.25 per million input tokens and $10 per million output. Gemini 3.5 Flash is $0.30 / $2.50 per million — one of the best price/intelligence ratios available. Gemma 4 31B is the cheapest Google-hosted option at $0.05 / $0.15.

Question 2

Is Gemini cheaper than GPT-5?

Accepted Answer

Gemini 3.1 Pro matches GPT-5.5 exactly on input price ($1.25/1M) and output price ($10/1M). Gemini 3.5 Flash is significantly cheaper than GPT-5.4 mini for most workloads — especially long-context and vision.

Question 3

Does Gemini have a free tier?

Accepted Answer

Yes. Google AI Studio offers a free tier with rate limits suitable for prototyping. Production traffic moves to paid pricing once you exceed the free quota. The free tier uses your data for training; the paid tier does not.

Question 4

When should I use Gemini Flash vs Pro?

Accepted Answer

Use Gemini 3.5 Flash for long-context retrieval (1M context for $0.30/1M), vision-heavy workloads, and high-volume chat. Use Gemini 3.1 Pro when you need top-tier reasoning — it ranks near GPT-5.5 and Claude Sonnet on hard benchmarks.

Question 5

How is Gemini priced for long context?

Accepted Answer

Unlike older Gemini 1.5, Gemini 3.x is flat-priced regardless of context length — you pay the same per-token rate whether you send 1K or 1M tokens. This makes Gemini 3.5 Flash dramatically cheaper than alternatives for long-document RAG.

Model	Input / 1M	Output / 1M	Context
Gemini 3.1 Pro Google	$1.25	$10.00	1000K
Gemini 3.5 Flash Google	$0.30	$2.50	1000K
Gemini 2.5 Pro Google	$1.25	$10.00	2000K
Gemini 2.5 Flash Google	$0.30	$2.50	1000K
Gemma 4 31B Google	$0.05	$0.15	256K

Gemini Pricing Calculator (3.1 Pro, 3.5 Flash, Gemma)

Which Gemini model should you use?

Gemini vs GPT-5 vs Claude pricing

How to lower your Gemini bill

Frequently asked questions