Description

️ 🖼Tool Name:
Google Gemini API

🔖 Tool Category:
Google AI API - Multipurpose models (text, images, audio, video) from Google AI.

️ ✏What does this tool offer?

  • Access to advanced Gemini models (Pro, Flash, Flash-Lite) to process text, images, video, and audio.

  • Support chain-of-thought capabilities and complex analysis.

  • Generate images (Imagen 3, 4) and videos (Veo 2, Veo 3).

  • High-quality text-to-speech (TTS).

  • Grounding capabilities with Google searches for increased accuracy.

What does the tool actually deliver based on user experience?

  • Provides very high performance in typing and reasoning.

  • Facilitates the integration of AI capabilities into applications (chat, websites, analytical services).

  • It has great flexibility in pricing and plans (from Free Tier to Enterprise).

  • Supports millions of tokens for long context, making it ideal for complex research or large projects.

🤖 Does it include automation?
Yes - it offers extensive automation:

  • Process text, images, and sounds directly through the API.

  • Automatically optimize responses with Batch Mode to reduce cost by 50%.

  • Automatic Grounding via Google Search to return results backed by sources.

💰 Pricing Model:

  • Free Tier: Available to all users, but with limited rate limits.

  • Paid Tier: Different rates depending on the model and type of input (text/audio/video/image).


🆓 Free Plan Details (Free Tier):

  • Google AI Studio is completely free to use (in all available countries).

  • Access to all models with low usage limits (useful for testing).

  • Grounding support is free up to 500 requests/day for some models.


💳 Paid Plan Details (top models):

1. Gemini 2.5 Pro

  • Input: $1.25 per million tokens (≤ 200k), or $2.50 (> 200k).

  • Output: $10 (≤ 200k), or $15 (> 200k).

  • Context Caching: From $0.31 up to $0.625.

2. Gemini 2.5 Flash

  • Input: $0.30 (text/image/video) - $1.00 (audio).

  • Output: $2.50.

  • Context Caching: $0.075 (text/image/video), $0.25 (audio).

3. Gemini 2.5 Flash-Lite

  • Input: $0.10 (text/image/video) - $0.30 (audio).

  • Output: $0.40.

  • Context Caching: From $0.025 up to $0.125.

4. Gemini 2.0 Flash

  • Input: $0.10 (text/image/video) - $0.70 (audio).

  • Output: $0.40.

  • Image Generation: $0.039 per image (Imagen 4).

  • Live API: Text Input $0.35, Text Output $1.50.

5. Gemini 1.5 Pro

  • Input: $1.25 (≤128k) or $2.50 (>128k).

  • Output: $5 (≤128k) or $10 (>128k).

  • Huge context window (2 million Token).

6. Imagen 4 (Image Generation)

  • Fast: $0.02 per image.

  • Standard: $0.04.

  • Ultra: $0.06.

7. Veo 3 (Video Generation)

  • 0.75 per second (default, video with audio).

  • Veo 3 Fast: $0.40 per second.

8. Gemini Embedding

  • Input: $0.15 per million Token.

9. TTS Plans (Text-to-Speech Preview)

  • Gemini 2.5 Flash TTS: $0.50 text input, $10 voice output.

  • Gemini 2.5 Pro TTS: $1 text input, $20 voice output.


🧭 Access Method:

  • Via Google AI Studio (free for test use).

  • Via Gemini Developer API (paid by consumption).

  • Also available through Vertex AI at slightly different prices.

🔗 Experience Link:
ai.google.dev

Pricing Details

Prices for Gemini's models and services vary depending on usage. Gemini 2.5 Pro costs $1.25 per million tokens when inputting up to 200k, or $2.50 when exceeding it, while output is $10 (≤200k) or $15 (>200k), with Context Caching costing between $0.31 and $0.625. Gemini 2.5 Flash has an input price of $0.30 for text/image/video and $1 for audio, and an output price of $2.50, with Context Caching at $0.075 for text/image/video and $0.25 for audio. The lighter Flash-Lite version offers cheaper input ($0.10-0.30) with $0.40 output and Context Caching between $0.025 and $0.125. For Gemini 2.0 Flash, input pricing starts at $0.10 for text/images/video and goes up to $0.70 for audio and $0.40 for output, with image generation via Imagen 4 ($0.039 per image) and Live API at $0.35 input and $1.50 output. The Gemini 1.5 Pro model offers $1.25 (≤128k) or $2.50 (>128k) input, with $5-10 output and a huge context window of up to 2 million tokens. Image generation services via Imagen 4 start at $0.02 (Fast) and go up to $0.06 (Ultra), while video via Veo 3 costs $0.75/second or $0.40 in Fast mode. Finally, Gemini Embedding costs $0.15 per million tokens, and TTS plans range from $0.50-$1 text input and $10-$20 audio output depending on the plan (Flash TTS or Pro TTS).