截止20250826 Google Gemini的modellists 模型列表

Google AI模型库概览 Google提供70多个AI模型，涵盖多模态、文本生成、图像处理和问答系统等。主要包括： Gemini系列：多模态大语言模型 1.5/2.0/2.5版本迭代演进 Pro/Flash/Flash-Lite不同规模支持百万级tokens上下文 Gemma系列：轻量级开源模型 1B到27B参数规格适用于资源受限场景专业模型 Imagen：文本生成图像 AQA：溯源问答

伊玛目的门徒

530人浏览 · 2025-08-26 09:50:08

伊玛目的门徒 · 2025-08-26 09:50:08 发布

核心摘要

这份列表包含了 70个模型，涵盖了文本生成、多模态理解、文本嵌入、图像生成和专业问答等多种类型。模型主要来自以下几个系列：

Gemini 系列：核心的多模态大语言模型（如 Gemini 1.5 Pro, Gemini 1.5 Flash, Gemini 2.0, Gemini 2.5）。
Gemma 系列：Google 推出的开源轻量级语言模型。
Embedding 系列：用于生成文本向量表示的模型。
Imagen 系列：Google 的文本生成图像模型。
专业模型：如用于溯源问答的 AQA 模型和用于教育的 LearnLM。

按模型类型和系列分类解读

1. 多模态大语言模型 (Gemini 系列)

这是列表中最主要的部分，型号繁多，可分为几个子类：

Gemini 1.5 系列：
- Pro 型号 (gemini-1.5-pro-*)：均衡的性能，主打超长上下文（最高达 2M tokens）。包含稳定版（如 -002, -001）和指向最新版本的别名（-latest）。
- Flash 型号 (gemini-1.5-flash-*)：更轻量、更快速的版本，上下文长度稍短（1M tokens）。同样包含稳定版和别名。
- Flash-8B 型号 (gemini-1.5-flash-8b-*)：最小、成本最低的 Flash 模型，参数量为 80 亿。
Gemini 2.0 系列：
- 主要集中在 Flash 和 Flash-Lite 型号上，上下文长度统一为 1M tokens。包含稳定版（-001）、实验版（-exp）和预览版（-preview）。值得注意的是，有一些 2.0 的实验版实际上描述的是 2.5 的功能，这可能意味着内部版本的迭代。
Gemini 2.5 系列（最新一代）：
- 重大升级：上下文长度标准提升至 1,048,576 tokens (~1M)，输出长度大幅增加至 65,536 tokens。
- 核心特性：普遍支持 thinking（思考） 功能（允许模型进行更深度的推理）和 batchGenerateContent（批量处理）方法。
- Pro 型号 (gemini-2.5-pro-*)：高性能版本，有多个预览版和一个稳定的最终版（gemini-2.5-pro，发布于 2025年6月17日）。
- Flash 型号 (gemini-2.5-flash-*)：快速版本，同样支持长上下文和思考功能。
- Flash-Lite 型号 (gemini-2.5-flash-lite-*)：更轻量的版本。
- TTS 预览版 (*-preview-tts)：专门用于文本转语音任务的特殊版本，输入输出限制与其他版本不同。

2. 轻量级语言模型 (Gemma 系列)

这是 Google 的开源模型系列，参数规模较小，适合资源受限的场景。

型号：包含 1B, 4B, 12B, 27B 四种参数规模，上下文长度一般为 32K tokens（27B-it 为 131K）。
特点：名称中的 -it 代表 "Instruction-Tuned"，即经过指令微调，更适合对话交互。此外还有超小规模的 3n-e4b-it 和 3n-e2b-it（40亿和20亿参数）。

3. 文本嵌入模型 (Embedding 系列)

用于将文本转换为数值向量，适用于搜索、聚类、分类等任务。

老型号：embedding-gecko-001, embedding-001。
新型号：text-embedding-004（稳定版），gemini-embedding-001 和实验版 gemini-embedding-exp-*（支持更长的 8K tokens 输入）。

4. 图像生成模型 (Imagen 系列)

文生图模型。

Imagen 3.0：稳定版本 (imagen-3.0-generate-002)。
Imagen 4.0：预览版本 (imagen-4.0-generate-preview-06-06)，代表更先进的图像生成能力。

5. 专业任务模型

AQA (models/aqa)：Attributed Question Answering（归因问答） 模型。其核心功能是根据提供的资料生成答案，并确保答案可溯源，同时会估算“可回答的概率”。使用 generateAnswer 方法而非通用的 generateContent。
LearnLM (learnlm-2.0-flash-experimental)：专为教育场景优化的实验性模型。

{
"models":
[
{
"name": "models/embedding-gecko-001",
"version": "001",
"displayName": "Embedding Gecko",
"description": "Obtain a distributed representation of a text.",
"inputTokenLimit": 1024,
"outputTokenLimit": 1,
"supportedGenerationMethods":
[
"embedText",
"countTextTokens"
]
},
{
"name": "models/gemini-1.5-pro-latest",
"version": "001",
"displayName": "Gemini 1.5 Pro Latest",
"description": "Alias that points to the most recent production (non-experimental) release of Gemini 1.5 Pro, our mid-size multimodal model that supports up to 2 million tokens.",
"inputTokenLimit": 2000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-1.5-pro-002",
"version": "002",
"displayName": "Gemini 1.5 Pro 002",
"description": "Stable version of Gemini 1.5 Pro, our mid-size multimodal model that supports up to 2 million tokens, released in September of 2024.",
"inputTokenLimit": 2000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-1.5-pro",
"version": "001",
"displayName": "Gemini 1.5 Pro",
"description": "Stable version of Gemini 1.5 Pro, our mid-size multimodal model that supports up to 2 million tokens, released in May of 2024.",
"inputTokenLimit": 2000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-1.5-flash-latest",
"version": "001",
"displayName": "Gemini 1.5 Flash Latest",
"description": "Alias that points to the most recent production (non-experimental) release of Gemini 1.5 Flash, our fast and versatile multimodal model for scaling across diverse tasks.",
"inputTokenLimit": 1000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-1.5-flash",
"version": "001",
"displayName": "Gemini 1.5 Flash",
"description": "Alias that points to the most recent stable version of Gemini 1.5 Flash, our fast and versatile multimodal model for scaling across diverse tasks.",
"inputTokenLimit": 1000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-1.5-flash-002",
"version": "002",
"displayName": "Gemini 1.5 Flash 002",
"description": "Stable version of Gemini 1.5 Flash, our fast and versatile multimodal model for scaling across diverse tasks, released in September of 2024.",
"inputTokenLimit": 1000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-1.5-flash-8b",
"version": "001",
"displayName": "Gemini 1.5 Flash-8B",
"description": "Stable version of Gemini 1.5 Flash-8B, our smallest and most cost effective Flash model, released in October of 2024.",
"inputTokenLimit": 1000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"createCachedContent",
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-1.5-flash-8b-001",
"version": "001",
"displayName": "Gemini 1.5 Flash-8B 001",
"description": "Stable version of Gemini 1.5 Flash-8B, our smallest and most cost effective Flash model, released in October of 2024.",
"inputTokenLimit": 1000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"createCachedContent",
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-1.5-flash-8b-latest",
"version": "001",
"displayName": "Gemini 1.5 Flash-8B Latest",
"description": "Alias that points to the most recent production (non-experimental) release of Gemini 1.5 Flash-8B, our smallest and most cost effective Flash model, released in October of 2024.",
"inputTokenLimit": 1000000,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"createCachedContent",
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.5-pro-preview-03-25",
"version": "2.5-preview-03-25",
"displayName": "Gemini 2.5 Pro Preview 03-25",
"description": "Gemini 2.5 Pro Preview 03-25",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.5-flash-preview-05-20",
"version": "2.5-preview-05-20",
"displayName": "Gemini 2.5 Flash Preview 05-20",
"description": "Preview release (April 17th, 2025) of Gemini 2.5 Flash",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.5-flash",
"version": "001",
"displayName": "Gemini 2.5 Flash",
"description": "Stable version of Gemini 2.5 Flash, our mid-size multimodal model that supports up to 1 million tokens, released in June of 2025.",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.5-flash-lite-preview-06-17",
"version": "2.5-preview-06-17",
"displayName": "Gemini 2.5 Flash-Lite Preview 06-17",
"description": "Preview release (June 11th, 2025) of Gemini 2.5 Flash-Lite",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.5-pro-preview-05-06",
"version": "2.5-preview-05-06",
"displayName": "Gemini 2.5 Pro Preview 05-06",
"description": "Preview release (May 6th, 2025) of Gemini 2.5 Pro",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.5-pro-preview-06-05",
"version": "2.5-preview-06-05",
"displayName": "Gemini 2.5 Pro Preview",
"description": "Preview release (June 5th, 2025) of Gemini 2.5 Pro",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.5-pro",
"version": "2.5",
"displayName": "Gemini 2.5 Pro",
"description": "Stable release (June 17th, 2025) of Gemini 2.5 Pro",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.0-flash-exp",
"version": "2.0",
"displayName": "Gemini 2.0 Flash Experimental",
"description": "Gemini 2.0 Flash Experimental",
"inputTokenLimit": 1048576,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"bidiGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-flash",
"version": "2.0",
"displayName": "Gemini 2.0 Flash",
"description": "Gemini 2.0 Flash",
"inputTokenLimit": 1048576,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-flash-001",
"version": "2.0",
"displayName": "Gemini 2.0 Flash 001",
"description": "Stable version of Gemini 2.0 Flash, our fast and versatile multimodal model for scaling across diverse tasks, released in January of 2025.",
"inputTokenLimit": 1048576,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-flash-exp-image-generation",
"version": "2.0",
"displayName": "Gemini 2.0 Flash (Image Generation) Experimental",
"description": "Gemini 2.0 Flash (Image Generation) Experimental",
"inputTokenLimit": 1048576,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"bidiGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-flash-lite-001",
"version": "2.0",
"displayName": "Gemini 2.0 Flash-Lite 001",
"description": "Stable version of Gemini 2.0 Flash-Lite",
"inputTokenLimit": 1048576,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-flash-lite",
"version": "2.0",
"displayName": "Gemini 2.0 Flash-Lite",
"description": "Gemini 2.0 Flash-Lite",
"inputTokenLimit": 1048576,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-flash-preview-image-generation",
"version": "2.0",
"displayName": "Gemini 2.0 Flash Preview Image Generation",
"description": "Gemini 2.0 Flash Preview Image Generation",
"inputTokenLimit": 32768,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-flash-lite-preview-02-05",
"version": "preview-02-05",
"displayName": "Gemini 2.0 Flash-Lite Preview 02-05",
"description": "Preview release (February 5th, 2025) of Gemini 2.0 Flash-Lite",
"inputTokenLimit": 1048576,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-flash-lite-preview",
"version": "preview-02-05",
"displayName": "Gemini 2.0 Flash-Lite Preview",
"description": "Preview release (February 5th, 2025) of Gemini 2.0 Flash-Lite",
"inputTokenLimit": 1048576,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 40,
"maxTemperature": 2
},
{
"name": "models/gemini-2.0-pro-exp",
"version": "2.5-exp-03-25",
"displayName": "Gemini 2.0 Pro Experimental",
"description": "Experimental release (March 25th, 2025) of Gemini 2.5 Pro",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.0-pro-exp-02-05",
"version": "2.5-exp-03-25",
"displayName": "Gemini 2.0 Pro Experimental 02-05",
"description": "Experimental release (March 25th, 2025) of Gemini 2.5 Pro",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-exp-1206",
"version": "2.5-exp-03-25",
"displayName": "Gemini Experimental 1206",
"description": "Experimental release (March 25th, 2025) of Gemini 2.5 Pro",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.0-flash-thinking-exp-01-21",
"version": "2.5-preview-05-20",
"displayName": "Gemini 2.5 Flash Preview 05-20",
"description": "Preview release (April 17th, 2025) of Gemini 2.5 Flash",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.0-flash-thinking-exp",
"version": "2.5-preview-05-20",
"displayName": "Gemini 2.5 Flash Preview 05-20",
"description": "Preview release (April 17th, 2025) of Gemini 2.5 Flash",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.0-flash-thinking-exp-1219",
"version": "2.5-preview-05-20",
"displayName": "Gemini 2.5 Flash Preview 05-20",
"description": "Preview release (April 17th, 2025) of Gemini 2.5 Flash",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/gemini-2.5-flash-preview-tts",
"version": "gemini-2.5-flash-exp-tts-2025-05-19",
"displayName": "Gemini 2.5 Flash Preview TTS",
"description": "Gemini 2.5 Flash Preview TTS",
"inputTokenLimit": 8192,
"outputTokenLimit": 16384,
"supportedGenerationMethods":
[
"countTokens",
"generateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2
},
{
"name": "models/gemini-2.5-pro-preview-tts",
"version": "gemini-2.5-pro-preview-tts-2025-05-19",
"displayName": "Gemini 2.5 Pro Preview TTS",
"description": "Gemini 2.5 Pro Preview TTS",
"inputTokenLimit": 8192,
"outputTokenLimit": 16384,
"supportedGenerationMethods":
[
"countTokens",
"generateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2
},
{
"name": "models/learnlm-2.0-flash-experimental",
"version": "2.0",
"displayName": "LearnLM 2.0 Flash Experimental",
"description": "LearnLM 2.0 Flash Experimental",
"inputTokenLimit": 1048576,
"outputTokenLimit": 32768,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2
},
{
"name": "models/gemma-3-1b-it",
"version": "001",
"displayName": "Gemma 3 1B",
"inputTokenLimit": 32768,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 64
},
{
"name": "models/gemma-3-4b-it",
"version": "001",
"displayName": "Gemma 3 4B",
"inputTokenLimit": 32768,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 64
},
{
"name": "models/gemma-3-12b-it",
"version": "001",
"displayName": "Gemma 3 12B",
"inputTokenLimit": 32768,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 64
},
{
"name": "models/gemma-3-27b-it",
"version": "001",
"displayName": "Gemma 3 27B",
"inputTokenLimit": 131072,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 64
},
{
"name": "models/gemma-3n-e4b-it",
"version": "001",
"displayName": "Gemma 3n E4B",
"inputTokenLimit": 8192,
"outputTokenLimit": 2048,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 64
},
{
"name": "models/gemma-3n-e2b-it",
"version": "001",
"displayName": "Gemma 3n E2B",
"inputTokenLimit": 8192,
"outputTokenLimit": 2048,
"supportedGenerationMethods":
[
"generateContent",
"countTokens"
],
"temperature": 1,
"topP": 0.95,
"topK": 64
},
{
"name": "models/gemini-2.5-flash-lite",
"version": "001",
"displayName": "Gemini 2.5 Flash-Lite",
"description": "Stable verion of Gemini 2.5 Flash-Lite, released in July of 2025",
"inputTokenLimit": 1048576,
"outputTokenLimit": 65536,
"supportedGenerationMethods":
[
"generateContent",
"countTokens",
"createCachedContent",
"batchGenerateContent"
],
"temperature": 1,
"topP": 0.95,
"topK": 64,
"maxTemperature": 2,
"thinking": true
},
{
"name": "models/embedding-001",
"version": "001",
"displayName": "Embedding 001",
"description": "Obtain a distributed representation of a text.",
"inputTokenLimit": 2048,
"outputTokenLimit": 1,
"supportedGenerationMethods":
[
"embedContent"
]
},
{
"name": "models/text-embedding-004",
"version": "004",
"displayName": "Text Embedding 004",
"description": "Obtain a distributed representation of a text.",
"inputTokenLimit": 2048,
"outputTokenLimit": 1,
"supportedGenerationMethods":
[
"embedContent"
]
},
{
"name": "models/gemini-embedding-exp-03-07",
"version": "exp-03-07",
"displayName": "Gemini Embedding Experimental 03-07",
"description": "Obtain a distributed representation of a text.",
"inputTokenLimit": 8192,
"outputTokenLimit": 1,
"supportedGenerationMethods":
[
"embedContent",
"countTextTokens",
"countTokens"
]
},
{
"name": "models/gemini-embedding-exp",
"version": "exp-03-07",
"displayName": "Gemini Embedding Experimental",
"description": "Obtain a distributed representation of a text.",
"inputTokenLimit": 8192,
"outputTokenLimit": 1,
"supportedGenerationMethods":
[
"embedContent",
"countTextTokens",
"countTokens"
]
},
{
"name": "models/gemini-embedding-001",
"version": "001",
"displayName": "Gemini Embedding 001",
"description": "Obtain a distributed representation of a text.",
"inputTokenLimit": 2048,
"outputTokenLimit": 1,
"supportedGenerationMethods":
[
"embedContent",
"countTextTokens",
"countTokens"
]
},
{
"name": "models/aqa",
"version": "001",
"displayName": "Model that performs Attributed Question Answering.",
"description": "Model trained to return answers to questions that are grounded in provided sources, along with estimating answerable probability.",
"inputTokenLimit": 7168,
"outputTokenLimit": 1024,
"supportedGenerationMethods":
[
"generateAnswer"
],
"temperature": 0.2,
"topP": 1,
"topK": 40
},
{
"name": "models/imagen-3.0-generate-002",
"version": "002",
"displayName": "Imagen 3.0",
"description": "Vertex served Imagen 3.0 002 model",
"inputTokenLimit": 480,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"predict"
]
},
{
"name": "models/imagen-4.0-generate-preview-06-06",
"version": "01",
"displayName": "Imagen 4 (Preview)",
"description": "Vertex served Imagen 4.0 model",
"inputTokenLimit": 480,
"outputTokenLimit": 8192,
"supportedGenerationMethods":
[
"predict"
]
}
],

}