Documentation Index
Fetch the complete documentation index at: https://docs.parallel.ai/llms.txt
Use this file to discover all available pages before exploring further.
For AI agents: a documentation index is available at https://docs.parallel.ai/llms.txt. The full text of all docs is at https://docs.parallel.ai/llms-full.txt. You may also fetch any page as Markdown by appending
The Parallel Search API is available in the Google Gemini Enterprise Agent Platform (formerly Vertex AI) as an external grounding provider. Use it to ground Gemini model responses with up-to-date context from the public web.
.md to its URL or sending Accept: text/markdown.Grounding with Parallel on the Gemini Enterprise Agent Platform is currently in Preview (Pre-GA) per Google’s Service Specific Terms.
| Google Cloud Marketplace | Bring Your Own Key (BYOK) | |
|---|---|---|
| Setup | Subscribe via Google Cloud Marketplace | Get an API key from Parallel Platform |
| Authentication | Automatic — no API key needed | API key passed in each request |
| Billing | Consolidated through Google Cloud | Billed through Parallel |
| Quota | 200 prompts per minute | 200 prompts per minute |
Use cases
- Using web data for information completion or enrichment.
- Multi-hop agents that require deeper web searches for complex questions.
- Building APIs that integrate web search data.
- Employee-facing assistants for up-to-date analysis and reporting.
- Consumer apps (retail, travel) supporting informed purchase decisions.
- Automated agents (e.g., news analysis, KYC checks).
- Vertical agents (sales, coding, finance) fetching the latest context from the web.
Example
Who won the 2025 Las Vegas F1 Grand Prix?| Without Grounding | With Grounding |
|---|---|
| The 2025 Las Vegas Grand Prix has not happened yet. The race is scheduled to take place on the weekend of November 20-22, 2025. Therefore, the winner is currently unknown. | The winner of the 2025 Las Vegas F1 Grand Prix was Max Verstappen of Red Bull Racing. The race took place on November 22, 2025. Sources: domain1.com, domain2.com, … |
Supported models
The following models support Grounding with Parallel web search:- Gemini 2.5 Flash (
gemini-2.5-flash) - Gemini 2.5 Flash-Lite (
gemini-2.5-flash-lite) - Gemini 2.5 Pro (
gemini-2.5-pro) - Gemini 3.1 Pro (
gemini-3.1-pro-preview) - Gemini 3.1 Flash-Lite (
gemini-3.1-flash-lite)
Setup
- Google Cloud Marketplace (Recommended)
- Bring Your Own Key
The fastest way to get started is through the Google Cloud Marketplace. This approach requires no API key — authentication is handled automatically through your Google Cloud project.
- Go to the Parallel Web Search listing on Google Cloud Marketplace.
- Click Subscribe.
- Review the pricing, accept the terms of service, and confirm your subscription.
- Ensure the subscription is active in the Google Cloud project you plan to use with Gemini Enterprise.
Vertex AI Studio
You can also use Parallel as a grounding source directly in the Vertex AI Studio UI — no code required. This requires an active Google Cloud Marketplace subscription.- Open Vertex AI Studio in the Google Cloud Console.
- Select a supported Gemini model.
- In the grounding configuration, select Parallel Web Search as the grounding source.
- Enter your prompt and send — the model response will be grounded with web results from Parallel.
Make a grounded request
Use the Gemini REST API to request grounded responses from Gemini:PROJECT_ID: Your Google Cloud project ID.LOCATION: The region to process the request (e.g.,us-central1). Omit from the endpoint to use the global endpoint.MODEL_ID: The Gemini model to use (e.g.,gemini-2.5-flash).
- Google Cloud Marketplace
- Bring Your Own Key
No
api_key field is needed when using the Marketplace subscription:If both a Marketplace subscription and an API key are present in a request, the API key takes precedence.
Configuration options
AllcustomConfigs fields are optional. For best performance, use defaults unless you have specific requirements.
| Parameter | Default | Range | Description |
|---|---|---|---|
max_results | 10 | 1–20 | Number of search results used for grounding |
excerpts.max_chars_per_result | 30,000 | 1,000–100,000 | Maximum characters per excerpt |
excerpts.max_chars_total | 100,000 | 1,000–1,000,000 | Maximum total excerpt characters |
source_policy.include_domains | — | Up to 10 | Only return results from these domains |
source_policy.exclude_domains | — | Up to 10 | Exclude results from these domains |
mode | one-shot | one-shot | fast | Search mode. Use fast to optimize for latency. |
Quota
The default quota is 200 prompts per minute. If you need higher rate limits, contact your Google account team (Marketplace) orsupport@parallel.ai (BYOK) with your use case and requirements.
Billing
Using Gemini with Parallel incurs charges from both Gemini token consumption and use of Parallel’s Search API.- Google Cloud Marketplace: Search API charges are consolidated into your Google Cloud billing.
- Bring Your Own Key: Search API charges are billed through Parallel’s pricing.