Google Gemini Enterprise

The Parallel Search API is available in the Google Gemini Enterprise Agent Platform (formerly Vertex AI) as an external grounding provider. Use it to ground Gemini model responses with up-to-date context from the public web.

Grounding with Parallel on the Gemini Enterprise Agent Platform is currently in Preview (Pre-GA) per Google’s Service Specific Terms.

There are two ways to get started:

	Google Cloud Marketplace	Bring Your Own Key (BYOK)
Setup	Subscribe via Google Cloud Marketplace	Get an API key from Parallel Platform
Authentication	Automatic — no API key needed	API key passed in each request
Billing	Consolidated through Google Cloud	Billed through Parallel
Quota	200 prompts per minute	200 prompts per minute

Read Google’s official documentation here.

Use cases

Using web data for information completion or enrichment.
Multi-hop agents that require deeper web searches for complex questions.
Building APIs that integrate web search data.
Employee-facing assistants for up-to-date analysis and reporting.
Consumer apps (retail, travel) supporting informed purchase decisions.
Automated agents (e.g., news analysis, KYC checks).
Vertical agents (sales, coding, finance) fetching the latest context from the web.

Example

Who won the 2025 Las Vegas F1 Grand Prix?

Without Grounding	With Grounding
The 2025 Las Vegas Grand Prix has not happened yet. The race is scheduled to take place on the weekend of November 20-22, 2025. Therefore, the winner is currently unknown.	The winner of the 2025 Las Vegas F1 Grand Prix was Max Verstappen of Red Bull Racing. The race took place on November 22, 2025. Sources: domain1.com, domain2.com, …

Supported models

The following models support Grounding with Parallel web search:

Gemini 2.5 Flash (gemini-2.5-flash)
Gemini 2.5 Flash-Lite (gemini-2.5-flash-lite)
Gemini 2.5 Pro (gemini-2.5-pro)
Gemini 3.1 Pro (gemini-3.1-pro-preview)
Gemini 3.1 Flash-Lite (gemini-3.1-flash-lite)

Setup

Google Cloud Marketplace (Recommended)
Bring Your Own Key

The fastest way to get started is through the Google Cloud Marketplace. This approach requires no API key — authentication is handled automatically through your Google Cloud project.

Go to the Parallel Web Search listing on Google Cloud Marketplace.
Click Subscribe.
Review the pricing, accept the terms of service, and confirm your subscription.
Ensure the subscription is active in the Google Cloud project you plan to use with Gemini Enterprise.

Once subscribed, you can start making grounded requests immediately — no API key is needed in your request body.

Vertex AI Studio

You can also use Parallel as a grounding source directly in the Vertex AI Studio UI — no code required. This requires an active Google Cloud Marketplace subscription.

Open Vertex AI Studio in the Google Cloud Console.
Select a supported Gemini model.
In the grounding configuration, select Parallel Web Search as the grounding source.
Enter your prompt and send — the model response will be grounded with web results from Parallel.

Vertex AI Studio is a great way to experiment with grounded responses before integrating via the API.

Make a grounded request

Use the Gemini REST API to request grounded responses from Gemini:

POST https://LOCATION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID:generateContent

PROJECT_ID: Your Google Cloud project ID.
LOCATION: The region to process the request (e.g., us-central1). Omit from the endpoint to use the global endpoint.
MODEL_ID: The Gemini model to use (e.g., gemini-2.5-flash).

Google Cloud Marketplace
Bring Your Own Key

No api_key field is needed when using the Marketplace subscription:

{
  "contents": [{
    "role": "user",
    "parts": [{
      "text": "MODEL_PROMPT_TEXT"
    }]
  }],
  "tools": [{
    "parallelAiSearch": {
      "customConfigs": {
        "source_policy": {
          "exclude_domains": ["EXCLUDE_DOMAINS"],
          "include_domains": ["INCLUDE_DOMAINS"]
        },
        "excerpts": {
          "max_chars_per_result": MAX_CHARS_PER_RESULT,
          "max_chars_total": MAX_CHARS_TOTAL
        },
        "max_results": MAX_RESULTS,
        "mode": "MODE"
      }
    }
  }],
  "model": "projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID"
}

Include your API key in the parallelAiSearch object:

{
  "contents": [{
    "role": "user",
    "parts": [{
      "text": "MODEL_PROMPT_TEXT"
    }]
  }],
  "tools": [{
    "parallelAiSearch": {
      "api_key": "PARALLEL_API_KEY",
      "customConfigs": {
        "source_policy": {
          "exclude_domains": ["EXCLUDE_DOMAINS"],
          "include_domains": ["INCLUDE_DOMAINS"]
        },
        "excerpts": {
          "max_chars_per_result": MAX_CHARS_PER_RESULT,
          "max_chars_total": MAX_CHARS_TOTAL
        },
        "max_results": MAX_RESULTS,
        "mode": "MODE"
      }
    }
  }],
  "model": "projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID"
}

Execute the request:

curl -X POST \
     -H "Authorization: Bearer $(gcloud auth print-access-token)" \
     -H "Content-Type: application/json; charset=utf-8" \
     -d @request.json \
     "https://LOCATION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID:generateContent"

If both a Marketplace subscription and an API key are present in a request, the API key takes precedence.

Configuration options

All customConfigs fields are optional. For best performance, use defaults unless you have specific requirements.

Parameter	Default	Range	Description
`max_results`	10	1–20	Number of search results used for grounding
`excerpts.max_chars_per_result`	30,000	1,000–100,000	Maximum characters per excerpt
`excerpts.max_chars_total`	100,000	1,000–1,000,000	Maximum total excerpt characters
`source_policy.include_domains`	—	Up to 10	Only return results from these domains
`source_policy.exclude_domains`	—	Up to 10	Exclude results from these domains
`mode`	`one-shot`	`one-shot` \| `fast`	Search mode. Use `fast` to optimize for latency.

For guidance on search queries and configuration, see Search API Best Practices.

For a complete working example, see the Vertex AI demo in the Parallel Cookbook.

Quota

The default quota is 200 prompts per minute. If you need higher rate limits, contact your Google account team (Marketplace) or support@parallel.ai (BYOK) with your use case and requirements.

Billing

Using Gemini with Parallel incurs charges from both Gemini token consumption and use of Parallel’s Search API.

Google Cloud Marketplace: Search API charges are consolidated into your Google Cloud billing.
Bring Your Own Key: Search API charges are billed through Parallel’s pricing.

Getting started

Web Tools

Web Agents

Developer Tools

Platform Integrations

Data Integrations

Additional Resources

Google Gemini Enterprise

Use cases

Example

Supported models

Setup

Vertex AI Studio

Make a grounded request

Configuration options

Quota

Billing

Getting started

Web Tools

Web Agents

Developer Tools

Platform Integrations

Data Integrations

Additional Resources

Documentation Index

​Use cases

​Example

​Supported models

​Setup

​Vertex AI Studio

​Make a grounded request

​Configuration options

​Quota

​Billing

Use cases

Example

Supported models

Setup

Vertex AI Studio

Make a grounded request

Configuration options

Quota

Billing