Parallel is available as a web search engine on OpenRouter, enabling any model to use Parallel-powered web search results. Select Parallel as the engine in the OpenRouterDocumentation Index
Fetch the complete documentation index at: https://docs.parallel.ai/llms.txt
Use this file to discover all available pages before exploring further.
web_search server tool to ground AI responses with real-time web data.
Read OpenRouter’s official documentation here.
How it works
OpenRouter’sweb_search server tool lets the model decide when and how often to search the web. When you select Parallel as the engine, OpenRouter routes the search request through Parallel’s Search API and returns the results as annotations on the model response.
This works with any model on OpenRouter, regardless of the provider.
OpenRouter’s older
web plugin is deprecated. New integrations should use the openrouter:web_search server tool shown below.Quick start
Set theengine to "parallel" in the openrouter:web_search server tool:
Configuration options
Customize the server tool with additional parameters:| Parameter | Type | Default | Description |
|---|---|---|---|
engine | string | — | Set to "parallel" to use Parallel search |
max_results | integer | 5 | Results per search call (range 1–25) |
max_total_results | integer | — | Cap on cumulative results across multiple searches in a single response |
search_context_size | string | medium | low, medium, or high. For Parallel, controls the total characters returned across all results |
allowed_domains | string[] | — | Only include results from these domains |
excluded_domains | string[] | — | Exclude results from these domains |
allowed_domains and excluded_domains are mutually exclusive when using Parallel as the engine — you cannot use both in the same request.Domain filtering
Restrict which domains appear in search results:parallel.ai) or bare domain extensions starting with a period (e.g., .edu, .gov).
Parsing search results
Web search results are returned as annotations in the OpenAI Chat Completion format:Pricing
When using Parallel as the engine on OpenRouter, pricing has two components:- Parallel Search API costs — see Parallel pricing for current rates
- LLM usage costs — charged by OpenRouter for the additional prompt tokens from search results