Control which domains, datasets, and sources are included, excluded, or soft-ranked in search results
Control exactly which sources your search uses. Focus on trusted domains, access specific datasets, exclude unreliable sources, or soft-rank results toward preferred domains.
Source filters accept domains, URLs, dataset names, specific paths, or collections.
Save time with Collections! If you use the same source combinations frequently, create a Collection to bundle them and reference by name.
Type: Array of stringsOnly search within these sources. Can include domains, URLs, or dataset names.Example: ["arxiv.org", "valyu/valyu-pubmed"]
excluded_sources
Type: Array of stringsExclude these sources from results. Same formats as included_sources.Example: ["example.com", "example.org"]
source_biases
Type: Object (domain → integer)Soft-rank sources from -5 (strong demotion) to +5 (strong boost) without hard filtering.Example: {"nasa.gov": 5, "example.com": -3}
If both included_sources and excluded_sources are provided, included_sources takes priority.
When to use source_biases vs hard filters: Use included_sources/excluded_sources when you need strict control over which sources appear. Use source_biases when you want to influence ranking — boosting authoritative domains or demoting low-quality ones — while still allowing all sources to appear if they’re highly relevant.
Source biases let you influence ranking without hard filtering. Unlike included_sources/excluded_sources, biased sources can still appear (or be absent) based on relevance — biases just nudge the ranking.Values range from -5 (strong demotion) to +5 (strong boost). A value of 0 has no effect.