Trusted Sources Library

Where the answers come from.

Pilot5 doesn't generate from training data alone. Every deliberation can pull from 400+ data sources and integrations across 21 knowledge domains 250+ verified institutional sources (government registers, central banks, peer-reviewed databases, official statistics, regulatory codes) plus 140+ specialist API adapters for real-time queries against the same institutions. Allowlist model: no unverified blogs, no scraped forums, no Wikipedia. If a source isn't on this list, the deliberation can't cite it. Every claim it produces is tagged [SOURCED] or [INFERRED] so you always know what's grounded.

400+

data sources and integrations across the platform

250+

verified institutional sources (curated, allowlist)

140+

specialist API adapters for real-time retrieval

5

frontier models in deliberation

21

knowledge domains routed by classifier

0

blocklisted sources — allowlist only

Hybrid retrieval: BM25 + pgvector + Reciprocal Rank Fusion + FlashRank cross-encoder rerank. Top-k = 5, similarity threshold = 0.45 cosine, max context = 4000 chars per source.

How sources are used

Tier 1Primary institutions

Government registers, central banks, peer-reviewed databases, official statistics, regulatory codes. Cited directly. Tagged [SOURCED].

Tier 2Curated specialist databases

Domain adapters: Cochrane, EUR-Lex case law, NICE guidelines, OECD benchmarks, EPO patents. Same provenance discipline as Tier 1.

Tier 3Live web (entity-enriched)

Real-time queries against the open web for fast-moving facts. Treated with stricter scrutiny: claims that don't trace to a verified source are downgraded to [INFERRED].

Retrieval anatomy

When a deliberation needs grounding, the orchestrator runs a four-stage retrieval pipeline against the trusted-sources corpus. Each stage has a measurable parameter; none of them are magic.

  1. 01
    BM25 lexical search. Term-frequency match against full-text indexes — catches exact keyword + phrase matches a vector model would miss. Returns a ranked candidate list.
  2. 02
    pgvector dense search. Cosine similarity between the question embedding and source embeddings stored in Postgres via the vector extension. Catches semantic overlap a keyword search would miss. Threshold: 0.45.
  3. 03
    Reciprocal Rank Fusion. Combines the two ranked lists into a single ordering weighted by reciprocal-of-rank, parameter k = 50. Avoids the “BM25 wins because the question used the source’s exact terminology” bias.
  4. 04
    FlashRank cross-encoder rerank. A local ONNX cross-encoder model re-scores the fused list against the question semantically. Final top_k = 5 sources are passed to the deliberation, capped at 4000 characters of context per source.

How a claim becomes [SOURCED]

Provenance isn’t a self-reported tag. The orchestrator checks each claim against the retrieved evidence and downgrades anything that doesn’t actually trace.

  1. Persona writes a claim. In Round 1, each persona’s analysis annotates every claim with [SOURCED] when it cites a retrieved source, or [INFERRED] when it’s analytical reasoning.
  2. Audit verifier runs. For every [SOURCED] claim, the verifier looks for the cited reference in the actual retrieved corpus for that round. If the cited source is in the corpus and supports the claim, the tag stays.
  3. Fabricated citations are downgraded. Anything tagged [SOURCED] that can’t be traced to retrieved evidence is rewritten to [INFERRED] and logged as a SOURCED_TAGS_DOWNGRADED event. Synthesis only ever sees the post-audit version of the round.
  4. You see the count. Every deliberation’s audit summary reports verified [SOURCED] tags vs. [INFERRED] tags. A high downgrade count is a signal that the panel tried to over-claim grounding — visible in the audit trail, not buried.

Coverage by domain

327 unique institutions

Law & Regulation45

AFAAGCMAgIDARCEPAutorité de la concurrenceBundesministerium der JustizCamera dei DeputatiCaselaw Access Project / Harvard LawCirculaires / DILACMACNILCompetition Bureau CanadaCongress.govCorte CostituzionaleCorte di CassazioneCourt of Justice of the European UnionDG COMPDOJ AntitrustECHR / Council of EuropeEUR-LexEUR-Lex / EUEUR-Lex / European UnionEuropean CommissionEuropean Commission (DG TAXUD)European CouncilFTCGazzetta Ufficiale / IPZSGiustizia AmministrativaHASHUDOCJournal OfficielJournal Officiel / DILALegifranceLegifrance / DILAlegislation.gov.au (Australian Government)Légifrance / DILANormattivaNormattiva / IPZSOHCHR / United NationsOLRCSenato della RepubblicaService-Public.frSupreme Court of the United StatesUS CourtsWIPO

Medical & Life Sciences26

AIFAbioRxivCDCClinicalTrials.gov/NLMCMSCochraneCochrane CollaborationEFSAEMAEMBL-EBIFDAGBIFHASHAS (France)HHSLancet / NEJM / BMJ / JAMANCBINCBI/NLMNICENICE (UK)NIHNIH/NLMPubChem / NCBIU.S. National Library of MedicineUniProt ConsortiumWHO

Tax23

ADGMAEATAgenzia delle EntrateANAFBelastingdienstBMFBOFIPBOFiP / DGFiPCGI / LégifranceCRADGFiPDIFCFTAHMRCIRSMEF — Ministero dell'Economia e delle FinanzeMinistero dell'Economia e delle FinanzeMISAQFCSPF FinancesSwiss Federal Tax AdministrationUrssafZATCA

Finance42

ACPRACPR / Banque de FranceAFMAMFBanca d'ItaliaBank of JapanBanque de FranceBCBSCBBCBUAECFTCCNMV (Comisión Nacional del Mercado de Valores)CONSOBCSACSSF (Commission de Surveillance du Secteur Financier)Deutsche BundesbankEBAECBEIOPAESMAEuropean Central BankFDICFederal ReserveFederal Reserve Bank of St. LouisFinCENFINMAFINRAHKMAInternational Monetary FundIVASSLloyd's of LondonMASNAICOCCOECDOFROSFISECSEC / EDGARSwiss National Bank (SNB)US TreasuryWorld Bank

Privacy & AI Regulation9

AGCOMANSSICalifornia AGCNILEDPBENISAFDPICGarante PrivacyICO

Sanctions & Trade9

BISFATFITCOFACOFSIUS CBPUS Treasury (CFIUS)USTRWTO

Accounting Standards8

ANCFASBIAASBIASBIFRS FoundationISSBOICPCAOB

Science & Engineering28

ACMAllen Institute for AI (AI2)arXiv (Cornell University)arXiv / CornellCORE / Open UniversityCrossrefERICEuropean Patent OfficeGitHubHugging FaceIEEEIETFISONASANature / SpringerNewsAPINISTOEIS FoundationOpenAlexPapers With Code (Meta AI)Schloss Dagstuhl / Leibniz Center for InformaticsScience / AAASSemantic ScholarSemantic Scholar / Allen AISpringer Nature / AAASSSRN / ElsevierStack ExchangeW3C

Economics & Climate8

EEAEPAIMFIPCCNBEROECDSSRN (Elsevier)World Bank

Statistics6

DARESEurostatILOINSEEISTATUN / UNCTAD

Patents & IP6

EPOGoogle / Lens.orgINPIUS Copyright OfficeUSPTOWIPO

Logistics9

COMCECDG MOVEEurostatOECD / ITFOpenSky NetworkTMLUN ComtradeUNECEWorld Bank

Startup & Private Markets18

Bessemer Venture PartnersBpifranceCB InsightsDelaware CourtsDelaware Division of CorporationsEwing Marion Kauffman FoundationInfoCamereKVKOpenCorporatesOpenView PartnersPitchBookPreqinSCORE (Service Corps of Retired Executives)SSRN / ElsevierU.S. Bureau of Labor StatisticsU.S. Securities and Exchange CommissionU.S. Small Business AdministrationY Combinator

Industry Analysts (citation only)14

Bain & CompanyDeloitteEYForrester ResearchGallupGartnerGoogleHarvard Business ReviewHubSpotMcKinsey & CompanySalesforceSHRMTOPO (Gartner)World Economic Forum

Country-specific Business Law67

ABS (Australian Bureau of Statistics)AGCMAGCOMAIFA — Agenzia Italiana del FarmacoANCANPALBAILIIBelgielex/Justel (Belgian Federal Government)BOE (Boletín Oficial del Estado)CanLII (Canadian Legal Information Institute)CBS (Statistics Netherlands)CNDCECCode de commerce / LégifranceCompanies HouseCompanies Registration Office (Ireland)Congress.gov / Library of CongressConseil constitutionnelConseil d'ÉtatCorte CostituzionaleCorte di CassazioneCour de cassationDARES / Ministère du Travaildata.gouv.fr / Etalabdati.gov.it / AgIDDestatisDOLDREES / Ministère de la SantéeCFR / GPOECHR / HUDOCEEOCFCAFederal Register / GPOFederal Trade CommissionFedlex (Swiss Federal Chancellery)France TravailGarante per la protezione dei dati personaligesetze-im-internet.de (BMJ)Giustizia AmministrativaGovInfo / GPOH3CINAILINE (Instituto Nacional de Estadística)InfoCamere / Registro ImpreseINPSINSEEIrish Statute BookISS — Istituto Superiore di SanitàISTATJustice Laws (Government of Canada)Legilux (Luxembourg Government)legislation.gov.uk / TNAMHRSDNational Bank of Belgium (NBB/BNB)NeuRIS / DigitalServiceNew York State SenateNLRBNY LegislatureOIC — Organismo Italiano di ContabilitàOSHARechtspraak.nl (Dutch Judiciary)Regulations.gov / GSARiigi Teataja (Estonian State Gazette)Statistics CanadaUK GovernmentUK ParliamentURSSAFwetten.overheid.nl (Dutch Government)

Cultural & Archival9

BnF / GallicaDPLAEuropeanaGallica (BnF)HathiTrustInternet ArchiveLibrary of CongressNASA ADS / SAOPerseus / Tufts University

How we add a source

  • Manual review — licensing, jurisdiction, freshness, cite-ability.
  • Allowlist only — sources are added, never blocked.
  • Request a source: legal@pilot5.ai