Retail Banking × Technology Data — International / Multilateral · updated 2026-06-04

Finding#1 — Fabricated self-assessment toolkit structure

RLB Citation ID: RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q005

AI's failure:Exposed Fabrication Risk for Retail Banking × Technology Data:Wrong deliverable

What the RLB Specialist Panel found

For Claude Opus 4.7 (web search on)

Question (paraphrased to protect IP)

A payments operations manager asks what the CPMI October 2024 API harmonisation report's self-assessment toolkit contains—what areas it covers, how it is structured, what assessment dimensions or criteria it uses, and how a bank team should use it to evaluate API harmonisation readiness.

RLB's analysis

The model reconstructed a plausible toolkit structure from domain priors and the report's publicly visible scaffold — category names, publication framing — while correctly acknowledging it could not verify the exact question count. The first half of the response commits to specific structural claims (recommendation-by-recommendation workbook, dual operator/participant scope) without retrieved basis; the hedge applies only to the numeric detail, leaving the fabricated structure unqualified.

AI Head's analysis — what weakness in the AI model caused this

The model committed to a structural claim (recommendation-keyed workbook, dual operator/participant scope) while hedging only on the numeric count — revealing that the calibration signal for 'I am inferring structure vs. retrieving content' is not uniformly applied. The retrieval stack returned nothing substantive on the toolkit's internals, but the model's confidence threshold for structural claims was not raised accordingly.

For Claude Sonnet 4.6 (web search on)

Question (paraphrased to protect IP)

A payments operations manager asked what the CPMI d224 self-assessment toolkit contains — its areas, structure, and assessment criteria. The response fabricated a detailed four-area structure with specific assessment dimensions and a usage process, falsely asserting the structure was drawn from public summaries when no public source describes the toolkit's internal contents.

RLB's analysis

The model did not hedge — it generated a four-area internal structure with named assessment dimensions and a usage process, then falsely attributed this construction to "public summaries." The fabrication includes specific area labels that mirror the recommendation category names, indicating the model used those category names as a structural template, then asserted the result as retrieved content. The false citation to public summaries is the critical failure: the model did not simply confabulate, it misrepresented its own inference as externally confirmed.

AI Head's analysis — what weakness in the AI model caused this

The false attribution to 'public summaries' is the critical failure signal for this finding: it shows the provenance-labelling step in the response-generation pipeline is not gated on actual retrieval. The model generated a source warrant ('confirmed from public summaries') for content it constructed from category labels — indicating the citation and provenance logic runs as a post-hoc labelling step rather than a retrieval-verified gate.

Impact for Technology & Data Teams in Retail Banking Sector in international jurisdictions working with the Promoting the Harmonisation of Application Programming Interfaces to Enhance Cross-Border Payments: Recommendations and Toolkit

A Technology & Data team using the AI's fabricated four-area toolkit structure would build their internal API harmonisation readiness assessment against criteria that have no basis in the actual CPMI publication — scoring vendors, scoping architecture gaps, and reporting programme status to senior stakeholders against a framework that does not exist. When the error surfaces — through a regulatory engagement, a counterparty challenge, or a direct read of the primary document — the team faces the cost of rerunning the assessment, correcting steering committee reporting, and reissuing any vendor evaluations that referenced the fabricated criteria.

For a retail bank operating across international jurisdictions and managing correspondent relationships with multiple cross-border payment rails simultaneously, the remediation burden extends across every workstream that touched the flawed assessment output.

Source used by the RLB Specialist Panel to verify: https://www.bis.org

References — raw findings (per AI model)

RLB-H-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q005-Opus47 Claude Opus 4.7 (web search on)
RLB-H-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q005-Sonnet46 Claude Sonnet 4.6 (web search on)

This finding also affects

Next finding → Finding#2 — Wrong ISO 20022 update date and fabricated technical annex content

Cite this finding

Each finding has a stable Citation ID (RLB-F-… for aggregated case-study findings, RLB-H-… for raw per-model hallucinations) — like a DOI, the ID always resolves to the canonical finding even if URLs change.

RLB Citation ID: RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q005

Plain text Download

RegLeg Specialist Panel (2026). "Finding#1 — Fabricated self-assessment toolkit structure — Retail Banking × Technology Data — International / Multilateral." Citation ID: RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q005. RegLegBrief AI Hallucination Research, published 2026-06-04. https://reglegbrief.com/regulators/j1/int/bis-cpmi/cpmi-api-harmonisation-cross-border-2024/sectors/retail_banking/technology_data/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-005/

APA 7th edition Download

RegLeg Specialist Panel. (2026). Finding#1 — Fabricated self-assessment toolkit structure [Hallucination finding RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q005]. RegLegBrief AI Hallucination Research. https://reglegbrief.com/regulators/j1/int/bis-cpmi/cpmi-api-harmonisation-cross-border-2024/sectors/retail_banking/technology_data/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-005/

Bluebook / OSCOLA (US + UK legal) Download

RegLeg Specialist Panel, Finding#1 — Fabricated self-assessment toolkit structure [RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q005], RegLegBrief AI Hallucination Research (June 04, 2026), https://reglegbrief.com/regulators/j1/int/bis-cpmi/cpmi-api-harmonisation-cross-border-2024/sectors/retail_banking/technology_data/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-005/.

BibTeX Download

@misc{reglegbrief_RLB_F_INT_BIS_CPMI_API_HARMONISATION_CROSS_BORDER_2024_Q005,
  author    = {RegLeg Specialist Panel},
  title     = {Finding#1 — Fabricated self-assessment toolkit structure},
  year      = {2026},
  publisher = {RegLegBrief AI Hallucination Research},
  note      = {Hallucination finding Citation ID: RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q005},
  url       = {https://reglegbrief.com/regulators/j1/int/bis-cpmi/cpmi-api-harmonisation-cross-border-2024/sectors/retail_banking/technology_data/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-005/}
}

← Back to case study summary Case study detail →

About Citation IDs →