One published white paper per regulation, filtered to the AI Labs audience. Each is grounded in the same substrate of authenticated primary sources used across the rest of this research.
When both Claude Opus 4.7 with web search and Claude Sonnet 4.6 with web search encounter content locked inside an inaccessible PDF — the CPMI October 2024 report on harmonising APIs for cross-border payments — they...
Fabrication and scope-conflation are the dominant failure shapes on the CFTC's December 2025 swap dealer business conduct and documentation rulemaking, with Claude Opus 4.7 with web search producing an invented...
Condition-sunset misclassification and fabricated amendment provenance are the dominant failure surfaces across both Claude Opus 4.7 and Claude Sonnet 4.6 on the CFTC's Digital Asset Collateral No-Action Relief and...
Both Claude Opus 4.7 with web search and Claude Sonnet 4.6 with web search produced failures on CPMI-IOSCO's Implementation Monitoring of the PFMI: Level 3 Assessment on General Business Risks (Bank for International...
The dominant failure observed in Claude Sonnet 4.6 on the CPMI-IOSCO Consultation on Updated Guidance and Public Disclosures to Implement Initial Margin Proposals is deontic register substitution — the model hardened...
Numeric conflation across disaggregated adoption-rate subcategories — collapsing distinct faster-payment-system and RTGS figures into a single blended claim — is the primary failure surface for Claude Opus 4.7 with...
This paper presents findings from RegLeg's hallucination research on the Agreement under the United Nations Convention on the Law of the Sea on the Conservation and Sustainable Use of Marine Biological Diversity of...
RegLeg tested two frontier AI models against the Principles for Financial Market Infrastructures (PFMI), the global standard for payment systems, central counterparties, and securities settlement systems published...
This paper presents findings from RegLeg's evaluation of AI model responses to questions about MAS Notice 637 — the Monetary Authority of Singapore's risk-based capital adequacy framework for banks — covering both...
This report documents hallucinations produced by frontier AI models when asked questions about the Guidance on Cyber Resilience for Financial Market Infrastructures, published in June 2016 by CPMI and IOSCO under the...
This paper presents findings from a structured evaluation of two frontier AI models — Claude Opus 4.7 with web search and Claude Sonnet 4.6 with web search — against the Financial Conduct Authority's Consumer Duty...