R-BED™  ·  Risk-Based Evaluability Design

Governance for AI systems that actually deploy.

When regulators audit, when boards ask, when something goes wrong — your institution must answer one question: can you prove your AI is doing what it was meant to? R-BED is the operational framework that makes that answer possible.

EDITION5.0JUNE 2026
1,368
Documented AI failures analysed across healthcare, finance, law enforcement, and consumer technology.
13
Sub-dimensions across three vertices — covering technical, deployment, and consequence risk.
9
Regulatory frameworks mapped: SR 26-2, EU AI Act, NIST AI RMF, ISO 42001, NAIC, OSFI E-23, and more.
SR 26-2
The April 2026 supervisory update that left GenAI & agentic AI outside existing model risk vocabulary.

Most AI governance fails before deployment begins — not because frameworks are absent, but because they are unenforceable.

Across 1,368 documented failures, the breakdown was rarely the absence of policy. It was the inability to determine, in operational terms, whether deployed systems remained within acceptable behavioural boundaries — and to credibly link what was monitored to the business outcomes the deployment was meant to influence. We call this the Evaluability Gap. It manifests in three dimensions.

GAP 01

The Definition Gap

Organisations cannot articulate, in measurable operational terms, what acceptable AI system behaviour means in their specific deployment context. Without a definition, monitoring is theatre.

GAP 02

The Operationalisation Gap

Even where definitions exist, institutions lack the monitoring infrastructure, intervention thresholds, and enforcement authority required to act on them in production. Governance becomes documentation, not control.

GAP 03

The Business-Linkage Gap

The dimension most often missed: governance signals are rarely connected to the revenue, customer outcomes, or institutional risk the deployment was meant to influence. Well-monitored systems remain disconnected from why they were deployed.

A risk decomposition that traces every failure to three measurable vertices.

R-BED grounds AI governance in the same disciplined risk decomposition that the Federal Reserve's SR 11-7 brought to banking models after the global financial crisis — extended into the probabilistic, adaptive, and operationally fluid systems that now define the AI frontier.

I

Technical Failure

The probability that the system itself misbehaves: system design, data foundations, failure modes, fairness, stability, safety, explainability.

P(F)  ·  7 sub-dimensions
II

Deployment Harm

Given a failure, the probability that harm reaches a stakeholder: human control, exposure architecture, monitoring & guardrails.

P(H | F, A)  ·  3 sub-dimensions
III

Business Impact

Given harm, the magnitude and reversibility of the consequence: propagation, severity, recovery.

E[S | H, F, A]  ·  3 sub-dimensions
Expected harm, decomposed
E[H] = P(F) · P(H | F, A) · E[S | H, F, A]
P(F)probability of technical failure
P(H | F, A)conditional harm given failure & architecture
E[S | H, F, A]expected severity given realised harm

The complete operational treatise — 659 pages, empirically validated, regulator-mapped.

2026 · Enhanced Edition
Evaluable AI
Building Governable AI Systems — A Unified Operating Model for Traditional ML, Generative AI, RAG, and Agentic AI
Srivastava · Sah

A unified governance operating model spanning traditional ML, generative AI, RAG, and agentic systems. Grounded in the analysis of 1,368 documented AI failures and synthesised with the Stanford Digital Economy Lab's 51 Enterprise AI Success Patterns.

Develops R-BED from first principles, maps it to nine regulatory frameworks, presents six worked examples drawn from financial services and insurance, and operationalises the framework through the Indicator Catalogue, the Indicator Companion, and the R-BED Workbook.

Built for institutions under supervisory accountability — where operational governance is not optional.

BANKS  ·  BHCs  ·  SIFIs

Model Risk Management

Extends the SR 11-7 lineage into the SR 26-2 era — including the GenAI and agentic systems the supervisory letter explicitly carved out. Aligned with OSFI Guideline E-23 for Canadian institutions.

INSURANCE  ·  NAIC STATES

Insurance & Underwriting

Mapped to the NAIC Model Bulletin on AI Systems, with worked examples for accelerated underwriting, conversion scoring, and external lead allocation deployments.

EU AI ACT  ·  HIGH-RISK

Regulated Industries

R-BED tier assignment maps directly to EU AI Act high-risk categorisation, NIST AI RMF profiles, and ISO/IEC 42001 management-system requirements.

PUBLIC  ·  CRITICAL SECTORS

Public & Critical Sectors

For deployments where reversibility, recourse, and consequence severity demand the discipline of operational evaluability — not aspirational principle.

Two practitioners formed in the discipline of model risk management.

Vishal Srivastava, PhD

Originator  ·  R-BED Framework

Originator of the R-BED framework and trademark holder. Practitioner experience deploying AI systems in regulated industries, formed within the model risk discipline shaped by SR 11-7 and its successor SR 26-2. New York–based.

Tanmay Sah, PhD

Co-Author  ·  Evaluable AI

Co-author with research focus on AI evaluation, responsibility, and the operational gap between governance as designed and governance as it functions in production deployments.

Three ways to bring R-BED into your institution.

Start where the urgency is. Most institutions begin with an executive briefing, move to a portfolio assessment of their current AI deployments, then operationalise R-BED through their model risk function.

01  ·  READ

The Book

The complete framework, mapped to regulatory expectations, with six worked examples and the full indicator catalogue. The entry point for governance, MRM, and risk teams.

Get the book →
02  ·  BRIEF

Executive Briefing

A 90-minute working session for boards, executive committees, or MRM leadership. Tailored to your portfolio, your regulators, and your current AI risk posture.

Request a briefing →
03  ·  DEPLOY

R-BED Engagement

A structured implementation: portfolio scoring, gap analysis, governance operating model design, and training for your three lines of defence. Typical engagement: 8–16 weeks.

Start the conversation →