R-BED™ · Risk-Based Evaluability Design

Governance for AI systems that actually deploy.

When regulators audit, when boards ask, when something goes wrong — your institution must answer one question: can you prove your AI is doing what it was meant to? R-BED is the operational framework that makes that answer possible.

Read the Book → Brief Your Team

EDITION5.0JUNE 2026

1,368

Documented AI failures analysed across healthcare, finance, law enforcement, and consumer technology.

Sub-dimensions across three vertices — covering technical, deployment, and consequence risk.

Regulatory frameworks mapped: SR 26-2, EU AI Act, NIST AI RMF, ISO 42001, NAIC, OSFI E-23, and more.

SR 26-2

The April 2026 supervisory update that left GenAI & agentic AI outside existing model risk vocabulary.

01The Evaluability Gap

Most AI governance fails before deployment begins — not because frameworks are absent, but because they are unenforceable.

Across 1,368 documented failures, the breakdown was rarely the absence of policy. It was the inability to determine, in operational terms, whether deployed systems remained within acceptable behavioural boundaries — and to credibly link what was monitored to the business outcomes the deployment was meant to influence. We call this the Evaluability Gap. It manifests in three dimensions.

GAP 01

The Definition Gap

Organisations cannot articulate, in measurable operational terms, what acceptable AI system behaviour means in their specific deployment context. Without a definition, monitoring is theatre.

GAP 02

The Operationalisation Gap

Even where definitions exist, institutions lack the monitoring infrastructure, intervention thresholds, and enforcement authority required to act on them in production. Governance becomes documentation, not control.

GAP 03

The Business-Linkage Gap

The dimension most often missed: governance signals are rarely connected to the revenue, customer outcomes, or institutional risk the deployment was meant to influence. Well-monitored systems remain disconnected from why they were deployed.

02The R-BED Framework

A risk decomposition that traces every failure to three measurable vertices.

R-BED grounds AI governance in the same disciplined risk decomposition that the Federal Reserve's SR 11-7 brought to banking models after the global financial crisis — extended into the probabilistic, adaptive, and operationally fluid systems that now define the AI frontier.

Technical Failure

The probability that the system itself misbehaves: system design, data foundations, failure modes, fairness, stability, safety, explainability.

P(F) · 7 sub-dimensions

Deployment Harm

Given a failure, the probability that harm reaches a stakeholder: human control, exposure architecture, monitoring & guardrails.

P(H | F, A) · 3 sub-dimensions

III

Business Impact

Given harm, the magnitude and reversibility of the consequence: propagation, severity, recovery.

E[S | H, F, A] · 3 sub-dimensions

Expected harm, decomposed

E[H] = P(F) · P(H | F, A) · E[S | H, F, A]

P(F)probability of technical failure

P(H | F, A)conditional harm given failure & architecture

E[S | H, F, A]expected severity given realised harm

03The Book

The complete operational treatise — 659 pages, empirically validated, regulator-mapped.

2026 · Enhanced Edition

Evaluable AI

Building Governable AI Systems — A Unified Operating Model for Traditional ML, Generative AI, RAG, and Agentic AI

Srivastava · Sah

A unified governance operating model spanning traditional ML, generative AI, RAG, and agentic systems. Grounded in the analysis of 1,368 documented AI failures and synthesised with the Stanford Digital Economy Lab's 51 Enterprise AI Success Patterns.

Develops R-BED from first principles, maps it to nine regulatory frameworks, presents six worked examples drawn from financial services and insurance, and operationalises the framework through the Indicator Catalogue, the Indicator Companion, and the R-BED Workbook.

Get the Book → Sample Chapter

04Who R-BED Is For

Built for institutions under supervisory accountability — where operational governance is not optional.

BANKS · BHCs · SIFIs

Model Risk Management

Extends the SR 11-7 lineage into the SR 26-2 era — including the GenAI and agentic systems the supervisory letter explicitly carved out. Aligned with OSFI Guideline E-23 for Canadian institutions.

INSURANCE · NAIC STATES

Insurance & Underwriting

Mapped to the NAIC Model Bulletin on AI Systems, with worked examples for accelerated underwriting, conversion scoring, and external lead allocation deployments.

EU AI ACT · HIGH-RISK

Regulated Industries

R-BED tier assignment maps directly to EU AI Act high-risk categorisation, NIST AI RMF profiles, and ISO/IEC 42001 management-system requirements.

PUBLIC · CRITICAL SECTORS

Public & Critical Sectors

For deployments where reversibility, recourse, and consequence severity demand the discipline of operational evaluability — not aspirational principle.

05The Authors

Two practitioners formed in the discipline of model risk management.

Vishal Srivastava, PhD

Originator · R-BED Framework

Originator of the R-BED framework and trademark holder. Practitioner experience deploying AI systems in regulated industries, formed within the model risk discipline shaped by SR 11-7 and its successor SR 26-2. New York–based.

Tanmay Sah, PhD

Co-Author · Evaluable AI

Co-author with research focus on AI evaluation, responsibility, and the operational gap between governance as designed and governance as it functions in production deployments.

06Engage

Three ways to bring R-BED into your institution.

Start where the urgency is. Most institutions begin with an executive briefing, move to a portfolio assessment of their current AI deployments, then operationalise R-BED through their model risk function.

01 · READ

The Book

The complete framework, mapped to regulatory expectations, with six worked examples and the full indicator catalogue. The entry point for governance, MRM, and risk teams.

Get the book →

02 · BRIEF

Executive Briefing

A 90-minute working session for boards, executive committees, or MRM leadership. Tailored to your portfolio, your regulators, and your current AI risk posture.

Request a briefing →

03 · DEPLOY

R-BED Engagement

A structured implementation: portfolio scoring, gap analysis, governance operating model design, and training for your three lines of defence. Typical engagement: 8–16 weeks.

Start the conversation →