How do you classify AI risk?

Use a four tier classification based on the consequence of the system being wrong: from minimal risk tools that need only basic inventory tracking up to high stakes systems affecting credit, employment, safety, or regulatory standing that warrant full governance review. The tier determines the controls. Treating every AI system as the same risk level is how organizations end up with 47 approval gates on a meeting summarizer.

What does an AI incident actually cost?

The average cost of an AI incident is $4.2 million once remediation, regulatory response, and business impact are counted. That number is the budget argument for proportionate risk management: a working risk register, tiered controls, and an incident playbook cost a small fraction of one incident. It is also the argument against naivety, the failure mode where models deploy with zero governance.

What is an AI risk register and what goes in it?

An AI risk register is a living inventory of every AI system in use, its risk tier, the specific risks identified (bias, drift, security, data exposure), the controls applied, and the named owner. The design goal is a register that actually gets used: updated when systems change, reviewed on a cadence matched to tier, and connected to incident response rather than filed and forgotten.

How should we manage risk from third party AI tools?

Apply the same tiered classification to vendor AI that you apply to in house models, because you inherit the risk without controlling the system. That means contractual rights to model change notification, clarity on how your data is used and retained, vendor incident obligations, and your own monitoring of outputs. A vendor badge is not a control.

What should an AI incident response plan include?

A six stage playbook: detect the incident, contain it (which often means pulling the model or routing to fallback), assess impact, remediate, communicate to affected stakeholders and regulators where required, and feed lessons back into the risk register and controls. The teams that handle AI incidents well rehearsed the playbook before they needed it, exactly as with security incidents.

Quick AnswerEffective enterprise AI risk management classifies every AI system into one of four risk tiers, applies controls proportionate to each tier, and maintains a living risk register, so low risk tools ship fast while high stakes models get real scrutiny. Across 200+ enterprise engagements, the programs that fail are either paranoid, gating everything, or naive, governing nothing.

AI Governance March 28, 2026 11 min read

AI Risk Management: A Framework for the Non-Paranoid

Published 18 December 2025 · Updated 9 June 2026

Morten Andersen Co-Founder · AI Advisory Practice

Enterprise AI risk management that enables velocity. Learn how to classify AI risk systematically, build risk registers that actually get used, map control requirements to actual risk levels, govern third-party AI, and respond to incidents without killing innovation.

The Two Failure Modes of AI Risk Management

AI risk management fails in two opposite ways. The first failure mode is paranoia. Every AI system gets treated as a potential doomsday scenario. Every model requires 47 approval gates, 90-day pilot periods, and risk committee reviews that take longer than building the model. The result: enterprise velocity drops to zero. AI initiatives die in approval hell. Competitors deploy AI in weeks while you are still in risk review.

The second failure mode is naivety. Risk frameworks do not exist. Models deploy with zero governance. Bias, data drift, adversarial attacks, and security vulnerabilities go undetected until something breaks in production and the enterprise faces a crisis. The average cost of an AI incident is $4.2 million. Uncontrolled deployments are how you get expensive surprises.

The goal is calibration. Controls should match actual risk. High-risk systems get rigorous oversight. Low-risk systems deploy fast. The framework in this guide provides a way to classify risk systematically and map controls to actual risk levels. It works because it accepts risk explicitly rather than trying to eliminate it.

Four-Tier Risk Classification

Every enterprise AI system falls into one of four risk tiers. This classification determines what controls are required before deployment. Most enterprises fail to classify systematically, treating every system the same way (usually the paranoid way). Systematic classification fixes this.

PROHIBITED

Prohibited Systems

Systems too risky to deploy under any circumstances. Harm is too large, regulatory ban is explicit, or control is impossible.

Control requirement: No deployment. Audit all discovery efforts. Document decision.

Examples: Fully autonomous weapons systems; hiring models without human review in jurisdictions banning algorithmic hiring; facial recognition for mass surveillance without consent.

HIGH RISK

High-Risk Systems

Systems where AI failure creates direct harm to individuals. Decisions cannot easily be reversed. Disproportionate impact on vulnerable populations.

Control requirement: Human-in-loop review; bias testing; explainability; continuous monitoring; documentation.

Examples: Credit decisions; medical diagnosis support; hiring and promotion; criminal risk assessment; insurance underwriting; benefits determination.

LIMITED RISK

Limited-Risk Systems

Systems where AI failure creates inconvenience or modest economic loss. Impact is reversible. Mostly affects business operations rather than individuals.

Control requirement: Basic logging; performance monitoring; incident response process.

Examples: Demand forecasting; customer segmentation; content recommendation; ad targeting; inventory optimization; fraud detection for transactions under threshold.

MINIMAL RISK

Minimal-Risk Systems

Systems where AI failure has no material business impact. Humans never rely on model output alone. Failure is detected immediately and easily reversed.

Control requirement: Standard software development controls only. No special AI governance.

Examples: Email spam filtering; chatbot for FAQs; image compression; internal BI dashboard; anomaly detection in logs.

The vast majority of enterprise AI systems are Limited or Minimal Risk. The paranoia failure mode treats them as High Risk. The naivety failure mode treats them as Minimal Risk with zero controls. Accurate classification prevents both.

Building Your AI Risk Register

A risk register is a living document. For every AI system in production, you track five core fields. This is not theoretical. It is the document you reference when decisions need to be made.

System Name	Risk Tier	Use Case & Harm	Current Controls	Review Cadence
Credit Risk Model	High Risk	Loan approval decisions. Harm: systematic credit denial to protected groups.	Quarterly bias audit, human review all denials above threshold, explanations provided, annual external audit	Quarterly
Demand Forecasting	Limited Risk	Inventory planning. Harm: stockouts or overstock. Reversible and detected immediately.	Automated accuracy monitoring, alert if MAPE exceeds 15%, monthly manual validation	Monthly
Email Spam Filter	Minimal Risk	Classifies incoming mail. Harm: users manually review false positives immediately.	Standard software testing, feedback loop for user corrections	Quarterly
Hiring Recommendation Engine	High Risk	Resume screening and ranking. Harm: systematic bias against protected groups in hiring pipeline.	Monthly bias audit by protected attribute, human review all top 10, annual external fairness audit, adverse impact testing	Monthly
Customer Churn Prediction	Limited Risk	Identifies at-risk customers for retention campaigns. Harm: wasted marketing spend or missed retention.	Calibration validation monthly, performance tracking by segment, alert if calibration drifts	Monthly

The risk register is not a compliance document. It is a working tool. Update it whenever you deploy a new system, when control changes occur, and when incidents happen. Review it monthly in your governance committee. Use it to triage resources toward the highest-risk systems.

Is Your Risk Register Actually Being Used?

Most enterprises build risk registers and then never look at them again. We audit your governance structure and show you exactly how to operationalize your risk framework so it actually drives decisions. Free assessment covers your current risk tier classifications and control gaps.

Start Your Free Assessment →

Risk-Appropriate Controls

The fundamental principle: controls should be proportional to risk. The bigger the potential harm, the more rigorous the control. This prevents both failure modes.

Risk Tier	Model Governance	Testing Requirements	Monitoring	Decision Process
PROHIBITED	No deployment. Board-level escalation. Audit trail of decision.	Risk assessment documenting why system cannot be deployed.	No production deployment.	Executive and legal review required. Documented approval to not proceed.
HIGH RISK	Human-in-loop review for all decisions. Model documentation mandatory. Fairness assessment required.	Bias testing by protected attribute. Performance validation on holdout sets. Stress testing on out-of-sample populations. Adversarial testing.	Real-time accuracy and fairness monitoring. Automated alerts. Weekly performance reviews. Disparate impact testing quarterly.	Risk committee review required. Executive sign-off. Adverse action documentation. Right-to-appeal process.
LIMITED RISK	Standard model documentation. Basic performance baseline.	Validation on test set. Automated accuracy checks. Basic performance testing.	Monthly performance metrics. Automated alerts for metric degradation. Drift detection.	Standard approval process. Model owner sign-off. 5-day review window.
MINIMAL RISK	Standard SDLC practices apply. No special AI governance required.	Standard software testing. Code review. Unit and integration tests.	Standard application monitoring. Alerting for system failures.	Standard deployment process. No special approval gates required.

The key insight: High-Risk systems require continuous fairness monitoring and human review. Limited-Risk systems need basic logging. Minimal-Risk systems deploy like normal software. This is how you scale AI without losing control.

Managing Third-Party AI Risk

Third-party AI is where enterprises fail most often. You deploy a vendor model or open-source model and assume it is safe. It is not. Vendor models can be biased. Open-source models can have security vulnerabilities. You own the risk even though you do not own the model.

Third-party AI falls into three categories, each with different assessment requirements:

Embedded in SaaS

AI capabilities bundled into SaaS applications. You do not have visibility into the model but must manage the risk.

Assess SaaS vendor AI governance practices
Request fairness documentation
Validate performance on your data
Monitor for unexpected behavior changes
Document vendor responsibilities in contract

Vendor APIs & Models

Purpose-built models from specialized AI vendors. More transparency than SaaS. Still external dependency.

Request model card and fairness audit
Test for bias on your population
Validate update protocols and SLAs
Understand retraining cadence
Establish monitoring and incident response

Open-Source Models

Public models with source code available. Maximum transparency but maximum responsibility falls on you.

Review training data and potential biases
Security audit for vulnerabilities
License compliance review
Fine-tuning and safety testing required
Maintenance plan for security patches

For all three categories, follow the same risk tier classification framework. If a vendor model is being used for High-Risk decisions, apply High-Risk controls even though it is not your model. The vendor is your agent, but you are the responsible party.

AI Risk Governance Structure

Who actually owns AI risk? Most enterprises do not answer this question clearly. The result: accountability falls through the cracks.

The proven structure is the three-lines-of-defense model adapted for AI:

First Line: Operations

Model owners and data science teams own day-to-day risk management.

Model documentation
Fairness testing and validation
Performance monitoring
Incident detection
Control implementation

Second Line: Risk and Compliance

Risk team oversees controls and escalates issues.

Risk assessment and classification
Control framework design
Testing validation and verification
Regulatory liaison
Incident escalation decision

Third Line: Audit and Board

Independent audit and governance committee oversight.

Control effectiveness audit
Risk appetite definition
Major incident reviews
Policy approval
Public reporting and disclosure

Committee Structure: Establish an AI Risk Committee meeting monthly. Members: Chief Data Officer or equivalent, Chief Risk Officer, General Counsel, business unit heads responsible for High-Risk systems, external expert advisor. Agenda: review new systems, discuss incidents, adjust controls, track risk metrics. This is where decisions actually get made.

Escalation Thresholds: Define when issues bubble up. Example: Fairness metric degradation of 5 or more percentage points triggers immediate review. Suspected data breach or adversarial attack triggers 24-hour crisis response. Regulatory inquiry requires 72-hour briefing.

Review Cadence: Prohibited systems, never. High-Risk systems, monthly. Limited-Risk systems, quarterly. Minimal-Risk systems, annually.

Six-Stage Incident Response Playbook

Most enterprises lack a playbook for AI incidents. When something goes wrong, they improvise. Improvisation is expensive. Define your playbook in advance.

Stage 1: Detection (0-6 hours)

Someone detects a problem. Automated monitoring catches performance degradation. A customer complains. An audit finds a control failure. Document what triggered detection. Move to triage immediately.

Stage 2: Triage (6-24 hours)

Is this actually a problem? Confirm the issue. Assess severity: Is the system down? Is accuracy materially degraded? Is there potential unfair outcome? Is data compromised? Assign severity level: Critical (affects decisions), High (performance impact), Medium (potential but no current impact), Low (documentation or process issue). Communicate status to Risk Committee and affected business units.

Stage 3: Containment (24-48 hours)

Prevent further harm. Options: Take system offline completely. Move to manual review only (no automated decisions). Narrow scope to lower-risk population. Slow down deployment if real-time system. Set deployment freeze on related systems. Document containment decision.

Stage 4: Investigation (48-72 hours)

What caused it? Review model performance by population. Check for data drift. Audit recent training data changes. Review deployment changes. Interview model owners and data engineers. Document root cause. Assess duration: How long was system degraded before detection?

Stage 5: Remediation (72 hours to 10 days)

Fix it. Retrain on updated data. Adjust model threshold. Apply fairness constraints. Remove model from production if necessary. Implement control that should have caught this. Test thoroughly before redeployment.

Stage 6: Post-Incident Review (10-30 days)

Learn from it. Determine what controls failed. Update your risk register. Improve monitoring. Communicate findings to Risk Committee and audit. Update incident response playbook if needed.

Timeline requirements: Critical incidents require CEO notification within 24 hours. High-risk incidents require Risk Committee briefing within 72 hours. All incidents require post-incident review within 30 days.

Putting It Together: A Working Example

Your organization has five AI systems in production. You use this framework:

Credit risk model: Classified as High Risk. You implement human-in-loop review for all denials, quarterly fairness audits, real-time monitoring. Monthly Risk Committee review required. Investment: significant, but necessary.
Demand forecasting: Classified as Limited Risk. You implement automated accuracy monitoring and monthly validation. Minimal special governance. Standard deployment process.
Hiring recommendation engine: Classified as High Risk. Same rigorous approach as credit model. Monthly bias testing required. Adverse action documentation.
Customer churn prediction: Classified as Limited Risk. Basic monitoring. Standard approval.
Email spam filtering: Classified as Minimal Risk. Deploys like regular software. No special AI governance gates.

You maintain a risk register. It lives in your governance system. It is reviewed monthly by the Risk Committee. When performance degrades, you consult it to understand what controls apply and what the escalation path is. You do not paralyze every system. You focus rigor where it matters. Innovation is not killed. But risk is not ignored either.

Key Takeaways

AI risk management succeeds when controls match actual risk:

Classify systematically: Use four-tier framework. Not everything is equally risky.
Build risk register: Living document updated monthly. Reference it in decisions.
Apply proportional controls: High-Risk systems get rigorous governance. Limited and Minimal-Risk systems deploy faster.
Manage third-party AI: You own the risk even if you do not own the model.
Establish governance structure: Three lines of defense. AI Risk Committee. Clear escalation thresholds.
Prepare incident response: Do not improvise. Use the six-stage playbook.

Seventy-eight percent of enterprises lack AI risk frameworks. The ones that do have them enjoy faster deployment, fewer surprises, and lower incident costs. Calibrated risk management is not opposed to innovation. It enables it.

Need a risk framework that works?

Our governance assessment covers risk classification of your current systems, control gaps, and implementation roadmap. Build governance that enables velocity without killing controls.

Start Free Assessment →