Controls & Evidence

NIST AI RMF readiness evaluates your controls across multiple domains. For each domain, reviewers look for evidence that controls are designed properly and operating effectively. Below are the core control domains with minimum requirements and example evidence artifacts.

What reviewers look for: Reviewers don't just check that policies exist. They verify that controls are operating as described, that evidence is produced on schedule, and that gaps are tracked and remediated. The evidence examples below show what "operating effectiveness" looks like in practice.

AI System Inventory

Governance

Comprehensive catalog of all AI systems with risk classification, purpose documentation, and stakeholder identification.

Requirements

  • Catalog all AI systems with unique identifiers
  • Classify each system by risk tier (minimal, limited, high, unacceptable)
  • Document purpose and intended use for each system
  • Identify stakeholders and affected populations
  • Maintain regular inventory update cadence

Evidence Examples

Artifact Owner Frequency
AI system registry with unique identifiers and descriptions AI Governance Lead Quarterly
Risk classification records with tier justification Risk Management Annually or on system change
Stakeholder analysis and affected population mapping AI Governance Lead Annually

Risk Categorization & Tiering

Governance

Structured methodology for categorizing AI system risk levels and applying tiered governance requirements.

Requirements

  • Risk categorization methodology documented and approved
  • Factors include potential harm severity, reversibility, scale of impact, and automation level
  • Tiered governance requirements established (higher risk = more oversight)
  • Alignment with organizational risk tolerance documented

Evidence Examples

Artifact Owner Frequency
Risk categorization framework document Risk Management Annually
Tier assignment records with justification for each AI system AI Governance Lead Quarterly or on system change
Risk tolerance documentation and board-level approval Executive Leadership Annually

Impact Assessment & Context Analysis

Governance

Systematic analysis of AI system impacts on stakeholders, including harm identification and deployment context evaluation.

Requirements

  • Stakeholder impact analysis completed for each AI system
  • Context-of-use documentation maintained
  • Identification of potential harms at individual, group, societal, and environmental levels
  • Assessment of deployment context distinguishing high-stakes from low-stakes use
  • Comparative analysis of AI versus non-AI alternatives

Evidence Examples

Artifact Owner Frequency
Impact assessment reports per AI system AI Governance Lead Prior to deployment and annually
Stakeholder engagement records and feedback Product Management Prior to deployment
Context analysis documents with deployment environment details AI Governance Lead Annually or on context change

Bias & Fairness Testing

Governance

Pre-deployment and ongoing testing for bias across demographic groups with defined fairness metrics and remediation procedures.

Requirements

  • Bias testing across demographic groups completed before deployment
  • Fairness metrics defined and documented (demographic parity, equalized odds, etc.)
  • Disparate impact analysis performed
  • Ongoing monitoring for bias drift in production
  • Remediation procedures established for detected bias

Evidence Examples

Artifact Owner Frequency
Bias test reports with demographic group analysis Data Science / ML Engineering Prior to deployment and quarterly
Fairness metric dashboards with threshold definitions AI Governance Lead Continuous monitoring
Demographic impact analysis documentation Data Science / ML Engineering Prior to deployment and on model retrain
Bias remediation records and outcome tracking AI Governance Lead As needed

Performance & Reliability Monitoring

Governance

Continuous monitoring of AI system accuracy, drift detection, robustness testing, and reliability SLAs.

Requirements

  • Accuracy and performance benchmarks established for each AI system
  • Production monitoring for model drift and degradation
  • Robustness testing against adversarial inputs and edge cases
  • Reliability metrics and SLAs defined
  • Automated alerting for performance degradation

Evidence Examples

Artifact Owner Frequency
Performance benchmark records with baseline metrics ML Engineering Prior to deployment and on retrain
Drift detection dashboards and alert logs ML Engineering / SRE Continuous
Robustness test results (adversarial, edge case) ML Engineering Prior to deployment and quarterly
SLA documentation and uptime reports SRE / Operations Monthly

Transparency & Explainability

Governance

Model cards, user-facing disclosures, explanation methods, and documentation of AI system limitations.

Requirements

  • Model cards or system cards maintained for each AI system
  • Explanation methods appropriate to audience (technical, user, affected person)
  • Disclosure of AI use to end users
  • Documentation of known limitations and failure modes
  • Interpretability requirements scaled by risk tier

Evidence Examples

Artifact Owner Frequency
Model cards or system cards for each AI system ML Engineering / AI Governance Lead On deployment and on material change
User-facing AI disclosure statements Product / Legal On deployment
Explainability documentation with audience-appropriate methods ML Engineering On deployment and annually
Limitation and failure mode disclosures ML Engineering On deployment and on material change

Human Oversight & Accountability

Governance

Human-in-the-loop mechanisms, override capabilities, accountability chains, and appeal processes for AI decisions.

Requirements

  • Human-in-the-loop required for high-risk decisions
  • Override mechanisms documented and tested
  • Clear accountability chain defining responsibility for AI decisions
  • Escalation paths for AI system failures
  • Appeal mechanisms available for affected individuals

Evidence Examples

Artifact Owner Frequency
Human oversight procedures and decision authority matrix AI Governance Lead Annually
Override mechanism logs and testing records Operations / ML Engineering Quarterly
RACI matrix for AI system accountability AI Governance Lead Annually
Appeal process documentation and resolution records Compliance / Legal Ongoing

Data Governance for AI

Governance

Training data quality, provenance tracking, consent management, labeling QA, and synthetic data governance.

Requirements

  • Training data quality assessed for accuracy, completeness, and representativeness
  • Data provenance and lineage tracking from source to model
  • Consent and authorization verified for data use in AI training and inference
  • Data labeling quality assurance processes established
  • Synthetic data generation and use governed

Evidence Examples

Artifact Owner Frequency
Data quality reports with representativeness analysis Data Engineering / Data Science Per training cycle
Data provenance and lineage records Data Engineering Per training cycle
Consent and authorization documentation for training data Legal / Privacy Prior to data acquisition
Labeling QA audit records and inter-annotator agreement scores Data Science Per labeling campaign

Model Lifecycle Management

Governance

Version control, validation gates, deployment approvals, production maintenance, and retirement procedures for AI models.

Requirements

  • Version control for models and training data
  • Validation and testing gates before deployment
  • Deployment approval process with sign-off authority
  • Production monitoring and maintenance procedures
  • Retirement and decommissioning procedures

Evidence Examples

Artifact Owner Frequency
Model version registry with training data references ML Engineering Per model version
Validation test results and gate approval records ML Engineering / QA Per deployment
Deployment approval sign-offs AI Governance Lead / Engineering Manager Per deployment
Decommission records with rollback and data retention details ML Engineering As needed

Third-Party AI Risk

Governance

Assessment, contractual controls, and ongoing monitoring for AI systems provided by vendors and third parties.

Requirements

  • Vendor AI system assessment completed before adoption
  • Contractual requirements for AI transparency and data handling
  • Ongoing monitoring of vendor AI system performance
  • Supply chain risk assessment for AI components (models, data, infrastructure)
  • Incident reporting requirements established for vendor AI systems

Evidence Examples

Artifact Owner Frequency
Vendor AI assessment questionnaires and evaluation results Procurement / AI Governance Lead Prior to adoption and annually
Contract provisions for AI transparency and incident reporting Legal / Procurement Per contract cycle
Vendor AI performance monitoring reports AI Governance Lead / Operations Quarterly
AI supply chain risk register Risk Management Annually

Evidence Naming Conventions

Organized, traceable evidence is critical for a smooth review. Adopting a consistent convention makes evidence retrieval faster and reduces friction.

Recommended format:

ControlID_System_ArtifactType_YYYY-MM-DD_Period_Owner_v#

Key principles for evidence management:

  • Centralized repository with access control and version history
  • Consistent naming across all control domains and artifact types
  • Defined cadence for each evidence type: event-driven, monthly, quarterly, or annual
  • Immutable exports where possible to demonstrate evidence integrity

AI and data companies: Standard controls are the baseline. See the AI-specific advisory modules for additional controls addressing data governance, prompt logging, RAG security, and model vendor risk.