MAP-3.5: Processes for human oversight are defined, assessed, and documented in accordance with organizational policies from the govern function.

10 min readLast verified: February 2026By Isaac Silverman

To meet MAP-3.5: processes for human oversight are defined, assessed, and documented in accordance with organizational policies from the govern function. requirement, you must formally define where humans supervise AI decisions, test whether those oversight steps work in practice, and keep durable documentation that ties oversight back to your organization’s AI governance policies. Treat this as a control: clear roles, triggers, escalation paths, and recurring evidence. ¹

Key takeaways:

Define “human oversight” as specific workflow steps (approve, review, override, escalate), not a general principle. ¹
Assess oversight design and operation, then document results and remediation actions on a schedule aligned to governance policy. ¹
Retain audit-ready artifacts that prove oversight happens for each in-scope AI use case, including exceptions and overrides. ¹

MAP-3.5 sits in the “Map” function of the NIST AI Risk Management Framework and forces operational clarity: where exactly do humans remain accountable for AI-supported outcomes, and how do you prove it. The requirement is not satisfied by a statement like “a human is in the loop.” You need defined processes, an assessment method to validate those processes, and documentation that aligns to governance policies established under the “Govern” function. ¹

For a Compliance Officer, CCO, or GRC lead, MAP-3.5 is best implemented as a repeatable control that attaches to each AI system (or each high-impact use case) and produces recurring evidence. You will coordinate across product owners, model risk owners, security, operations, and legal, but your deliverable is straightforward: documented oversight procedures, documented assessments of those procedures, and a governance linkage that explains why the oversight is adequate for the risk. ¹

This page gives requirement-level guidance you can execute quickly: scope, roles, workflow design patterns, an assessment approach, evidence lists, and the exam questions you will get if you claim MAP-3.5 coverage.

Regulatory text

Regulatory excerpt: “Processes for human oversight are defined, assessed, and documented in accordance with organizational policies from the govern function.” ¹

Operator interpretation (what you must do):

Defined: You must write down specific oversight processes (who does what, when, with what authority) for AI development and/or AI deployment. ¹
Assessed: You must evaluate whether those oversight processes are properly designed and working as intended in real operations (not just on paper). ¹
Documented + aligned to governance: Your oversight processes and assessments must be consistent with your organization’s AI governance policies (the “Govern” function), and you must keep documentation that shows that linkage. ¹

Plain-English interpretation (what MAP-3.5 really demands)

Human oversight under MAP-3.5 means you can answer, with evidence:

Where the AI can act automatically and where it cannot.
Who can approve, pause, override, or roll back AI outputs.
What triggers human review (confidence thresholds, policy flags, complaint signals, safety events, drift indicators, or regulatory-risk events).
How oversight is checked (testing, QA sampling, control testing, incident reviews).
How you keep records that make your oversight defensible after an incident or during an audit. ¹

Who it applies to (entity + operational context)

MAP-3.5 applies to organizations developing or deploying AI systems, including:

Product teams shipping AI features (recommendations, personalization, automated decisions).
Enterprises deploying third-party AI tools (HR screening, fraud tools, AML monitoring, customer support, security analytics).
Regulated businesses where AI influences customer outcomes (credit, employment, healthcare, housing, education), even if the AI is provided by a third party. ¹

Operationally, you should treat these as in scope:

Customer-impacting decisions: approvals/denials, eligibility, pricing, ranking, content moderation.
Safety/security decisions: threat detection, access decisions, anomaly flags.
Material internal decisions: workforce analytics, performance scoring, investigation prioritization.

If an AI output can create legal, operational, financial, or reputational harm, MAP-3.5 oversight controls should be documented and tested for that workflow. ¹

What you actually need to do (step-by-step)

Step 1: Tie MAP-3.5 to governance policy (don’t start in the weeds)

Identify the authoritative governance artifacts that set expectations for AI accountability (AI risk policy, model governance standard, risk acceptance policy, incident response policy).
Write a short MAP-3.5 control statement that references those policies and defines how oversight must work across the AI lifecycle.
Output: “Human oversight standard” (or procedure) that is explicitly governed by your policy framework. ¹

Step 2: Define oversight patterns per AI use case

Create a simple matrix per system/use case:

Oversight decision point	Human role	Authority	Trigger	Required action	Record
Pre-release approval	Model owner / risk owner	Approve or block release	New model/version	Review test results + risk sign-off	Release checklist + sign-off
Output review sampling	Ops QA	Escalate or correct	Sampling plan or flagged cases	Review outputs; correct labels/workflows	QA log
User complaint handling	Support lead	Pause feature if needed	Complaint patterns	Triage; escalate to risk committee	Ticket + escalation notes
Incident response	Incident commander	Kill switch	Safety/security incident	Disable model route; notify stakeholders	Incident report

Pick patterns that fit your environment:

Human-in-the-loop: a human must approve each output before action.
Human-on-the-loop: automated actions occur, but humans monitor, sample, and can intervene quickly.
Human-in-command: humans retain authority to pause/override and define boundaries even if the system is automated.

Your documentation should state which pattern applies where, and why. ¹

Step 3: Assign accountable owners and escalation paths

For each oversight process, document:

Process owner (accountable for design and operation).
Performers (operators executing review/approval).
Escalation chain (who decides risk acceptance, who can trigger a rollback, who notifies compliance/legal).
Competency/training expectations for reviewers (what they must understand to review well).

One common audit failure: “everyone is responsible,” which means no one is accountable.

Step 4: Define the assessment method (design + operating effectiveness)

You need a repeatable assessment approach that tests:

Design effectiveness: Does the process, as written, address the risks and align to governance policy?
Operating effectiveness: Did the process actually occur in practice, with evidence, for a sample of events (releases, overrides, escalations, complaints, monitoring alerts)? ¹

Practical assessment methods you can run without heavy tooling:

Walkthroughs of the workflow with artifact inspection (tickets, approvals, logs).
Targeted sampling of recent AI-driven decisions to verify reviewer action and documentation.
Tabletop exercises for incident escalation and kill switch invocation (document outcomes and gaps).
Post-incident reviews that explicitly evaluate oversight breakdowns.

Step 5: Document results, exceptions, and remediation

For each assessment cycle:

Record findings, severity, root cause, and remediation plan.
Track exceptions (where oversight did not happen, or happened late) with an approval and a time-bound fix.
Feed material gaps back into governance reporting (risk committee, model governance forum). ¹

Step 6: Operationalize recurring evidence collection

Turn MAP-3.5 into a control with recurring evidence. A lightweight way:

Add MAP-3.5 fields to your AI system inventory: oversight pattern, owners, triggers, kill switch location, assessment date, evidence link.
Create an “oversight evidence folder” per system/use case with standardized naming.
Calendar the assessment cadence that your governance policies require (document your chosen schedule). ¹

Where Daydream fits naturally: If you struggle to keep oversight evidence consistent across many AI use cases and third parties, Daydream can track control ownership, map MAP-3.5 to policies and procedures, and prompt recurring evidence collection so you are not rebuilding audit binders each cycle. ¹

Required evidence and artifacts to retain (audit-ready)

Keep artifacts that prove definition, assessment, and documentation:

Governance alignment

AI governance policy/standard that sets oversight expectations (or cross-reference to enterprise governance policy). ¹
MAP-3.5 control narrative mapped to policy sections. ¹

Defined processes

Human oversight procedure(s) per system/use case.
RACI or responsibility matrix for oversight roles.
Escalation procedure (including criteria to pause/rollback).
Training/competency materials for reviewers (or role requirements).

Operating records (proof it happened)

Release approvals and checklists for model/version changes.
QA sampling logs and reviewer notes.
Override logs (who overrode the system, why, what happened next).
Incident reports and post-incident reviews tied to oversight performance.
Tickets for user complaints and escalation decisions.

Assessment records

Assessment plan, sample selection rationale, test steps.
Findings log and remediation tracking.
Exception approvals and closure evidence. ¹

Common exam/audit questions and hangups

Auditors and internal reviewers tend to ask:

“Show me the exact step in the workflow where a human reviews or can override the AI output.”
“Who has authority to stop the model in production? Demonstrate it.”
“How do you know oversight is working? Show recent tests, samples, or incidents.”
“How do your oversight processes align to governance policy from the Govern function?”
“What happens when the AI is provided by a third party? Where is your oversight boundary?” ¹

Hangups that delay closure:

Oversight exists informally in Slack/email with no durable record.
Multiple teams claim ownership; no single accountable process owner.
“Human review” is documented, but there is no trigger definition, so it rarely occurs.

Frequent implementation mistakes (and how to avoid them)

Mistake: Defining oversight as a principle, not a process.
Fix: Write workflow steps with triggers, roles, and records. If you cannot test it, it is not a process. ¹
Mistake: No assessment method.
Fix: Add a simple test plan: walkthrough + sampling + incident/tabletop review, and document results. ¹
Mistake: Oversight is documented once, then forgotten.
Fix: Put evidence on a recurring collection cycle tied to governance reporting.
Mistake: Third-party AI treated as “outside your control.”
Fix: Define your oversight boundary: input constraints, output monitoring, human review for high-impact decisions, and escalation to the third party with documented SLAs.

Enforcement context and risk implications

NIST AI RMF is a framework, not a regulator, so MAP-3.5 is not “enforced” by NIST as a penalty regime. The risk is practical: weak or undocumented human oversight becomes a liability in internal audits, regulator exams that evaluate model governance, customer disputes, incident response, and litigation discovery where you must show who reviewed what and when. MAP-3.5 reduces the chance that “the model did it” becomes an acceptable explanation. ²

Practical 30/60/90-day execution plan

Day 30: Define and scope

Inventory in-scope AI systems/use cases (focus on customer-impacting and safety/security-impacting first).
Publish a human oversight procedure template and a MAP-3.5 control narrative mapped to governance policy. ¹
Assign owners and escalation paths for top-risk use cases.

Day 60: Implement oversight workflows + evidence capture

Roll out oversight matrices for each in-scope use case (pattern, triggers, roles, records).
Implement recordkeeping: approvals, override logs, QA sampling logs, ticket tagging.
Train reviewers and incident responders on oversight responsibilities.

Day 90: Assess and remediate

Run the first assessment cycle: design review + operating effectiveness sampling.
Document findings, exceptions, and remediation plans; report material gaps through governance forums.
Standardize recurring evidence collection (and automate reminders/workflows where possible). ¹

Frequently Asked Questions

Does MAP-3.5 require a human to approve every AI decision?

No. MAP-3.5 requires defined, assessed, and documented oversight processes aligned to governance policy. For many use cases, “human-on-the-loop” monitoring with clear escalation and override authority can meet the intent if it is risk-appropriate and evidenced. ¹

What counts as an “assessment” of human oversight?

An assessment tests whether the oversight process is well designed and actually operating. Common options include workflow walkthroughs, sampling of recent decisions or escalations, and reviews of incident/complaint handling records tied to oversight steps. ¹

We use third-party AI tools. How do we show human oversight if we can’t see the model?

Document your oversight boundary: input controls, output monitoring, human review triggers, override/pause procedures in your operations, and escalation paths to the third party. Retain tickets, monitoring results, and decisions made by your staff. ¹

What is the minimum documentation auditors expect for MAP-3.5?

A written oversight procedure tied to governance policy, named owners and escalation paths, and operating records that show oversight events occurred (approvals, samples, overrides, incidents) plus an assessment report with findings and remediation tracking. ¹

How do we handle disagreements between product and compliance on oversight triggers?

Use governance policy as the tie-breaker and document a risk decision: define triggers based on impact severity, document rationale, and require a formal exception if a team wants less oversight than policy implies. Keep the exception approval and review it during assessments. ¹

Can we satisfy MAP-3.5 with a single enterprise policy?

A policy helps, but MAP-3.5 also requires defined processes and evidence that they operate. You usually need system- or use-case-level procedures (or an attached control matrix) so the oversight is concrete and testable. ¹

Frequently Asked Questions

Does MAP-3.5 require a human to approve every AI decision?

What counts as an “assessment” of human oversight?

We use third-party AI tools. How do we show human oversight if we can’t see the model?

What is the minimum documentation auditors expect for MAP-3.5?

How do we handle disagreements between product and compliance on oversight triggers?

Can we satisfy MAP-3.5 with a single enterprise policy?

Operationalize this requirement

Map requirement text to controls, owners, evidence, and review workflows inside Daydream.

See Daydream

Regulatory text

Plain-English interpretation (what MAP-3.5 really demands)

Who it applies to (entity + operational context)

What you actually need to do (step-by-step)

Step 1: Tie MAP-3.5 to governance policy (don’t start in the weeds)

Step 2: Define oversight patterns per AI use case

Step 3: Assign accountable owners and escalation paths

Step 4: Define the assessment method (design + operating effectiveness)

Step 5: Document results, exceptions, and remediation

Step 6: Operationalize recurring evidence collection

Required evidence and artifacts to retain (audit-ready)

Common exam/audit questions and hangups

Frequent implementation mistakes (and how to avoid them)

Enforcement context and risk implications

Practical 30/60/90-day execution plan

Day 30: Define and scope

Day 60: Implement oversight workflows + evidence capture

Day 90: Assess and remediate

Frequently Asked Questions

Does MAP-3.5 require a human to approve every AI decision?

What counts as an “assessment” of human oversight?

We use third-party AI tools. How do we show human oversight if we can’t see the model?

What is the minimum documentation auditors expect for MAP-3.5?

How do we handle disagreements between product and compliance on oversight triggers?

Can we satisfy MAP-3.5 with a single enterprise policy?

Footnotes

Frequently Asked Questions

Related Resources

Operationalize this requirement