Shadow Mode
Shadow Mode implements a progressive autonomy model for AI agents. Instead of granting full autonomy on day one, agents earn independence by demonstrating alignment with human decisions over time. The model has three levels:| Level | Behavior | Human Role |
|---|---|---|
| Shadow | Agent observes and records what it would recommend, but takes no action | Human decides independently |
| Supervised | Agent recommends an action, human confirms or overrides before execution | Human approves or rejects |
| Autonomous | Agent acts independently, human reviews after the fact | Human monitors |
Creating Shadow Sessions
A shadow session tracks one specific capability being evaluated. You define the skill (what the agent can do) and the decision class (what type of decision it involves).| Field | Description |
|---|---|
skill_name | The agent capability being evaluated (e.g., incident_response, code_review, deployment) |
decision_class | The type of decision (e.g., escalation, approval, rollback, triage) |
min_observations | Number of decisions required before graduation is possible |
agreement_threshold | Minimum agreement rate (0.0 - 1.0) required to graduate |
critical_miss_window_hours | Time window in which no critical misses can occur |
incident_response/escalation at Supervised while deployment/approval is still in Shadow.
Agreement Tracking
During a shadow session, every decision point follows the same flow:Agent recommends
The agent analyzes the situation and records what it would recommend. In Shadow mode, this recommendation is silent — the human never sees it. In Supervised mode, the recommendation is shown to the human.
Human decides
The human makes the actual decision independently (Shadow) or in response to the agent’s recommendation (Supervised).
Graduation Gates
Graduation from one level to the next requires meeting three quantitative requirements simultaneously:| Gate | Requirement | Example |
|---|---|---|
| Minimum observations | Enough decisions have been recorded to be statistically meaningful | 25 observations |
| Agreement threshold | Agreement rate meets or exceeds the configured threshold | 90% agreement |
| No recent critical misses | No critical disagreements within the lookback window | Zero critical misses in last 7 days |
Checking Eligibility
Promoting
Graduation is an explicit action — it never happens automatically. When all gates are met, the owner promotes the session:When to Use Shadow Mode
Shadow Mode is most valuable in four situations:New Skills
When an agent gains a new capability (e.g., you add a deployment tool), start that skill in Shadow. Let the agent build a track record before it acts independently.High-Risk Decision Classes
For decisions with significant consequences — production deployments, access grants, financial transactions — Shadow Mode provides a safety net even for well-established agents.Post-Incident Recovery
After an agent makes a mistake, demote the relevant skill back to Shadow. Rebuild confidence through measured agreement before restoring autonomy.New Agent Versions
When you update the underlying model or significantly change the agent’s prompts, re-enter Shadow Mode for critical skills. Model updates can change behavior in unexpected ways.Integration
The agent plugin reports recommendations and receives human decisions through the shadow session API.Plugin-Side Flow
- Agent encounters a decision point during normal operation
- Agent computes its recommendation and submits it to the session
- If in Shadow mode: recommendation is recorded silently, human decides independently
- If in Supervised mode: recommendation is shown to the human in the AgentVault dashboard
- Human decision is recorded via the API
- System compares and updates the agreement rate
Recording from the Plugin
API Reference
| Method | Endpoint | Description |
|---|---|---|
POST | /api/v1/shadow/sessions | Create a new shadow session |
POST | /api/v1/shadow/sessions/{id}/record-human-decision | Record an agent recommendation and human decision |
GET | /api/v1/shadow/sessions/{id}/graduation-check | Check graduation eligibility against all gates |
POST | /api/v1/shadow/sessions/{id}/graduate | Promote session to the next autonomy level |