Settings
Configuration and calibration
| Parameter | Value |
|---|---|
beta_u
|
1.000 |
beta_n
|
1.000 |
beta_w
|
1.505 |
beta_pi
|
1.000 |
beta_e
|
0.800 |
gamma_un
|
0.200 |
gamma_uw
|
0.300 |
alpha_ai_precision
|
2.000 |
alpha_hybrid_uncertainty
|
0.800 |
alpha_human_conflict
|
1.510 |
| Threshold | Value |
|---|---|
u_critical
|
0.90 |
n_critical
|
0.90 |
w_critical
|
0.85 |
e_critical
|
0.90 |
w_prohibit_ai
|
0.70 |
n_prohibit_ai
|
0.80 |
u_prohibit_ai
|
0.80 |
governance_tier_high
|
0.70 |
governance_tier_low
|
0.30 |
min_confidence
|
0.30 |
surprise_escalation_threshold
|
0.70 |
surprise_tier_boost
|
1.00 |
surprise_threshold_adaptive
|
1.00 |
surprise_threshold_percentile
|
0.90 |
surprise_threshold_min
|
0.50 |
surprise_threshold_max
|
0.95 |
surprise_history_size
|
100.00 |
Calibrate allocation parameters based on recorded outcomes. This uses precision-weighted gradient descent to optimize parameters.
Loading...
Surprise-Triggered Auditing
When prediction error exceeds the threshold (default: 0.7), the system automatically:
- Escalates governance tier for more thorough documentation
- Adds
SURPRISE_ESCALATIONflag to the decision - Increases scrutiny for unexpected situations
Theoretical basis: In Active Inference, high "surprise" indicates model inadequacy, warranting increased human oversight.
Learning from Overrides (κ Multiplier)
When humans override AI recommendations, the system learns with boosted precision:
- Override precision: ωoverride = κ · ωbase, where κ = 1.5 (default)
- Escalations (AI→Human): Higher weight on safety corrections
- Delegations (Human→AI): System learns efficiency patterns
Effect: Unified score weights (wi) are adjusted via gradient descent: Δwi = η · κ · ωbase · εr · ∂r/∂wi
What the Metrics Mean
| Total Updates | Number of batch gradient descent updates performed. Each update adjusts parameters based on ~50-100 recorded outcomes. |
| Pending Outcomes | Outcomes waiting to be processed. A batch update triggers when this reaches the batch size threshold. |
| Mean Recent Error | Average prediction error over recent decisions. Lower is better. Target: < 0.15 |
| Std Recent Error | Variability in prediction errors. Lower values indicate more consistent performance. |
| Cumulative Error | Total accumulated error across all decisions. Used internally for gradient computation. |
| Override Learnings | Count of parameter updates from human overrides. When humans correct AI recommendations, the system learns from these corrections. |
| Escalation Corrections | Overrides where AI was corrected to Human mode. These use κ-boosted precision (κ=1.5) for safety learning. |
| Surprise Escalations | Times when high prediction error automatically escalated governance tier for more thorough documentation. |
How to Observe Improvement
- Run simulations with "Use Real API" enabled on the Simulation page to generate decisions
- Record outcomes - The system learns when actual outcomes are recorded via the API
- Watch Mean Error decrease - After multiple batch updates, the mean error should trend downward
- Click "Run Calibration" to trigger immediate parameter adjustment
Improvement Indicators
Unified Score Function & Learning
r(x;θ) = σ(Σi wi·φi(xi) + Σi<j wij·φij(xi,xj) + bias)
T(r) = AI if r < b₁, Hybrid if b₁ ≤ r < b₂, Human if r ≥ b₂
Δwi = η · ωobs · εr · σ'(z) · φi(xi)
ωobs = observation precision, εr = y - r̂ (score error), σ' = sigmoid derivative,
(b₁, b₂) = (0.3, 0.7) default thresholds
No error history available yet. Run simulations with outcomes to see the trend.
| Debug Mode | OFF |
| Log Level | info |
| Demo UI | ENABLED |
| Storage Backend | memory |
| Allocation Temperature | 1.0 |
| Auto-load Plugins | ON |
Run the automated test suite to verify system functionality. This executes all pytest tests and displays the results.
MCAT can be configured via environment variables:
# General
MCAT_DEBUG=true
MCAT_LOG_LEVEL=DEBUG
MCAT_ENABLE_DEMO_UI=true
# Unified Score weights (sign-constrained)
MCAT_UNIFIED_W_U=1.0 # Uncertainty (≥0)
MCAT_UNIFIED_W_N=1.0 # Novelty (≥0)
MCAT_UNIFIED_W_W=1.5 # Value conflict (≥0)
MCAT_UNIFIED_W_PI=-1.0 # Precision (≤0)
MCAT_UNIFIED_W_E=0.8 # Prediction error (≥0)
MCAT_UNIFIED_B1=0.3 # AI/Hybrid threshold
MCAT_UNIFIED_B2=0.7 # Hybrid/Human threshold
# Learning parameters
MCAT_LEARNING_RATE=0.01 # η - gradient learning rate
MCAT_LEARNING_KAPPA=1.5 # κ - override precision multiplier
# Thresholds
MCAT_THRESHOLD_U_CRITICAL=0.9
MCAT_THRESHOLD_W_CRITICAL=0.85
# Storage
MCAT_STORAGE_BACKEND=memory
MCAT_DATABASE_URL=sqlite+aiosqlite:///mcat.db