8 Settings / 8

Settings

Configuration and calibration

Allocation Parameters

Parameter	Value
`beta_u`	1.000
`beta_n`	1.000
`beta_w`	1.500
`beta_pi`	1.000
`beta_e`	0.800
`gamma_un`	0.200
`gamma_uw`	0.300
`alpha_ai_precision`	2.000
`alpha_hybrid_uncertainty`	0.800
`alpha_human_conflict`	1.500

Safety Thresholds

Threshold	Value
`u_critical`	0.90
`n_critical`	0.90
`w_critical`	0.85
`e_critical`	0.90
`precision_floor`	0.20
`w_prohibit_ai`	0.70
`n_prohibit_ai`	0.80
`u_prohibit_ai`	0.80
`governance_tier_high`	0.70
`governance_tier_low`	0.30
`min_confidence`	0.30
`surprise_escalation_threshold`	0.70
`surprise_tier_boost`	1.00
`surprise_threshold_adaptive`	1.00
`surprise_threshold_percentile`	0.90
`surprise_threshold_min`	0.50
`surprise_threshold_max`	0.95
`surprise_history_size`	100.00

Parameter Calibration

Calibrate allocation parameters based on recorded outcomes. This uses precision-weighted gradient descent to optimize parameters.

Calibration Results

Learning Statistics

Surprise-Triggered Auditing

When prediction error exceeds the threshold (default: 0.7), the system automatically:

Escalates governance tier for more thorough documentation
Adds SURPRISE_ESCALATION flag to the decision
Increases scrutiny for unexpected situations

Theoretical basis: In Active Inference, high "surprise" indicates model inadequacy, warranting increased human oversight.

Learning from Overrides (κ Multiplier)

When humans override AI recommendations, the system learns with boosted precision:

Override precision: ω_override = κ · ω_base, where κ = 1.5 (default)
Escalations (AI→Human): Higher weight on safety corrections
Delegations (Human→AI): System learns efficiency patterns

Effect: Unified score weights (w_i) are adjusted via gradient descent: Δw_i = η · κ · ω_base · ε_r · ∂r/∂w_i

How System Learning Works

What the Metrics Mean

Total Updates	Number of batch gradient descent updates performed. Each update adjusts parameters based on ~50-100 recorded outcomes.
Pending Outcomes	Outcomes waiting to be processed. A batch update triggers when this reaches the batch size threshold.
Mean Recent Error	Average prediction error over recent decisions. Lower is better. Target: < 0.15
Std Recent Error	Variability in prediction errors. Lower values indicate more consistent performance.
Cumulative Error	Total accumulated error across all decisions. Used internally for gradient computation.
Override Learnings	Count of parameter updates from human overrides. When humans correct AI recommendations, the system learns from these corrections.
Escalation Corrections	Overrides where AI was corrected to Human mode. These use κ-boosted precision (κ=1.5) for safety learning.
Surprise Escalations	Times when high prediction error automatically escalated governance tier for more thorough documentation.

How to Observe Improvement

Run simulations with "Use Real API" enabled on the Simulation page to generate decisions
Record outcomes - The system learns when actual outcomes are recorded via the API
Watch Mean Error decrease - After multiple batch updates, the mean error should trend downward
Click "Run Calibration" to trigger immediate parameter adjustment

Improvement Indicators

Optimized Mean Error < 0.15 — System is well-calibrated

Learning Mean Error 0.15 - 0.30 — System is adapting

Needs Work Mean Error > 0.30 — Run calibration

Unified Score Function & Learning

Unified score function (sigmoid of weighted features):

                                r(x;θ) = σ(Σi wi·φi(xi) + Σi<j wij·φij(xi,xj) + bias)
                            Mode thresholding:

                                T(r) = AI if r < b₁, Hybrid if b₁ ≤ r < b₂, Human if r ≥ b₂
                            Gradient learning rule:

                                Δwi = η · ωobs · εr · σ'(z) · φi(xi)
                            
                                Where: x = (U, N, W, π̃, ε̃) ∈ [0,1]⁵ dimensional vector, η = learning rate (default 0.01),

                                ωobs = observation precision, εr = y - r̂ (score error), σ' = sigmoid derivative,

                                (b₁, b₂) = (0.3, 0.7) default thresholds

Error Trend Over Time

No error history available yet. Run simulations with outcomes to see the trend.

Current Status: -

Trend: -

General Settings

Debug Mode	OFF
Log Level	`info`
Demo UI	ENABLED

Storage Backend	`memory`
Allocation Temperature	1.0
Auto-load Plugins	ON

System Self-Test

Run the automated test suite to verify system functionality. This executes all pytest tests and displays the results.

Configuration via Environment

MCAT can be configured via environment variables:

# General
MCAT_DEBUG=true
MCAT_LOG_LEVEL=DEBUG
MCAT_ENABLE_DEMO_UI=true

# Unified Score weights (sign-constrained)
MCAT_UNIFIED_W_U=1.0      # Uncertainty (≥0)
MCAT_UNIFIED_W_N=1.0      # Novelty (≥0)
MCAT_UNIFIED_W_W=1.5      # Value conflict (≥0)
MCAT_UNIFIED_W_PI=-1.0    # Precision (≤0)
MCAT_UNIFIED_W_E=0.8      # Prediction error (≥0)
MCAT_UNIFIED_B1=0.3       # AI/Hybrid threshold
MCAT_UNIFIED_B2=0.7       # Hybrid/Human threshold

# Learning parameters
MCAT_LEARNING_RATE=0.01   # η - gradient learning rate
MCAT_LEARNING_KAPPA=1.5   # κ - override precision multiplier

# Thresholds
MCAT_THRESHOLD_U_CRITICAL=0.9
MCAT_THRESHOLD_W_CRITICAL=0.85

# Storage
MCAT_STORAGE_BACKEND=memory
MCAT_DATABASE_URL=sqlite+aiosqlite:///mcat.db

Settings

Calibration Results

Surprise-Triggered Auditing

Learning from Overrides (κ Multiplier)

What the Metrics Mean

How to Observe Improvement

Improvement Indicators

Unified Score Function & Learning

Summary

Result

Test Output

Individual Tests