Paper 5 · Sensitivity study

Optimising audit friction while holding unsafe approvals accountable

Monorepo root commit: Not recorded in the public portfolio system_snapshot.json (v1.2, 2026-04-11T07:37:21Z) used for this binding. Not invented on this page.
Tier-0 shared-core commit (portfolio snapshot): cd9ad79fe16f34ad861bd6527670dcfbef8fe864
Paper 5 repository commit (released): 75e90ccf676374dc6fcfa79c92b55b13116fed06
Zenodo DOI: https://doi.org/10.5281/zenodo.19499806
Release version: v2.0.0 (portfolio release designation); CITATION.cff may list package version 1.0.0 — treat commit + DOI as authoritative if they diverge.
Page generated (UTC): 2026-04-12

Executive overview

Problem: Stricter gates reduce unsafe approvals but increase organisational friction; looser rules improve throughput but can explode rare harm. Policy makers rarely see the joint surface of those trade-offs under explicit uncertainty.

Why it matters: Designing thresholds without sensitivity and multi-objective reasoning invites accidental corner solutions—acceptable on one metric and unacceptable on another.

Core insight

Multi-objective search exposes a structured frontier: detection, safe throughput, false-negative harm, and friction can be traded off transparently. Global sensitivity analysis shows which gates dominate which outcomes—information committees can act on without reading simulation code.

What was done

The pipeline generates 10,000 synthetic systems per evaluation, executes scenario families across threshold profiles, runs NSGA-II to approximate Pareto surfaces, applies Sobol’ indices on seven parameters, models lifecycle drift with re-audit, performs bootstrap and Bayesian intervals, validates against seven internal tests, and enforces a replication tolerance protocol. Decision-curve style analysis supports preference-weighted policy comparison.

What was found

Under reported configurations, non-compensatory moderation achieves very high unsafe-case detection relative to a compensatory comparator while accepting lower throughput—consistent with the programme’s conjunctive design thesis. Pareto sets contain a minority of “sweet-spot” solutions under declared detection/throughput floors. Sobol’ results attribute dominant variance for key outcomes to identifiable gates. Lifecycle summaries quantify decay and harm accumulation with modest re-audit rates.

Why this matters for regulation, safety, and deployment

Resource planning: friction metrics make operational cost of governance visible alongside safety metrics.
Threshold setting: sensitivity indices show where small parameter changes swing outcomes—critical for oversight.
Procurement: supports evidence requests that go beyond single-number accuracy claims.

Limitations and ethics

Synthetic structural causal model—parameters are research artefacts, not national statistics. Environmental noise (e.g. messaging library warnings) may appear in logs; pipeline completion remains the authority per QA notes. Compensatory anchoring includes narrative-only regulatory references not reimplemented as numeric claims in code.

QA posture: Second independent run (2026-04-12) completed full reproduce_all.py with validation passed; traceability retains Traced row labels as in the engineering matrix (row-level VERIFIED tags were not promoted in-source).

View technical detail — notebook walkthrough (conceptual)

01_simulation_pipeline Documents the phased simulation driver (may mirror scripts/run_all.py when embedded runs are enabled).

02_results_and_tables Builds manuscript-aligned tables including per-100-deployment comparisons (P5-C14, P5-C18).

03_figures Frontier and diagnostic figures (P5-C19–C20).

04_sensitivity_and_inference Sobol exports, inference summaries, DCA outputs (P5-C21–C24, P5-C26–C27, P5-C30).

Closing this panel does not remove the executive conclusions already stated above.

Full claim traceability (P5-C01–P5-C32)

One-to-one with docs/claim_traceability.md in the pinned repository.

Claim ID	Claim (paraphrase / anchor)	Code / pipeline	Primary outputs	Status
P5-C01	Monte Carlo governance simulation with SCM linking latent traits to harms, signals, lifecycle	`scripts/run_all.py`; `src/scm/causal_model.py`; `scm_functions.yaml`	`outputs/processed/simulation_summary.json`	Traced
P5-C02	10,000 synthetic systems per evaluation	`config/parameters.yaml` → `simulation.n_systems`; `src/generators/system_generator.py`	`outputs/raw/scenario_results.csv`	Traced
P5-C03	Five scenario families × four threshold profiles × five replicates (100 evaluations)	`scripts/run_all.py` Phase 2; `config/scenarios/scenarios.yaml`	`outputs/raw/scenario_results.csv`	Traced
P5-C04	SCM DAG: 28 nodes, 19 edges (Figure 6)	`config/scm_graph.dot`; `src/scm/causal_model.py`	Console + `outputs/figures/fig6_scm.png`	Traced
P5-C05	Structural equations from versioned `scm_functions.yaml` v1.0	`scm_functions.yaml`; `apply_scm`	Scenario + summary JSON	Traced
P5-C06	Preregistered analysis plan hash 1fba2e24	`analysis_plan.yaml`; `src/preregistration/plan.py`	`outputs/processed/simulation_summary.json` (`manifest.plan_hash`)	Traced
P5-C07	NSGA-II: 30 generations, pop 60, 5D thresholds [0.1, 0.9], four objectives	`src/optimisation/nsga2_search.py`; Phase 5	`outputs/processed/pareto_solutions.csv`	Traced
P5-C08	Sweet-spot: detection ≥70% and safe throughput ≥50%	`scripts/run_all.py`; `identify_sweet_spot` in `nsga2_search.py`	`pareto_solutions.csv` (`in_sweet_spot`)	Traced
P5-C09	Sobol: 7 parameters (5 gates + noise + UBR), 64 base samples	`src/analysis/sensitivity.py`; Phase 7	`outputs/tables/sobol_*.csv`	Traced
P5-C10	Bootstrap n=500 + Bayesian rate intervals for key metrics	`src/analysis/inference.py`; Phase 9	`outputs/processed/inference_results.json`	Traced
P5-C11	Lifecycle: 12 periods, drift, re-audit	`src/analysis/lifecycle.py`; Phase 6	`simulation_summary.json` → `lifecycle_summary`	Traced
P5-C12	Seven internal validation tests (named suite)	`src/validation/validation_suite.py`; Phase 1	`outputs/logs/validation_report.json`	Traced
P5-C13	Replication protocol within ≤0.01 absolute tolerance	`src/replication/replication.py`; Phase 10	`reproducibility/replication_report.json`	Traced
P5-C14	Table 1 — four profiles at 20% UBR, moderate audit (detection, throughput, FN harm, friction)	Phase 12 table build; `notebooks/02_results_and_tables.ipynb`	`outputs/tables/table1_thresholds.csv`	Traced
P5-C15	Moderate NC detection 99.8% vs compensatory 77.2%	`GovernancePolicyEngine` vs `CompensatoryPolicyEngine`; Phase 3	`outputs/raw/compensatory_comparison.csv`; summary	Traced
P5-C16	False-negative harm 152× lower NC vs compensatory (2.1 vs 323.1)	Phase 3	`compensatory_comparison.csv`; manuscript Table 2 narrative	Traced
P5-C17	Throughput: NC moderate 21.3% vs compensatory 81.2%	Phase 2–3	`scenario_results.csv`; `compensatory_comparison.csv`	Traced
P5-C18	Expected unsafe approvals per 100 deployments: NC moderate 0.04 vs compensatory 4.56 (20% base)	Derived in `notebooks/02_results_and_tables.ipynb`	`outputs/tables/table2_per100_deployments.csv`	Traced
P5-C19	60 Pareto solutions; 20 in sweet-spot (~33%)	Phase 5	`pareto_solutions.csv`; `fig1_frontier.png`	Traced
P5-C20	Reported Pareto ranges (detection, throughput, FN harm, friction)	Phase 5 + figures	`fig1_frontier.png`; Table 3 / `table2_pareto.csv`	Traced
P5-C21	Sobol: safety gate ST = 1.00 for unsafe detection rate	`format_sensitivity_table`; Phase 7	`sobol_unsafe_detection_rate.csv`	Traced
P5-C22	Sobol: safety gate ST = 1.00 for false-negative harm	Phase 7	`sobol_false_negative_harm.csv`	Traced
P5-C23	Sobol: safe throughput — calibration gate dominant (ST ≈ 0.32)	Phase 7	`sobol_safe_throughput.csv`	Traced
P5-C24	Sobol: mean friction — safety gate ST ≈ 0.23	Phase 7	`sobol_mean_total_friction.csv`	Traced
P5-C25	Lifecycle: mean final decay ~0.033, mean cumulative harm ~5.05, re-audit ~10.1%	Phase 6	`simulation_summary.json`	Traced
P5-C26	Bootstrap: detection mean 0.996, CI [0.994, 0.998]	Phase 9	`inference_results.json`	Traced
P5-C27	Bootstrap: safe throughput 0.210 [0.203, 0.217]	Phase 9	`inference_results.json`	Traced
P5-C28	All seven validation tests passed	Phase 1	`validation_report.json`	Traced
P5-C29	Deterministic replication: zero absolute difference on key metrics	Phase 10	`replication_report.json`	Traced
P5-C30	Decision Curve Analysis supports policy comparison under preferences	`src/analysis/dca.py`; Phase 8	`outputs/processed/dca_results.json`; `fig5_dca.png`	Traced
P5-C31	Grid search over safety × evidence gates	`run_grid_search`; Phase 4	`outputs/processed/grid_results.json`	Traced
P5-C32	Companion historical replay / Core-12 statistics (91.7% sensitivity, specificity 1.000, etc.)	Cross-repo (P4) — manuscript cites unified engine; not recomputed in this repo	Escalation: verify against `ethical-alpha-audit-paper-4-historical-replay` frozen artefact	External