Why an AI Assurance Layer Becomes Structurally Necessary for AI Governance— A Lean 4 Machine Verification of ADIC’s Mathematical CoreIn One Sentence
- kanna qed
- 6 日前
- 読了時間: 3分
This Lean 4 formalization machine-checks that if AI governance requires judgment grounds to remain distinguishable, then a faithful evidence layer is structurally necessary.
— -
The Intuition
A medical AI recommends “prioritize this drug.” Only the result is recorded. But was the decision based on blood test values, prior conditions, or a physician’s override? If that information is gone, safety verification after the fact is impossible.
This Lean formalization says: the problem of that information “disappearing” is not just a best-practice concern. It can be demonstrated with a concrete mathematical counterexample that when the evidence layer is weak, structurally important distinctions collapse.

— -
What Assumptions Were Made
The core framing is this:
”Rather than observing AI operations alone, we consider a world where judgment grounds, evidence, and responsibility records are tracked alongside those operations.”
Concretely, the following structures are placed:
- O: The operational category — the world of AI actions (“route this shipment”, “prioritize this drug”)
- F: Evidence fiber families — audit records, sensor readings, responsible parties, attached to each operational state
- E = ∫_O F: The total category combining operations and evidence (Grothendieck construction)
- push / pull: Evidence moves forward as operations proceed (push) and can be traced backward during audits (pull)
- push ⊣ pull (adjunction): Forward update and backward audit are two sides of the same structure
- standard section: Each operational state has a corresponding standard evidence state
- beck_chevalley: Evidence updates and audit traces remain consistent across organizational viewpoint changes (this condition is held as a structural field; it is not derived in general)
All of these assumptions are stated explicitly in the README and the Lean file. There are no hidden premises.
— -
What Was Proved
1. When the evidence layer exists: operations and evidence remain jointly traceable after composition
When operation A is followed by operation B, the evidence traces compose accordingly and remain traceable (`opcart_factor`, `cart_factor`). This is a machine-checked confirmation that the “path of judgment” does not break under composition.
2. When the evidence layer is absent: distinct judgment grounds can appear identical
This is the core result.
A concrete finite category was constructed and the following was machine-verified:
```
traceA ≠ traceB — judgment ground A and judgment ground B are distinct
U(traceA) = U(traceB) — but without evidence, both map to the same visible operation
```
Theorem name: `forgetting_trace_layer_can_collapse_distinctions`
This is not a claim about intuition or industry practice. It is a counterexample accepted by Lean. Anyone with Lean can verify it independently.
— -
Why This Matters for ADIC
ADIC’s claim — “AI judgment grounds should be preserved in a form verifiable by third parties after the fact” — has until now been a policy position.
With this Lean formalization, it becomes a claim backed by machine-verified evidence: when the evidence layer is weak, the distinction between judgment grounds can structurally collapse — and this is machine-verified, not asserted.
It is not a marketing claim. It exists as a mathematical skeleton accepted by Lean.
— -
Honest Limitations
What this Lean does not prove:
- That any real-world AI system is safe
- EU AI Act compliance
- That ADIC’s implementation is correct
- beck_chevalley (cross-organizational viewpoint consistency) is held as a structural assumption, not a general derivation
What was proved is the mathematical skeleton. Application to real systems is a separate matter.
— -
How to Verify
The repository is publicly available. Anyone with Lean 4 and Mathlib can run the type-checker independently.
```bash
lake update
lake exe cache get
lake env lean AIAssurance.lean
```
If the command returns without output, all theorems have passed type-checking.



コメント