Lean Mechanization Research Plan

Research plan for mechanizing the fixed-topology staged-reduction core in Lean 4.

On this page

Summary
Core Definitions
Theorem Stack
Tier 1: Core algebraic lemmas
Tier 2: Closure operator laws
Tier 3: Recovery normalization
Tier 4: Structural recovery safety
Tier 5: Classification exhaustiveness
Explicit Non-Goals
Suggested Lean Module Structure
Expected Challenges
Relationship to the Paper Set
Internal Value
Related

Lean Mechanization Research Plan

Status: proposed Scope: machine-checked support for the fixed-topology staged-reduction core used by Papers 1 and 2

Summary

This plan turns the fixed-topology staged-reduction core into a Lean 4 mechanization target. The goal is not end-to-end workflow correctness or replayed I/O equivalence. The goal is to mechanize the structural safety contract the runtime actually relies on: commutative accumulation of node-local facts, idempotent failure closure, deterministic classification, and recovery safety after persisted-prefix crashes.

The central artifact is an explicit WellFormedGraphState structure, exposed under the published wellFormedGraphState predicate name, for normalized, closed states. Without that predicate, Paper 1’s recovery claim and Paper 2’s algebraic presentation remain too rhetorical.

Core Definitions

Mechanization starts by fixing the objects the paper set depends on:

finite DAGs with reachability and topological order
NodeStatus
GraphState
applyNodeFact
resetRunningToPending
propagateFailure
readyNodes
classifyGraphState

The hub predicate should be:

structure WellFormedGraphState (G : DAG) (s : GraphState) : Prop where
  topologyDomain : topologyDomain G s
  outputsRespectStatuses : outputsRespectStatuses s
  outputsCompleteForStatuses : outputsCompleteForStatuses s
  noRunningNodes : noRunningNodes s
  noInterruptedNodes : noInterruptedNodes s
  failureClosureComplete : failureClosureComplete G s
  causalHistoryClosed : CausalHistoryClosed G s
  frontierBridge : frontierBridge G s

Here frontierBridge means proof-level readyNodes G s coincides with the runtime-style direct-predecessor frontier directReadyNodes G s.

Theorem Stack

Tier 1: Core algebraic lemmas

theorem frontier_antichain (G : DAG) (s : GraphState) :
  PairwiseReachabilityIncomparable (readyNodes G s)

theorem applyNodeFact_comm (r1 r2 : NodeResult) (s : GraphState)
  (h : r1.nid ≠ r2.nid) :
  applyNodeFact r1 (applyNodeFact r2 s) =
  applyNodeFact r2 (applyNodeFact r1 s)

theorem NodeResult.applyNodeFacts_perm_invariant
  (results1 results2 : List NodeResult) (s : GraphState)
  (h_perm : results1 ~ results2)
  (h_distinct : results1.Pairwise (fun a b => a.nid ≠ b.nid)) :
  results1.foldl (flip applyNodeFact) s =
  results2.foldl (flip applyNodeFact) s

Tier 2: Closure operator laws

theorem propagateFailure_extensive (G : DAG) (s : GraphState) :
  s ≤ propagateFailure G s

theorem propagateFailure_monotone (G : DAG) {s1 s2 : GraphState}
  (h : GraphState.failureLe s1 s2) :
  GraphState.failureLe (propagateFailure G s1) (propagateFailure G s2)

theorem propagateFailure_idempotent (G : DAG) (s : GraphState) :
  propagateFailure G (propagateFailure G s) = propagateFailure G s

Tier 3: Recovery normalization

theorem resetRunning_preserves_domains (G : DAG) (s : GraphState) :
  domainsPreserved G s (resetRunningToPending s)

theorem resetRunning_no_running (s : GraphState) :
  noRunningNodes (resetRunningToPending s)

theorem closure_failure_complete (G : DAG) (s : GraphState) :
  failureClosureComplete G (propagateFailure G s)

theorem frontierBridge_of_closed_causal (G : DAG) (s : GraphState)
  (h_closed : failureClosureComplete G s)
  (h_causal : CausalHistoryClosed G s) :
  frontierBridge G s

Tier 4: Structural recovery safety

theorem persistence_safety
  (G : DAG) (S' : List NodeResult) (s0 : GraphState) :
  let sPersisted := S'.foldl (flip applyNodeFact) s0
  let sRecovered := propagateFailure G (resetRunningToPending sPersisted)
  wellFormedGraphState G sRecovered

This is the load-bearing theorem for the crash-recovery model. It should match Paper 1’s structural persistence-safety claim and Paper 2’s recovery theorem in intent.

Tier 5: Classification exhaustiveness

theorem classifyClosedGraphState_exhaustive_of_wellFormed
  (G : DAG) (s : GraphState)
  (h : wellFormedGraphState G s) :
  -- failed, progressing, completed, or suspended; never stuck
  ...

The classifier sits above recovery: well-formed recovered states can resume, settle, or suspend, but they cannot enter the stuck diagnostic branch.

Explicit Non-Goals

Do not expand this plan into:

end-to-end workflow correctness
replayed I/O equivalence
dynamic graph rewriting correctness
operator-facing persistence semantics

Those belong either to assumptions outside the fixed-topology kernel or to the separate rewrite-materialization track.

Suggested Lean Module Structure

DurableTask/
  Graph.lean
  State.lean
  Fact.lean
  Closure.lean
  Classify.lean
  Validity.lean
  Properties/
    Antichain.lean
    Commutativity.lean
    Closure.lean
    Recovery.lean
    Safety.lean

Validity.lean should appear early, not as an afterthought. The point of this plan is to make valid graph state a first-class theorem object.

Expected Challenges

Finite DAG representation may need an adapter from the runtime’s relation-style graph model.
Closure idempotence is the hardest proof burden in the current stack.
Waiting(signal) complicates any local status order because waiting carries data.
Output ownership is two-sided in the fixed-topology kernel: outputs must appear only where statuses own them, and completed/rewritten statuses must carry outputs.

Relationship to the Paper Set

Paper 1 uses the theorems operationally and reports implementation evidence.
Paper 2 states the sharpened algebraic claims and separates proof from assumption.
This plan is where closure laws and structural recovery safety become machine-checked.

The paper set should share the same names for:

wellFormedGraphState
disjoint-key accumulation commutativity
structural persistence safety
piecewise determinism assumption

If the names drift, the proof program becomes incoherent again.

Internal Value

This plan provides three concrete benefits:

it turns the recovery contract into an executable specification
it forces later extensions to name the invariant they depend on
it prevents implementation tests from being rhetorically upgraded into proofs

../../Publications/Paper-1-staged-reduction/ — runtime-grounded fixed-topology paper.
../../Publications/Paper-2-algebraic-foundations/ — algebraic companion paper.
rewrite-materialization-and-recovery.md — separate plan for dynamic topology.

Lean Mechanization Research Plan

Summary

Core Definitions

Theorem Stack

Tier 1: Core algebraic lemmas

Tier 2: Closure operator laws

Tier 3: Recovery normalization

Tier 4: Structural recovery safety

Tier 5: Classification exhaustiveness

Explicit Non-Goals

Suggested Lean Module Structure

Expected Challenges

Relationship to the Paper Set

Internal Value

Related