Rewrite Materialization and Recovery Research Plan

Research plan for dynamic graph rewriting, materialization boundaries, and recovery semantics in durable execution.

On this page

Summary
Why This Is Separate
Rewrite-Capable Machine
Durable Identity Model
Persistence and Materialization
Rewrite Admission Contract
Proof Obligations
Boundary to Process Migration
Relationship to the Paper Set
Deferred Work
Related

Rewrite Materialization and Recovery Research Plan

Status: proposed Scope: runtime-grounded rewrite admission, materialization, and recovery semantics beyond the fixed-topology core

Summary

This plan is the runtime-grounded companion to Paper 3’s substitution theory. It studies topology-changing durable execution in which a running stage may propose a structured rewrite of the future execution graph. Papers 1 and 2 cover the fixed-topology kernel; they do not cover the case where accumulation, closure, and recovery are computed over a graph that may change mid-run.

The plan therefore centers on a stronger execution machine with an explicit materialization boundary and iterative control loop over materialized graph state. The focus is recovery semantics, not just rewrite syntax.

Why This Is Separate

Topology-changing execution is not a minor extension of the fixed-topology theorem.

Once a rewrite is admitted:

the topology changes
the valid-state predicate changes
downstream readiness changes
recovery must distinguish persisted lineage from durably materialized topology

The theory also needs to distinguish:

the planned or compiled graph artifact describing what obligations may arise
the materialized execution state describing what this run has actually reached

This is why the runtime rewrite story belongs in a plan of its own rather than as a subsection of the fixed-topology papers.

Rewrite-Capable Machine

The normative machine is an iterative control loop:

flowchart TD
    A[Compute Ready Frontier] --> B[Concurrent Node Execution]
    B --> C[Accumulate Node Facts]
    C --> D[Close and Classify]
    D --> E{Rewrite pending?}
    E -- Yes --> F[Admit and Materialize Rewrite]
    F --> A
    E -- No --> G{Ready frontier empty?}
    G -- No --> A
    G -- Yes --> H[Terminal]

Key rule:

an admitted rewrite must be materialized before the runtime may conclude completion, suspension, or stuckness from the old topology

The back-edge above is a runtime control loop, not a graph cycle. Each materialized topology epoch is still a DAG.

Durable Identity Model

The rewrite-capable runtime needs distinct identities for:

NodeId — durable execution identity
StageTemplateId — durable serialized template identity
StageActionId — executable identity used for hydration and retry validation
stageId — semantic stage label only

The plan should explain why these identities must stay distinct and what fails when stageId is overloaded as both semantic and executable identity.

Persistence and Materialization

The persistence model has three key pieces:

lineage — append-only rewrite history
materialization watermark — monotone marker such as applied_rewrite_id
remaining budget — durable rewrite-budget state consumed at materialization

Recovery should:

rebuild the graph only through the watermark
deterministically finish any admitted but not yet materialized rewrites
only then resume scheduling

This is closer to projection-checkpoint recovery than to raw log replay.

For selected latent branches, the admitted selected rewrite is the recovery fact. Recovery must replay that same selected branch and must not materialize unselected branch alternatives. The current model has no durable “discard branch” event; unselected arms remain sealed possibilities in the compiled artifact rather than becoming runtime state for the run.

Rewrite Admission Contract

Admitted rewrites must satisfy explicit structural conditions:

inserted nodes and stage definitions match exactly
entry and exit nodes are subsets of the inserted graph
required entry and exit sets are non-empty for the relevant constructors
resulting topology remains acyclic
anchor nodes exist in the current materialized plan
output-preserving rewrite modes keep the rewriting node’s output available for downstream dataflow

The current shipped rejection policy is intentionally narrow:

invalid or over-budget rewrites fail fast with a structured diagnostic

Configurable fallback or planner-visible rejection handling is future work, not part of the current contract.

Conditional branch actualization is a restricted case of the same admission path in the current runtime. Conceptually it consumes an owner-bound actualization capability for one sealed branch, but v1 materialization persists an ordinary admitted rewrite row for the selected branch. The selected branch consumes ordinary rewrite budget at actualization time; unselected branches do not prepay or consume budget.

Proof Obligations

The plan centers on the new proof burdens:

materialized-state validity for rewritten graphs
crash safety across lineage and materialization gaps
hydration safety from persisted template identity back to executable semantics
dataflow preservation for output-preserving rewrite modes
admitted-rewrite-before-termination correctness
selected-branch replay determinism: recovery materializes the same admitted selected branch without making unselected alternatives live
selected-cost budget preservation: only the admitted selected branch consumes runtime rewrite budget, and no sum-prepayment for the latent family is implied

The fixed-topology theorem can be reused only inside a single materialized topology epoch.

Boundary to Process Migration

Local graph substitution and non-local workflow migration should remain distinct:

local rewrites substitute a bounded node region and continue
migration reinterprets already executed structure, history, or state mapping

That boundary matters because future planner- or operator-driven graph surgery may stop being “just another rewrite constructor” and become a process-migration problem instead.

Relationship to the Paper Set

Paper 1 — fixed-topology experience report
Lean mechanization plan — machine-checked support for the fixed-topology core
Paper 2 — algebraic theory of the fixed-topology core and extension boundary
This plan — rewrite-capable execution, materialization, and recovery semantics
Paper 3 — graph substitution semantics, compiled artifacts, and the implementation-independent layering above this runtime track

This split keeps the fixed-topology theorem honest while giving rewriting enough room to be treated seriously.

Deferred Work

The following are explicitly outside the current shipped and theory contract:

true concurrent rewrite admission within one frontier wave
configurable rewrite rejection fallback instead of fail-fast only
richer operator-facing graph visualization and rewrite provenance
hierarchical child workflows with explicit sub-budget assignment
reserved latent-branch capacity, including max-branch reservation or escrow rules for every compiled branch family
integrity witnesses stronger than the current watermark model, such as materialized-graph checksums

../../Publications/Paper-2-algebraic-foundations/ — fixed-topology algebra and extension boundary.
../../Publications/Paper-3-graph-substitution-semantics/ — implementation-independent substitution theory.
../../Architecture/07-rewrites-and-materialization.md — canonical runtime architecture chapter.
../../Reference/Wire/conditionality.md — guarded-affine select(...) and selected-cost branch semantics.
../../ADRs/0036-wire-latent-branch-budget-recovery.md — current latent-branch budget and recovery policy.

Rewrite Materialization and Recovery Research Plan

Summary

Why This Is Separate

Rewrite-Capable Machine

Durable Identity Model

Persistence and Materialization

Rewrite Admission Contract

Proof Obligations

Boundary to Process Migration

Relationship to the Paper Set

Deferred Work

Related