Outcome Digital Twin (Outcome Space + Repair Mixtures)
Helix is distribution-first: it models the outcome space (a discrete set of named terminal edits) and predicts a probability distribution over that space, rather than implicitly assuming a single “winning” edit.
That choice is the foundation for auditability, reproducibility, and downstream optimization.
What’s Unusual (Best-in-Class Signals)
1) Explicit outcome enumeration (named outcomes)
Outcomes are referential objects (e.g. del3@-2, del5@0), not anonymous histogram bins. That makes them:
- Discrete (finite outcome set)
- Indexable (stable keys for caches / exports)
- Diffable (regression tests can compare distributions by outcome name)
- Auditable (exports are deterministic and attributable)
2) Escape states are first-class
Helix represents “no cut / no edit” as probability mass, rather than treating repair as guaranteed. This is required for any credible digital-twin narrative (failure modes must be explicit).
3) Distribution-level reasoning
Helix computes and exposes distribution-level properties (dominance, entropy, tail risk), because the right question is not “what wins?” but “what is the full posterior over outcomes?”
The 4.5 → 5.0 Upgrade: Representation, Not New Math
Reviewers want to see mechanistic structure emerge from probabilities, not sit next to them as labels.
The minimal representation lift is a repair-mixture view:
Mixture identity:
P(outcome) = Σ_pathway P(pathway) × P(outcome | pathway)
This does not change the underlying simulation math; it makes the mechanistic story inspectable.
Repair pathway objects (derived)
Helix derives an explicit pathway decomposition from the outcome distribution and stable heuristics, exposed as objects in:
src/helix/studio/repair_pathways.pyRepairPathway(pathway, probability_mass, outcome_distribution)build_repair_pathway_mixture(...)build_stochastic_narrative_v1(...)
This is intentionally conservative: unless the simulator explicitly emits pathway labels, Helix treats the pathway assignment as a representation layer derived from outcome labels/categories.
Stochastic narrative (without sampling)
Even if the implementation is deterministic, the conceptual model is stochastic and is now surfaced explicitly:
- Cleavage occurs with probability
Pc(vsP(no_cut)). - Repair pathway choice is a categorical distribution conditional on cleavage.
- Outcome selection is the conditional distribution
P(outcome | pathway).
This is what reviewers mean by a “digital twin”: a visible branching model with explicit mixture weights.
Where You See It
Studio UI (Outcome Explorer)
- Outcome rows/bars carry a
pathwaytag. - A “Repair pathway mixture (derived)” readout is shown in Analysis/Debug detail levels.
- The Outcome Explorer filter includes a pathway selector to view distributions per pathway.
Exports
- Outcome CSV/HTML exports include a
pathwaycolumn. - Session report HTML renders a “Repair pathway mixture (derived)” section, including the mixture identity and a compact stochastic narrative when decision-grade probabilities are available.
Future-Proofing (No Rewrites Required)
When the simulator begins emitting explicit pathway objects, Helix can replace the heuristic classifier with ground-truth pathway provenance while keeping:
- the same mixture interface,
- the same export schema shape,
- the same UI controls,
- and the same deterministic audit story.