Polymodal Computation¶

This research track extends the post-textual direction of Fixed-Semantic Spatial Computation into a modality-agnostic computational substrate.

The earlier prototype proved that 2D space can be a native computational surface. Polymodal computation asks the next question:

If 2D space can host a non-textual program, can the same program be rendered, perceived, and edited in any modality — visual, sonic, haptic, kinetic, spatial-3D — without losing its identity?

Why "polymodal" and not "multimodal"¶

The repository already uses multimodal for image/audio/video payloads flowing through AI calls. That meaning is about data types. Polymodal computation is about program identity: one program, many peer perceptual realizations.

The distinction matters because every modality in this architecture is a peer. No modality is primary; the 2D spatial demo, the 1D linear strip, the 3D volumetric scene, the sonic WebAudio composition, and the MIDI piano roll are all realizations of the same semantic core.

Layered model¶

Layer	Question it answers	Concrete artifact
Semantic core	What primitives, with what intensities and relations?	`semantic-core-v0` manifest. Modality-free. No coordinates. No frequencies. Now carries derived containment relations.
Modality projection	How is this realized in modality M?	`linear-seed-v0`, `spatial-seed-v0`, `volumetric-seed-v0`, `sonic-seed-v0`, `midi-seed-v0` — generated per-modality
Modality runtime	How does a perceiver experience it?	Canvas (linear, spatial, volumetric, MIDI), WebAudio (sonic)
Modality capture	How can authoring happen in this modality?	Inverse of projection — sonic captures landed first (Python + browser microphone), other modalities follow

The semantic core is the canonical program. Every modality is a peer. Text is one of those authoring surfaces for polymodal programs, but it is not the only one and it is not required to be the interchange format between modalities.

Compatibility principle¶

Polymodal computation must not break the existing textual Multilingual language. The safe boundary is:

text .multi
spatial edits
sonic capture
MIDI capture
3D edits
haptic / kinetic input
        |
        v
semantic-core-vN
        |
        v
text projection / WAT / Python / modality projections

The textual language keeps its parser, AST, language packs, Python backend, WAT/WASM backend, and compatibility tests. Polymodal computation is an additional semantic profile layered beside the current language, not a replacement for source syntax.

This distinction is load-bearing:

Compatibility belongs to the textual language. Existing .multi and .ml programs should continue to parse, transpile, execute, and compile as before.
Equivalence belongs to the semantic core. A polymodal program is equivalent across surfaces only to the extent that each surface preserves the declared semantic facts of semantic-core-vN.
Text regeneration is optional. A spatial or sonic capture should reconstruct the semantic core directly; generated text may be useful for inspection, teaching, or archival purposes, but it should not be the authority through which every modality must pass.

Not every Multilingual program needs to be polymodal. The project can support a broad language and a stricter polymodal profile:

Multilingual language
  includes ordinary textual programs
  includes Python/WAT-compatible programs
  includes polymodal seed programs

Only programs in the polymodal profile are required to satisfy cross-modal equivalence. This protects existing source compatibility while letting the research track evolve quickly.

Semantic versioning of program identity¶

The semantic core must be versioned aggressively. A semantic-core-v0 manifest should always mean what it meant when it was emitted. If a future semantic-core-v1 adds explicit coupling, temporal ordering, stable entity IDs, richer relation kinds, or perceptual capability metadata, old manifests should be handled by explicit migration:

semantic-core-v0 -> semantic-core-v1

The migration may enrich or annotate old programs, but it must not silently reinterpret old fields. This gives the project two kinds of stability at once: textual source compatibility for existing Multilingual programs, and manifest compatibility for polymodal programs.

As editing becomes bidirectional, entity identity should move from index-only ordering toward stable IDs:

{
  "id": "ent_7f3a",
  "index": 0,
  "opcode": 6,
  "intensity": 0.8,
  "signal": 0.4,
  "phase": 0.25,
  "channel": 2
}

Indexes remain useful for deterministic ordering and compact manifests. Stable IDs preserve identity when a user drags, deletes, records, inserts, groups, or reorders entities in a non-textual surface.

Perceptual capability contracts¶

Every modality should declare what it preserves, what it loses, and what it cannot disambiguate. Equivalence should be checked against that contract rather than assumed.

{
  "projection": "sonic-seed-v0",
  "preserves": ["opcode", "intensity", "phase", "channel"],
  "lossy": ["signal"],
  "ambiguous": ["diffuse/stabilize/resonate", "attract/repel"]
}

This makes room for several equivalence levels:

Level	Meaning
Exact-equivalent	The modality can recover the declared semantic facts exactly.
Behavior-equivalent	The modality preserves behavior even if presentation details differ.
Lossy-equivalent	The modality preserves a documented subset and marks the rest as lost.
Ambiguous-by-design	The modality cannot distinguish some semantic identities and says so.
View-only	The modality renders the program but does not claim inverse recovery.
Non-invertible	The modality is useful as output, but not as an authoring surface.

The sonic round-trip already follows this discipline: opcodes whose (role, waveform, envelope) tuple is unique are invertible, while shared signatures are excluded from the invertible subset instead of pretending to be lossless. Future MIDI, spatial, volumetric, haptic, and kinetic capture paths should make the same kind of declaration.

From structure to process: `semantic-core-v1` (proposed)¶

semantic-core-v0 describes a static structure: a flat set of entities with fixed scalar fields, plus derived containment. It is faithfully projectable into every modality, and that is a real result. But it cannot express the systems that make computation feel alive — cellular automata, Game of Life, fractals, reaction-diffusion, predator-prey ecosystems, flocking, growth. Those are not structures. They are rules unfolding over time, and v0 has nowhere to put a rule: the actual dynamics live, hand-written, inside each modality's runtime (e.g. spatial_runtime.js::step(dt)), which also means cross-modal equivalence is only proven for frame 0, not for the computation itself.

The answer is not more opcodes. A bag of behaviors is never universal; you chase the next phenomenon forever. Universality comes from finding the single abstraction that every one of these systems already is, and making it the core. That abstraction is the dynamical / rewriting system, and it decomposes into four orthogonal axes.

The four axes of any computational universe¶

Axis	Question	v0 today	Universal form
State	what does each locus carry?	five fixed scalars	a typed, possibly heterogeneous record (`{alive}`, `{z: ℂ}`, `{species, energy, age}`)
Topology	what interacts with what?	re-derived from coordinates per runtime	a declared object answering one query: `neighbors(locus) -> set`
Rule	how does a locus change?	hard-coded in `step()` per modality	a rewrite rule: `match local pattern -> produce replacement`
Schedule	when do rules fire?	implicit animation frame	declared: synchronous / asynchronous / continuous-`dt` / generative-depth

A semantic-core-v1 polymodal program graduates from a bag of entities into the tuple ⟨State, Topology, Rule, Schedule⟩. Every component is still declarative, modality-free data, so it projects and round-trips through modalities exactly as entities do today.

Rewriting is the one meta-primitive¶

The load-bearing move is making the rewrite rule the single universal primitive. match a local pattern -> produce a replacement is Turing-complete (it is what term rewriting, graph grammars, Rule 110, and Wolfram-style physics all run on). Three capabilities v0 lacks fall out of this one primitive for free:

Generativity / recursion — when the replacement is larger than the match, you get production rules: L-systems, fractals, growth.
Open population — when a match creates or destroys loci, you get birth, death, and reproduction: ecosystems. (This is exactly why v0's "preserves entity count" invariant must relax to "preserves the rule's trajectory.")
Iteration in value-space — when topology is "neighbors = self" and the space is the value, you get Mandelbrot/Julia and strange attractors.

The twelve v0 opcodes do not disappear: they become a standard library of rules expressed in this calculus, not built-in magic. That is the difference between a language and a feature list.

The same shape, four times¶

The proof of universality is that systems we call completely different are one form with four fillings:

Game of Life    State {alive}    Topology lattice/Moore-8    Rule n==3 ∨ (alive∧n==2)        Schedule synchronous
Mandelbrot      State {z:ℂ}      Topology self (plane ℂ)     Rule z' = z² + c               Schedule iterate-N
L-system plant  State {symbol}   Topology sequence -> tree   Rule F -> F[+F]F[-F]F (grows)   Schedule depth-N
Predator-prey   State {sp,e,age} Topology continuous/radius  Rules eat -> e += k, kill prey; Schedule async dt
                                                                   e>θ -> spawn; e<0 -> die

None of these needs a new opcode. The broader landscape lands the same way: iterated maps and IFS fractals, elementary CA (Rule 30/110), Langton's ant, reaction-diffusion / Turing patterns (Gray-Scott), Ising/Monte-Carlo, DLA, percolation, boids, Lotka-Volterra, Schelling segregation, ant stigmergy, n-body and SPH physics, neural and Boolean networks, SIR epidemics, von Neumann self-replication, and hypergraph rewriting — each a filling of ⟨State, Topology, Rule, Schedule⟩.

Dynamics become polymodal too¶

Because rules are data, rules project to every modality just like entities do — and that is where the time-based modalities finally get first-class content:

Video is the modality v0 had no home for, because v0 had no time. Here, video = spatial × the Schedule axis: the rule's firing is the frame clock. Video is to the rule axis what linear was to the dimensionality axis — the modality that forces the new dimension to be real.
Sonic: a rule is an effect/transformation node; reaction-diffusion becomes evolving timbre; a CA becomes generative rhythm. MIDI: rules as generative sequencers.
Topology projects too: lattice -> grid, graph -> node-link, continuous -> canvas positions. The neighborhood query renders.

A non-negotiable consequence: the stepper that advances ⟨State, Topology, Rule, Schedule⟩ must be specified once in the modality-free core and compiled into each runtime — never hand-written per modality. Otherwise two runtimes evolve the same program differently and cross-modal equivalence dies the instant the program animates.

The honesty frontier: expressiveness tiers¶

Universality has a cost worth naming: a Turing-complete rule can express things with no natural sound or shape, so not every program can be authored (inverted) in every modality. The capability-contract discipline above generalizes from "this opcode is lossy in sonic" to "this rule-class is view-only in modality M." Expressiveness should be stratified into tiers, each adding power and possibly shrinking the set of modalities that can invert it:

Tier	Expressiveness	Examples	Invertibility
0	static structure	the current entity set	invertible everywhere (this is v0)
1	fixed-population continuous dynamics	particles, boids, oscillator nets, iterated maps	fully projectable; per-modality inverse
2	synchronous lattice / field rules	Game of Life, reaction-diffusion, Turing patterns	projectable; inverse per contract
3	generative / open-population rewriting	fractals, L-systems, ecosystems	projects as output; authoring-invertible only in some modalities
4	full graph / hypergraph rewriting	universal computation	mostly view-only / non-invertible

This ladder is the precise answer to "is polymodal computation universal?": yes at the representation and output level (Tier 4), with a clearly marked frontier beyond which perceptual authoring degrades. That is not a retreat — it is the honest statement of what polymodal universality can claim.

A subtlety the implementation makes concrete: the tier is the class ceiling, not a per-program verdict. A program may sit at Tier 4 because its topology is a graph (graphs can host structure-changing rewriting, which is view-only) and yet, if that particular program only rewrites node states on a fixed graph, project and invert exactly. The network-contagion example does exactly this — it is Tier 4 but its projection round-trips every node. So the tier (read from the axes by expressiveness_tier / tierOf) and the concrete inverse level (read from coverage by each projection's capability contract) are orthogonal: the tier says how expressive the form is, the contract says what this projection of this run recovers.

Cheapest falsifiable first step¶

Mirror how linear cheaply proved the dimensionality axis. Implement only ⟨State, Topology=lattice, Rule, Schedule=synchronous⟩, run Game of Life through it, and project that same rule-bearing manifest to two modalities — spatial canvas and sonic (live cells -> rhythmic voices) — driven by one shared, modality-free stepper. If one rewrite-rule manifest animates Life identically in two surfaces, the dynamics layer is proven, and the stepper-in-the-core requirement is satisfied by construction.

semantic-core-v1 lands beside v0 under the same aggressive versioning rule: v0 manifests keep meaning exactly what they meant, and a v0 program is simply a v1 program with an empty rule set and a single-step schedule.

The opcode ontology¶

A single ontology (opcode_ontology.py) is the source of truth for every primitive. Each opcode entry carries a stable integer code, a canonical name, and per-modality realization hints:

Layer	Field
Semantic	`code`, `name`, `description`
Linear (1D)	`linear.glyph`, `linear.color`
Spatial (2D)	`spatial.shape`, `spatial.color`
Volumetric (3D)	`volumetric.primitive`, `volumetric.color`
Sonic (WebAudio)	`sonic.role`, `sonic.waveform`, `sonic.envelope`
MIDI (events)	`midi.role`, `midi.pitch`, `midi.velocity`

Adding a new primitive is one row; adding a new modality is one new hint group. Cross-modal coherence is enforced by tests, not convention.

The vocabularies are kept disjoint by design: linear glyphs do not include 2D shape names ("ring", "diamond", "membrane"), volumetric primitives do not reuse 1D or 2D names, and MIDI roles (note, drum, cc, program, bus) form their own taxonomy. Tests in polymodal_equivalence_test.py enforce these separations so dimensionality and modality cannot be silently conflated.

What lives where¶

multilingualprogramming/codegen/
  opcode_ontology.py      # the only source of opcode truth
  semantic_core.py        # build semantic-core-v0 manifests (modality-free);
                          # derives containment relations
  linear_manifest.py      # 1D linear projection   -> linear-seed-v0
  spatial_manifest.py     # 2D spatial projection  -> spatial-seed-v0
  volumetric_manifest.py  # 3D volumetric projection -> volumetric-seed-v0
  sonic_projection.py     # sonic projection        -> sonic-seed-v0
  midi_manifest.py        # MIDI projection         -> midi-seed-v0
  sonic_capture.py        # inverse sonic projection (Python)
  midi_capture.py         # inverse MIDI projection (Python)
  spatial_capture.py      # inverse spatial projection (Python)

docs/browser/spatial-dynamics/
  program.multi           # authored polymodal seed (canonical source)
  program.spatial.json    # generated 2D projection
  ontology.json           # generated ontology sidecar
  spatial_runtime.js      # Canvas 2D runtime + capture wiring
  spatial_capture.js      # inverse projection (JS)
  index.html

docs/browser/linear-dynamics/    # 1D peer
  program.linear.json
  ontology.json
  linear_runtime.js              # Canvas 1D timeline
  index.html

docs/browser/volumetric-dynamics/  # 3D peer
  program.volumetric.json
  ontology.json
  volumetric_runtime.js            # Canvas with hand-rolled 3D projection
  index.html

docs/browser/sonic-dynamics/       # WebAudio peer with capture
  program.sonic.json
  ontology.json
  sonic_runtime.js                 # WebAudio forward + capture wiring
  sonic_capture.js                 # inverse projection (JS)
  microphone_capture.js            # mic -> ObservedVoice pipeline
  index.html

docs/browser/midi-dynamics/        # discrete-event peer
  program.midi.json
  ontology.json
  midi_runtime.js                  # piano-roll viz + Web MIDI Out
  midi_capture.js                  # inverse projection (JS)
  index.html

Building¶

multilingual polymodal-build  docs/browser/spatial-dynamics/program.multi \
  --out /tmp/program.semantic.json

multilingual linear-build     docs/browser/spatial-dynamics/program.multi \
  --out docs/browser/linear-dynamics/program.linear.json

multilingual spatial-build    docs/browser/spatial-dynamics/program.multi \
  --out docs/browser/spatial-dynamics/program.spatial.json

multilingual volumetric-build docs/browser/spatial-dynamics/program.multi \
  --out docs/browser/volumetric-dynamics/program.volumetric.json

multilingual sonic-build      docs/browser/spatial-dynamics/program.multi \
  --out docs/browser/sonic-dynamics/program.sonic.json

multilingual midi-build       docs/browser/spatial-dynamics/program.multi \
  --out docs/browser/midi-dynamics/program.midi.json

multilingual ontology-export  --out docs/browser/spatial-dynamics/ontology.json

The same program.multi flows into every projection. That is the architectural claim.

The load-bearing test of this architecture lives in polymodal_equivalence_test.py. It loads one semantic core, projects to all five peer manifests, and asserts that:

entity counts agree across every projection,
opcode ordering agrees,
intensity, phase, and channel are preserved by every projection (translated as needed: position for linear, z for volumetric, start_offset for MIDI, etc.),
every projection's modality fields match the ontology's hints,
structural relations derived at the semantic core flow through to every projection that exposes them.

If a future modality drops, renames, or reorders entities relative to the core, this test fails. That is what prevents the modalities from silently drifting apart as the project grows.

Relations¶

The semantic core records structural relations between entities so the peer projections share a single source of truth for structure (not just for entity fields). Currently inferred:

Containment. Each contain-opcode entity on channel C contains every non-contain entity on channel C. Multiple containers on the same channel produce overlapping containments. Lonely contain entities (no other entities on their channel) emit no relation — empty membranes are not load-bearing structure.

Coupling and temporal ordering are deferred until the source language gains explicit syntactic surface for them: inferring those from raw structure risks codifying arbitrary heuristics that become hard to change.

Future relation kinds should become first-class semantic facts rather than projection heuristics:

Coupling. Entity A modulates or depends on entity B.
Inhibition. Entity A suppresses entity B when a threshold is met.
Temporal ordering. Entity A precedes, triggers, or schedules entity B.
Synchronization. Entities share phase, clock, pulse, or cadence.
Routing. Signals move through explicit paths or channels.
Causality. A relation records that one event or state transition produces another.

These relations should enter the semantic core through explicit authoring surfaces or language syntax, not through hidden inference. Once present in semantic-core-vN, every peer modality can decide how to expose them: as spatial force, sonic modulation, MIDI routing, volumetric structure, haptic resistance, or textual declarations.

Sonic round-trip (current bidirectional path)¶

Sonic was the first modality to land a bidirectional path:

Python inverse in sonic_capture.py takes label-stripped ObservedVoice records (the fields a real audio analyzer could recover: role, waveform, envelope, frequency, amplitude, start offset, channel) and reconstructs a semantic-core-v0 manifest.
JS inverse in sonic_capture.js mirrors the Python module and fetches the shared ontology.json sidecar so it cannot drift.
Microphone pipeline in microphone_capture.js produces ObservedVoice records from real audio using vanilla DSP: spectral-flux onset detection, FFT peak picking for pitch, harmonic-ratio waveform classification, RMS-history envelope classification, snapping to the same pentatonic scale used by the forward projection.

The round-trip test in SonicRoundTripTestSuite asserts that the invertible subset of opcodes (those whose (role, waveform, envelope) tuple is unique in the ontology) recovers its semantic identity under core -> sonic -> observed -> captured-core. Opcodes that share a sonic signature (split/merge, attract/repel, diffuse/stabilize/resonate) are explicitly excluded from the invertible set — ambiguous-by-design rather than silently lossy.

MIDI round-trip (current bidirectional path)¶

MIDI is the second modality with an inverse path. Its observation surface is deliberately discrete: role, pitch, velocity, channel, start offset, and index. The Python inverse in midi_capture.py and the JS inverse in midi_capture.js recover semantic identity from (role, pitch) using the shared ontology.

MIDI capture is intentionally partial:

bus events are silent routing markers and are not recoverable as authoring input.
program events currently have zero base velocity, so intensity cannot be recovered.
events clipped to MIDI velocity 127 are rejected because the original intensity may have exceeded the representable range.

The MIDI round-trip tests assert that the unclipped, invertible subset recovers opcode, name, phase, channel, and approximate intensity under core -> MIDI -> observed -> captured-core.

Spatial round-trip (current bidirectional path)¶

Spatial is the third modality with an inverse path -- and the first that claims exact equivalence across every opcode. Identity is recovered from the ontology's (shape, color) tuple, which is unique for all twelve opcodes (including contain). The 2D editor surface exposes intensity, signal, phase, and channel as direct widgets rather than as perceptual features, so the inverse does not have to infer them from a measurement:

Python inverse in spatial_capture.py takes ObservedSpatialMark records and reconstructs a semantic-core-v0 manifest. Loaded entities pass their stable ent_<8hex> IDs through unchanged; freshly authored entities receive a fresh derived ID at capture time so the recovered manifest is still well-formed.
JS inverse in spatial_capture.js mirrors the Python module and fetches the shared ontology.json sidecar.
Editor pipeline in the spatial runtime preserves manifest entity IDs through drag edits, assigns ent_<8hex> IDs to palette-added entities, and exposes a Capture button that serializes the live world directly into a semantic core -- never routing through generated text.

This directly satisfies the compatibility principle's clause that "text regeneration is optional" and the design-test rule that "a capture path must not route through generated text as the only way to recover semantic identity." The spatial manifest's capability contract reports inverse: "exact" accordingly.

Long-term authoring model¶

The long-term environment should be a live polymodal workspace rather than a source editor with previews. A user should be able to edit the same program through several peer surfaces:

draw or move entities in 2D space,
hear and modify the sonic projection,
record or edit MIDI events,
manipulate a volumetric scene,
use touch, gesture, or haptic input,
inspect the semantic core as a diagnostic view,
optionally project the core back into textual Multilingual.

The shared state should be the semantic core:

semantic-core-vN
  <-> spatial runtime
  <-> sonic runtime
  <-> MIDI runtime
  <-> volumetric runtime
  <-> haptic / kinetic runtimes
  <-> textual projection

This is different from a conventional IDE. The text surface is a peer interface for programs that choose to expose one, not the sovereign representation that every other modality must imitate.

Research milestones¶

The next milestones should preserve compatibility while making the polymodal claim stronger:

Browser MIDI input wiring. The Python/JS MIDI inverse exists; the runtime still needs Web MIDI In capture that feeds observed events into it.
Stable identity. Add optional entity IDs to the semantic core and preserve them through every projection.
Richer capability enforcement. Projection manifests now declare capability metadata; future tests should use those contracts to drive exact, lossy, and view-only round-trip expectations.
Golden manifests. Store versioned fixture manifests for the semantic core and every projection, then require old fixtures to load identically or migrate explicitly.
Spatial authoring. Landed: the 2D runtime now preserves manifest entity IDs through drag edits, assigns stable IDs to palette-added entities, and ships a Capture button that serializes the live world directly into a semantic-core-v0 manifest. spatial-seed-v0 now claims inverse: "exact" -- all twelve opcodes are recoverable from the ontology's (shape, color) tuple.
Live synchronized runtime. Keep spatial, sonic, MIDI, and semantic-core views synchronized as one program state changes.
Explicit relation syntax. Add coupling and temporal-ordering relations only when the source language or authoring surfaces can express them deliberately.
Pattern recognition. Recognize stable dynamic structures such as gates, memory loops, oscillator networks, membranes, routers, attractor basins, and feedback circuits as higher-level semantic patterns.
semantic-core-v1 process calculus. Promote dynamics from hand-written per-runtime step() code into a modality-free ⟨State, Topology, Rule, Schedule⟩ tuple with rewriting as the single meta-primitive (see From structure to process above). First falsifiable target: Game of Life expressed once and projected to the spatial and sonic modalities through one shared core stepper. This is the path to universality — fractals, cellular automata, and ecosystems as fillings of the same form rather than new opcodes. Authoring landed: a v1 process program is now written in .multi, not Python — the process-core constructors (lattice_topology, rewrite_rule, synchronous_schedule, build_process_core, ...) are exposed as multilingual runtime builtins, a program assigns its ⟨State, Topology, Rule, Schedule⟩ core to a process variable (the v1 analogue of v0's seed), and process-build emits the manifest. examples/game_of_life.multi (and its French twin game_of_life.fr.multi) build a core byte-identical to, and stepping identically to, the Python reference; the two languages lower to the same core. Polymodal dynamics are no longer Python-only. Generative rewriting landed: the rewrite primitive now also expresses productions whose replacement is longer than their match, via a sequence topology and a generative schedule. examples/lindenmayer.multi is the canonical algae L-system (axiom A, A->AB, B->A) authored in .multi; it yields A, AB, ABA, ABAAB, ... (Fibonacci lengths) through the same shared stepper and classifies as Tier 3 (generative). This is the second .multi process program and the first that is not a cellular automaton — evidence the calculus generalises past lattices. L-systems are now end-to-end: the generative stepper is ported to JS, a sequence projection lays the word out as a piano-roll over the alphabet (spatial == sonic == word, exactly invertible), and a browser page animates it through the one shared stepper. Asynchronous / heterogeneous-state axis landed: an asynchronous schedule (sequential in-place update) over a multi-valued field expresses a cyclic-dominance ecology (examples/ecosystem.multi, en + fr) — Tier 1, with a value-aware field projection and a browser page, all on the project rhythm (engine Python+JS parity → .multi example → projection → browser → tests). Graph topology landed: the engine now answers adjacency from an explicit edge set (graph_topology), so the same rewrite rule that flips a lattice cell drives a node on an arbitrary network — only the topology decides who is a neighbour. examples/network_epidemic.multi (en + fr) is a discrete SIR contagion on a two-community contact graph; it classifies as Tier 4 (graph rewriting) yet its process_graph_projection round-trips exactly over the node set — a concrete demonstration that the tier (the expressive class ceiling) and the per-instance invertibility are orthogonal. A browser page renders it as a node-link diagram + chord and shows the tier in the status line via the shared classifier (tierOf) — as do the Game-of-Life, L-system and ecosystem pages, so every browser surface now names its program's expressiveness class. v0 closes the loop: a migrated (Tier-0) v0 core now projects through the v1 path too — process_static_projection places each v0 entity by the two integer fields it already carries (its index and channel) and carries the rest of the record verbatim, so spatial == sonic == the migrated core's loci and the round-trip recovers the original v0 entities exactly (Tier 0, "invertible everywhere"). So even a static v0 structure travels the one shared stepper and the one projection discipline as every dynamic program does — no special engine path. Continuous-time dynamics landed: the schedule axis is now complete. Beside the discrete schedules (synchronous, asynchronous, generative, static) sits continuous-dt, which integrates a new general rule kind — a rate rule (rate_rule), each field's time-derivative as a linear combination of the locus's own fields and the mean over its neighbours — by one explicit-Euler step next = current + dt·rate. It is the second general rule kind beside rewrite, and it names no system: diffusion is a particular set of coefficients, not an engine branch. examples/diffusion.multi (en + fr) authors the heat equation du/dt = D·(mean_nbr(u) − u) on a torus; on the mean-stencil total mass is conserved exactly while the bump relaxes, and the value-aware field projection renders it (heat field + sampled chord) exactly invertibly at Tier 1 (fixed-population continuous dynamics) — the first program to reach that tier through a non-rewrite rule, classified from the axes, not a label. A browser page shows it with the tier in the status line. (A load-bearing detail the JS/Python byte-identical guard surfaced: CPython's sum uses compensated summation, so the rate must use a naive left-fold to match the JS port bit-for-bit.) Still open on the rule/topology axes: positional / multiset match clauses, higher-order integrators (RK4) and continuous-metric (particle/boids) topologies, and structure-changing graph rewriting (creating and destroying nodes/edges), which is the genuinely view-only end of Tier 4.

The strongest near-term proof is:

Two independent authoring modalities can modify the same semantic core, and equivalence tests prove that the declared semantic identity survives.

Potential applications¶

Polymodal computation has applications wherever computation needs to be understood, authored, or controlled through more than one human sensory or cognitive channel.

Area	Why polymodal computation matters
Accessibility	Blind, deaf, motor-impaired, and neurodiverse programmers can use peer authoring surfaces instead of accessibility layers added after the fact.
Education	Learners can encounter computation as rhythm, movement, containment, propagation, resonance, and transformation before syntax.
Creative coding	One semantic program can become animation, sound, MIDI, spatial sculpture, or live performance state.
Scientific simulation	Diffusion, attraction, repulsion, stabilization, oscillation, propagation, and containment map naturally onto dynamic systems.
Robotics	Spatial, kinetic, and haptic modalities can express target seeking, obstacle avoidance, damping, gait, routing, and boundary constraints.
Music as computation	MIDI and sonic capture let executable structure be played, heard, recovered, and transformed.
Collaborative tools	Different collaborators can use different modalities while sharing one semantic program.
AI agent debugging	Goals, constraints, uncertainty, conflicts, tool calls, and plan propagation can become perceptible structures.
Digital twins	Industrial, building, transport, and distributed systems can be inspected as synchronized spatial, sonic, and semantic state.

The project should avoid framing these as visualizations of ordinary programs. The more interesting claim is that computation can have multiple native perceptual forms while keeping one semantic identity.

Open work¶

Browser MIDI input wiring. The Python and JS inverse projections now exist; the browser runtime still needs a live Web MIDI In capture button and recovered-core panel.
Authoring surfaces inside the runtimes. Sonic captures from the microphone, MIDI captures from a Web MIDI input port, and the 2D spatial runtime now edits the manifest in place and emits a captured semantic core. The linear and volumetric runtimes are still view-only.
Coupling and temporal-ordering relations. Waiting for explicit source syntax.
Glyph-only authoring. .multi still uses English-spelled primitive names. Once the polymodal split is real, replacing them with stable glyphs / numeric opcodes is incremental.
Compatibility profiles. The textual language and the polymodal profile should remain distinct so ordinary .multi programs do not inherit cross-modal obligations.
Perceptual contract enforcement. Projection metadata now marks lossy, ambiguous, and partial-inverse surfaces; tests should increasingly derive their expectations from those declarations.

Design test¶

A prototype has drifted out of polymodal computation if any of the following hold:

A "primary" modality emerges that the others must be defined relative to.
A new primitive is added in one modality without an ontology entry.
A projection adds entities that have no counterpart in the semantic core.
Cross-modal equivalence tests are weakened or skipped to ship a feature.
A polymodal feature changes the behavior of ordinary textual programs outside the polymodal profile.
A capture path routes through generated text as the only way to recover semantic identity.
A lossy or ambiguous projection claims exact equivalence.
A modality's hint vocabulary silently reuses names from another dimensionality (e.g., 1D glyphs reaching for 2D shape names), collapsing the distinction the peer projections are meant to preserve.

The standard is stricter than "multimedia support": every modality must be a peer expression of the same program.