Restructure three-pronged → knowledge-layers: collapse 11 files to 3, integrate into main architecture

- Rename 'three-pronged' folder to 'knowledge-layers' — prong metaphor was misleading (implied parallel tines), replaced with epistemic layers (deductive base, empirical middle, probabilistic oracle — vertical stack) - Collapse 11 overlapping files into 3 coherent documents: - knowledge-layers/_index.org: core framework (two engines + one store, World Model formula, 0-14 layer table, provenance store design, conflict resolution, cold-start, stage mapping) - knowledge-layers/practical-implications.org: design-world-aware-of- physics, 10 powers, Schafmeister existence proof, epistemic transparency - knowledge-layers/neurological-empirical.org: neural networks in provenance framework (kept intact) - Relocate wolfram/mathematica and Schafmeister docs to ideas/viability/ - Integrate into main architecture _index.org: - Gate: expanded from two vectors (ACL2+LLM) to three (deductive, provenance/empirical, LLM oracle) - Autodidactic loop: split into Track 1 (deductive hardening, fast) and Track 2 (empirical validation, slow, experimental-feedback-driven) - See also: added Knowledge Layers cross-reference - Add all-lisp geometry engine note (ideas/lisp-geometry-engine.org) as concrete illustration of the empirical layer's effect on design work - Rebuild site: 148 files, 0 errors
2026-06-04 19:09:44 +00:00
parent 2e8cf19f9e
commit 6e992cc0c5
92 changed files with 921 additions and 2628 deletions
--- a/projects/passepartout/architecture/knowledge-layers/_index.org
+++ b/projects/passepartout/architecture/knowledge-layers/_index.org
@@ -0,0 +1,163 @@
+:PROPERTIES:
+:CREATED:  [2026-05-24 Sun]
+:ID:       329bd4fb-702a-4a2b-9c63-69281aacb83a
+:END:
+#+title: Knowledge Layers
+#+filetags: :architecture:knowledge-layers:verification:epistemology:
+
+Passepartout's architecture for how the system knows what it knows: deductive proofs (mathematical certainty), provenance-tracked empirical models (statistical validity), and probabilistic oracle (LLM-aided guidance) — all governed by the gate.
+
+These three epistemic layers form a vertical stack: the deductive base provides formal guarantees, the empirical middle bridges mathematics to physical reality through curated data, and the probabilistic oracle generates hypotheses and interprets results within the boundaries set by the lower layers.
+
+---
+
+* Two Engines, One Store
+
+The three layers are not three parallel engines. They are two reasoning engines and one curated data store:
+
+- **The symbolic engine** handles everything that can be formalized: deductive proofs, empirical equations, validity predicates, pipeline composition, uncertainty propagation. This is one engine — it reasons about symbols using rules that are either proven (ACL2) or well-defined (force field equations). It is authoritative where it applies.
+
+- **The probabilistic oracle** (the LLM) handles everything that cannot be formalized: parameter selection, model choice, interpretation of results in natural language, failure diagnosis, creative hypothesis generation. It proposes; the symbolic engine checks. It is bounded — it cannot execute actions, only recommend them.
+
+- **The provenance store** is not an engine. It is a structured database that stores empirical parameter sets, validity envelopes, experimental benchmarks, and comparison histories. Neither engine reasons about it as a whole. The symbolic engine queries it for parameters and validity predicates. The LLM queries it for context and updates it with new data.
+
+The gate is the integration point. Every action is checked against three vectors:
+1. **Security policy** — is this action safe? (ACL2-verified gate rules)
+2. **Scientific validity** — is this model valid in this context? (provenance store query + validity envelope check)
+3. **Consistency** — do the symbolic check and the oracle's assessment agree? If the LLM's recommendation violates a validity envelope, the gate rejects.
+
+* The Knowledge Tree
+
+The layers of human knowledge, from formal foundations to empirical design, mapped to their epistemic status:
+
+| Layer | Domain | Formal status | Verification model |
+|---|---|---|---|
+| 0. Logic / Foundations | Proof theory, set theory, type theory | Deductive | Complete — provable from axioms |
+| 1. Algebra | Groups, rings, fields, vector spaces | Deductive | Complete |
+| 2. Analysis | Calculus, limits, real numbers, measure theory | Deductive | Complete (in principle; deep results are hard) |
+| 3. Geometry / Topology | Manifolds, differential forms, curvature | Deductive | Complete |
+| 4. Classical Mechanics | Lagrangian/Hamiltonian mechanics | Deductive | Complete |
+| 5. Quantum Mechanics | Hilbert spaces, operators, Schrödinger equation | Deductive | Complete |
+| 6. Statistical Mechanics | Ensembles, partition functions, entropy | Deductive | Complete |
+| 7. Electrodynamics | Maxwell's equations, potentials, radiation | Deductive | Complete |
+| 8. Quantum Chemistry | Born-Oppenheimer, DFT, coupled cluster | Partially deductive — equations formal, approximations necessary | The implementation is verifiable; the model choice is not |
+| 9. Molecular Mechanics | Force fields, potential functions | Empirical parameterization | Simulation is deterministic; parameters fitted to experiment |
+| 10. Molecular Dynamics | Integration schemes, thermostats | Deductive mechanics + empirical inputs | Integrator provable; force field parameters are not |
+| 11. Chemical Thermodynamics | Binding constants, free energy surfaces | Mixed — statistical mechanics deductive, solvation models empirical | Provenance tracked for empirical components |
+| 12. Structural Biochemistry | Protein folding, docking, enzyme kinetics | Largely empirical | Validation against experiment, not deductive proof |
+| 13. Organic Chemistry | Reaction mechanisms, functional group transformations | Empirical with formal structure | Mechanism hypotheses falsified by experiment, not proved |
+| 14. Molecular Design | Shape-programmable molecules, therapeutic targeting | Empirical design space | Design rules validated by experiment, not derived from QM |
+
+The critical transition is between layers 7 and 8. Everything below is fully formalizable — ACL2 can verify correctness against first principles. Layer 8 introduces the first irreducible approximation (Born-Oppenheimer, DFT exchange-correlation functionals). From layer 9 onward, models are empirical through and through: mathematically rigorous in their execution, but their parameters are fitted to experiment and their validity is provisional.
+
+Passepartout can verify the *computation* at every layer — that the Schrödinger equation is correctly solved, that the MD integrator preserves phase space, that the docking algorithm correctly explores conformational space. It cannot verify that the *model* matches reality. That is the domain of the empirical layer.
+
+* World Model = Verified Equations ⊗ Parameters ⊗ Validity Envelope
+
+Every computation that bridges formal mathematics and physical reality decomposes into a world model triple:
+
+**World Model = Verified Equations ⊗ Provenance-Tracked Parameters ⊗ Validity Envelope**
+
+| Component | What it is | Who handles it |
+|---|---|---|
+| Verified Equations | The formal skeleton: differential equations, integration schemes, force field functional forms | Symbolic engine — ACL2 verifies the implementation against the mathematical theory |
+| Provenance-Tracked Parameters | The numbers that make the model match reality: force constants, partial charges, solvation parameters, scoring weights | Provenance store — each carries a source (paper, dataset, calculation), confidence interval, validity regime (temperature, molecular class, solvent), and last-validation date |
+| Validity Envelope | The region of input space where the model has been experimentally validated | Gate — checked as a predicate before execution: is the current input within the model's validated range? |
+
+A force field, for example, is:
+- Bond stretching follows Hooke's law (verified equation — proven by the symbolic engine)
+- The specific spring constant for a C-C bond is 600 kcal/mol/Å² (provenance-tracked parameter — from Cornell et al. 1995, validated against 50+ small molecules)
+- The model is valid for proteins and nucleic acids in aqueous solution at 273-373K (validity envelope — checked by the gate before each simulation)
+
+The three components are inseparable. Without verified equations, the computation is untrustworthy. Without provenance-tracked parameters, the numbers are arbitrary. Without a validity envelope, the user cannot know whether the model applies to their problem.
+
+* The Provenance Store
+
+The provenance store is the infrastructure that makes the empirical layer operational. It is not a single database — it is a structured knowledge base that holds:
+
+**For traditional empirical models** (force fields, solvation equations, scoring functions):
+- The functional form of the model (e.g., AMBER ff14SB: harmonic bond + harmonic angle + Fourier torsion + LJ + Coulomb)
+- Every parameter with its source (paper, dataset, QM calculation), confidence interval, and validity regime
+- Validation history: which experimental measurements have been compared to this model, with what outcome
+- Revision history: when parameters were updated, by whom, and what changed
+
+**For neural network models** (ANI-2x, AlphaFold, learned potentials):
+- Architecture description and training hyperparameters
+- Training dataset provenance (level of theory, molecule coverage, element coverage)
+- Validation benchmarks with per-benchmark error metrics
+- Distribution summary statistics (needed for the distribution match check)
+- Domain of applicability (elements, charge ranges, molecule classes)
+
+**The gate checks:**
+
+For traditional models:
+1. Does the model support the elements/atom types in the current input? (parameter availability check)
+2. Are the conditions (temperature, pressure, solvent) within the model's validated range? (validity envelope check)
+3. Is the input within the model's training distribution? (distribution match check — primarily for neural network models)
+
+For neural network models, check 3 requires new machinery: a distribution match function that computes how similar the current input is to the model's training distribution in latent space. This is a standard technique in reliable ML (distance to training data, density estimation, conformal prediction). It integrates into the gate as a predicate: input within distribution = pass; outside distribution = flag with confidence reduction.
+
+Every check outputs pass, flag with reduced confidence, or block. The gate never silently permits a computation outside a model's validated range.
+
+* Conflict Resolution
+
+The three layers can disagree. The arbitration rules:
+
+1. **Deductive overrides both.** If the symbolic engine (ACL2) proves that a computation is formally incorrect, it is blocked regardless of what the LLM recommends or what the provenance store reports. Formal correctness is the non-negotiable base layer.
+
+2. **Empirical overrides probabilistic.** If the provenance store reports that a model's validity envelope excludes the current conditions, the LLM cannot override that judgment. The LLM may recommend a different model, or the gate may flag for human review — but it cannot proceed with the invalid model.
+
+3. **Probabilistic proposes, never executes.** The LLM recommends model selections, parameter choices, and design alternatives. Every proposal is checked against the deductive layer (formal correctness) and the empirical layer (validity envelope) before execution. The LLM cannot write a file, run a command, or send a message — it can only propose.
+
+4. **Human override is always recorded.** A user can override any layer's judgment. The override is logged to the provenance chain with the user's signature and reason. The result of an overridden computation is tagged as "human override — bypassed [layer] check" with reduced default confidence.
+
+5. **Uncertainty propagates upward.** If two empirical models disagree, the system reports both results with their respective confidence intervals and a flag: "Models disagree by 2.3 kcal/mol. Model A's uncertainty: ±0.8 kcal/mol. Model B's uncertainty: ±1.1 kcal/mol. Recommend experimental validation." The gate does not force agreement; it reports the conflict transparently.
+
+* Cold Start
+
+The provenance store must be populated with validated data before it can enforce validity envelopes. The bootstrap sequence:
+
+1. **Seed from curated sources.** Initial parameter sets from established force fields (AMBER, CHARMM, OPLS), benchmark datasets (PDBbind, COMP6), and published experimental reference data are loaded with explicit provenance tagging. Each entry is marked "unverified by this instance" but carries its original source citation.
+
+2. **LLM provides provisional parameters.** For domains where no curated data exists, the LLM proposes parameter values based on training data knowledge. These are tagged as "unvalidated — LLM-sourced" with reduced confidence and clearly marked validity envelopes.
+
+3. **Validation through use.** Every time the system runs a computation and receives experimental feedback (or the user provides a measurement), the comparison is recorded. Disagreements between prediction and measurement trigger parameter updates. Over hundreds of comparisons, the provenance store's confidence intervals tighten.
+
+4. **Community amplification (Stage 1+).** Through the social protocol, instances share validated parameter sets with provenance chains. A force field validated by one instance for ethanol and another for DMSO accumulates a broader validity envelope than either alone. The network effect compounds the cold-start investment.
+
+The cold start never reaches the same confidence as a mature instance with years of experimental feedback. But even a seeded provenance store with provisional parameters is strictly better than a system with no provenance — because the provisional parameters are explicitly tagged as provisional, and the user can see the confidence for every result rather than trusting a single unmarked number.
+
+* Mapping to Stages
+
+The knowledge-layers infrastructure is staged, not all-at-once:
+
+- **Stage 0 (current).** The probabilistic oracle exists (the LLM). The provenance store does not. The deductive engine partially exists through Hermes skills (symbolic gate rules as Python, not ACL2). The empirical layer is invisible — the LLM reasons about chemistry, biology, and engineering using training data alone, without systematic provenance.
+
+- **Stage 1 (social protocol).** The provenance store prototype can be introduced as a side effect of signed messages and data exchange. When instances share a validated parameter set, the message carries a signature and source. The receiving instance stores it with provenance. Natural crawl before full infrastructure.
+
+- **Stage 2 (gate as software).** The provenance store becomes operational infrastructure. The gate checks scientific validity alongside security policy. The provenance store integrates with the Knowledge subsystem as a structured data store — the symbolic index holds formal facts; the provenance store holds empirical parameters. Same storage mechanism, different data type.
+
+- **Stage 3 (Lisp machine).** The symbolic engine is native in one address space. ACL2 runs at hardware level. The provenance store becomes a native Lisp hash table with persistence. The gate checks validity predicates in the evaluation loop itself. The LLM proposes model selections; every proposal is verified against the provenance store before execution. All three layers in one address space.
+
+- **Stage 4+ (in-process inference).** The LLM moves in-process. All three components share one address space. No IPC between them. The query cycle is: LLM proposes → symbolic engine checks against provenance store → if valid, execute → if invalid, return to LLM with diagnostic. This loop runs at native speed.
+
+* What This Changes in the Architecture
+
+The knowledge-layers model adds a dimension to the existing architecture that was only implicit before:
+
+1. **The gate gets a third vector.** Previously the gate checked security (is this action safe?) and, through its ACL2 verification, mathematical correctness. Now it also checks scientific validity (is this model valid in this context?). The mechanism is the same — a policy evaluated before the computation proceeds — but the policy now includes predicates over empirical model applicability, not just safety and formal correctness.
+
+2. **The autodidactic loop gets two speeds.** The fast loop (deductive — generate code, prove it, hot-reload) runs autonomously at LLM speed. The slow loop (empirical — make prediction, get experimental data, update parameters) requires real-world feedback and cannot run without it. Both are essential. The fast loop makes the system mathematically powerful; the slow loop makes it useful for real-world science and engineering.
+
+3. **The provenance store is a new data type in the Knowledge subsystem.** It is neither the symbolic index (formal facts) nor the neural index (embedding vectors). It is a third index: parametric, uncertain, provisional — but no less essential for its lack of deductive certainty.
+
+4. **The gate becomes a configurable integrity layer.** Security, scientific validity, ethical constraints, legal constraints, economic constraints — all expressed as predicates over the computation's inputs, models, and parameters. Users, institutions, or jurisdictions can configure different policies without changing anything else in the system. Compliance becomes configuration.
+
+---
+
+See also:
+- [[id:971cd9e7-2cc5-4743-8042-2469dbe4078f][Lisp Foundation]] — the prover bootstrapping path that enables the deductive layer
+- [[id:1c3ec48b-446c-50d2-b53e-126a81f5143f][Architecture]] — the gate, subsystems, and staged progression
+- [[id:4b5c6d7e-8f9a-0b1c-2d3e-4f5a6b7c8d9e][Neural Networks in the Empirical Layer]] — how trained models fit into the provenance framework
+- knowledge-layers/practical-implications.org — concrete consequences for design, safety, regulation, and trust
+- [[id:f4e5d6c7-b8a9-0c1d-2e3f-4a5b6c7d8e9f][Schafmeister and Clasp]] — existence proof: Lisp in computational nanotechnology