memex: update AGENTS.md, add passepartout design-decisions notes, SWOT + agora notes, bump submodules → v0.8.1

2026-05-10 07:11:08 -04:00
parent 04944a62e2
commit e719443ce7
6 changed files with 1566 additions and 3 deletions
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -12,7 +12,7 @@
   a. Write the test first → tangle → run → prove it FAILS (RED)
   b. Write the implementation → tangle → run → prove it PASSES (GREEN)
   c. Record both failure and success output
-5. **Reflect in org** — once tests pass, ensure the implementation is in the .org source
+5. **Reflect in org** — once tests pass, ensure the implementation is in the .org source, put each function in a separate code block.
 6. **Update literate prose** — write/update the explanatory text around the code:
   what it does, why it exists, how it connects to the rest of the system
 7. **Mark the origin TODO DONE** — in `docs/ROADMAP.org`, change the
--- a/notes/passepartout-SWOT.org
+++ b/notes/passepartout-SWOT.org
@@ -0,0 +1,868 @@
 #+TITLE: Passepartout Neurosymbolic + Agora Integration — SWOT Analysis
 #+AUTHOR: Agent
 #+FILETAGS: :notes:analysis:swot:passepartout:agora:neurosymbolic:
 #+CREATED: [2026-05-09 Sat]
 * Premise and Scope
 This analysis assumes the engineering is possible — Screamer can be wrapped,
 VivaceGraph can persist facts, ACL2 can verify structural properties, the
 archivist can extract triples from prose with Screamer verification, and the
 note-publishing bridge to Agora can be implemented. The question is not "can it
 be built?" but "does the architecture cohere? What does it enable? What does it
 miss?"
 * Will It Work Conceptually?
 The short answer: yes, within a specific domain. The long answer: the boundary of
 that domain is the most important thing to get right.
 ** The architecture's core insight is correct and load-bearing
 The central design decision — "the LLM proposes; the symbolic engine decides
 whether to accept" — is sound. It is the inverse of every existing agent
 architecture. Claude Code, OpenCode, Hermes — all of them put the LLM in the
 driver's seat and add safety as an afterthought (prompt-based guardrails that
 consume tokens and can be evaded). Passepartout inverts this: the LLM proposes
 actions and facts, but a deterministic layer of gates, constraint solvers, and
 formal verifiers decides what to admit and what to execute. This inversion is the
 correct response to the hallucination problem. You cannot eliminate hallucination
 by making the LLM better. You eliminate it by not asking the LLM to do things
 that require certainty.
 The bootstrap mechanism — extracting 50-70 entity classes mechanically from the
 existing Dispatcher gate stack with zero new code — is genuinely elegant. It
 proves the pattern at minimal cost: code becomes facts, facts enable reasoning.
 Every new gate pattern adds to the ontology organically. This is the right way to
 start a knowledge base: not by designing a schema upfront, but by formalizing what
 the system already knows implicitly.
 ** The "one memex, two indices" architecture survives contact with reality
 Option 4 (one memex with neural and symbolic indices over the same Org files) is
 the correct long-term architecture. The prose is the ground truth — always. The
 symbolic index is a derived view that can be thrown away and rebuilt. The neural
 index handles semantic search, associative leaps, and fuzzy matching. This
 division of labor is permanent, not transitional, because the domains they serve
 are fundamentally different kinds of knowledge.
 The practical path — starting with Option 5 (ephemeral facts, no persistence)
 through Phases 1-4, then graduating to Option 4 with VivaceGraph persistence in
 Phase 5 — is the right sequence. It punts the serialization format problem until
 the fact language has been battle-tested. It keeps the cost of mistakes low. It
 treats the ontology as something discovered through use rather than designed
 upfront.
 ** Wikipedia's ontology WOULD give it a running start — with caveats
 Wikidata contains approximately 100 million entities with a decade of human
 curation: type hierarchies, relations, dates, citations, disambiguation. For a
 personal memex that mentions Nabokov, /Pale Fire/, Kafka, postmodernism, and
 butterfly migration, the gate stack's 50-70 entity classes is starvation.
 Organic growth through prose extraction would take years to cover the entities in
 one person's engagement with a single novel.
 Loading Wikidata's entity graph into the symbolic index transforms the
 archivist's job from "discover that Nabokov wrote /Pale Fire/" to "connect your
 heading to Wikidata entity Q36591." The second task is reference resolution, not
 knowledge extraction — simpler, more reliable, and in many cases doable without
 an LLM at all (string match against loaded entities). The notes claim this
 collapses the LLM's role to three thin boundaries: input translation, prose-to-
 candidate-triple for personal content, and result-to-prose formatting.
 The caveats are real:
 - Entity resolution (matching prose mentions to Wikidata entities) is genuinely
  hard. "Nabokov" in a diary might refer to Vladimir Nabokov (Q36591), his son
  Dmitri (Q566744), or someone else entirely. Disambiguation requires context
  that the symbolic engine doesn't have without LLM assistance.
 - Wikidata is biased toward English Wikipedia's coverage. A memex in Arabic,
  Farsi, or Amharic will find far fewer resolved entities. The "universal" in
  Wikidata is aspirational, not actual.
 - Wikidata's property graph is not a ontology in the formal sense — it's a
  collaboratively edited dataset with contradictions, gaps, and editorial wars
  frozen in time. Loading it directly into a symbolic index that expects
  consistency (Screamer checks, cardinality policies) will surface thousands of
  contradictions on ingest, many of which are Wikidata artifacts, not meaningful
  tensions.
 - N-hop expansion is unbounded. One hop from Nabokov hits hundreds of entities
  (his works, his family, his influences, his translators). Two hops hits
  thousands. Three hops hits tens of thousands. The notes say "3-4 hops" for a
  literary memex but don't estimate the entity count this implies. The claim that
  5 million entities = ~400MB is the best-case hash-table figure; a graph with
  query indices will be larger, and Prolog-like queries over millions of nodes
  are not free.
 Still: even a partial Wikidata load with conservative hop limits would provide
 more ontology than the system could accumulate through years of organic growth.
 It is the right accelerator, and the architecture handles it correctly — Wikidata
 facts are admitted with =:provenance :wikidata= and =:policy :plural=, meaning
 they sit alongside personal facts without overriding them. Disagreements are
 surfaced, not resolved. The architecture treats Wikidata as evidence from an
 external source, not as ground truth. That's the correct posture.
 ** Cardinality policies are the right abstraction for contradiction
 The =:singular= / =:dual= / =:plural= cardinality model is one of the most
 important ideas in these notes. Classical logic requires consistency — a
 contradiction implies everything (ex contradictione quodlibet). A constraint
 solver like Screamer also requires consistency — a contradictory constraint set
 has no solutions. But a personal memex operates across domains where the meaning
 of contradiction is fundamentally different:
 - "rm -rf / is catastrophic" is =:singular= — there is one truth that evolves
  over time.
 - "I loved this person AND I resented them" is =:dual= — the tension IS the
  fact.
 - "Wikidata says Everest is 8848m; DBpedia says 8849m; my 2023 diary says
  8848m" is =:plural= — multiple sources disagree, and surfacing the disagreement
  with provenance is the product.
 This is a genuinely novel contribution to knowledge representation. Most
 knowledge graphs (Wikidata, Freebase, DBpedia) don't model contradiction at all —
 they pick one value and discard the rest. Most constraint solvers reject
 contradiction as error. Passepartout's cardinality model makes contradiction a
 first-class citizen: you can query the fact that "I used to believe X until
 Tuesday, then Y," or "these three sources disagree on height," or "I hold these
 two positions in tension." The symbolic engine's job is not to decide which is
 right. It is to surface the tension with provenance.
 This alone, if implemented correctly, would be a category-level advance over
 every existing personal knowledge management tool.
 ** Ontology versioning is the right approach to the migration problem
 Every knowledge base eventually faces schema migration — you split =:secret-file=
 into =:crypto-secret= and =:plaintext-secret=, and now every deduction that
 crossed the old category boundary is suspect. The standard approach is batch
 UPDATE operations that overwrite the past. Passepartout's approach — the category
 hierarchy itself is a Merkle tree, every fact stores the =:ontology-version= at
 assertion time, category changes trigger re-verification rather than remapping —
 preserves all worldviews. You can query "what did I believe about secrets before
 I refined my security model?" This is not querying a fact. It is querying the
 history of your own thinking.
 This is the kind of capability that no existing tool provides, and it flows
 directly from the architecture. If the Merkle DAG infrastructure exists (it does,
 from v0.2.0), ontology versioning is ~40 lines on top of it. The conceptual
 design is sound. The engineering appears tractable.
 * SWOT Analysis
 ** Strengths
 *** Architectural inversion — proposer vs decider
 The LLM proposes. The symbolic engine decides. This is the inverse of every
 existing agent architecture, and it solves the hallucination problem at the
 architectural level rather than the prompt-engineering level. No amount of
 prompt refinement can make a probabilistic system deterministic. But a
 deterministic admission gate can make a probabilistic proposer safe.
 *** Unified container format (Org files)
 Org files serve as the container for human prose, Lisp source code, symbolic
 facts, and Agora Notes. One format, one toolchain, one Merkle tree, one version
 control system. If Passepartout stops existing, the data survives in plain text.
 This is the hardest commitment in the design and the most undervalued. Most agent
 architectures store memory in JSONL transcripts, vector databases, or proprietary
 formats — opaque to the human and dependent on the tool. Passepartout's memory
 IS the human's memory, in the human's format.
 *** Provenance as product
 Every fact carries =:grounding= (the specific Org heading), =:provenance= (who
 or what produced it), =:timestamp=, =:referenced-by=, =:contradicted-by=,
 =:superseded-by=. The =/audit= command renders the full provenance chain. In the
 broader memex, the value is not the verified fact ("this command is safe"). It
 is the provenance itself: "this claim originated in that diary entry, has been
 referenced 7 times across 4 projects, was contradicted 6 months later, and was
 revised 3 weeks after that." This is a memory prosthesis that makes your own mind
 legible to you.
 *** Gate-to-fact bootstrap — ontology from existing code
 The existing Dispatcher gate stack encodes an implicit ontology (categories of
 secrets, destructive commands, trusted domains, core files). The bootstrap
 extracts this mechanically — zero LLM tokens, zero human authoring, ~30 lines of
 Lisp. This proves the pattern and provides the seed ontology without any new
 infrastructure. Every new gate pattern added by the human (HITL approvals that
 become rules) extends the ontology automatically.
 *** Self-preservation architecture
 The Third Law implementation — quarantine on skill failure, degraded-mode
 signaling, resource monitoring, external watchdog, refusal to self-terminate —
 is individually small (~20-50 lines each) and collectively transforms
 self-preservation from a passive architectural property into an active behavior.
 The key insight: the biggest gap is not that these mechanisms are hard. It is
 that degradation is currently silent. Making it visible is cheap and high-impact.
 *** Cardinality policies as a solution to contradiction
 The =:singular= / =:dual= / =:plural= model is novel in knowledge representation
 and directly addresses the hardest problem in a personal memex: that
 contradiction is the product, not the error. Bayesian knowledge bases, graph
 databases, and triple stores all struggle with contradiction. Passepartout's
 model makes it a feature.
 *** Organic ontology growth
 Categories emerge from the system's own operation: gate patterns → gate outcomes
 → Screamer generalizations → archivist proposals → cross-domain overlap
 detection. The ontology is a garden, not a building. This avoids the Principia
 Mathematica problem — the need to define everything upfront — by replacing
 axiomatic design with evolutionary growth. Categories that aren't used fade.
 Categories that are contradictory are pruned. Categories that emerge from
 overlapping domains are promoted. The system converges on useful granularity
 through use.
 *** Agora as provenance layer for networked knowledge
 A BFT-timestamped triple store is one approach, but the Merkle DAG + DID
 signatures provide a lighter-weight alternative: every fact's provenance is
 content-addressed, every author's identity is cryptographically verifiable, and
 the DAG structure enables partial replication without consensus. This is more
 tractable than full BFT and sufficient for a personal memex that needs to share
 facts across a network.
 *** Decoupling of compute cost from knowledge base size
 LLM tokens are minimized by design — deterministic gates cost 0 tokens, sparse-
 tree rendering keeps context at 2,000-4,000 tokens, Screamer deductions cost 0
 tokens. Adding 5 million Wikidata entities does not add a single token to any LLM
 call. The variables that actually degrade performance — context window size, LLM
 call frequency, Screamer deduction budget — are all bounded independently of
 knowledge base size. This is a structural property: the education is local, only
 the brain costs.
 ** Weaknesses
 *** The fact language is unproven and may be insufficient
 Triples — =(:entity :relation :value)= with provenance and grounding — is the
 current hypothesis. It is simple enough to be parseable, expressive enough to
 capture the gate stack's implicit claims, and extensible enough that Screamer can
 operate on it. But:
 - Triples cannot naturally express temporal relations. "Was X before Y?" requires
  reification (making the relation itself an entity), which makes queries
  exponentially more complex.
 - Triples cannot express modal claims. "Should not do X unless Y" has no natural
  triple representation. Neither does "could have done X but chose Y."
 - Triples cannot express counterfactuals. "If X had happened, Y would have
  followed." These are essential for the "what if" reasoning that a personal
  memex should support.
 - Triples struggle with n-ary relations. "Nabokov wrote Pale Fire in 1962 while
  living in Montreux" is a 4-ary relation (author, work, date, location), not a
  set of independent binary relations. Breaking it into triples loses the
  connection that binds them.
 - Triples cannot express negation cleanly. "Nabokov did NOT write Doctor Zhivago"
  requires a negative fact, which in a triple store with an open-world assumption
  means "not known" and "known not" are conflated.
 The notes acknowledge this limitation but defer it. The right granularity
 "depends on what queries the planner actually needs to make, and that cannot be
 known in advance." This is honest but unsatisfying. If triples prove insufficient,
 the entire fact store, the Screamer integration, the VivaceGraph persistence, and
 the archivist's extraction format must be redesigned. The architecture has no
 intermediate fallback between "triples" and "something more expressive."
 *** Screamer as admission gate is untested at this scale
 Screamer is a constraint solver with non-deterministic backtracking. Using it
 to check a candidate triple against an existing fact store is conceptually
 elegant: express the fact store as constraint variables, assert the candidate,
 check solvability. But:
 - Screamer was designed for constraint satisfaction problems with tens to
  hundreds of variables. A fact store with millions of triples (after Wikidata
  loading) is a constraint space orders of magnitude larger than Screamer's
  design envelope.
 - The consistency check is domain-scoped (only rules from the candidate's
  =:domain= apply), but cross-domain contradictions are the most valuable kind.
  "Nabokov was born in 1899" (literature domain) should be consistent with
  "Nabokov died in 1977" (history domain). If these are separate domains, the
  check misses contradictions; if they are unified, the constraint space
  explodes.
 - Screamer's non-deterministic backtracking is worst-case exponential. The notes
  bound this via deduction budget (=SCREAMER_DEDUCTION_BUDGET_MS=) but don't
  address the admission check itself, which runs on every assertion.
 There is a risk that Screamer works beautifully for the gate-bootstrapped seed
 (50-70 entity classes, ~200 facts) and becomes unusably slow after Wikidata
 loading (millions of facts). The transition from "works" to "doesn't" may be
 gradual and hard to detect — the system gets slower but doesn't crash,
 degrading user experience without a clear diagnostic.
 *** The "flip" from lossy to deterministic is underspecified
 The architecture's central narrative arc is the "flip": at some point, the non-
 lossy facts constitute a sufficient foundation that the symbolic engine can
 reverse the flow — instead of LLM extraction, the symbolic engine reads prose
 through its own lens and deduces facts directly. The sufficiency metric
 (non-lossy / total > 0.7) makes this "computable and visible to the user."
 But:
 - The threshold (0.7) is arbitrary. It is not derived from empirical measurement,
  information theory, or constraint satisfaction theory. It is a guess.
 - Sufficiency is domain-specific, not global. The gate stack may have 0.95
  coverage of security classifications but 0.05 coverage of literary analysis.
  A global threshold of 0.7 hides the domains where the symbolic engine is still
  effectively blind.
 - The "flip" operation itself is not defined. "Screamer reads prose through its
  own lens" — Screamer does not read prose. It operates on structured facts.
  Either the archivist still extracts triples (which is LLM work), or some new
  mechanism parses prose into triples deterministically (which is NLP at a level
  that does not exist in open-source Lisp).
 - Even after the flip, facts from the pre-flip period carry =:provenance
  :llm-proposed= and are therefore suspect. The pre-flip facts were admitted
  against fewer non-lossy facts, meaning Screamer's consistency checks were
  weaker. A fact admitted during the seed phase may be wrong but undetected
  because there were no contradicting facts at the time. Re-verifying all pre-
  flip facts against the current fact store is described as a heartbeat task but
  the cost (millions of Screamer checks) is not estimated.
 The flip is a beautiful narrative. It may also be a mirage — the system may
 achieve high sufficiency in narrow domains (security, filesystem, coding) and
 never approach it in the broader memex (literature, personal reflection, daily
 life). If the broader memex is the use case, the flip may never happen.
 *** The archivist's extraction cost is unaccounted
 The archivist calls the LLM to extract triples from prose, with "a minimal prompt
 (~200 tokens)." Over a personal memex with thousands of entries — a decade of
 diary entries, hundreds of literature notes, dozens of project logs — the
 extraction cost is substantial.
 Assume 5,000 headings, 200 tokens per heading prompt, and an LLM that returns
 ~100 tokens of structured triples per heading. That's 1.5 million tokens for the
 initial extraction, plus verification tokens (Screamer checks cost 0 LLM tokens,
 but incorrect proposals generate feedback that may trigger re-extraction). At
 current API prices (~$0.15 per million input tokens for GPT-4o-mini), the cost
 is modest (~$0.25). But at scale — re-extraction after ontology changes,
 continuous extraction as new content is added, extraction for all incoming Agora
 Notes — the cost accumulates.
 More importantly, the extraction latency is human-noticeable. 5,000 headings at
 1 second per LLM call is ~1.4 hours of extraction time. The system needs to
 either batch-extract on startup (making cold starts slow) or extract lazily on
 first query (making first queries slow). Neither is ideal.
 The notes trumpet the token savings from deterministic gates and Screamer
 deductions (valid — those cost 0 tokens) but the archivist's extraction cost is
 the system's single largest recurring LLM expense, and it is mentioned only in
 passing.
 *** The Agora integration is clean in theory, undefined in practice
 The "Passepartout IS the PDS" claim is elegant: the =memory-object= struct IS
 the Note format, the Merkle DAG IS the Key Event Log, the fact store IS the
 reputation system. But:
 - An Agora PDS needs to serve HTTP APIs for thin clients. The daemon speaks a
  framed TCP protocol over a local port. Extending it to serve HTTPS with
  DIDComm endpoints, subscription management, and Relay push/pull is a
  substantial engineering effort.
 - The PDS needs to manage encrypted storage — client-side encrypted content that
  the PDS itself cannot read. Passepartout's vault stores credentials with
  integrity hashes but does not currently manage per-Note encryption with
  audience-specific keys.
 - The Relay Network is described as an intelligent communication backbone with
  pub/sub routing. Passepartout has no Relay implementation, no Relay-facing API,
  and no subscription management beyond its own event orchestrator.
 - Agora's contract system (SCAL contracts, HODL invoices, arbitration tiers)
  requires state machines and Lightning Network integration that Passepartout
  has no primitives for.
 - The "Passepartout IS the PDS" vision conflates two things: the data model
  (Org files = Notes) and the infrastructure (a process that serves a network
  protocol). The data model unification is clean and right. The infrastructure
  unification implies Passepartout grows from a local agent to a network server
  — a significant architectural expansion that the notes treat as a ~40-line
  utility.
 *** No adversarial model
 The notes describe layered authentication (crypto, sensory, deterministic,
 probabilistic) and type-level gates as structural safety. They do not describe
 an adversarial model:
 - What stops a malicious Agora Note from containing 100,000 triples that flood
  the fact store?
 - What stops a DID from publishing Notes that deliberately inject contradictions
  to force Screamer into exponential backtracking?
 - What stops a compromised sensor key from signing valid sensor data that is
  adversarially crafted (e.g., video frames designed to trigger specific vision
  model false positives)?
 - What stops a spam DID from creating millions of Personas and flooding the
  user's incoming Notes directory?
 The resource monitor (Phase 1a) handles storage pressure generically. The
 quarantine system handles individual DIDs flagged for spam. But none of these
 are adversary-aware — they react to symptoms (disk full, error rate high) rather
 than anticipating attack patterns. An adversarial model would identify these
 vectors and design mitigations specifically. The notes describe a system that
 works in a cooperative environment, not an adversarial one.
 *** The self-repair criterion creates a two-tier architecture
 The AGENTS.md rule — "default: everything is a skill" — means the symbolic
 engine (Screamer, VivaceGraph, fact store, archivist, ACL2, planner) is all
 skills, not core. This is correct for the self-repair criterion: a corrupted
 skill degrades the agent but doesn't kill it. A corrupted core file kills the
 brainstem.
 But it creates a tension: the symbolic engine IS the reasoning layer that would
 diagnose and repair a corrupted skill. If the fact store itself is corrupted
 (impossible facts, inconsistent cardinality, broken Merkle chains), the engine
 that detects corruption is the engine that is corrupted. The system needs a
 "repair from below" path — a minimal core that can purge and rebuild the symbolic
 index without depending on the symbolic index. This path exists (the fact store
 is ephemeral in Phase 1-4 and rebuildable from prose in Phase 5+) but is not
 exercised automatically. A corruption in the symbolic engine requires human
 detection and manual rebuild — the exact problem the self-repair criterion was
 designed to avoid.
 ** Opportunities
 *** A memory prosthesis that makes your own mind legible
 The symbolic index, when populated and queried, answers questions that no
 existing tool can:
 - "What did I believe about monorepos in 2023, and how has that changed?"
 - "Which of my diary entries contradict each other?"
 - "What entities in my memex have no connection to any other entity?"
 - "Show me everything I've written about Nabokov, organized by when I wrote it,
  what I was reading at the time, and what I concluded."
 - "Which of my project plans reference security assumptions that I later changed?"
 - "What did I think about this topic, and why did I change my mind?"
 These are not information retrieval queries. They are self-knowledge queries.
 They require provenance chains, temporal versioning, contradiction surfacing, and
 cross-domain linkage — all of which the architecture provides as first-class
 capabilities. If this works, it transforms the memex from a searchable archive
 into a thinking partner that knows the history of your thoughts.
 *** Deterministic reasoning as a moat
 Every competitor agent system (Claude Code, OpenCode, OpenClaw, Hermes, Cognee,
 Mem0) uses neural-only reasoning. They are all vulnerable to the same failure
 mode: the LLM hallucinates a fact or an action, and there is no second system to
 catch it. Their safety is heuristic. Their memory is flat. Their reasoning is
 unprovable.
 Passepartout's architectural bet — a symbolic engine that verifies, deduces, and
 audits — creates a category difference, not a performance difference. If the bet
 pays off, Passepartout is not "a better AI agent." It is a different kind of
 system — one whose reasoning is provable, whose memory is content-addressed, and
 whose knowledge accumulates through deduction rather than re-prompting.
 This is a genuine moat. It cannot be replicated by adding a better system prompt
 or a larger context window. It requires building the ontology, the constraint
 solver, the fact store, and the provenance tracker — work that takes years and
 cannot be shortcut by spending more on inference.
 *** Agora as the first sovereign agent network
 If Passepartout serves as the PDS and an Agora Persona, then AI agents can:
 - Publish verified outputs as signed Notes with cryptographic provenance.
  Readers know the agent produced the output, not a human impersonating the
  agent.
 - Accept invocation Notes from other persona owners. "Please analyze this
  contract and publish your findings." The agent receives the request as an
  Agora Note, processes it, signs the response, and publishes it.
 - Build reputation through auditable chains of signed work products, not through
  self-reported claims.
 - Participate in the compute marketplace as both consumer and provider.
 - Maintain sovereign identity — the agent's DID is independent of any platform,
  any provider, any human account.
 This is not a chatbot on a messaging platform. It is an autonomous entity on a
 decentralized network, with cryptographic identity, verifiable provenance, and
 economic agency. If Agora reaches even Order 1 (the first 1,000 users),
 Passepartout agents become some of the most capable participants on the network.
 *** The 10-80-10 ratio for coding is genuinely achievable
 For a coding agent — the domain that Passepartout currently operates in — the
 10-80-10 ratio is plausible. The existing Dispatcher already verifies every
 action deterministically. Adding Screamer for consistency checking, VivaceGraph
 for dependency queries, and ACL2 for structural verification would shift the
 ratio from the current ~95-5-0 (neural-gate-symbolic) toward 50-40-10 in the
 near term and potentially 10-80-10 in the long term.
 The bootstrapped gate facts already cover file classifications, command safety,
 path protections, and tool permissions — the core categories for a coding agent.
 The archivist's extraction from project files would add dependency information,
 test coverage, and code structure facts. The planner could reason about
 refactoring order, dependency chains, and safety constraints deterministically.
 This is the domain where the symbolic engine provides the most immediate value,
 and it is the domain Passepartout already operates in.
 *** Wikidata as an entity backbone unlocks cross-domain reasoning
 Without Wikidata, the symbolic index for a general-knowledge memex is a sparse
 set of personal facts with no connecting structure. With Wikidata, the entity
 graph is pre-structured. The system can answer:
 - "What does my memex say about Nabokov that Wikidata doesn't?"
 - "Where does my memex disagree with Wikidata?"
 - "What entities in my memex have no Wikidata counterpart?" (These are the
  personal, novel, or subjective entities that are the most valuable.)
 - "Show me the intersection of my literary interests (from diary) with Wikidata's
  influence graph — which authors I read influenced each other in ways I haven't
  written about?"
 These are cross-domain queries that require both the personal memex (for what
 the user knows) and Wikidata (for what the world knows). Neither alone can
 answer them. Together, they enable a kind of knowledge synthesis that no existing
 tool provides.
 *** Ontology versioning enables "what-if" reasoning about one's own thinking
 The ability to query across worldviews — "what did I believe before I changed my
 security model?" — is a capability that has no analog in any existing tool. It
 transforms the memex from a static archive into a dynamic record of intellectual
 evolution. Combined with the temporal awareness system (Phase 0c), the system
 could surface correlations: "You changed your mind about monorepos two weeks
 after reading this article, which you bookmarked on this date, and one week
 before starting this project that uses a monorepo structure." The provenance
 chain IS the narrative of your thinking.
 *** Contract-level pre-arbitration reduces the cost of decentralized commerce
 Agora's Tier 0 Arbitrator — a local AI that provides evidence summaries before
 human arbitration — is a genuinely useful role for a neurosymbolic system.
 - "Contract CID X references arbitrator DID Y. DID Y is active. Verified."
 - "All parties have signed. The HODL invoice is locked. Verified."
 - "The buyer's claim of non-delivery is supported by 3 signed messages with
  timestamps after the delivery deadline."
 - "The seller's proof-of-delivery field is empty. No QR scan recorded."
 Each check is a Screamer query against the contract-lifecycle domain. The results
 are a plist, not a ruling. Both parties see the same evidence summary before
 escalating. This makes Level 1 arbitration faster (arbitrators receive
 pre-processed evidence bundles), cheaper (no human time spent on trivial
 verification), and more transparent (both parties see the same machine-generated
 summary).
 This is not AI judging. This is AI preparing the docket. The distinction is
 important and defensible.
 *** Self-auditing agents could transform AI safety discourse
 If Passepartout can answer =/audit= for any action or fact — showing the full
 provenance chain, every gate that approved it, every fact that supported it,
 every alternative that was considered — then AI safety moves from "trust us, we
 tested it" to "here is the audit trail, verify it yourself."
 This is the transparency that every AI safety framework calls for and none
 delivers. It is possible because the architecture records provenance as a
 first-class operation, not as an after-the-fact log. The provenance is the
 operating system, not a logging layer.
 *** The memex + Agora combination could be a new kind of social network
 Current social networks (Twitter, Facebook, Reddit) separate the person from
 their knowledge. You are a profile with posts. Your posts are isolated units
 without connection to your broader intellectual life.
 A Passepartout-powered Agora Persona would publish Notes that are grounded in
 the memex: "Here is my analysis of /Pale Fire/, drawn from diary entries across
 three years, annotated with Wikidata context, and verified against my existing
 literary framework." The Note is cryptographically signed, carrying provenance
 back to the specific Org headings that informed it. Readers see not just the
 conclusion but the intellectual scaffolding that produced it.
 This is not a "post." It is a publication — a knowledge artifact with verifiable
 provenance, auditable reasoning, and cryptographic identity. If this becomes the
 norm, it raises the standard for public discourse from "this is my opinion" to
 "this is my opinion, here is the evidence, here is how it evolved, here is who
 verified it."
 ** Threats
 *** The ontology problem may be harder than anticipated
 The notes are honest about this: "Whitehead's Principia Mathematica took over
 300 pages to define the logical foundations before it could prove that 1+1=2."
 Passepartout's domain is narrower (coding + personal knowledge) but the
 ontology problem is the same category of problem. Every entity class must be
 defined. Every relation must have clear semantics. Every inference rule must be
 justified.
 The gate-to-fact bootstrap provides 50-70 entity classes — enough for a coding
 agent. But the broader memex contains orders of magnitude more entity types:
 people, places, works, concepts, events, emotions, aesthetic judgments,
 professional skills, personal projects, temporal patterns. Defining these as
 triples with clear semantics is genuine intellectual work that no amount of
 engineering can shortcut.
 The risk is not that it's impossible. It's that it's slow — slow enough that
 the system never achieves the density of facts needed for the "flip" in the
 broader memex. The coding domain may reach sufficiency in months. The literary
 domain may take years. The daily-reflection domain may never cross the
 threshold because the facts involved (mood, insight, aesthetic experience) are
 not formalizable as triples.
 *** Screamer may not scale to the fact store size
 The constraint satisfaction approach to consistency checking is elegant for a
 seed fact set of hundreds of triples. It is unproven for millions of triples
 (after Wikidata loading + years of personal extraction). The domain-scoping
 strategy (Screamer only checks facts from the candidate's =:domain=) bounds the
 constraint space, but the most valuable consistency checks are cross-domain:
 - "You classified this file as public in your project notes but the gate stack
  classifies it as secret." (project domain vs security domain)
 - "You wrote that Nabokov influenced Kafka, but Wikidata says Kafka died before
  Nabokov published his first novel." (literature domain vs Wikidata domain)
 - "You planned to use this dependency, but the dependency's license changed in
  a way that conflicts with your project's license." (project domain vs legal
  domain)
 If cross-domain checks are disabled for performance, the most valuable
 contradictions are never detected. If they are enabled, the constraint space
 explodes. There is no obvious sweet spot.
 *** Wikidata quality may undermine trust in the symbolic index
 If Wikidata facts are admitted with =:policy :plural= and the user sees
 thousands of contradictions between Wikidata and their personal memex, the
 symbolic index may feel less trustworthy, not more. "Wikidata says Mount Everest
 is 8848m. DBpedia says 8849m. Your 2023 diary says 8848m. These three sources
 disagree on height." This is correct behavior — surfacing disagreement with
 provenance — but it may be overwhelming. The user wanted a knowledge base, not
 a disagreement engine.
 The trust problem is compounded by Wikidata's editorial biases. Wikidata
 reflects the biases of Wikipedia editors: English-language dominance, Western
 epistemological frameworks, systemic underrepresentation of non-Western
 knowledge. A memex in Arabic that references Islamic philosophy, Egyptian
 history, or African literature will find Wikidata's coverage thin, biased, or
 absent. The symbolic index would dutifully surface these gaps — "your memex
 mentions 47 entities with no Wikidata counterpart" — but it cannot fill them.
 *** LLM cost and latency may prevent the archivist from keeping up
 If the user writes a diary entry every day, the archivist must extract triples
 from each new heading. If the extraction takes 1-3 seconds per heading, it's
 background noise. But if the user imports 500 old diary entries, or the
 archivist needs to re-extract after an ontology change, or Agora Notes arrive in
 bulk from multiple follows, the extraction queue grows faster than it drains.
 The notes describe extraction as a background task triggered by heartbeat, but
 they don't specify the extraction rate limit. An unbounded queue with no rate
 limit would consume the LLM budget. A bounded queue would fall behind. A lazy
 extraction strategy (extract on first query) would make first queries slow.
 A batch extraction on startup would make cold starts slow.
 The archivist's throughput is gated by LLM API rate limits, token costs, and
 inference latency. These are external constraints that the architecture cannot
 eliminate. The symbolic engine can reduce LLM calls for reasoning; it cannot
 reduce LLM calls for extraction from prose.
 *** Agora may never reach network effects
 Agora faces the cold start problem that every decentralized social protocol
 faces: users won't join without content, creators won't post without users. The
 bootstrapping strategy (managed service → hybrid → full decentralization,
 targeting niche communities first) is well-articulated but its success depends
 on execution in a market where Mastodon, Bluesky, Nostr, and Farcaster are
 already competing for the same users.
 If Agora doesn't reach even Order 1 (1,000 users), the PDS integration is
 academic. Passepartout's DID identity, DIDComm gateway, Note signing, and
 contract verification are all infrastructure for a network that doesn't exist.
 The symbolic engine still works locally — provenance tracking, contradiction
 surfacing, and deduction are all valuable without Agora. But the network effects
 that make Agora a transformative platform — reputation, contracts, marketplaces,
 collective governance — require a living network.
 The risk is asymmetric: Passepartout invests significant engineering in Agora
 integration that provides zero value if Agora fails to launch.
 *** Complexity may prevent adoption
 Passepartout is already a complex system: a Lisp daemon, a terminal UI, a skill
 engine, a gate stack, multiple LLM backends, a Merkle memory system, and an
 event orchestrator. Adding a fact store, a constraint solver, a graph database,
 a theorem prover, an archivist, a planner, and an Agora PDS makes it more
 complex, not less.
 The target user — someone who wants a personal AI assistant that works offline —
 may not want or need any of this. They want the TUI to work, the LLM to be fast,
 and the files to stay safe. The neurosymbolic engine is infrastructure for a use
 case (lifelong personal knowledge management with verifiable provenance) that
 most users do not yet know they have.
 The risk is that Passepartout builds a cathedral for a congregation of one — a
 system that is architecturally brilliant and practically unused because the
 complexity-to-value ratio is too high for anyone except the author.
 *** The self-repair criterion may not hold under adversarial conditions
 The architecture assumes that skills can fail gracefully (fboundp guards, hash
 table fallbacks, degraded mode). It does not assume that a skill can be
 adversarially corrupted to behave correctly while producing wrong results. A
 compromised archivist that extracts plausible but false triples, a compromised
 Screamer that passes all consistency checks, a compromised VivaceGraph that
 returns query results from a parallel graph — these are "living" skills that
 would pass integrity checks and still poison the symbolic index.
 The type-level gates prevent the LLM from modifying gate code. They do not
 prevent a compromised skill (loaded by a trusted human, or corrupted on disk by
 a separate process) from operating normally while subtly wrong. The integrity
 monitoring (Phase 0) catches disk-level corruption through hash checks. It does
 not catch semantic corruption — a skill that is byte-for-byte identical to the
 known-good version but loaded with a malicious input that triggers a latent bug.
 This is not a vulnerability unique to Passepartout. It is a vulnerability in
 every system where components trust each other. But Passepartout's architecture
 amplifies the risk because the symbolic engine is supposed to be the trustworthy
 layer — the component that verifies the LLM's output. If the symbolic engine
 itself is compromised, the system has no higher court of appeal.
 *** The 10-80-10 ratio may create false confidence
 If the sufficiency metric shows "71% non-lossy, threshold 70%, mode: AUTO-
 EXTRACTION," the user may assume the system is trustworthy. But sufficiency is
 global — it aggregates across all domains. The system may have 95% sufficiency
 in the security domain and 5% sufficiency in the literary domain, averaging to
 71%. The auto-extraction switch would bypass the LLM for all categories with
 sufficient coverage, but the threshold is global, not per-domain. A literary
 query would hit the symbolic index that has "sufficient" coverage globally but
 insufficient coverage for literature.
 The notes describe domain-scoped Screamer checks but not domain-scoped
 sufficiency. A global sufficiency metric that triggers a global extraction mode
 change is the wrong granularity. Per-domain sufficiency, with per-domain
 extraction mode, would be more complex but more honest. The architecture as
 described has the simpler, more dangerous version.
 ** Summary Matrix
 |           | Positive                                                                              | Negative                                                                              |
 |-----------+---------------------------------------------------------------------------------------+----------------------------------------------------------------------------------------|
 | INTERNAL  | S: Architectural inversion, unified Org format, provenance as product,                | W: Unproven fact language, Screamer scale unverified, extraction cost hidden,         |
 |           | cardinality model, gate-to-fact bootstrap, self-preservation, organic ontology,       | flip underspecified, adversarial model absent, self-repair tension,                   |
 |           | Wikidata as accelerator, decoupled compute cost                                       | Agora integration scope undefined, per-domain sufficiency missing                     |
 |-----------+---------------------------------------------------------------------------------------+----------------------------------------------------------------------------------------|
 | EXTERNAL  | O: Memory prosthesis, deterministic moat, sovereign agent network,                    | T: Ontology may be harder than expected, Screamer may not scale,                      |
 |           | 10-80-10 for coding achievable, Wikidata cross-domain queries,                        | Wikidata quality/trust, LLM extraction bottleneck, Agora network effects,             |
 |           | ontology versioning, contract pre-arbitration, self-auditing safety,                  | complexity-to-adoption ratio, adversarial semantic corruption,                        |
 |           | knowledge-based social network                                                        | false confidence from global sufficiency metric                                       |
 * What This Unlocks
 ** Technologically
 The neurosymbolic engine, if built, would be the first AI system where:
 1. *Reasoning is auditable.* Every conclusion carries a provenance chain back to
   its premises. The =/audit= command renders the full inference tree — every
   fact, every deduction, every gate outcome — in human-readable form.
 2. *Knowledge accumulates deterministically.* Screamer deductions and gate
   outcomes generate new facts without any LLM involvement. The knowledge base
   grows from the system's own operation, not from re-prompting the LLM.
 3. *Memory is content-addressed.* Every fact is a Merkle node. Every version
   chain is tamper-proof. Rollback is atomic. The storage format is proven
   correct before it is committed to disk.
 4. *Safety is provable, not empirical.* Type-level gates make self-modification
   structurally impossible. ACL2 proves that the rule set has no contradictions.
   The dispatcher doesn't "try" to be safe — it is safe by construction.
 5. *The human and the machine share the same format.* Org files for both. No
   hidden database. No import/export step. The agent's memory IS the human's
   memory.
 These five properties, together, define a new category of AI system: the
 *sovereign reasoning agent*. Not sovereign in the blockchain sense (decentralized
 by consensus), but sovereign in the personal sense: the agent runs on your
 hardware, reasons with your knowledge, and proves its reasoning to you.
 ** Socially
 If the technical vision succeeds and Agora reaches network effects, the
 combination unlocks:
 1. *Verifiable public discourse.* Every published claim carries provenance back
   to source material. "I read this, I thought this, I changed my mind on this
   date, here is the evidence." Public discourse shifts from "competing opinions"
   to "competing evidence chains." The quality floor rises because claims without
   provenance are visibly weaker than claims with provenance.
 2. *Sovereign AI agents with legal and economic personhood.* A Passepartout
   agent with an Agora Persona can own assets, enter contracts, earn reputation,
   and face consequences for failure. This is not a chatbot. It is an autonomous
   entity with cryptographic identity, verified provenance, and economic agency
   — more like a corporation than a tool.
 3. *Self-auditing AI safety.* Every action the agent takes is traceable. Every
   gate decision is recorded. Every fact that informed a decision is queryable.
   AI safety moves from "trust us" to "here is the audit trail." This is the
   transparency that every AI ethics framework calls for.
 4. *A personal knowledge economy.* If your memex can publish Notes as Agora
   content, your intellectual work — your analyses, your syntheses, your
   discoveries — becomes a publishable, attributable, monetizable asset. Not
   through advertising or subscriptions, but through direct value exchange:
   Lightning payments for content access, contract work for your verified
   expertise, reputation that follows your Persona across platforms.
 5. *Collective intelligence without centralized control.* If multiple
   Passepartout agents share facts through Agora Notes, the collective symbolic
   index represents the verified, provenanced knowledge of a community — not the
   averaged opinion of a crowd, but the auditable intersection of independently
   verified claims. This is Wikipedia without the editorial board, science
   without the journal gatekeepers, journalism without the corporate owners.
 6. *A memory prosthesis that outlives the individual.* A memex with a decade of
   diary entries, linked to Wikidata's entity graph, with Screamer deductions
   surfacing patterns and contradictions, with ontology versioning preserving
   intellectual evolution — this is not a knowledge management tool. It is an
   externalized, queryable, auditable record of a life's thinking. It is what
   Vannevar Bush imagined in 1945: "an enlarged intimate supplement to one's
   memory."
 * Conclusion
 The architecture described in these notes is genuinely novel. Not incrementally
 novel — most agent architectures are variations on "LLM + tools + prompt-based
 safety." Passepartout's neurosymbolic vision is categorically different: an
 inversion where the deterministic layer judges the probabilistic layer, where
 facts carry provenance chains, where contradiction is a feature rather than an
 error, and where the user's Org files are the single source of truth for both
 human and machine.
 The largest risk is not that the architecture is wrong. It is that the ontology
 problem — the genuine difficulty of defining what a "fact" is, what relations
 are, what categories are useful, and how they evolve — is harder than the notes
 anticipate, and that the system spends years in a partially-working state where
 the symbolic index is too sparse to be useful but too entangled to be discarded.
 The second-largest risk is that Agora never reaches the network effects needed
 to make the PDS integration valuable beyond a local experiment, and that the
 engineering investment in DIDComm gateways, Note signing, contract verification,
 and Relay integration produces infrastructure for a network that doesn't exist.
 The opportunity is equally large: a system that makes your own mind legible to
 you, that proves its reasoning rather than asserting it, that accumulates
 knowledge across sessions through deduction rather than re-prompting, and that
 publishes verified, provenanced knowledge to a decentralized network. If this
 works — even partially, even slowly — it is a category-level advance over every
 existing agent architecture and every existing personal knowledge management
 tool.
 The notes are a map of territory that no one has walked. The territory is real.
 The map is detailed enough to navigate by. Whether the journey completes depends
 on whether the ontology problem yields to engineering, and whether the user —
 the one human whose memex this serves — finds value in the partial system well
 before the full vision materializes.
--- a/notes/passepartout-agora.org
+++ b/notes/passepartout-agora.org
@@ -0,0 +1,314 @@
 #+TITLE: Passepartout-Agora Integration — Unified Container Format
 #+AUTHOR: Agent
 #+FILETAGS: :notes:integration:agora:passepartout:design:
 #+CREATED: [2026-05-08 Fri]
 * Summary
 Org files and Agora Notes are the same container. Both are text with headers,
 tags, properties, and prose body. Both contain zero or more symbolic facts
 extractable by Passepartout's archivist. The only difference is that an Agora
 Note carries a DID signature and a CID for cryptographic provenance on the
 network. An Org file without a signature is a local Note. A signed Org file
 pushed to the PDS is an Agora Note.
 Passepartout's =memory-object= struct serves as the storage format for both.
 The archivist extracts facts from one unified store. Authorship is distinguished
 by provenance, not location.
 * The Unification
 ** Org files and Notes are the same container
 | Property         | Org file (local)             | Agora Note (network)               |
 |------------------+------------------------------+-------------------------------------|
 | Format           | Org-mode text                | Org-mode text                       |
 | Identity         | Merkle hash (=memory-object=) | CIDv1 (same hash)                   |
 | Contains facts   | Yes (archivist extracts)     | Yes (archivist extracts)            |
 | Author identity  | Implicit (file in =~/memex/=) | Explicit (DID signature in =proof=) |
 | Access control   | Filesystem permissions       | =access_control= flags              |
 | Routing          | N/A (local disk)             | =notify= + =references= + Relay     |
 | Ephemeral        | No                           | =ephemeral_duration=                |
 | Behavioral flag  | Implicit (convention)        | =is_feed= field                     |
 The structure converges in a single plist:
 #+begin_src lisp
 (:cid <merkle-hash>             ;; Identity across local and network
 :title <string>                ;; Org headline title
 :content <org-text>            ;; Full Org body (headings, prose, source blocks)
 :owner <did-or-nil>            ;; For Agora Notes: the signing Persona DID. nil for local
 :proof <plist-or-nil>          ;; ( :editor <did> :signature <bytes> )
 ;; Agora behavioral flags (nil for local files)
 :is-feed <boolean-or-nil>
 :access-control <did-list-or-nil>
 :notify <did-list-or-nil>
 :references <cid-list-or-nil>
 :reply-to <cid-or-nil>
 :thread-root <cid-or-nil>
 :ephemeral-duration <integer-or-nil>
 ;; Passepartout metadata
 :created-at <timestamp>
 :tags <string-list>            ;; Org tags
 :properties <plist>            ;; Org property drawer
 :extracted-facts <fact-list>)  ;; Populated by archivist after extraction
 #+end_src
 ** Facts are extracted from both, identically
 An Org file in =~/memex/literature/pale-fire.org= and an Agora Note from
 =did:agora:heather= with =:references <post-CID>= both contain prose. The
 archivist scans both, proposes triples via the LLM, verifies via Screamer,
 and admits facts to the symbolic index. The facts carry different provenance:
 #+begin_src lisp
 ;; Extracted from local Org file
 (:entity :pale-fire :relation :theme :value :unreliable-narration
 :provenance :local-prose :grounding "heading-42")
 ;; Extracted from Agora Note
 (:entity :kafka :relation :influence :value :nabokov
 :provenance :agora-note :grounding <incoming-note-cid> :author "did:agora:heather")
 #+end_src
 No new extraction path. The archivist already walks containers and extracts
 facts. The container type determines the provenance tag and the grounding
 identifier (local heading ID vs. Note CID).
 ** The memex distinguishes provenance by location, not format
 Incoming Agora Notes arrive at =~/memex/social/notes/<did>/<cid>.org=.
 The directory structure encodes authorship:
 | Path                                              | Meaning                            |
 |---------------------------------------------------+------------------------------------|
 | ~/memex/daily/                                    | Local diary entries                |
 | ~/memex/projects/                                 | Local project files                |
 | ~/memex/literature/                               | Local reading notes                |
 | ~/memex/notes/                                    | Local design and thinking notes    |
 | ~/memex/social/notes/<did>/<cid>.org              | Incoming Notes from other DIDs     |
 | ~/memex/social/outbox/<cid>.org                   | Outgoing Notes signed by the user  |
 The archivist scans all directories. Local files produce facts with
 =:provenance :local-prose=. Agora files produce facts with =:provenance
 :agora-note= + =:author <did>=. The symbolic index maps the provenance
 to the cardinality policy: local prose is =:plural= (the human's own notes —
 multiple interpretations coexist). Agora Notes are =:plural= by default (the
 author's claim, not authoritative over local facts). Agora Notes can be promoted
 to =:singular= or =:dual= if they carry cryptographic proofs of specific claims.
 ** Publishing Org content as Agora Notes
 When the user wants to publish a diary entry, project log, or literary note as
 an Agora Note, the operation is:
 1. Select the Org heading or file.
 2. Compute the Merkle hash (=memory-object= hash → CIDv1).
 3. Sign with the user's Persona DID key (Phase 0b key registry).
 4. Set Agora flags: =:is-feed= t/nil, =:access-control= [], =:references= [previous-note-cid].
 5. Push to the PDS. The Note is an Org plist with a DID signature.
 6. The PDS stores and relays it. The Note remains in =~/memex/social/outbox/= with its CID.
 All of this is a single function: =(note-publish heading-id &key is-feed access-control references)=.
 ~40 lines, extending the vault (key signing), the fact store (CID generation),
 and the memex (output directory).
 * Implications for Passepartout's Architecture
 ** The symbolic index now has a second ingestion path
 Facts enter through three gates:
 1. Gate outcomes (bootstrap + runtime, =:provenance :gate-outcome=)
 2. Screamer deductions (=:provenance :deduced=)
 3. Archivist extraction (=:provenance :local-prose= or =:provenance :agora-note=)
 The third path now covers both local Org files and incoming Agora Notes. No new
 path needed. The archivist gains no new code — only a new directory to walk
 (=~/memex/social/notes/=) and a new provenance tag to assign.
 ** Authentication Layer 1 now has Agora-native verification
 Phase 0b's cryptographic gate (vector 0) verifies DID signatures. An incoming
 Agora Note carries =:owner <did>= and =:proof.signature <bytes>=. Gate vector 0
 verifies the signature against the DID's public key (from the key registry, which
 is now also an Agora DID registry). Verification is identical for local signals
 and Agora signals — the same gate, the same key lookup.
 ** Self-preservation gains an Agora dimension
 The resource monitor (Phase 1a) tracks =~/memex/social/= as a source of storage
 growth. Incoming Notes from network sources are lower preservation priority than
 local prose — if disk pressure hits, incoming Agora Notes are evicted first
 (their source is the remote PDS; they can be re-fetched). Quarantine (Phase 1a)
 extends to Agora channels: if a DID is sending spam or malformed Notes, their
 incoming directory is quarantined and the DID is flagged for human review.
 ** Sufficiency tracks Agora as a provenance source
 The sufficiency score (Phase 4) gains a new provenance category:
 #+begin_example
 Symbolic Index
  Facts:    3,847
    Gate outcomes:       847  (22%)
    Deduced:             921  (24%)
    Human-authored:       72   (2%)
    Local prose:        1,247  (32%)
    Agora Notes:          760  (20%)
  ─────────────────────────
  Non-lossy:            1,840  (48%)
  LLM-proposed:         2,007  (52%)
 #+end_example
 Agora Notes are a provenance source, not a lossiness category. Facts from Agora
 Notes carry =:provenance :agora-note= — they are LLM-extracted (the archivist
 proposes them) but the source is cryptographically signed by a known DID. They
 are neither =:gate-outcome= (mechanical) nor =:llm-proposed= from local prose
 (uncertain source). They occupy a middle ground: verified source, uncertain
 extraction.
 * Implications for Agora
 ** Passepartout IS the PDS
 The TODO.org in =projects/agora/= already captures this: "Passepartout IS the
 PDS — the agent runs a personal data store in-process." With Org files as the
 Note format, this is literal. The PDS stores Org files. The agent reads them.
 The network accesses them via the PDS API. There is no separate PDS process.
 ** Level 0 pre-arbitration via Screamer
 Section 07 of the Agora requirements describes a "Tier 0 Arbitrator" — a local
 AI that provides a sanity check before human arbitration. Passepartout's
 Screamer + fact store provides this at zero LLM tokens when working from
 existing facts:
 - "Contract CID X references arbitrator DID Y. DID Y is active. Verified."
 - "All parties have signed. The HODL invoice is locked. Verified."
 - "The buyer's claim of non-delivery is supported by 3 signed messages with
  timestamps after the delivery deadline."
 - "The seller's proof-of-delivery field is empty. No QR scan recorded."
 Each check is a Screamer query against the contract-lifecycle domain. Results
 are a plist, not a ruling. Both parties see the same evidence summary before
 escalating to Level 1.
 ** Reputation as deduced facts
 Screamer deduces reputation from signed contract chains, not asserted claims:
 #+begin_src lisp
 (:entity "did:agora:heather" :relation :contract-reputation
 :value (:completed 47 :defaulted 0 :disputes 3 :won 3 :escalated 0)
 :provenance :deduced :derived-from (<list of 47 contract CIDs>))
 #+end_src
 This is the strong version of Agora's Trust Score. It's a fact deduced from
 cryptographic evidence, not a claim by the persona (self-reporting could be
 false) and not a claim by a centralized reputation service (could be bought).
 The deduction is auditable — `/audit did:agora:heather` shows every contract,
 every outcome, every ruling.
 ** Agent Behavioral Contracts — formal enforcement for the ABC of Agora
 Bhardwaj (2026) introduces a formal framework that brings Design-by-Contract
 principles to autonomous AI agents. An ABC contract =C = (P, I, G, R)=
 specifies /Preconditions/, /Invariants/ (hard and soft), /Governance/ policies
 (hard and soft), and /Recovery/ mechanisms as first-class runtime-enforceable
 components.
 This maps directly onto Agora's contract lifecycle:
 | ABC component          | Agora mapping                                                |
 |------------------------+--------------------------------------------------------------|
 | =P= (Preconditions)    | Contract Note validity checks: all signers' DIDs active,     |
 |                        | contract CID correctly referenced, HODL invoice locked        |
 | =I= (Invariants)       | Hard: payment amount unchanged, arbitrator DID unchanged.     |
 |                        | Soft: delivery within estimated window                        |
 | =G= (Governance)       | Hard: no party modifies contract terms unilaterally.          |
 |                        | Soft: parties communicate through designated channels         |
 | =R= (Recovery)         | Arbitration escalation, HODL invoice release, reputation      |
 |                        | deduction                                                    |
 The framework's key mathematical results have direct implications for Agora:
 - /Drift Bounds Theorem/: contracts with recovery rate γ > α (natural drift rate
  from LLM non-determinism in agent behavior) bound behavioral drift to D* = α/γ.
  For Agora, this means contract enforcement can be /predictive/ — detecting drift
  before violation — rather than just /corrective/ after breach.
 - /Compositionality Theorem/: sufficient conditions (interface compatibility,
  assumption discharge, governance consistency, recovery independence) under
  which individual contract guarantees compose end-to-end for multi-agent chains.
  This is essential for Agora's multi-party contracts, where a buyer, seller,
  arbitrator, and escrow agent form a chain of interdependent behavioral
  expectations.
 - /(p, δ, k)-satisfaction/: probabilistic compliance accounting for LLM
  non-determinism — contracts hold with probability p, deviations stay within
  tolerance δ, recovery within k steps. This formalizes what Screamer's
  contract-lifecycle domain queries verify: whether the current state of a
  contract satisfies its agreed-upon conditions, given the inherent uncertainty
  in any agent's behavior.
 The empirical results are significant: across 1,980 sessions on 7 models,
 contracted agents (with ABC enforcement) detected 5.2-6.8 soft violations per
 session that uncontracted agents missed entirely, with <10ms per-action overhead.
 Overhead is critical for Passepartout as the PDS — contract enforcement must not
 add latency to Note processing.
 ABC does not replace Screamer. ABC specifies /what/ must hold; Screamer verifies
 /whether/ it holds against the fact store. The contract-lifecycle domain already
 planned for Phase 0b (signal chain) can be implemented as an ABC-like structure:
 a tuple of preconditions, invariants, governance rules, and recovery mechanisms,
 each expressed as Screamer-verifiable facts with Merkle provenance.
 See also:
 - Bhardwaj, V.P. (2026). Agent Behavioral Contracts: Formal Specification and
  Runtime Enforcement for Reliable Autonomous AI Agents. arXiv:2602.22302.
 ** The merkle DAG IS the Key Event Log
 Agora's KEL specification (Section 02) describes an append-only log of key
 events — inception, rotation, revocation, follow events. Passepartout's Merkle
 DAG (Phase 5, built on v0.2.0 memory-object infrastructure) is this log. Each
 key event is a fact in the =:key-lifecycle= domain. Each event has a
 =:parent-id= chaining to the previous event. The DAG is content-addressed —
 every event is a CID. The full KEL is queryable: `/audit did:agora:heather`
 renders every key event, every follow event, every contract signature, with
 provenance chains.
 * Relation to the Neurosymbolic Roadmap
 The Agora integration is not a new phase. It is a consequence of decisions
 already made:
 | Roadmap item            | Agora consequence                                              |
 |-------------------------+----------------------------------------------------------------|
 | Phase 0b (key registry) | Key registry uses Agora DIDs. DID store is =:key-lifecycle= domain |
 | Phase 1 (fact store)    | Fact store is also Note store. Same API, same hash table       |
 | Phase 1a (self-pres.)   | Incoming Notes tracked. Spam DIDs quarantined. Disk eviction    |
 | Phase 3 (archivist)     | Archivist walks =~/memex/social/notes/= alongside local dirs    |
 | Phase 4 (sufficiency)   | Agora Notes are a provenance category in the sufficiency score  |
 | Phase 5 (Merkle DAG)    | DAG = KEL. DAG = contract audit trail                          |
 | Phase 0b (signal chain) | Signal chain = contract lifecycle chain. Same Merkle linking    |
 No new lines in the roadmap. The Note publishing function (~40 lines) is a
 utility, not a phase.
 * What Is NOT Built
 1. *A separate Note parser.* Agora Notes ARE Org files. The existing Org parser
   reads both.
 2. *A separate Note store.* The =memory-object= struct stores both. The
   =*memory-store*= hash table holds both.
 3. *A separate extraction path for Agora content.* The archivist extracts facts
   from prose regardless of origin. The provenance tag distinguishes source.
 4. *A new authentication mechanism for Agora signals.* Gate vector 0 verifies
   DID signatures. The key registry is the DID registry.
 See also:
 - =projects/agora/docs/= — Agora requirements (overview, identity, primitive, social, contracts, governance)
 - =projects/agora/TODO.org= — Passepartout integration track
 - =passepartout-neurosymbolic-design-decisions-and-options.org= — the full design rationale
 - =passepartout-neurosymbolic-roadmap.org= — the phased implementation plan
--- a/notes/passepartout-neurosymbolic-design-decisions-and-options.org
+++ b/notes/passepartout-neurosymbolic-design-decisions-and-options.org
@@ -442,6 +442,371 @@ design. The gate stack provides the seed. Gate outcomes, prose extraction,
 deduction, and human authoring grow the shoots. Screamer prunes contradictions.
 The ontology is a garden, not a building.
 * Empirical Validation — Modular Ontology Engineering with LLMs
 Shimizu and Hitzler (2025, /Journal of Web Semantics/) argue that LLMs can
 significantly accelerate knowledge graph and ontology engineering — modeling,
 extension, population, alignment, and entity disambiguation — but /only/ if
 ontologies are modular. Their paper provides empirical evidence that validates
 the modular architecture described in this document and exposes concrete patterns
 the archivist should adopt.
 ** The central finding: modularity is the key variable
 In a complex ontology alignment task (mapping between two oceanography ontologies
 with hundreds of classes and properties), an LLM without module information
 detected correct mappings for 5 of 109 alignment rules — effectively useless. When
 the same LLM was given the module structure of the target ontology (20 named
 conceptual modules such as "Organization," "Cruise," "Physical Sample"), it
 detected correct mappings for 104 of 109 rules — 95% accuracy. The variable was
 modularity.
 For ontology population (extracting triples from text), their best results came
 from prompts that included a schematic representation of a /single module/ plus
 one extraction example. Against ground truth, this achieved approximately 90%
 extraction accuracy. Without module-scoped prompting, quality degraded
 substantially.
 The mechanism: conceptual modules scope the LLM's attention to something
 human-sized. The paper's central claim — "by somehow limiting the scope, we
 achieve a more human-like approach — and one more capable of being expressed
 succinctly in language, and thus more appropriate for LLM-based assistance" — is
 an independent discovery of the same principle underlying Passepartout's
 domain-scoped Screamer checks and per-domain cardinality policies.
 ** MOMo: a mature modular ontology methodology
 The authors' approach, MOMo (Modular Ontology Modeling), has been developed over a
 decade and includes:
 - A /step-by-step methodology/ that breaks ontology design into clearly delineated
  pieces, each "easier to automate than going one-shot from base data to an
  ontology."
 - A /pattern description language/ (OPLa, expressed in OWL) for annotating modules
  so they can be identified programmatically.
 - A /design library/ (MODL) containing hundreds of commonsense micropatterns
  organized for programmatic access, including via RAG.
 - A /Protégé plugin/ (CoModIDE) for graphical modular ontology development.
 Critically, their modules are not formal sub-ontologies with logical boundaries.
 They are /conceptual/ partitions — groupings of classes, properties, and axioms
 around "key notions" identified by domain experts. Modules can overlap and nest.
 There are "no precise rules" for what belongs in a module. The modules provide
 "conceptual bridges between human expert conceptualization and data reality."
 ** What Passepartout should adopt
 *** The modular prompt pattern for the archivist
 The extraction prompt structure that achieved 90% accuracy is concrete and
 replicable: a schematic representation of a domain module plus a single extraction
 example. The archivist should use this pattern when extracting facts from prose.
 Instead of a generic "extract triples from this text" prompt (200 tokens), the
 prompt should reference the relevant module(s) and include an example triple for
 each relation in that module. The module provides /context/; the example provides
 /format/. Both improve LLM extraction quality without increasing Screamer's
 verification burden.
 *** MOMo modules as ontology scaffold
 The Passepartout notes describe an organic growth model: gate-bootstrapped facts
 seed the ontology; gate outcomes, Screamer deductions, and archivist proposals
 grow the shoots. This is correct for the /security and filesystem/ domains where
 the gate stack already encodes expertise. For the broader memex — literature,
 daily reflection, project planning — the 50-70 gate-bootstrapped entity classes
 are starvation.
 MOMo's micropattern library provides a ready-made scaffold for these domains.
 Hundreds of commonsense patterns already exist for temporal relations, spatial
 relations, agent-action, organizational structure, provenance, and event
 participation. Loading these as initial modules — with :policy :plural and
 =:provenance :external-ontology= — would give the symbolic index a structured
 vocabulary for domains where the gate stack has nothing to offer. The organic
 growth model then /extends and refines/ these modules rather than inventing them
 from scratch. This is the Wikidata strategy applied at the schema level: adopt
 existing structured knowledge, connect personal facts to it, and surface
 disagreements rather than resolve them.
 *** OPLa annotation for module identification
 MOMo modules annotated in OPLa can "easily be identified programmatically." If
 Passepartout annotates its ontology modules in a compatible format (even a
 simplified plist-based equivalent), the archivist can automatically select the
 right module(s) when extracting facts from prose. A heading in =literature/=
 triggers the literature module; a heading in =projects/= triggers the software
 engineering module; a heading tagged =:personal:= triggers the diary module. The
 module scopes the prompt. The prompt improves extraction. Screamer gates the
 result. This is the full pipeline, validated at each step.
 ** What this means for the Passepartout architecture
 The paper validates three design decisions already made:
 1. /Modularity is non-negotiable./ The paper found that modularity is the
   difference between 5% and 95% accuracy on alignment. Passepartout's per-domain
   cardinality policies and domain-scoped Screamer checks are the same insight
   implemented in a different context. The paper proves the approach works;
   Passepartout applies it to verification rather than extraction.
 2. /The extraction pipeline is feasible./ 90% population accuracy with module-
   scoped prompts means the archivist /can/ extract useful facts from prose. The
   remaining 10% — the hallucination rate — is what Screamer catches. The paper
   validates the LLM-as-proposer role; Passepartout adds the Screamer-as-verifier
   role.
 3. /KGs are positioned as anti-hallucination infrastructure./ The paper explicitly
   frames knowledge graphs as "ground truth to escape from LLM hallucinations" and
   as "components of other neurosymbolic approaches." This is the Passepartout
   thesis — the symbolic index as ground truth against which LLM proposals are
   checked — stated in the academic literature by the editors of the neurosymbolic
   AI handbooks.
 And it exposes one gap in the current design:
 1. /Emergent modularity may be slower than designed modularity./ Passepartout's
   modules are supposed to emerge organically from gate patterns, Screamer
   generalizations, and cross-domain overlap detection. MOMo's modules are
   designed by domain experts who identify key notions upfront. The emergent
   approach is philosophically cleaner — the system learns its own categories —
   but practically slower. The paper's results suggest that adopting designed
   modules as a scaffold, and letting emergent growth /refine/ rather than
   /invent/ them, would compress the timeline for sufficiency by years.
 ** Relation to Wikidata loading
 The MOMo micropattern approach and the Wikidata loading strategy are complementary:
 | Layer          | MOMo provides                  | Wikidata provides        |
 |----------------+--------------------------------+--------------------------|
 | Schema         | Modular ontology of relations  | — (Wikidata's schema is  |
 |                | and entity classes             |   implicit in its data)  |
 | Instances      | — (patterns, not entities)     | 100M+ entities with      |
 |                |                                | property-value pairs     |
 MOMo gives Passepartout the /relations/ (wrote, lectured-on, influenced,
 published-in). Wikidata gives Passepartout the /instances/ (Nabokov, Pale Fire,
 Kafka). Both are needed. Neither alone is sufficient. The MOMo scaffold tells the
 archivist /what kinds of facts to look for/. The Wikidata graph tells the
 archivist /which entities those facts are about/. Together they transform the
 extraction task from "discover entities and their relations from prose" to
 "connect this prose heading to known entities using known relations" — a
 dramatically simpler prompt with dramatically higher expected accuracy.
 ** Reference
 - Shimizu, C., & Hitzler, P. (2025). Accelerating knowledge graph and ontology
  engineering with large language models. /Journal of Web Semantics, 85/,
  100862. https://doi.org/10.1016/j.websem.2025.100862
 ** See also
 - =passepartout-neurosymbolic-roadmap.org=: Phase 3 (Archivist) — the modular
  prompt pattern should be incorporated into the extraction pipeline.
 - =passepartout-agora.org=: the KEL / contract audit trail as instances of
  MOMo-style key-lifecycle and contract-lifecycle modules.
 - =notes/passepartout-SWOT.org=: the SWOT analysis which identifies the ontology
  problem as the key bottleneck — MOMo partially addresses this.
 ** Supporting References
 *** MOMo: the canonical methodology
 Shimizu, Hammar & Hitzler (2023, /Semantic Web Journal/) present the full MOMo
 methodology — 31 pages covering the step-by-step design process, schema diagrams
 as knowledge elicitation tools, ODP libraries, OPLa annotation language, and
 CoModIDE, a Protégé plugin for graphical modular ontology development. The paper
 was evaluated with usability studies and demonstrates that modular development
 significantly improves approachability for domain experts who are not ontology
 engineers.
 Key architectural commitments from MOMo that Passepartout should adopt:
 - /Schema diagrams/ as the primary communication format between ontologist and
  domain expert. Passepartout's equivalent: the archivist's module-scoped prompt
  includes a simplified schema diagram of the module being populated.
 - /Template-based instantiation/ of ontology design patterns into concrete
  modules. Passepartout's equivalent: micropatterns loaded from MODL are
  instantiated with entities from the user's memex, producing concrete facts.
 - /Systematic axiomatization/ — 17 frequently used axiom patterns for each
  node-edge-node construction in a schema diagram. Passepartout's equivalent:
  Screamer constraint rules derived from module structure.
 Reference:
 - Shimizu, C., Hammar, K., & Hitzler, P. (2023). Modular ontology modeling.
  /Semantic Web, 14/(3), 459–489. https://doi.org/10.3233/SW-222886
 *** Ontology Population — the empirical methodology
 Norouzi et al. (2024) provide the full experimental methodology behind the ~90%
 extraction accuracy claim. Using the Enslaved.org Hub Ontology as ground truth
 and Wikipedia articles as source text, they tested five LLMs across a three-stage
 pipeline: preprocessing, text retrieval, and KG population. The critical finding:
 prompts that included a /schema diagram/ of the target ontology module (using
 MOMo's visual conventions with colored boxes for classes, arrows for relations)
 plus a single extraction example achieved the highest accuracy. Without
 module-scoped prompts, quality degraded substantially.
 Three findings are directly applicable to the archivist:
 1. /Role chain simplification./ The Enslaved Ontology has complex role chains
   (e.g., Person → hasRole → Role → inEvent → Event). These were collapsed into
   shortcut relations (e.g., Person → participatedIn → Event) for LLM extraction.
   The archivist should maintain two layers: the /logical/ schema with full role
   chains for Screamer verification, and the /extraction/ schema with simplified
   relations for LLM prompting.
 2. /Variance across models./ Five LLMs were tested. Performance varied
   significantly. The archivist should benchmark extraction accuracy per provider
   and per module, and route extraction tasks to the best-performing model for
   each module — extending the existing model-tier routing (v0.3.0) from
   complexity-based to accuracy-based routing.
 3. /Cross-source validation./ The paper used both Wikipedia text and Wikidata
   as overlapping sources for the same entities, enabling cross-verification.
   The archivist can do the same: extract facts from the user's prose, extract
   facts from Wikidata for the same entities, and present disagreements with
   provenance. This is the =:plural= cardinality policy applied at extraction time.
 Reference:
 - Norouzi, S.S., Barua, A., Christou, A., Gautam, N., Eells, A., Hitzler, P.,
  & Shimizu, C. (2024). Ontology Population using LLMs. arXiv:2411.01612.
 * Historical Lineage — McCarthy's Advice Taker
 McCarthy's "Programs with Common Sense" (1959) is the direct intellectual ancestor
 of the Passepartout architecture. The paper proposed an "advice taker" — a program
 that "will draw immediate conclusions from a list of premises" expressed in
 "a suitable formal language (most likely a part of the predicate calculus)." The
 program would:
 1. Accept declarative statements about the world as input.
 2. Store them as logical formulas.
 3. Reason from them to produce new conclusions.
 4. Accept new facts and revise its conclusions.
 This is precisely the Passepartout pipeline: the archivist extracts declarative
 facts from prose → Screamer checks them for consistency → VivaceGraph stores them
 → the planner reasons from them → new facts from gate outcomes and deductions
 revise the store. McCarthy proposed it in 1959. Passepartout is building it in
 2026.
 The gap between McCarthy's proposal and Passepartout's implementation is the
 /hallucination problem/. McCarthy assumed facts would be entered by a human
 programmer in formal logic. Passepartout's facts are extracted from natural
 language prose by an LLM — a probabilistic process that requires deterministic
 verification. Screamer is the component McCarthy didn't need: a constraint solver
 that gates LLM-proposed facts against the existing fact store.
 The connection is not metaphorical. McCarthy cited Principia Mathematica as an
 influence on Lisp. Passepartout's Whitehead note traces the same PM → Lisp
 lineage. The advice taker → Passepartout lineage completes the arc: PM's formal
 logic → Lisp → McCarthy's advice taker → Passepartout's neurosymbolic engine.
 Reference:
 - McCarthy, J. (1959). Programs with Common Sense. /Proceedings of the
  Teddington Conference on the Mechanization of Thought Processes./
 * Philosophical Validation — The Neurosymbolic Consensus
 Three papers from the neurosymbolic AI research community validate the
 architectural thesis from complementary angles.
 ** Marcus (2020): The Case Against Pure Deep Learning
 Gary Marcus's "The Next Decade in AI" argues that deep learning alone is "data
 hungry, shallow, brittle, and limited in its ability to generalize." The paper
 demonstrates GPT-2 failing at basic commonsense reasoning:
 - "Yesterday I dropped my clothes off at the dry cleaners and have yet to pick
  them up. Where are my clothes?" → GPT-2: "at my mom's house."
 - "There are six frogs on a log. Two leave, but three join. The number of frogs
  on the log is now" → GPT-2: "seventeen."
 Marcus proposes four steps toward robust AI: hybrid architecture (combining
 neural and symbolic), large-scale knowledge (abstract and causal, not just
 statistical), reasoning (formal inference over structured representations), and
 cognitive models (frameworks for how entities relate). Passepartout implements all
 four: the perceive-reason-act pipeline is hybrid, the symbolic index is causal
 knowledge, Screamer + ACL2 provide reasoning, and the gate-bootstrapped ontology
 plus MOMo modules provide cognitive models.
 Marcus's core claim — "we have no hope of achieving robust intelligence without
 first developing systems with deep understanding" — is the justification for
 Passepartout's entire neurosymbolic investment. The alternative is a system that
 works "on a good day" and fails unpredictably. The deterministic gate stack and
 Screamer admission gate are the engineering realization of Marcus's call for
 robustness.
 Reference:
 - Marcus, G. (2020). The Next Decade in AI: Four Steps Towards Robust
  Artificial Intelligence. arXiv:2002.06177.
 ** Gaur & Sheth (2023): CREST — Trustworthy Neurosymbolic AI
 Gaur and Sheth present the CREST framework: Consistency, Reliability, user-level
 Explainability, and Safety build Trust — and they argue these require
 neurosymbolic methods. Their empirical finding: GPT-3.5 breached safety
 constraints 30% of the time when asked identical questions repeatedly. Claude's
 16 safety rules and Sparrow's 23 rules provide no /inherent/ safety — they are
 heuristic guardrails that can be breached through prompt variation.
 These findings validate three Passepartout design commitments:
 1. /Prompt-level safety is insufficient./ Claude and Sparrow use rules that
   consume LLM tokens and can be evaded. Passepartout's deterministic gates run
   in pure Lisp, cost 0 tokens, and cannot be evaded by prompt engineering.
 2. /Inconsistency is the norm, not the exception./ Gaur & Sheth show that even
   identical queries produce inconsistent responses ~30% of the time. This
   validates the cardinality model: a system that expects contradiction and
   surfaces it with provenance is architecturally more honest than one that
   assumes consistency and silently resolves it.
 3. /Knowledge infusion is required for trust./ The CREST framework embeds
   domain knowledge (clinical guidelines, procedural knowledge) into LLM
   pipelines. Passepartout's symbolic index IS the knowledge infusion layer —
   facts extracted from prose, verified by Screamer, and available for any LLM
   call through the context assembly pipeline.
 Reference:
 - Gaur, M., & Sheth, A. (2023). Building Trustworthy NeuroSymbolic AI Systems:
  Consistency, Reliability, Explainability, and Safety. arXiv:2312.06798.
 ** Sheth et al. (2022): Knowledge-Infused Learning
 Sheth, Gunaratna, Bhatt, and Gaur define Knowledge-infused Learning (KiL) as
 "combining various types of explicit knowledge with data-driven deep learning
 techniques." They identify three infusion levels (shallow, semi-deep, deep) and
 position KiL as "a sweet spot in neuro-symbolic AI."
 The paper makes two observations relevant to Passepartout:
 1. /Data alone is not enough./ The opening cites Pedro Domingos ("Data Alone is
   Not Enough"), Andrew Ng ("the importance of Big Data is overhyped"), and
   Gary Marcus ("AI that captures how humans think"). These are the intellectual
   warrant for the symbolic index: a knowledge layer that is independent of any
   specific LLM call, accumulated across sessions, and verified against existing
   facts.
 2. /Expert knowledge is external to the model./ Domain experts use "their past
   experience, web or domain-specific knowledge sources, and annotation
   guidelines" to create ground truth — resources the LLM cannot access during
   training. The symbolic index makes these resources queryable: facts from the
   gate stack (security expertise), from the human (declarative authoring), from
   Wikidata (world knowledge), and from Screamer deductions (derived expertise).
 Passepartout's architecture is a specific implementation of KiL at the deepest
 infusion level: knowledge is not appended to prompts (shallow) or embedded in
 fine-tuning (semi-deep). It is a first-class data structure — the symbolic index
 — that the LLM queries through the archivist and the planner. The knowledge is
 living: it accumulates, is verified, carries provenance, and evolves through
 ontology versioning.
 Reference:
 - Gaur, M., Gunaratna, K., Bhatt, S., & Sheth, A. (2022). Knowledge-Infused
  Learning: A Sweet Spot in Neuro-Symbolic AI. /IEEE Internet Computing, 26/(4),
  5–11. https://doi.org/10.1109/MIC.2022.3179759
 * Semantic Wikipedia as Entity Backbone
 The gate stack provides 50-70 entity classes — adequate for a coding agent where
@@ -1412,3 +1777,19 @@ See also:
 - =passepartout/docs/DESIGN_DECISIONS.org= — the existing design decisions
 - =passepartout/docs/ARCHITECTURE.org= — the current pipeline architecture
 - =passepartout/docs/ROADMAP.org= — the feature roadmap through v0.13.0
 - =notes/passepartout-SWOT.org= — SWOT analysis of the neurosymbolic architecture
 - =passepartout-agora.org= — Passepartout-Agora integration design
 - Shimizu, C. & Hitzler, P. (2025). Accelerating knowledge graph and ontology
  engineering with large language models. /Journal of Web Semantics, 85/, 100862.
  https://doi.org/10.1016/j.websem.2025.100862
 - Shimizu, C., Hammar, K., & Hitzler, P. (2023). Modular ontology modeling.
  /Semantic Web, 14/(3), 459–489. https://doi.org/10.3233/SW-222886
 - Norouzi, S.S. et al. (2024). Ontology Population using LLMs. arXiv:2411.01612.
 - McCarthy, J. (1959). Programs with Common Sense. /Proc. Teddington Conf. on
  the Mechanization of Thought Processes./
 - Marcus, G. (2020). The Next Decade in AI. arXiv:2002.06177.
 - Gaur, M. & Sheth, A. (2023). Building Trustworthy NeuroSymbolic AI Systems.
  arXiv:2312.06798.
 - Gaur, M., Gunaratna, K., Bhatt, S., & Sheth, A. (2022). Knowledge-Infused
  Learning. /IEEE Internet Computing, 26/(4), 5–11.
 - Bhardwaj, V.P. (2026). Agent Behavioral Contracts. arXiv:2602.22302.
--- a/projects/passepartout
+++ b/projects/passepartout
--- a/projects/passepartout-contrib
+++ b/projects/passepartout-contrib