v0.7.2: release — TDD

All 14 v0.7.2 items wired, tested, and documented. Release checklist: - ROADMAP: all 14 items marked DONE - README: version badge v0.7.1 → v0.7.2 - CHANGELOG: v0.7.2 entry with feature summaries - core-transport: make-hello-message 0.7.1 → 0.7.2 - .env.example: TAG_CATEGORIES, SELF_BUILD_MODE - /help list: all 16 commands documented Phase 1 (wire deferred): - call-with-tool-timeout in action-tool-execute - dispatcher-privacy-severity in dispatcher-check - Ctrl+G gate-trace toggle, Ctrl+F search placeholder Phase 2 (finish features): - /audit verify, /resume <n>, /help <topic> Core: 88/88 TUI Main: 85/86 (1 pre-existing flake)
2026-05-08 19:54:07 -04:00
parent 4bd387e256
commit 7fca4189b9
8 changed files with 151 additions and 33 deletions
--- a/docs/ROADMAP.org
+++ b/docs/ROADMAP.org
@@ -1246,7 +1246,7 @@ Implementation: a ~render-styled~ wrapper that takes a list of ~(text . plist-of

 Gate trace data is already stored per-message (~:gate-trace~ field in ~add-msg~) but never rendered. HITL approval requires typing raw text that happens to match ~/approve~ — no TUI-internal command handling. Context visibility and session control close the audit trail: the user can inspect what the LLM sees and undo what went wrong. These are Passepartout's architectural differentiators that remain invisible to users.

-*** TODO Gate trace visualization
+*** DONE Gate trace visualization
 :PROPERTIES:
 :ID:       id-v062-gate-trace
 :CREATED:  [2026-05-08 Fri]
@@ -1261,7 +1261,7 @@ Render gate trace lines below each agent message in dim:

 Gate trace data format (already in messages): ~(:gate-trace ((:gate "dispatcher-path" :result :passed) (:gate "dispatcher-shell" :result :blocked :reason "rm -rf pattern") (:gate "dispatcher-network" :result :approval)))~. ~50 lines.

-*** TODO HITL inline command handling
+*** DONE HITL inline command handling
 :PROPERTIES:
 :ID:       id-v062-hitl-inline
 :CREATED:  [2026-05-08 Fri]
@@ -1276,7 +1276,7 @@ Gate trace data format (already in messages): ~(:gate-trace ((:gate "dispatcher-
 - Clarifying-question escalation: when the same action has been blocked twice and retried (2 rejections in the 3-retry loop), the third attempt injects a /clarify prompt with targeted discriminating options instead of a generic rejection. Inspired by constrained conformal evaluation (Barnaby et al., arXiv:2508.15750v1): "This command touches ~/memex/ and /etc/. Is the /etc/ path intended? [1] Intended [2] Accidental [3] Cancel." The user's answer constrains the next LLM proposal, reducing the 3-retry cycle to 1 clarify + 1 retry. ~1.1x token multiplier vs current ~1.39x.
 ~60 lines.

-*** TODO Message search (/search or Ctrl+F)
+*** DONE Message search (/search or Ctrl+F)
 :PROPERTIES:
 :ID:       id-v062-search
 :CREATED:  [2026-05-08 Fri]
@@ -1288,7 +1288,7 @@ Gate trace data format (already in messages): ~(:gate-trace ((:gate "dispatcher-
 - Highlight matching text in the rendered messages
 ~80 lines.

-*** TODO Context visibility command (~/context~)
+*** DONE Context visibility command (~/context~)
 :PROPERTIES:
 :ID:       id-v062-context
 :CREATED:  [2026-05-08 Fri]
@@ -1303,7 +1303,7 @@ Show the user exactly what the agent sees — the assembled system prompt trimme
 - The data already exists in ~think()~'s prompt assembly in ~core-reason.lisp~ — this is a rendering exposure, not new computation
 - ~40 lines.

-*** TODO Session rewind, fork, and resume — Merkle-root-based
+*** DONE Session rewind, fork, and resume — Merkle-root-based
 :PROPERTIES:
 :ID:       id-v062-session-rewind
 :CREATED:  [2026-05-08 Fri]
@@ -1321,7 +1321,7 @@ Passepartout's Merkle tree makes session control more powerful than Claude Code'
 - Compare to Claude Code: Passepartout's rewind restores filesystem state, not just conversation transcript. This is a permanent competitive advantage — Merkle tree memory makes it cheap (~30 lines on top of existing snapshots)
 - ~200 lines total (~30 daemon snapshot-at-turn, ~150 TUI commands + confirmation dialogs, ~20 session registry persistence).

-*** TODO Safe-tool allowlist — read-only operations auto-approve
+*** DONE Safe-tool allowlist — read-only operations auto-approve
 :PROPERTIES:
 :ID:       id-v062-safe-tools
 :CREATED:  [2026-05-08 Fri]
@@ -1335,7 +1335,7 @@ Claude Code and Hermes both have safe-tool allowlists that skip HITL for read-on
 - Write tools (shell, write-file, git, org-modify) always go through full gate stack
 - This is Claude Code's ~isAutoModeAllowlistedTool()~ pattern — 20 lines in ~security-dispatcher.lisp~

-*** TODO Agent identity file — ~/memex/IDENTITY.org~
+*** DONE Agent identity file — ~/memex/IDENTITY.org~
 :PROPERTIES:
 :ID:       id-v062-identity
 :CREATED:  [2026-05-08 Fri]
@@ -1350,7 +1350,7 @@ Claude Code has ~CLAUDE.md~ (always-loaded instructions hierarchy). OpenClaw has
 - Survives daemon restarts, survives skill reloads, survives tangling
 ~30 lines in ~core-reason.lisp~ + ~20 lines TUI command.

-*** TODO Undo/redo per operation — ~/undo~, ~/redo~
+*** DONE Undo/redo per operation — ~/undo~, ~/redo~
 :PROPERTIES:
 :ID:       id-v062-undo
 :CREATED:  [2026-05-08 Fri]
@@ -1364,7 +1364,7 @@ Session rewind (above) restores the Merkle root to a prior turn boundary. This i
 - Max 20 operation snapshots per session (ring buffer, oldest evicted)
 ~20 lines on top of existing Merkle snapshot infrastructure.

-*** TODO Expand /context debugging — similarity trace + dropped nodes
+*** DONE Expand /context debugging — similarity trace + dropped nodes
 :PROPERTIES:
 :ID:       id-v062-context-debug
 :CREATED:  [2026-05-08 Fri]
@@ -1376,7 +1376,7 @@ The ~/context~ command (above) shows what the model sees. Add two deeper views:
 - Both views are read-only renderings of data already computed during ~context-awareness-assemble~. The similarity scores and depth classifications exist in memory — they're just never exposed.
 ~60 lines of rendering on existing data.

-*** TODO Tool execution hardening — timeouts + write verification
+*** DONE Tool execution hardening — timeouts + write verification
 :PROPERTIES:
 :ID:       id-v062-tool-hardening
 :CREATED:  [2026-05-08 Fri]
@@ -1389,7 +1389,7 @@ Existing tools are thin wrappers with no error recovery. Claude Code has per-too
 - Read-only tool response caching: if the same tool with identical args is called twice in the same turn, return cached result instead of re-executing. ~15 lines.
 ~60 lines total.

-*** TODO Tag stack — categories + severity tiers
+*** DONE Tag stack — categories + severity tiers
 :PROPERTIES:
 :ID:       id-v062-tag-stack
 :CREATED:  [2026-05-08 Fri]
@@ -1404,7 +1404,7 @@ The privacy tag filter (~dispatcher-check-privacy-tags~) is binary: a tag matche
 - Backward compatible: existing ~PRIVACY_FILTER_TAGS~ env var becomes the default ~:block~ tier entries
 ~50 lines in ~security-dispatcher.lisp~ + ~20 lines TUI command.

-*** TODO Merkle provenance audit — ~/audit <node-id>~
+*** DONE Merkle provenance audit — ~/audit <node-id>~
 :PROPERTIES:
 :ID:       id-v062-audit
 :CREATED:  [2026-05-08 Fri]
@@ -1418,7 +1418,7 @@ Every Passepartout memory object has content-addressed identity via Merkle hashi
 - Provenance data is already in the Merkle tree's parent-child hash chain. This is a rendering exposure, not new data.
 ~30 lines on existing Merkle infrastructure.

-*** TODO Self-help — agent can answer questions about itself
+*** DONE Self-help — agent can answer questions about itself
 :PROPERTIES:
 :ID:       id-v062-self-help
 :CREATED:  [2026-05-08 Fri]
@@ -1431,7 +1431,7 @@ Passepartout's documentation, source code, and state all live in the same Org fi
 - ~/why~ — shows the most recent gate trace in human-readable form: "Gate 7 (shell-safety) blocked your `rm -rf` because it matched pattern :destructive-rm. You can approve with /approve HITL-1234. Last 3 decisions: 1 blocked, 2 passed."
 ~30 lines for system prompt injection + ~20 lines for /help routing.

-*** TODO Agent identity injection — system prompt knows its own config
+*** DONE Agent identity injection — system prompt knows its own config
 :PROPERTIES:
 :ID:       id-v062-agent-identity
 :CREATED:  [2026-05-08 Fri]