chore: checkpoint broken state before fixing macro conflict

2026-04-28 10:33:51 -04:00
parent a717ab1d3a
commit 3dddfe3e3d
7 changed files with 428 additions and 618 deletions
--- a/docs/ROADMAP.org
+++ b/docs/ROADMAP.org
@@ -5,46 +5,164 @@

 The roadmap is designed working backwards from SOTA parity (V 1.0.0), guiding each version toward a fully autonomous, self-editing agent. Each version builds on the previous, with features designed to be implemented in pure Common Lisp + Org-mode.

+** Non-Negotiable Identity
+- Pure Common Lisp + Org-mode. No JSON. No YAML. No external databases.
+- Single-address-space memory (Lisp hash tables in RAM — the agent IS the memory).
+- "Thin harness, fat skills" — complexity lives at the edges, not the kernel.
+- One agent composed of many skills. Concurrency via bordeaux-threads (shared memory).
+- Plists everywhere — homoiconic communication between all components.
+
 ** Version Roadmap

-*** v0.1.0: The Autonomous Foundation ✅
+*** v0.1.0: The Autonomous Foundation — CURRENT RELEASE ✅
+
 The secure, auditable Lisp kernel. All core infrastructure in place.

+| Component                         | Status | Notes                                                                 |
+|-----------------------------------+--------+-----------------------------------------------------------------------|
+| Perceive-Reason-Act pipeline      | ✅     | 3-stage metabolic loop                                                |
+| Skills engine with jailed loading | ✅     | defskill, topological sort, hot-reload                                |
+| Policy skill (6 invariants)       | ✅     | Transparency, Autonomy, Bloat, Modularity, Mentorship, Sustainability |
+| Bouncer skill                     | ✅     | Command whitelist guard functions                                     |
+| Memory (org-object + Merkle)      | ✅     | Hash tables, snapshots, rollback                                      |
+| Lisp validator skill              | ✅     | Syntax validation before eval                                         |
+| Scribe + Gardener skills          | ✅     | Heartbeat-driven distillation + audit                                 |
+| LLM gateway (OpenRouter + Ollama) | ✅     | Provider cascade                                                      |
+| Shell actuator                    | ✅     | Safe command execution                                                |
+| Emacs bridge via Swank            | ✅     | Point/buffer updates                                                  |
+| FiveAM test suite                 | ✅     | Memory, boot, pipeline, act, communication                            |
+| Credentials vault                 | ✅     | Encrypted storage                                                     |
+
 *** v0.2.0: Interactive Refinement ✅
+
 The "Brain" meets the "Machine." Standardization and professionalization of the user interface and environment.

-| Feature | Status | Notes |
-| :--- | :---: | :--- |
-| Minimalist Kernel | ✅ | Purified harness targeting I/O & Memory only. |
-| Sovereign Skills | ✅ | Diagnostics and Configuration extracted to Userland. |
-| POSIX/XDG Compliance| ✅ | Standardized paths (~/.config, ~/.local). |
-| Professional TUI | ✅ | Styled, scrollable, and verified Lisp interface. |
-| Onboarding Wizard | ✅ | Modular Lisp setup for multiple LLM providers. |
-| Linkage Command | ✅ | Real-time verification of external gateways (Telegram). |
+| Feature              | Status | Notes                                                   |
+| :---                 | :---:  | :---                                                    |
+| Minimalist Kernel    | ✅     | Purified harness targeting I/O & Memory only.           |
+| Sovereign Skills     | ✅     | Diagnostics and Configuration extracted to Userland.    |
+| POSIX/XDG Compliance | ✅     | Standardized paths (~/.config, ~/.local).               |
+| Professional TUI     | ✅     | Styled, scrollable, and verified Lisp interface.        |
+| Onboarding Wizard    | ✅     | Modular Lisp setup for multiple LLM providers.          |
+| Linkage Command      | ✅     | Real-time verification of external gateways (Telegram). |
+| Self-Editing         | ✅     | Detects errors, applies fixes, learns from outcomes.    |
+| Memory Rollback      | ✅     | Snap back to known-good state on critical errors.       |

 *** v0.3.0: Event Orchestration + HITL
+
 Unified control plane and Human-in-the-Loop (HITL) state management.

-| Feature | Description |
-| :--- | :--- |
-| org-skill-event-orchestrator | Unified hooks + cron + routing. Three tiers: =:REFLEX=, =:COGNITION=, =:REASONING=. |
-| Human-in-the-Loop (HITL) | Continuation-based interaction. The agent can "suspend" to ask clarification and resume state. |
-| Asynchronous Embedding Gateway | Provider-agnostic vector generation (Ollama, local llama.cpp) via background worker. |
-| Telegram Gateway Skill | Full implementation of the message receiver for linked Telegram bots. |
-| Command Palette | Support for internal `/` commands defined as Lisp-native capabilities. |
+| Feature                        | Description                                                                                                                                               |
+|--------------------------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------|
+| org-skill-event-orchestrator   | Unified hooks + cron + routing. Three tiers: =:REFLEX= (no LLM), =:COGNITION= (light LLM), =:REASONING= (full LLM).                                       |
+| Human-in-the-Loop (HITL)       | Continuation-based interaction. The agent can "suspend" its cognitive loop to ask for permission or clarification and resume precisely where it left off. |
+| org-skill-context-manager      | Stack-based project scoping. =push-context= / =pop-context=. Path resolution relative to context.                                                         |
+| Memory scope segmentation      | =:scope= property on org-objects: memex/session/project. Scope-aware retrieval.                                                                           |
+| Model-tier routing             | Complexity-based model selection: heartbeat → tiny, user → medium, reasoning → large.                                                                     |
+| Slash commands                 | =M-x= style command palette in TUI. Commands defined in Org-mode.                                                                                         |
+| Asynchronous Embedding Gateway | Provider-agnostic vector generation (Ollama, local llama.cpp) via background worker.                                                                      |
+| Telegram Gateway Skill         | Full implementation of the message receiver for linked Telegram bots.                                                                                     |

 *** v0.4.0: Long-Horizon Planning + Git Workflows
+
 Structured tracking, failure handling, and course correction for multi-step engineering work.

-| Feature | Description |
-| :--- | :--- |
-| org-skill-long-horizon | Decompose tasks into Org-mode headline trees. |
-| org-skill-git-steward | Policy-enforced git workflows (Commit-Before-Modify). |
-| TDD runner | Auto-verify changes on save and trigger self-fix loops. |
+| Feature                | Description                                                                                                                                 |
+|------------------------+---------------------------------------------------------------------------------------------------------------------------------------------|
+| org-skill-long-horizon | Decompose tasks into Org-mode headline trees. Terminal states: =:done= / =:blocked= / =:stuck=. Parent summarises children. Branch pruning. |
+| org-skill-git-steward  | Status, diff, commit, push, branch. Policy enforces commit-before-modify.                                                                   |
+| TDD runner             | FiveAM on file save. =:test-failure= events. Hook into self-fix for auto-repair.                                                            |
+| Deep Emacs integration | Full org-agenda awareness. Navigate, clock time, refile, archive.                                                                           |

-*** v0.5.0+ : True Agency and Lisp Machine Emergence
-World models, visual grounding, and total internalization of the AI stack.
- **v0.5.0**: Interactive PTY Actuators & Environment Stewardship.
- **v0.7.0**: Visual Grounding & MCP Bridge.
- **v1.0.0**: SOTA Parity (Competitive with commercial agents).
- **v2.0.0**: Lisp Machine Emergence (Agent IS the OS).
+*** v0.5.0: Interactive Actuation & Environment Stewardship
+
+Interactive terminal sessions and autonomous dependency management.
+
+| Feature                  | Description                                                                                                                         |
+|--------------------------+-------------------------------------------------------------------------------------------------------------------------------------|
+| Interactive PTY Actuator | Stream long-running process output to the context window (e.g., `npm run dev`, REPLs) with async interrupt control.                 |
+| The Environment Steward  | Autonomously detect missing dependencies (e.g., "Command not found"), propose an installation command, and retry the failed action. |
+
+*** v0.6.0: Concurrency + Creator + GTD
+
+The agent bootstraps itself and manages parallel workstreams.
+
+| Feature                     | Description                                                                                                                           |
+|-----------------------------+---------------------------------------------------------------------------------------------------------------------------------------|
+| org-skill-sub-agent-manager | Lightweight Lisp-native sub-agents (via bordeaux-threads) that share memory but have isolated execution contexts for background work. |
+| org-skill-creator           | LLM drafts complete skill org-file from natural language. Mandatory: syntax validation → jail-load → test → register.                 |
+| org-skill-architect         | Scan =:STATUS: FROZEN= PRDs. Generate Phase B PROTOCOL.                                                                               |
+| org-skill-gtd               | Full GTD cycle: capture, clarify, organize, reflect, engage. org-gtd v4.0 DAG (=:TRIGGER:=, =:BLOCKER:=).                             |
+| Consensus loop              | Run multiple providers for critical decisions. Compare results, detect disagreements.                                                 |
+| Web research                | Headless Chromium via Python bridge. Text extraction, screenshots, Gemini Web UI automation.                                          |
+
+*** v0.7.0: Visual Grounding & MCP Bridge
+
+Multimodal visual interaction and ecosystem-wide tool compatibility.
+
+| Feature               | Description                                                                                                                                                |
+|-----------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| Computer Use / Vision | Allow the agent to request host OS or browser screenshots, analyze the UI, and issue precise X/Y coordinate click/type commands via an X11/Wayland bridge. |
+| MCP Gateway Bridge    | Lisp-native client for the Model Context Protocol, allowing OpenCortex to connect to the entire ecosystem of external tools and data sources.              |
+
+*** v0.8.0: The Evaluation Harness
+
+Automated benchmarking to mathematically prove the agent's reasoning capabilities.
+
+| Feature           | Description                                                                                                                                    |
+|-------------------+------------------------------------------------------------------------------------------------------------------------------------------------|
+| SWE-Bench Harness | Automated pipeline that clones repositories, feeds GitHub issues, tracks the multi-step resolution trajectory, runs tests, and scores success. |
+
+*** v1.0.0: SOTA Parity
+
+Feature-complete agent competitive with commercial agents. All features reimplemented in pure Lisp.
+
+| Area              | Status    | Notes                                     |
+|-------------------+-----------+-------------------------------------------|
+| Self-improvement  | ✅ v0.2.0 | Self-edit + lisp-repair                   |
+| Planning          | ✅ v0.4.0 | Task tree DAGs with terminal states       |
+| Tool ecosystem    | 🟡 v0.4.0 | 10+ cognitive tools                       |
+| Context window    | ✅ v0.3.0 | Semantic search + scope segmentation      |
+| Safety            | ✅ v0.1.0 | 6 Policy invariants + formal verification |
+| Multi-step tasks  | ✅ v0.4.0 | Task trees with failure handling          |
+| Code editing      | ✅ v0.2.0 | Full org-mode file read/write             |
+| Memory            | ✅ v0.2.0 | Vector recall in org-object               |
+| Emacs integration | ✅ v0.2.0 | Full org-mode control                     |
+| Autonomy          | ✅ v0.1.0 | 100% local capable (Ollama)               |
+
+*** v2.0.0: Lisp Machine Emergence
+
+From Lisp-using agent to true Lisp machine. Agent IS the Emacs process.
+
+| Feature | Description |
+|---------|-------------|
+| Lish: Lisp editor | Org-mode as IDE. Org-babel for interactive evaluation. Full REPL in TUI. No bridge needed. |
+| Lish: Shell replacement | Lisp-based shell that speaks plists. Org-mode buffers as file system. |
+
+*** v3.0.0: Neurosymbolic Maturity
+
+Deterministic planner takes the wheel. LLM relegated to semantic translation.
+
+| Feature | Description |
+|---------|-------------|
+| Deterministic planner | Pure Lisp task scheduler. No LLM needed for planning. |
+| Self-correcting gates | Gates learn from false positives (user override patterns). |
+
+*** v4.0.0: AI Stack Internalized
+
+The agent understands its own weights. No external inference.
+
+| Feature | Description |
+|---------|-------------|
+| Llama.cpp in Lisp | FFI binding. No Python subprocess. Pure Common Lisp inference. |
+| Weights as sexps | Neural weights as Lisp data structures. Homoiconic model introspection. |
+
+*** v5.0.0: True Agency
+
+World models, temporal reasoning, goal persistence across restarts.
+
+| Feature | Description |
+|---------|-------------|
+| World models | Predictive models of user behavior, project dynamics, system state. |
+| Temporal reasoning | Scheduling, deadlines, elapsed duration awareness. |
+| Goal persistence | Goals survive restarts. Long-term projects in org-objects. |