passepartout v0.7.2 (Gate Trace + HITL + Search + 11 more features): - Gate trace visualization with Ctrl+G toggle - HITL inline panels with styled collapse on approve/deny - Agent identity file + /identity command - Safe-tool read-only allowlist - Message search mode with Up/Down nav and highlights - Context budget visibility with section breakdown - Session rewind /sessions /resume /rewind - Undo/redo per operation - Context debugging /context why /context dropped - Tool hardening (timeouts, write verify, read-only cache) - Tag stack severity tiers + trigger counts - Merkle provenance audit + audit-verify - Self-help /help <topic> reads USER_MANUAL.org - Live CONFIG section in system prompts - Pads: Page Up/Down scroll by 10 lines Core 92/92 TUI Main 104/104 TUI View 29/29 Neuro 13/13
29 KiB
Competitive Analysis — AI Coding Agents & Personal AI Agent Systems
- Overview
- 1. Aider (Paul Gauthier)
- 2. Cursor Agent Mode
- 3. GitHub Copilot Agent Mode
- 4. Amazon Q Developer
- 5. Devin (Cognition AI)
- 6. Factory AI / Factory Droid
- 7. Cline (formerly Claude Dev)
- 8. RooCode
- 9. AutoGPT
- 10. Microsoft AutoGen
- 11. CrewAI
- 12. Replit Agent (Ghostwriter)
- 13. Codex CLI (OpenAI)
- 14. Continue.dev
- 15. PearAI
- 16. Melty (now Conductor)
- 17. Windsurf / Codeium (now part of Cognition AI / Devin)
- 18. Cursor AI
- 19. Augment Code
- 20. Qoder
- 21. v0 by Vercel
- 22. Lovable
- 23. Void (formerly based on Codex)
- 24. Cosine Genie
- 25. Mentat
- 26. Ghostwriter by Replit (→ see Replit Agent #12)
- 27. Poolside
- 28. Tabnine
- 29. Factory Droid (→ see #6)
- 30. Devin (→ see #5)
- Emerging / Notable Others
- Cross-Cutting Analysis
- Key Takeaways for Passepartout
Overview
This document surveys 30+ AI coding agents and personal AI agent systems across the dimensions most relevant to Passepartout: safety architecture, memory persistence, TUI/CLI interface, extensibility model, neurosymbolic or deterministic-rule components.
1. Aider (Paul Gauthier)
What it does
Git-aware AI pair programming in the terminal. Reads/writes files in your repo, auto-commits changes. Designed for interactive chat-based coding.
Architecture
- Model: pluggable (OpenAI, Anthropic, Gemini, local via Ollama/LM Studio)
- Tools: file read/write, git, lint/test execution, repo map (tree-sitter AST analysis)
- Memory: git history + chat history file (.aider.chat.history.md). No persistent memory across sessions beyond git.
- Safety: git-backed undo per edit; user must approve file additions; linting/tests auto-run
Differentiators
- Repo map: compresses entire codebase into ~1024 tokens of structured context
- Edit formats: whole-file, search/replace diff, universal diff — fallback chain
- Open source (Apache 2.0), Python, highly scriptable
- Benchmark leaderboard (SWE-bench, own editing/refactoring benchmarks)
Maturity
Production. 30k+ GitHub stars. Active development.
Relevance to Passepartout: HIGH
— Directly comparable: CLI-native, git-integrated, extensible via Python scripting — PP's .org-as-source-of-truth + tangle workflow is architecturally distinct — Aider has no neurosymbolic components or deterministic rule engine — Aider lacks memory persistence beyond git; PP's org-mode + contract-first TDD is richer
2. Cursor Agent Mode
What it does
IDE-integrated coding agent inside Cursor (VS Code fork). Agent mode can plan, read/write files, run terminal commands, and iterate autonomously.
Architecture
- Model: Claude, GPT-4o, etc. (cursor-small model for tab completion)
- Tools: file editing, terminal, @-symbols for context, MCP support, image input
- Memory: session-only; no persistent memory across sessions
- Safety: diff view for changes, user approval on terminal commands (configurable), lint monitoring
Differentiators
- .cursorrules for project-specific instructions
- Visual diff before applying changes
- Tab completion + agent mode + chat in one IDE
- MCP server integration for custom tools
Maturity
Production. Widely used.
Relevance to Passepartout: MEDIUM
— IDE-dependent (not TUI-first). PP's Emacs/terminal-native approach is different — Rule system (.cursorrules) is closest thing to deterministic rules — but plain-text prompts only — No neurosymbolic, no persistent memory, no contract-first workflow
3. GitHub Copilot Agent Mode
What it does
Microsoft/GitHub's coding agent across VS Code, GitHub.com, CLI. Agent mode (2025) can autonomously plan, edit, run commands.
Architecture
- Model: multi-model (OpenAI, Claude, Gemini, Haiku via Copilot)
- Tools: IDE edit, terminal, MCP, code review, code search, GitHub issues
- Memory: per-session; Enterprise can index org codebase for retrieval
- Safety: IP indemnity, code referencing filter, admin-managed MCP allowlists, audit logs for enterprise, opt-out for training data
Differentiators
- Deep GitHub integration (PR review, issues, Actions)
- Multi-model access from one subscription
- Enterprise governance (SSO, audit, VPC)
- Copilot CLI for terminal-only use
Maturity
Production. Largest userbase (millions).
Relevance to Passepartout: LOW
— Massive platform lock-in. No extensibility for custom workflows — No neurosymbolic. No persistent memory across sessions — Enterprise features irrelevant to PP's use case — The CLI component is closest competitor but lacks PP's TDD/contract cycle
4. Amazon Q Developer
What it does
AWS's coding assistant across IDE, CLI, and AWS console. Code completion, chat, security scanning, code transformation.
Architecture
- Model: Amazon Bedrock (multiple FMs), augmented with AWS content
- Tools: IDE extension, CLI, AWS console chat, automated code review, vulnerability scanning, code transformation (e.g., Java upgrades)
- Memory: session; no persistent cross-session memory
- Safety: AWS IAM permissions, Bedrock abuse detection, zero data retention for Business tier; no training on Enterprise data
Differentiators
- Deep AWS knowledge (VPC, EC2, Lambda, etc.)
- Automated code transformation (e.g., Java 8→17)
- Security vulnerability scanning built in
- Free tier generous
Maturity
Production.
Relevance to Passepartout: LOW
— AWS-ecosystem focused. No CLI/TUI philosophy. No extensibility. — Not a general-purpose agent; AWS-specific
5. Devin (Cognition AI)
What it does
Autonomous AI software engineer in a sandboxed environment. Plans, codes, tests, deploys end-to-end. Acquired Windsurf (Codeium).
Architecture
- Model: Claude Sonnet 4.5 (publicly), proprietary model claimed
- Tools: shell, code editor, browser, sandboxed compute environment
- Memory: per-session long-term reasoning; can recall context across steps within a task. Reports progress in real-time
- Safety: sandboxed environment, user approval on deployment, SSH key support
Differentiators
- Full autonomy (not pair programming) — can be assigned via Slack, Jira
- SWE-bench leader: 13.86% (initial), now higher
- Can learn unfamiliar technologies, train models, do Upwork tasks
- Windsurf acquisition: now owns IDE + cloud agent stack
Maturity
Production (GA Dec 2024). Backed by $21M+ from Founders Fund.
Relevance to Passepartout: MEDIUM
— Autonomous agent philosophy differs from PP's interactive TDD partner — WindSurf integration creates IDE dependency; PP is terminal-native — No neurosymbolic. No contract-first. No persistent memory (per-task only)
6. Factory AI / Factory Droid
What it does
Automated code review and bug-fixing. Runs as GitHub app on every PR. Droid bot auto-fixes issues found in review.
Architecture
- Model: Claude/GPT (likely)
- Tools: GitHub PR integration, code review, auto-fix generation
- Memory: PR-level context; no cross-PR memory
- Safety: review-before-apply; GitHub permissions
Differentiators
- PR-review focused (not general coding agent)
- Auto-fix generation as part of review workflow
- Enterprise-focused (code review automation)
Maturity
Beta/production.
Relevance to Passepartout: LOW
— Narrow scope (PR review). Not a general agent. — No TUI, no memory, no extensibility.
7. Cline (formerly Claude Dev)
What it does
Autonomous coding agent VSCode extension. Can create/edit files, run terminal commands, use browser, execute MCP tools. Human-in-the-loop for all actions.
Architecture
- Model: any (OpenRouter, Anthropic, OpenAI, Google, AWS Bedrock, local models)
- Tools: file R/W, terminal, browser (computer use), MCP servers, linter/compiler monitoring, checkpoint/restore
- Memory: session context; checkpoints as workspace snapshots; no persistent cross-session memory
- Safety: human-in-the-loop for every file change and terminal command (GUI approval); diff view; checkpoints for rollback; permission gates
Differentiators
- "Add a tool" — can ask Cline to create new MCP servers on the fly
- @url, @problems, @file, @folder context markers
- Browser computer use for interactive debugging
- Checkpoint system: compare/restore workspace snapshots
- Open source (Apache 2.0), 61k+ stars
- Enterprise: SSO, on-prem, audit trails
Maturity
Production. 61.5k GitHub stars. Rapid development.
Relevance to Passepartout: HIGH
— Closest architecture: extensible via MCP, CLI+editor integration, human-in-loop — MCP-based tool creation PP could adopt — No neurosymbolic rules engine; contracts are plain .clinerules text — Checkpoint workflow similar to PP's git-based snapshots but less structured — PP's .org source-of-truth + tangle is unique
8. RooCode
What it does
VSCode extension for multi-agent coding. Variant/fork of Cline with multiple agent "modes" (architect, ask, code, custom).
Architecture
- Model: any (same provider list as Cline)
- Tools: file editing, terminal, browser, MCP, image support
- Memory: per-session context
- Safety: human approval gates, diff view
Differentiators
- Multi-agent modes (architect plans, coder implements, ask answers)
- Custom modes with custom prompts
- Forked from Cline, similar architecture
Maturity
Production.
Relevance to Passepartout: MEDIUM
— Multi-agent orchestration is interesting but VSCode-dependent — No neurosymbolic. No persistent memory. No contract-first.
9. AutoGPT
What it does
Platform for building, deploying, and running continuous AI agents. Classic version was autonomous GPT-4 agent; now a platform with agent builder, marketplace, workflow management.
Architecture
- Model: any LLM (pluggable)
- Tools: web search, file operations, code execution, block-based workflow builder
- Memory: long-term memory via vector DB (Redis/Pinecone), persistent agent state
- Safety: Docker sandboxing, user approval gates
Differentiators
- Agent builder with visual block-based workflow
- Marketplace for pre-built agents
- Continuous/long-running agents (not session-only)
- Classic AutoGPT pioneered autonomous agent loop (think → act → observe)
Maturity
Production. 184k stars. Classic in maintenance; platform in beta/active.
Relevance to Passepartout: MEDIUM
— Long-running, persistent agents concept is relevant — Block-based workflow builder is anti-neurosymbolic (no rules engine) — Python-centric; PP is Common Lisp — No contract-first TDD workflow
10. Microsoft AutoGen
What it does
Multi-agent conversation framework from Microsoft. Agents can converse, collaborate, execute code, use tools. .NET and Python.
Architecture
- Model: any (OpenAI, etc.)
- Tools: MCP, Docker code execution, OpenAPI, web search, distributed runtimes
- Memory: conversation history; no built-in long-term memory; use extensions
- Safety: Docker sandbox for code execution; human-in-loop patterns
Differentiators
- Event-driven, distributed multi-agent architecture (gRPC runtime)
- AgentChat for conversational, Core for event-driven, Studio for GUI
- MCP tool support built-in
- .NET and Python support
- Research-grade multi-agent patterns
Maturity
Stable/Production. Backed by Microsoft.
Relevance to Passepartout: MEDIUM
— Multi-agent orchestration architecture is relevant — No TUI/CLI focus; Python/.NET — No neurosymbolic; no deterministic rules — PP could learn from AutoGen's event-driven agent patterns
11. CrewAI
What it does
Open-source framework for orchestrating autonomous AI agents as "crews" with role-based collaboration. Flows for workflow control.
Architecture
- Model: any LLM (pluggable)
- Tools: API, database, custom tools; agent roles with specific goals
- Memory: conversation-based; no built-in persistent memory across crews
- Safety: enterprise security claims; human-in-loop patterns
Differentiators
- Role-playing agents (researcher, writer, etc.)
- Flows (stateful, event-driven) + Crews (autonomous teams)
- 100k+ certified developers
- Enterprise-ready
Maturity
Production.
Relevance to Passepartout: LOW
— Python framework, not a standalone agent — No TUI/CLI; not a coding agent — Role-based agent pattern is interesting but not directly applicable
12. Replit Agent (Ghostwriter)
What it does
In-browser coding agent on Replit platform. Build, deploy apps from prompts. Full IDE in browser with AI agent.
Architecture
- Model: proprietary (likely fine-tuned LLM)
- Tools: in-browser IDE, file system, terminal, deployment, database
- Memory: project context within session
- Safety: sandboxed in-browser environment; Replit platform moderation
Differentiators
- Zero setup: browser-based, no install
- Full-stack: code + DB + deploy from one prompt
- Educational focus (used in classrooms)
- Collaborative editing
Maturity
Production.
Relevance to Passepartout: LOW
— Cloud-only, browser-based. Anti-TUI. — No extensibility. No memory persistence. — Educational/consumer focus, not power-user agent
13. Codex CLI (OpenAI)
What it does
Lightweight CLI coding agent from OpenAI. Runs locally, writes files, runs commands. Desktop app variant available.
Architecture
- Model: OpenAI models (GPT-5, o-series)
- Tools: file read/write, shell execution, sandboxed environment
- Memory: session context; conversation history per session
- Safety: user approval on file writes and commands; runs locally; sandboxed execution
Differentiators
- CLI-native (npm install -g @openai/codex)
- Desktop app (codex app) for richer UI
- Multi-platform (macOS, Linux, Windows)
- Open source (Apache 2.0), 81k stars, 6k+ commits
- "Sign in with ChatGPT" or API key
- Environment management for secrets
Maturity
Production. 81k GitHub stars. Very active.
Relevance to Passepartout: HIGH
— Direct competitor: CLI-native coding agent — Same philosophy: terminal-first, local execution — PP differentiators: .org source-of-truth, tangle workflow, neurosymbolic TDD, contract-first, deterministic rules engine — Codex has NO neurosymbolic component, NO contracts, NO persistent memory beyond git, NO rule engine
14. Continue.dev
What it does
Open-source AI code assistant for IDE. Chat, edit, tab-completion. Now pivoted to Continuous AI — AI checks on PRs (source-controlled checks).
Architecture
- Model: any (OpenAI, Anthropic, Ollama, etc.)
- Tools: IDE chat, file editing, @-references, PR checks
- Memory: session-based
- Safety: local models possible, diff-based editing
Differentiators
- Fully open-source IDE assistant
- "Checks" — source-controlled AI reviews as markdown files in repo
- Multiple model providers
- VS Code + JetBrains
Maturity
Production. Renamed to Continuous AI for PR-check product.
Relevance to Passepartout: LOW
— IDE-dependent. PR-check focus is different from PP's build-time agent — "Checks as markdown" concept is closest to PP's .org-based contracts — but far less structured. PP's contracts are machine-verifiable, not just prompts
15. PearAI
What it does
AI code editor (VS Code fork) with integrated coding agent + chat. Open-source, Bun-based performance.
Architecture
- Model: any (OpenAI, Anthropic, Ollama)
- Tools: IDE agent, chat, file editing, context management
- Memory: session-based
- Safety: open source, local model support
Differentiators
- VS Code fork (not extension)
- Bun for performance
- Free, open source
- "Context" management for prompt optimization
Maturity
Beta/Production.
Relevance to Passepartout: LOW
— IDE-dependent fork. PP's Emacs + TUI is philosophically opposite. — No unique architecture features.
16. Melty (now Conductor)
What it does
Originally Melty, now Conductor — orchestrator for running multiple coding agents (Claude Code, Codex) in parallel on your Mac. Each agent gets an isolated git worktree.
Architecture
- Model: uses Codex + Claude Code under the hood
- Tools: git worktree management, parallel agent execution, review UI
- Memory: per-task git worktree; no cross-session memory
- Safety: git isolation; user reviews changes before merging Differentiators
- Multi-agent parallelism (not multi-agent collaboration)
- Git worktree-based isolation
- Dashboard for monitoring agents
Maturity
Production (Beta/2025). Used at Linear, Vercel, Notion, Ramp.
Relevance to Passepartout: MEDIUM
— Parallel agent orchestration model is interesting — Doesn't replace PP's workflow; could complement — No neurosymbolic, no rules engine, no memory persistence
17. Windsurf / Codeium (now part of Cognition AI / Devin)
What it does
AI-native IDE. Cascade agent for autonomous coding. Tab completion, agent mode, MCP support. Acquired by Cognition (Devin).
Architecture
- Model: multi-model (GPT-5, Claude, custom)
- Tools: Cascade (agent), Tab (completions), MCP, JetBrains plugin, Devin integration, Spaces (bundled context)
- Memory: Cascade sessions within workspace; Spaces for grouped context
- Safety: admin-managed MCP servers; enterprise controls
Differentiators
- Cascade: local agent for real-time assistance
- Devin integration: cloud agent for long-running tasks
- Spaces: bundle agent sessions, PRs, files around a task
- Agent Command Center: Kanban dashboard for agents
- JetBrains plugin (targets non-VS Code users)
Maturity
Production. 1M+ users, 4k+ enterprise customers.
Relevance to Passepartout: MEDIUM
— IDE-dependent (VS Code fork + JetBrains plugin) — Cascade + Devin hybrid local/cloud model is architecturally interesting — No neurosymbolic. No deterministic rules. No contract-first. — Spaces concept (grouping context around a task) is close to PP's session management
18. Cursor AI
What it does
AI-first code editor (VS Code fork). Multi-model, agent mode, tab completion, MCP support. The most popular AI IDE.
Architecture
- Model: proprietary (cursor-small) + OpenAI, Anthropic, Gemini
- Tools: agent mode, tab completion, chat, @-symbols, MCP, terminal
- Memory: session; no cross-session persistence
- Safety: diff view, configurable permission levels, image input support
Differentiators
- First-mover in AI IDEs (fork vs extension approach)
- .cursorrules for project conventions
- Fast tab completion (custom small model)
- @-symbol context system (files, docs, web)
Maturity
Production. Most popular AI IDE.
Relevance to Passepartout: LOW
— IDE-dependent. PP is philosophy of terminal + .org + Emacs — No neurosymbolic, no persistent memory, no contract-first
19. Augment Code
What it does
AI coding platform with deep codebase understanding. Agent, chat, CLI (Auggie), context engine that indexes entire codebase.
Architecture
- Model: proprietary + multi-model
- Tools: agent, code completions (sunset soon), CLI (Auggie), IDE extensions
- Memory: codebase index (persistent), session context
- Safety: enterprise SSO, permissions
Differentiators
- Auggie CLI — terminal-first agent (closest to PP)
- Codebase-wide context engine (indexes entire repo, not just open files)
- Agent can tackle large tasks autonomously
Maturity
Production. Well-funded.
Relevance to Passepartout: HIGH
— Auggie CLI is directly comparable: terminal-native coding agent — Full codebase indexing is better than PP's current approach — No neurosymbolic. No contract-first. No org-mode source-of-truth. — PP's TDD + contract + tangle workflow is unique differentiator
20. Qoder
What it does
Coding agent platform. Details limited.
Architecture
Unknown — website unreachable.
Maturity
Unknown.
Relevance to Passepartout: UNKNOWN
— Insufficient data.
21. v0 by Vercel
What it does
UI generation agent. Generates React/Next.js components and pages from text prompts. Visual design oriented.
Architecture
- Model: proprietary (likely fine-tuned)
- Tools: code generation, visual design mode, templates, deployment to Vercel
- Memory: per-session; design context within chat
- Safety: Vercel platform controls
Differentiators
- Visual-first: generates UI, not general code
- Design mode: fine-tune with visual controls
- Template library
- iOS app for mobile building
- Deep Vercel/Next.js integration
Maturity
Production.
Relevance to Passepartout: LOW
— Narrow domain (UI generation). Not a general coding agent. — Cloud-only. No TUI. — No relevance to PP's workflow.
22. Lovable
What it does
Full-stack application generation from natural language prompts. Build and deploy apps/websites via AI chat.
Architecture
- Model: proprietary (likely fine-tuned)
- Tools: app generation, deployment, domain registration, mobile app
- Memory: per-project conversation
- Safety: platform-level moderation
Differentiators
- Full-stack: frontend + backend + DB + deploy
- No-code-friendly (describe app → get working app)
- Template library, mobile companion app
- Enterprise security claims
Maturity
Production.
Relevance to Passepartout: LOW
— Consumer/no-code focus. Not a developer coding agent. — No TUI, no extensibility, no memory persistence beyond project.
23. Void (formerly based on Codex)
What it does
CLI coding agent for terminal-based AI code generation. Originally built on OpenAI Codex.
Architecture
- Model: OpenAI API-compatible
- Tools: terminal code generation, file writing
- Memory: session context
- Safety: user approval on file changes
Differentiators
- CLI-native (similar to Codex CLI)
- Lightweight
Maturity
Beta/early.
Relevance to Passepartout: MEDIUM
— CLI-native coding agent, directly comparable to PP — Less mature than Codex CLI or Aider — No unique differentiators
24. Cosine Genie
What it does
Autonomous AI software engineer. Takes Jira tickets or PR descriptions, breaks them down, writes code, delivers PRs. Works asynchronously.
Architecture
- Model: Genie 2 (proprietary model); 72% on SWE-Lancer
- Tools: IDE/CLI, Slack, Jira, Linear, GitHub integration
- Memory: task-level context; works asynchronously in background
- Safety: sandboxed; desktop app runs locally; cloud service
Differentiators
- Proprietary model (Genie 2) — not just wrapping an API
- SWE-Lancer leader: 72% pass rate (highest)
- Asynchronous: works without active session
- Slack/Jira/Linear integration for task intake
- CLI + Desktop app + Cloud
Maturity
Production. Well-funded.
Relevance to Passepartout: HIGH
— Direct competitor: agent that takes tickets and delivers code — CLI-native operation — No contract-first, no neurosymbolic, no org-mode — PP's structured TDD cycle + deterministic rules are key differentiators — Cosine's Slack/Jira integration interesting for PP to consider
25. Mentat
What it does
CLI coding assistant that coordinates edits across multiple files. Project context understanding.
Architecture
- Model: any LLM (OpenAI, Anthropic)
- Tools: file editing across multiple files, project context gathering
- Memory: session context, git awareness
- Safety: user approval
Differentiators
- Multi-file editing focus
- Project-level understanding
Maturity
Currently inactive/archived. GitHub 404.
Relevance to Passepartout: LOW
— Inactive project. No meaningful differentiation from Aider/Codex CLI.
26. Ghostwriter by Replit (→ see Replit Agent #12)
27. Poolside
What it does
Foundation models for software engineering. Builds models + agents for enterprise. On-prem deployment, air-gapped. TUI, IDE extensions, agents.
Architecture
- Model: Laguna XS.2 M.1 (proprietary foundation model)
- Tools: agents, TUI, IDE extensions, multi-agent orchestration
- Memory: enterprise context (connectors to repos, DBs, private data)
- Safety: on-prem/VPC/air-gapped, RBAC for humans and agents, audit trails, executive governance, no data leaves customer boundary
Differentiators
- Full-stack: builds foundation models + agents + enterprise deployment
- Forward Deployed Research Engineers embedded with customers
- Outcome ownership (not just model handoff)
- AGI thesis: software engineering as path to AGI
- Military/defense-grade security (not just compliance)
Maturity
Production. Frontier lab status. $500M+ funding.
Relevance to Passepartout: LOW
— Enterprise/military focus, completely different market — Building own models (PP uses existing LLMs) — TUI is just one surface among many — PP's individual-developer, open-source, TDD-first philosophy is opposite
28. Tabnine
What it does
Enterprise AI code completion + agent platform. Code completions, chat, agents, CLI, context engine. Gartner Visionary 2025.
Architecture
- Model: multiple (code-specific small models + large model access)
- Tools: code completion, chat, CLI, agents (planning, coding, testing, docs), context engine, provenance/attribution
- Memory: Enterprise Context Engine (indexes org codebase, architecture, standards)
- Safety: on-prem/air-gapped/VPC, zero data retention, IP indemnity, provenance tracking, admin controls, audit logs
Differentiators
- Enterprise Context Engine: organizational intelligence layer for any agent
- Fine-tuning on private repos
- Provenance & attribution for IP compliance
- Gartner Visionary; Leader in Omdia Universe
- Zero Trust compliance (air-gapped)
Maturity
Production. 15+ years in market (originally Codota). Millions of developers.
Relevance to Passepartout: LOW
— Enterprise-focused (compliance, IP, governance). PP is individual-agent. — Context Engine concept is interesting but proprietary — No neurosymbolic. No contract-first. — CLI is secondary to IDE completions for Tabnine
29. Factory Droid (→ see #6)
30. Devin (→ see #5)
Emerging / Notable Others
Conductor (Melty) — see #16
Cline — see #7
RooCode — see #8
Augment Code CLI (Auggie) — see #19
Cosine Genie — see #24
Cross-Cutting Analysis
Safety / Security Architecture
| System | Sandbox | Human-in-Loop | Diff Review | Gov/Enterprise | Notes |
|---|---|---|---|---|---|
| Aider | Git undo | File approval | Git diff | No | Git as safety net |
| Cline | None | Every op | Diff view | SSO, on-prem | Best HIL in class |
| Codex CLI | Sandboxed exec | On write/command | Console output | No | Basic |
| Devin | Sandboxed env | Deployment gate | PR review | Enterprise plan | Cloud sandbox |
| Cursor | None | Configurable | Diff view | No | .cursorrules |
| Copilot | None | Configurable | Diff view | SSO, audit, MCP allowlist | Best enterprise |
| Cosine | None | PR review | PR review | Cloud + on-prem | Async operation |
| Tabnine | None | Configurable | Diff view | Air-gapped, on-prem, audit | Best air-gapped |
| Poolside | Deploy boundary | Enterprise governance | Platform | Air-gapped, defense | Most secure by design |
Memory Persistence
Nearly ALL systems have session-only memory. Exceptions:
- Tabnine: Enterprise Context Engine (persistent codebase index)
- Devin/Windsurf: Spaces (bundled context across sessions)
- Poolside: enterprise connectors to repos+DBs
- AutoGPT: vector DB persistent memory
Passepartout's approach: git + .org files as source of truth is unique and powerful — no other system uses literate programming as memory.
TUI / CLI Interface
CLI-native systems: Aider, Codex CLI, Cosine Genie, Void, Augment CLI IDE-first: Cursor, Windsurf, Copilot, Cline, Continue, PearAI Both: GitHub Copilot CLI, Poolside TUI
PP is CLI + Emacs. Closest in philosophy: Aider, Codex CLI, Cosine Genie.
Extensibility Model
- MCP (Model Context Protocol): Cline, Cursor, Windsurf, Copilot, AutoGen
- Custom prompts/rules: .cursorrules, .clinerules, AGENTS.md, CLAUDE.md
- Python scripting: Aider
- Agent creation: AutoGPT (block builder), CrewAI (role-based)
- MCP server creation: Cline ("add a tool" via LLM)
PP's extensibility: Common Lisp macros + skill system. Unique: hot-reloadable skills, self-repair capability. No other system has this.
Neurosymbolic / Deterministic Rule Components
NONE of the surveyed systems have a neurosymbolic architecture or deterministic rule engine. The closest approximations:
- .cursorrules / .clinerules / AGENTS.md / CLAUDE.md: plain-text instructions to the LLM (zero enforcement)
- Cline's MCP tools: deterministic tool execution but no rule reasoning
- Tabnine's Provenance: deterministic code matching but not rules
- AutoGen's event-driven core: deterministic workflow but not rule-based reasoning
Passepartout's contract-first TDD (machine-verifiable contracts + Fiveam tests) is architecturally unique. No competitor has anything like it.
Key Takeaways for Passepartout
- NO competitor has neurosymbolic architecture or deterministic rule enforcement. This is PP's strongest differentiator.
- NO competitor uses literate programming (.org as source of truth) or org-babel tangle workflow. This is PP's second strongest differentiator.
- NO competitor has hot-reloadable, self-repairable skills. PP's skill system (Lisp macros + fboundp guards) is unique.
- Memory persistence is universally weak. PP's git + .org approach is arguably more robust than any competitor's session-only model.
- CLI-native agent space is growing: Codex CLI, Aider, Cosine Genie, Auggie. PP must match or exceed their terminal UX quality.
- MCP is becoming the universal extensibility standard. PP should support MCP.
- Async/background operation (Cosine Genie, Devin) is a growing expectation. PP's REPL-based daemon architecture is well-positioned for this.
- Enterprise features (SSO, on-prem, audit) are table stakes for enterprise but irrelevant for PP's individual-agent use case.
- Multi-agent orchestration (AutoGen, CrewAI, Conductor) is a separate concern. PP should focus on single-agent excellence first.
- The "contract-first TDD" workflow from .org → write test → watch fail → implement → watch pass → tangle is UNIQUE in the entire competitive landscape.