amr/memex

Files

Amr Gharbeia 4e9431ec1d memex: update passepartout submodule → v0.7.2, add notes

passepartout v0.7.2 (Gate Trace + HITL + Search + 11 more features):
- Gate trace visualization with Ctrl+G toggle
- HITL inline panels with styled collapse on approve/deny
- Agent identity file + /identity command
- Safe-tool read-only allowlist
- Message search mode with Up/Down nav and highlights
- Context budget visibility with section breakdown
- Session rewind /sessions /resume /rewind
- Undo/redo per operation
- Context debugging /context why /context dropped
- Tool hardening (timeouts, write verify, read-only cache)
- Tag stack severity tiers + trigger counts
- Merkle provenance audit + audit-verify
- Self-help /help <topic> reads USER_MANUAL.org
- Live CONFIG section in system prompts
- Pads: Page Up/Down scroll by 10 lines

Core 92/92  TUI Main 104/104  TUI View 29/29  Neuro 13/13

2026-05-08 21:56:11 -04:00

29 KiB

Raw Blame History

Competitive Analysis — AI Coding Agents & Personal AI Agent Systems

Overview
1. Aider (Paul Gauthier)
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: HIGH
2. Cursor Agent Mode
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: MEDIUM
3. GitHub Copilot Agent Mode
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
4. Amazon Q Developer
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
5. Devin (Cognition AI)
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: MEDIUM
6. Factory AI / Factory Droid
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
7. Cline (formerly Claude Dev)
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: HIGH
8. RooCode
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: MEDIUM
9. AutoGPT
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: MEDIUM
10. Microsoft AutoGen
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: MEDIUM
11. CrewAI
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
12. Replit Agent (Ghostwriter)
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
13. Codex CLI (OpenAI)
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: HIGH
14. Continue.dev
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
15. PearAI
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
16. Melty (now Conductor)
- What it does
- Architecture
- Maturity
- Relevance to Passepartout: MEDIUM
17. Windsurf / Codeium (now part of Cognition AI / Devin)
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: MEDIUM
18. Cursor AI
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
19. Augment Code
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: HIGH
20. Qoder
- What it does
- Architecture
- Maturity
- Relevance to Passepartout: UNKNOWN
21. v0 by Vercel
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
22. Lovable
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
23. Void (formerly based on Codex)
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: MEDIUM
24. Cosine Genie
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: HIGH
25. Mentat
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
26. Ghostwriter by Replit (→ see Replit Agent #12)
27. Poolside
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
28. Tabnine
- What it does
- Architecture
- Differentiators
- Maturity
- Relevance to Passepartout: LOW
29. Factory Droid (→ see #6)
30. Devin (→ see #5)
Emerging / Notable Others
- Conductor (Melty) — see #16
- Cline — see #7
- RooCode — see #8
- Augment Code CLI (Auggie) — see #19
- Cosine Genie — see #24
Cross-Cutting Analysis
- Safety / Security Architecture
- Memory Persistence
- TUI / CLI Interface
- Extensibility Model
- Neurosymbolic / Deterministic Rule Components
Key Takeaways for Passepartout

Overview

This document surveys 30+ AI coding agents and personal AI agent systems across the dimensions most relevant to Passepartout: safety architecture, memory persistence, TUI/CLI interface, extensibility model, neurosymbolic or deterministic-rule components.

1. Aider (Paul Gauthier)

What it does

Git-aware AI pair programming in the terminal. Reads/writes files in your repo, auto-commits changes. Designed for interactive chat-based coding.

Architecture

Model: pluggable (OpenAI, Anthropic, Gemini, local via Ollama/LM Studio)
Tools: file read/write, git, lint/test execution, repo map (tree-sitter AST analysis)
Memory: git history + chat history file (.aider.chat.history.md). No persistent memory across sessions beyond git.
Safety: git-backed undo per edit; user must approve file additions; linting/tests auto-run

Differentiators

Repo map: compresses entire codebase into ~1024 tokens of structured context
Edit formats: whole-file, search/replace diff, universal diff — fallback chain
Open source (Apache 2.0), Python, highly scriptable
Benchmark leaderboard (SWE-bench, own editing/refactoring benchmarks)

Maturity

Production. 30k+ GitHub stars. Active development.

Relevance to Passepartout: HIGH

— Directly comparable: CLI-native, git-integrated, extensible via Python scripting — PP's .org-as-source-of-truth + tangle workflow is architecturally distinct — Aider has no neurosymbolic components or deterministic rule engine — Aider lacks memory persistence beyond git; PP's org-mode + contract-first TDD is richer

2. Cursor Agent Mode

What it does

IDE-integrated coding agent inside Cursor (VS Code fork). Agent mode can plan, read/write files, run terminal commands, and iterate autonomously.

Architecture

Model: Claude, GPT-4o, etc. (cursor-small model for tab completion)
Tools: file editing, terminal, @-symbols for context, MCP support, image input
Memory: session-only; no persistent memory across sessions
Safety: diff view for changes, user approval on terminal commands (configurable), lint monitoring

Differentiators

.cursorrules for project-specific instructions
Visual diff before applying changes
Tab completion + agent mode + chat in one IDE
MCP server integration for custom tools

Maturity

Production. Widely used.

Relevance to Passepartout: MEDIUM

— IDE-dependent (not TUI-first). PP's Emacs/terminal-native approach is different — Rule system (.cursorrules) is closest thing to deterministic rules — but plain-text prompts only — No neurosymbolic, no persistent memory, no contract-first workflow

3. GitHub Copilot Agent Mode

What it does

Microsoft/GitHub's coding agent across VS Code, GitHub.com, CLI. Agent mode (2025) can autonomously plan, edit, run commands.

Architecture

Model: multi-model (OpenAI, Claude, Gemini, Haiku via Copilot)
Tools: IDE edit, terminal, MCP, code review, code search, GitHub issues
Memory: per-session; Enterprise can index org codebase for retrieval
Safety: IP indemnity, code referencing filter, admin-managed MCP allowlists, audit logs for enterprise, opt-out for training data

Differentiators

Deep GitHub integration (PR review, issues, Actions)
Multi-model access from one subscription
Enterprise governance (SSO, audit, VPC)
Copilot CLI for terminal-only use

Maturity

Production. Largest userbase (millions).

Relevance to Passepartout: LOW

— Massive platform lock-in. No extensibility for custom workflows — No neurosymbolic. No persistent memory across sessions — Enterprise features irrelevant to PP's use case — The CLI component is closest competitor but lacks PP's TDD/contract cycle

4. Amazon Q Developer

What it does

AWS's coding assistant across IDE, CLI, and AWS console. Code completion, chat, security scanning, code transformation.

Architecture

Model: Amazon Bedrock (multiple FMs), augmented with AWS content
Tools: IDE extension, CLI, AWS console chat, automated code review, vulnerability scanning, code transformation (e.g., Java upgrades)
Memory: session; no persistent cross-session memory
Safety: AWS IAM permissions, Bedrock abuse detection, zero data retention for Business tier; no training on Enterprise data

Differentiators

Deep AWS knowledge (VPC, EC2, Lambda, etc.)
Automated code transformation (e.g., Java 8→17)
Security vulnerability scanning built in
Free tier generous

Maturity

Production.

Relevance to Passepartout: LOW

— AWS-ecosystem focused. No CLI/TUI philosophy. No extensibility. — Not a general-purpose agent; AWS-specific

5. Devin (Cognition AI)

What it does

Autonomous AI software engineer in a sandboxed environment. Plans, codes, tests, deploys end-to-end. Acquired Windsurf (Codeium).

Architecture

Model: Claude Sonnet 4.5 (publicly), proprietary model claimed
Tools: shell, code editor, browser, sandboxed compute environment
Memory: per-session long-term reasoning; can recall context across steps within a task. Reports progress in real-time
Safety: sandboxed environment, user approval on deployment, SSH key support

Differentiators

Full autonomy (not pair programming) — can be assigned via Slack, Jira
SWE-bench leader: 13.86% (initial), now higher
Can learn unfamiliar technologies, train models, do Upwork tasks
Windsurf acquisition: now owns IDE + cloud agent stack

Maturity

Production (GA Dec 2024). Backed by $21M+ from Founders Fund.

Relevance to Passepartout: MEDIUM

— Autonomous agent philosophy differs from PP's interactive TDD partner — WindSurf integration creates IDE dependency; PP is terminal-native — No neurosymbolic. No contract-first. No persistent memory (per-task only)

6. Factory AI / Factory Droid

What it does

Automated code review and bug-fixing. Runs as GitHub app on every PR. Droid bot auto-fixes issues found in review.

Architecture

Model: Claude/GPT (likely)
Tools: GitHub PR integration, code review, auto-fix generation
Memory: PR-level context; no cross-PR memory
Safety: review-before-apply; GitHub permissions

Differentiators

PR-review focused (not general coding agent)
Auto-fix generation as part of review workflow
Enterprise-focused (code review automation)

Maturity

Beta/production.

Relevance to Passepartout: LOW

— Narrow scope (PR review). Not a general agent. — No TUI, no memory, no extensibility.

7. Cline (formerly Claude Dev)

What it does

Autonomous coding agent VSCode extension. Can create/edit files, run terminal commands, use browser, execute MCP tools. Human-in-the-loop for all actions.

Architecture

Model: any (OpenRouter, Anthropic, OpenAI, Google, AWS Bedrock, local models)
Tools: file R/W, terminal, browser (computer use), MCP servers, linter/compiler monitoring, checkpoint/restore
Memory: session context; checkpoints as workspace snapshots; no persistent cross-session memory
Safety: human-in-the-loop for every file change and terminal command (GUI approval); diff view; checkpoints for rollback; permission gates

Differentiators

"Add a tool" — can ask Cline to create new MCP servers on the fly
@url, @problems, @file, @folder context markers
Browser computer use for interactive debugging
Checkpoint system: compare/restore workspace snapshots
Open source (Apache 2.0), 61k+ stars
Enterprise: SSO, on-prem, audit trails

Maturity

Production. 61.5k GitHub stars. Rapid development.

Relevance to Passepartout: HIGH

— Closest architecture: extensible via MCP, CLI+editor integration, human-in-loop — MCP-based tool creation PP could adopt — No neurosymbolic rules engine; contracts are plain .clinerules text — Checkpoint workflow similar to PP's git-based snapshots but less structured — PP's .org source-of-truth + tangle is unique

8. RooCode

What it does

VSCode extension for multi-agent coding. Variant/fork of Cline with multiple agent "modes" (architect, ask, code, custom).

Architecture

Model: any (same provider list as Cline)
Tools: file editing, terminal, browser, MCP, image support
Memory: per-session context
Safety: human approval gates, diff view

Differentiators

Multi-agent modes (architect plans, coder implements, ask answers)
Custom modes with custom prompts
Forked from Cline, similar architecture

Maturity

Production.

Relevance to Passepartout: MEDIUM

— Multi-agent orchestration is interesting but VSCode-dependent — No neurosymbolic. No persistent memory. No contract-first.

9. AutoGPT

What it does

Platform for building, deploying, and running continuous AI agents. Classic version was autonomous GPT-4 agent; now a platform with agent builder, marketplace, workflow management.

Architecture

Model: any LLM (pluggable)
Tools: web search, file operations, code execution, block-based workflow builder
Memory: long-term memory via vector DB (Redis/Pinecone), persistent agent state
Safety: Docker sandboxing, user approval gates

Differentiators

Agent builder with visual block-based workflow
Marketplace for pre-built agents
Continuous/long-running agents (not session-only)
Classic AutoGPT pioneered autonomous agent loop (think → act → observe)

Maturity

Production. 184k stars. Classic in maintenance; platform in beta/active.

Relevance to Passepartout: MEDIUM

— Long-running, persistent agents concept is relevant — Block-based workflow builder is anti-neurosymbolic (no rules engine) — Python-centric; PP is Common Lisp — No contract-first TDD workflow

10. Microsoft AutoGen

What it does

Multi-agent conversation framework from Microsoft. Agents can converse, collaborate, execute code, use tools. .NET and Python.

Architecture

Model: any (OpenAI, etc.)
Tools: MCP, Docker code execution, OpenAPI, web search, distributed runtimes
Memory: conversation history; no built-in long-term memory; use extensions
Safety: Docker sandbox for code execution; human-in-loop patterns

Differentiators

Event-driven, distributed multi-agent architecture (gRPC runtime)
AgentChat for conversational, Core for event-driven, Studio for GUI
MCP tool support built-in
.NET and Python support
Research-grade multi-agent patterns

Maturity

Stable/Production. Backed by Microsoft.

Relevance to Passepartout: MEDIUM

— Multi-agent orchestration architecture is relevant — No TUI/CLI focus; Python/.NET — No neurosymbolic; no deterministic rules — PP could learn from AutoGen's event-driven agent patterns

11. CrewAI

What it does

Open-source framework for orchestrating autonomous AI agents as "crews" with role-based collaboration. Flows for workflow control.

Architecture

Model: any LLM (pluggable)
Tools: API, database, custom tools; agent roles with specific goals
Memory: conversation-based; no built-in persistent memory across crews
Safety: enterprise security claims; human-in-loop patterns

Differentiators

Role-playing agents (researcher, writer, etc.)
Flows (stateful, event-driven) + Crews (autonomous teams)
100k+ certified developers
Enterprise-ready

Maturity

Production.

Relevance to Passepartout: LOW

— Python framework, not a standalone agent — No TUI/CLI; not a coding agent — Role-based agent pattern is interesting but not directly applicable

12. Replit Agent (Ghostwriter)

What it does

In-browser coding agent on Replit platform. Build, deploy apps from prompts. Full IDE in browser with AI agent.

Architecture

Model: proprietary (likely fine-tuned LLM)
Tools: in-browser IDE, file system, terminal, deployment, database
Memory: project context within session
Safety: sandboxed in-browser environment; Replit platform moderation

Differentiators

Zero setup: browser-based, no install
Full-stack: code + DB + deploy from one prompt
Educational focus (used in classrooms)
Collaborative editing

Maturity

Production.

Relevance to Passepartout: LOW

— Cloud-only, browser-based. Anti-TUI. — No extensibility. No memory persistence. — Educational/consumer focus, not power-user agent

13. Codex CLI (OpenAI)

What it does

Lightweight CLI coding agent from OpenAI. Runs locally, writes files, runs commands. Desktop app variant available.

Architecture

Model: OpenAI models (GPT-5, o-series)
Tools: file read/write, shell execution, sandboxed environment
Memory: session context; conversation history per session
Safety: user approval on file writes and commands; runs locally; sandboxed execution

Differentiators

CLI-native (npm install -g @openai/codex)
Desktop app (codex app) for richer UI
Multi-platform (macOS, Linux, Windows)
Open source (Apache 2.0), 81k stars, 6k+ commits
"Sign in with ChatGPT" or API key
Environment management for secrets

Maturity

Production. 81k GitHub stars. Very active.

Relevance to Passepartout: HIGH

— Direct competitor: CLI-native coding agent — Same philosophy: terminal-first, local execution — PP differentiators: .org source-of-truth, tangle workflow, neurosymbolic TDD, contract-first, deterministic rules engine — Codex has NO neurosymbolic component, NO contracts, NO persistent memory beyond git, NO rule engine

14. Continue.dev

What it does

Open-source AI code assistant for IDE. Chat, edit, tab-completion. Now pivoted to Continuous AI — AI checks on PRs (source-controlled checks).

Architecture

Model: any (OpenAI, Anthropic, Ollama, etc.)
Tools: IDE chat, file editing, @-references, PR checks
Memory: session-based
Safety: local models possible, diff-based editing

Differentiators

Fully open-source IDE assistant
"Checks" — source-controlled AI reviews as markdown files in repo
Multiple model providers
VS Code + JetBrains

Maturity

Production. Renamed to Continuous AI for PR-check product.

Relevance to Passepartout: LOW

— IDE-dependent. PR-check focus is different from PP's build-time agent — "Checks as markdown" concept is closest to PP's .org-based contracts — but far less structured. PP's contracts are machine-verifiable, not just prompts

15. PearAI

What it does

AI code editor (VS Code fork) with integrated coding agent + chat. Open-source, Bun-based performance.

Architecture

Model: any (OpenAI, Anthropic, Ollama)
Tools: IDE agent, chat, file editing, context management
Memory: session-based
Safety: open source, local model support

Differentiators

VS Code fork (not extension)
Bun for performance
Free, open source
"Context" management for prompt optimization

Maturity

Beta/Production.

Relevance to Passepartout: LOW

— IDE-dependent fork. PP's Emacs + TUI is philosophically opposite. — No unique architecture features.

16. Melty (now Conductor)

What it does

Originally Melty, now Conductor — orchestrator for running multiple coding agents (Claude Code, Codex) in parallel on your Mac. Each agent gets an isolated git worktree.

Architecture

Model: uses Codex + Claude Code under the hood
Tools: git worktree management, parallel agent execution, review UI
Memory: per-task git worktree; no cross-session memory
Safety: git isolation; user reviews changes before merging Differentiators
Multi-agent parallelism (not multi-agent collaboration)
Git worktree-based isolation
Dashboard for monitoring agents

Maturity

Production (Beta/2025). Used at Linear, Vercel, Notion, Ramp.

Relevance to Passepartout: MEDIUM

— Parallel agent orchestration model is interesting — Doesn't replace PP's workflow; could complement — No neurosymbolic, no rules engine, no memory persistence

17. Windsurf / Codeium (now part of Cognition AI / Devin)

What it does

AI-native IDE. Cascade agent for autonomous coding. Tab completion, agent mode, MCP support. Acquired by Cognition (Devin).

Architecture

Model: multi-model (GPT-5, Claude, custom)
Tools: Cascade (agent), Tab (completions), MCP, JetBrains plugin, Devin integration, Spaces (bundled context)
Memory: Cascade sessions within workspace; Spaces for grouped context
Safety: admin-managed MCP servers; enterprise controls

Differentiators

Cascade: local agent for real-time assistance
Devin integration: cloud agent for long-running tasks
Spaces: bundle agent sessions, PRs, files around a task
Agent Command Center: Kanban dashboard for agents
JetBrains plugin (targets non-VS Code users)

Maturity

Production. 1M+ users, 4k+ enterprise customers.

Relevance to Passepartout: MEDIUM

— IDE-dependent (VS Code fork + JetBrains plugin) — Cascade + Devin hybrid local/cloud model is architecturally interesting — No neurosymbolic. No deterministic rules. No contract-first. — Spaces concept (grouping context around a task) is close to PP's session management

18. Cursor AI

What it does

AI-first code editor (VS Code fork). Multi-model, agent mode, tab completion, MCP support. The most popular AI IDE.

Architecture

Model: proprietary (cursor-small) + OpenAI, Anthropic, Gemini
Tools: agent mode, tab completion, chat, @-symbols, MCP, terminal
Memory: session; no cross-session persistence
Safety: diff view, configurable permission levels, image input support

Differentiators

First-mover in AI IDEs (fork vs extension approach)
.cursorrules for project conventions
Fast tab completion (custom small model)
@-symbol context system (files, docs, web)

Maturity

Production. Most popular AI IDE.

Relevance to Passepartout: LOW

— IDE-dependent. PP is philosophy of terminal + .org + Emacs — No neurosymbolic, no persistent memory, no contract-first

19. Augment Code

What it does

AI coding platform with deep codebase understanding. Agent, chat, CLI (Auggie), context engine that indexes entire codebase.

Architecture

Model: proprietary + multi-model
Tools: agent, code completions (sunset soon), CLI (Auggie), IDE extensions
Memory: codebase index (persistent), session context
Safety: enterprise SSO, permissions

Differentiators

Auggie CLI — terminal-first agent (closest to PP)
Codebase-wide context engine (indexes entire repo, not just open files)
Agent can tackle large tasks autonomously

Maturity

Production. Well-funded.

Relevance to Passepartout: HIGH

— Auggie CLI is directly comparable: terminal-native coding agent — Full codebase indexing is better than PP's current approach — No neurosymbolic. No contract-first. No org-mode source-of-truth. — PP's TDD + contract + tangle workflow is unique differentiator

20. Qoder

What it does

Coding agent platform. Details limited.

Architecture

Unknown — website unreachable.

Maturity

Unknown.

Relevance to Passepartout: UNKNOWN

— Insufficient data.

21. v0 by Vercel

What it does

UI generation agent. Generates React/Next.js components and pages from text prompts. Visual design oriented.

Architecture

Model: proprietary (likely fine-tuned)
Tools: code generation, visual design mode, templates, deployment to Vercel
Memory: per-session; design context within chat
Safety: Vercel platform controls

Differentiators

Visual-first: generates UI, not general code
Design mode: fine-tune with visual controls
Template library
iOS app for mobile building
Deep Vercel/Next.js integration

Maturity

Production.

Relevance to Passepartout: LOW

— Narrow domain (UI generation). Not a general coding agent. — Cloud-only. No TUI. — No relevance to PP's workflow.

22. Lovable

What it does

Full-stack application generation from natural language prompts. Build and deploy apps/websites via AI chat.

Architecture

Model: proprietary (likely fine-tuned)
Tools: app generation, deployment, domain registration, mobile app
Memory: per-project conversation
Safety: platform-level moderation

Differentiators

Full-stack: frontend + backend + DB + deploy
No-code-friendly (describe app → get working app)
Template library, mobile companion app
Enterprise security claims

Maturity

Production.

Relevance to Passepartout: LOW

— Consumer/no-code focus. Not a developer coding agent. — No TUI, no extensibility, no memory persistence beyond project.

23. Void (formerly based on Codex)

What it does

CLI coding agent for terminal-based AI code generation. Originally built on OpenAI Codex.

Architecture

Model: OpenAI API-compatible
Tools: terminal code generation, file writing
Memory: session context
Safety: user approval on file changes

Differentiators

CLI-native (similar to Codex CLI)
Lightweight

Maturity

Beta/early.

Relevance to Passepartout: MEDIUM

— CLI-native coding agent, directly comparable to PP — Less mature than Codex CLI or Aider — No unique differentiators

24. Cosine Genie

What it does

Autonomous AI software engineer. Takes Jira tickets or PR descriptions, breaks them down, writes code, delivers PRs. Works asynchronously.

Architecture

Model: Genie 2 (proprietary model); 72% on SWE-Lancer
Tools: IDE/CLI, Slack, Jira, Linear, GitHub integration
Memory: task-level context; works asynchronously in background
Safety: sandboxed; desktop app runs locally; cloud service

Differentiators

Proprietary model (Genie 2) — not just wrapping an API
SWE-Lancer leader: 72% pass rate (highest)
Asynchronous: works without active session
Slack/Jira/Linear integration for task intake
CLI + Desktop app + Cloud

Maturity

Production. Well-funded.

Relevance to Passepartout: HIGH

— Direct competitor: agent that takes tickets and delivers code — CLI-native operation — No contract-first, no neurosymbolic, no org-mode — PP's structured TDD cycle + deterministic rules are key differentiators — Cosine's Slack/Jira integration interesting for PP to consider

25. Mentat

What it does

CLI coding assistant that coordinates edits across multiple files. Project context understanding.

Architecture

Model: any LLM (OpenAI, Anthropic)
Tools: file editing across multiple files, project context gathering
Memory: session context, git awareness
Safety: user approval

Differentiators

Multi-file editing focus
Project-level understanding

Maturity

Currently inactive/archived. GitHub 404.

Relevance to Passepartout: LOW

— Inactive project. No meaningful differentiation from Aider/Codex CLI.

26. Ghostwriter by Replit (→ see Replit Agent #12)

27. Poolside

What it does

Foundation models for software engineering. Builds models + agents for enterprise. On-prem deployment, air-gapped. TUI, IDE extensions, agents.

Architecture

Model: Laguna XS.2 M.1 (proprietary foundation model)
Tools: agents, TUI, IDE extensions, multi-agent orchestration
Memory: enterprise context (connectors to repos, DBs, private data)
Safety: on-prem/VPC/air-gapped, RBAC for humans and agents, audit trails, executive governance, no data leaves customer boundary

Differentiators

Full-stack: builds foundation models + agents + enterprise deployment
Forward Deployed Research Engineers embedded with customers
Outcome ownership (not just model handoff)
AGI thesis: software engineering as path to AGI
Military/defense-grade security (not just compliance)

Maturity

Production. Frontier lab status. $500M+ funding.

Relevance to Passepartout: LOW

— Enterprise/military focus, completely different market — Building own models (PP uses existing LLMs) — TUI is just one surface among many — PP's individual-developer, open-source, TDD-first philosophy is opposite

28. Tabnine

What it does

Enterprise AI code completion + agent platform. Code completions, chat, agents, CLI, context engine. Gartner Visionary 2025.

Architecture

Model: multiple (code-specific small models + large model access)
Tools: code completion, chat, CLI, agents (planning, coding, testing, docs), context engine, provenance/attribution
Memory: Enterprise Context Engine (indexes org codebase, architecture, standards)
Safety: on-prem/air-gapped/VPC, zero data retention, IP indemnity, provenance tracking, admin controls, audit logs

Differentiators

Enterprise Context Engine: organizational intelligence layer for any agent
Fine-tuning on private repos
Provenance & attribution for IP compliance
Gartner Visionary; Leader in Omdia Universe
Zero Trust compliance (air-gapped)

Maturity

Production. 15+ years in market (originally Codota). Millions of developers.

Relevance to Passepartout: LOW

— Enterprise-focused (compliance, IP, governance). PP is individual-agent. — Context Engine concept is interesting but proprietary — No neurosymbolic. No contract-first. — CLI is secondary to IDE completions for Tabnine

29. Factory Droid (→ see #6)

30. Devin (→ see #5)

Emerging / Notable Others

Conductor (Melty) — see #16

Cline — see #7

RooCode — see #8

Augment Code CLI (Auggie) — see #19

Cosine Genie — see #24

Cross-Cutting Analysis

Safety / Security Architecture

System	Sandbox	Human-in-Loop	Diff Review	Gov/Enterprise	Notes
Aider	Git undo	File approval	Git diff	No	Git as safety net
Cline	None	Every op	Diff view	SSO, on-prem	Best HIL in class
Codex CLI	Sandboxed exec	On write/command	Console output	No	Basic
Devin	Sandboxed env	Deployment gate	PR review	Enterprise plan	Cloud sandbox
Cursor	None	Configurable	Diff view	No	.cursorrules
Copilot	None	Configurable	Diff view	SSO, audit, MCP allowlist	Best enterprise
Cosine	None	PR review	PR review	Cloud + on-prem	Async operation
Tabnine	None	Configurable	Diff view	Air-gapped, on-prem, audit	Best air-gapped
Poolside	Deploy boundary	Enterprise governance	Platform	Air-gapped, defense	Most secure by design

Memory Persistence

Nearly ALL systems have session-only memory. Exceptions:

Tabnine: Enterprise Context Engine (persistent codebase index)
Devin/Windsurf: Spaces (bundled context across sessions)
Poolside: enterprise connectors to repos+DBs
AutoGPT: vector DB persistent memory

Passepartout's approach: git + .org files as source of truth is unique and powerful — no other system uses literate programming as memory.

TUI / CLI Interface

CLI-native systems: Aider, Codex CLI, Cosine Genie, Void, Augment CLI IDE-first: Cursor, Windsurf, Copilot, Cline, Continue, PearAI Both: GitHub Copilot CLI, Poolside TUI

PP is CLI + Emacs. Closest in philosophy: Aider, Codex CLI, Cosine Genie.

Extensibility Model

MCP (Model Context Protocol): Cline, Cursor, Windsurf, Copilot, AutoGen
Custom prompts/rules: .cursorrules, .clinerules, AGENTS.md, CLAUDE.md
Python scripting: Aider
Agent creation: AutoGPT (block builder), CrewAI (role-based)
MCP server creation: Cline ("add a tool" via LLM)

PP's extensibility: Common Lisp macros + skill system. Unique: hot-reloadable skills, self-repair capability. No other system has this.

Neurosymbolic / Deterministic Rule Components

NONE of the surveyed systems have a neurosymbolic architecture or deterministic rule engine. The closest approximations:

.cursorrules / .clinerules / AGENTS.md / CLAUDE.md: plain-text instructions to the LLM (zero enforcement)
Cline's MCP tools: deterministic tool execution but no rule reasoning
Tabnine's Provenance: deterministic code matching but not rules
AutoGen's event-driven core: deterministic workflow but not rule-based reasoning

Passepartout's contract-first TDD (machine-verifiable contracts + Fiveam tests) is architecturally unique. No competitor has anything like it.

Key Takeaways for Passepartout

NO competitor has neurosymbolic architecture or deterministic rule enforcement. This is PP's strongest differentiator.
NO competitor uses literate programming (.org as source of truth) or org-babel tangle workflow. This is PP's second strongest differentiator.
NO competitor has hot-reloadable, self-repairable skills. PP's skill system (Lisp macros + fboundp guards) is unique.
Memory persistence is universally weak. PP's git + .org approach is arguably more robust than any competitor's session-only model.
CLI-native agent space is growing: Codex CLI, Aider, Cosine Genie, Auggie. PP must match or exceed their terminal UX quality.
MCP is becoming the universal extensibility standard. PP should support MCP.
Async/background operation (Cosine Genie, Devin) is a growing expectation. PP's REPL-based daemon architecture is well-positioned for this.
Enterprise features (SSO, on-prem, audit) are table stakes for enterprise but irrelevant for PP's individual-agent use case.
Multi-agent orchestration (AutoGen, CrewAI, Conductor) is a separate concern. PP should focus on single-agent excellence first.
The "contract-first TDD" workflow from .org → write test → watch fail → implement → watch pass → tangle is UNIQUE in the entire competitive landscape.

29 KiB Raw Blame History

Competitive Analysis — AI Coding Agents & Personal AI Agent Systems

Overview

1. Aider (Paul Gauthier)

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: HIGH

2. Cursor Agent Mode

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: MEDIUM

3. GitHub Copilot Agent Mode

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: LOW

4. Amazon Q Developer

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: LOW

5. Devin (Cognition AI)

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: MEDIUM

6. Factory AI / Factory Droid

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: LOW

7. Cline (formerly Claude Dev)

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: HIGH

8. RooCode

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: MEDIUM

9. AutoGPT

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: MEDIUM

10. Microsoft AutoGen

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: MEDIUM

11. CrewAI

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: LOW

12. Replit Agent (Ghostwriter)

What it does

Architecture

Differentiators

Maturity

Relevance to Passepartout: LOW

13. Codex CLI (OpenAI)

What it does

Architecture

Differentiators

Maturity

29 KiB

Raw Blame History