passepartout v0.7.2 (Gate Trace + HITL + Search + 11 more features): - Gate trace visualization with Ctrl+G toggle - HITL inline panels with styled collapse on approve/deny - Agent identity file + /identity command - Safe-tool read-only allowlist - Message search mode with Up/Down nav and highlights - Context budget visibility with section breakdown - Session rewind /sessions /resume /rewind - Undo/redo per operation - Context debugging /context why /context dropped - Tool hardening (timeouts, write verify, read-only cache) - Tag stack severity tiers + trigger counts - Merkle provenance audit + audit-verify - Self-help /help <topic> reads USER_MANUAL.org - Live CONFIG section in system prompts - Pads: Page Up/Down scroll by 10 lines Core 92/92 TUI Main 104/104 TUI View 29/29 Neuro 13/13
830 lines
29 KiB
Org Mode
830 lines
29 KiB
Org Mode
#+TITLE: Competitive Analysis — AI Coding Agents & Personal AI Agent Systems
|
|
#+DATE: 2026-05-08
|
|
#+CONTEXT: Research for Passepartout — Common Lisp AI coding agent with TUI/CLI, REPL-driven, neurosymbolic TDD workflow
|
|
|
|
* Overview
|
|
|
|
This document surveys 30+ AI coding agents and personal AI agent systems across
|
|
the dimensions most relevant to Passepartout: safety architecture, memory
|
|
persistence, TUI/CLI interface, extensibility model, neurosymbolic or
|
|
deterministic-rule components.
|
|
|
|
* 1. Aider (Paul Gauthier)
|
|
|
|
** What it does
|
|
Git-aware AI pair programming in the terminal. Reads/writes files in your repo,
|
|
auto-commits changes. Designed for interactive chat-based coding.
|
|
|
|
** Architecture
|
|
- Model: pluggable (OpenAI, Anthropic, Gemini, local via Ollama/LM Studio)
|
|
- Tools: file read/write, git, lint/test execution, repo map (tree-sitter AST analysis)
|
|
- Memory: git history + chat history file (.aider.chat.history.md). No persistent memory across sessions beyond git.
|
|
- Safety: git-backed undo per edit; user must approve file additions; linting/tests auto-run
|
|
|
|
** Differentiators
|
|
- Repo map: compresses entire codebase into ~1024 tokens of structured context
|
|
- Edit formats: whole-file, search/replace diff, universal diff — fallback chain
|
|
- Open source (Apache 2.0), Python, highly scriptable
|
|
- Benchmark leaderboard (SWE-bench, own editing/refactoring benchmarks)
|
|
|
|
** Maturity
|
|
Production. 30k+ GitHub stars. Active development.
|
|
|
|
** Relevance to Passepartout: HIGH
|
|
--- Directly comparable: CLI-native, git-integrated, extensible via Python scripting
|
|
--- PP's .org-as-source-of-truth + tangle workflow is architecturally distinct
|
|
--- Aider has no neurosymbolic components or deterministic rule engine
|
|
--- Aider lacks memory persistence beyond git; PP's org-mode + contract-first TDD is richer
|
|
|
|
* 2. Cursor Agent Mode
|
|
|
|
** What it does
|
|
IDE-integrated coding agent inside Cursor (VS Code fork). Agent mode can plan,
|
|
read/write files, run terminal commands, and iterate autonomously.
|
|
|
|
** Architecture
|
|
- Model: Claude, GPT-4o, etc. (cursor-small model for tab completion)
|
|
- Tools: file editing, terminal, @-symbols for context, MCP support, image input
|
|
- Memory: session-only; no persistent memory across sessions
|
|
- Safety: diff view for changes, user approval on terminal commands (configurable), lint monitoring
|
|
|
|
** Differentiators
|
|
- .cursorrules for project-specific instructions
|
|
- Visual diff before applying changes
|
|
- Tab completion + agent mode + chat in one IDE
|
|
- MCP server integration for custom tools
|
|
|
|
** Maturity
|
|
Production. Widely used.
|
|
|
|
** Relevance to Passepartout: MEDIUM
|
|
--- IDE-dependent (not TUI-first). PP's Emacs/terminal-native approach is different
|
|
--- Rule system (.cursorrules) is closest thing to deterministic rules — but plain-text prompts only
|
|
--- No neurosymbolic, no persistent memory, no contract-first workflow
|
|
|
|
* 3. GitHub Copilot Agent Mode
|
|
|
|
** What it does
|
|
Microsoft/GitHub's coding agent across VS Code, GitHub.com, CLI. Agent mode
|
|
(2025) can autonomously plan, edit, run commands.
|
|
|
|
** Architecture
|
|
- Model: multi-model (OpenAI, Claude, Gemini, Haiku via Copilot)
|
|
- Tools: IDE edit, terminal, MCP, code review, code search, GitHub issues
|
|
- Memory: per-session; Enterprise can index org codebase for retrieval
|
|
- Safety: IP indemnity, code referencing filter, admin-managed MCP allowlists,
|
|
audit logs for enterprise, opt-out for training data
|
|
|
|
** Differentiators
|
|
- Deep GitHub integration (PR review, issues, Actions)
|
|
- Multi-model access from one subscription
|
|
- Enterprise governance (SSO, audit, VPC)
|
|
- Copilot CLI for terminal-only use
|
|
|
|
** Maturity
|
|
Production. Largest userbase (millions).
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Massive platform lock-in. No extensibility for custom workflows
|
|
--- No neurosymbolic. No persistent memory across sessions
|
|
--- Enterprise features irrelevant to PP's use case
|
|
--- The CLI component is closest competitor but lacks PP's TDD/contract cycle
|
|
|
|
* 4. Amazon Q Developer
|
|
|
|
** What it does
|
|
AWS's coding assistant across IDE, CLI, and AWS console. Code completion, chat,
|
|
security scanning, code transformation.
|
|
|
|
** Architecture
|
|
- Model: Amazon Bedrock (multiple FMs), augmented with AWS content
|
|
- Tools: IDE extension, CLI, AWS console chat, automated code review,
|
|
vulnerability scanning, code transformation (e.g., Java upgrades)
|
|
- Memory: session; no persistent cross-session memory
|
|
- Safety: AWS IAM permissions, Bedrock abuse detection, zero data retention for
|
|
Business tier; no training on Enterprise data
|
|
|
|
** Differentiators
|
|
- Deep AWS knowledge (VPC, EC2, Lambda, etc.)
|
|
- Automated code transformation (e.g., Java 8→17)
|
|
- Security vulnerability scanning built in
|
|
- Free tier generous
|
|
|
|
** Maturity
|
|
Production.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- AWS-ecosystem focused. No CLI/TUI philosophy. No extensibility.
|
|
--- Not a general-purpose agent; AWS-specific
|
|
|
|
* 5. Devin (Cognition AI)
|
|
|
|
** What it does
|
|
Autonomous AI software engineer in a sandboxed environment. Plans, codes, tests,
|
|
deploys end-to-end. Acquired Windsurf (Codeium).
|
|
|
|
** Architecture
|
|
- Model: Claude Sonnet 4.5 (publicly), proprietary model claimed
|
|
- Tools: shell, code editor, browser, sandboxed compute environment
|
|
- Memory: per-session long-term reasoning; can recall context across steps
|
|
within a task. Reports progress in real-time
|
|
- Safety: sandboxed environment, user approval on deployment, SSH key support
|
|
|
|
** Differentiators
|
|
- Full autonomy (not pair programming) — can be assigned via Slack, Jira
|
|
- SWE-bench leader: 13.86% (initial), now higher
|
|
- Can learn unfamiliar technologies, train models, do Upwork tasks
|
|
- Windsurf acquisition: now owns IDE + cloud agent stack
|
|
|
|
** Maturity
|
|
Production (GA Dec 2024). Backed by $21M+ from Founders Fund.
|
|
|
|
** Relevance to Passepartout: MEDIUM
|
|
--- Autonomous agent philosophy differs from PP's interactive TDD partner
|
|
--- WindSurf integration creates IDE dependency; PP is terminal-native
|
|
--- No neurosymbolic. No contract-first. No persistent memory (per-task only)
|
|
|
|
* 6. Factory AI / Factory Droid
|
|
|
|
** What it does
|
|
Automated code review and bug-fixing. Runs as GitHub app on every PR.
|
|
Droid bot auto-fixes issues found in review.
|
|
|
|
** Architecture
|
|
- Model: Claude/GPT (likely)
|
|
- Tools: GitHub PR integration, code review, auto-fix generation
|
|
- Memory: PR-level context; no cross-PR memory
|
|
- Safety: review-before-apply; GitHub permissions
|
|
|
|
** Differentiators
|
|
- PR-review focused (not general coding agent)
|
|
- Auto-fix generation as part of review workflow
|
|
- Enterprise-focused (code review automation)
|
|
|
|
** Maturity
|
|
Beta/production.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Narrow scope (PR review). Not a general agent.
|
|
--- No TUI, no memory, no extensibility.
|
|
|
|
* 7. Cline (formerly Claude Dev)
|
|
|
|
** What it does
|
|
Autonomous coding agent VSCode extension. Can create/edit files, run terminal
|
|
commands, use browser, execute MCP tools. Human-in-the-loop for all actions.
|
|
|
|
** Architecture
|
|
- Model: any (OpenRouter, Anthropic, OpenAI, Google, AWS Bedrock, local models)
|
|
- Tools: file R/W, terminal, browser (computer use), MCP servers, linter/compiler
|
|
monitoring, checkpoint/restore
|
|
- Memory: session context; checkpoints as workspace snapshots; no persistent
|
|
cross-session memory
|
|
- Safety: human-in-the-loop for every file change and terminal command (GUI
|
|
approval); diff view; checkpoints for rollback; permission gates
|
|
|
|
** Differentiators
|
|
- "Add a tool" — can ask Cline to create new MCP servers on the fly
|
|
- @url, @problems, @file, @folder context markers
|
|
- Browser computer use for interactive debugging
|
|
- Checkpoint system: compare/restore workspace snapshots
|
|
- Open source (Apache 2.0), 61k+ stars
|
|
- Enterprise: SSO, on-prem, audit trails
|
|
|
|
** Maturity
|
|
Production. 61.5k GitHub stars. Rapid development.
|
|
|
|
** Relevance to Passepartout: HIGH
|
|
--- Closest architecture: extensible via MCP, CLI+editor integration, human-in-loop
|
|
--- MCP-based tool creation PP could adopt
|
|
--- No neurosymbolic rules engine; contracts are plain .clinerules text
|
|
--- Checkpoint workflow similar to PP's git-based snapshots but less structured
|
|
--- PP's .org source-of-truth + tangle is unique
|
|
|
|
* 8. RooCode
|
|
|
|
** What it does
|
|
VSCode extension for multi-agent coding. Variant/fork of Cline with multiple
|
|
agent "modes" (architect, ask, code, custom).
|
|
|
|
** Architecture
|
|
- Model: any (same provider list as Cline)
|
|
- Tools: file editing, terminal, browser, MCP, image support
|
|
- Memory: per-session context
|
|
- Safety: human approval gates, diff view
|
|
|
|
** Differentiators
|
|
- Multi-agent modes (architect plans, coder implements, ask answers)
|
|
- Custom modes with custom prompts
|
|
- Forked from Cline, similar architecture
|
|
|
|
** Maturity
|
|
Production.
|
|
|
|
** Relevance to Passepartout: MEDIUM
|
|
--- Multi-agent orchestration is interesting but VSCode-dependent
|
|
--- No neurosymbolic. No persistent memory. No contract-first.
|
|
|
|
* 9. AutoGPT
|
|
|
|
** What it does
|
|
Platform for building, deploying, and running continuous AI agents.
|
|
Classic version was autonomous GPT-4 agent; now a platform with agent builder,
|
|
marketplace, workflow management.
|
|
|
|
** Architecture
|
|
- Model: any LLM (pluggable)
|
|
- Tools: web search, file operations, code execution, block-based workflow builder
|
|
- Memory: long-term memory via vector DB (Redis/Pinecone), persistent agent state
|
|
- Safety: Docker sandboxing, user approval gates
|
|
|
|
** Differentiators
|
|
- Agent builder with visual block-based workflow
|
|
- Marketplace for pre-built agents
|
|
- Continuous/long-running agents (not session-only)
|
|
- Classic AutoGPT pioneered autonomous agent loop (think → act → observe)
|
|
|
|
** Maturity
|
|
Production. 184k stars. Classic in maintenance; platform in beta/active.
|
|
|
|
** Relevance to Passepartout: MEDIUM
|
|
--- Long-running, persistent agents concept is relevant
|
|
--- Block-based workflow builder is anti-neurosymbolic (no rules engine)
|
|
--- Python-centric; PP is Common Lisp
|
|
--- No contract-first TDD workflow
|
|
|
|
* 10. Microsoft AutoGen
|
|
|
|
** What it does
|
|
Multi-agent conversation framework from Microsoft. Agents can converse,
|
|
collaborate, execute code, use tools. .NET and Python.
|
|
|
|
** Architecture
|
|
- Model: any (OpenAI, etc.)
|
|
- Tools: MCP, Docker code execution, OpenAPI, web search, distributed runtimes
|
|
- Memory: conversation history; no built-in long-term memory; use extensions
|
|
- Safety: Docker sandbox for code execution; human-in-loop patterns
|
|
|
|
** Differentiators
|
|
- Event-driven, distributed multi-agent architecture (gRPC runtime)
|
|
- AgentChat for conversational, Core for event-driven, Studio for GUI
|
|
- MCP tool support built-in
|
|
- .NET and Python support
|
|
- Research-grade multi-agent patterns
|
|
|
|
** Maturity
|
|
Stable/Production. Backed by Microsoft.
|
|
|
|
** Relevance to Passepartout: MEDIUM
|
|
--- Multi-agent orchestration architecture is relevant
|
|
--- No TUI/CLI focus; Python/.NET
|
|
--- No neurosymbolic; no deterministic rules
|
|
--- PP could learn from AutoGen's event-driven agent patterns
|
|
|
|
* 11. CrewAI
|
|
|
|
** What it does
|
|
Open-source framework for orchestrating autonomous AI agents as "crews" with
|
|
role-based collaboration. Flows for workflow control.
|
|
|
|
** Architecture
|
|
- Model: any LLM (pluggable)
|
|
- Tools: API, database, custom tools; agent roles with specific goals
|
|
- Memory: conversation-based; no built-in persistent memory across crews
|
|
- Safety: enterprise security claims; human-in-loop patterns
|
|
|
|
** Differentiators
|
|
- Role-playing agents (researcher, writer, etc.)
|
|
- Flows (stateful, event-driven) + Crews (autonomous teams)
|
|
- 100k+ certified developers
|
|
- Enterprise-ready
|
|
|
|
** Maturity
|
|
Production.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Python framework, not a standalone agent
|
|
--- No TUI/CLI; not a coding agent
|
|
--- Role-based agent pattern is interesting but not directly applicable
|
|
|
|
* 12. Replit Agent (Ghostwriter)
|
|
|
|
** What it does
|
|
In-browser coding agent on Replit platform. Build, deploy apps from prompts.
|
|
Full IDE in browser with AI agent.
|
|
|
|
** Architecture
|
|
- Model: proprietary (likely fine-tuned LLM)
|
|
- Tools: in-browser IDE, file system, terminal, deployment, database
|
|
- Memory: project context within session
|
|
- Safety: sandboxed in-browser environment; Replit platform moderation
|
|
|
|
** Differentiators
|
|
- Zero setup: browser-based, no install
|
|
- Full-stack: code + DB + deploy from one prompt
|
|
- Educational focus (used in classrooms)
|
|
- Collaborative editing
|
|
|
|
** Maturity
|
|
Production.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Cloud-only, browser-based. Anti-TUI.
|
|
--- No extensibility. No memory persistence.
|
|
--- Educational/consumer focus, not power-user agent
|
|
|
|
* 13. Codex CLI (OpenAI)
|
|
|
|
** What it does
|
|
Lightweight CLI coding agent from OpenAI. Runs locally, writes files, runs
|
|
commands. Desktop app variant available.
|
|
|
|
** Architecture
|
|
- Model: OpenAI models (GPT-5, o-series)
|
|
- Tools: file read/write, shell execution, sandboxed environment
|
|
- Memory: session context; conversation history per session
|
|
- Safety: user approval on file writes and commands; runs locally; sandboxed
|
|
execution
|
|
|
|
** Differentiators
|
|
- CLI-native (npm install -g @openai/codex)
|
|
- Desktop app (codex app) for richer UI
|
|
- Multi-platform (macOS, Linux, Windows)
|
|
- Open source (Apache 2.0), 81k stars, 6k+ commits
|
|
- "Sign in with ChatGPT" or API key
|
|
- Environment management for secrets
|
|
|
|
** Maturity
|
|
Production. 81k GitHub stars. Very active.
|
|
|
|
** Relevance to Passepartout: HIGH
|
|
--- Direct competitor: CLI-native coding agent
|
|
--- Same philosophy: terminal-first, local execution
|
|
--- PP differentiators: .org source-of-truth, tangle workflow, neurosymbolic
|
|
TDD, contract-first, deterministic rules engine
|
|
--- Codex has NO neurosymbolic component, NO contracts, NO persistent memory
|
|
beyond git, NO rule engine
|
|
|
|
* 14. Continue.dev
|
|
|
|
** What it does
|
|
Open-source AI code assistant for IDE. Chat, edit, tab-completion. Now pivoted
|
|
to Continuous AI — AI checks on PRs (source-controlled checks).
|
|
|
|
** Architecture
|
|
- Model: any (OpenAI, Anthropic, Ollama, etc.)
|
|
- Tools: IDE chat, file editing, @-references, PR checks
|
|
- Memory: session-based
|
|
- Safety: local models possible, diff-based editing
|
|
|
|
** Differentiators
|
|
- Fully open-source IDE assistant
|
|
- "Checks" — source-controlled AI reviews as markdown files in repo
|
|
- Multiple model providers
|
|
- VS Code + JetBrains
|
|
|
|
** Maturity
|
|
Production. Renamed to Continuous AI for PR-check product.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- IDE-dependent. PR-check focus is different from PP's build-time agent
|
|
--- "Checks as markdown" concept is closest to PP's .org-based contracts —
|
|
but far less structured. PP's contracts are machine-verifiable, not just
|
|
prompts
|
|
|
|
* 15. PearAI
|
|
|
|
** What it does
|
|
AI code editor (VS Code fork) with integrated coding agent + chat.
|
|
Open-source, Bun-based performance.
|
|
|
|
** Architecture
|
|
- Model: any (OpenAI, Anthropic, Ollama)
|
|
- Tools: IDE agent, chat, file editing, context management
|
|
- Memory: session-based
|
|
- Safety: open source, local model support
|
|
|
|
** Differentiators
|
|
- VS Code fork (not extension)
|
|
- Bun for performance
|
|
- Free, open source
|
|
- "Context" management for prompt optimization
|
|
|
|
** Maturity
|
|
Beta/Production.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- IDE-dependent fork. PP's Emacs + TUI is philosophically opposite.
|
|
--- No unique architecture features.
|
|
|
|
* 16. Melty (now Conductor)
|
|
|
|
** What it does
|
|
Originally Melty, now Conductor — orchestrator for running multiple coding
|
|
agents (Claude Code, Codex) in parallel on your Mac. Each agent gets an
|
|
isolated git worktree.
|
|
|
|
** Architecture
|
|
- Model: uses Codex + Claude Code under the hood
|
|
- Tools: git worktree management, parallel agent execution, review UI
|
|
- Memory: per-task git worktree; no cross-session memory
|
|
- Safety: git isolation; user reviews changes before merging
|
|
Differentiators
|
|
- Multi-agent parallelism (not multi-agent collaboration)
|
|
- Git worktree-based isolation
|
|
- Dashboard for monitoring agents
|
|
|
|
** Maturity
|
|
Production (Beta/2025). Used at Linear, Vercel, Notion, Ramp.
|
|
|
|
** Relevance to Passepartout: MEDIUM
|
|
--- Parallel agent orchestration model is interesting
|
|
--- Doesn't replace PP's workflow; could complement
|
|
--- No neurosymbolic, no rules engine, no memory persistence
|
|
|
|
* 17. Windsurf / Codeium (now part of Cognition AI / Devin)
|
|
|
|
** What it does
|
|
AI-native IDE. Cascade agent for autonomous coding. Tab completion, agent mode,
|
|
MCP support. Acquired by Cognition (Devin).
|
|
|
|
** Architecture
|
|
- Model: multi-model (GPT-5, Claude, custom)
|
|
- Tools: Cascade (agent), Tab (completions), MCP, JetBrains plugin, Devin
|
|
integration, Spaces (bundled context)
|
|
- Memory: Cascade sessions within workspace; Spaces for grouped context
|
|
- Safety: admin-managed MCP servers; enterprise controls
|
|
|
|
** Differentiators
|
|
- Cascade: local agent for real-time assistance
|
|
- Devin integration: cloud agent for long-running tasks
|
|
- Spaces: bundle agent sessions, PRs, files around a task
|
|
- Agent Command Center: Kanban dashboard for agents
|
|
- JetBrains plugin (targets non-VS Code users)
|
|
|
|
** Maturity
|
|
Production. 1M+ users, 4k+ enterprise customers.
|
|
|
|
** Relevance to Passepartout: MEDIUM
|
|
--- IDE-dependent (VS Code fork + JetBrains plugin)
|
|
--- Cascade + Devin hybrid local/cloud model is architecturally interesting
|
|
--- No neurosymbolic. No deterministic rules. No contract-first.
|
|
--- Spaces concept (grouping context around a task) is close to PP's session
|
|
management
|
|
|
|
* 18. Cursor AI
|
|
|
|
** What it does
|
|
AI-first code editor (VS Code fork). Multi-model, agent mode, tab completion,
|
|
MCP support. The most popular AI IDE.
|
|
|
|
** Architecture
|
|
- Model: proprietary (cursor-small) + OpenAI, Anthropic, Gemini
|
|
- Tools: agent mode, tab completion, chat, @-symbols, MCP, terminal
|
|
- Memory: session; no cross-session persistence
|
|
- Safety: diff view, configurable permission levels, image input support
|
|
|
|
** Differentiators
|
|
- First-mover in AI IDEs (fork vs extension approach)
|
|
- .cursorrules for project conventions
|
|
- Fast tab completion (custom small model)
|
|
- @-symbol context system (files, docs, web)
|
|
|
|
** Maturity
|
|
Production. Most popular AI IDE.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- IDE-dependent. PP is philosophy of terminal + .org + Emacs
|
|
--- No neurosymbolic, no persistent memory, no contract-first
|
|
|
|
* 19. Augment Code
|
|
|
|
** What it does
|
|
AI coding platform with deep codebase understanding. Agent, chat, CLI (Auggie),
|
|
context engine that indexes entire codebase.
|
|
|
|
** Architecture
|
|
- Model: proprietary + multi-model
|
|
- Tools: agent, code completions (sunset soon), CLI (Auggie), IDE extensions
|
|
- Memory: codebase index (persistent), session context
|
|
- Safety: enterprise SSO, permissions
|
|
|
|
** Differentiators
|
|
- Auggie CLI — terminal-first agent (closest to PP)
|
|
- Codebase-wide context engine (indexes entire repo, not just open files)
|
|
- Agent can tackle large tasks autonomously
|
|
|
|
** Maturity
|
|
Production. Well-funded.
|
|
|
|
** Relevance to Passepartout: HIGH
|
|
--- Auggie CLI is directly comparable: terminal-native coding agent
|
|
--- Full codebase indexing is better than PP's current approach
|
|
--- No neurosymbolic. No contract-first. No org-mode source-of-truth.
|
|
--- PP's TDD + contract + tangle workflow is unique differentiator
|
|
|
|
* 20. Qoder
|
|
|
|
** What it does
|
|
Coding agent platform. Details limited.
|
|
|
|
** Architecture
|
|
Unknown — website unreachable.
|
|
|
|
** Maturity
|
|
Unknown.
|
|
|
|
** Relevance to Passepartout: UNKNOWN
|
|
--- Insufficient data.
|
|
|
|
* 21. v0 by Vercel
|
|
|
|
** What it does
|
|
UI generation agent. Generates React/Next.js components and pages from text
|
|
prompts. Visual design oriented.
|
|
|
|
** Architecture
|
|
- Model: proprietary (likely fine-tuned)
|
|
- Tools: code generation, visual design mode, templates, deployment to Vercel
|
|
- Memory: per-session; design context within chat
|
|
- Safety: Vercel platform controls
|
|
|
|
** Differentiators
|
|
- Visual-first: generates UI, not general code
|
|
- Design mode: fine-tune with visual controls
|
|
- Template library
|
|
- iOS app for mobile building
|
|
- Deep Vercel/Next.js integration
|
|
|
|
** Maturity
|
|
Production.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Narrow domain (UI generation). Not a general coding agent.
|
|
--- Cloud-only. No TUI.
|
|
--- No relevance to PP's workflow.
|
|
|
|
* 22. Lovable
|
|
|
|
** What it does
|
|
Full-stack application generation from natural language prompts. Build and
|
|
deploy apps/websites via AI chat.
|
|
|
|
** Architecture
|
|
- Model: proprietary (likely fine-tuned)
|
|
- Tools: app generation, deployment, domain registration, mobile app
|
|
- Memory: per-project conversation
|
|
- Safety: platform-level moderation
|
|
|
|
** Differentiators
|
|
- Full-stack: frontend + backend + DB + deploy
|
|
- No-code-friendly (describe app → get working app)
|
|
- Template library, mobile companion app
|
|
- Enterprise security claims
|
|
|
|
** Maturity
|
|
Production.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Consumer/no-code focus. Not a developer coding agent.
|
|
--- No TUI, no extensibility, no memory persistence beyond project.
|
|
|
|
* 23. Void (formerly based on Codex)
|
|
|
|
** What it does
|
|
CLI coding agent for terminal-based AI code generation. Originally built on
|
|
OpenAI Codex.
|
|
|
|
** Architecture
|
|
- Model: OpenAI API-compatible
|
|
- Tools: terminal code generation, file writing
|
|
- Memory: session context
|
|
- Safety: user approval on file changes
|
|
|
|
** Differentiators
|
|
- CLI-native (similar to Codex CLI)
|
|
- Lightweight
|
|
|
|
** Maturity
|
|
Beta/early.
|
|
|
|
** Relevance to Passepartout: MEDIUM
|
|
--- CLI-native coding agent, directly comparable to PP
|
|
--- Less mature than Codex CLI or Aider
|
|
--- No unique differentiators
|
|
|
|
* 24. Cosine Genie
|
|
|
|
** What it does
|
|
Autonomous AI software engineer. Takes Jira tickets or PR descriptions, breaks
|
|
them down, writes code, delivers PRs. Works asynchronously.
|
|
|
|
** Architecture
|
|
- Model: Genie 2 (proprietary model); 72% on SWE-Lancer
|
|
- Tools: IDE/CLI, Slack, Jira, Linear, GitHub integration
|
|
- Memory: task-level context; works asynchronously in background
|
|
- Safety: sandboxed; desktop app runs locally; cloud service
|
|
|
|
** Differentiators
|
|
- Proprietary model (Genie 2) — not just wrapping an API
|
|
- SWE-Lancer leader: 72% pass rate (highest)
|
|
- Asynchronous: works without active session
|
|
- Slack/Jira/Linear integration for task intake
|
|
- CLI + Desktop app + Cloud
|
|
|
|
** Maturity
|
|
Production. Well-funded.
|
|
|
|
** Relevance to Passepartout: HIGH
|
|
--- Direct competitor: agent that takes tickets and delivers code
|
|
--- CLI-native operation
|
|
--- No contract-first, no neurosymbolic, no org-mode
|
|
--- PP's structured TDD cycle + deterministic rules are key differentiators
|
|
--- Cosine's Slack/Jira integration interesting for PP to consider
|
|
|
|
* 25. Mentat
|
|
|
|
** What it does
|
|
CLI coding assistant that coordinates edits across multiple files. Project
|
|
context understanding.
|
|
|
|
** Architecture
|
|
- Model: any LLM (OpenAI, Anthropic)
|
|
- Tools: file editing across multiple files, project context gathering
|
|
- Memory: session context, git awareness
|
|
- Safety: user approval
|
|
|
|
** Differentiators
|
|
- Multi-file editing focus
|
|
- Project-level understanding
|
|
|
|
** Maturity
|
|
Currently inactive/archived. GitHub 404.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Inactive project. No meaningful differentiation from Aider/Codex CLI.
|
|
|
|
* 26. Ghostwriter by Replit (→ see Replit Agent #12)
|
|
* 27. Poolside
|
|
|
|
** What it does
|
|
Foundation models for software engineering. Builds models + agents for
|
|
enterprise. On-prem deployment, air-gapped. TUI, IDE extensions, agents.
|
|
|
|
** Architecture
|
|
- Model: Laguna XS.2 M.1 (proprietary foundation model)
|
|
- Tools: agents, TUI, IDE extensions, multi-agent orchestration
|
|
- Memory: enterprise context (connectors to repos, DBs, private data)
|
|
- Safety: on-prem/VPC/air-gapped, RBAC for humans and agents, audit trails,
|
|
executive governance, no data leaves customer boundary
|
|
|
|
** Differentiators
|
|
- Full-stack: builds foundation models + agents + enterprise deployment
|
|
- Forward Deployed Research Engineers embedded with customers
|
|
- Outcome ownership (not just model handoff)
|
|
- AGI thesis: software engineering as path to AGI
|
|
- Military/defense-grade security (not just compliance)
|
|
|
|
** Maturity
|
|
Production. Frontier lab status. $500M+ funding.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Enterprise/military focus, completely different market
|
|
--- Building own models (PP uses existing LLMs)
|
|
--- TUI is just one surface among many
|
|
--- PP's individual-developer, open-source, TDD-first philosophy is opposite
|
|
|
|
* 28. Tabnine
|
|
|
|
** What it does
|
|
Enterprise AI code completion + agent platform. Code completions, chat, agents,
|
|
CLI, context engine. Gartner Visionary 2025.
|
|
|
|
** Architecture
|
|
- Model: multiple (code-specific small models + large model access)
|
|
- Tools: code completion, chat, CLI, agents (planning, coding, testing, docs),
|
|
context engine, provenance/attribution
|
|
- Memory: Enterprise Context Engine (indexes org codebase, architecture,
|
|
standards)
|
|
- Safety: on-prem/air-gapped/VPC, zero data retention, IP indemnity,
|
|
provenance tracking, admin controls, audit logs
|
|
|
|
** Differentiators
|
|
- Enterprise Context Engine: organizational intelligence layer for any agent
|
|
- Fine-tuning on private repos
|
|
- Provenance & attribution for IP compliance
|
|
- Gartner Visionary; Leader in Omdia Universe
|
|
- Zero Trust compliance (air-gapped)
|
|
|
|
** Maturity
|
|
Production. 15+ years in market (originally Codota). Millions of developers.
|
|
|
|
** Relevance to Passepartout: LOW
|
|
--- Enterprise-focused (compliance, IP, governance). PP is individual-agent.
|
|
--- Context Engine concept is interesting but proprietary
|
|
--- No neurosymbolic. No contract-first.
|
|
--- CLI is secondary to IDE completions for Tabnine
|
|
|
|
* 29. Factory Droid (→ see #6)
|
|
* 30. Devin (→ see #5)
|
|
|
|
* Emerging / Notable Others
|
|
|
|
** Conductor (Melty) — see #16
|
|
** Cline — see #7
|
|
** RooCode — see #8
|
|
** Augment Code CLI (Auggie) — see #19
|
|
** Cosine Genie — see #24
|
|
|
|
* Cross-Cutting Analysis
|
|
|
|
** Safety / Security Architecture
|
|
|
|
| System | Sandbox | Human-in-Loop | Diff Review | Gov/Enterprise | Notes |
|
|
|--------|---------|---------------|-------------|----------------|-------|
|
|
| Aider | Git undo| File approval | Git diff | No | Git as safety net |
|
|
| Cline | None | Every op | Diff view | SSO, on-prem | Best HIL in class |
|
|
| Codex CLI | Sandboxed exec | On write/command | Console output | No | Basic |
|
|
| Devin | Sandboxed env | Deployment gate | PR review | Enterprise plan | Cloud sandbox |
|
|
| Cursor | None | Configurable | Diff view | No | .cursorrules |
|
|
| Copilot| None | Configurable | Diff view | SSO, audit, MCP allowlist | Best enterprise |
|
|
| Cosine | None | PR review | PR review | Cloud + on-prem| Async operation |
|
|
| Tabnine| None | Configurable | Diff view | Air-gapped, on-prem, audit | Best air-gapped |
|
|
| Poolside| Deploy boundary | Enterprise governance | Platform | Air-gapped, defense | Most secure by design |
|
|
|
|
** Memory Persistence
|
|
|
|
Nearly ALL systems have session-only memory. Exceptions:
|
|
- Tabnine: Enterprise Context Engine (persistent codebase index)
|
|
- Devin/Windsurf: Spaces (bundled context across sessions)
|
|
- Poolside: enterprise connectors to repos+DBs
|
|
- AutoGPT: vector DB persistent memory
|
|
|
|
Passepartout's approach: git + .org files as source of truth is unique and
|
|
powerful — no other system uses literate programming as memory.
|
|
|
|
** TUI / CLI Interface
|
|
|
|
CLI-native systems: Aider, Codex CLI, Cosine Genie, Void, Augment CLI
|
|
IDE-first: Cursor, Windsurf, Copilot, Cline, Continue, PearAI
|
|
Both: GitHub Copilot CLI, Poolside TUI
|
|
|
|
PP is CLI + Emacs. Closest in philosophy: Aider, Codex CLI, Cosine Genie.
|
|
|
|
** Extensibility Model
|
|
|
|
- MCP (Model Context Protocol): Cline, Cursor, Windsurf, Copilot, AutoGen
|
|
- Custom prompts/rules: .cursorrules, .clinerules, AGENTS.md, CLAUDE.md
|
|
- Python scripting: Aider
|
|
- Agent creation: AutoGPT (block builder), CrewAI (role-based)
|
|
- MCP server creation: Cline ("add a tool" via LLM)
|
|
|
|
PP's extensibility: Common Lisp macros + skill system. Unique: hot-reloadable
|
|
skills, self-repair capability. No other system has this.
|
|
|
|
** Neurosymbolic / Deterministic Rule Components
|
|
|
|
NONE of the surveyed systems have a neurosymbolic architecture or deterministic
|
|
rule engine. The closest approximations:
|
|
- .cursorrules / .clinerules / AGENTS.md / CLAUDE.md: plain-text instructions
|
|
to the LLM (zero enforcement)
|
|
- Cline's MCP tools: deterministic tool execution but no rule reasoning
|
|
- Tabnine's Provenance: deterministic code matching but not rules
|
|
- AutoGen's event-driven core: deterministic workflow but not rule-based
|
|
reasoning
|
|
|
|
Passepartout's contract-first TDD (machine-verifiable contracts + Fiveam tests)
|
|
is architecturally unique. No competitor has anything like it.
|
|
|
|
* Key Takeaways for Passepartout
|
|
|
|
1. NO competitor has neurosymbolic architecture or deterministic rule
|
|
enforcement. This is PP's strongest differentiator.
|
|
|
|
2. NO competitor uses literate programming (.org as source of truth) or
|
|
org-babel tangle workflow. This is PP's second strongest differentiator.
|
|
|
|
3. NO competitor has hot-reloadable, self-repairable skills. PP's skill system
|
|
(Lisp macros + fboundp guards) is unique.
|
|
|
|
4. Memory persistence is universally weak. PP's git + .org approach is
|
|
arguably more robust than any competitor's session-only model.
|
|
|
|
5. CLI-native agent space is growing: Codex CLI, Aider, Cosine Genie, Auggie.
|
|
PP must match or exceed their terminal UX quality.
|
|
|
|
6. MCP is becoming the universal extensibility standard. PP should support MCP.
|
|
|
|
7. Async/background operation (Cosine Genie, Devin) is a growing expectation.
|
|
PP's REPL-based daemon architecture is well-positioned for this.
|
|
|
|
8. Enterprise features (SSO, on-prem, audit) are table stakes for enterprise
|
|
but irrelevant for PP's individual-agent use case.
|
|
|
|
9. Multi-agent orchestration (AutoGen, CrewAI, Conductor) is a separate
|
|
concern. PP should focus on single-agent excellence first.
|
|
|
|
10. The "contract-first TDD" workflow from .org → write test → watch fail →
|
|
implement → watch pass → tangle is UNIQUE in the entire competitive
|
|
landscape.
|