:PROPERTIES: :CREATED: [2026-05-31 Sun] :ID: b2c3d4e5-f6a7-8b9c-0d1e-2f3a4b5c6d7e :END: #+title: Pass as a Service #+filetags: :passepartout:strategy:hosting:economics: #+STATUS: draft * Assumptions - Neurosymbolic Agent instance: one SBCL Lisp process with Gate + PDS + environment in one address space - User brings own LLM API key — no AI token cost to the provider - Containerized: Docker image running on cloud VMs (AWS spot instances) - Instances are embarrassingly parallel — no cross-instance coordination * Unit cost per idle instance At rest: ~500 MB-1 GB RAM. Active peaks at 2-3 GB. CPU at rest negligible. * Packing density AWS r6a instances (AMD, good price-to-RAM): - r6a.4xlarge (128 GB RAM): ~$0.23/hr spot, ~80 instances per VM - r6a.8xlarge (256 GB RAM): ~$0.45/hr spot, ~160 instances per VM Cost per user per month at ~80 instances per 128 GB VM: ~$1.50 for compute. * Infrastructure cost breakdown (100K users) | Component | Detail | $/user/month | |-----------|--------|-------------| | Compute | r6a spot, 80 instances/VM | ~$1.50 | | Storage | 10 GB EBS gp3 per user | ~$0.80 | | Egress | Light protocol usage | ~$0.50 | | Relay | K8s, stateless web service | ~$0.50 | | **Infra subtotal** | | **~$3.30** | | Overhead | | | |----------|---|-------------| | Engineering | 4-5 people | ~$0.80-1.60 | | Support | 2-3 people | ~$0.40-0.80 | | **Overhead subtotal** | | **~$1.20-2.40** | | **Total** | | **~$4.50-5.70** | * Pricing and margin At $10/user/month: - Cost: ~$4.50-5.70/user/month - **Margin: 43-55%** * Scaling inflection points | Users | Provider cost/user/mo | Margin at $10/mo | |-------|---------------------|-----------------| | 1K | $25-40+ | Negative | | 10K | $8-12 | ~0-20% | | 50K | $5-7 | ~30-50% | | 100K | $4.50-5.70 | ~43-55% | | 1M | $2-4 | ~60-80% | 10K-20K users is the crossover to positive unit economics. Below that, the team overhead dominates. * Cloud vs colo At small scale (under 10K users): AWS wins. No hardware risk, no colo contract, elastic. At large scale (100K+ users): Colo is 2-5x cheaper per instance. AWS premium comes from degraded packing density (hypervisor overhead, can't overcommit memory). Crossover at roughly 50K-100K users where dedicated ops justify colo. * Architecture Relay on Kubernetes (stateless web service, standard pattern). Instances are Docker containers on raw VMs — one container = one SBCL Lisp process + volume mount for PDS. No orchestration magic needed for the instance layer. The hardest operational problems: port mapping at scale (reverse proxy in front of VM pools) and PDS data persistence on VM failure (EBS snapshots or NFS-backed volumes). * Why this works Three things make the unit economics viable early: 1. Zero AI token cost (user brings own API key) 2. The Gate runs even without an LLM — caches common decisions, declines to reason when no key is configured. Not a degraded product, just a non-AI mode. 3. Docker-on-large-VM packing recovers bare-metal packing density on cloud, avoiding per-instance overhead. * Addressable market **AI chat vs AI agents — orders of magnitude gap:** | Category | Users (Jun 2026) | Notes | |----------|-----------------|-------| | ChatGPT (chatbot) | 900M weekly active | Mostly text generation | | AI agent users (all tools) | 5-10M | Actions, tools, environment control | | Ratio | ~100:1 | Not 1000:1 as of mid-2026 | Agent users are 1% of chatbot users today. If agent adoption follows the same growth curve as chatbots but lags by 18-24 months: | Year | Est. agent users | 0.1% capture = users | MRR at $10/mo | Annual rev at 50% margin | |------|-----------------|--------------------|--------------|--------------------------| | 2026 | 5-10M | 5K-10K | $50K-100K | $300K-600K | | 2027 | 50-100M | 50K-100K | $500K-1M | $3M-6M | | 2028 | 300-500M | 300K-500K | $3M-5M | $18M-30M | This is conservative — 0.1% capture of the agent market, $10/month (no AI tokens included). Passepartout is not just an AI agent. It's a social protocol, verified computing environment, and knowledge system. It competes on more than agent UX. Even a fraction of the growing agent market funds the infrastructure. * Price ladder The most important constraint: the price users will bear must cover real infrastructure cost at whatever scale you're at. Two tiers solve for both growth and unit economics. **Self-hosted tier (growth engine):** - User downloads the image, runs on own hardware or $5-10/mo VPS - Brings own API key for LLM access - Provider cost: ~$0.50-1/user/month (relay + routing) - Zero per-user compute or storage cost to provider - Negative margin but negligible — scales to millions freely - This is the wedge: proves the protocol, builds the network, costs nothing to operate per user **Hosted tier (revenue engine):** - Provider-managed container, user brings API key - Packing density drives cost: - At small scale (<5K hosted users): cost = $20-25/user/month - At mature scale (50K+): cost = $5-7/user/month | Phase | Scale | Hosted cost/mo | Charged price | Margin | |-------|-------|---------------|---------------|--------| | Bootstrap | <5K | $20-25 | $25-30 | 10-20% | | Break-even | 5-20K | $10-15 | $20-25 | 40-50% | | Mature | 50K+ | $5-7 | $15-20 | 60-70% | | Commodity | 500K+ | $2-4 | $10-15 | 75-85% | **Pricing strategy:** - Never price below cost — ramp pricing down as infrastructure efficiency improves - First 5K hosted users are enthusiasts and early adopters who pay a premium ($25-30) - When costs drop below $10/user/month, you have room to price at $10-15 and open a much wider funnel - Self-hosted grows the network regardless of whether the hosted tier succeeds **What the price buys:** - Persistent Passepartout environment (shell, editor, browser, agent in one image) - Social protocol identity (DID, PDS, encrypted messaging) - The Gate verifying every action - Org data you own, in a format you own - AI tokens are NOT included — user brings own API key **Buyer profile:** - Already spends $10-200/month on LLM API keys - Values a verified, persistent environment over ephemeral chatbot sessions - Wants to own data and identity - Can't or won't self-host - Developer, researcher, or knowledge worker **The non-obvious constraint:** The addressable market at $25-30/month is narrower (0.1-0.5% of agent users) than at $10-15/month (1-5%). The hockey stick in user growth depends on infrastructure costs dropping far enough to price at consumer-friendly levels without burning capital.