153 lines
6.4 KiB
Org Mode
153 lines
6.4 KiB
Org Mode
:PROPERTIES:
|
|
:CREATED: [2026-05-31 Sun]
|
|
:ID: b2c3d4e5-f6a7-8b9c-0d1e-2f3a4b5c6d7e
|
|
:END:
|
|
#+title: Pass as a Service
|
|
#+filetags: :passepartout:strategy:hosting:economics:
|
|
#+STATUS: draft
|
|
|
|
* Assumptions
|
|
|
|
- Neurosymbolic Agent instance: one SBCL Lisp process with Gate + PDS + environment in one address space
|
|
- User brings own LLM API key — no AI token cost to the provider
|
|
- Containerized: Docker image running on cloud VMs (AWS spot instances)
|
|
- Instances are embarrassingly parallel — no cross-instance coordination
|
|
|
|
* Unit cost per idle instance
|
|
|
|
At rest: ~500 MB-1 GB RAM. Active peaks at 2-3 GB. CPU at rest negligible.
|
|
|
|
* Packing density
|
|
|
|
AWS r6a instances (AMD, good price-to-RAM):
|
|
- r6a.4xlarge (128 GB RAM): ~$0.23/hr spot, ~80 instances per VM
|
|
- r6a.8xlarge (256 GB RAM): ~$0.45/hr spot, ~160 instances per VM
|
|
|
|
Cost per user per month at ~80 instances per 128 GB VM: ~$1.50 for compute.
|
|
|
|
* Infrastructure cost breakdown (100K users)
|
|
|
|
| Component | Detail | $/user/month |
|
|
|-----------|--------|-------------|
|
|
| Compute | r6a spot, 80 instances/VM | ~$1.50 |
|
|
| Storage | 10 GB EBS gp3 per user | ~$0.80 |
|
|
| Egress | Light protocol usage | ~$0.50 |
|
|
| Relay | K8s, stateless web service | ~$0.50 |
|
|
| **Infra subtotal** | | **~$3.30** |
|
|
|
|
| Overhead | | |
|
|
|----------|---|-------------|
|
|
| Engineering | 4-5 people | ~$0.80-1.60 |
|
|
| Support | 2-3 people | ~$0.40-0.80 |
|
|
| **Overhead subtotal** | | **~$1.20-2.40** |
|
|
|
|
| **Total** | | **~$4.50-5.70** |
|
|
|
|
* Pricing and margin
|
|
|
|
At $10/user/month:
|
|
- Cost: ~$4.50-5.70/user/month
|
|
- **Margin: 43-55%**
|
|
|
|
* Scaling inflection points
|
|
|
|
| Users | Provider cost/user/mo | Margin at $10/mo |
|
|
|-------|---------------------|-----------------|
|
|
| 1K | $25-40+ | Negative |
|
|
| 10K | $8-12 | ~0-20% |
|
|
| 50K | $5-7 | ~30-50% |
|
|
| 100K | $4.50-5.70 | ~43-55% |
|
|
| 1M | $2-4 | ~60-80% |
|
|
|
|
10K-20K users is the crossover to positive unit economics. Below that, the team overhead dominates.
|
|
|
|
* Cloud vs colo
|
|
|
|
At small scale (under 10K users): AWS wins. No hardware risk, no colo contract, elastic.
|
|
|
|
At large scale (100K+ users): Colo is 2-5x cheaper per instance. AWS premium comes from degraded packing density (hypervisor overhead, can't overcommit memory).
|
|
|
|
Crossover at roughly 50K-100K users where dedicated ops justify colo.
|
|
|
|
* Architecture
|
|
|
|
Relay on Kubernetes (stateless web service, standard pattern). Instances are Docker containers on raw VMs — one container = one SBCL Lisp process + volume mount for PDS. No orchestration magic needed for the instance layer.
|
|
|
|
The hardest operational problems: port mapping at scale (reverse proxy in front of VM pools) and PDS data persistence on VM failure (EBS snapshots or NFS-backed volumes).
|
|
|
|
* Why this works
|
|
|
|
Three things make the unit economics viable early:
|
|
1. Zero AI token cost (user brings own API key)
|
|
2. The Gate runs even without an LLM — caches common decisions, declines to reason when no key is configured. Not a degraded product, just a non-AI mode.
|
|
3. Docker-on-large-VM packing recovers bare-metal packing density on cloud, avoiding per-instance overhead.
|
|
|
|
* Addressable market
|
|
|
|
**AI chat vs AI agents — orders of magnitude gap:**
|
|
|
|
| Category | Users (Jun 2026) | Notes |
|
|
|----------|-----------------|-------|
|
|
| ChatGPT (chatbot) | 900M weekly active | Mostly text generation |
|
|
| AI agent users (all tools) | 5-10M | Actions, tools, environment control |
|
|
| Ratio | ~100:1 | Not 1000:1 as of mid-2026 |
|
|
|
|
Agent users are 1% of chatbot users today. If agent adoption follows the same growth curve as chatbots but lags by 18-24 months:
|
|
|
|
| Year | Est. agent users | 0.1% capture = users | MRR at $10/mo | Annual rev at 50% margin |
|
|
|------|-----------------|--------------------|--------------|--------------------------|
|
|
| 2026 | 5-10M | 5K-10K | $50K-100K | $300K-600K |
|
|
| 2027 | 50-100M | 50K-100K | $500K-1M | $3M-6M |
|
|
| 2028 | 300-500M | 300K-500K | $3M-5M | $18M-30M |
|
|
|
|
This is conservative — 0.1% capture of the agent market, $10/month (no AI tokens included).
|
|
|
|
Passepartout is not just an AI agent. It's a social protocol, verified computing environment, and knowledge system. It competes on more than agent UX. Even a fraction of the growing agent market funds the infrastructure.
|
|
|
|
* Price ladder
|
|
|
|
The most important constraint: the price users will bear must cover real infrastructure cost at whatever scale you're at. Two tiers solve for both growth and unit economics.
|
|
|
|
**Self-hosted tier (growth engine):**
|
|
- User downloads the image, runs on own hardware or $5-10/mo VPS
|
|
- Brings own API key for LLM access
|
|
- Provider cost: ~$0.50-1/user/month (relay + routing)
|
|
- Zero per-user compute or storage cost to provider
|
|
- Negative margin but negligible — scales to millions freely
|
|
- This is the wedge: proves the protocol, builds the network, costs nothing to operate per user
|
|
|
|
**Hosted tier (revenue engine):**
|
|
- Provider-managed container, user brings API key
|
|
- Packing density drives cost:
|
|
- At small scale (<5K hosted users): cost = $20-25/user/month
|
|
- At mature scale (50K+): cost = $5-7/user/month
|
|
|
|
| Phase | Scale | Hosted cost/mo | Charged price | Margin |
|
|
|-------|-------|---------------|---------------|--------|
|
|
| Bootstrap | <5K | $20-25 | $25-30 | 10-20% |
|
|
| Break-even | 5-20K | $10-15 | $20-25 | 40-50% |
|
|
| Mature | 50K+ | $5-7 | $15-20 | 60-70% |
|
|
| Commodity | 500K+ | $2-4 | $10-15 | 75-85% |
|
|
|
|
**Pricing strategy:**
|
|
- Never price below cost — ramp pricing down as infrastructure efficiency improves
|
|
- First 5K hosted users are enthusiasts and early adopters who pay a premium ($25-30)
|
|
- When costs drop below $10/user/month, you have room to price at $10-15 and open a much wider funnel
|
|
- Self-hosted grows the network regardless of whether the hosted tier succeeds
|
|
|
|
**What the price buys:**
|
|
- Persistent Passepartout environment (shell, editor, browser, agent in one image)
|
|
- Social protocol identity (DID, PDS, encrypted messaging)
|
|
- The Gate verifying every action
|
|
- Org data you own, in a format you own
|
|
- AI tokens are NOT included — user brings own API key
|
|
|
|
**Buyer profile:**
|
|
- Already spends $10-200/month on LLM API keys
|
|
- Values a verified, persistent environment over ephemeral chatbot sessions
|
|
- Wants to own data and identity
|
|
- Can't or won't self-host
|
|
- Developer, researcher, or knowledge worker
|
|
|
|
**The non-obvious constraint:** The addressable market at $25-30/month is narrower (0.1-0.5% of agent users) than at $10-15/month (1-5%). The hockey stick in user growth depends on infrastructure costs dropping far enough to price at consumer-friendly levels without burning capital.
|