Combined all three under verification-monopoly.org with title: 'The Evaluation Harness — Collective Regression Suite as Certification Monopoly' Structure: (1) vision from monopoly, (2) service from harness, (3) spec from collective-regression. All three IDs preserved in PROPERTIES. Deleted evaluation-harness.org and collective-regression-suite.org.
28 lines
2.0 KiB
Org Mode
28 lines
2.0 KiB
Org Mode
:PROPERTIES:
|
|
:CREATED: [2026-05-24 Sun]
|
|
:ID: efc76898-03f7-57ba-923d-35d65da88bb7
|
|
:END:
|
|
#+title: The Per-Domain Sufficiency Flip
|
|
#+filetags: :passepartout:sufficiency:flip:domain:knowledge:
|
|
|
|
The sufficiency flip is not a single event — it happens independently for each domain, and some domains never flip.
|
|
|
|
**Knowledge types required:**
|
|
- **Structural** (published rules, configs, specs) — LLM translation + ACL2 verification. Flips in days.
|
|
- **Empirical** (what happens when X?) — active sandboxed probing. Flips in days to weeks.
|
|
- **Performance** (latency, throughput) — benchmark harness. Flips in hours.
|
|
- **Transfer** from related domains — ontology alignment. Flips in days.
|
|
- **Tacit** (craft expertise, organizational culture) — requires Phase 3 archivist loop over time. May never fully flip.
|
|
- **Aesthetic** (poetry, creative writing) — never flips. The system is honest about its frontier.
|
|
|
|
**Fastest path to flip any domain:**
|
|
1. Ingest all published text (hours)
|
|
2. Run benchmark harness (hours)
|
|
3. Run active sandboxed probes (automated)
|
|
4. Generate contrastive queries for the 5% uncertain rules (one human session, a few hours)
|
|
5. Start serving real interactions (empirical loop tightens from first interaction)
|
|
|
|
The macroeconomic [[id:5f55bbe6-d243-5766-8ccf-5c5cc88a6542][impact on the AI and GPU industry]] — where token demand compresses and verification hardware emerges — is the industry-level expression of this per-domain sufficiency flip.
|
|
|
|
For the Lisp Machine bootstrap, every subdomain is software (the most codifiable domain). The entire bootstrap can flip in days to weeks with one human review session. The [[id:45ea493b-94ad-5885-aa65-0c846e5c3c1d][gate rule encoding]] process feeds directly into this: each domain's rules are formally encoded and verified. The [[id:dc2e4f22-1c4c-5d4a-a151-f96e5d3b0d70][time estimates]] for the overall project are derived from the time to flip each subdomain. The [[id:0b5a8a74-cfd6-542d-bc88-4eb3cd8626f9][cost structure]] shifts from LLM-token-heavy to verification-heavy as more domains flip.
|