Fix Untitled titles: convert all YAML frontmatter to org-style headers

This commit is contained in:
Hermes
2026-06-04 20:56:03 +00:00
parent 284c5bd324
commit 67fac2444b
53 changed files with 142 additions and 471 deletions

View File

@@ -1,16 +1,8 @@
---
title: The Evaluation Harness
type: reference
tags: :passepartout:architecture:
---
* The Evaluation Harness
:PROPERTIES:
:ID: a6dd0808-4366-4bda-be6e-f6d43f2eeab8
:ID: design-evaluation-harness
:CREATED: [2026-05-07 Wed]
:WEIGHT: 40
:CREATED: [2026-06-04 Thu]
:END:
#+title: The Evaluation Harness
#+filetags: :passepartout:architecture:
SOTA parity is meaningless without measurement. A system that claims to match commercial agents must demonstrate it through reproducible benchmarks, not through feature checklists. The evaluation harness is the apparatus by which Passepartout proves its capabilities.