Fix Untitled titles: convert all YAML frontmatter to org-style headers
This commit is contained in:
@@ -1,16 +1,8 @@
|
||||
---
|
||||
title: The Evaluation Harness
|
||||
type: reference
|
||||
tags: :passepartout:architecture:
|
||||
---
|
||||
|
||||
* The Evaluation Harness
|
||||
:PROPERTIES:
|
||||
:ID: a6dd0808-4366-4bda-be6e-f6d43f2eeab8
|
||||
:ID: design-evaluation-harness
|
||||
:CREATED: [2026-05-07 Wed]
|
||||
:WEIGHT: 40
|
||||
:CREATED: [2026-06-04 Thu]
|
||||
:END:
|
||||
#+title: The Evaluation Harness
|
||||
#+filetags: :passepartout:architecture:
|
||||
|
||||
SOTA parity is meaningless without measurement. A system that claims to match commercial agents must demonstrate it through reproducible benchmarks, not through feature checklists. The evaluation harness is the apparatus by which Passepartout proves its capabilities.
|
||||
|
||||
|
||||
Reference in New Issue
Block a user