hermes-brain/projects/passepartout/architecture/design/validation/gaur--sheth-2023-crest-trustworthy-neurosymbolic-ai.org at 284c5bd324c62f7e36fcad879fa9e85d11141ec1

amr/hermes-brain

Fork 0

Files

Hermes 284c5bd324 Break design/ files into subfolders by ** heading (43 new files)

2026-06-04 20:47:17 +00:00

1.4 KiB

Raw Blame History

Gaur & Sheth (2023): CREST — Trustworthy Neurosymbolic AI

— title: Gaur & Sheth (2023): CREST — Trustworthy Neurosymbolic AI type: reference tags: :passepartout:architecture: —

Gaur & Sheth (2023): CREST — Trustworthy Neurosymbolic AI

Gaur and Sheth present the CREST framework: Consistency, Reliability, user-level Explainability, and Safety build Trust — and they argue these require neurosymbolic methods. Their empirical finding: GPT-3.5 breached safety constraints 30% of the time when asked identical questions repeatedly. Claude's 16 safety rules and Sparrow's 23 rules provide no inherent safety — they are heuristic guardrails that can be breached through prompt variation.

These findings validate three Passepartout design commitments: (1) prompt-level safety is insufficient — deterministic gates run in pure Lisp, cost 0 tokens, and cannot be evaded by prompt engineering; (2) inconsistency is the norm — the cardinality model expects contradiction and surfaces it with provenance; (3) knowledge infusion is required for trust — Passepartout's symbolic index IS the knowledge infusion layer, facts extracted from prose, verified by Screamer, and available for any LLM call.

Reference: Gaur, M., & Sheth, A. (2023). Building Trustworthy NeuroSymbolic AI Systems: Consistency, Reliability, Explainability, and Safety. arXiv:2312.06798.

1.4 KiB Raw Blame History

Gaur & Sheth (2023): CREST — Trustworthy Neurosymbolic AI

1.4 KiB

Raw Blame History