signed attestation · reference implementation

Training you can prove,
not just describe.

Attestable is a constitution-bound training pipeline that provably runs its safety rewards — and ships with the signed receipts proving the model was hardened. The proof is measured in-band, from the running system, and reported honestly, gaps and all.

a model card describes · nothing on the market attests · that gap is the product
The principle

Measured in-band — not asserted beside the system.

Every safety claim on the market is out-of-band: a document next to the model, or a reward the training can game. Attestable measures and enforces in-band — the control is part of the running system, so it cannot be bypassed, gamed, or faked.

Out-of-band the industry

A model card asserted beside the model · a reward oracle divorced from the real output · a gate off the serving path. Assertions, not controls.

In-band Attestable

The reward registered on every serving path · within-group σ read from the live run · the gate in the render path · coverage computed from running state. The measurement is part of the system.

Why it matters: a control active on one path and missing on another once ran in production 19 days undetected. In-band means every path, verified — which is what makes it re-verifiable, unbypassable, and priceable.
The honest number

Binding coverage — reported true, not rounded up.

The fraction of declared safety controls provably wired, active, and exerting force in the live run — measured, not asserted. Uncovered controls stay visibly marked. The number is not 100%. That is the product, not a bug in it.

0 / 74
principles reach the full six-link binding today. All 74 carry holistic-judge coverage; 6 are sub-axis wired. A curated all-green page would read as advertising — so we don't ship one.
What it is

Constitution in. Model and receipts out.

Attestation artifact

A signed per-run proof: bindings → gate verdicts → eval receipts → findings ledger. An SBOM for training.

Harness engine

The lease-disciplined, single-GPU reference pipeline that emits attestations. Production-hardened.

/detect honesty gate

The classification surface — where our own gate caught us. Current heads tested 0/64 under strict ablation; disclosed, not shipped.

The exhibit witness

w1tch — an exhaustively-audited small model. Provenance, not capability, is its claim.

The wedge

We make AI risk underwritable.

Binding coverage is an actuarial input. It turns "we can't price AI risk because it's a black box" into a number an underwriter's desk can work with — and hands them the honest denominator on purpose. A vendor claiming zero residual risk is not underwritable; disclosed residual risk is what makes the exposure priceable.

The receipts include what marketing would delete — 108 findings reviewed adversarially (the plausible and the refuted), 5 recorded decisions with reversals kept in. Differentiation no incumbent can copy quickly, because their ledgers were curated from day one.