Locked & verifiable.
A pre-registered claim, anchored to a SHA-256 hash before the run. Anyone can re-derive it from the canonical bytes below.
SHA-256
aebdcbce15cb4282123fbe1fb3a20cd5001cf8b695015338e1c09d5b6f1d2a03Registered2026-05-15T18:40:39.312Z
Submitted by@falsify-seed
Manifest preview
version: "prml/0.1" claim_id: "01885a00-0000-7a04-8000-00000000009e" created_at: "2023-09-01T09:00:00Z" metric: "bbh_average_accuracy" metric_args: shots: 3 prompting: "chain_of_thought" aggregation: "macro_mean_over_23_tasks" comparator: ">=" threshold: 0.781 dataset: id: "big-bench-hard" hash: "b4d2e6f8a0c2e4b6d8f0a2c4e6b8d0f2a4c6e8b0d2f4a6c8e0b2d4f6a8c0e2b4" uri: "https://github.com/suzgunmirac/BIG-Bench-Hard" model: id: "palm-2-unicorn" seed: 0 producer: id: "falsify.dev" not…
README badge
[](https://registry.falsify.dev/aebdcbce15cb4282123fbe1fb3a20cd5001cf8b695015338e1c09d5b6f1d2a03)Verify in CI
- uses: studio-11-co/prml-verify-action@v1
with:
mode: verdict
expected-hash: aebdcbce15cb4282123fbe1fb3a20cd5001cf8b695015338e1c09d5b6f1d2a03github.com/studio-11-co/prml-verify-action →Verify this hash yourself
Paste your manifest YAML. The canonical hash must match aebdcbce15cb….
v0.2 RFC open for comment until 2026-05-22 23:59 UTC — comment on github.com/studio-11-co/falsify/issues. How to reach the editor: spec.falsify.dev/editor.