Locked & verifiable.
A pre-registered claim, anchored to a SHA-256 hash before the run. Anyone can re-derive it from the canonical bytes below.
SHA-256
43c20d9e6e55d9ea1db2d8acb7687cdab966f59498d9f637a56bd422f7db6390Registered2026-05-15T18:40:38.328Z
Submitted by@falsify-seed
Manifest preview
version: "prml/0.1" claim_id: "01900b40-0000-7a03-8000-00000000008d" created_at: "2024-05-10T11:00:00Z" metric: "accuracy" metric_args: shots: 5 comparator: ">=" threshold: 0.820 dataset: id: "mmlu-test" hash: "c1f9b6d6a3e7d4b2f0a18c5e7d2b9f4a6c8e1d3b5a7c9e2f4d6b8a0c2e4f6a8b" uri: "https://huggingface.co/datasets/cais/mmlu" model: id: "meta-llama-3-70b-instruct" seed: 0 producer: id: "falsify.dev" notes: "Retroactive demo lock — original claim is Meta's, not falsify.dev's. Dataset ha…
README badge
[](https://registry.falsify.dev/43c20d9e6e55d9ea1db2d8acb7687cdab966f59498d9f637a56bd422f7db6390)Verify in CI
- uses: studio-11-co/prml-verify-action@v1
with:
mode: verdict
expected-hash: 43c20d9e6e55d9ea1db2d8acb7687cdab966f59498d9f637a56bd422f7db6390github.com/studio-11-co/prml-verify-action →Verify this hash yourself
Paste your manifest YAML. The canonical hash must match 43c20d9e6e55….
v0.2 RFC open for comment until 2026-05-22 23:59 UTC — comment on github.com/studio-11-co/falsify/issues. How to reach the editor: spec.falsify.dev/editor.