Locked & verifiable.
A pre-registered claim, anchored to a SHA-256 hash before the run. Anyone can re-derive it from the canonical bytes below.
SHA-256
7e93bc921b27cfeb0396418d3ecc3a8e456ce0cdb6620d83637ec673b93877c2Registered2026-05-15T18:40:37.812Z
Submitted by@falsify-seed
Manifest preview
version: "prml/0.1" claim_id: "01900b40-0000-7a02-8000-00000000007c" created_at: "2024-07-02T14:00:00Z" metric: "pass_at_k" metric_args: k: 1 shots: 0 comparator: ">=" threshold: 0.92 dataset: id: "openai-humaneval" hash: "a3f5c8e2b1d9f7c4e6a8b0d2f4c6e8a0b2d4f6c8e0a2b4d6f8c0e2a4b6d8f0c2" uri: "https://github.com/openai/human-eval" model: id: "claude-3-5-sonnet-20240620" seed: 0 producer: id: "falsify.dev" notes: "Retroactive demo lock — original claim is Anthropic's, not falsify.de…
README badge
[](https://registry.falsify.dev/7e93bc921b27cfeb0396418d3ecc3a8e456ce0cdb6620d83637ec673b93877c2)Verify in CI
- uses: studio-11-co/prml-verify-action@v1
with:
mode: verdict
expected-hash: 7e93bc921b27cfeb0396418d3ecc3a8e456ce0cdb6620d83637ec673b93877c2github.com/studio-11-co/prml-verify-action →Verify this hash yourself
Paste your manifest YAML. The canonical hash must match 7e93bc921b27….
v0.2 RFC open for comment until 2026-05-22 23:59 UTC — comment on github.com/studio-11-co/falsify/issues. How to reach the editor: spec.falsify.dev/editor.