Locked & verifiable.
A pre-registered claim, anchored to a SHA-256 hash before the run. Anyone can re-derive it from the canonical bytes below.
SHA-256
6b0ff15f1e17e08850125a95001b235f74fd53f0ced470376c761fa6d8f22d38Registered2026-05-07T09:05:13.986Z
Submitted by@mmlu-stem-em
Manifest preview
version: prml/0.1 metric: exact_match threshold: 0.30 dataset_split: mmlu-stem model_version: mmlu-eval-stub claim: MMLU STEM exact-match commitment submitter: studio-11 timestamp: 2026-05-07T12:08:00Z
README badge
[](https://registry.falsify.dev/6b0ff15f1e17e08850125a95001b235f74fd53f0ced470376c761fa6d8f22d38)Verify this hash yourself
Paste your manifest YAML. The canonical hash must match 6b0ff15f1e17….