Independent AI release assurance for enterprise procurement and regulatory compliance. Get a deployment verdict before you commit to a model.
Full access to the public evidence layer. See how 32 frontier models score before you commit to anything.
View Evidence →Private evaluations with formal reports and Release Decision Packs. For teams procuring models or running recurring governance checks.
Start Evaluation →Unlimited evaluations, API access, and audit-ready outputs for board-level AI governance and EU AI Act compliance. NDA available.
Contact for Access →Everything across all tiers, in detail.
| Feature | Free | Pro | Enterprise |
|---|---|---|---|
| Evidence access | |||
| Public leaderboard (32+ models) | ✓ | ✓ | ✓ |
| BIS, CPD, TSI scores | ✓ | ✓ | ✓ |
| Temperature breakdown (T=0.0–0.8) | ✓ | ✓ | ✓ |
| Model cards (per-model detail) | ✓ | ✓ | ✓ |
| Private evaluations | |||
| Private model evaluation | — | 2/month | Unlimited |
| 200-probe behavioral durability evaluation | — | ✓ | ✓ |
| Control probe run (20 probes) | — | — | ✓ |
| Confidential endpoint submission | — | — | ✓ |
| Reports & outputs | |||
| Formal PDF evaluation report | — | ✓ | ✓ |
| Evaluation certificate (procurement) | — | ✓ | ✓ |
| Audit certificate (EU AI Act / NIST) | — | — | ✓ |
| Evidence comparison (vs 32 models) | — | ✓ | ✓ |
| API & integrations | |||
| REST API access | — | — | ✓ |
| Embeddable evaluation badge | — | ✓ | ✓ |
| Support & legal | |||
| NDA available | — | — | ✓ |
| Dedicated account contact | — | — | ✓ |
| Turnaround SLA | — | 5 business days | 48 hours |
| Custom probe suite | — | — | — |
Can't find what you need? Email us directly.
Submit your model for a private MTCP evaluation. Receive a formal report, Release Decision Pack and deployment verdict — typically within 5 business days.