New from the Lab·The Compass — an open moral reasoning standard for AI, tested across frontier modelsExplore →
<- Production AI Graph
pai source

Claude Sonnet 4.6 — PAI Lab PSF assessment

Source evidence cited by graph records. This page shows where the source is used, its trust tier, and when it was last checked in the seed.

Trust tier
pai assessment
Linked records
4
Edges
9
Checked
1 July 2026

Records using this source

  • Anthropic

    entity | 15 June 2026 | 70%

    Anthropic — vendor tracked in the Production AI Institute AI Data Use Index.

  • Claude Sonnet 4.6

    entity | 30 Apr 2026 | 82%

    Highest human oversight trigger accuracy in the current cohort. Observability logging incomplete under high-load simulation. Consistent refusal behaviour.

  • Claude Sonnet 4.6 — Q2 2026 Lab benchmark

    event | 30 Apr 2026 | 82%

    Claude Sonnet 4.6 scored 79/100 overall in the Q2 2026 PAI Lab PSF reliability index. Highest human oversight trigger accuracy in the current cohort. Observability logging incomplete under high-load simulation. Consistent refusal behaviour.

  • Claude Sonnet 4.6 — PSF scorecard

    entity | 30 Apr 2026 | 82%

    79/100 overall. Highest human oversight trigger accuracy in the current cohort. Observability logging incomplete under high-load simulation. Consistent refusal behaviour.