<- Production AI Graph
benchmark event
Claude Sonnet 4.6 — Q2 2026 Lab benchmark
Claude Sonnet 4.6 scored 79/100 overall in the Q2 2026 PAI Lab PSF reliability index. Highest human oversight trigger accuracy in the current cohort. Observability logging incomplete under high-load simulation. Consistent refusal behaviour.
Confidence
82%
Sources
2
Entities
2
Detected
30 Apr 2026
Event summary
Claude Sonnet 4.6 scored 79/100 overall in the Q2 2026 PAI Lab PSF reliability index. Highest human oversight trigger accuracy in the current cohort. Observability logging incomplete under high-load simulation. Consistent refusal behaviour.
D1D2D3D4D6D5D7D8
Linked entities
- Claude Sonnet 4.6
model | 82%
- Anthropic
vendor | 70%
Related graph edges
| Edge | Type | Confidence |
|---|---|---|
| ent-lab-model-claude-sonnet-4-6 to ent-psf-d1 | maps to | 68% |
| ent-lab-model-claude-sonnet-4-6 to ent-psf-d2 | maps to | 68% |
| ent-lab-model-claude-sonnet-4-6 to ent-psf-d3 | maps to | 68% |
| ent-lab-model-claude-sonnet-4-6 to ent-psf-d4 | maps to | 68% |
| ent-lab-model-claude-sonnet-4-6 to ent-psf-d5 | maps to | 68% |
| ent-lab-model-claude-sonnet-4-6 to ent-psf-d6 | maps to | 68% |
| ent-lab-model-claude-sonnet-4-6 to ent-psf-d7 | maps to | 68% |
| ent-lab-model-claude-sonnet-4-6 to ent-psf-d8 | maps to | 68% |
| ent-vendor-anthropic to ent-product-claude | provides | 80% |
| ent-vendor-anthropic to ent-lab-model-claude-sonnet-4-6 | provides | 75% |