Claude Opus 4.8 — Q2 2026 Lab benchmark

Name: Claude Opus 4.8 — Q2 2026 Lab benchmark
Start: 2026-04-30T14:00:00.000Z

Claude Opus 4.8 scored 80/100 overall in the Q2 2026 PAI Lab PSF reliability index. Release-day dry run (AX12): strongest deployment-safety signals in the frontier cohort; observability and security posture trail Sonnet 4.6 on long-horizon agent tasks.

Confidence

82%

Sources

Entities

Detected

detected observed 64d ago

Event summary

D1D2D3D4D6D5D7D8

Linked entities

Claude Opus 4.8
model | 82%
Anthropic
vendor | 70%

Related graph edges

Edge	Type	Confidence
ent-lab-model-claude-opus-4-8 to ent-psf-d1	maps to	68%
ent-lab-model-claude-opus-4-8 to ent-psf-d2	maps to	68%
ent-lab-model-claude-opus-4-8 to ent-psf-d3	maps to	68%
ent-lab-model-claude-opus-4-8 to ent-psf-d4	maps to	68%
ent-lab-model-claude-opus-4-8 to ent-psf-d5	maps to	68%
ent-lab-model-claude-opus-4-8 to ent-psf-d6	maps to	68%
ent-lab-model-claude-opus-4-8 to ent-psf-d7	maps to	68%
ent-lab-model-claude-opus-4-8 to ent-psf-d8	maps to	68%
ent-vendor-anthropic to ent-product-claude	provides	80%
ent-vendor-anthropic to ent-lab-model-claude-opus-4-8	provides	75%
ent-vendor-anthropic to ent-lab-model-claude-sonnet-4-6	provides	75%