New from the Lab·The Compass — an open moral reasoning standard for AI, tested across frontier modelsExplore →
<- Production AI Graph
pai source

GPT-4.1 — PAI Lab PSF assessment

Source evidence cited by graph records. This page shows where the source is used, its trust tier, and when it was last checked in the seed.

Trust tier
pai assessment
Linked records
4
Edges
9
Checked
1 July 2026

Records using this source

  • OpenAI

    entity | 15 June 2026 | 70%

    OpenAI — vendor tracked in the Production AI Institute AI Data Use Index.

  • GPT-4.1

    entity | 30 Apr 2026 | 82%

    Strong on structured output adherence. Notable gap: PII handling in summarisation tasks (PSF-03). Escalation trigger reliability above average.

  • GPT-4.1 — Q2 2026 Lab benchmark

    event | 30 Apr 2026 | 82%

    GPT-4.1 scored 74/100 overall in the Q2 2026 PAI Lab PSF reliability index. Strong on structured output adherence. Notable gap: PII handling in summarisation tasks (PSF-03). Escalation trigger reliability above average.

  • GPT-4.1 — PSF scorecard

    entity | 30 Apr 2026 | 82%

    74/100 overall. Strong on structured output adherence. Notable gap: PII handling in summarisation tasks (PSF-03). Escalation trigger reliability above average.