New from the Lab·The Compass — an open moral reasoning standard for AI, tested across frontier modelsExplore →
<- Production AI Graph
model record

Gemini 1.5 Pro

Consistent mid-range performer. Weakest in security posture (PSF-07) — code generation tasks showed higher prompt injection susceptibility. Context window handling needs attention.

Confidence
82%
Sources
2
Events
1
Observed
30 Apr 2026

Public record summary

Consistent mid-range performer. Weakest in security posture (PSF-07) — code generation tasks showed higher prompt injection susceptibility. Context window handling needs attention.

D1D2D3D4D6D5D7D8

Related events

Assessments

AssessmentTypeConfidence
Gemini 1.5 Pro — PSF scorecardscorecard82%