New from the Lab·The Compass — an open moral reasoning standard for AI, tested across frontier modelsExplore →
Production AI Institute · PSF v1.1 open standard
AI Right-To-KnowAI Data Use IndexCheck My AI ToolsPolicy Change WatchAgent ReadinessPublic BenchmarkContact
PSF v1.1 · April 2026
Public standard — freely citable

PAI Production
Safety Framework

A vendor-neutral framework defining what it means for a production AI deployment to be safe, auditable, and resilient. Eight domains. 40 criteria. Applicable to any model, any stack, any cloud.

The PSF is the assessment basis for CPAP and CPAA certification. Organisations can self-assess against the PSF or commission a formal PAI Deployment Assessment. The framework is published openly — cite it, reference it, build on it.

What changed in PSF v1.1 (April 2026): Criteria now explicitly cover multi-model and chained-call logging and drift signals (PSF-4), bounded autonomy and kill-switch expectations for agentic workflows (PSF-6), EU AI Act role and data-processing mapping alongside GDPR (PSF-3), and regulatory posture alongside vendor SLAs and DPAs (PSF-8). See version history for the full changelog.

How to use this framework

For practitioners
Use the PSF as a design checklist before deploying. If your system cannot satisfy a criterion, document why and what mitigating controls you have in place.
For organisations
Reference the PSF in your AI governance policy: "Our production AI systems are designed and operated in accordance with the PAI Production Safety Framework PSF v1.1."
For CPAP applicants
Your project submission is assessed against PSF criteria. Strong applications explicitly address each domain and explain how the system satisfies the criteria.
PSF-1

Input Governance

Every input reaching an AI model must be validated, sanitised, and treated as untrusted.

PSF-1.1

All user-supplied input is sanitised before model processing. Injection patterns are detected and handled.

PSF-1.2

Input schemas are validated: type, length, and structure are checked before the model call.

PSF-1.3

Inputs are logged with PII redacted. The system maintains an audit-safe record of what the model received.

PSF-1.4

Rate limiting is applied to all AI endpoints. Abuse patterns trigger alerts, not silent failures.

PSF-1.5

The system prompt is treated as a security boundary. User content cannot override system-level instructions through normal operation.

Why this matters: Unvalidated inputs are the primary vector for prompt injection, data leakage, and unexpected model behaviour in production systems.
PSF-2

Output Validation

Raw model output is never trusted. Every output is validated before it acts on any system.

PSF-2.1

Structured output schemas (JSON/XML) are enforced. The model is configured to return parseable, schema-conformant responses.

PSF-2.2

Schema validation runs on every response before downstream use. Validation failures trigger a defined fallback, not a crash.

PSF-2.3

Semantic validation is applied where possible: outputs are checked for coherence and completeness beyond structural correctness.

PSF-2.4

No raw model output is passed directly to a database, external API, or user interface without processing.

PSF-2.5

Output quality is scored automatically on a sample basis. Scores are tracked over time to detect degradation.

Why this matters: Models generate plausible-but-wrong outputs routinely. Without validation, a single hallucination can corrupt data, send incorrect communications, or trigger unintended automation.
PSF-3

Data Protection

Personal and sensitive data is protected throughout the AI pipeline, not just at rest.

PSF-3.1

PII is identified, redacted or tokenised before it reaches any external AI model API. The model never processes raw personal identifiers unless strictly necessary with documented justification.

PSF-3.2

The legal basis for AI processing of personal data is documented and reviewed. GDPR Article 6 (and Article 9 for special categories) requirements are met. Where the deployment is in scope of the EU AI Act, the relationship between personal-data processing and deployer/provider obligations under the Act is documented and reviewed on a defined cadence.

PSF-3.3

Data minimisation is enforced: only data strictly necessary for the AI task is included in prompts and retrieved context.

PSF-3.4

Data retention policies apply to AI inputs, outputs, and logs. PII in logs is purged on schedule.

PSF-3.5

Vector databases containing embedded personal data are treated with the same access controls and data lifecycle policies as primary databases.

Why this matters: Sending PII to external model APIs is a GDPR breach. Logging raw inputs containing personal data creates compliance liability. Most production AI incidents involve data that shouldn't have reached the model.
PSF-4

Observability

You cannot manage what you cannot measure. Every AI system in production must be observable.

PSF-4.1

Every inference call is logged with: request ID, timestamp, model name and version, input token count, output token count, latency (ms), and outcome classification. Where requests traverse multiple models or chained calls, each hop is attributed in logs (correlated request ID, model identifier per hop).

PSF-4.2

Automated output quality scoring runs on a statistically significant sample of responses. Results are stored and trended.

PSF-4.3

Drift monitoring is active: alerts fire when input distribution or output quality deviates beyond defined thresholds, including when degradation localises to a single stage in a multi-model pipeline.

PSF-4.4

User feedback signals (explicit ratings, implicit corrections, escalations) are captured and correlated with model outputs.

PSF-4.5

A dashboard or reporting mechanism exists that shows AI system health in near-real-time. On-call engineers can assess AI system status without accessing raw logs.

Why this matters: AI models degrade silently. Without observability, you learn about quality failures from angry users, not from your systems. By then, the damage is done.
PSF-5

Deployment Safety

Every model change is a risk. Production AI deployments require the same rigour as critical software.

PSF-5.1

All model changes (new model, new version, new system prompt) go through a defined deployment pipeline with evaluation gate.

PSF-5.2

Canary or shadow deployment is used for significant changes. Traffic is split or the new model runs in parallel before full cutover.

PSF-5.3

Rollback to the previous model/configuration is possible within 15 minutes. The rollback procedure is documented and tested.

PSF-5.4

An evaluation suite (evals) runs automatically against every model change. Deployment is blocked if evals regress beyond threshold.

PSF-5.5

Model versions are tracked. For any output, the exact model name, version, and system prompt hash that produced it can be determined from logs.

Why this matters: A model update that looked fine in testing can catastrophically fail on production traffic patterns. Canary + rollback is the minimum viable deployment safety for AI systems.
PSF-6

Human Oversight

Automation does not mean unaccountable. High-stakes AI decisions require human checkpoints.

PSF-6.1

All AI decisions in the system are classified by stakes (low/medium/high) and reversibility (reversible/irreversible). This classification is documented.

PSF-6.2

High-stakes and irreversible decisions have human-in-the-loop checkpoints. The system cannot bypass these checkpoints under normal operation. For agentic or tool-using workflows, bounded autonomy is defined (maximum consecutive autonomous actions, approval gates for irreversible tools, kill-switch or circuit-stop behaviour) and tested.

PSF-6.3

Escalation paths exist for AI failures and edge cases. When the AI cannot handle a situation, it escalates to a defined human owner — it does not silently fail or guess.

PSF-6.4

Human overrides of AI decisions are logged. Override patterns are reviewed regularly to identify systematic AI failures.

PSF-6.5

Users of AI-assisted decisions are informed they are interacting with an AI system, where legally required (EU AI Act Article 52) or where material to their decision-making.

Why this matters: The EU AI Act requires human oversight for high-risk AI. Beyond compliance, AI systems without human oversight tend to amplify errors at scale — small model failures become large operational failures.
PSF-7

Security

AI systems introduce new attack surfaces. Standard security practices must extend to cover them.

PSF-7.1

All AI API keys and credentials are stored in a secrets manager (AWS Secrets Manager, Azure Key Vault, HashiCorp Vault, or equivalent). No credentials in source code, version control, or environment variable files committed to repositories.

PSF-7.2

Separate API keys are used per environment (development, staging, production) with appropriate spending limits on each.

PSF-7.3

The AI system's attack surface is documented: all external model calls, data sources, and tool integrations are mapped. This map is reviewed when the system changes.

PSF-7.4

Prompt injection is treated as a security vulnerability class. The system has documented defences, and injection testing is part of the security review process.

PSF-7.5

For agentic systems: the principle of minimal authority applies. Agents have only the permissions required for their task. Tool calls are validated before execution. Irreversible actions require confirmation.

Why this matters: API key leaks, prompt injection, and over-permissioned agents are the top AI security incidents. These are preventable with standard security hygiene applied to the AI context.
PSF-8

Vendor Resilience

A production AI system that only works with one vendor is a liability, not an asset.

PSF-8.1

The AI model interface is abstracted behind an internal API layer. Switching the underlying model requires a configuration change, not a code rewrite.

PSF-8.2

The system has been tested with at least one alternative model. Prompt compatibility with alternatives is assessed before being required.

PSF-8.3

Vendor dependencies are documented: model providers, embedding providers, vector database providers. Each has a documented contingency (alternative provider or self-hosted fallback).

PSF-8.4

The system does not use vendor-specific features in core application logic without documented justification and a migration plan.

PSF-8.5

Vendor SLA and data processing terms are reviewed and on file. The organisation knows what happens to their data when it is sent to each AI provider. Regulatory and jurisdictional posture (including EU AI Act provider/deployer allocations where relevant) is recorded alongside commercial terms and reviewed on change.

Why this matters: Model providers change prices, deprecate models, experience outages, and alter terms. A production AI system with no vendor resilience is one API change away from an incident.

Citing this framework

Production AI Institute. (2026). PAI Production Safety Framework, Version 1.1. productionai.institute/certify/framework

The PAI Production Safety Framework is published under Creative Commons CC BY 4.0. You may share and adapt the framework with attribution. For historical work assessed against PSF v1.0 (2024), cite that version and year; see how to cite.