Production AI Institute · PSF v1.1 open standard
AI Right-To-KnowAI Data Use IndexCheck My AI ToolsAgent ReadinessPublic BenchmarkContactGlobal standard · Australia/APAC founded
Public ledger · reviewed May 2026

Does this company use my data to train AI?

The AI Data Use Index reads the public record: what companies say about training reuse, opt-outs, retention, human review, and what ordinary people can actually tell from the disclosure in front of them.

This is a transparency index, not legal advice. It records what is publicly stated today and where the public answer is still incomplete.

Check my tools ->Read methodology ->Publish a clearer disclosure ->
Indexed products

What the public can tell from the public record.

OpenAI

ChatGPT

User-controlled

OpenAI says ChatGPT conversations may be used to improve models unless the user turns off model improvement in Data Controls.

Consumer serviceReviewed 2026-05-15
Google

Gemini Apps

Uses data when activity is on

Google says that when Gemini Apps Activity is on, Gemini data is used to improve Google AI with help from human reviewers.

Consumer serviceReviewed 2026-05-15
Microsoft

Microsoft 365 Copilot

Not used to train foundation models

Microsoft says files, communications, prompts, responses, and Microsoft Graph data used with Microsoft 365 Copilot are not used to train foundation models.

Commercial serviceReviewed 2026-05-15
Meta

Meta AI

Uses eligible public data

Meta says it uses public information from adult accounts and interactions with AI at Meta features to develop and improve generative AI models, with a right to object.

Consumer serviceReviewed 2026-05-15
X / xAI

Grok on X

Uses public data and interactions

X says public X data plus interactions, inputs, and results with Grok may be shared with xAI to train and fine-tune Grok and other generative AI models.

Consumer serviceReviewed 2026-05-15
Perplexity

Perplexity

Training on by default for consumers

Perplexity says AI data retention is enabled by default for Free, Pro, and Max users, and that users can turn it off in account settings.

Consumer serviceReviewed 2026-05-15
Anthropic

Claude

User-controlled

Anthropic describes a model-improvement setting for consumer chats and says incognito chats are not used to improve Claude.

Consumer serviceReviewed 2026-05-15
GitHub

GitHub Copilot

No training by default

GitHub says that by default it, its affiliates, and third parties do not use individual-subscriber Copilot data, including prompts, suggestions, and code snippets, for AI model training.

Developer serviceReviewed 2026-05-15
Anysphere

Cursor

Depends on Privacy Mode

Cursor says code is not used for training when Privacy Mode is enabled; when Privacy Mode is off, Cursor may use stored codebase data, prompts, editor actions, and snippets to improve AI features and train models.

Developer serviceReviewed 2026-05-15
Notion

Notion AI

No customer-data training

Notion says it does not use Customer Data, including user content under personal terms, or permit others to use it to train the machine-learning models used to provide Notion AI.

Commercial serviceReviewed 2026-05-15
Slack

AI in Slack

No generative-AI training

Slack says Customer Data is not used to train generative AI models unless the customer gives affirmative opt-in consent.

Commercial serviceReviewed 2026-05-15
Zoom

Zoom AI Companion

No customer-content training

Zoom says it does not use customer audio, video, chat, screen sharing, attachments, or other communications-like customer content to train Zoom or third-party AI models.

Commercial serviceReviewed 2026-05-15
Grammarly

Grammarly

Training control varies by account

Grammarly says individual-account Product Improvement and Training is on by default, while enterprise and certain sales-led accounts have it off by default.

Consumer and commercial serviceReviewed 2026-05-15
Canva

Canva

Depends on privacy settings

Canva says privacy settings control whether general usage data and User Content can improve AI-powered features, and that Canva Education User Content is not used for AI training.

Consumer and commercial serviceReviewed 2026-05-15
Adobe

Adobe Firefly

No customer-content training

Adobe says Firefly does not train on customer data and that Firefly uses commercially safe datasets such as licensed content and public-domain material.

Creative serviceReviewed 2026-05-15