Public agent repositories, measured against visible PSF evidence.
PAI scans public GitHub metadata and file paths for signs of production AI discipline: evals, output schemas, observability, deployment gates, human oversight, security policy, and provider resilience. This is evidence coverage, not certification.
Projects are discovered through GitHub repository search, then scanned for visible PSF-aligned evidence in their public file tree. Higher coverage means more evidence was visible to the scanner, not that PAI has certified or endorsed the project.
personal AI runtime, local-first. MCP bridge giving Claude Code 170+ tools (LSP, debugger, terminal, git) inside VS Code, Cursor, Windsurf, or JetBrains. Optional Patchwork layer adds YAML recipes, an approval queue, and an oversight dashboard. Your models, your machine, your policy.
16 starsTypeScriptUpdated May 13, 2026ai-agentanthropicapproval-queue
79%
A
D12/2
D21/2
D32/2
D42/2
D52/2
D61/2
D71/2
D81/2
AI observability instrumentationdashboard/src/app/traces | dashboard/src/app/traces/error.tsx | dashboard/src/app/traces/layout.tsx
Human approval gatesdocs/adr/0006-approval-gate-design.md | src/__tests__/approvalGate.e2e.test.ts | src/__tests__/approvalQueue.test.ts
🤖 MateClaw — Your second brain with Multi-Agent Orchestration, MCP Protocol, Skills & Memory, Dream, and Multi-Channel Support. Built on Spring AI Alibaba.
454 starsJavaUpdated May 13, 2026agentai-agentdingtalk-robot
65%
A
D11/2
D22/2
D31/2
D41/2
D52/2
D61/2
D71/2
D81/2
AI observability instrumentationmateclaw-server/src/main/resources/skills/popular-web-designs/templates/sentry.md
Human approval gatesmateclaw-server/src/main/java/vip/mate/approval/ApprovalWorkflowService.java | mateclaw-server/src/main/java/vip/mate/approval/event/WorkflowApprovalResolvedEvent.java | mateclaw-server/src/main/java/vip/mate/workflow/runtime/ApprovalResumeBridge.java
Give your short business requirement, follow instructions, answer a few questions, get your specification, codes, tests.
28 starsHTMLUpdated May 13, 2026aiai-agentrequirements-engineering
23%
B
D11/2
D20/2
D31/2
D40/2
D51/2
D60/2
D71/2
D80/2
Security policy and secret hygiene.trae/skills/visual-spec/prompts/vspec_detail/data_permission.md | .trae/skills/visual-spec/prompts/vspec_detail/rbac.md | docs/en-US/tools/access-control-rbac.md
Prompt, policy, or model versioning.trae/skills/visual-spec/prompts/harness | .trae/skills/visual-spec/prompts/harness/post_append_test_coverage_check.md | .trae/skills/visual-spec/prompts/harness/post_impl_verify.md
Local-first AI desktop agent for Windows, macOS, Linux & Android. Codework, multi-agent teams, desktop automation, 15+ AI providers. No Docker. No terminal. AI Companion. Agent Skills (SKILL.md). Migration-Importer, BYOK, from 6 to 60+. Recurring Autonomous AI Agent Tasks.
929 starsTypeScriptUpdated May 13, 2026agentic-aiai-agentai-assistant
15%
C
D10/2
D20/2
D30/2
D41/2
D50/2
D60/2
D71/2
D80/2
AI observability instrumentationapps/web/src/app/api/telemetry | apps/web/src/app/api/telemetry/ping | apps/web/src/app/api/telemetry/ping/route.ts
Security policy and secret hygieneapps/web/src/actions/secrets.ts
A 30-day public U.S. stock challenge: follow a 5000 HKD 🦞 claw through live market days.
36 starsJavaScriptUpdated May 13, 2026ai-agentalgorithmic-tradingautotrader
12%
C
D10/2
D21/2
D30/2
D41/2
D50/2
D60/2
D70/2
D80/2
Schema or contract validationxhs-agent/schemas/public-snapshot.schema.json | xhs-agent/schemas/xhs-post-package.schema.json
Incident or drift evidencedocs/incidents | docs/incidents/.gitkeep
Unauthenticated GitHub runs are capped at 20 repositories. Set GITHUB_TOKEN to benchmark up to 100.
Where the repository list comes from
The benchmark uses GitHub's public repository search endpoint and rotates focused queries for AI agent, agentic AI, LLM agent, and MCP server repositories. The run de-duplicates repositories, excludes archived projects and forks when GitHub returns those flags, and sorts the published table by visible PSF evidence coverage.
Add a server-side GITHUB_TOKEN or GH_TOKEN, then request /api/agent-readiness/benchmark?limit=100. Without a token the public route is capped lower to respect GitHub rate limits. The scanner still reads only public repository metadata and file paths.
Use the benchmark page for a live public sample.
Use the API route for scheduled monthly reports.
Use opt-in reports and badges for maintainers who want a public profile.