promptfoo/promptfoo
Fork: 868 Star: 9914 (更新于 2026-01-15 19:10:12)
license: MIT
Language: TypeScript .
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
最后发布版本: 0.120.14 ( 2026-01-15 02:56:50)
Promptfoo: LLM evals & red teaming
promptfoo is a developer-friendly local tool for testing LLM applications. Stop the trial-and-error approach - start shipping secure, reliable AI apps.
Website · Getting Started · Red Teaming · Documentation · Discord
Quick Start
# Install and initialize project
npx promptfoo@latest init
# Run your first evaluation
npx promptfoo eval
See Getting Started (evals) or Red Teaming (vulnerability scanning) for more.
What can you do with Promptfoo?
- Test your prompts and models with automated evaluations
- Secure your LLM apps with red teaming and vulnerability scanning
- Compare models side-by-side (OpenAI, Anthropic, Azure, Bedrock, Ollama, and more)
- Automate checks in CI/CD
- Review pull requests for LLM-related security and compliance issues with code scanning
- Share results with your team
Here's what it looks like in action:

It works on the command line too:

It also can generate security vulnerability reports:

Why Promptfoo?
- 🚀 Developer-first: Fast, with features like live reload and caching
- 🔒 Private: LLM evals run 100% locally - your prompts never leave your machine
- 🔧 Flexible: Works with any LLM API or programming language
- 💪 Battle-tested: Powers LLM apps serving 10M+ users in production
- 📊 Data-driven: Make decisions based on metrics, not gut feel
- 🤝 Open source: MIT licensed, with an active community
Learn More
- 📚 Full Documentation
- 🔐 Red Teaming Guide
- 🎯 Getting Started
- 💻 CLI Usage
- 📦 Node.js Package
- 🤖 Supported Models
- 🔬 Code Scanning Guide
Contributing
We welcome contributions! Check out our contributing guide to get started.
Join our Discord community for help and discussion.
最近版本更新:(数据更新于 2026-01-15 19:09:57)
2026-01-15 02:56:50 0.120.14
2026-01-13 09:05:30 0.120.13
2026-01-13 07:42:21 0.120.12
2026-01-10 09:32:17 0.120.11
2026-01-07 05:25:55 0.120.10
2025-12-31 05:47:45 0.120.9
2025-12-22 01:00:53 0.120.8
2025-12-20 03:08:44 0.120.7
2025-12-20 02:25:58 0.120.6
2025-12-17 06:01:15 0.120.5
主题(topics):
llm testing rag cicd pentesting evaluation-framework llm-evaluation llm-evaluation-framework ci ci-cd prompt-engineering llm-eval llmops vulnerability-scanners evaluation prompts prompt-testing red-teaming
promptfoo/promptfoo同语言 TypeScript最近更新仓库
2026-01-16 10:28:49 OHIF/Viewers
2026-01-16 08:25:46 Expensify/App
2026-01-16 08:08:59 tinacms/tinacms
2026-01-16 07:25:30 BabylonJS/Babylon.js
2026-01-16 06:47:46 andreasgerstmayr/fava-dashboards
2026-01-16 02:32:17 aws/aws-cdk