Arena Tutorials
Step-by-step learning paths to master PromptArena from beginner to advanced.
Learning Path
Work through these tutorials in order for the best learning experience:
Beginner
Install PromptArena, write your first test scenario, configure a provider, and run tests. Perfect starting point for newcomers.
Time: 20 minutes
Intermediate
Test the same scenario across OpenAI, Claude, and Gemini. Learn to compare providers and optimize for cost vs. quality.
Time: 30 minutes
Build complex conversation flows with context retention. Test customer support dialogs, information gathering, and conversation branching.
Time: 45 minutes
Advanced
Test LLMs that use Model Context Protocol tools for database queries, API calls, and system interactions.
Time: 45 minutes
Integrate Arena into GitHub Actions, GitLab CI, and other pipelines. Set up quality gates and deployment automation.
Time: 60 minutes
Test bidirectional voice conversations with real-time audio streaming. Learn duplex mode, turn detection, and self-play with TTS.
Time: 45 minutes
What’s the Difference?
Tutorials are learning-oriented lessons that guide you through completing projects:
- ✅ “Build your first LLM test from scratch”
- ✅ “Set up multi-provider testing step-by-step”
- ✅ “Create a CI/CD pipeline for LLM tests”
Looking for something else?
- How-To Guides - Task-focused recipes for specific problems
- Explanation - Conceptual understanding and design decisions
- Reference - Complete technical specifications
Prerequisites
Before starting, ensure you have:
- Go 1.23+ installed
- Git for cloning the repository
- At least one LLM API key (OpenAI, Anthropic, or Google)
- Basic command-line knowledge
Getting Help
Stuck on a tutorial?
- Check the How-To Guides for specific solutions
- Review the Reference Documentation for complete specs
- Visit GitHub Discussions
- Read the FAQ (coming soon)
What You’ll Build
By completing all tutorials, you’ll have:
- ✅ A working LLM test suite
- ✅ Multi-provider comparison tests
- ✅ Complex conversation flows
- ✅ Tool/function calling tests
- ✅ Automated CI/CD pipeline
- ✅ Voice/duplex streaming tests
Ready to get started? Begin with Tutorial 1 →