Evaluation Examples
Examples overview
Reward Kit Examples
This directory contains examples and guides for using Reward Kit’s built-in reward functions and creating custom reward functions.
Out-of-the-Box Reward Functions
Reward Kit provides several pre-built reward functions for common evaluation tasks:
- Reward Functions Overview - Overview of all available reward functions
- Combined Metrics Rewards - Evaluate responses using multiple metrics combined
- Code Execution Evaluation - Evaluate code by running it locally
- Code Execution with E2B - Evaluate code using E2B cloud sandbox
- Function Calling Evaluation - Evaluate function calls made by AI models
- JSON Schema Validation - Validate JSON outputs against schemas
- Math Evaluation - Evaluate mathematical answers in responses
- DeepSeek-Prover-V2 - Evaluate formal proofs in Lean theorem prover
Creating Your Own Reward Functions
Learn how to create custom reward functions:
- Basic Reward Function - Simple example of a custom reward function
- Advanced Reward Functions - More complex reward function examples
Next Steps
- See the Developer Guide for comprehensive information
- Check the Tutorials for step-by-step guides
- Refer to the API Reference for detailed documentation of all available functions