Learn how to run evaluations using the Axiom CLI and interpret the results.
axiom eval
# By evaluation name axiom eval spam-classification # By file path axiom eval src/evals/spam-classification.eval.ts # By glob pattern axiom eval "**/*spam*.eval.ts"
axiom eval --list
axiom eval --debug
# Single flag axiom eval --flag.ticketClassification.model=gpt-4o # Multiple flags axiom eval \ --flag.ticketClassification.model=gpt-4o \ --flag.ticketClassification.temperature=0.3
axiom eval --flags-config=experiments/gpt4.json
✓ spam-classification (4/4 passed) ✓ Test case 1: spam detection ✓ Test case 2: legitimate question Scorers: category-match: 100% (4/4) high-confidence: 75% (3/4) Results: Total: 4 test cases Passed: 4 (100%) Duration: 3.2s Cost: $0.0024 View full report: https://app.axiom.co/your-org/ai-engineering/evaluations?runId=ABC123
Was this page helpful?