EAI Evals Handbook
Start Here GitHub
Home/Resources & Toolkit

Resources & Toolkit

Interactive tools, checklists, and templates to speed up your eval workflow. All separate pages form a complete toolkit for production AI evaluation.


Templates & Worksheets

Readiness Checklist

32-point interactive launch checklist

Golden Set Template

Curate your first test dataset

Weekly Report

Template for stakeholder updates

Consequence Scoring

Prioritize risks by impact

Calculators & Tools

LLM Judge Rubric

Builder for scoring criteria

Maturity Assessment

Score your team's eval capabilities

ROI Calculator

Quantify the value of evals money

Vendor Comparison

Feature matrix for top eval tools

Prompt Engineering

Cheatsheet for judge prompts