Video tutorial: Introducing Judges for AI Response Quality Monitoring

Published March 2, 2026

by Paul Loeb

Judges are a LaunchDarkly capability that automatically evaluates the responses generated by AI models in your applications. Judges score outputs based on metrics like relevance, accuracy, and toxicity, allowing you to monitor quality over time.

AI Configs are called AgentControl configs

LaunchDarkly AI Configs have been rebranded as AgentControl and AgentControl configs. Some references in this topic may be outdated.

In this video, you will learn how to:

Attach multiple judges to a config
Customize judges to fit your business needs
Control costs by adjusting sampling percentages in different environments
Track metrics to detect regressions and compare variations

Introducing Judges

Learn more

Online evaluations
Custom judges
Evaluate LLM Code Generation with Custom Judges and Online Evals