AI Safety & Agent EvaluationAdvancedAlmost Full

LLM-as-a-Judge for Enterprise AI Evaluation

Tuesday, June 9, 2026 · 10:00 AM – 11:00 AM · Hall D1

When and how to trust LLM judges. We discuss calibration, rubric design, jury-of-models patterns, and pitfalls when using models to evaluate other models in production.

Topic
Evaluation
Time of day
Morning
Speaker
Dr. Yuki Tanaka
Company
Polaris Research
Room
Hall D1
Session ID
S017