AI Safety & Agent EvaluationAdvancedAlmost Full

LLM-as-a-Judge for Enterprise AI Evaluation

Tuesday, June 9, 2026 · 10:00 AM – 11:00 AM · Hall D1

When and how to trust LLM judges. We discuss calibration, rubric design, jury-of-models patterns, and pitfalls when using models to evaluate other models in production.

Topic

Evaluation

Time of day

Morning

Speaker

Dr. Yuki Tanaka

Company

Polaris Research

Room

Hall D1

Session ID

S017