Definition
Employs a separate, often smaller, language model to evaluate the primary LLM's final output against detailed rules for accuracy, style, and safety. Serves as an automated, sophisticated quality assurance layer.
Best Used
As the final check in complex AI agents or pipelines that require the response to strictly adhere to multiple, nuanced constraints (e.g., tone, format, and compliance).