Quality and trustworthiness assessment of AI-generated responses, including output formatting, context grounding, and communication of uncertainty or knowledge gaps.