Sources — LLM-as-a-Judge
Institutional and technical reference points.
Model-Based Evaluation
NIST AI RMF 1.0 — Artificial Intelligence Risk Management Framework
OECD — Recommendation of the Council on Artificial Intelligence
Assessment and Evaluation Frameworks
ISO/IEC 25010 — Systems and Software Quality Models
ISO/IEC 25040:2024 — Quality evaluation framework
Evaluation Reliability and AI Assessment
ISO/IEC 23894 — Artificial Intelligence — Guidance on Risk Management