Logical Validity
Reasoning chain correctness and step validity
Decision Transparency
Clarity of decision points and trade-offs
Completeness
Full coverage of requirements and edge cases
Calibration
Confidence alignment with actual accuracy
Correctness
Final answer accuracy against ground truth