incerto.llm.calibration_error

incerto.llm.calibration_error#

incerto.llm.calibration_error(confidences, correctness, n_bins=10)[source]#

Compute Expected Calibration Error (ECE) and Maximum Calibration Error (MCE).

Parameters:
  • confidences (Tensor) – Confidence scores (batch,)

  • correctness (Tensor) – Binary correctness (batch,)

  • n_bins (int) – Number of bins

Return type:

dict

Returns:

Dictionary with ECE and MCE