Some Notes of Calibration

Language Models (Mostly) Know What They Know

Introduction

Larger Models are Calibrated on Diverse Multiple Choice Questions

From Calibration to Knowing What You Know

Replacing an Option with ‘None of the Above’ Harms Performance and Calibration

Models are Well-Calibrated on True/False Tasks

RLHF Policy Miscalibration Can Be Remediated with a Temperature Tuning

Ask the AI: Is your proposed answer True or False?

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback

Introduction

Evaluating Calibration in RLHF-LMs

Results