FEA add temperature scaling to `CalibratedClassifierCV` #31068

virchan · 2025-03-25T07:57:42Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR adds temperature scaling to scikit-learn's CalibratedClassifierCV:

Temperature scaling can be enabled by setting method = "temperature" in CalibratedClassifierCV:

from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split
from sklearn.calibration import CalibratedClassifierCV
from sklearn.svm import LinearSVC

X, y = make_classification(random_state=42)

X_train, X_calib, y_train, y_calib = train_test_split(X, y, random_state=42)

clf = LinearSVC(random_state=42)
clf.fit(X_train, y_train)
cal_clf = CalibratedClassifierCV(clf, method="temperature").fit(X_train, y_train)

This method supports both binary and multi-class classification.

Any other comments?

Cc @adrinjalali, @lorentzenchr in advance.

github-actions · 2025-03-25T07:58:58Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 7312b00. Link to the linter CI: here}

virchan

A follow-up to my comment on the Array API: I don't think we can support the Array API here, as scipy.optimize.minimize does not appear to support it.

If I missed anything, please let me know—I'd be happy to investigate further.

sklearn/calibration.py

sklearn/tests/test_calibration.py

ogrisel

Thanks for the PR. Here is a first pass of feedback:

sklearn/calibration.py

sklearn/tests/test_calibration.py

sklearn/calibration.py

…enting_temperature_scaling

…fier`. Updated constructor of `_TemperatureScaling` class. Updated `test_temperature_scaling` in `test_calibration.py`. Added `__sklearn_tags__` to `_TemperatureScaling` class.

…enting_temperature_scaling

sklearn/calibration.py

…enting_temperature_scaling

…Updated doc-strings of temperature scaling in `calibration.py`. Updated formatting.

virchan

I'm still working on addressing the feedback, but I also wanted to share some findings related to it and provide an update.

sklearn/calibration.py

sklearn/tests/test_calibration.py

…enting_temperature_scaling

lorentzenchr

I few computational things seem off.

sklearn/calibration.py

…enting_temperature_scaling

Update `minimize` in `_temperture_scaling` to `minimize.scalar`. Update `test_calibration.py` to check the optimised inverse temperature is between 0.1 and 10.

virchan

There are some CI failures—I'll fix those shortly.

Also considering adding a verbose parameter to CalibratedClassifierCV to optionally display convergence info when optimising the inverse temperature beta.

sklearn/calibration.py

…enting_temperature_scaling

…id `method` parameter.

…enting_temperature_scaling

virchan

The CI fails when checking that the ROC AUCs are equal up to 7 decimal places. I'll fix it later.

…enting_temperature_scaling

virchan

CI passed!

lorentzenchr

Close to the finish line.

doc/modules/calibration.rst

sklearn/calibration.py

sklearn/tests/test_calibration.py

…eter is close to 1.0 when fitted on the training set of LogisticRegression.

virchan

I updated the user guide and the docstrings in calibration.py. I also modified the test to check that the temperature parameter is close to 1 when the temperature scaler is fitted on the training set of the LogisticRegression classifier.

There are still some comments that need to be addressed, and I'll work on them later.

sklearn/tests/test_calibration.py

…ibratedClassifier.predict_proba` only.

…ature_scaling' into issues/28574_implementing_temperature_scaling # Conflicts: # sklearn/calibration.py

…tedClassifier.predict_proba` only.

virchan

I've refactored the part for checking reponse_method_name:

if len(classes) == 2 and predictions.shape[-1] == 1:
    response_method_name = _check_response_method(
        clf,
        ["decision_function", "predict_proba"],
    ).__name__
    if response_method_name == "predict_proba":
        predictions = np.hstack([1 - predictions, predictions])

I think this only needs to be applied in two places: : _fit_calibrator and _CalibratedClassifier.predict_proba. But please let me know if there's a better way to handle this.

I've also moved _temperature_scaling inside _TemperatureScaling.fit.

CI has passed, so it's ready for review!

lorentzenchr · 2025-07-30T09:04:56Z

sklearn/calibration.py

+        def _temperature_scaling(predictions, labels, sample_weight=None):
+            """Calibrate the temperature of temperature scaling.
+
+            Parameters
+            ----------
+            predictions : ndarray of shape (n_samples,) or (n_samples, n_classes)
+                The output of `decision_function` or `predict_proba`. If the input
+                appears to be probabilities (i.e., values between 0 and 1 that sum to 1
+                across classes), it will be converted to logits using `np.log(p + eps)`.
+
+                Binary decision function outputs (1D) will be converted to two-class
+                logits of the form (-x, x). For shapes of the form (n_samples, 1), the
+                same process applies.
+
+            labels : ndarray of shape (n_samples,)
+                True labels for the samples.
+
+            sample_weight : array-like of shape (n_samples,), default=None
+                Sample weights. If None, then samples are equally weighted.
+
+            Returns
+            -------
+            beta : float
+                The optimised inverse temperature parameter for probability calibration,
+                with a value in the range (0, infinity).
+
+            References
+            ----------
+            On Calibration of Modern Neural Networks,
+            C. Guo, G. Pleiss, Y. Sun, & K. Q. Weinberger, ICML 2017.
+            """
+            check_consistent_length(predictions, labels)
+            logits = _convert_to_logits(
+                predictions
+            )  # guarantees np.float64 or np.float32


Suggested change

def _temperature_scaling(predictions, labels, sample_weight=None):

"""Calibrate the temperature of temperature scaling.

Parameters

----------

predictions : ndarray of shape (n_samples,) or (n_samples, n_classes)

The output of `decision_function` or `predict_proba`. If the input

appears to be probabilities (i.e., values between 0 and 1 that sum to 1

across classes), it will be converted to logits using `np.log(p + eps)`.

Binary decision function outputs (1D) will be converted to two-class

logits of the form (-x, x). For shapes of the form (n_samples, 1), the

same process applies.

labels : ndarray of shape (n_samples,)

True labels for the samples.

sample_weight : array-like of shape (n_samples,), default=None

Sample weights. If None, then samples are equally weighted.

Returns

-------

beta : float

The optimised inverse temperature parameter for probability calibration,

with a value in the range (0, infinity).

References

----------

On Calibration of Modern Neural Networks,

C. Guo, G. Pleiss, Y. Sun, & K. Q. Weinberger, ICML 2017.

"""

check_consistent_length(predictions, labels)

logits = _convert_to_logits(

predictions

) # guarantees np.float64 or np.float32

X, y = indexable(X, y) # Is this really needed?

predictions, labels = X, y

check_consistent_length(predictions, labels)

logits = _convert_to_logits(

predictions

) # guarantees np.float64 or np.float32

and so on.
So remove the function _temperature_scaling altogether and integrate it in fit.

It will end with

self.beta = np.exp(log_beta_minimizer.x) return self

lorentzenchr · 2025-07-30T09:11:48Z

sklearn/calibration.py

+        if len(classes) == 2 and predictions.shape[-1] == 1:
+            response_method_name = _check_response_method(
+                clf,
+                ["decision_function", "predict_proba"],
+            ).__name__
+            if response_method_name == "predict_proba":
+                predictions = np.hstack([1 - predictions, predictions])


Why not put it inside _TemperatureScaling.fit?

lorentzenchr · 2025-07-30T09:12:49Z

sklearn/calibration.py

+            if n_classes == 2 and predictions.shape[-1] == 1:
+                response_method_name = _check_response_method(
+                    self.estimator,
+                    ["decision_function", "predict_proba"],
+                ).__name__
+                if response_method_name == "predict_proba":
+                    predictions = np.hstack([1 - predictions, predictions])


Why not put it into _TemperatureScaling.predict?

FEA add temperature scaling to CalibratedClassifierCV

604e0da

added changelog

257fd03

virchan added the New Feature label Mar 25, 2025

virchan commented Mar 25, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

virchan added the module:calibration label Mar 25, 2025

updated docstring.

eb6dd4a

virchan marked this pull request as ready for review March 25, 2025 10:55

ogrisel reviewed Mar 25, 2025

View reviewed changes

virchan added 4 commits March 27, 2025 18:14

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

73d0335

…enting_temperature_scaling

Updated docstrings of CalibratedClassifierCV and `_CalibratedClassi…

6d6963f

…fier`. Updated constructor of `_TemperatureScaling` class. Updated `test_temperature_scaling` in `test_calibration.py`. Added `__sklearn_tags__` to `_TemperatureScaling` class.

Fix typo in _TemperatureScaling's fit method.

c8ffc1b

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

5a09af3

…enting_temperature_scaling

lorentzenchr reviewed Mar 31, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

virchan added 5 commits March 31, 2025 15:11

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

a1be098

…enting_temperature_scaling

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

d715e6e

…enting_temperature_scaling

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

4ce1452

…enting_temperature_scaling

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

b039a5a

…enting_temperature_scaling

Updated test cases for temperature scaling in test_calibration.py. …

93c7972

…Updated doc-strings of temperature scaling in `calibration.py`. Updated formatting.

virchan commented Apr 10, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

virchan added 2 commits April 14, 2025 15:52

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

7d50ea7

…enting_temperature_scaling

Fix failing test_float32_predict_proba.

dfcaa39

lorentzenchr reviewed Apr 15, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

ogrisel reviewed Apr 15, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

virchan added 2 commits April 25, 2025 22:16

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

24c1266

…enting_temperature_scaling

Update HalfMultinomialLoss docstring.

ad8dea5

Update `minimize` in `_temperture_scaling` to `minimize.scalar`. Update `test_calibration.py` to check the optimised inverse temperature is between 0.1 and 10.

virchan commented Apr 26, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

virchan added 3 commits April 30, 2025 19:36

Update _TemperatureScaling tags.

9f1626c

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

b4f1aad

…enting_temperature_scaling

Add test_calibration_method in test_calibration.py to check inval…

1a9e307

…id `method` parameter.

virchan added 2 commits July 18, 2025 00:03

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

9a3eda7

…enting_temperature_scaling

update test_temperature_scaling.

40c5601

virchan commented Jul 18, 2025

View reviewed changes

virchan added 3 commits July 18, 2025 14:54

removed error-raisings in private functions

662414e

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

51a9e26

…enting_temperature_scaling

updated ROC-AUC checking in the test_temperature_Scaling function

903930c

virchan commented Jul 18, 2025

View reviewed changes

lorentzenchr reviewed Jul 28, 2025

View reviewed changes

virchan added 7 commits July 28, 2025 17:58

Update calibration.rst by applying review suggestions.

eba8821

Update calibration.py docstrings by applying review suggestions.

41ac95d

Update _convert_to_logits by tightening eps.

9a7b787

Update test_temperature_scaling to check that the temperature param…

7f68ac2

…eter is close to 1.0 when fitted on the training set of LogisticRegression.

Merge branch 'main' into issues/28574_implementing_temperature_scaling

a3734c6

Update calibration.py

b48a4c8

Update _temperature_scaling docstrings.

ad2a786

virchan commented Jul 29, 2025

View reviewed changes

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

virchan added 12 commits July 29, 2025 16:18

Update test_calibration.py

1d030eb

Merge branch 'main' into issues/28574_implementing_temperature_scaling

fe87ded

Move _temperature_scaling into _TemperatureScaling().fit method.

3ee2d7c

Move _temperature_scaling into _TemperatureScaling().fit method.

438f5fb

Fix docstrings in calibration.py

58370d3

Update the _fit_calibrator function

c8d8926

Place response_method_name logic inside _convert_to_logits.

2769eab

Place response_method_name logic inside _fit_calibrator and `_Cal…

626f83a

…ibratedClassifier.predict_proba` only.

Merge remote-tracking branch 'origin/issues/28574_implementing_temper…

549b2b4

…ature_scaling' into issues/28574_implementing_temperature_scaling # Conflicts: # sklearn/calibration.py

Place response_method_name logic in _fit_calibrator and `_Calibra…

2118081

…tedClassifier.predict_proba` only.

Remove blank line.

22a97f5

Update test_calibration.py

7312b00

virchan commented Jul 30, 2025

View reviewed changes

lorentzenchr reviewed Jul 30, 2025

View reviewed changes

Uh oh!

FEA add temperature scaling to CalibratedClassifierCV #31068

Are you sure you want to change the base?

FEA add temperature scaling to CalibratedClassifierCV #31068

Conversation

virchan commented Mar 25, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

FEA add temperature scaling to `CalibratedClassifierCV` #31068

FEA add temperature scaling to `CalibratedClassifierCV` #31068

github-actions bot commented Mar 25, 2025 •

edited

Loading