I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers

Vashistha, Ritwik; Farahi, Arya

Statistics > Machine Learning

arXiv:2501.15617 (stat)

[Submitted on 26 Jan 2025 (v1), last revised 1 May 2025 (this version, v2)]

Title:I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers

Authors:Ritwik Vashistha, Arya Farahi

View PDF HTML (experimental)

Abstract:As probabilistic models continue to permeate various facets of our society and contribute to scientific advancements, it becomes a necessity to go beyond traditional metrics such as predictive accuracy and error rates and assess their trustworthiness. Grounded in the competence-based theory of trust, this work formalizes I-trustworthy framework -- a novel framework for assessing the trustworthiness of probabilistic classifiers for inference tasks by linking local calibration to trustworthiness. To assess I-trustworthiness, we use the local calibration error (LCE) and develop a method of hypothesis-testing. This method utilizes a kernel-based test statistic, Kernel Local Calibration Error (KLCE), to test local calibration of a probabilistic classifier. This study provides theoretical guarantees by offering convergence bounds for an unbiased estimator of KLCE. Additionally, we present a diagnostic tool designed to identify and measure biases in cases of miscalibration. The effectiveness of the proposed test statistic is demonstrated through its application to both simulated and real-world datasets. Finally, LCE of related recalibration methods is studied, and we provide evidence of insufficiency of existing methods to achieve I-trustworthiness.

Comments:	Accepted at AISTATS 2025 Conference
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
Cite as:	arXiv:2501.15617 [stat.ML]
	(or arXiv:2501.15617v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2501.15617
Journal reference:	International Conference on Artificial Intelligence and Statistics 2025 Apr 23 (pp. 4726-4734). PMLR

Submission history

From: Ritwik Vashistha [view email]
[v1] Sun, 26 Jan 2025 17:54:43 UTC (903 KB)
[v2] Thu, 1 May 2025 08:57:50 UTC (889 KB)

Statistics > Machine Learning

Title:I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators