ChatGPT Prompting Cannot Estimate Predictive Uncertainty in High-Resource Languages

Pelucchi, Martino; Valdenegro-Toro, Matias

Computer Science > Computation and Language

arXiv:2311.06427 (cs)

[Submitted on 10 Nov 2023]

Title:ChatGPT Prompting Cannot Estimate Predictive Uncertainty in High-Resource Languages

Authors:Martino Pelucchi, Matias Valdenegro-Toro

View PDF

Abstract:ChatGPT took the world by storm for its impressive abilities. Due to its release without documentation, scientists immediately attempted to identify its limits, mainly through its performance in natural language processing (NLP) tasks. This paper aims to join the growing literature regarding ChatGPT's abilities by focusing on its performance in high-resource languages and on its capacity to predict its answers' accuracy by giving a confidence level. The analysis of high-resource languages is of interest as studies have shown that low-resource languages perform worse than English in NLP tasks, but no study so far has analysed whether high-resource languages perform as well as English. The analysis of ChatGPT's confidence calibration has not been carried out before either and is critical to learn about ChatGPT's trustworthiness. In order to study these two aspects, five high-resource languages and two NLP tasks were chosen. ChatGPT was asked to perform both tasks in the five languages and to give a numerical confidence value for each answer. The results show that all the selected high-resource languages perform similarly and that ChatGPT does not have a good confidence calibration, often being overconfident and never giving low confidence values.

Comments:	14 pages, 4 figures, with appendix
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2311.06427 [cs.CL]
	(or arXiv:2311.06427v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.06427

Submission history

From: Matias Valdenegro-Toro [view email]
[v1] Fri, 10 Nov 2023 23:25:34 UTC (436 KB)

Computer Science > Computation and Language

Title:ChatGPT Prompting Cannot Estimate Predictive Uncertainty in High-Resource Languages

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ChatGPT Prompting Cannot Estimate Predictive Uncertainty in High-Resource Languages

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators