Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions

Moraes, Daniel de S.; Santos, Pedro T. C.; da Costa, Polyana B.; Pinto, Matheus A. S.; Pinto, Ivan de J. P.; da Veiga, Álvaro M. G.; Colcher, Sergio; Busson, Antonio J. G.; Rocha, Rafael H.; Gaio, Rennan; Miceli, Rafael; Tourinho, Gabriela; Rabaioli, Marcos; Santos, Leandro; Marques, Fellipe; Favaro, David

Computer Science > Computation and Language

arXiv:2401.06790 (cs)

[Submitted on 8 Jan 2024 (v1), last revised 11 Feb 2024 (this version, v2)]

Title:Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions

Authors:Daniel de S. Moraes, Pedro T. C. Santos, Polyana B. da Costa, Matheus A. S. Pinto, Ivan de J. P. Pinto, Álvaro M. G. da Veiga, Sergio Colcher, Antonio J. G. Busson, Rafael H. Rocha, Rennan Gaio, Rafael Miceli, Gabriela Tourinho, Marcos Rabaioli, Leandro Santos, Fellipe Marques, David Favaro

View PDF

Abstract:This work presents an unsupervised method for automatically constructing and expanding topic taxonomies using instruction-based fine-tuned LLMs (Large Language Models). We apply topic modeling and keyword extraction techniques to create initial topic taxonomies and LLMs to post-process the resulting terms and create a hierarchy. To expand an existing taxonomy with new terms, we use zero-shot prompting to find out where to add new nodes, which, to our knowledge, is the first work to present such an approach to taxonomy tasks. We use the resulting taxonomies to assign tags that characterize merchants from a retail bank dataset. To evaluate our work, we asked 12 volunteers to answer a two-part form in which we first assessed the quality of the taxonomies created and then the tags assigned to merchants based on that taxonomy. The evaluation revealed a coherence rate exceeding 90% for the chosen taxonomies. The taxonomies' expansion with LLMs also showed exciting results for parent node prediction, with an f1-score above 70% in our taxonomies.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2401.06790 [cs.CL]
	(or arXiv:2401.06790v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.06790

Submission history

From: Antonio Busson [view email]
[v1] Mon, 8 Jan 2024 00:27:16 UTC (3,125 KB)
[v2] Sun, 11 Feb 2024 15:54:58 UTC (3,116 KB)

Computer Science > Computation and Language

Title:Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators