Skip to main content

Showing 1–5 of 5 results for author: Pinto, J P

.
  1. arXiv:2401.06790  [pdf, other

    cs.CL cs.AI

    Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions

    Authors: Daniel de S. Moraes, Pedro T. C. Santos, Polyana B. da Costa, Matheus A. S. Pinto, Ivan de J. P. Pinto, Álvaro M. G. da Veiga, Sergio Colcher, Antonio J. G. Busson, Rafael H. Rocha, Rennan Gaio, Rafael Miceli, Gabriela Tourinho, Marcos Rabaioli, Leandro Santos, Fellipe Marques, David Favaro

    Abstract: This work presents an unsupervised method for automatically constructing and expanding topic taxonomies using instruction-based fine-tuned LLMs (Large Language Models). We apply topic modeling and keyword extraction techniques to create initial topic taxonomies and LLMs to post-process the resulting terms and create a hierarchy. To expand an existing taxonomy with new terms, we use zero-shot promp… ▽ More

    Submitted 11 February, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  2. arXiv:2309.01764  [pdf, ps, other

    stat.ME econ.EM stat.ML

    Generalized Information Criteria for Structured Sparse Models

    Authors: Eduardo F. Mendes, Gabriel J. P. Pinto

    Abstract: Regularized m-estimators are widely used due to their ability of recovering a low-dimensional model in high-dimensional scenarios. Some recent efforts on this subject focused on creating a unified framework for establishing oracle bounds, and deriving conditions for support recovery. Under this same framework, we propose a new Generalized Information Criteria (GIC) that takes into consideration th… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    MSC Class: 62F07

  3. arXiv:1812.10048  [pdf, other

    cs.LG q-bio.GN stat.ML

    Parallel Clustering of Single Cell Transcriptomic Data with Split-Merge Sampling on Dirichlet Process Mixtures

    Authors: Tiehang Duan, José P. Pinto, Xiaohui Xie

    Abstract: Motivation: With the development of droplet based systems, massive single cell transcriptome data has become available, which enables analysis of cellular and molecular processes at single cell resolution and is instrumental to understanding many biological processes. While state-of-the-art clustering methods have been applied to the data, they face challenges in the following aspects: (1) the clu… ▽ More

    Submitted 25 December, 2018; originally announced December 2018.

    Comments: Accepted for Bioinformatics Oxford

  4. Sudden change of quantum discord for a system of two qubits

    Authors: João P. G. Pinto, Goktug Karpat, Felipe F. Fanchini

    Abstract: It is known that quantum discord might experience a sudden transition in its dynamics when calculated for certain Bell-diagonal states (BDS) that are in interaction with their surroundings. We examine this phenomenon known as the sudden change of quantum discord, considering the case of two qubits independently interacting with dephasing reservoirs. We first numerically demonstrate that, for a cla… ▽ More

    Submitted 30 September, 2013; originally announced September 2013.

    Comments: 5 pages, 2 figures (Published version)

    Journal ref: Phys. Rev. A 88, 034304 (2013)

  5. Relativistic deuteron structure function at large Q^2

    Authors: J. Paulo Pinto, A. Amorim, F. D. Santos

    Abstract: The deuteron deep inelastic unpolarized structure function F_2^D is calculated using the Wilson operator product expansion method. The long distance behaviour, related to the deuteron bound state properties, is evaluated using the Bethe-Salpeter equation with one particle on mass shell. The calculation of the ratio F_2^D/F_2^N is compared with other convolution models showing important deviation… ▽ More

    Submitted 21 November, 1997; originally announced November 1997.

    Comments: 7 pages, 1 ps figure, RevTeX source, 1 tar.gz file. Submited to Physical Letters

    Journal ref: Europhys.Lett.46:668-674,1999