Showing 1–2 of 2 results for author: de Oliveira, J P M

Search v0.5.6 released 2020-02-24

arXiv:1807.00303 [pdf, other]

cs.CL cs.IR

Modeling, comprehending and summarizing textual content by graphs

Authors: Vinicius Woloszyn, Guilherme Medeiros Machado, Leandro Krug Wives, José Palazzo Moreira de Oliveira

Abstract: Automatic Text Summarization strategies have been successfully employed to digest text collections and extract its essential content. Usually, summaries are generated using textual corpora that belongs to the same domain area where the summary will be used. Nonetheless, there are special cases where it is not found enough textual sources, and one possible alternative is to generate a summary from… ▽ More Automatic Text Summarization strategies have been successfully employed to digest text collections and extract its essential content. Usually, summaries are generated using textual corpora that belongs to the same domain area where the summary will be used. Nonetheless, there are special cases where it is not found enough textual sources, and one possible alternative is to generate a summary from a different domain. One manner to summarize texts consists of using a graph model. This model allows giving more importance to words corresponding to the main concepts from the target domain found in the summarized text. This gives the reader an overview of the main text concepts as well as their relationships. However, this kind of summarization presents a significant number of repeated terms when compared to human-generated summaries. In this paper, we present an approach to produce graph-model extractive summaries of texts, meeting the target domain exigences and treating the terms repetition problem. To evaluate the proposition, we performed a series of experiments showing that the proposed approach statistically improves the performance of a model based on Graph Centrality, achieving better coverage, accuracy, and recall. △ Less

Submitted 1 July, 2018; originally announced July 2018.
arXiv:1111.2829 [pdf, ps, other]

physics.soc-ph cs.DL

doi 10.1016/j.physa.2011.11.021

Universality in Bibliometrics

Authors: Roberto da Silva, Fahad Kalil, Alexandre Souto Martinez, Jose Palazzo Moreira de Oliveira

Abstract: Many discussions have enlarged the literature in Bibliometrics since the Hirsh proposal, the so called $h$-index. Ranking papers according to their citations, this index quantifies a researcher only by its greatest possible number of papers that are cited at least $h$ times. A closed formula for $h$-index distribution that can be applied for distinct databases is not yet known. In fact, to obtain… ▽ More Many discussions have enlarged the literature in Bibliometrics since the Hirsh proposal, the so called $h$-index. Ranking papers according to their citations, this index quantifies a researcher only by its greatest possible number of papers that are cited at least $h$ times. A closed formula for $h$-index distribution that can be applied for distinct databases is not yet known. In fact, to obtain such distribution, the knowledge of citation distribution of the authors and its specificities are required. Instead of dealing with researchers randomly chosen, here we address different groups based on distinct databases. The first group is composed by physicists and biologists, with data extracted from Institute of Scientific Information (ISI). The second group composed by computer scientists, which data were extracted from Google-Scholar system. In this paper, we obtain a general formula for the $h$-index probability density function (pdf) for groups of authors by using generalized exponentials in the context of escort probability. Our analysis includes the use of several statistical methods to estimate the necessary parameters. Also an exhaustive comparison among the possible candidate distributions are used to describe the way the citations are distributed among authors. The $h$-index pdf should be used to classify groups of researchers from a quantitative point of view, which is meaningfully interesting to eliminate obscure qualitative methods. △ Less

Submitted 11 November, 2011; originally announced November 2011.

Comments: To appear in Physica A (8 pages, 6 figures and 2 tables)

Journal ref: Physica A 391 (2012) 2119-2128

Search v0.5.6 released 2020-02-24