Skip to main content

Showing 1–4 of 4 results for author: Papageorgiou, H

Searching in archive cs. Search in all archives.
.
  1. SciNoBo : A Hierarchical Multi-Label Classifier of Scientific Publications

    Authors: Nikolaos Gialitsis, Sotiris Kotitsas, Haris Papageorgiou

    Abstract: Classifying scientific publications according to Field-of-Science (FoS) taxonomies is of crucial importance, allowing funders, publishers, scholars, companies and other stakeholders to organize scientific literature more effectively. Most existing works address classification either at venue level or solely based on the textual content of a research publication. We present SciNoBo, a novel classif… ▽ More

    Submitted 2 April, 2022; originally announced April 2022.

  2. arXiv:1401.6224  [pdf, other

    cs.CL physics.data-an

    Word-length entropies and correlations of natural language written texts

    Authors: Maria Kalimeri, Vassilios Constantoudis, Constantinos Papadimitriou, Konstantinos Karamanos, Fotis K. Diakonos, Harris Papageorgiou

    Abstract: We study the frequency distributions and correlations of the word lengths of ten European languages. Our findings indicate that a) the word-length distribution of short words quantified by the mean value and the entropy distinguishes the Uralic (Finnish) corpus from the others, b) the tails at long words, manifested in the high-order moments of the distributions, differentiate the Germanic languag… ▽ More

    Submitted 23 January, 2014; originally announced January 2014.

    Comments: 13 pages + 1 page of supporting information, 9 figures

  3. arXiv:1401.4205  [pdf, other

    cs.CL physics.data-an

    Entropy analysis of word-length series of natural language texts: Effects of text language and genre

    Authors: Maria Kalimeri, Vassilios Constantoudis, Constantinos Papadimitriou, Kostantinos Karamanos, Fotis K. Diakonos, Haris Papageorgiou

    Abstract: We estimate the $n$-gram entropies of natural language texts in word-length representation and find that these are sensitive to text language and genre. We attribute this sensitivity to changes in the probability distribution of the lengths of single words and emphasize the crucial role of the uniformity of probabilities of having words with length between five and ten. Furthermore, comparison wit… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Comments: 9 pages, 7 figures

    Journal ref: International Journal of Bifurcation and Chaos, 22, 1250223, (2012)

  4. A Matching Technique in Example-Based Machine Translation

    Authors: Lambros Cranias, Harris Papageorgiou, Stelios Piperidis

    Abstract: This paper addresses an important problem in Example-Based Machine Translation (EBMT), namely how to measure similarity between a sentence fragment and a set of stored examples. A new method is proposed that measures similarity according to both surface structure and content. A second contribution is the use of clustering to make retrieval of the best matching example from the database more effi… ▽ More

    Submitted 10 August, 1995; originally announced August 1995.

    Comments: 5 pages,LaTeX uses aclap.sty