Skip to main content

Showing 1–7 of 7 results for author: Faralli, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2105.01305  [pdf, other

    cs.AI cs.CL

    Large-scale Taxonomy Induction Using Entity and Word Embeddings

    Authors: Petar Ristoski, Stefano Faralli, Simone Paolo Ponzetto, Heiko Paulheim

    Abstract: Taxonomies are an important ingredient of knowledge organization, and serve as a backbone for more sophisticated knowledge representations in intelligent systems, such as formal ontologies. However, building taxonomies manually is a costly endeavor, and hence, automatic methods for taxonomy induction are a good alternative to build large-scale taxonomies. In this paper, we propose TIEmb, an approa… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: Published at IEEE/WIC/ACM International Conference on Web Intelligence 2017 (WI'17)

  2. arXiv:2005.06748  [pdf, other

    cs.IR cs.CY

    ECIR 2020 Workshops: Assessing the Impact of Going Online

    Authors: Sérgio Nunes, Suzanne Little, Sumit Bhatia, Ludovico Boratto, Guillaume Cabanac, Ricardo Campos, Francisco M. Couto, Stefano Faralli, Ingo Frommholz, Adam Jatowt, Alípio Jorge, Mirko Marras, Philipp Mayr, Giovanni Stilo

    Abstract: ECIR 2020 https://ecir2020.org/ was one of the many conferences affected by the COVID-19 pandemic. The Conference Chairs decided to keep the initially planned dates (April 14-17, 2020) and move to a fully online event. In this report, we describe the experience of organizing the ECIR 2020 Workshops in this scenario from two perspectives: the workshop organizers and the workshop participants. We pr… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: 10 pages, 3 figures, submitted to ACM SIGIR Forum

  3. arXiv:1803.05829  [pdf, other

    cs.CL

    Enriching Frame Representations with Distributionally Induced Senses

    Authors: Stefano Faralli, Alexander Panchenko, Chris Biemann, Simone Paolo Ponzetto

    Abstract: We introduce a new lexical resource that enriches the Framester knowledge graph, which links Framnet, WordNet, VerbNet and other resources, with semantic features from text corpora. These features are extracted from distributionally induced sense inventories and subsequently linked to the manually-constructed frame representations to boost the performance of frame disambiguation in context. Since… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

    Comments: In Proceedings of the 11th Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan. ELRA

  4. arXiv:1712.08819  [pdf, other

    cs.CL

    A Framework for Enriching Lexical Semantic Resources with Distributional Semantics

    Authors: Chris Biemann, Stefano Faralli, Alexander Panchenko, Simone Paolo Ponzetto

    Abstract: We present an approach to combining distributional semantic representations induced from text corpora with manually constructed lexical-semantic networks. While both kinds of semantic resources are available with high lexical coverage, our aligned resource combines the domain specificity and availability of contextual information from distributional models with the conciseness and high quality of… ▽ More

    Submitted 23 December, 2017; originally announced December 2017.

    Comments: Accepted for publication in the journal of Natural Language Engineering, 2018

  5. arXiv:1711.02918  [pdf, other

    cs.CL

    Improving Hypernymy Extraction with Distributional Semantic Classes

    Authors: Alexander Panchenko, Dmitry Ustalov, Stefano Faralli, Simone P. Ponzetto, Chris Biemann

    Abstract: In this paper, we show how distributionally-induced semantic classes can be helpful for extracting hypernyms. We present methods for inducing sense-aware semantic classes using distributional semantics and using these induced semantic classes for filtering noisy hypernymy relations. Denoising of hypernyms is performed by labeling each semantic class with its hypernyms. On the one hand, this allows… ▽ More

    Submitted 28 February, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

    Comments: In Proceedings of the 11th Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan

  6. arXiv:1710.01779  [pdf, other

    cs.CL

    Building a Web-Scale Dependency-Parsed Corpus from CommonCrawl

    Authors: Alexander Panchenko, Eugen Ruppert, Stefano Faralli, Simone Paolo Ponzetto, Chris Biemann

    Abstract: We present DepCC, the largest-to-date linguistically analyzed corpus in English including 365 million documents, composed of 252 billion tokens and 7.5 billion of named entity occurrences in 14.3 billion sentences from a web-scale crawl of the \textsc{Common Crawl} project. The sentences are processed with a dependency parser and with a named entity tagger and contain provenance information, enabl… ▽ More

    Submitted 28 February, 2018; v1 submitted 4 October, 2017; originally announced October 2017.

    Comments: In Proceedings of the 11th Conference on Language Resources and Evaluation (LREC'2018). Miyazaki, Japan

  7. Unsupervised, Knowledge-Free, and Interpretable Word Sense Disambiguation

    Authors: Alexander Panchenko, Fide Marten, Eugen Ruppert, Stefano Faralli, Dmitry Ustalov, Simone Paolo Ponzetto, Chris Biemann

    Abstract: Interpretability of a predictive model is a powerful feature that gains the trust of users in the correctness of the predictions. In word sense disambiguation (WSD), knowledge-based systems tend to be much more interpretable than knowledge-free counterparts as they rely on the wealth of manually-encoded elements representing word senses, such as hypernyms, usage examples, and images. We present a… ▽ More

    Submitted 21 July, 2017; originally announced July 2017.

    Comments: In Proceedings of the the Conference on Empirical Methods on Natural Language Processing (EMNLP 2017). 2017. Copenhagen, Denmark. Association for Computational Linguistics

    ACM Class: I.2.6; I.5.3; I.2.4