Skip to main content

Showing 1–11 of 11 results for author: Nastase, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.06622  [pdf, other

    cs.CL

    Exploring Italian sentence embeddings properties through multi-tasking

    Authors: Vivi Nastase, Giuseppe Samo, Chunyang Jiang, Paola Merlo

    Abstract: We investigate to what degree existing LLMs encode abstract linguistic information in Italian in a multi-task setting. We exploit curated synthetic data on a large scale -- several Blackbird Language Matrices (BLMs) problems in Italian -- and use them to study how sentence representations built using pre-trained language models encode specific syntactic and semantic information. We use a two-level… ▽ More

    Submitted 29 November, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: 11 pages, 6 figures, 4 tables

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2409.06567  [pdf, other

    cs.CL

    Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement

    Authors: Vivi Nastase, Chunyang Jiang, Giuseppe Samo, Paola Merlo

    Abstract: In this paper, our goal is to investigate to what degree multilingual pretrained language models capture cross-linguistically valid abstract linguistic representations. We take the approach of developing curated synthetic data on a large scale, with specific properties, and using them to study sentence representations built using pretrained language models. We use a new multiple-choice task and da… ▽ More

    Submitted 29 November, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: 13 pages, 5 tables, 6 figures

    MSC Class: 68T50 ACM Class: I.2.7

  3. arXiv:2407.18119  [pdf, other

    cs.CL

    Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification

    Authors: Vivi Nastase, Paola Merlo

    Abstract: Analyses of transformer-based models have shown that they encode a variety of linguistic information from their textual input. While these analyses have shed a light on the relation between linguistic information on one side, and internal architecture and parameters on the other, a question remains unanswered: how is this linguistic information reflected in sentence embeddings? Using datasets cons… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 12 pages, 9 figures, 1 table, published in RepL4NLP 2024

    MSC Class: 68T50 ACM Class: I.2.7

  4. arXiv:2406.16563  [pdf, other

    cs.CL

    Are there identifiable structural parts in the sentence embedding whole?

    Authors: Vivi Nastase, Paola Merlo

    Abstract: Sentence embeddings from transformer models encode in a fixed length vector much linguistic information. We explore the hypothesis that these embeddings consist of overlapping layers of information that can be separated, and on which specific types of information -- such as information about chunks and their structural and semantic properties -- can be detected. We show that this is the case using… ▽ More

    Submitted 2 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 17 pages, 14 figures, 5 tables

    MSC Class: 68T50 ACM Class: I.2.7

  5. arXiv:2312.11272  [pdf, other

    cs.CL

    Disentangling continuous and discrete linguistic signals in transformer-based sentence embeddings

    Authors: Vivi Nastase, Paola Merlo

    Abstract: Sentence and word embeddings encode structural and semantic information in a distributed manner. Part of the information encoded -- particularly lexical information -- can be seen as continuous, whereas other -- like structural information -- is most often discrete. We explore whether we can compress transformer-based sentence embeddings into a representation that separates different linguistic si… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    ACM Class: I.2.7

  6. arXiv:2312.09890  [pdf, other

    cs.CL

    Grammatical information in BERT sentence embeddings as two-dimensional arrays

    Authors: Vivi Nastase, Paola Merlo

    Abstract: Sentence embeddings induced with various transformer architectures encode much semantic and syntactic information in a distributed manner in a one-dimensional array. We investigate whether specific grammatical information can be accessed in these distributed representations. Using data from a task developed to test rule-like generalizations, our experiments on detecting subject-verb agreement yiel… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Published in RepL4NLP 2023

    ACM Class: I.2.7

    Journal ref: Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP 2023)

  7. arXiv:2009.05426  [pdf, other

    cs.CL

    Semantic Relations and Deep Learning

    Authors: Vivi Nastase, Stan Szpakowicz

    Abstract: The second edition of "Semantic Relations Between Nominals" by Vivi Nastase, Stan Szpakowicz, Preslav Nakov and Diarmuid Ó Séaghdha has been published in April 2021 by Morgan & Claypool (www.morganclaypoolpublishers.com/catalog_Orig/product_info.php?products_id=1627). A new Chapter 5 of the book, by Vivi Nastase and Stan Szpakowicz, discusses relation classification/extraction in the deep-learning… ▽ More

    Submitted 15 April, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: 86 pages

    ACM Class: I.2.7; H.3.3

  8. arXiv:1905.05538  [pdf, other

    cs.CL

    Assessing the Difficulty of Classifying ConceptNet Relations in a Multi-Label Classification Setting

    Authors: Maria Becker, Michael Staniek, Vivi Nastase, Anette Frank

    Abstract: Commonsense knowledge relations are crucial for advanced NLU tasks. We examine the learnability of such relations as represented in CONCEPTNET, taking into account their specific properties, which can make relation classification difficult: a given concept pair can be linked by multiple relation types, and relations can have multi-word arguments of diverse semantic types. We explore a neural open… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: RELATIONS - Workshop on meaning relations between phrases and sentences (co-located with IWCS). May 2019, Gothenburg, Sweden

  9. arXiv:1708.06816  [pdf, other

    cs.AI

    Analysis of the Impact of Negative Sampling on Link Prediction in Knowledge Graphs

    Authors: Bhushan Kotnis, Vivi Nastase

    Abstract: Knowledge graphs are large, useful, but incomplete knowledge repositories. They encode knowledge through entities and relations which define each other through the connective structure of the graph. This has inspired methods for the joint embedding of entities and relations in continuous low-dimensional vector spaces, that can be used to induce new edges in the graph, i.e., link prediction in know… ▽ More

    Submitted 2 March, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: 14 pages

  10. arXiv:1706.09278  [pdf, other

    cs.AI

    Learning Knowledge Graph Embeddings with Type Regularizer

    Authors: Bhushan Kotnis, Vivi Nastase

    Abstract: Learning relations based on evidence from knowledge bases relies on processing the available relation instances. Many relations, however, have clear domain and range, which we hypothesize could help learn a better, more generalizing, model. We include such information in the RESCAL model in the form of a regularization factor added to the loss function that takes into account the types (categories… ▽ More

    Submitted 2 March, 2018; v1 submitted 28 June, 2017; originally announced June 2017.

  11. arXiv:1411.7820  [pdf, other

    cs.CL

    Coarse-grained Cross-lingual Alignment of Comparable Texts with Topic Models and Encyclopedic Knowledge

    Authors: Vivi Nastase, Angela Fahrni

    Abstract: We present a method for coarse-grained cross-lingual alignment of comparable texts: segments consisting of contiguous paragraphs that discuss the same theme (e.g. history, economy) are aligned based on induced multilingual topics. The method combines three ideas: a two-level LDA model that filters out words that do not convey themes, an HMM that models the ordering of themes in the collection of d… ▽ More

    Submitted 28 November, 2014; originally announced November 2014.

    Comments: 9 pages, 4 figures