Skip to main content

Showing 1–17 of 17 results for author: Titov, I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.19923  [pdf, other

    cs.AI cs.LG stat.ME

    Language Agents Meet Causality -- Bridging LLMs and Causal World Models

    Authors: John Gkountouras, Matthias Lindemann, Phillip Lippe, Efstratios Gavves, Ivan Titov

    Abstract: Large Language Models (LLMs) have recently shown great promise in planning and reasoning applications. These tasks demand robust systems, which arguably require a causal understanding of the environment. While LLMs can acquire and reflect common sense causal knowledge from their pretraining data, this information is often incomplete, incorrect, or inapplicable to a specific environment. In contras… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

    Comments: Project page: https://j0hngou.github.io/LLMCWM/

  2. arXiv:2109.03792  [pdf, other

    cs.CL cs.AI stat.ML

    Highly Parallel Autoregressive Entity Linking with Discriminative Correction

    Authors: Nicola De Cao, Wilker Aziz, Ivan Titov

    Abstract: Generative approaches have been recently shown to be effective for both Entity Disambiguation and Entity Linking (i.e., joint mention detection and disambiguation). However, the previously proposed autoregressive formulation for EL suffers from i) high computational cost due to a complex (deep) decoder, ii) non-parallelizable decoding that scales with the source sequence length, and iii) the need… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP2021 Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Code at https://github.com/nicola-decao/efficient-autoregressive-EL . 8 pages, 1 figure, 3 tables

  3. arXiv:2010.00577  [pdf, other

    cs.CL cs.LG stat.ML

    Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking

    Authors: Michael Sejr Schlichtkrull, Nicola De Cao, Ivan Titov

    Abstract: Graph neural networks (GNNs) have become a popular approach to integrating structural inductive biases into NLP models. However, there has been little work on interpreting them, and specifically on understanding which parts of the graphs (e.g. syntactic trees or co-reference structures) contribute to a prediction. In this work, we introduce a post-hoc method for interpreting the predictions of GNN… ▽ More

    Submitted 3 October, 2022; v1 submitted 1 October, 2020; originally announced October 2020.

  4. arXiv:2004.14992  [pdf, other

    cs.CL stat.ML

    How do Decisions Emerge across Layers in Neural Models? Interpretation with Differentiable Masking

    Authors: Nicola De Cao, Michael Schlichtkrull, Wilker Aziz, Ivan Titov

    Abstract: Attribution methods assess the contribution of inputs to the model prediction. One way to do so is erasure: a subset of inputs is considered irrelevant if it can be removed without affecting the prediction. Though conceptually simple, erasure's objective is intractable and approximate search remains expensive with modern deep NLP models. Erasure is also susceptible to the hindsight bias: the fact… ▽ More

    Submitted 2 March, 2021; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: Accepted at the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Source code available at https://github.com/nicola-decao/diffmask . 18 pages, 15 figures, 4 tables

  5. arXiv:2004.14884  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    Few-Shot Learning for Opinion Summarization

    Authors: Arthur Bražinskas, Mirella Lapata, Ivan Titov

    Abstract: Opinion summarization is the automatic creation of text reflecting subjective information expressed in multiple documents, such as user reviews of a product. The task is practically important and has attracted a lot of attention. However, due to the high cost of summary production, datasets large enough for training supervised models are lacking. Instead, the task has been traditionally approached… ▽ More

    Submitted 10 October, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: EMNLP 2020

  6. arXiv:2004.14758  [pdf, other

    cs.LG cs.CL stat.ML

    Preventing Posterior Collapse with Levenshtein Variational Autoencoder

    Authors: Serhii Havrylov, Ivan Titov

    Abstract: Variational autoencoders (VAEs) are a standard framework for inducing latent variable models that have been shown effective in learning text representations as well as in text generation. The key challenge with using VAEs is the {\it posterior collapse} problem: learning tends to converge to trivial solutions where the generators ignore latent variables. In our Levenstein VAE, we propose to replac… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  7. arXiv:1911.02247  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Unsupervised Opinion Summarization as Copycat-Review Generation

    Authors: Arthur Bražinskas, Mirella Lapata, Ivan Titov

    Abstract: Opinion summarization is the task of automatically creating summaries that reflect subjective information expressed in multiple documents, such as product reviews. While the majority of previous work has focused on the extractive setting, i.e., selecting fragments from input reviews to produce a summary, we let the model generate novel sentences and hence produce abstractive summaries. Recent prog… ▽ More

    Submitted 19 April, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: ACL 2020

  8. arXiv:1904.09585  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Obfuscation for Privacy-preserving Syntactic Parsing

    Authors: Zhifeng Hu, Serhii Havrylov, Ivan Titov, Shay B. Cohen

    Abstract: The goal of homomorphic encryption is to encrypt data such that another party can operate on it without being explicitly exposed to the content of the original data. We introduce an idea for a privacy-preserving transformation on natural language data, inspired by homomorphic encryption. Our primary tool is {\em obfuscation}, relying on the properties of natural language. Specifically, a given Eng… ▽ More

    Submitted 27 May, 2020; v1 submitted 21 April, 2019; originally announced April 2019.

    Comments: Accepted to IWPT 2020

  9. arXiv:1904.04676  [pdf, other

    stat.ML cs.LG

    Block Neural Autoregressive Flow

    Authors: Nicola De Cao, Ivan Titov, Wilker Aziz

    Abstract: Normalising flows (NFS) map two density functions via a differentiable bijection whose Jacobian determinant can be computed efficiently. Recently, as an alternative to hand-crafted bijections, Huang et al. (2018) proposed neural autoregressive flow (NAF) which is a universal approximator for density functions. Their flow is a neural network (NN) whose parameters are predicted by another NN. The la… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: 12 pages, 3 figures, 3 tables

  10. arXiv:1808.09920  [pdf, other

    cs.CL stat.ML

    Question Answering by Reasoning Across Documents with Graph Convolutional Networks

    Authors: Nicola De Cao, Wilker Aziz, Ivan Titov

    Abstract: Most research in reading comprehension has focused on answering questions based on individual documents or even single paragraphs. We introduce a neural model which integrates and reasons relying on information spread within documents and across multiple documents. We frame it as an inference problem on a graph. Mentions of entities are nodes of this graph while edges encode relations between diff… ▽ More

    Submitted 27 September, 2022; v1 submitted 29 August, 2018; originally announced August 2018.

    Journal ref: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) 2306--2317

  11. arXiv:1703.06103  [pdf, other

    stat.ML cs.AI cs.DB cs.LG

    Modeling Relational Data with Graph Convolutional Networks

    Authors: Michael Schlichtkrull, Thomas N. Kipf, Peter Bloem, Rianne van den Berg, Ivan Titov, Max Welling

    Abstract: Knowledge graphs enable a wide variety of applications, including question answering and information retrieval. Despite the great effort invested in their creation and maintenance, even the largest (e.g., Yago, DBPedia or Wikidata) remain incomplete. We introduce Relational Graph Convolutional Networks (R-GCNs) and apply them to two standard knowledge base completion tasks: Link prediction (recove… ▽ More

    Submitted 26 October, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

  12. arXiv:1702.03121  [pdf, other

    cs.CL cs.AI stat.ML

    Modeling Semantic Expectation: Using Script Knowledge for Referent Prediction

    Authors: Ashutosh Modi, Ivan Titov, Vera Demberg, Asad Sayeed, Manfred Pinkal

    Abstract: Recent research in psycholinguistics has provided increasing evidence that humans predict upcoming content. Prediction also affects perception and might be a key to robustness in human language processing. In this paper, we investigate the factors that affect human prediction by building a computational model that can predict upcoming discourse referents based on linguistic knowledge alone vs. lin… ▽ More

    Submitted 10 February, 2017; originally announced February 2017.

    Comments: 14 pages, published at TACL, 2017, Volume-5, Pg 31-44, 2017

    Journal ref: Transactions of ACL, Volume-5, Pg 31-44 (2017)

  13. arXiv:1603.09128  [pdf, other

    cs.CL cs.LG stat.ML

    Bilingual Learning of Multi-sense Embeddings with Discrete Autoencoders

    Authors: Simon Šuster, Ivan Titov, Gertjan van Noord

    Abstract: We present an approach to learning multi-sense word embeddings relying both on monolingual and bilingual information. Our model consists of an encoder, which uses monolingual and bilingual context (i.e. a parallel sentence) to choose a sense for a given word, and a decoder which predicts context words based on the chosen sense. The two components are estimated jointly. We observe that the word rep… ▽ More

    Submitted 30 March, 2016; originally announced March 2016.

    Comments: 11 pages, to appear at NAACL 2016

  14. arXiv:1508.07709  [pdf, other

    cs.CL cs.LG stat.ML

    Word Representations, Tree Models and Syntactic Functions

    Authors: Simon Šuster, Gertjan van Noord, Ivan Titov

    Abstract: Word representations induced from models with discrete latent variables (e.g.\ HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem as unsupervised learning of tree-structured hidden Markov models. Syntactic functions are used as additional observed variables in the model, influencing both t… ▽ More

    Submitted 5 February, 2016; v1 submitted 31 August, 2015; originally announced August 2015.

    Comments: Add github code repository link. Fix equation 4.1

  15. arXiv:1412.6418  [pdf, other

    cs.CL cs.LG stat.ML

    Inducing Semantic Representation from Text by Jointly Predicting and Factorizing Relations

    Authors: Ivan Titov, Ehsan Khoddam

    Abstract: In this work, we propose a new method to integrate two recent lines of work: unsupervised induction of shallow semantics (e.g., semantic roles) and factorization of relations in text and knowledge bases. Our model consists of two components: (1) an encoding component: a semantic role labeling model which predicts roles given a rich set of syntactic and lexical features; (2) a reconstruction compon… ▽ More

    Submitted 16 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: Accepted as a workshop contribution at ICLR 2015

  16. arXiv:1412.2812  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Unsupervised Induction of Semantic Roles within a Reconstruction-Error Minimization Framework

    Authors: Ivan Titov, Ehsan Khoddam

    Abstract: We introduce a new approach to unsupervised estimation of feature-rich semantic role labeling models. Our model consists of two components: (1) an encoding component: a semantic role labeling model which predicts roles given a rich set of syntactic and lexical features; (2) a reconstruction component: a tensor factorization model which relies on roles to predict argument fillers. When the componen… ▽ More

    Submitted 8 December, 2014; originally announced December 2014.

  17. arXiv:1312.5198  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Learning Semantic Script Knowledge with Event Embeddings

    Authors: Ashutosh Modi, Ivan Titov

    Abstract: Induction of common sense knowledge about prototypical sequences of events has recently received much attention. Instead of inducing this knowledge in the form of graphs, as in much of the previous work, in our method, distributed representations of event realizations are computed based on distributed representations of predicates and their arguments, and then these representations are used to pre… ▽ More

    Submitted 25 April, 2014; v1 submitted 18 December, 2013; originally announced December 2013.

    Comments: 4 Pages, 1 figure, ICLR Workshop

    ACM Class: I.2.6; I.2.7