Skip to main content

Showing 1–27 of 27 results for author: van Deemter, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.04142  [pdf

    cs.CL cs.AI

    My Life in Artificial Intelligence: People, anecdotes, and some lessons learnt

    Authors: Kees van Deemter

    Abstract: In this very personal workography, I relate my 40-year experiences as a researcher and educator in and around Artificial Intelligence (AI), more specifically Natural Language Processing. I describe how curiosity, and the circumstances of the day, led me to work in both industry and academia, and in various countries, including The Netherlands (Amsterdam, Eindhoven, and Utrecht), the USA (Stanford)… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

    Comments: 34 pages

  2. arXiv:2501.12011  [pdf, other

    cs.CL

    Reference-free Evaluation Metrics for Text Generation: A Survey

    Authors: Takumi Ito, Kees van Deemter, Jun Suzuki

    Abstract: A number of automatic evaluation metrics have been proposed for natural language generation systems. The most common approach to automatic evaluation is the use of a reference-based metric that compares the model's output with gold-standard references written by humans. However, it is expensive to create such references, and for some tasks, such as response generation in dialogue, creating referen… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: Work in progress

  3. arXiv:2403.04376  [pdf, other

    cs.CL

    Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases

    Authors: Yuqi Liu, Guanyi Chen, Kees van Deemter

    Abstract: Theoretical linguists have suggested that some languages (e.g., Chinese and Japanese) are "cooler" than other languages based on the observation that the intended meaning of phrases in these languages depends more on their contexts. As a result, many expressions in these languages are shortened, and their meaning is inferred from the context. In this paper, we focus on the omission of the pluralit… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  4. arXiv:2402.07432  [pdf, other

    cs.CL

    Intrinsic Task-based Evaluation for Referring Expression Generation

    Authors: Guanyi Chen, Fahime Same, Kees van Deemter

    Abstract: Recently, a human evaluation study of Referring Expression Generation (REG) models had an unexpected conclusion: on \textsc{webnlg}, Referring Expressions (REs) generated by the state-of-the-art neural models were not only indistinguishable from the REs in \textsc{webnlg} but also from the REs generated by a simple rule-based system. Here, we argue that this limitation could stem from the use of a… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  5. arXiv:2401.09041  [pdf, other

    cs.CL

    Textual Summarisation of Large Sets: Towards a General Approach

    Authors: Kittipitch Kuptavanich, Ehud Reiter, Kees Van Deemter, Advaith Siddharthan

    Abstract: We are developing techniques to generate summary descriptions of sets of objects. In this paper, we present and evaluate a rule-based NLG technique for summarising sets of bibliographical references in academic papers. This extends our previous work on summarising sets of consumer products and shows how our model generalises across these two very different domains.

    Submitted 17 January, 2024; originally announced January 2024.

  6. arXiv:2401.07897  [pdf, ps, other

    cs.CL

    The Pitfalls of Defining Hallucination

    Authors: Kees van Deemter

    Abstract: Despite impressive advances in Natural Language Generation (NLG) and Large Language Models (LLMs), researchers are still unclear about important aspects of NLG evaluation. To substantiate this claim, I examine current classifications of hallucination and omission in Data-text NLG, and I propose a logic-based synthesis of these classfications. I conclude by highlighting some remaining limitations o… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in Computational Linguistics on 30 Dec. 2023. (9 Pages.)

  7. arXiv:2307.14817  [pdf, other

    cs.CL

    Models of reference production: How do they withstand the test of time?

    Authors: Fahime Same, Guanyi Chen, Kees van Deemter

    Abstract: In recent years, many NLP studies have focused solely on performance improvement. In this work, we focus on the linguistic and scientific aspects of NLP. We use the task of generating referring expressions in context (REG-in-context) as a case study and start our analysis from GREC, a comprehensive set of shared tasks in English that addressed this topic over a decade ago. We ask what the performa… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted to INLG 2023

  8. arXiv:2305.14020  [pdf, other

    cs.CL cs.AI

    Does ChatGPT have Theory of Mind?

    Authors: Bart Holterman, Kees van Deemter

    Abstract: Theory of Mind (ToM) is the ability to understand human thinking and decision-making, an ability that plays a crucial role in social interaction between people, including linguistic communication. This paper investigates to what extent recent Large Language Models in the ChatGPT tradition possess ToM. We posed six well-known problems that address biases in human reasoning and decision making to tw… ▽ More

    Submitted 13 September, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  9. arXiv:2305.01633  [pdf, other

    cs.CL

    Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

    Authors: Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Anouck Braggaar, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Mingqi Gao, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubicka, Emiel Krahmer, Huiyuan Lai , et al. (17 additional authors not shown)

    Abstract: We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which include that just 13\% of papers had (i) sufficiently low barriers to reproduction, and (ii) enough obtainable information, to be considered for reproduction, a… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 5 pages plus appendix, 4 tables, 1 figure. To appear at "Workshop on Insights from Negative Results in NLP" (co-located with EACL2023). Updated author list and acknowledgements

    MSC Class: 68 ACM Class: I.2.7

  10. Interpreting Vision and Language Generative Models with Semantic Visual Priors

    Authors: Michele Cafagna, Lina M. Rojas-Barahona, Kees van Deemter, Albert Gatt

    Abstract: When applied to Image-to-text models, interpretability methods often provide token-by-token explanations namely, they compute a visual explanation for each token of the generated sequence. Those explanations are expensive to compute and unable to comprehensively explain the model's output. Therefore, these models often require some sort of approximation that eventually leads to misleading explanat… ▽ More

    Submitted 4 May, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

  11. arXiv:2302.12189  [pdf, other

    cs.CL cs.CV

    HL Dataset: Visually-grounded Description of Scenes, Actions and Rationales

    Authors: Michele Cafagna, Kees van Deemter, Albert Gatt

    Abstract: Current captioning datasets focus on object-centric captions, describing the visible objects in the image, e.g. "people eating food in a park". Although these datasets are useful to evaluate the ability of Vision & Language models to recognize and describe visual content, they do not support controlled experiments involving model testing or fine-tuning, with more high-level captions, which humans… ▽ More

    Submitted 25 September, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  12. arXiv:2211.04971  [pdf, other

    cs.CL cs.CV

    Understanding Cross-modal Interactions in V&L Models that Generate Scene Descriptions

    Authors: Michele Cafagna, Kees van Deemter, Albert Gatt

    Abstract: Image captioning models tend to describe images in an object-centric way, emphasising visible objects. But image descriptions can also abstract away from objects and describe the type of scene depicted. In this paper, we explore the potential of a state-of-the-art Vision and Language model, VinVL, to caption images at the scene level using (1) a novel dataset which pairs images with both object-ce… ▽ More

    Submitted 10 November, 2022; v1 submitted 9 November, 2022; originally announced November 2022.

  13. arXiv:2210.04828  [pdf, other

    cs.CL

    Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset

    Authors: Guanyi Chen, Fahime Same, Kees van Deemter

    Abstract: Previous work on Neural Referring Expression Generation (REG) all uses WebNLG, an English dataset that has been shown to reflect a very limited range of referring expression (RE) use. To tackle this issue, we build a dataset based on the OntoNotes corpus that contains a broader range of RE use in both English and Chinese (a language that uses zero pronouns). We build neural Referential Form Select… ▽ More

    Submitted 11 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Eval4NLP workshop

  14. arXiv:2209.11977  [pdf, other

    cs.CL

    Understanding the Use of Quantifiers in Mandarin

    Authors: Guanyi Chen, Kees van Deemter

    Abstract: We introduce a corpus of short texts in Mandarin, in which quantified expressions figure prominently. We illustrate the significance of the corpus by examining the hypothesis (known as Huang's "coolness" hypothesis) that speakers of East Asian Languages tend to speak more briefly but less informatively than, for example, speakers of West-European languages. The corpus results from an elicitation e… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: AACL-Findings 2022

  15. arXiv:2209.06169  [pdf, ps, other

    cs.CL

    The Role of Explanatory Value in Natural Language Processing

    Authors: Kees van Deemter

    Abstract: A key aim of science is explanation, yet the idea of explaining language phenomena has taken a backseat in mainstream Natural Language Processing (NLP) and many other areas of Artificial Intelligence. I argue that explanation of linguistic behaviour should be a main goal of NLP, and that this is not the same as making NLP models explainable. To illustrate these ideas, some recent models of human l… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 8 pages + bibliography

  16. arXiv:2205.13858  [pdf, other

    cs.CL

    Semeval-2022 Task 1: CODWOE -- Comparing Dictionaries and Word Embeddings

    Authors: Timothee Mickus, Kees van Deemter, Mathieu Constant, Denis Paperno

    Abstract: Word embeddings have advanced the state of the art in NLP across numerous tasks. Understanding the contents of dense neural representations is of utmost interest to the computational semantics community. We propose to focus on relating these opaque word vectors with human-readable definitions, as found in dictionaries. This problem naturally divides into two subtasks: converting definitions into e… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  17. arXiv:2204.12197  [pdf, other

    cs.LO

    Evaluating Automatic Difficulty Estimation of Logic Formalization Exercises

    Authors: Alexandra Mayn, Kees van Deemter

    Abstract: Teaching logic effectively requires an understanding of the factors which cause logic students to struggle. Formalization exercises, which require the student to produce a formula corresponding to the natural language sentence, are a good candidate for scrutiny since they tap into the students' understanding of various aspects of logic. We correlate the difficulty of formalization exercises predic… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: 21 pages, 1 figure

  18. arXiv:2203.08274  [pdf, other

    cs.CL

    Non-neural Models Matter: A Re-evaluation of Neural Referring Expression Generation Systems

    Authors: Fahime Same, Guanyi Chen, Kees van Deemter

    Abstract: In recent years, neural models have often outperformed rule-based and classic Machine Learning approaches in NLG. These classic approaches are now often disregarded, for example when new neural models are evaluated. We argue that they should not be overlooked, since, for some tasks, well-designed non-neural approaches achieve better performance than neural ones. In this paper, the task of generati… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  19. arXiv:2109.07301  [pdf, other

    cs.CL

    What Vision-Language Models `See' when they See Scenes

    Authors: Michele Cafagna, Kees van Deemter, Albert Gatt

    Abstract: Images can be described in terms of the objects they contain, or in terms of the types of scene or place that they instantiate. In this paper we address to what extent pretrained Vision and Language models can learn to align descriptions of both types with images. We compare 3 state-of-the-art models, VisualBERT, LXMERT and CLIP. We find that (i) V&L models are susceptible to stylistic biases acqu… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  20. arXiv:2108.06806  [pdf, other

    cs.CL

    What can Neural Referential Form Selectors Learn?

    Authors: Guanyi Chen, Fahime Same, Kees van Deemter

    Abstract: Despite achieving encouraging results, neural Referring Expression Generation models are often thought to lack transparency. We probed neural Referential Form Selection (RFS) models to find out to what extent the linguistic features influencing the RE form are learnt and captured by state-of-the-art RFS models. The results of 8 probing tasks show that all the defined features were learnt to some e… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: Long paper accepted at INLG 2021

  21. arXiv:2011.07398  [pdf, other

    cs.CL

    Lessons from Computational Modelling of Reference Production in Mandarin and English

    Authors: Guanyi Chen, Kees van Deemter

    Abstract: Referring expression generation (REG) algorithms offer computational models of the production of referring expressions. In earlier work, a corpus of referring expressions (REs) in Mandarin was introduced. In the present paper, we annotate this corpus, evaluate classic REG algorithms on it, and compare the results with earlier results on the evaluation of REG for English referring expressions. Next… ▽ More

    Submitted 15 August, 2021; v1 submitted 14 November, 2020; originally announced November 2020.

    Comments: Long paper accepted at INLG 2020

  22. arXiv:2005.07988  [pdf, other

    cs.CL

    A Text Reassembling Approach to Natural Language Generation

    Authors: Xiao Li, Kees van Deemter, Chenghua Lin

    Abstract: Recent years have seen a number of proposals for performing Natural Language Generation (NLG) based in large part on statistical techniques. Despite having many attractive features, we argue that these existing approaches nonetheless have some important drawbacks, sometimes because the approach in question is not fully statistical (i.e., relies on a certain amount of handcrafting), sometimes becau… ▽ More

    Submitted 25 December, 2020; v1 submitted 16 May, 2020; originally announced May 2020.

  23. What do you mean, BERT? Assessing BERT as a Distributional Semantics Model

    Authors: Timothee Mickus, Denis Paperno, Mathieu Constant, Kees van Deemter

    Abstract: Contextualized word embeddings, i.e. vector representations for words in context, are naturally seen as an extension of previous noncontextual distributional semantic models. In this work, we focus on BERT, a deep neural network that produces contextualized embeddings and has set the state-of-the-art in several semantic tasks, and study the semantic coherence of its embedding space. While showing… ▽ More

    Submitted 8 May, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Journal ref: Proceedings of the Society for Computation in Linguistics: Vol. 3 (2020), Article 34

  24. arXiv:1809.02494  [pdf, other

    cs.CL cs.AI

    Meteorologists and Students: A resource for language grounding of geographical descriptors

    Authors: Alejandro Ramos-Soto, Ehud Reiter, Kees van Deemter, Jose M. Alonso, Albert Gatt

    Abstract: We present a data resource which can be useful for research purposes on language grounding tasks in the context of geographical referring expression generation. The resource is composed of two data sets that encompass 25 different geographical descriptors and a set of associated graphical representations, drawn as polygons on a map by two groups of human subjects: teenage students and expert meteo… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: Resource paper, 5 pages, 6 figures, 1 table. Conference: INLG 2018

  25. arXiv:1703.10429  [pdf, other

    cs.AI

    An Empirical Approach for Modeling Fuzzy Geographical Descriptors

    Authors: Alejandro Ramos-Soto, Jose M. Alonso, Ehud Reiter, Kees van Deemter, Albert Gatt

    Abstract: We present a novel heuristic approach that defines fuzzy geographical descriptors using data gathered from a survey with human subjects. The participants were asked to provide graphical interpretations of the descriptors `north' and `south' for the Galician region (Spain). Based on these interpretations, our approach builds fuzzy descriptors that are able to compute membership degrees for geograph… ▽ More

    Submitted 30 March, 2017; originally announced March 2017.

    Comments: Conference paper: Accepted for FUZZIEEE-2017. One column version for arXiv (8 pages)

  26. arXiv:cs/0312052  [pdf, ps, other

    cs.CL cs.AI

    Dialogue as Discourse: Controlling Global Properties of Scripted Dialogue

    Authors: Paul Piwek, Kees van Deemter

    Abstract: This paper explains why scripted dialogue shares some crucial properties with discourse. In particular, when scripted dialogues are generated by a Natural Language Generation system, the generator can apply revision strategies that cannot normally be used when the dialogue results from an interaction between autonomous agents (i.e., when the dialogue is not scripted). The paper explains that the… ▽ More

    Submitted 22 December, 2003; originally announced December 2003.

    Report number: ITRI-03-04 ACM Class: I.2.7

    Journal ref: Proceedings of AAAI Spring Symposium on Natural Language Generation in Spoken and Written Dialogue, Stanford, 2003

  27. arXiv:cs/0312051  [pdf, ps, other

    cs.CL cs.AI

    Towards Automated Generation of Scripted Dialogue: Some Time-Honoured Strategies

    Authors: Paul Piwek, Kees van Deemter

    Abstract: The main aim of this paper is to introduce automated generation of scripted dialogue as a worthwhile topic of investigation. In particular the fact that scripted dialogue involves two layers of communication, i.e., uni-directional communication between the author and the audience of a scripted dialogue and bi-directional pretended communication between the characters featuring in the dialogue, i… ▽ More

    Submitted 22 December, 2003; originally announced December 2003.

    Report number: ITRI-02-11 ACM Class: I.2.7

    Journal ref: Proceedings of EDILOG: 6th Workshop on the Semantics and Pragmatics of Dialogue, 2002, pp. 141-148