Skip to main content

Showing 1–50 of 53 results for author: Ilievski, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00056  [pdf, other

    cs.CL cs.IR cs.LG cs.MM

    Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity

    Authors: Tygo Bloem, Filip Ilievski

    Abstract: Meme clustering is critical for toxicity detection, virality modeling, and typing, but it has received little attention in previous research. Clustering similar Internet memes is challenging due to their multimodality, cultural context, and adaptability. Existing approaches rely on databases, overlook semantics, and struggle to handle diverse dimensions of similarity. This paper introduces a novel… ▽ More

    Submitted 2 May, 2025; v1 submitted 30 April, 2025; originally announced May 2025.

    Journal ref: ICWSM 2025

  2. arXiv:2502.17422  [pdf, other

    cs.CV cs.AI cs.CL

    MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs

    Authors: Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski

    Abstract: Multimodal Large Language Models (MLLMs) have experienced rapid progress in visual recognition tasks in recent years. Given their potential integration into many critical applications, it is important to understand the limitations of their visual perception. In this work, we study whether MLLMs can perceive small visual details as effectively as large ones when answering questions about images. We… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: Published as a conference paper at ICLR 2025. Code at: https://github.com/saccharomycetes/mllms_know

  3. arXiv:2502.04352  [pdf, ps, other

    cs.CL cs.AI

    Investigating the Robustness of Deductive Reasoning with Large Language Models

    Authors: Fabian Hoppe, Filip Ilievski, Jan-Christoph Kalo

    Abstract: Large Language Models (LLMs) have been shown to achieve impressive results for many reasoning-based Natural Language Processing (NLP) tasks, suggesting a degree of deductive reasoning capability. However, it remains unclear to which extent LLMs, in both informal and autoformalisation methods, are robust on logical deduction tasks. Moreover, while many LLM-based deduction methods have been proposed… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  4. arXiv:2501.05069  [pdf, other

    cs.CV cs.AI

    Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning

    Authors: Huabin Liu, Filip Ilievski, Cees G. M. Snoek

    Abstract: This paper proposes the first video-grounded entailment tree reasoning method for commonsense video question answering (VQA). Despite the remarkable progress of large visual-language models (VLMs), there are growing concerns that they learn spurious correlations between videos and likely answers, reinforced by their black-box nature and remaining benchmarking biases. Our method explicitly grounds… ▽ More

    Submitted 24 March, 2025; v1 submitted 9 January, 2025; originally announced January 2025.

    Comments: Accepted by CVPR 2025

  5. arXiv:2411.15626  [pdf, other

    cs.AI

    Aligning Generalisation Between Humans and Machines

    Authors: Filip Ilievski, Barbara Hammer, Frank van Harmelen, Benjamin Paassen, Sascha Saralajew, Ute Schmid, Michael Biehl, Marianna Bolognesi, Xin Luna Dong, Kiril Gashteovski, Pascal Hitzler, Giuseppe Marra, Pasquale Minervini, Martin Mundt, Axel-Cyrille Ngonga Ngomo, Alessandro Oltramari, Gabriella Pasi, Zeynep G. Saribatur, Luciano Serafini, John Shawe-Taylor, Vered Shwartz, Gabriella Skitalinskaya, Clemens Stachl, Gido M. van de Ven, Thomas Villmann

    Abstract: Recent advances in AI -- including generative approaches -- have resulted in technology that can support humans in scientific discovery and forming decisions, but may also disrupt democracies and target individuals. The responsible use of AI and its participation in human-AI teams increasingly shows the need for AI alignment, that is, to make AI systems act according to our preferences. A crucial… ▽ More

    Submitted 27 May, 2025; v1 submitted 23 November, 2024; originally announced November 2024.

  6. arXiv:2409.04053  [pdf, other

    cs.CV cs.AI

    COLUMBUS: Evaluating COgnitive Lateral Understanding through Multiple-choice reBUSes

    Authors: Koen Kraaijveld, Yifan Jiang, Kaixin Ma, Filip Ilievski

    Abstract: While visual question-answering (VQA) benchmarks have catalyzed the development of reasoning techniques, they have focused on vertical thinking. Effective problem-solving also necessitates lateral thinking, which remains understudied in AI and has not been used to test visual perception systems. To bridge this gap, we formulate visual lateral thinking as a multiple-choice question-answering task a… ▽ More

    Submitted 20 December, 2024; v1 submitted 6 September, 2024; originally announced September 2024.

    Comments: 15 pages, 10 figures, accepted to AAAI-25

  7. arXiv:2404.16068  [pdf, other

    cs.AI cs.CL cs.LG

    SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense

    Authors: Yifan Jiang, Filip Ilievski, Kaixin Ma

    Abstract: While vertical thinking relies on logical and commonsense reasoning, lateral thinking requires systems to defy commonsense associations and overwrite them through unconventional thinking. Lateral thinking has been shown to be challenging for current models but has received little attention. A recent benchmark, BRAINTEASER, aims to evaluate current models' lateral thinking ability in a zero-shot se… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  8. arXiv:2404.13591  [pdf, other

    cs.CV cs.LG

    MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning

    Authors: Yifan Jiang, Jiarui Zhang, Kexuan Sun, Zhivar Sourati, Kian Ahrabian, Kaixin Ma, Filip Ilievski, Jay Pujara

    Abstract: While multi-modal large language models (MLLMs) have shown significant progress on many popular visual reasoning benchmarks, whether they possess abstract visual reasoning abilities remains an open question. Similar to the Sudoku puzzles, abstract visual reasoning (AVR) problems require finding high-level patterns (e.g., repetition constraints) that control the input shapes (e.g., digits) in a spe… ▽ More

    Submitted 24 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  9. arXiv:2404.03624  [pdf, other

    cs.AI cs.SE

    Standardizing Knowledge Engineering Practices with a Reference Architecture

    Authors: Bradley P. Allen, Filip Ilievski

    Abstract: Knowledge engineering is the process of creating and maintaining knowledge-producing systems. Throughout the history of computer science and AI, knowledge engineering workflows have been widely used given the importance of high-quality knowledge for reliable intelligent agents. Meanwhile, the scope of knowledge engineering, as apparent from its target tasks and use cases, has been shifting, togeth… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 23 pages, 4 figures, 2 tables, camera-ready version, accepted for Transactions on Graph Data and Knowledge (TGDK)

  10. arXiv:2403.17426  [pdf, other

    cs.AI

    Knowledge-Powered Recommendation for an Improved Diet Water Footprint

    Authors: Saurav Joshi, Filip Ilievski, Jay Pujara

    Abstract: According to WWF, 1.1 billion people lack access to water, and 2.7 billion experience water scarcity at least one month a year. By 2025, two-thirds of the world's population may be facing water shortages. This highlights the urgency of managing water usage efficiently, especially in water-intensive sectors like food. This paper proposes a recommendation engine, powered by knowledge graphs, aiming… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 3 pages, 1 figure, AAAI'24

  11. arXiv:2402.07384  [pdf, other

    cs.CV cs.AI cs.LG

    Exploring Perceptual Limitation of Multimodal Large Language Models

    Authors: Jiarui Zhang, Jinyi Hu, Mahyar Khayatkhoei, Filip Ilievski, Maosong Sun

    Abstract: Multimodal Large Language Models (MLLMs) have recently shown remarkable perceptual capability in answering visual questions, however, little is known about the limits of their perception. In particular, while prior works have provided anecdotal evidence of MLLMs' sensitivity to object size, this phenomenon and its underlying causes have not been explored comprehensively. In this work, we quantitat… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: 14 pages, 14 figures, 3 tables

  12. arXiv:2311.11157  [pdf, other

    cs.SI cs.AI cs.IR

    Contextualizing Internet Memes Across Social Media Platforms

    Authors: Saurav Joshi, Filip Ilievski, Luca Luceri

    Abstract: Internet memes have emerged as a novel format for communication and expressing ideas on the web. Their fluidity and creative nature are reflected in their widespread use, often across platforms and occasionally for unethical or harmful purposes. While computational work has already analyzed their high-level virality over time and developed specialized classifiers for hate speech detection, there h… ▽ More

    Submitted 26 February, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: 10 pages, 7 figures, 2 tables

  13. arXiv:2311.06647  [pdf, other

    cs.CL

    Robust Text Classification: Analyzing Prototype-Based Networks

    Authors: Zhivar Sourati, Darshan Deshpande, Filip Ilievski, Kiril Gashteovski, Sascha Saralajew

    Abstract: Downstream applications often require text classification models to be accurate and robust. While the accuracy of the state-of-the-art Language Models (LMs) approximates human performance, they often exhibit a drop in performance on noisy data found in the real world. This lack of robustness can be concerning, as even small perturbations in the text, irrelevant to the target task, can cause classi… ▽ More

    Submitted 27 October, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Comments: Published at EMNLP Findings 2024

  14. arXiv:2310.16033  [pdf, other

    cs.CV cs.CL

    Towards Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal LLMs

    Authors: Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski

    Abstract: Multimodal Large Language Models (MLLMs) have recently achieved promising zero-shot accuracy on visual question answering (VQA) -- a fundamental task affecting various downstream applications and domains. Given the great potential for the broad use of these models, it is important to investigate their limitations in dealing with different image and question properties. In this work, we investigate… ▽ More

    Submitted 12 February, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 20 pages, 12 figures, 7 tables

  15. arXiv:2310.05057  [pdf, other

    cs.CL

    BRAINTEASER: Lateral Thinking Puzzles for Large Language Models

    Authors: Yifan Jiang, Filip Ilievski, Kaixin Ma, Zhivar Sourati

    Abstract: The success of language models has inspired the NLP community to attend to tasks that require implicit and complex reasoning, relying on human-like commonsense mechanisms. While such vertical thinking tasks have been relatively popular, lateral thinking puzzles have received little attention. To bridge this gap, we devise BRAINTEASER: a multiple-choice Question Answering task designed to test the… ▽ More

    Submitted 9 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  16. arXiv:2310.00996  [pdf, other

    cs.CL

    ARN: Analogical Reasoning on Narratives

    Authors: Zhivar Sourati, Filip Ilievski, Pia Sommerauer, Yifan Jiang

    Abstract: As a core cognitive skill that enables the transferability of information across domains, analogical reasoning has been extensively studied for both humans and computational models. However, while cognitive theories of analogy often focus on narratives and study the distinction between surface, relational, and system similarities, existing work in natural language processing has a narrower focus a… ▽ More

    Submitted 3 September, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  17. arXiv:2308.14391  [pdf, other

    cs.CV cs.CL

    FIRE: Food Image to REcipe generation

    Authors: Prateek Chhikara, Dhiraj Chaurasia, Yifan Jiang, Omkar Masur, Filip Ilievski

    Abstract: Food computing has emerged as a prominent multidisciplinary field of research in recent years. An ambitious goal of food computing is to develop end-to-end intelligent systems capable of autonomously producing recipe information for a food image. Current image-to-recipe methods are retrieval-based and their success depends heavily on the dataset size and diversity, as well as the quality of learne… ▽ More

    Submitted 12 May, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Published at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) -- 2024

  18. arXiv:2306.15124  [pdf, other

    cs.SE cs.AI

    Identifying and Consolidating Knowledge Engineering Requirements

    Authors: Bradley P. Allen, Filip Ilievski, Saurav Joshi

    Abstract: Knowledge engineering is the process of creating and maintaining knowledge-producing systems. Throughout the history of computer science and AI, knowledge engineering workflows have been widely used because high-quality knowledge is assumed to be crucial for reliable intelligent agents. However, the landscape of knowledge engineering has changed, presenting four challenges: unaddressed stakeholder… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  19. arXiv:2306.02520  [pdf, other

    cs.CL cs.AI cs.LG

    A Study of Situational Reasoning for Traffic Understanding

    Authors: Jiarui Zhang, Filip Ilievski, Kaixin Ma, Aravinda Kollaa, Jonathan Francis, Alessandro Oltramari

    Abstract: Intelligent Traffic Monitoring (ITMo) technologies hold the potential for improving road safety/security and for enabling smart city infrastructure. Understanding traffic situations requires a complex fusion of perceptual information with domain-specific and causal commonsense knowledge. Whereas prior work has provided benchmarks and methods for traffic monitoring, it remains unclear whether model… ▽ More

    Submitted 15 July, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: 11 pages, 6 figures, 5 tables, camera ready version of SIGKDD 2023

  20. arXiv:2306.00228  [pdf, other

    cs.CV cs.AI cs.CL

    Using Visual Cropping to Enhance Fine-Detail Question Answering of BLIP-Family Models

    Authors: Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski

    Abstract: Visual Question Answering is a challenging task, as it requires seamless interaction between perceptual, linguistic, and background knowledge systems. While the recent progress of visual and natural language models like BLIP has led to improved performance on this task, we lack understanding of the ability of such models to perform on different kinds of questions and reasoning types. As our initia… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 16 pages, 5 figures, 7 tables

  21. arXiv:2305.12280  [pdf, other

    cs.CL

    Contextualizing Argument Quality Assessment with Relevant Knowledge

    Authors: Darshan Deshpande, Zhivar Sourati, Filip Ilievski, Fred Morstatter

    Abstract: Automatic assessment of the quality of arguments has been recognized as a challenging task with significant implications for misinformation and targeted speech. While real-world arguments are tightly anchored in context, existing computational methods analyze their quality in isolation, which affects their accuracy and generalizability. We propose SPARK: a novel method for scoring argument quality… ▽ More

    Submitted 17 June, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted at NAACL 2024

  22. arXiv:2305.05091  [pdf, other

    cs.CL cs.AI cs.HC

    Knowledge-enhanced Agents for Interactive Text Games

    Authors: Prateek Chhikara, Jiarui Zhang, Filip Ilievski, Jonathan Francis, Kaixin Ma

    Abstract: Communication via natural language is a key aspect of machine intelligence, and it requires computational models to learn and reason about world concepts, with varying levels of supervision. Significant progress has been made on fully-supervised non-interactive tasks, such as question-answering and procedural text understanding. Yet, various sequential interactive tasks, as in text-based games, ha… ▽ More

    Submitted 16 December, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Published at K-CAP '23

  23. arXiv:2304.13867  [pdf, other

    cs.CL

    Transferring Procedural Knowledge across Commonsense Tasks

    Authors: Yifan Jiang, Filip Ilievski, Kaixin Ma

    Abstract: Stories about everyday situations are an essential part of human communication, motivating the need to develop AI agents that can reliably understand these stories. Despite the long list of supervised methods for story completion and procedural understanding, current AI has no mechanisms to automatically track and explain procedures in unseen stories. To bridge this gap, we study the ability of AI… ▽ More

    Submitted 19 November, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

  24. arXiv:2301.11879  [pdf, other

    cs.AI cs.CL

    Case-Based Reasoning with Language Models for Classification of Logical Fallacies

    Authors: Zhivar Sourati, Filip Ilievski, Hông-Ân Sandlin, Alain Mermoud

    Abstract: The ease and speed of spreading misinformation and propaganda on the Web motivate the need to develop trustworthy technology for detecting fallacies in natural language arguments. However, state-of-the-art language modeling methods exhibit a lack of robustness on tasks like logical fallacy classification that require complex reasoning. In this paper, we propose a Case-Based Reasoning method that c… ▽ More

    Submitted 17 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  25. arXiv:2212.07798  [pdf, other

    cs.CL cs.AI

    Utilizing Background Knowledge for Robust Reasoning over Traffic Situations

    Authors: Jiarui Zhang, Filip Ilievski, Aravinda Kollaa, Jonathan Francis, Kaixin Ma, Alessandro Oltramari

    Abstract: Understanding novel situations in the traffic domain requires an intricate combination of domain-specific and causal commonsense knowledge. Prior work has provided sufficient perception-based modalities for traffic monitoring, in this paper, we focus on a complementary research aspect of Intelligent Transportation: traffic understanding. We scope our study to text-based methods and datasets given… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: Camera ready version of AAAI 2023 workshop on Knowledge-Augmented Methods for Natural Language Processing

  26. arXiv:2212.07425  [pdf, other

    cs.CL cs.AI

    Robust and Explainable Identification of Logical Fallacies in Natural Language Arguments

    Authors: Zhivar Sourati, Vishnu Priya Prasanna Venkatesh, Darshan Deshpande, Himanshu Rawlani, Filip Ilievski, Hông-Ân Sandlin, Alain Mermoud

    Abstract: The spread of misinformation, propaganda, and flawed argumentation has been amplified in the Internet era. Given the volume of data and the subtlety of identifying violations of argumentation norms, supporting information analytics tasks, like content moderation, with trustworthy methods that can identify logical fallacies is essential. In this paper, we formalize prior theoretical work on logical… ▽ More

    Submitted 25 September, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

  27. arXiv:2212.05613  [pdf, other

    cs.CL cs.AI

    A Study of Slang Representation Methods

    Authors: Aravinda Kolla, Filip Ilievski, Hông-Ân Sandlin, Alain Mermoud

    Abstract: Considering the large amount of content created online by the minute, slang-aware automatic tools are critically needed to promote social good, and assist policymakers and moderators in restricting the spread of offensive language, abuse, and hate speech. Despite the success of large language models and the spontaneous emergence of slang dictionaries, it is unclear how far their combination goes i… ▽ More

    Submitted 31 January, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

  28. arXiv:2212.05612  [pdf, other

    cs.AI cs.CL cs.LG

    Multimodal and Explainable Internet Meme Classification

    Authors: Abhinav Kumar Thakur, Filip Ilievski, Hông-Ân Sandlin, Zhivar Sourati, Luca Luceri, Riccardo Tommasini, Alain Mermoud

    Abstract: In the current context where online platforms have been effectively weaponized in a variety of geo-political events and social issues, Internet memes make fair content moderation at scale even more difficult. Existing work on meme classification and tracking has focused on black-box methods that do not explicitly consider the semantics of the memes or the context of their creation. In this paper,… ▽ More

    Submitted 6 April, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

  29. arXiv:2211.01562  [pdf, other

    cs.CL

    PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales

    Authors: Peifeng Wang, Aaron Chan, Filip Ilievski, Muhao Chen, Xiang Ren

    Abstract: Neural language models (LMs) have achieved impressive results on various language-based reasoning tasks by utilizing latent knowledge encoded in their own pretrained parameters. To make this reasoning process more explicit, recent works retrieve a rationalizing LM's internal knowledge by training or prompting it to generate free-text rationales, which can be used to guide task predictions made by… ▽ More

    Submitted 6 April, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: 17 pages, 5 figures. Accepted to ICLR 2023

  30. arXiv:2210.00620  [pdf, other

    cs.AI cs.CL

    Does Wikidata Support Analogical Reasoning?

    Authors: Filip Ilievski, Jay Pujara, Kartik Shenoy

    Abstract: Analogical reasoning methods have been built over various resources, including commonsense knowledge bases, lexical resources, language models, or their combination. While the wide coverage of knowledge about entities and events make Wikidata a promising resource for analogical reasoning across situations and domains, Wikidata has not been employed for this task yet. In this paper, we investigate… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  31. arXiv:2208.12848  [pdf, other

    cs.CL

    Coalescing Global and Local Information for Procedural Text Understanding

    Authors: Kaixin Ma, Filip Ilievski, Jonathan Francis, Eric Nyberg, Alessandro Oltramari

    Abstract: Procedural text understanding is a challenging language reasoning task that requires models to track entity states across the development of a narrative. A complete procedural understanding solution should combine three core aspects: local and global views of the inputs, and global view of outputs. Prior methods considered a subset of these aspects, resulting in either low precision or low recall.… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: COLING 2022

  32. arXiv:2207.00143  [pdf, other

    cs.AI

    Enriching Wikidata with Linked Open Data

    Authors: Bohui Zhang, Filip Ilievski, Pedro Szekely

    Abstract: Large public knowledge graphs, like Wikidata, contain billions of statements about tens of millions of entities, thus inspiring various use cases to exploit such knowledge graphs. However, practice shows that much of the relevant information that fits users' needs is still missing in Wikidata, while current linked open data (LOD) tools are not suitable to enrich large graphs like Wikidata. In this… ▽ More

    Submitted 8 August, 2022; v1 submitted 30 June, 2022; originally announced July 2022.

  33. arXiv:2206.07167  [pdf, other

    cs.AI cs.CL

    Understanding Narratives through Dimensions of Analogy

    Authors: Thiloshon Nagarajah, Filip Ilievski, Jay Pujara

    Abstract: Analogical reasoning is a powerful qualitative reasoning tool that enables humans to connect two situations, and to generalize their knowledge from familiar to novel situations. Cognitive Science research provides valuable insights into the richness and complexity of analogical reasoning, together with implementations of expressive analogical reasoners with limited scalability. Modern scalable AI… ▽ More

    Submitted 27 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Journal ref: IJCAI 2022 workshop on Qualitative Reasoning

  34. arXiv:2205.10661  [pdf, other

    cs.CL cs.AI

    An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs

    Authors: Jiarui Zhang, Filip Ilievski, Kaixin Ma, Jonathan Francis, Alessandro Oltramari

    Abstract: Self-supervision based on the information extracted from large knowledge graphs has been shown to improve the generalization of language models, in zero-shot evaluation on various downstream language reasoning tasks. Since these improvements are reported in aggregate, however, little is known about (i) how to select the appropriate knowledge for solid performance across tasks, (ii) how to combine… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  35. arXiv:2203.13965  [pdf, other

    cs.AI

    Augmenting Knowledge Graphs for Better Link Prediction

    Authors: Jiang Wang, Filip Ilievski, Pedro Szekely, Ke-Thia Yao

    Abstract: Embedding methods have demonstrated robust performance on the task of link prediction in knowledge graphs, by mostly encoding entity relationships. Recent methods propose to enhance the loss function with a literal-aware term. In this paper, we propose KGA: a knowledge graph augmentation method that incorporates literals in an embedding model without modifying its loss function. KGA discretizes qu… ▽ More

    Submitted 24 April, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  36. Generalizable Neuro-symbolic Systems for Commonsense Question Answering

    Authors: Alessandro Oltramari, Jonathan Francis, Filip Ilievski, Kaixin Ma, Roshanak Mirzaee

    Abstract: This chapter illustrates how suitable neuro-symbolic models for language understanding can enable domain generalizability and robustness in downstream tasks. Different methods for integrating neural language models and knowledge graphs are discussed. The situations in which this combination is most appropriate are characterized, including quantitative evaluation and qualitative error analysis on a… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: In Pascal Hitzler, Md Kamruzzaman Sarker (eds.), Neuro-Symbolic Artificial Intelligence: The State of the Art. Frontiers in Artificial Intelligence and Applications Vol. 342, IOS Press, Amsterdam, 2022. arXiv admin note: text overlap with arXiv:2003.04707

  37. arXiv:2112.06318  [pdf, other

    cs.CL

    Contextualized Scene Imagination for Generative Commonsense Reasoning

    Authors: PeiFeng Wang, Jonathan Zamora, Junfeng Liu, Filip Ilievski, Muhao Chen, Xiang Ren

    Abstract: Humans use natural language to compose common concepts from their environment into plausible, day-to-day scene descriptions. However, such generative commonsense reasoning (GCSR) skills are lacking in state-of-the-art text generation methods. Descriptive sentences about arbitrary concepts generated by neural text generation models (e.g., pre-trained text-to-text Transformers) are often grammatical… ▽ More

    Submitted 7 March, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: Accepted by ICLR 2022

  38. arXiv:2109.02837  [pdf, other

    cs.CL

    Exploring Strategies for Generalizable Commonsense Reasoning with Pre-trained Models

    Authors: Kaixin Ma, Filip Ilievski, Jonathan Francis, Satoru Ozaki, Eric Nyberg, Alessandro Oltramari

    Abstract: Commonsense reasoning benchmarks have been largely solved by fine-tuning language models. The downside is that fine-tuning may cause models to overfit to task-specific data and thereby forget their knowledge gained during pre-training. Recent works only propose lightweight model updates as models may already possess useful knowledge from past experience, but a challenge remains in understanding wh… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  39. arXiv:2108.07119  [pdf, ps, other

    cs.AI

    Creating and Querying Personalized Versions of Wikidata on a Laptop

    Authors: Hans Chalupsky, Pedro Szekely, Filip Ilievski, Daniel Garijo, Kartik Shenoy

    Abstract: Application developers today have three choices for exploiting the knowledge present in Wikidata: they can download the Wikidata dumps in JSON or RDF format, they can use the Wikidata API to get data about individual entities, or they can use the Wikidata SPARQL endpoint. None of these methods can support complex, yet common, query use cases, such as retrieval of large amounts of data or aggregati… ▽ More

    Submitted 18 August, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

    ACM Class: H.3.3; I.2

  40. arXiv:2108.05412  [pdf, ps, other

    cs.AI

    Analyzing Race and Country of Citizenship Bias in Wikidata

    Authors: Zaina Shaik, Filip Ilievski, Fred Morstatter

    Abstract: As an open and collaborative knowledge graph created by users and bots, it is possible that the knowledge in Wikidata is biased in regards to multiple factors such as gender, race, and country of citizenship. Previous work has mostly studied the representativeness of Wikidata knowledge in terms of genders of people. In this paper, we examine the race and citizenship bias in general and in regards… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

  41. arXiv:2108.05410  [pdf, other

    cs.AI

    User-friendly Comparison of Similarity Algorithms on Wikidata

    Authors: Filip Ilievski, Pedro Szekely, Gleb Satyukov, Amandeep Singh

    Abstract: While the similarity between two concept words has been evaluated and studied for decades, much less attention has been devoted to algorithms that can compute the similarity of nodes in very large knowledge graphs, like Wikidata. To facilitate investigations and head-to-head comparisons of similarity algorithms on Wikidata, we present a user-friendly interface that allows flexible computation of s… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

  42. arXiv:2107.00156  [pdf, other

    cs.AI

    A Study of the Quality of Wikidata

    Authors: Kartik Shenoy, Filip Ilievski, Daniel Garijo, Daniel Schwabe, Pedro Szekely

    Abstract: Wikidata has been increasingly adopted by many communities for a wide variety of applications, which demand high-quality knowledge to deliver successful results. In this paper, we develop a framework to detect and analyze low-quality statements in Wikidata by shedding light on the current practices exercised by the community. We explore three indicators of data quality in Wikidata, based on: 1) co… ▽ More

    Submitted 18 November, 2021; v1 submitted 30 June, 2021; originally announced July 2021.

    Comments: 12 pages

    Journal ref: Journal of Web Semantics, Special issue on Community-Based Knowledge Bases, 2021

  43. arXiv:2106.11533  [pdf, other

    cs.CL cs.AI

    Do Language Models Perform Generalizable Commonsense Inference?

    Authors: Peifeng Wang, Filip Ilievski, Muhao Chen, Xiang Ren

    Abstract: Inspired by evidence that pretrained language models (LMs) encode commonsense knowledge, recent work has applied LMs to automatically populate commonsense knowledge graphs (CKGs). However, there is a lack of understanding on their generalization to multiple CKGs, unseen relations, and novel entities. This paper analyzes the ability of LMs to perform generalizable commonsense inference, in terms of… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: 8 pages, 4 figures. Accepted to ACL'21 Findings

  44. arXiv:2104.08712  [pdf, other

    cs.CL

    PaCo: Preconditions Attributed to Commonsense Knowledge

    Authors: Ehsan Qasemi, Filip Ilievski, Muhao Chen, Pedro Szekely

    Abstract: Humans can seamlessly reason with circumstantial preconditions of commonsense knowledge. We understand that a glass is used for drinking water, unless the glass is broken or the water is toxic. Despite state-of-the-art (SOTA) language models' (LMs) impressive performance on inferring commonsense knowledge, it is unclear whether they understand the circumstantial preconditions. To address this gap,… ▽ More

    Submitted 13 August, 2023; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: EMNLP 2022 (Findings)

  45. arXiv:2103.13136  [pdf, other

    cs.CL cs.AI cs.LG

    Representing Numbers in NLP: a Survey and a Vision

    Authors: Avijit Thawani, Jay Pujara, Pedro A. Szekely, Filip Ilievski

    Abstract: NLP systems rarely give special consideration to numbers found in text. This starkly contrasts with the consensus in neuroscience that, in the brain, numbers are represented differently from words. We arrange recent NLP work on numeracy into a comprehensive taxonomy of tasks and methods. We break down the subjective notion of numeracy into 7 subtasks, arranged along two dimensions: granularity (ex… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted at NAACL 2021

    ACM Class: I.2.7

  46. arXiv:2101.04640  [pdf, other

    cs.AI

    Dimensions of Commonsense Knowledge

    Authors: Filip Ilievski, Alessandro Oltramari, Kaixin Ma, Bin Zhang, Deborah L. McGuinness, Pedro Szekely

    Abstract: Commonsense knowledge is essential for many AI applications, including those in natural language processing, visual processing, and planning. Consequently, many sources that include commonsense knowledge have been designed and constructed over the past decades. Recently, the focus has been on large text-based sources, which facilitate easier integration with neural (language) models and applicatio… ▽ More

    Submitted 29 July, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

    Journal ref: Knowledge-Based Systems 2021

  47. arXiv:2012.11490  [pdf, other

    cs.AI cs.SI

    CSKG: The CommonSense Knowledge Graph

    Authors: Filip Ilievski, Pedro Szekely, Bin Zhang

    Abstract: Sources of commonsense knowledge support applications in natural language understanding, computer vision, and knowledge graphs. Given their complementarity, their integration is desired. Yet, their different foci, modeling approaches, and sparse overlap make integration difficult. In this paper, we consolidate commonsense knowledge by following five principles, which we apply to combine seven key… ▽ More

    Submitted 22 March, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2006.06114

    Journal ref: ESWC 2021 Resource Track

  48. arXiv:2011.03863  [pdf, other

    cs.CL cs.AI

    Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

    Authors: Kaixin Ma, Filip Ilievski, Jonathan Francis, Yonatan Bisk, Eric Nyberg, Alessandro Oltramari

    Abstract: Recent developments in pre-trained neural language modeling have led to leaps in accuracy on commonsense question-answering benchmarks. However, there is increasing concern that models overfit to specific tasks, without learning to utilize external knowledge or perform general semantic reasoning. In contrast, zero-shot evaluations have shown promise as a more robust measure of a model's general re… ▽ More

    Submitted 14 December, 2020; v1 submitted 7 November, 2020; originally announced November 2020.

    Comments: AAAI 2021

  49. arXiv:2008.08114  [pdf, other

    cs.AI

    Commonsense Knowledge in Wikidata

    Authors: Filip Ilievski, Pedro Szekely, Daniel Schwabe

    Abstract: Wikidata and Wikipedia have been proven useful for reason-ing in natural language applications, like question answering or entitylinking. Yet, no existing work has studied the potential of Wikidata for commonsense reasoning. This paper investigates whether Wikidata con-tains commonsense knowledge which is complementary to existing commonsense sources. Starting from a definition of common sense, we… ▽ More

    Submitted 15 October, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: WikiData Workshop at ISWC 2020

  50. arXiv:2006.06114  [pdf, other

    cs.AI cs.CL

    Consolidating Commonsense Knowledge

    Authors: Filip Ilievski, Pedro Szekely, Jingwei Cheng, Fu Zhang, Ehsan Qasemi

    Abstract: Commonsense reasoning is an important aspect of building robust AI systems and is receiving significant attention in the natural language understanding, computer vision, and knowledge graphs communities. At present, a number of valuable commonsense knowledge sources exist, with different foci, strengths, and weaknesses. In this paper, we list representative sources and their properties. Based on t… ▽ More

    Submitted 22 June, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: 14 pages