Skip to main content

Showing 1–11 of 11 results for author: Gkoumas, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13984  [pdf, other

    cs.CL cs.MM

    Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation

    Authors: Dimitris Gkoumas, Maria Liakata

    Abstract: Scientific language models drive research innovation but require extensive fine-tuning on large datasets. This work enhances such models by improving their inference and evaluation capabilities with minimal or no additional training. Focusing on molecule caption generation, we explore post-training synergies between alignment fine-tuning and model merging in a cross-modal setup. We reveal intrigui… ▽ More

    Submitted 26 May, 2025; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: ACL25 Main

  2. arXiv:2405.08619  [pdf, other

    cs.CL cs.MM

    ALMol: Aligned Language-Molecule Translation LLMs through Offline Preference Contrastive Optimisation

    Authors: Dimitris Gkoumas

    Abstract: The field of chemistry and Artificial Intelligence (AI) intersection is an area of active research that aims to accelerate scientific discovery. The integration of large language models (LLMs) with scientific modalities has shown significant promise in this endeavour. However, challenges persist in effectively addressing training efficacy and the out-of-distribution problem, particularly as existi… ▽ More

    Submitted 15 July, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  3. arXiv:2310.09897  [pdf, other

    cs.CL

    Reformulating NLP tasks to Capture Longitudinal Manifestation of Language Disorders in People with Dementia

    Authors: Dimitris Gkoumas, Matthew Purver, Maria Liakata

    Abstract: Dementia is associated with language disorders which impede communication. Here, we automatically learn linguistic disorder patterns by making use of a moderately-sized pre-trained language model and forcing it to focus on reformulated natural language processing (NLP) tasks and associated linguistic patterns. Our experiments show that NLP tasks that encapsulate contextual information and enhance… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: It has been accepted to appear at EMNLP23

  4. arXiv:2310.09623  [pdf, other

    cs.CL

    A Digital Language Coherence Marker for Monitoring Dementia

    Authors: Dimitris Gkoumas, Adam Tsakalidis, Maria Liakata

    Abstract: The use of spontaneous language to derive appropriate digital markers has become an emergent, promising and non-intrusive method to diagnose and monitor dementia. Here we propose methods to capture language coherence as a cost-effective, human-interpretable digital marker for monitoring cognitive changes in people with dementia. We introduce a novel task to learn the temporal logical consistency o… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: It has been accepted to appear at EMNLP23

  5. arXiv:2109.01537  [pdf, other

    cs.CL cs.AI cs.DB cs.MM

    A Longitudinal Multi-modal Dataset for Dementia Monitoring and Diagnosis

    Authors: Dimitris Gkoumas, Bo Wang, Adam Tsakalidis, Maria Wolters, Arkaitz Zubiaga, Matthew Purver, Maria Liakata

    Abstract: Dementia affects cognitive functions of adults, including memory, language, and behaviour. Standard diagnostic biomarkers such as MRI are costly, whilst neuropsychological tests suffer from sensitivity issues in detecting dementia onset. The analysis of speech and language has emerged as a promising and non-intrusive technology to diagnose and monitor dementia. Currently, most work in this directi… ▽ More

    Submitted 23 December, 2023; v1 submitted 3 September, 2021; originally announced September 2021.

  6. arXiv:2103.10572  [pdf, other

    cs.MM

    Quantum-inspired Multimodal Fusion for Video Sentiment Analysis

    Authors: Qiuchi Li, Dimitris Gkoumas, Christina Lioma, Massimo Melucci

    Abstract: We tackle the crucial challenge of fusing different modalities of features for multimodal sentiment analysis. Mainly based on neural networks, existing approaches largely model multimodal interactions in an implicit and hard-to-understand manner. We address this limitation with inspirations from quantum theory, which contains principled methods for modeling complicated interactions and correlation… ▽ More

    Submitted 22 March, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: Post-print accepted by Information Fusion

  7. arXiv:2101.04406  [pdf, other

    cs.CL cs.AI

    Quantum Cognitively Motivated Decision Fusion for Video Sentiment Analysis

    Authors: Dimitris Gkoumas, Qiuchi Li, Shahram Dehdashti, Massimo Melucci, Yijun Yu, Dawei Song

    Abstract: Video sentiment analysis as a decision-making process is inherently complex, involving the fusion of decisions from multiple modalities and the so-caused cognitive biases. Inspired by recent advances in quantum cognition, we show that the sentiment judgment from one modality could be incompatible with the judgment from another, i.e., the order matters and they cannot be jointly measured to produce… ▽ More

    Submitted 18 May, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

    Comments: The uploaded version is a preprint of the accepted AAAI-21 paper

  8. arXiv:2007.04357  [pdf, other

    cs.IR

    A Survey of Quantum Theory Inspired Approaches to Information Retrieval

    Authors: Sagar Uprety, Dimitris Gkoumas, Dawei Song

    Abstract: Since 2004, researchers have been using the mathematical framework of Quantum Theory (QT) in Information Retrieval (IR). QT offers a generalized probability and logic framework. Such a framework has been shown capable of unifying the representation, ranking and user cognitive aspects of IR, and helpful in developing more dynamic, adaptive and context-aware IR systems. Although Quantum-inspired IR… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: Accepted for publication at ACM Computing Surveys on May 20, 2020

  9. arXiv:1811.11422  [pdf, other

    cs.IR

    Exploiting "Quantum-like Interference" in Decision Fusion for Ranking Multimodal Documents

    Authors: Dimitris Gkoumas, Dawei Sogn

    Abstract: Fusing and ranking multimodal information remains always a challenging task. A robust decision-level fusion method should not only be dynamically adaptive for assigning weights to each representation but also incorporate inter-relationships among different modalities. In this paper, we propose a quantum-inspired model for fusing and ranking visual and textual information accounting for the depende… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

  10. arXiv:1811.06645  [pdf, ps, other

    cs.IR

    Investigating Bell Inequalities for Multidimensional Relevance Judgments in Information Retrieval

    Authors: Sagar Uprety, Dimitris Gkoumas, Dawei Song

    Abstract: Relevance judgment in Information Retrieval is influenced by multiple factors. These include not only the topicality of the documents but also other user oriented factors like trust, user interest, etc. Recent works have identified these various factors into seven dimensions of relevance. In a previous work, these relevance dimensions were quantified and user's cognitive state with respect to a do… ▽ More

    Submitted 13 March, 2019; v1 submitted 15 November, 2018; originally announced November 2018.

    Comments: 11th Quantum Interaction Conference, Nice, France

  11. arXiv:1810.11303  [pdf, other

    cs.IR

    Investigating non-classical correlations between decision fused multi-modal documents

    Authors: Dimitris Gkoumas, Sagar Uprety, Dawei Song

    Abstract: Correlation has been widely used to facilitate various information retrieval methods such as query expansion, relevance feedback, document clustering, and multi-modal fusion. Especially, correlation and independence are important issues when fusing different modalities that influence a multi-modal information retrieval process. The basic idea of correlation is that an observable can help predict o… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.