Skip to main content

Showing 1–37 of 37 results for author: Hakimov, S

.
  1. arXiv:2505.14425  [pdf, other

    cs.CL

    From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning

    Authors: Chalamalasetti Kranti, Sherzod Hakimov, David Schlangen

    Abstract: Instruction-tuned large language models (LLMs) have shown strong performance on a variety of tasks; however, generalizing from synthetic to human-authored instructions in grounded environments remains a challenge for them. In this work, we study generalization challenges in spatial grounding tasks where models interpret and translate instructions for building object arrangements on a $2.5$D grid.… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 4 pages

  2. arXiv:2505.05445  [pdf, other

    cs.CL

    clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations

    Authors: Chalamalasetti Kranti, Sherzod Hakimov, David Schlangen

    Abstract: The emergence of instruction-tuned large language models (LLMs) has advanced the field of dialogue systems, enabling both realistic user simulations and robust multi-turn conversational agents. However, existing research often evaluates these components in isolation-either focusing on a single user simulator or a specific system design-limiting the generalisability of insights across architectures… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: 30 pages

  3. arXiv:2504.08590  [pdf, other

    cs.CL

    Playpen: An Environment for Exploring Learning Through Conversational Interaction

    Authors: Nicola Horst, Davide Mazzaccara, Antonia Schmidt, Michael Sullivan, Filippo Momentè, Luca Franceschetti, Philipp Sadler, Sherzod Hakimov, Alberto Testoni, Raffaella Bernardi, Raquel Fernández, Alexander Koller, Oliver Lemon, David Schlangen, Mario Giulianelli, Alessandro Suglia

    Abstract: Interaction between learner and feedback-giver has come into focus recently for post-training of Large Language Models (LLMs), through the use of reward models that judge the appropriateness of a model's response. In this paper, we investigate whether Dialogue Games -- goal-directed and rule-governed activities driven predominantly by verbal actions -- can also serve as a source of feedback signal… ▽ More

    Submitted 23 May, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: Source code: https://github.com/lm-playpen/playpen Please send correspodence to: [email protected]

  4. arXiv:2502.11733  [pdf, ps, other

    cs.CL

    Plant in Cupboard, Orange on Rably, Inat Aphone. Benchmarking Incremental Learning of Situation and Language Model using a Text-Simulated Situated Environment

    Authors: Jonathan Jordan, Sherzod Hakimov, David Schlangen

    Abstract: Large Language Models (LLMs) serve not only as chatbots but as key components in agent systems, where their common-sense knowledge significantly impacts performance as language-based planners for situated or embodied action. We assess LLMs' incremental learning (based on feedback from the environment), and controlled in-context learning abilities using a text-based environment. We introduce challe… ▽ More

    Submitted 27 June, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Accepted at The 28th International Conference of Text, Speech and Dialogue (TSD2025)

  5. arXiv:2502.11707  [pdf, ps, other

    cs.CL

    Ad-hoc Concept Forming in the Game Codenames as a Means for Evaluating Large Language Models

    Authors: Sherzod Hakimov, Lara Pfennigschmidt, David Schlangen

    Abstract: This study utilizes the game Codenames as a benchmarking tool to evaluate large language models (LLMs) with respect to specific linguistic and cognitive skills. LLMs play each side of the game, where one side generates a clue word covering several target words and the other guesses those target words. We designed various experiments by controlling the choice of words (abstract vs. concrete words,… ▽ More

    Submitted 25 June, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Accepted at GemBench workshop co-located with ACL 2025

  6. arXiv:2409.11041  [pdf, other

    cs.CL

    Towards No-Code Programming of Cobots: Experiments with Code Synthesis by Large Code Models for Conversational Programming

    Authors: Chalamalasetti Kranti, Sherzod Hakimov, David Schlangen

    Abstract: While there has been a lot of research recently on robots in household environments, at the present time, most robots in existence can be found on shop floors, and most interactions between humans and robots happen there. ``Collaborative robots'' (cobots) designed to work alongside humans on assembly lines traditionally require expert programming, limiting ability to make changes, or manual guidan… ▽ More

    Submitted 18 September, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

  7. arXiv:2407.01384  [pdf, ps, other

    cs.CL

    Free-text Rationale Generation under Readability Level Control

    Authors: Yi-Sheng Hsu, Nils Feldhus, Sherzod Hakimov

    Abstract: Free-text rationales justify model decisions in natural language and thus become likable and accessible among approaches to explanation across many tasks. However, their effectiveness can be hindered by misinterpretation and hallucination. As a perturbation test, we investigate how large language models (LLMs) perform rationale generation under the effects of readability level control, i.e., being… ▽ More

    Submitted 3 June, 2025; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: ACL 2025 Workshop on Generation, Evaluation, and Metrics (GEM^2)

  8. arXiv:2406.17553  [pdf, other

    cs.CL

    Retrieval-Augmented Code Generation for Situated Action Generation: A Case Study on Minecraft

    Authors: Chalamalasetti Kranti, Sherzod Hakimov, David Schlangen

    Abstract: In the Minecraft Collaborative Building Task, two players collaborate: an Architect (A) provides instructions to a Builder (B) to assemble a specified structure using 3D blocks. In this work, we investigate the use of large language models (LLMs) to predict the sequence of actions taken by the Builder. Leveraging LLMs' in-context learning abilities, we use few-shot prompting techniques, that signi… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: under review

  9. arXiv:2406.14051  [pdf, other

    cs.CL cs.AI

    How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics

    Authors: Nidhir Bhavsar, Jonathan Jordan, Sherzod Hakimov, David Schlangen

    Abstract: What makes a good Large Language Model (LLM)? That it performs well on the relevant benchmarks -- which hopefully measure, with some validity, the presence of capabilities that are also challenged in real application. But what makes the model perform well? What gives a model its abilities? We take a recently introduced type of benchmark that is meant to challenge capabilities in a goal-directed, a… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: under review

  10. arXiv:2406.14035  [pdf, other

    cs.CL cs.AI

    Using Game Play to Investigate Multimodal and Conversational Grounding in Large Multimodal Models

    Authors: Sherzod Hakimov, Yerkezhan Abdullayeva, Kushal Koshti, Antonia Schmidt, Yan Weiser, Anne Beyer, David Schlangen

    Abstract: While the situation has improved for text-only models, it again seems to be the case currently that multimodal (text and image) models develop faster than ways to evaluate them. In this paper, we bring a recently developed evaluation paradigm from text models to multimodal models, namely evaluation through the goal-oriented game (self) play, complementing reference-based and preference-based evalu… ▽ More

    Submitted 11 December, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted at COLING 2025

  11. arXiv:2405.20859  [pdf, other

    cs.CL cs.AI

    clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents

    Authors: Anne Beyer, Kranti Chalamalasetti, Sherzod Hakimov, Brielen Madureira, Philipp Sadler, David Schlangen

    Abstract: It has been established in recent work that Large Language Models (LLMs) can be prompted to "self-play" conversational games that probe certain capabilities (general instruction following, strategic goal orientation, language understanding abilities), where the resulting interactive game play can be automatically scored. In this paper, we take one of the proposed frameworks for setting up such gam… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: under review

  12. arXiv:2404.01753  [pdf, other

    cs.CL

    M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets

    Authors: Gaurish Thakkar, Sherzod Hakimov, Marko Tadić

    Abstract: In recent years, multimodal natural language processing, aimed at learning from diverse data types, has garnered significant attention. However, there needs to be more clarity when it comes to analysing multimodal tasks in multi-lingual contexts. While prior studies on sentiment analysis of tweets have predominantly focused on the English language, this paper addresses this gap by transforming an… ▽ More

    Submitted 12 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Journal ref: LREC-COLING 2024 - The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation

  13. arXiv:2403.17497  [pdf, other

    cs.CL cs.CV

    Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies

    Authors: Philipp Sadler, Sherzod Hakimov, David Schlangen

    Abstract: In collaborative goal-oriented settings, the participants are not only interested in achieving a successful outcome, but do also implicitly negotiate the effort they put into the interaction (by adapting to each other). In this work, we propose a challenging interactive reference game that requires two players to coordinate on vision and language observations. The learning signal in this game is a… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 9 pages, Accepted at LREC-COLING 2024

  14. arXiv:2402.04824  [pdf, other

    cs.CL

    Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game

    Authors: Philipp Sadler, Sherzod Hakimov, David Schlangen

    Abstract: Albrecht and Stone (2018) state that modeling of changing behaviors remains an open problem "due to the essentially unconstrained nature of what other agents may do". In this work we evaluate the adaptability of neural artificial agents towards assumed partner behaviors in a collaborative reference game. In this game success is achieved when a knowledgeable Guide can verbally lead a Follower to th… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Work presented at the "Cooperative Multi-Agent Systems Decision-making and Learning" workshop (AAAI'24)

  15. arXiv:2306.12886  [pdf, other

    cs.CL cs.DL

    Unveiling Global Narratives: A Multilingual Twitter Dataset of News Media on the Russo-Ukrainian Conflict

    Authors: Sherzod Hakimov, Gullal S. Cheema

    Abstract: The ongoing Russo-Ukrainian conflict has been a subject of intense media coverage worldwide. Understanding the global narrative surrounding this topic is crucial for researchers that aim to gain insights into its multifaceted dimensions. In this paper, we present a novel multimedia dataset that focuses on this topic by collecting and processing tweets posted by news or media companies on social me… ▽ More

    Submitted 7 April, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: ICMR 2024

    Journal ref: ICMR 2024 - ACM International Conference on Multimedia Retrieval 2024

  16. arXiv:2305.18599  [pdf, other

    cs.CL cs.IR cs.LG cs.MM

    Improving Generalization for Multimodal Fake News Detection

    Authors: Sahar Tahmasebi, Sherzod Hakimov, Ralph Ewerth, Eric Müller-Budack

    Abstract: The increasing proliferation of misinformation and its alarming impact have motivated both industry and academia to develop approaches for fake news detection. However, state-of-the-art approaches are usually trained on datasets of smaller size or with a limited set of specific topics. As a consequence, these models lack generalization capabilities and are not applicable to real-world data. In thi… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted for ICMR 2023

  17. arXiv:2305.13782  [pdf, other

    cs.CL

    Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks

    Authors: Sherzod Hakimov, David Schlangen

    Abstract: Large language models have demonstrated robust performance on various language tasks using zero-shot or few-shot learning paradigms. While being actively researched, multimodal models that can additionally handle images as input have yet to catch up in size and generality with language-only models. In this work, we ask whether language-only models can be utilised for tasks that require visual inpu… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 Findings

  18. arXiv:2305.13455  [pdf, other

    cs.CL

    Clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents

    Authors: Kranti Chalamalasetti, Jana Götze, Sherzod Hakimov, Brielen Madureira, Philipp Sadler, David Schlangen

    Abstract: Recent work has proposed a methodology for the systematic evaluation of "Situated Language Understanding Agents"-agents that operate in rich linguistic and non-linguistic contexts-through testing them in carefully constructed interactive settings. Other recent work has argued that Large Language Models (LLMs), if suitably set up, can be understood as (simulators of) such agents. A connection sugge… ▽ More

    Submitted 23 November, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  19. arXiv:2305.12880  [pdf, other

    cs.CV cs.CL

    Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers

    Authors: Philipp Sadler, Sherzod Hakimov, David Schlangen

    Abstract: The ability to pick up on language signals in an ongoing interaction is crucial for future machine learning models to collaborate and interact with humans naturally. In this paper, we present an initial study that evaluates intra-episodic feedback given in a collaborative setting. We use a referential language game as a controllable example of a task-oriented collaborative joint activity. A teache… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 5 pages, Accepted at Findings of ACL 2023

  20. arXiv:2211.08042  [pdf, other

    cs.IR

    MM-Locate-News: Multimodal Focus Location Estimation in News

    Authors: Golsa Tahmasebzadeh, Eric Müller-Budack, Sherzod Hakimov, Ralph Ewerth

    Abstract: The consumption of news has changed significantly as the Web has become the most influential medium for information. To analyze and contextualize the large amount of news published every day, the geographic focus of an article is an important aspect in order to enable content-based news retrieval. There are methods and datasets for geolocation estimation from text or photos, but they are typically… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  21. arXiv:2205.01989  [pdf, other

    cs.CL cs.AI cs.CV cs.MM cs.SI

    MM-Claims: A Dataset for Multimodal Claim Detection in Social Media

    Authors: Gullal S. Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth

    Abstract: In recent years, the problem of misinformation on the web has become widespread across languages, countries, and various social media platforms. Although there has been much work on automated fake news detection, the role of images and their variety are not well explored. In this paper, we investigate the roles of image and text at an earlier stage of the fake news detection pipeline, called claim… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to Findings of NAACL 2022

  22. arXiv:2204.06299  [pdf, other

    cs.CL cs.AI cs.CV

    TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes

    Authors: Sherzod Hakimov, Gullal S. Cheema, Ralph Ewerth

    Abstract: The detection of offensive, hateful content on social media is a challenging problem that affects many online users on a daily basis. Hateful content is often used to target a group of people based on ethnicity, gender, religion and other factors. The hate or contempt toward women has been increasing on social platforms. Misogynous content detection is especially challenging when textual and visua… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted for publication at SemEval-2022 Workshop, Task 5: MAMI - Multimedia Automatic Misogyny Identification co-located with NAACL 2022

  23. arXiv:2112.04803  [pdf, other

    cs.CL cs.LG

    Combining Textual Features for the Detection of Hateful and Offensive Language

    Authors: Sherzod Hakimov, Ralph Ewerth

    Abstract: The detection of offensive, hateful and profane language has become a critical challenge since many users in social networks are exposed to cyberbullying activities on a daily basis. In this paper, we present an analysis of combining different textual features for the detection of hateful or offensive posts on Twitter. We provide a detailed experimental evaluation to understand the impact of each… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: HASOC 2021, Forum for Information Retrieval Evaluation, 2021

  24. EduCOR: An Educational and Career-Oriented Recommendation Ontology

    Authors: Eleni Ilkou, Hasan Abu-Rasheed, Mohammadreza Tavakoli, Sherzod Hakimov, Gábor Kismihók, Sören Auer, Wolfgang Nejdl

    Abstract: With the increased dependence on online learning platforms and educational resource repositories, a unified representation of digital learning resources becomes essential to support a dynamic and multi-source learning experience. We introduce the EduCOR ontology, an educational, career-oriented ontology that provides a foundation for representing online learning resources for personalised learning… ▽ More

    Submitted 13 July, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted in the The 20th International Semantic Web Conference (ISWC2021)

    ACM Class: E.2; I.2.4

  25. arXiv:2106.08829  [pdf, other

    cs.SI cs.CL cs.CV

    A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods

    Authors: Gullal S. Cheema, Sherzod Hakimov, Eric Müller-Budack, Ralph Ewerth

    Abstract: Opinion and sentiment analysis is a vital task to characterize subjective information in social media posts. In this paper, we present a comprehensive experimental evaluation and comparison with six state-of-the-art methods, from which we have re-implemented one of them. In addition, we investigate different textual and visual feature embeddings that cover different aspects of the content, as well… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted in Workshop on Multi-ModalPre-Training for Multimedia Understanding (MMPT 2021), co-located with ICMR 2021

  26. arXiv:2105.12532  [pdf, other

    cs.CV cs.AI

    Unsupervised Video Summarization via Multi-source Features

    Authors: Hussain Kanafani, Junaid Ahmed Ghauri, Sherzod Hakimov, Ralph Ewerth

    Abstract: Video summarization aims at generating a compact yet representative visual summary that conveys the essence of the original video. The advantage of unsupervised approaches is that they do not require human annotations to learn the summarization capability and generalize to a wider range of domains. Previous work relies on the same type of deep features, typically based on a model pre-trained on Im… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted for publication at the ACM International Conference on Multimedia Retrieval (ICMR) 2021

  27. arXiv:2104.14994  [pdf, other

    cs.IR cs.MM

    GeoWINE: Geolocation based Wiki, Image,News and Event Retrieval

    Authors: Golsa Tahmasebzadeh, Endri Kacupaj, Eric Müller-Budack, Sherzod Hakimov, Jens Lehmann, Ralph Ewerth

    Abstract: In the context of social media, geolocation inference on news or events has become a very important task. In this paper, we present the GeoWINE (Geolocation-based Wiki-Image-News-Event retrieval) demonstrator, an effective modular system for multimodal retrieval which expects only a single image as input. The GeoWINE system consists of five modules in order to retrieve related information from var… ▽ More

    Submitted 4 May, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in: International ACM SIGIR Conference on Research and Development in Information Retrieval 2021

  28. arXiv:2104.11530  [pdf, other

    cs.CV cs.AI cs.IR cs.LG cs.MM

    Supervised Video Summarization via Multiple Feature Sets with Parallel Attention

    Authors: Junaid Ahmed Ghauri, Sherzod Hakimov, Ralph Ewerth

    Abstract: The assignment of importance scores to particular frames or (short) segments in a video is crucial for summarization, but also a difficult task. Previous work utilizes only one source of visual features. In this paper, we suggest a novel model architecture that combines three feature sets for visual content and motion to predict importance scores. The proposed architecture utilizes an attention me… ▽ More

    Submitted 13 May, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Accepted in IEEE International Conference on Multimedia and Expo (ICME) 2021 (They have copyright to publish camera ready version of this work)

  29. arXiv:2103.09602  [pdf, other

    cs.SI cs.CL cs.CV

    On the Role of Images for Analyzing Claims in Social Media

    Authors: Gullal S. Cheema, Sherzod Hakimov, Eric Müller-Budack, Ralph Ewerth

    Abstract: Fake news is a severe problem in social media. In this paper, we present an empirical study on visual, textual, and multimodal models for the tasks of claim, claim check-worthiness, and conspiracy detection, all of which are related to fake news detection. Recent work suggests that images are more influential than text and often appear alongside fake text. To this end, several multimodal models ha… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Comments: CLEOPATRA-2021 Workshop co-located with The Web Conf 2021

  30. arXiv:2101.03529  [pdf, other

    cs.SI cs.CL

    TIB's Visual Analytics Group at MediaEval '20: Detecting Fake News on Corona Virus and 5G Conspiracy

    Authors: Gullal S. Cheema, Sherzod Hakimov, Ralph Ewerth

    Abstract: Fake news on social media has become a hot topic of research as it negatively impacts the discourse of real news in the public. Specifically, the ongoing COVID-19 pandemic has seen a rise of inaccurate and misleading information due to the surrounding controversies and unknown details at the beginning of the pandemic. The FakeNews task at MediaEval 2020 tackles this problem by creating a challenge… ▽ More

    Submitted 10 January, 2021; originally announced January 2021.

    Comments: MediaEval 2020 Fake News Task

  31. arXiv:2011.04714  [pdf, other

    cs.CV

    Ontology-driven Event Type Classification in Images

    Authors: Eric Müller-Budack, Matthias Springstein, Sherzod Hakimov, Kevin Mrutzek, Ralph Ewerth

    Abstract: Event classification can add valuable information for semantic search and the increasingly important topic of fact validation in news. So far, only few approaches address image classification for newsworthy event types such as natural disasters, sports events, or elections. Previous work distinguishes only between a limited number of event types and relies on rather small datasets for training. In… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted for publication in: IEEE Winter Conference on Applications of Computer Vision (WACV) 2021

  32. arXiv:2010.13626  [pdf, other

    cs.CV cs.LG

    Classification of Important Segments in Educational Videos using Multimodal Features

    Authors: Junaid Ahmed Ghauri, Sherzod Hakimov, Ralph Ewerth

    Abstract: Videos are a commonly-used type of content in learning during Web search. Many e-learning platforms provide quality content, but sometimes educational videos are long and cover many topics. Humans are good in extracting important sections from videos, but it remains a significant challenge for computers. In this paper, we address the problem of assigning importance scores to video segments, that i… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: Proceedings of the CIKM 2020 Workshops, October 19 to 20, Galway, Ireland

  33. arXiv:2007.10534  [pdf, other

    cs.CL cs.SI

    Check_square at CheckThat! 2020: Claim Detection in Social Media via Fusion of Transformer and Syntactic Features

    Authors: Gullal S. Cheema, Sherzod Hakimov, Ralph Ewerth

    Abstract: In this digital age of news consumption, a news reader has the ability to react, express and share opinions with others in a highly interactive and fast manner. As a consequence, fake news has made its way into our daily life because of very limited capacity to verify news on the Internet by large companies as well as individuals. In this paper, we focus on solving two problems which are part of t… ▽ More

    Submitted 20 September, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: CLEF2020-CheckThat!

  34. arXiv:2007.06390  [pdf, other

    cs.CL cs.IR cs.LG

    A Feature Analysis for Multimodal News Retrieval

    Authors: Golsa Tahmasebzadeh, Sherzod Hakimov, Eric Müller-Budack, Ralph Ewerth

    Abstract: Content-based information retrieval is based on the information contained in documents rather than using metadata such as keywords. Most information retrieval methods are either based on text or image. In this paper, we investigate the usefulness of multimodal features for cross-lingual news search in various domains: politics, health, environment, sport, and finance. To this end, we consider five… ▽ More

    Submitted 1 October, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: CLEOPATRA Workshop co-located with ESWC 2020

    Journal ref: CLEOPATRA Workshop co-located with ESWC 2020

  35. arXiv:2005.10595  [pdf, other

    cs.CY

    A Recommender System For Open Educational Videos Based On Skill Requirements

    Authors: Mohammadreza Tavakoli, Sherzod Hakimov, Ralph Ewerth, Gábor Kismihók

    Abstract: In this paper, we suggest a novel method to help learners find relevant open educational videos to master skills demanded on the labour market. We have built a prototype, which 1) applies text classification and text mining methods on job vacancy announcements to match jobs and their required skills; 2) predicts the quality of videos; and 3) creates an open educational video recommender system to… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: This paper has been accepted to be published in the proceedings of International Conference on Advanced Learning Technologies (ICALT) 2020 by IEEE Computer Society

  36. arXiv:1812.02536  [pdf, other

    cs.CL cs.AI

    Evaluating Architectural Choices for Deep Learning Approaches for Question Answering over Knowledge Bases

    Authors: Sherzod Hakimov, Soufian Jebbara, Philipp Cimiano

    Abstract: The task of answering natural language questions over knowledge bases has received wide attention in recent years. Various deep learning architectures have been proposed for this task. However, architectural design choices are typically not systematically compared nor evaluated under the same conditions. In this paper, we contribute to a better understanding of the impact of architectural design c… ▽ More

    Submitted 13 December, 2018; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: the longer version than the original publication at ICSC 2019

  37. arXiv:1802.09296  [pdf, ps, other

    cs.AI cs.CL

    AMUSE: Multilingual Semantic Parsing for Question Answering over Linked Data

    Authors: Sherzod Hakimov, Soufian Jebbara, Philipp Cimiano

    Abstract: The task of answering natural language questions over RDF data has received wide interest in recent years, in particular in the context of the series of QALD benchmarks. The task consists of mapping a natural language question to an executable form, e.g. SPARQL, so that answers from a given KB can be extracted. So far, most systems proposed are i) monolingual and ii) rely on a set of hard-coded ru… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

    Comments: International Semantic Web Conference, 2017