Skip to main content

Showing 1–17 of 17 results for author: Jones, G J F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.18024  [pdf, other

    cs.IR

    Report on the Workshop on Simulations for Information Access (Sim4IA 2024) at SIGIR 2024

    Authors: Timo Breuer, Christin Katharina Kreutz, Norbert Fuhr, Krisztian Balog, Philipp Schaer, Nolwenn Bernard, Ingo Frommholz, Marcel Gohsen, Kaixin Ji, Gareth J. F. Jones, Jüri Keller, Jiqun Liu, Martin Mladenov, Gabriella Pasi, Johanne Trippas, Xi Wang, Saber Zerhoudi, ChengXiang Zhai

    Abstract: This paper is a report of the Workshop on Simulations for Information Access (Sim4IA) workshop at SIGIR 2024. The workshop had two keynotes, a panel discussion, nine lightning talks, and two breakout sessions. Key takeaways were user simulation's importance in academia and industry, the possible bridging of online and offline evaluation, and the issues of organizing a companion shared task around… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Preprint of a SIGIR Forum submission for Vol. 58 No. 2 - December 2024

  2. Examining the Potential for Conversational Exploratory Search using a Smart Speaker Digital Assistant

    Authors: Abhishek Kaushik, Gareth J. F. Jones

    Abstract: Online Digital Assistants, such as Amazon Alexa, Google Assistant, Apple Siri are very popular and provide a range or services to their users, a key function is their ability to satisfy user information needs from the sources available to them. Users may often regard these applications as providing search services similar to Google type search engines. However, while it is clear that they are in g… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Journal ref: Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - HUCAPP, ISBN 978-989-758-634-7; ISSN 2184-4321, SciTePress, pages 305-317, 2023

  3. Comparing Conventional and Conversational Search Interaction using Implicit Evaluation Methods

    Authors: Abhishek Kaushik, Gareth J. F. Jones

    Abstract: Conversational search applications offer the prospect of improved user experience in information seeking via agent support. However, it is not clear how searchers will respond to this mode of engagement, in comparison to a conventional user-driven search interface, such as those found in a standard web search engine. We describe a laboratory-based study directly comparing user behaviour for a conv… ▽ More

    Submitted 18 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Journal ref: Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - (Volume 2)- February 19-21, 2023, in Lisbon, Portugal

  4. arXiv:2301.06056  [pdf, other

    cs.IR

    Improving Noise Robustness for Spoken Content Retrieval using Semi-supervised ASR and N-best Transcripts for BERT-based Ranking Models

    Authors: Yasufumi Moriya, Gareth. J. F. Jones

    Abstract: BERT-based re-ranking and dense retrieval (DR) systems have been shown to improve search effectiveness for spoken content retrieval (SCR). However, both methods can still show a reduction in effectiveness when using ASR transcripts in comparison to accurate manual transcripts. We find that a known-item search task on the How2 dataset of spoken instruction videos shows a reduction in mean reciproca… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

    Comments: accepted by SLT 2022

  5. arXiv:2203.05899  [pdf, other

    cs.CL

    Achieving Reliable Human Assessment of Open-Domain Dialogue Systems

    Authors: Tianbo Ji, Yvette Graham, Gareth J. F. Jones, Chenyang Lyu, Qun Liu

    Abstract: Evaluation of open-domain dialogue systems is highly challenging and development of better techniques is highlighted time and again as desperately needed. Despite substantial efforts to carry out reliable live evaluation of systems in recent competitions, annotations have been abandoned and reported as too unreliable to yield sensible results. This is a serious problem since automatic metrics are… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: to appear at ACL 2022 main conference

  6. arXiv:2105.03311  [pdf, other

    cs.CL

    Translation Quality Assessment: A Brief Survey on Manual and Automatic Methods

    Authors: Lifeng Han, Gareth J. F. Jones, Alan F. Smeaton

    Abstract: To facilitate effective translation modeling and translation studies, one of the crucial questions to address is how to assess translation quality. From the perspectives of accuracy, reliability, repeatability and cost, translation quality assessment (TQA) itself is a rich and challenging task. In this work, we present a high-level and concise survey of TQA methods, including both manual judgement… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: Accepted to 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021): Workshop on Modelling Translation: Translatology in the Digital Age (MoTra21). arXiv admin note: substantial text overlap with arXiv:1605.04515

  7. arXiv:2104.13473  [pdf, other

    cs.CV cs.AI

    TRECVID 2020: A comprehensive campaign for evaluating video retrieval tasks across multiple application domains

    Authors: George Awad, Asad A. Butt, Keith Curtis, Jonathan Fiscus, Afzal Godil, Yooyoung Lee, Andrew Delgado, Jesse Zhang, Eliot Godard, Baptiste Chocot, Lukas Diduch, Jeffrey Liu, Alan F. Smeaton, Yvette Graham, Gareth J. F. Jones, Wessel Kraaij, Georges Quenot

    Abstract: The TREC Video Retrieval Evaluation (TRECVID) is a TREC-style video analysis and retrieval evaluation with the goal of promoting progress in research and development of content-based exploitation and retrieval of information from digital video via open, metrics-based evaluation. Over the last twenty years this effort has yielded a better understanding of how systems can effectively accomplish such… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: TRECVID 2020 Workshop Overview Paper. arXiv admin note: substantial text overlap with arXiv:2009.09984

  8. arXiv:2104.04501  [pdf, ps, other

    cs.HC cs.IR

    Exploring Current User Web Search Behaviours in Analysis Tasks to be Supported in Conversational Search

    Authors: Abhishek Kaushik, Gareth J. F. Jones

    Abstract: Conversational search presents opportunities to support users in their search activities to improve the effectiveness and efficiency of search while reducing their cognitive load. Limitations of the potential competency of conversational agents restrict the situations for which conversational search agents can replace human intermediaries. It is thus more interesting, initially at least, to invest… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted in SIGIR 2018 Second International Workshop on Conversational Approaches to Information Retrieval (CAIR 18), July 12, 2018, Ann Arbor Michigan, USA

  9. arXiv:2104.04497  [pdf, other

    cs.CL cs.LG

    Chinese Character Decomposition for Neural MT with Multi-Word Expressions

    Authors: Lifeng Han, Gareth J. F. Jones, Alan F. Smeaton, Paolo Bolzoni

    Abstract: Chinese character decomposition has been used as a feature to enhance Machine Translation (MT) models, combining radicals into character and word level models. Recent work has investigated ideograph or stroke level embedding. However, questions remain about different decomposition levels of Chinese character representations, radical and strokes, best suited for MT. To investigate the impact of Chi… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted to publish in NoDaLiDa2021

  10. arXiv:2104.03940  [pdf, other

    cs.HC cs.IR

    A Conceptual Framework for Implicit Evaluation of Conversational Search Interfaces

    Authors: Abhishek Kaushik, Gareth J. F. Jones

    Abstract: Conversational search (CS) has recently become a significant focus of the information retrieval (IR) research community. Multiple studies have been conducted which explore the concept of conversational search. Understanding and advancing research in CS requires careful and detailed evaluation. Existing CS studies have been limited to evaluation based on simple user feedback on task completion. We… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: Accepted in MICROS (Mixed-Initiative ConveRsatiOnal Systems) Workshop at 43rd European Conference on Information Retrieval

  11. arXiv:2103.15953  [pdf, other

    cs.IR cs.CL

    TREC 2020 Podcasts Track Overview

    Authors: Rosie Jones, Ben Carterette, Ann Clifton, Maria Eskevich, Gareth J. F. Jones, Jussi Karlgren, Aasish Pappu, Sravana Reddy, Yongze Yu

    Abstract: The Podcast Track is new at the Text Retrieval Conference (TREC) in 2020. The podcast track was designed to encourage research into podcasts in the information retrieval and NLP research communities. The track consisted of two shared tasks: segment retrieval and summarization, both based on a dataset of over 100,000 podcast episodes (metadata, audio, and automatic transcripts) which was released c… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Journal ref: The Proceedings of the Twenty-Ninth Text REtrieval Conference Proceedings (TREC 2020)

  12. arXiv:2006.15679  [pdf, other

    cs.IR

    Kernel Density Estimation based Factored Relevance Model for Multi-Contextual Point-of-Interest Recommendation

    Authors: Anirban Chakraborty, Debasis Ganguly, Annalina Caputo, Gareth J. F. Jones

    Abstract: An automated contextual suggestion algorithm is likely to recommend contextually appropriate and personalized 'points-of-interest' (POIs) to a user, if it can extract information from the user's preference history (exploitation) and effectively blend it with the user's current contextual information (exploration) to predict a POI's 'appropriateness' in the current context. To balance this trade-of… ▽ More

    Submitted 25 November, 2021; v1 submitted 28 June, 2020; originally announced June 2020.

    Comments: To appear at Information Retrieval Journal

  13. arXiv:2006.03022  [pdf, other

    cs.CL cs.LG

    Response to LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts

    Authors: Hao Wu, Gareth J. F. Jones, Francois Pitie

    Abstract: Live video commenting systems are an emerging feature of online video sites. Recently the Chinese video sharing platform Bilibili, has popularised a novel captioning system where user comments are displayed as streams of moving subtitles overlaid on the video playback screen and broadcast to all viewers in real-time. LiveBot was recently introduced as a novel Automatic Live Video Commenting (ALVC)… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: 4 pages, 2 figures

    Report number: 06-04

  14. arXiv:2005.10583  [pdf, other

    cs.CL

    MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora

    Authors: Lifeng Han, Gareth J. F. Jones, Alan F. Smeaton

    Abstract: Multi-word expressions (MWEs) are a hot topic in research in natural language processing (NLP), including topics such as MWE detection, MWE decomposition, and research investigating the exploitation of MWEs in other NLP fields such as Machine Translation. However, the availability of bilingual or multi-lingual MWE corpora is very limited. The only bilingual MWE corpora that we are aware of is from… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: Accepted to LREC2020

  15. arXiv:1906.06147  [pdf, other

    cs.MM eess.IV

    Grounding Object Detections With Transcriptions

    Authors: Yasufumi Moriya, Ramon Sanabria, Florian Metze, Gareth J. F. Jones

    Abstract: A vast amount of audio-visual data is available on the Internet thanks to video streaming services, to which users upload their content. However, there are difficulties in exploiting available data for supervised statistical models due to the lack of labels. Unfortunately, generating labels for such amount of data through human annotation can be expensive, time-consuming and prone to annotation er… ▽ More

    Submitted 28 July, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

  16. arXiv:1606.07869  [pdf, other

    cs.IR

    Representing Documents and Queries as Sets of Word Embedded Vectors for Information Retrieval

    Authors: Dwaipayan Roy, Debasis Ganguly, Mandar Mitra, Gareth J. F. Jones

    Abstract: A major difficulty in applying word vector embeddings in IR is in devising an effective and efficient strategy for obtaining representations of compound units of text, such as whole documents, (in comparison to the atomic words), for the purpose of indexing and scoring documents. Instead of striving for a suitable method for obtaining a single vector representation of a large document of text, we… ▽ More

    Submitted 25 June, 2016; originally announced June 2016.

    Comments: Neu-IR '16 SIGIR Workshop on Neural Information Retrieval July 21, 2016, Pisa, Italy

  17. arXiv:1312.1913  [pdf, other

    cs.IR

    Adapting Binary Information Retrieval Evaluation Metrics for Segment-based Retrieval Tasks

    Authors: Robin Aly, Maria Eskevich, Roeland Ordelman, Gareth J. F. Jones

    Abstract: This report describes metrics for the evaluation of the effectiveness of segment-based retrieval based on existing binary information retrieval metrics. This metrics are described in the context of a task for the hyperlinking of video segments. This evaluation approach re-uses existing evaluation measures from the standard Cranfield evaluation paradigm. Our adaptation approach can in principle be… ▽ More

    Submitted 6 December, 2013; originally announced December 2013.

    Comments: Explanation of evaluation measures for the linking task of the MediaEval Workshop 2013