Skip to main content

Showing 1–28 of 28 results for author: Mackie, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.07722  [pdf, other

    cs.IR

    DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities

    Authors: Thong Nguyen, Shubham Chatterjee, Sean MacAvaney, Iain Mackie, Jeff Dalton, Andrew Yates

    Abstract: Learned Sparse Retrieval (LSR) models use vocabularies from pre-trained transformers, which often split entities into nonsensical fragments. Splitting entities can reduce retrieval accuracy and limits the model's ability to incorporate up-to-date world knowledge not included in the training data. In this work, we enhance the LSR vocabulary with Wikipedia concepts and entities, enabling the model t… ▽ More

    Submitted 15 October, 2024; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: https://github.com/thongnt99/DyVo

    Journal ref: EMNLP 2024

  2. arXiv:2409.19298   

    cs.LO cs.PL cs.SC

    Proceedings 13th International Workshop on Developments in Computational Models

    Authors: Sandra Alves, Ian Mackie

    Abstract: This volume contains the proceedings of DCM 2023, the 13th International Workshop on Developments in Computational Models held on 2 July 2023 in Rome, Italy. DCM 2023 was organised as a one-day satellite event of FSCD 2023, the 8th International Conference on Formal Structures for Computation and Deduction. The aim of this workshop is to bring together researchers who are currently developing new… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Journal ref: EPTCS 408, 2024

  3. arXiv:2403.00586  [pdf, other

    cs.IR

    Open Assistant Toolkit -- version 2

    Authors: Sophie Fischer, Federico Rossetto, Carlos Gemmell, Andrew Ramsay, Iain Mackie, Philip Zubel, Niklas Tecklenburg, Jeffrey Dalton

    Abstract: We present the second version of the Open Assistant Toolkit (OAT-v2), an open-source task-oriented conversational system for composing generative neural models. OAT-v2 is a scalable and flexible assistant platform supporting multiple domains and modalities of user interaction. It splits processing a user utterance into modular system components, including submodules such as action code generation,… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  4. GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants

    Authors: Sophie Fischer, Carlos Gemmell, Niklas Tecklenburg, Iain Mackie, Federico Rossetto, Jeffrey Dalton

    Abstract: We tackle the challenge of building real-world multimodal assistants for complex real-world tasks. We describe the practicalities and challenges of developing and deploying GRILLBot, a leading (first and second prize winning in 2022 and 2023) system deployed in the Alexa Prize TaskBot Challenge. Building on our Open Assistant Toolkit (OAT) framework, we propose a hybrid architecture that leverages… ▽ More

    Submitted 28 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 11 pages, KDD Preprint

  5. arXiv:2401.05939  [pdf, other

    cs.IR cs.AI

    DREQ: Document Re-Ranking Using Entity-based Query Understanding

    Authors: Shubham Chatterjee, Iain Mackie, Jeff Dalton

    Abstract: While entity-oriented neural IR models have advanced significantly, they often overlook a key nuance: the varying degrees of influence individual entities within a document have on its overall relevance. Addressing this gap, we present DREQ, an entity-oriented dense document re-ranking model. Uniquely, we emphasize the query-relevant entities within a document's representation while simultaneously… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: To be presented as a full paper at ECIR 2024 in Glasgpow, UK

  6. arXiv:2306.17082  [pdf, other

    cs.IR

    Adaptive Latent Entity Expansion for Document Retrieval

    Authors: Iain Mackie, Shubham Chatterjee, Sean MacAvaney, Jeffrey Dalton

    Abstract: Despite considerable progress in neural relevance ranking techniques, search engines still struggle to process complex queries effectively - both in terms of precision and recall. Sparse and dense Pseudo-Relevance Feedback (PRF) approaches have the potential to overcome limitations in recall, but are only effective with high precision in the top ranks. In this work, we tackle the problem of search… ▽ More

    Submitted 4 December, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

  7. arXiv:2306.09938  [pdf, other

    cs.IR

    GRM: Generative Relevance Modeling Using Relevance-Aware Sample Estimation for Document Retrieval

    Authors: Iain Mackie, Ivan Sekulic, Shubham Chatterjee, Jeffrey Dalton, Fabio Crestani

    Abstract: Recent studies show that Generative Relevance Feedback (GRF), using text generated by Large Language Models (LLMs), can enhance the effectiveness of query expansion. However, LLMs can generate irrelevant information that harms retrieval effectiveness. To address this, we propose Generative Relevance Modeling (GRM) that uses Relevance-Aware Sample Estimation (RASE) for more accurate weighting of ex… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  8. arXiv:2305.07477  [pdf, other

    cs.IR

    Generative and Pseudo-Relevant Feedback for Sparse, Dense and Learned Sparse Retrieval

    Authors: Iain Mackie, Shubham Chatterjee, Jeffrey Dalton

    Abstract: Pseudo-relevance feedback (PRF) is a classical approach to address lexical mismatch by enriching the query using first-pass retrieval. Moreover, recent work on generative-relevance feedback (GRF) shows that query expansion models using text generated from large language models can improve sparse retrieval without depending on first-pass retrieval effectiveness. This work extends GRF to dense and l… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  9. Generative Relevance Feedback with Large Language Models

    Authors: Iain Mackie, Shubham Chatterjee, Jeffrey Dalton

    Abstract: Current query expansion models use pseudo-relevance feedback to improve first-pass retrieval effectiveness; however, this fails when the initial results are not relevant. Instead of building a language model from retrieved results, we propose Generative Relevance Feedback (GRF) that builds probabilistic feedback models from long-form text generated from Large Language Models. We study the effectiv… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: SIGIR 2023 Preprint, 6 pages

    ACM Class: H.3.3

  10. arXiv:2211.06193  [pdf, other

    cs.CL

    DocuT5: Seq2seq SQL Generation with Table Documentation

    Authors: Elena Soare, Iain Mackie, Jeffrey Dalton

    Abstract: Current SQL generators based on pre-trained language models struggle to answer complex questions requiring domain context or understanding fine-grained table structure. Humans would deal with these unknowns by reasoning over the documentation of the tables. Based on this hypothesis, we propose DocuT5, which uses off-the-shelf language model architecture and injects knowledge from external `documen… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  11. arXiv:2211.04142  [pdf, other

    cs.IR cs.CL

    Query-Specific Knowledge Graphs for Complex Finance Topics

    Authors: Iain Mackie, Jeffrey Dalton

    Abstract: Across the financial domain, researchers answer complex questions by extensively "searching" for relevant information to generate long-form reports. This workshop paper discusses automating the construction of query-specific document and entity knowledge graphs (KGs) for complex research topics. We focus on the CODEC dataset, where domain experts (1) create challenging questions, (2) construct lon… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: AKBC 2022 Workshop, Knowledge Graphs in Finance and Economics

  12. arXiv:2208.14884  [pdf, other

    cs.CL cs.AI

    GRILLBot: An Assistant for Real-World Tasks with Neural Semantic Parsing and Graph-Based Representations

    Authors: Carlos Gemmell, Iain Mackie, Paul Owoicho, Federico Rossetto, Sophie Fischer, Jeffrey Dalton

    Abstract: GRILLBot is the winning system in the 2022 Alexa Prize TaskBot Challenge, moving towards the next generation of multimodal task assistants. It is a voice assistant to guide users through complex real-world tasks in the domains of cooking and home improvement. These are long-running and complex tasks that require flexible adjustment and adaptation. The demo highlights the core aspects, including a… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

  13. VILT: Video Instructions Linking for Complex Tasks

    Authors: Sophie Fischer, Carlos Gemmell, Iain Mackie, Jeffrey Dalton

    Abstract: This work addresses challenges in developing conversational assistants that support rich multimodal video interactions to accomplish real-world tasks interactively. We introduce the task of automatically linking instructional videos to task steps as "Video Instructions Linking for Complex Tasks" (VILT). Specifically, we focus on the domain of cooking and empowering users to cook meals interactivel… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: 7 pages, IMuR Preprint

    ACM Class: H.3.3

  14. arXiv:2205.04546  [pdf, other

    cs.IR

    CODEC: Complex Document and Entity Collection

    Authors: Iain Mackie, Paul Owoicho, Carlos Gemmell, Sophie Fischer, Sean MacAvaney, Jeffrey Dalton

    Abstract: CODEC is a document and entity ranking benchmark that focuses on complex research topics. We target essay-style information needs of social science researchers, i.e. "How has the UK's Open Banking Regulation benefited Challenger Banks?". CODEC includes 42 topics developed by researchers and a new focused web corpus with semantic annotations including entity links. This resource includes expert jud… ▽ More

    Submitted 17 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 10 pages, SIGIR 2022 Preprint

    ACM Class: H.3.3

  15. arXiv:2105.07975  [pdf, other

    cs.IR

    How Deep is your Learning: the DL-HARD Annotated Deep Learning Dataset

    Authors: Iain Mackie, Jeffery Dalton, Andrew Yates

    Abstract: Deep Learning Hard (DL-HARD) is a new annotated dataset designed to more effectively evaluate neural ranking models on complex topics. It builds on TREC Deep Learning (DL) topics by extensively annotating them with question intent categories, answer types, wikified entities, topic categories, and result type metadata from a commercial web search engine. Based on this data, we introduce a framework… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  16. arXiv:1902.01510   

    cs.LO cs.PL

    Proceedings Tenth International Workshop on Computing with Terms and Graphs

    Authors: Maribel Fernández, Ian Mackie

    Abstract: This volume contains a selection of the papers presented at TERMGRAPH 2018, the tenth edition of the international workshop on computing with terms and graphs. Graphs, and graph transformation systems, are used in many areas within Computer Science: to represent data structures and algorithms, to define computation models, as a general modelling tool to study complex systems, etc. Research in thi… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

    Journal ref: EPTCS 288, 2019

  17. In-place Graph Rewriting with Interaction Nets

    Authors: Ian Mackie, Shinya Sato

    Abstract: An algorithm is in-place, or runs in-situ, when it does not need any additional memory to execute beyond a small constant amount. There are many algorithms that are efficient because of this feature, therefore it is an important aspect of an algorithm. In most programming languages, it is not obvious when an algorithm can run in-place, and moreover it is often not clear that the implementation re… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

    Comments: In Proceedings TERMGRAPH 2016, arXiv:1609.03014

    Journal ref: EPTCS 225, 2016, pp. 15-24

  18. Compiling Process Networks to Interaction Nets

    Authors: Ian Mackie

    Abstract: Kahn process networks are a model of computation based on a collection of sequential, deterministic processes that communicate by sending messages through unbounded channels. They are well suited for modelling stream-based computations, but are in no way restricted to this application. Interaction nets are graph rewriting systems that have many interesting properties for implementation. In this pa… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

    Comments: In Proceedings TERMGRAPH 2016, arXiv:1609.03014

    Journal ref: EPTCS 225, 2016, pp. 5-14

  19. An Implementation Model for Interaction Nets

    Authors: Abubakar Hassan, Ian Mackie, Shinya Sato

    Abstract: To study implementations and optimisations of interaction net systems we propose a calculus to allow us to reason about nets, a concrete data-structure that is in close correspondence with the calculus, and a low-level language to create and manipulate this data structure. These work together so that we can describe the compilation process for interaction nets, reason about the behaviours of the i… ▽ More

    Submitted 26 May, 2015; originally announced May 2015.

    Comments: In Proceedings TERMGRAPH 2014, arXiv:1505.06818

    Journal ref: EPTCS 183, 2015, pp. 66-80

  20. arXiv:1403.7685   

    cs.LO cs.PL

    Proceedings 9th International Workshop on Developments in Computational Models

    Authors: Mauricio Ayala-Rincón, Eduardo Bonelli, Ian Mackie

    Abstract: This volume contains a selection of the papers presented at the Ninth International Workshop on Developments in Computational Models (DCM 2013) held in Buenos Aires, Argentina on 26th August 2013, as a satellite event of CONCUR 2013. Several new models of computation have emerged in the last years, and many developments of traditional computational models have been proposed with the aim of taking… ▽ More

    Submitted 29 March, 2014; originally announced March 2014.

    Comments: EPTCS 144, 2014

  21. Bigraphical Nets

    Authors: Maribel Fernández, Ian Mackie, Matthew Walker

    Abstract: Interaction nets are a graphical model of computation, which has been used to define efficient evaluators for functional calculi, and specifically lambda calculi with patterns. However, the flat structure of interaction nets forces pattern matching and functional behaviour to be encoded at the same level, losing some potential parallelism. In this paper, we introduce bigraphical nets, or binets… ▽ More

    Submitted 26 February, 2013; originally announced February 2013.

    Comments: In Proceedings TERMGRAPH 2013, arXiv:1302.5997

    Journal ref: EPTCS 110, 2013, pp. 74-81

  22. arXiv:1211.3480   

    cs.LO cs.PL

    Proceedings 2nd International Workshop on Linearity

    Authors: Sandra Alves, Ian Mackie

    Abstract: This volume contains a selection of the papers presented at the 2nd International Workshop on Linearity (LINEARITY'2012), which took place 1 April 2012 in Tallinn, Estonia. The workshop was a one-day satellite event of ETAPS 2012, the 15th European Joint Conference on Theory and Practice of Software. The aim of this workshop was to bring together researchers who are currently developing theory a… ▽ More

    Submitted 14 November, 2012; originally announced November 2012.

    Journal ref: EPTCS 101, 2012

  23. A new graphical calculus of proofs

    Authors: Sandra Alves, Maribel Fernández, Ian Mackie

    Abstract: We offer a simple graphical representation for proofs of intuitionistic logic, which is inspired by proof nets and interaction nets (two formalisms originating in linear logic). This graphical calculus of proofs inherits good features from each, but is not constrained by them. By the Curry-Howard isomorphism, the representation applies equally to the lambda calculus, offering an alternative diagra… ▽ More

    Submitted 13 February, 2011; originally announced February 2011.

    Comments: In Proceedings TERMGRAPH 2011, arXiv:1102.2268

    Journal ref: EPTCS 48, 2011, pp. 69-84

  24. arXiv:1003.5758   

    cs.PL cs.LO

    Proceedings Tenth International Workshop on Rule-Based Programming

    Authors: Ian Mackie, Anamaria Martins Moreira

    Abstract: This volume contains the proceedings of RULE 2009: the tenth International Workshop on Rule-Based Programming. It took place in June 28th 2009, Brasilia, Brazil, as a satellite event of RDP 2009. The first Rule workshop was held in Montreal in 2000, and subsequent editions took place in Firenze, Pittsburgh, Valencia, Aachen, Nara, Seattle, Paris, and Hagenberg.

    Submitted 30 March, 2010; originally announced March 2010.

    Journal ref: EPTCS 21, 2010

  25. Proceedings First International Workshop on Linearity

    Authors: Mário Florido, Ian Mackie

    Abstract: This volume contains the proceedings of LINEARITY 2009: the first International Workshop on Linearity, which took place 12th September 2009 in Coimbra, Portugal. The workshop was a satellite event of CSL 2009, the 18th EACSL Annual Conference on Computer Science Logic.

    Submitted 29 March, 2010; originally announced March 2010.

    Journal ref: EPTCS 22, 2010

  26. arXiv:1003.5350  [pdf, ps, other

    cs.DB cs.LO cs.PL

    An Improved Algorithm for Generating Database Transactions from Relational Algebra Specifications

    Authors: Daniel J. Dougherty

    Abstract: Alloy is a lightweight modeling formalism based on relational algebra. In prior work with Fisler, Giannakopoulos, Krishnamurthi, and Yoo, we have presented a tool, Alchemy, that compiles Alloy specifications into implementations that execute against persistent databases. The foundation of Alchemy is an algorithm for rewriting relational algebra formulas into code for database transactions. In th… ▽ More

    Submitted 28 March, 2010; originally announced March 2010.

    Journal ref: EPTCS 21, 2010, pp. 77-89

  27. arXiv:1001.3368  [pdf, ps, other

    cs.LO cs.PL

    Linear Recursion

    Authors: Sandra Alves, Maribel Fernández, Mário Florido, Ian Mackie

    Abstract: We define two extensions of the typed linear lambda-calculus that yield minimal Turing-complete systems. The extensions are based on unbounded recursion in one case, and bounded recursion with minimisation in the other. We show that both approaches are compatible with linearity and typeability constraints. Both extensions of the typed linear lambda-calculus are minimal, in the sense that taking ou… ▽ More

    Submitted 25 November, 2016; v1 submitted 19 January, 2010; originally announced January 2010.

    Comments: 28 pages

    ACM Class: F.4.1; F.3.3

  28. arXiv:0910.3321  [pdf, other

    cs.PL

    Iterators, Recursors and Interaction Nets

    Authors: Ian Mackie, Jorge Sousa Pinto, Miguel Vilaca

    Abstract: We propose a method for encoding iterators (and recursion operators in general) using interaction nets (INs). There are two main applications for this: the method can be used to obtain a visual nota- tion for functional programs; and it can be used to extend the existing translations of the lambda-calculus into INs to languages with recursive types.

    Submitted 17 October, 2009; originally announced October 2009.

    Comments: ISBN: 978-972-9348-18-1