Skip to main content

Showing 1–13 of 13 results for author: Rizvi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.15498  [pdf, ps, other

    cs.CL cs.AI cs.LG

    SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling

    Authors: Md Imbesat Hassan Rizvi, Xiaodan Zhu, Iryna Gurevych

    Abstract: Process or step-wise supervision has played a crucial role in advancing complex multi-step reasoning capabilities of Large Language Models (LLMs). However, efficient, high-quality automated process annotation remains a significant challenge. To address this, we introduce Single-Pass Annotation with Reference-Guided Evaluation (SPARE), a novel structured framework that enables single-pass, per-step… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 8 pages main content, 4 figures, 4 tables

  2. arXiv:2505.03593  [pdf, other

    cs.CY

    A Unifying Bias-aware Multidisciplinary Framework for Investigating Socio-Technical Issues

    Authors: Sacha Hasan, Mehdi Rizvi, Yingfang Yuan, Kefan Chen, Lynne Baillie, Wei Pang

    Abstract: This paper aims to bring together the disciplines of social science (SS) and computer science (CS) in the design and implementation of a novel multidisciplinary framework for systematic, transparent, ethically-informed, and bias-aware investigation of socio-technical issues. For this, various analysis approaches from social science and machine learning (ML) were applied in a structured sequence to… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: First two authors with equal contribution

  3. arXiv:2503.23415  [pdf, other

    cs.CL cs.AI

    An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering

    Authors: Alexander Murphy, Mohd Sanad Zaki Rizvi, Aden Haussmann, Ping Nie, Guifu Liu, Aryo Pradipta Gema, Pasquale Minervini

    Abstract: Large Language Models (LLMs) frequently produce factually inaccurate outputs - a phenomenon known as hallucination - which limits their accuracy in knowledge-intensive NLP tasks. Retrieval-augmented generation and agentic frameworks such as Reasoning and Acting (ReAct) can address this issue by giving the model access to external knowledge. However, LLMs often fail to remain faithful to retrieved… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

  4. arXiv:2410.09399  [pdf, other

    cs.CL cs.LG

    Text Classification using Graph Convolutional Networks: A Comprehensive Survey

    Authors: Syed Mustafa Haider Rizvi, Ramsha Imran, Arif Mahmood

    Abstract: Text classification is a quintessential and practical problem in natural language processing with applications in diverse domains such as sentiment analysis, fake news detection, medical diagnosis, and document classification. A sizable body of recent works exists where researchers have studied and tackled text classification from different angles with varying degrees of success. Graph convolution… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  5. arXiv:2409.16317  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    A Literature Review of Keyword Spotting Technologies for Urdu

    Authors: Syed Muhammad Aqdas Rizvi

    Abstract: This literature review surveys the advancements of keyword spotting (KWS) technologies, specifically focusing on Urdu, Pakistan's low-resource language (LRL), which has complex phonetics. Despite the global strides in speech technology, Urdu presents unique challenges requiring more tailored solutions. The review traces the evolution from foundational Gaussian Mixture Models to sophisticated neura… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  6. arXiv:2407.03133  [pdf, ps, other

    cs.CY cs.AI cs.LG stat.ML

    Quantifying the Cross-sectoral Intersecting Discrepancies within Multiple Groups Using Latent Class Analysis Towards Fairness

    Authors: Yingfang Yuan, Kefan Chen, Mehdi Rizvi, Lynne Baillie, Wei Pang

    Abstract: The growing interest in fair AI development is evident. The ''Leave No One Behind'' initiative urges us to address multiple and intersecting forms of inequality in accessing services, resources, and opportunities, emphasising the significance of fairness in AI. This is particularly relevant as an increasing number of AI tools are applied to decision-making processes, such as resource allocation an… ▽ More

    Submitted 3 July, 2025; v1 submitted 24 May, 2024; originally announced July 2024.

  7. arXiv:2406.04566  [pdf, other

    cs.CL cs.AI cs.LG

    SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language Models

    Authors: Md Imbesat Hassan Rizvi, Xiaodan Zhu, Iryna Gurevych

    Abstract: Spatial reasoning is a crucial component of both biological and artificial intelligence. In this work, we present a comprehensive study of the capability of current state-of-the-art large language models (LLMs) on spatial reasoning. To support our study, we created and contribute a novel Spatial Reasoning Characterization (SpaRC) framework and Spatial Reasoning Paths (SpaRP) datasets, to enable an… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 (Main)

  8. arXiv:2403.09728  [pdf, other

    cs.CL cs.AI cs.CC

    Simulating Weighted Automata over Sequences and Trees with Transformers

    Authors: Michael Rizvi, Maude Lizaire, Clara Lacroce, Guillaume Rabusseau

    Abstract: Transformers are ubiquitous models in the natural language processing (NLP) community and have shown impressive empirical successes in the past few years. However, little is understood about how they reason and the limits of their computational capabilities. These models do not process data sequentially, and yet outperform sequential neural models such as RNNs. Recent work has shown that these mod… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  9. arXiv:2209.07491  [pdf, other

    cs.CR cs.NI

    Defending Root DNS Servers Against DDoS Using Layered Defenses

    Authors: A S M Rizvi, Jelena Mirkovic, John Heidemann, Wesley Hardaker, Robert Story

    Abstract: Distributed Denial-of-Service (DDoS) attacks exhaust resources, leaving a server unavailable to legitimate clients. The Domain Name System (DNS) is a frequent target of DDoS attacks. Since DNS is a critical infrastructure service, protecting it from DoS is imperative. Many prior approaches have focused on specific filters or anti-spoofing techniques to protect generic services. DNS root nameserver… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 9 pages, 3 figures

  10. arXiv:2206.08137  [pdf

    eess.IV cs.LG q-bio.QM

    An AI tool for automated analysis of large-scale unstructured clinical cine CMR databases

    Authors: Jorge Mariscal-Harana, Clint Asher, Vittoria Vergani, Maleeha Rizvi, Louise Keehn, Raymond J. Kim, Robert M. Judd, Steffen E. Petersen, Reza Razavi, Andrew King, Bram Ruijsink, Esther Puyol-Antón

    Abstract: Artificial intelligence (AI) techniques have been proposed for automating analysis of short axis (SAX) cine cardiac magnetic resonance (CMR), but no CMR analysis tool exists to automatically analyse large (unstructured) clinical CMR datasets. We develop and validate a robust AI tool for start-to-end automatic quantification of cardiac function from SAX cine CMR in large clinical databases. Our pip… ▽ More

    Submitted 5 July, 2023; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Accepted at EHJ Digital Health; Bram Ruijsink and Esther Puyol-Antón are shared last authors

  11. arXiv:2111.01225  [pdf

    cs.CL cs.AI cs.LG

    Identifying causal relations in tweets using deep learning: Use case on diabetes-related tweets from 2017-2021

    Authors: Adrian Ahne, Vivek Khetan, Xavier Tannier, Md Imbessat Hassan Rizvi, Thomas Czernichow, Francisco Orchard, Charline Bour, Andrew Fano, Guy Fagherazzi

    Abstract: Objective: Leveraging machine learning methods, we aim to extract both explicit and implicit cause-effect associations in patient-reported, diabetes-related tweets and provide a tool to better understand opinion, feelings and observations shared within the diabetes online community from a causality perspective. Materials and Methods: More than 30 million diabetes-related tweets in English were col… ▽ More

    Submitted 24 February, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 6 Figures, 4 Tables

  12. arXiv:2110.07090  [pdf, other

    cs.CL

    MIMICause: Representation and automatic extraction of causal relation types from clinical notes

    Authors: Vivek Khetan, Md Imbesat Hassan Rizvi, Jessica Huber, Paige Bartusiak, Bogdan Sacaleanu, Andrew Fano

    Abstract: Understanding causal narratives communicated in clinical notes can help make strides towards personalized healthcare. Extracted causal information from clinical notes can be combined with structured EHR data such as patients' demographics, diagnoses, and medications. This will enhance healthcare providers' ability to identify aspects of a patient's story communicated in the clinical notes and help… ▽ More

    Submitted 13 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted at the Findings of ACL 2022

    ACM Class: I.2.7; I.5.4

  13. arXiv:2006.14058  [pdf, other

    cs.NI

    Anycast Agility: Network Playbooks to Fight DDoS

    Authors: A S M Rizvi, Leandro Bertholdo, Joao Ceron, John Heidemann

    Abstract: IP anycast is used for services such as DNS and Content Delivery Networks (CDN) to provide the capacity to handle Distributed Denial-of-Service (DDoS) attacks. During a DDoS attack service operators redistribute traffic between anycast sites to take advantage of sites with unused or greater capacity. Depending on site traffic and attack size, operators may instead concentrate attackers in a few si… ▽ More

    Submitted 28 February, 2022; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 21 pages, 22 figures