Skip to main content

Showing 1–17 of 17 results for author: Siddique, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.06262  [pdf, ps, other

    cs.LG cs.AI

    Dialz: A Python Toolkit for Steering Vectors

    Authors: Zara Siddique, Liam D. Turner, Luis Espinosa-Anke

    Abstract: We introduce Dialz, a framework for advancing research on steering vectors for open-source LLMs, implemented in Python. Steering vectors allow users to modify activations at inference time to amplify or weaken a 'concept', e.g. honesty or positivity, providing a more powerful alternative to prompting or fine-tuning. Dialz supports a diverse set of tasks, including creating contrastive pair dataset… ▽ More

    Submitted 3 June, 2025; v1 submitted 4 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL System Demo 2025

  2. arXiv:2503.05371  [pdf, other

    cs.LG cs.AI cs.CL

    Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs

    Authors: Zara Siddique, Irtaza Khalid, Liam D. Turner, Luis Espinosa-Anke

    Abstract: We present a novel approach to bias mitigation in large language models (LLMs) by applying steering vectors to modify model activations in forward passes. We employ Bayesian optimization to systematically identify effective contrastive pair datasets across nine bias axes. When optimized on the BBQ dataset, our individually tuned steering vectors achieve average improvements of 12.2%, 4.7%, and 3.2… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: Submitted to ACL 2025

  3. arXiv:2412.15375  [pdf, other

    cs.CL

    Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation

    Authors: Joanne Boisson, Zara Siddique, Hsuvas Borkakoty, Dimosthenis Antypas, Luis Espinosa Anke, Jose Camacho-Collados

    Abstract: Extracting metaphors and analogies from free text requires high-level reasoning abilities such as abstraction and language understanding. Our study focuses on the extraction of the concepts that form metaphoric analogies in literary texts. To this end, we construct a novel dataset in this domain with the help of domain experts. We compare the out-of-the-box ability of recent large language models… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: Accepted to COLING 2025, long paper

  4. arXiv:2409.00877  [pdf, other

    cs.CV

    Digital Twins in Additive Manufacturing: A Systematic Review

    Authors: Md Manjurul Ahsan, Yingtao Liu, Shivakumar Raman, Zahed Siddique

    Abstract: Digital Twins (DTs) are becoming popular in Additive Manufacturing (AM) due to their ability to create virtual replicas of physical components of AM machines, which helps in real-time production monitoring. Advanced techniques such as Machine Learning (ML), Augmented Reality (AR), and simulation-based models play key roles in developing intelligent and adaptable DTs in manufacturing processes. How… ▽ More

    Submitted 1 November, 2024; v1 submitted 1 September, 2024; originally announced September 2024.

  5. arXiv:2408.10207  [pdf, other

    cs.CV

    A Comprehensive Survey on Diffusion Models and Their Applications

    Authors: Md Manjurul Ahsan, Shivakumar Raman, Yingtao Liu, Zahed Siddique

    Abstract: Diffusion Models are probabilistic models that create realistic samples by simulating the diffusion process, gradually adding and removing noise from data. These models have gained popularity in domains such as image processing, speech synthesis, and natural language processing due to their ability to produce high-quality samples. As Diffusion Models are being adopted in various domains, existing… ▽ More

    Submitted 1 July, 2024; originally announced August 2024.

  6. arXiv:2407.06917  [pdf, other

    cs.CL cs.CY

    Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models

    Authors: Zara Siddique, Liam D. Turner, Luis Espinosa-Anke

    Abstract: Large language models (LLMs) have been shown to propagate and amplify harmful stereotypes, particularly those that disproportionately affect marginalised communities. To understand the effect of these stereotypes more comprehensively, we introduce GlobalBias, a dataset of 876k sentences incorporating 40 distinct gender-by-ethnicity groups alongside descriptors typically used in bias literature, wh… ▽ More

    Submitted 9 October, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted to EMNLP Main 2024

  7. arXiv:2310.08645  [pdf, other

    cs.CV cs.LG

    Defect Analysis of 3D Printed Cylinder Object Using Transfer Learning Approaches

    Authors: Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

    Abstract: Additive manufacturing (AM) is gaining attention across various industries like healthcare, aerospace, and automotive. However, identifying defects early in the AM process can reduce production costs and improve productivity - a key challenge. This study explored the effectiveness of machine learning (ML) approaches, specifically transfer learning (TL) models, for defect detection in 3D-printed cy… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  8. arXiv:2305.09777  [pdf, other

    cs.LG

    BSGAN: A Novel Oversampling Technique for Imbalanced Pattern Recognitions

    Authors: Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

    Abstract: Class imbalanced problems (CIP) are one of the potential challenges in developing unbiased Machine Learning (ML) models for predictions. CIP occurs when data samples are not equally distributed between the two or multiple classes. Borderline-Synthetic Minority Oversampling Techniques (SMOTE) is one of the approaches that has been used to balance the imbalance data by oversampling the minor (limite… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  9. arXiv:2304.10582  [pdf, other

    eess.IV cs.CV

    Invariant Scattering Transform for Medical Imaging

    Authors: Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

    Abstract: Over the years, the Invariant Scattering Transform (IST) technique has become popular for medical image analysis, including using wavelet transform computation using Convolutional Neural Networks (CNN) to capture patterns' scale and orientation in the input signal. IST aims to be invariant to transformations that are common in medical images, such as translation, rotation, scaling, and deformation… ▽ More

    Submitted 31 May, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted for Springer book chapter for a book "Data-driven approaches to Medical Imaging"

  10. arXiv:2210.12870  [pdf, other

    cs.LG

    Imbalanced Class Data Performance Evaluation and Improvement using Novel Generative Adversarial Network-based Approach: SSG and GBO

    Authors: Md Manjurul Ahsan, Md Shahin Ali, Zahed Siddique

    Abstract: Class imbalance in a dataset is one of the major challenges that can significantly impact the performance of machine learning models resulting in biased predictions. Numerous techniques have been proposed to address class imbalanced problems, including, but not limited to, Oversampling, Undersampling, and cost-sensitive approaches. Due to its ability to generate synthetic data, oversampling techni… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  11. arXiv:2202.03147  [pdf, other

    cs.RO

    The development of a portable elbow exoskeleton with a Twisted Strings Actuator to assist patients with upper limb inhabitation

    Authors: Rupal Roy, MM Rashid, Md Manjurul Ahsan, Zahed Siddique

    Abstract: Over the years, the number of exoskeleton devices utilized for upper-limb rehabilitation has increased dramatically, each with its own set of pros and cons. Most exoskeletons are not portable, limiting their utility to daily use for house patients. Additionally, the huge size of some grounded exoskeletons consumes space while maintaining a sophisticated structure and require more expensive materia… ▽ More

    Submitted 25 January, 2022; originally announced February 2022.

  12. arXiv:2201.10685  [pdf, other

    eess.SY cs.AR

    Design and Development of an Autonomous Surface Vehicle for Water Quality Monitoring

    Authors: MM Rashid, Rupal Roy, Md Manjurul Ahsan, Zahed Siddique

    Abstract: Manually monitoring water quality is very exhausting and requires several hours of sampling and laboratory testing for a particular body of water. This article presents a solution to test water properties like electrical conductivity and pH with a remote-controlled floating vehicle that minimizes time intervals. An autonomous surface vehicle (ASV) has been designed mathematically and operated via… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  13. arXiv:2201.06999  [pdf, other

    cs.CY

    Industry 4.0 in Health care: A systematic review

    Authors: Md Manjurul Ahsan, Zahed Siddique

    Abstract: Industry 4.0 in health care has evolved drastically over the past century. In fact, it is evolving every day, with new tools and strategies being developed by physicians and researchers alike. Health care and technology have been intertwined together with the advancement of cloud computing and big data. This study aims to analyze the impact of industry 4.0 in health care systems. To do so, a syste… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  14. arXiv:2201.02755  [pdf, other

    cs.LG cs.DL

    Machine Learning-Based Disease Diagnosis:A Bibliometric Analysis

    Authors: Md Manjurul Ahsan, Zahed Siddique

    Abstract: Machine Learning (ML) has garnered considerable attention from researchers and practitioners as a new and adaptable tool for disease diagnosis. With the advancement of ML and the proliferation of papers and research in this field, a complete examination of Machine Learning-Based Disease Diagnosis (MLBDD) is required. From a bibliometrics standpoint, this article comprehensively studies MLBDD paper… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

  15. arXiv:2112.15538  [pdf, other

    cs.LG

    Machine learning based disease diagnosis: A comprehensive review

    Authors: Md Manjurul Ahsan, Zahed Siddique

    Abstract: Globally, there is a substantial unmet need to diagnose various diseases effectively. The complexity of the different disease mechanisms and underlying symptoms of the patient population presents massive challenges to developing the early diagnosis tool and effective treatment. Machine Learning (ML), an area of Artificial Intelligence (AI), enables researchers, physicians, and patients to solve so… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

  16. arXiv:2112.10298  [pdf, other

    cs.CV

    Driver Drowsiness Detection Using Ensemble Convolutional Neural Networks on YawDD

    Authors: Rais Mohammad Salman, Mahbubur Rashid, Rupal Roy, Md Manjurul Ahsan, Zahed Siddique

    Abstract: Driver drowsiness detection using videos/images is one of the most essential areas in today's time for driver safety. The development of deep learning techniques, notably Convolutional Neural Networks (CNN), applied in computer vision applications such as drowsiness detection, has shown promising results due to the tremendous increase in technology in the recent few decades. Eyes that are closed o… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

  17. arXiv:2112.06459  [pdf, other

    cs.LG

    Machine Learning-Based Heart Disease Diagnosis: A Systematic Literature Review

    Authors: Md Manjurul Ahsan, Zahed Siddique

    Abstract: Heart disease is one of the significant challenges in today's world and one of the leading causes of many deaths worldwide. Recent advancement of machine learning (ML) application demonstrates that using electrocardiogram (ECG) and patient data, detecting heart disease during the early stage is feasible. However, both ECG and patient data are often imbalanced, which ultimately raises a challenge f… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.