Skip to main content

Showing 1–27 of 27 results for author: Afzal, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.02792  [pdf, ps, other

    cs.DC physics.app-ph physics.comp-ph

    Exploring metrics for analyzing dynamic behavior in MPI programs via a coupled-oscillator model

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellen

    Abstract: We propose a novel, lightweight, and physically inspired approach to modeling the dynamics of parallel distributed-memory programs. Inspired by the Kuramoto model, we represent MPI processes as coupled oscillators with topology-aware interactions, custom coupling potentials, and stochastic noise. The resulting system of nonlinear ordinary differential equations opens a path to modeling key perform… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2505.24362  [pdf, ps, other

    cs.CL

    Knowing Before Saying: LLM Representations Encode Information About Chain-of-Thought Success Before Completion

    Authors: Anum Afzal, Florian Matthes, Gal Chechik, Yftah Ziser

    Abstract: We investigate whether the success of a zero-shot Chain-of-Thought (CoT) process can be predicted before completion. We discover that a probing classifier, based on LLM representations, performs well \emph{even before a single token is generated}, suggesting that crucial information about the reasoning process is already present in the initial steps representations. In contrast, a strong BERT-base… ▽ More

    Submitted 2 June, 2025; v1 submitted 30 May, 2025; originally announced May 2025.

  3. arXiv:2504.20849  [pdf, other

    cs.CL

    JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry

    Authors: Anum Afzal, Alexandre Mercier, Florian Matthes

    Abstract: Online platforms are increasingly interested in using Data-to-Text technologies to generate content and help their users. Unfortunately, traditional generative methods often fall into repetitive patterns, resulting in monotonous galleries of texts after only a few iterations. In this paper, we investigate LLM-based data-to-text approaches to automatically generate marketing texts that are of suffi… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  4. arXiv:2503.08251  [pdf, other

    eess.SP cs.AI cs.LG

    MT-NAM: An Efficient and Adaptive Model for Epileptic Seizure Detection

    Authors: Arshia Afzal, Volkan Cevher, Mahsa Shoaran

    Abstract: Enhancing the accuracy and efficiency of machine learning algorithms employed in neural interface systems is crucial for advancing next-generation intelligent therapeutic devices. However, current systems often utilize basic machine learning models that do not fully exploit the natural structure of brain signals. Additionally, existing learning models used for neural signal processing often demons… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: Submitted to IEEE-TBME

  5. arXiv:2502.16249  [pdf, other

    cs.LG cs.AI

    Linear Attention for Efficient Bidirectional Sequence Modeling

    Authors: Arshia Afzal, Elias Abad Rocamora, Leyla Naz Candogan, Pol Puigdemont, Francesco Tonin, Yongtao Wu, Mahsa Shoaran, Volkan Cevher

    Abstract: Transformers with linear attention enable fast and parallel training. Moreover, they can be formulated as Recurrent Neural Networks (RNNs), for efficient linear-time inference. While extensively evaluated in causal sequence modeling, they have yet to be extended to the bidirectional setting. This work introduces the LION framework, establishing new theoretical foundations for linear transformers i… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  6. arXiv:2412.08792  [pdf, other

    cs.DC cs.PF

    Analytic Roofline Modeling and Energy Analysis of LULESH Proxy Application on Multi-Core Clusters

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein

    Abstract: We present a thorough performance and energy consumption analysis of the LULESH proxy application in its OpenMP and MPI variants on two different clusters based on Intel Ice Lake (ICL) and Sapphire Rapids (SPR) CPUs. We first study the strong scaling and power consumption characteristics of the six hot spot functions in the code on the node level, with a special focus on memory bandwidth utilizati… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: 10 pages, 11 figures, 4 tables

  7. arXiv:2411.08438  [pdf, ps, other

    cs.AI

    Towards Optimizing a Retrieval Augmented Generation using Large Language Model on Academic Data

    Authors: Anum Afzal, Juraj Vladika, Gentrit Fazlija, Andrei Staradubets, Florian Matthes

    Abstract: Given the growing trend of many organizations integrating Retrieval Augmented Generation (RAG) into their operations, we assess RAG on domain-specific data and test state-of-the-art models across various optimization techniques. We incorporate four optimizations; Multi-Query, Child-Parent-Retriever, Ensemble Retriever, and In-Context-Learning, to enhance the functionality and performance in the ac… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

  8. arXiv:2407.11591  [pdf, other

    cs.CL

    AdaptEval: Evaluating Large Language Models on Domain Adaptation for Text Summarization

    Authors: Anum Afzal, Ribin Chalumattu, Florian Matthes, Laura Mascarell

    Abstract: Despite the advances in the abstractive summarization task using Large Language Models (LLM), there is a lack of research that asses their abilities to easily adapt to different domains. We evaluate the domain adaptation abilities of a wide range of LLMs on the summarization task across various domains in both fine-tuning and in-context learning settings. We also present AdaptEval, the first domai… ▽ More

    Submitted 11 October, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  9. arXiv:2407.05925  [pdf, other

    cs.CL cs.AI

    Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using LLMs with Human in the Loop

    Authors: Anum Afzal, Alexander Kowsik, Rajna Fani, Florian Matthes

    Abstract: Large Language Models have found application in various mundane and repetitive tasks including Human Resource (HR) support. We worked with the domain experts of SAP SE to develop an HR support chatbot as an efficient and effective tool for addressing employee inquiries. We inserted a human-in-the-loop in various parts of the development cycles such as dataset collection, prompt optimization, and e… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  10. arXiv:2406.16906  [pdf, other

    eess.SP cs.AI cs.LG

    REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates

    Authors: Arshia Afzal, Grigorios Chrysos, Volkan Cevher, Mahsa Shoaran

    Abstract: EEG-based seizure detection models face challenges in terms of inference speed and memory efficiency, limiting their real-time implementation in clinical devices. This paper introduces a novel graph-based residual state update mechanism (REST) for real-time EEG signal analysis in applications such as epileptic seizure detection. By leveraging a combination of graph neural networks and recurrent st… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted paper at International Confrence on Machine Learning (ICML 2024). Visit our website: https://arshiaafzal.github.io/REST/

  11. arXiv:2310.05701  [pdf, other

    cs.DC physics.comp-ph

    Physical Oscillator Model for Supercomputing

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein

    Abstract: A parallel program together with the parallel hardware it is running on is not only a vehicle to solve numerical problems, it is also a complex system with interesting dynamical behavior: resynchronization and desynchronization of parallel processes, propagating phases of idleness, and the peculiar effects of noise and system topology are just a few examples. We propose a physical oscillator model… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 5 pages, 2 figures

  12. SPEChpc 2021 Benchmarks on Ice Lake and Sapphire Rapids Infiniband Clusters: A Performance and Energy Case Study

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein

    Abstract: In this work, fundamental performance, power, and energy characteristics of the full SPEChpc 2021 benchmark suite are assessed on two different clusters based on Intel Ice Lake and Sapphire Rapids CPUs using the MPI-only codes' variants. We use memory bandwidth, data volume, and scalability metrics in order to categorize the benchmarks and pinpoint relevant performance and scalability bottlenecks… ▽ More

    Submitted 14 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 9 pages, 6 figures; corrected links to system docs

  13. Challenges in Domain-Specific Abstractive Summarization and How to Overcome them

    Authors: Anum Afzal, Juraj Vladika, Daniel Braun, Florian Matthes

    Abstract: Large Language Models work quite well with general-purpose data and many tasks in Natural Language Processing. However, they show several limitations when used for a task such as domain-specific abstractive text summarization. This paper identifies three of those limitations as research problems in the context of abstractive text summarization: 1) Quadratic complexity of transformer-based models w… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  14. Making Applications Faster by Asynchronous Execution: Slowing Down Processes or Relaxing MPI Collectives

    Authors: Ayesha Afzal, Georg Hager, Stefano Markidis, Gerhard Wellein

    Abstract: Comprehending the performance bottlenecks at the core of the intricate hardware-software interactions exhibited by highly parallel programs on HPC clusters is crucial. This paper sheds light on the issue of automatically asynchronous MPI communication in memory-bound parallel programs on multicore clusters and how it can be facilitated. For instance, slowing down MPI processes by deliberate inject… ▽ More

    Submitted 24 February, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 18 pages, 14 figures, 7 tables. Corrected Fig. 4 layout

  15. arXiv:2301.04098  [pdf, other

    cs.CL

    Investigating Conversational Search Behavior For Domain Exploration

    Authors: Phillip Schneider, Anum Afzal, Juraj Vladika, Daniel Braun, Florian Matthes

    Abstract: Conversational search has evolved as a new information retrieval paradigm, marking a shift from traditional search systems towards interactive dialogues with intelligent search agents. This change especially affects exploratory information-seeking contexts, where conversational search systems can guide the discovery of unfamiliar domains. In these scenarios, users find it often difficult to expres… ▽ More

    Submitted 27 February, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: Accepted to ECIR 2023

  16. Exploring Techniques for the Analysis of Spontaneous Asynchronicity in MPI-Parallel Applications

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein, Stefano Markidis

    Abstract: This paper studies the utility of using data analytics and machine learning techniques for identifying, classifying, and characterizing the dynamics of large-scale parallel (MPI) programs. To this end, we run microbenchmarks and realistic proxy applications with the regular compute-communicate structure on two different supercomputing platforms and choose the per-process performance and MPI time p… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 12 pages, 9 figures, 1 table

  17. The Role of Idle Waves, Desynchronization, and Bottleneck Evasion in the Performance of Parallel Programs

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein

    Abstract: The performance of highly parallel applications on distributed-memory systems is influenced by many factors. Analytic performance modeling techniques aim to provide insight into performance limitations and are often the starting point of optimization efforts. However, coupling analytic models across the system hierarchy (socket, node, network) fails to encompass the intricate interplay between the… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 13 pages, 7 figures, 6 tables

  18. arXiv:2204.02362  [pdf, other

    cs.AI cs.AR cs.LG eess.SP

    Challenges and Opportunities of Edge AI for Next-Generation Implantable BMIs

    Authors: MohammadAli Shaeri, Arshia Afzal, Mahsa Shoaran

    Abstract: Neuroscience and neurotechnology are currently being revolutionized by artificial intelligence (AI) and machine learning. AI is widely used to study and interpret neural signals (analytical applications), assist people with disabilities (prosthetic applications), and treat underlying neurological symptoms (therapeutic applications). In this brief, we will review the emerging opportunities of on-ch… ▽ More

    Submitted 13 April, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  19. arXiv:2104.14272  [pdf, other

    cs.CV

    Current Status and Performance Analysis of Table Recognition in Document Images with Deep Neural Networks

    Authors: Khurram Azeem Hashmi, Marcus Liwicki, Didier Stricker, Muhammad Adnan Afzal, Muhammad Ahtsham Afzal, Muhammad Zeshan Afzal

    Abstract: The first phase of table recognition is to detect the tabular area in a document. Subsequently, the tabular structures are recognized in the second phase in order to extract information from the respective cells. Table detection and structural recognition are pivotal problems in the domain of table understanding. However, table analysis is a perplexing task due to the colossal amount of diversity… ▽ More

    Submitted 8 May, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: 23 pages, 14 figures

  20. arXiv:2104.08625  [pdf, other

    cs.RO

    GzScenic: Automatic Scene Generation for Gazebo Simulator

    Authors: Afsoon Afzal, Claire Le Goues, Christopher S. Timperley

    Abstract: Testing robotic and cyberphysical systems in simulation require specifications of the simulated environments (i.e., scenes). The Scenic domain-specific language provides a high-level probabilistic programming language that allows users to specify scenarios for simulation. Scenic automatically generates concrete scenes that can be rendered by simulators. However, Scenic is mainly designed for auton… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  21. Analytic Modeling of Idle Waves in Parallel Programs: Communication, Cluster Topology, and Noise Impact

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein

    Abstract: Most distributed-memory bulk-synchronous parallel programs in HPC assume that compute resources are available continuously and homogeneously across the allocated set of compute nodes. However, long one-off delays on individual processes can cause global disturbances, so-called idle waves, by rippling through the system. This process is mainly governed by the communication topology of the underlyin… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 19 pages, 10 figures, 2 tables

  22. arXiv:2011.00243  [pdf, other

    cs.DC cs.PF

    An analytic performance model for overlapping execution of memory-bound loop kernels on multicore CPUs

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein

    Abstract: Complex applications running on multicore processors show a rich performance phenomenology. The growing number of cores per ccNUMA domain complicates performance analysis of memory-bound code since system noise, load imbalance, or task-based programming models can lead to thread desynchronization. Hence, the simplifying assumption that all cores execute the same loop can not be upheld. Motivated b… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: 10 pages, 9 figures

  23. arXiv:2004.07368  [pdf, other

    cs.RO cs.SE

    A Study on the Challenges of Using Robotics Simulators for Testing

    Authors: Afsoon Afzal, Deborah S. Katz, Claire Le Goues, Christopher S. Timperley

    Abstract: Robotics simulation plays an important role in the design, development, and verification and validation of robotic systems. Recent studies have shown that simulation may be used as a cheaper, safer, and more reliable alternative to manual, and widely used, process of field testing. This is particularly important in the context of continuous integration pipelines, where integrated automated testing… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

  24. Desynchronization and Wave Pattern Formation in MPI-Parallel and Hybrid Memory-Bound Programs

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein

    Abstract: Analytic, first-principles performance modeling of distributed-memory parallel codes is notoriously imprecise. Even for applications with extremely regular and homogeneous compute-communicate phases, simply adding communication time to computation time does often not yield a satisfactory prediction of parallel runtime due to deviations from the expected simple lockstep pattern caused by system noi… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 18 pages, 8 figures

  25. Propagation and Decay of Injected One-Off Delays on Clusters: A Case Study

    Authors: Ayesha Afzal, Georg Hager, Gerhard Wellein

    Abstract: Analytic, first-principles performance modeling of distributed-memory applications is difficult due to a wide spectrum of random disturbances caused by the application and the system. These disturbances (commonly called "noise") destroy the assumptions of regularity that one usually employs when constructing simple analytic models. Despite numerous efforts to quantify, categorize, and reduce such… ▽ More

    Submitted 28 August, 2019; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: 10 pages, 9 figures; title changed

  26. arXiv:1608.03429  [pdf, other

    cs.NI

    Information-Centric Offloading in Cellular Networks with Coordinated Device-to-Device Communication

    Authors: Asma Afzal, Syed Ali Raza Zaidi, Des McLernon, Mounir Ghogho

    Abstract: In this paper, we develop a comprehensive analytical framework for cache enabled cellular networks overlaid with coordinated device-to-device (D2D) communication. We follow an approach similar to LTE Direct, where the base station (BS) is responsible for establishing D2D links. We consider that an arbitrary requesting user is offloaded to D2D mode to communicate with one of its 'k' closest D2D hel… ▽ More

    Submitted 20 December, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

    Comments: Submitted for possible journal publication

  27. arXiv:1411.6285  [pdf

    q-bio.BM cs.LG stat.ML

    Target Fishing: A Single-Label or Multi-Label Problem?

    Authors: Avid M. Afzal, Hamse Y. Mussa, Richard E. Turner, Andreas Bender, Robert C. Glen

    Abstract: According to Cobanoglu et al and Murphy, it is now widely acknowledged that the single target paradigm (one protein or target, one disease, one drug) that has been the dominant premise in drug development in the recent past is untenable. More often than not, a drug-like compound (ligand) can be promiscuous - that is, it can interact with more than one target protein. In recent years, in in silico… ▽ More

    Submitted 23 November, 2014; originally announced November 2014.