Search | arXiv e-print repository

"I Wrote, I Paused, I Rewrote" Teaching LLMs to Read Between the Lines of Student Writing

Authors: Samra Zafar, Shaheer Minhas, Syed Ali Hassan Zaidi, Arfa Naeem, Zahra Ali

Abstract: Large language models(LLMs) like Gemini are becoming common tools for supporting student writing. But most of their feedback is based only on the final essay missing important context about how that text was written. In this paper, we explore whether using writing process data, collected through keystroke logging and periodic snapshots, can help LLMs give feedback that better reflects how learners… ▽ More Large language models(LLMs) like Gemini are becoming common tools for supporting student writing. But most of their feedback is based only on the final essay missing important context about how that text was written. In this paper, we explore whether using writing process data, collected through keystroke logging and periodic snapshots, can help LLMs give feedback that better reflects how learners think and revise while writing. We built a digital writing tool that captures both what students type and how their essays evolve over time. Twenty students used this tool to write timed essays, which were then evaluated in two ways: (i) LLM generated feedback using both the final essay and the full writing trace, and (ii) After the task, students completed surveys about how useful and relatable they found the feedback. Early results show that learners preferred the process-aware LLM feedback, finding it more in tune with their own thinking. We also found that certain types of edits, like adding new content or reorganizing paragraphs, aligned closely with higher scores in areas like coherence and elaboration. Our findings suggest that making LLMs more aware of the writing process can lead to feedback that feels more meaningful, personal, and supportive. △ Less

Submitted 9 June, 2025; originally announced June 2025.

Comments: 7 pages, 6 figures, 2 tables

arXiv:2412.00588 [pdf]

Enhancing creep resistance in refractory high-entropy alloys: role of grain size and local chemical order

Authors: Saifuddin Zafar, Mashaekh Tausif Ehsan, Sourav Das Suvro, Mohammad Nasim Hasan, Mahmudul Islam

Abstract: Refractory high-entropy alloys (RHEAs) are a promising class of materials with potential applications in extreme environments, where the dominant failure mode is thermal creep. The design of these alloys, therefore, requires an understanding of how their microstructure and local chemical distribution affect creep behavior. In this study, we performed high-fidelity atomistic simulations using machi… ▽ More Refractory high-entropy alloys (RHEAs) are a promising class of materials with potential applications in extreme environments, where the dominant failure mode is thermal creep. The design of these alloys, therefore, requires an understanding of how their microstructure and local chemical distribution affect creep behavior. In this study, we performed high-fidelity atomistic simulations using machine-learning interatomic potentials to explore the creep deformation of MoNbTaW RHEAs under a wide range of stress and temperature conditions. We parametrized grain size and local chemical order (LCO) to investigate the effects of these two important design variables, which are controllable during the alloy fabrication process. Our investigation revealed that resistance to creep deformation is enhanced by larger grain sizes and higher levels of LCO. This study highlights the importance of utilizing LCO in conjunction with other microstructural properties when designing RHEAs for extreme environment applications. △ Less

Submitted 27 February, 2025; v1 submitted 30 November, 2024; originally announced December 2024.

Comments: 23 pages, 6 figures

arXiv:2411.13670 [pdf]

Graph neural network framework for energy mapping of hybrid monte-carlo molecular dynamics simulations of Medium Entropy Alloys

Authors: Mashaekh Tausif Ehsan, Saifuddin Zafar, Apurba Sarker, Sourav Das Suvro, Mohammad Nasim Hasan

Abstract: Machine learning (ML) methods have drawn significant interest in material design and discovery. Graph neural networks (GNNs), in particular, have demonstrated strong potential for predicting material properties. The present study proposes a graph-based representation for modeling medium-entropy alloys (MEAs). Hybrid Monte-Carlo molecular dynamics (MC/MD) simulations are employed to achieve thermal… ▽ More Machine learning (ML) methods have drawn significant interest in material design and discovery. Graph neural networks (GNNs), in particular, have demonstrated strong potential for predicting material properties. The present study proposes a graph-based representation for modeling medium-entropy alloys (MEAs). Hybrid Monte-Carlo molecular dynamics (MC/MD) simulations are employed to achieve thermally stable structures across various annealing temperatures in an MEA. These simulations generate dump files and potential energy labels, which are used to construct graph representations of the atomic configurations. Edges are created between each atom and its 12 nearest neighbors without incorporating explicit edge features. These graphs then serve as input for a Graph Convolutional Neural Network (GCNN) based ML model to predict the system's potential energy. The GCNN architecture effectively captures the local environment and chemical ordering within the MEA structure. The GCNN-based ML model demonstrates strong performance in predicting potential energy at different steps, showing satisfactory results on both the training data and unseen configurations. Our approach presents a graph-based modeling framework for MEAs and high-entropy alloys (HEAs), which effectively captures the local chemical order (LCO) within the alloy structure. This allows us to predict key material properties influenced by LCO in both MEAs and HEAs, providing deeper insights into how atomic-scale arrangements affect the properties of these alloys. △ Less

Submitted 20 November, 2024; originally announced November 2024.

Comments: 28 pages, 9 figures

arXiv:2410.01475 [pdf, other]

Exploring Learning Rate Selection in Generalised Bayesian Inference using Posterior Predictive Checks

Authors: Schyan Zafar, Geoff K. Nicholls

Abstract: Generalised Bayesian Inference (GBI) attempts to address model misspecification in a standard Bayesian setup by tempering the likelihood. The likelihood is raised to a fractional power, called the learning rate, which reduces its importance in the posterior and has been established as a method to address certain kinds of model misspecification. Posterior Predictive Checks (PPC) attempt to detect m… ▽ More Generalised Bayesian Inference (GBI) attempts to address model misspecification in a standard Bayesian setup by tempering the likelihood. The likelihood is raised to a fractional power, called the learning rate, which reduces its importance in the posterior and has been established as a method to address certain kinds of model misspecification. Posterior Predictive Checks (PPC) attempt to detect model misspecification by locating a diagnostic, computed on the observed data, within the posterior predictive distribution of the diagnostic. This can be used to construct a hypothesis test where a small $p$-value indicates potential misfit. The recent Embedded Diachronic Sense Change (EDiSC) model suffers from misspecification and benefits from likelihood tempering. Using EDiSC as a case study, this exploratory work examines whether PPC could be used in a novel way to set the learning rate in a GBI setup. Specifically, the learning rate selected is the lowest value for which a hypothesis test using the log likelihood diagnostic is not rejected at the 10% level. The experimental results are promising, though not definitive, and indicate the need for further research along the lines suggested here. △ Less

Submitted 21 January, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

arXiv:2409.10932

Early Detection of Coronary Heart Disease Using Hybrid Quantum Machine Learning Approach

Authors: Mehroush Banday, Sherin Zafar, Parul Agarwal, M Afshar Alam, Abubeker K M

Abstract: Coronary heart disease (CHD) is a severe cardiac disease, and hence, its early diagnosis is essential as it improves treatment results and saves money on medical care. The prevailing development of quantum computing and machine learning (ML) technologies may bring practical improvement to the performance of CHD diagnosis. Quantum machine learning (QML) is receiving tremendous interest in various d… ▽ More Coronary heart disease (CHD) is a severe cardiac disease, and hence, its early diagnosis is essential as it improves treatment results and saves money on medical care. The prevailing development of quantum computing and machine learning (ML) technologies may bring practical improvement to the performance of CHD diagnosis. Quantum machine learning (QML) is receiving tremendous interest in various disciplines due to its higher performance and capabilities. A quantum leap in the healthcare industry will increase processing power and optimise multiple models. Techniques for QML have the potential to forecast cardiac disease and help in early detection. To predict the risk of coronary heart disease, a hybrid approach utilizing an ensemble machine learning model based on QML classifiers is presented in this paper. Our approach, with its unique ability to address multidimensional healthcare data, reassures the method's robustness by fusing quantum and classical ML algorithms in a multi-step inferential framework. The marked rise in heart disease and death rates impacts worldwide human health and the global economy. Reducing cardiac morbidity and mortality requires early detection of heart disease. In this research, a hybrid approach utilizes techniques with quantum computing capabilities to tackle complex problems that are not amenable to conventional machine learning algorithms and to minimize computational expenses. The proposed method has been developed in the Raspberry Pi 5 Graphics Processing Unit (GPU) platform and tested on a broad dataset that integrates clinical and imaging data from patients suffering from CHD and healthy controls. Compared to classical machine learning models, the accuracy, sensitivity, F1 score, and specificity of the proposed hybrid QML model used with CHD are manifold higher. △ Less

Submitted 1 October, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

Comments: I found a mistake in methodology presentation. Also I have observed more precised results with new dataset. So my research guide ask me to modify the current version

arXiv:2404.01696 [pdf, ps, other]

doi 10.1093/ptep/ptae080

Semileptonic $W$ Decay to the $B$ Meson with Lepton Pairs in Heavy Quark Effective Theory Factorization upto $\mathcal{O}$$(α_s)$

Authors: Saadi Ishaq, Sajawal Zafar, Abdur Rehman, Ishtiaq Ahmed

Abstract: Motivated by the study of heavy-light meson production within the framework of heavy quark effective theory (HQET) factorization, we extend the factorization formalism for a rather complicated process $W^+\to B^+\ell^+\ell^-$ in the limit of a non-zero invariant squared-mass of dilepton, $q^2$, at the lowest order in $1/m_b$ up to $\mathcal{O}(α_s)$. The purpose of the current study is to extend t… ▽ More Motivated by the study of heavy-light meson production within the framework of heavy quark effective theory (HQET) factorization, we extend the factorization formalism for a rather complicated process $W^+\to B^+\ell^+\ell^-$ in the limit of a non-zero invariant squared-mass of dilepton, $q^2$, at the lowest order in $1/m_b$ up to $\mathcal{O}(α_s)$. The purpose of the current study is to extend the HQET factorization formula for the $W^+\to B^+\ell^+\ell^-$ process and subsequently compute the form factors for this channel up to next-to-leading-order corrections in $α_s$. We explicitly show the amplitude of the $W^+\to B^+\ell^+\ell^-$ process can also be factorized into a convolution between the perturbatively calculable hard-scattering kernel and the non-perturbative yet universal light-cone distribution amplitude (LCDA) defined in HQET. The validity of HQET factorization depends on the assumed scale hierarchy $m_W \sim m_b \gg Λ_{\mathrm{QCD}}$. Within the HQET framework, we evaluate the form factors associated with the $W^+ \rightarrow B^+\ell^+\ell^-$ process, providing insights into its phenomenology. In addition, we also perform an exploratory phenomenological study on $W^+ \rightarrow B^+\ell^+\ell^-$ by employing an exponential model for the LCDAs for $B^+$ meson. Our findings reveal that the branching ratio for $W^+ \rightarrow B^+\ell^+\ell^-$ is below $10^{-10}$. Although the branching ratios are small, this channel in high luminosity LHC experiments may serve to further constraints the value of $λ_B$. △ Less

Submitted 6 August, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: 6 figures, 20 pages

arXiv:2403.16808 [pdf, other]

doi 10.1109/CAI59869.2024.00179

Navigating the EU AI Act: A Methodological Approach to Compliance for Safety-critical Products

Authors: J. Kelly, S. Zafar, L. Heidemann, J. Zacchi, D. Espinoza, N. Mata

Abstract: In December 2023, the European Parliament provisionally agreed on the EU AI Act. This unprecedented regulatory framework for AI systems lays out guidelines to ensure the safety, legality, and trustworthiness of AI products. This paper presents a methodology for interpreting the EU AI Act requirements for high-risk AI systems by leveraging product quality models. We first propose an extended produc… ▽ More In December 2023, the European Parliament provisionally agreed on the EU AI Act. This unprecedented regulatory framework for AI systems lays out guidelines to ensure the safety, legality, and trustworthiness of AI products. This paper presents a methodology for interpreting the EU AI Act requirements for high-risk AI systems by leveraging product quality models. We first propose an extended product quality model for AI systems, incorporating attributes relevant to the Act not covered by current quality models. We map the Act requirements to relevant quality attributes with the goal of refining them into measurable characteristics. We then propose a contract-based approach to derive technical requirements at the stakeholder level. This facilitates the development and assessment of AI systems that not only adhere to established quality standards, but also comply with the regulatory requirements outlined in the Act for high-risk (including safety-critical) AI systems. We demonstrate the applicability of this methodology on an exemplary automotive supply chain use case, where several stakeholders interact to achieve EU AI Act compliance. △ Less

Submitted 26 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

Comments: To be published in: 2024 IEEE Conference on Artificial Intelligence (CAI 2024)

arXiv:2401.17984 [pdf, other]

doi 10.1145/3642921.3642928

Makinote: An FPGA-Based HW/SW Platform for Pre-Silicon Emulation of RISC-V Designs

Authors: Elias Perdomo, Alexander Kropotov, Francelly Cano, Syed Zafar, Teresa Cervero, Xavier Martorell, Behzad Salami

Abstract: Emulating chip functionality before silicon production is crucial, especially with the increasing prevalence of RISC-V-based designs. FPGAs are promising candidates for such purposes due to their high-speed and reconfigurable architecture. In this paper, we introduce our Makinote, an FPGA-based Cluster platform, hosted at Barcelona Supercomputing Center (BSC-CNS), which is composed of a large numb… ▽ More Emulating chip functionality before silicon production is crucial, especially with the increasing prevalence of RISC-V-based designs. FPGAs are promising candidates for such purposes due to their high-speed and reconfigurable architecture. In this paper, we introduce our Makinote, an FPGA-based Cluster platform, hosted at Barcelona Supercomputing Center (BSC-CNS), which is composed of a large number of FPGAs (in total 96 AMD/Xilinx Alveo U55c) to emulate massive size RTL designs (up to 750M ASIC cells). In addition, we introduce our FPGA shell as a powerful tool to facilitate the utilization of such a large FPGA cluster with minimal effort needed by the designers. The proposed FPGA shell provides an easy-to-use interface for the RTL developers to rapidly port such design into several FPGAs by automatically connecting to the necessary ports, e.g., PCIe Gen4, DRAM (DDR4 and HBM), ETH10g/100g. Moreover, specific drivers for exploiting RISC-V based architectures are provided within the set of tools associated with the FPGA shell. We release the tool online for further extensions. We validate the efficiency of our hardware platform (i.e., FPGA cluster) and the software tool (i.e., FPGA Shell) by emulating a RISC-V processor and experimenting HPC Challenge application running on 32 FPGAs. Our results demonstrate that the performance improves by 8 times over the single-FPGA case. △ Less

Submitted 5 February, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

Comments: 7 pages, 5 figures, presented in Rapid Simulation and Performance Evaluation for Design 2024 (RAPIDO24) and published in ACM Proceedings of Rapid Simulation and Performance Evaluation for Design

arXiv:2311.00541 [pdf, other]

doi 10.1016/j.csda.2024.108011

An Embedded Diachronic Sense Change Model with a Case Study from Ancient Greek

Authors: Schyan Zafar, Geoff K. Nicholls

Abstract: Word meanings change over time, and word senses evolve, emerge or die out in the process. For ancient languages, where the corpora are often small and sparse, modelling such changes accurately proves challenging, and quantifying uncertainty in sense-change estimates consequently becomes important. GASC (Genre-Aware Semantic Change) and DiSC (Diachronic Sense Change) are existing generative models… ▽ More Word meanings change over time, and word senses evolve, emerge or die out in the process. For ancient languages, where the corpora are often small and sparse, modelling such changes accurately proves challenging, and quantifying uncertainty in sense-change estimates consequently becomes important. GASC (Genre-Aware Semantic Change) and DiSC (Diachronic Sense Change) are existing generative models that have been used to analyse sense change for target words from an ancient Greek text corpus, using unsupervised learning without the help of any pre-training. These models represent the senses of a given target word such as "kosmos" (meaning decoration, order or world) as distributions over context words, and sense prevalence as a distribution over senses. The models are fitted using Markov Chain Monte Carlo (MCMC) methods to measure temporal changes in these representations. This paper introduces EDiSC, an Embedded DiSC model, which combines word embeddings with DiSC to provide superior model performance. It is shown empirically that EDiSC offers improved predictive accuracy, ground-truth recovery and uncertainty quantification, as well as better sampling efficiency and scalability properties with MCMC methods. The challenges of fitting these models are also discussed. △ Less

Submitted 25 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

Journal ref: Computational Statistics & Data Analysis, Volume 199, 2024, 108011, ISSN 0167-9473

arXiv:2310.15333 [pdf, other]

Safe and Interpretable Estimation of Optimal Treatment Regimes

Authors: Harsh Parikh, Quinn Lanners, Zade Akras, Sahar F. Zafar, M. Brandon Westover, Cynthia Rudin, Alexander Volfovsky

Abstract: Recent statistical and reinforcement learning methods have significantly advanced patient care strategies. However, these approaches face substantial challenges in high-stakes contexts, including missing data, inherent stochasticity, and the critical requirements for interpretability and patient safety. Our work operationalizes a safe and interpretable framework to identify optimal treatment regim… ▽ More Recent statistical and reinforcement learning methods have significantly advanced patient care strategies. However, these approaches face substantial challenges in high-stakes contexts, including missing data, inherent stochasticity, and the critical requirements for interpretability and patient safety. Our work operationalizes a safe and interpretable framework to identify optimal treatment regimes. This approach involves matching patients with similar medical and pharmacological characteristics, allowing us to construct an optimal policy via interpolation. We perform a comprehensive simulation study to demonstrate the framework's ability to identify optimal policies even in complex settings. Ultimately, we operationalize our approach to study regimes for treating seizures in critically ill patients. Our findings strongly support personalized treatment strategies based on a patient's medical history and pharmacological features. Notably, we identify that reducing medication doses for patients with mild and brief seizure episodes while adopting aggressive treatment for patients in intensive care unit experiencing intense seizures leads to more favorable outcomes. △ Less

Submitted 1 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: Accepted for publication in the proceedings of AISTATS 2025

arXiv:2307.10704 [pdf]

Decentralized Smart Charging of Large-Scale EVs using Adaptive Multi-Agent Multi-Armed Bandits

Authors: Sharyal Zafar, Raphaël Feraud, Anne Blavette, Guy Camilleri, Hamid Ben

Abstract: The drastic growth of electric vehicles and photovoltaics can introduce new challenges, such as electrical current congestion and voltage limit violations due to peak load demands. These issues can be mitigated by controlling the operation of electric vehicles i.e., smart charging. Centralized smart charging solutions have already been proposed in the literature. But such solutions may lack scalab… ▽ More The drastic growth of electric vehicles and photovoltaics can introduce new challenges, such as electrical current congestion and voltage limit violations due to peak load demands. These issues can be mitigated by controlling the operation of electric vehicles i.e., smart charging. Centralized smart charging solutions have already been proposed in the literature. But such solutions may lack scalability and suffer from inherent drawbacks of centralization, such as a single point of failure, and data privacy concerns. Decentralization can help tackle these challenges. In this paper, a fully decentralized smart charging system is proposed using the philosophy of adaptive multi-agent systems. The proposed system utilizes multi-armed bandit learning to handle uncertainties in the system. The presented system is decentralized, scalable, real-time, model-free, and takes fairness among different players into account. A detailed case study is also presented for performance evaluation. △ Less

Submitted 20 July, 2023; originally announced July 2023.

Comments: CIRED 2023 International Conference & Exhibition on Electricity Distribution, Jun 2023, Rome, Italy

arXiv:2203.04920 [pdf]

doi 10.1016/S2589-7500(23)00088-2

Effects of Epileptiform Activity on Discharge Outcome in Critically Ill Patients

Authors: Harsh Parikh, Kentaro Hoffman, Haoqi Sun, Wendong Ge, Jin Jing, Rajesh Amerineni, Lin Liu, Jimeng Sun, Sahar Zafar, Aaron Struck, Alexander Volfovsky, Cynthia Rudin, M. Brandon Westover

Abstract: Epileptiform activity (EA) is associated with worse outcomes including increased risk of disability and death. However, the effect of EA on the neurologic outcome is confounded by the feedback between treatment with anti-seizure medications (ASM) and EA burden. A randomized clinical trial is challenging due to the sequential nature of EA-ASM feedback, as well as ethical reasons. However, some mech… ▽ More Epileptiform activity (EA) is associated with worse outcomes including increased risk of disability and death. However, the effect of EA on the neurologic outcome is confounded by the feedback between treatment with anti-seizure medications (ASM) and EA burden. A randomized clinical trial is challenging due to the sequential nature of EA-ASM feedback, as well as ethical reasons. However, some mechanistic knowledge is available, e.g., how drugs are absorbed. This knowledge together with observational data could provide a more accurate effect estimate using causal inference. We performed a retrospective cross-sectional study with 995 patients with the modified Rankin Scale (mRS) at discharge as the outcome and the EA burden defined as the mean or maximum proportion of time spent with EA in six-hour windows in the first 24 hours of electroencephalography as the exposure. We estimated the change in discharge mRS if everyone in the dataset had experienced a certain EA burden and were untreated. We combined pharmacological modeling with an interpretable matching method to account for confounding and EA-ASM feedback. Our matched groups' quality was validated by the neurologists. Having a maximum EA burden greater than 75% when untreated had a 22% increased chance of a poor outcome (severe disability or death), and mild but long-lasting EA increased the risk of a poor outcome by 14%. The effect sizes were heterogeneous depending on pre-admission profile, e.g., patients with hypoxic-ischemic encephalopathy (HIE) or acquired brain injury (ABI) were more affected. Interventions should put a higher priority on patients with an average EA burden higher than 10%, while treatment should be more conservative when the maximum EA burden is low. △ Less

Submitted 11 March, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

Comments: 4 Figures

arXiv:2202.00478 [pdf]

NeuraHealth: An Automated Screening Pipeline to Detect Undiagnosed Cognitive Impairment in Electronic Health Records with Deep Learning and Natural Language Processing

Authors: Tanish Tyagi, Colin G. Magdamo, Ayush Noori, Zhaozhi Li, Xiao Liu, Mayuresh Deodhar, Zhuoqiao Hong, Wendong Ge, Elissa M. Ye, Yi-han Sheu, Haitham Alabsi, Laura Brenner, Gregory K. Robbins, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Alberto Serrano-Pozo, Dimitry Prokopenko, Rudolph E. Tanzi, Bradley T. Hyman, Deborah Blacker, Shibani S. Mukerji, M. Brandon Westover, Sudeshna Das

Abstract: Dementia related cognitive impairment (CI) is a neurodegenerative disorder, affecting over 55 million people worldwide and growing rapidly at the rate of one new case every 3 seconds. 75% cases go undiagnosed globally with up to 90% in low-and-middle-income countries, leading to an estimated annual worldwide cost of USD 1.3 trillion, forecasted to reach 2.8 trillion by 2030. With no cure, a recurr… ▽ More Dementia related cognitive impairment (CI) is a neurodegenerative disorder, affecting over 55 million people worldwide and growing rapidly at the rate of one new case every 3 seconds. 75% cases go undiagnosed globally with up to 90% in low-and-middle-income countries, leading to an estimated annual worldwide cost of USD 1.3 trillion, forecasted to reach 2.8 trillion by 2030. With no cure, a recurring failure of clinical trials, and a lack of early diagnosis, the mortality rate is 100%. Information in electronic health records (EHR) can provide vital clues for early detection of CI, but a manual review by experts is tedious and error prone. Several computational methods have been proposed, however, they lack an enhanced understanding of the linguistic context in complex language structures of EHR. Therefore, I propose a novel and more accurate framework, NeuraHealth, to identify patients who had no earlier diagnosis. In NeuraHealth, using patient EHR from Mass General Brigham BioBank, I fine-tuned a bi-directional attention-based deep learning natural language processing model to classify sequences. The sequence predictions were used to generate structured features as input for a patient level regularized logistic regression model. This two-step framework creates high dimensionality, outperforming all existing state-of-the-art computational methods as well as clinical methods. Further, I integrate the models into a real-world product, a web app, to create an automated EHR screening pipeline for scalable and high-speed discovery of undetected CI in EHR, making early diagnosis viable in medical facilities and in regions with scarce health services. △ Less

Submitted 20 June, 2022; v1 submitted 12 January, 2022; originally announced February 2022.

arXiv:2111.09115 [pdf, other]

Using Deep Learning to Identify Patients with Cognitive Impairment in Electronic Health Records

Authors: Tanish Tyagi, Colin G. Magdamo, Ayush Noori, Zhaozhi Li, Xiao Liu, Mayuresh Deodhar, Zhuoqiao Hong, Wendong Ge, Elissa M. Ye, Yi-han Sheu, Haitham Alabsi, Laura Brenner, Gregory K. Robbins, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Alberto Serrano-Pozo, Dimitry Prokopenko, Rudolph E. Tanzi, Bradley T. Hyman, Deborah Blacker, Shibani S. Mukerji, M. Brandon Westover, Sudeshna Das

Abstract: Dementia is a neurodegenerative disorder that causes cognitive decline and affects more than 50 million people worldwide. Dementia is under-diagnosed by healthcare professionals - only one in four people who suffer from dementia are diagnosed. Even when a diagnosis is made, it may not be entered as a structured International Classification of Diseases (ICD) diagnosis code in a patient's charts. In… ▽ More Dementia is a neurodegenerative disorder that causes cognitive decline and affects more than 50 million people worldwide. Dementia is under-diagnosed by healthcare professionals - only one in four people who suffer from dementia are diagnosed. Even when a diagnosis is made, it may not be entered as a structured International Classification of Diseases (ICD) diagnosis code in a patient's charts. Information relevant to cognitive impairment (CI) is often found within electronic health records (EHR), but manual review of clinician notes by experts is both time consuming and often prone to errors. Automated mining of these notes presents an opportunity to label patients with cognitive impairment in EHR data. We developed natural language processing (NLP) tools to identify patients with cognitive impairment and demonstrate that linguistic context enhances performance for the cognitive impairment classification task. We fine-tuned our attention based deep learning model, which can learn from complex language structures, and substantially improved accuracy (0.93) relative to a baseline NLP model (0.84). Further, we show that deep learning NLP can successfully identify dementia patients without dementia-related ICD codes or medications. △ Less

Submitted 12 November, 2021; originally announced November 2021.

Comments: Machine Learning for Health (ML4H) - Extended Abstract

arXiv:2106.14273 [pdf, other]

doi 10.1109/ACCESS.2021.3093442

A Systematic Review of Bio-Cyber Interface Technologies and Security Issues for Internet of Bio-Nano Things

Authors: Sidra Zafar, Mohsin Nazir, Taimur Bakhshi, Hasan Ali Khattak, Sarmadullah Khan, Muhammad Bilal, Kim-Kwang Raymond Choo, Kyung-Sup Kwak7, Aneeqa Sabah

Abstract: Advances in synthetic biology and nanotechnology have contributed to the design of tools that can be used to control, reuse, modify, and re-engineer cells' structure, as well as enabling engineers to effectively use biological cells as programmable substrates to realize Bio-Nano Things (biological embedded computing devices). Bio-NanoThings are generally tiny, non-intrusive, and concealable device… ▽ More Advances in synthetic biology and nanotechnology have contributed to the design of tools that can be used to control, reuse, modify, and re-engineer cells' structure, as well as enabling engineers to effectively use biological cells as programmable substrates to realize Bio-Nano Things (biological embedded computing devices). Bio-NanoThings are generally tiny, non-intrusive, and concealable devices that can be used for in-vivo applications such as intra-body sensing and actuation networks, where the use of artificial devices can be detrimental. Such (nano-scale) devices can be used in various healthcare settings such as continuous health monitoring, targeted drug delivery, and nano-surgeries. These services can also be grouped to form a collaborative network (i.e., nanonetwork), whose performance can potentially be improved when connected to higher bandwidth external networks such as the Internet, say via 5G. However, to realize the IoBNT paradigm, it is also important to seamlessly connect the biological environment with the technological landscape by having a dynamic interface design to convert biochemical signals from the human body into an equivalent electromagnetic signal (and vice versa). This, unfortunately, risks the exposure of internal biological mechanisms to cyber-based sensing and medical actuation, with potential security and privacy implications. This paper comprehensively reviews bio-cyber interface for IoBNT architecture, focusing on bio-cyber interfacing options for IoBNT like biologically inspired bio-electronic devices, RFID enabled implantable chips, and electronic tattoos. This study also identifies known and potential security and privacy vulnerabilities and mitigation strategies for consideration in future IoBNT designs and implementations. △ Less

Submitted 27 June, 2021; originally announced June 2021.

Comments: 41 pages, 9 tables, 6 figures

arXiv:2105.00819 [pdf, other]

doi 10.1111/rssc.12591

Measuring diachronic sense change: new models and Monte Carlo methods for Bayesian inference

Authors: Schyan Zafar, Geoff Nicholls

Abstract: In a bag-of-words model, the senses of a word with multiple meanings, e.g. "bank" (used either in a river-bank or an institution sense), are represented as probability distributions over context words, and sense prevalence is represented as a probability distribution over senses. Both of these may change with time. Modelling and measuring this kind of sense change is challenging due to the typical… ▽ More In a bag-of-words model, the senses of a word with multiple meanings, e.g. "bank" (used either in a river-bank or an institution sense), are represented as probability distributions over context words, and sense prevalence is represented as a probability distribution over senses. Both of these may change with time. Modelling and measuring this kind of sense change is challenging due to the typically high-dimensional parameter space and sparse datasets. A recently published corpus of ancient Greek texts contains expert-annotated sense labels for selected target words. Automatic sense-annotation for the word "kosmos" (meaning decoration, order or world) has been used as a test case in recent work with related generative models and Monte Carlo methods. We adapt an existing generative sense change model to develop a simpler model for the main effects of sense and time, and give MCMC methods for Bayesian inference on all these models that are more efficient than existing methods. We carry out automatic sense-annotation of snippets containing "kosmos" using our model, and measure the time-evolution of its three senses and their prevalence. As far as we are aware, ours is the first analysis of this data, within the class of generative models we consider, that quantifies uncertainty and returns credible sets for evolving sense prevalence in good agreement with those given by expert annotation. △ Less

Submitted 1 March, 2022; v1 submitted 14 April, 2021; originally announced May 2021.

Comments: Additional results included in the appendix

Journal ref: Journal of the Royal Statistical Society Series C: Applied Statistics, Volume 71, Issue 5, November 2022, Pages 1569-1604,

arXiv:2011.06489 [pdf, other]

Natural Language Processing to Detect Cognitive Concerns in Electronic Health Records Using Deep Learning

Authors: Zhuoqiao Hong, Colin G. Magdamo, Yi-han Sheu, Prathamesh Mohite, Ayush Noori, Elissa M. Ye, Wendong Ge, Haoqi Sun, Laura Brenner, Gregory Robbins, Shibani Mukerji, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Bradley T. Hyman, Michael B. Westover, Deborah Blacker, Sudeshna Das

Abstract: Dementia is under-recognized in the community, under-diagnosed by healthcare professionals, and under-coded in claims data. Information on cognitive dysfunction, however, is often found in unstructured clinician notes within medical records but manual review by experts is time consuming and often prone to errors. Automated mining of these notes presents a potential opportunity to label patients wi… ▽ More Dementia is under-recognized in the community, under-diagnosed by healthcare professionals, and under-coded in claims data. Information on cognitive dysfunction, however, is often found in unstructured clinician notes within medical records but manual review by experts is time consuming and often prone to errors. Automated mining of these notes presents a potential opportunity to label patients with cognitive concerns who could benefit from an evaluation or be referred to specialist care. In order to identify patients with cognitive concerns in electronic medical records, we applied natural language processing (NLP) algorithms and compared model performance to a baseline model that used structured diagnosis codes and medication data only. An attention-based deep learning model outperformed the baseline model and other simpler models. △ Less

Submitted 12 November, 2020; originally announced November 2020.

Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

MSC Class: I.2.7

arXiv:2010.06505 [pdf, other]

A Lean and Highly-automated Model-Based Software Development Process Based on DO-178C/DO-331

Authors: Konstantin Dmitriev, Shanza Ali Zafar, Kevin Schmiechen, Yi Lai, Micheal Saleab, Pranav Nagarajan, Daniel Dollinger, Markus Hochstrasser, Stephan Myschik, Florian Holzapfel

Abstract: The emergence of a global market for urban air mobility and unmanned aerial systems has attracted many startups across the world. These organizations have little training or experience in the traditional processes used in civil aviation for the development of software and electronic hardware. They are also constrained in the resources they can allocate for dedicated teams of professionals to follo… ▽ More The emergence of a global market for urban air mobility and unmanned aerial systems has attracted many startups across the world. These organizations have little training or experience in the traditional processes used in civil aviation for the development of software and electronic hardware. They are also constrained in the resources they can allocate for dedicated teams of professionals to follow these standardized processes. To fill this gap, this paper presents a custom workflow based on a subset of objectives derived from the foundational standards for safety critical software DO-178C/DO-331. The selection of objectives from the standards is based on the importance, degree of automation, and reusability of specific objectives. This custom workflow is intended to establish a lean and highly automated development life cycle resulting in higher quality software with better maintainability characteristics for research and prototype aircraft. It can also be proposed as means of compliance for software of certain applications such as unmanned aircraft systems, urban air mobility and general aviation. By producing the essential set of development and verification artifacts, the custom workflow also provides a scalable basis for potential future certification in compliance with DO-178C/DO-331. The custom workflow is demonstrated in a case study of an Autopilot Manual Disconnection System. △ Less

Submitted 13 October, 2020; originally announced October 2020.

arXiv:1907.08041 [pdf, other]

Channel Impulse Response-based Physical Layer Authentication in a Diffusion-based Molecular Communication System

Authors: Sidra Zafar, Waqas Aman, Muhammad Mahboob Ur Rahman, Akram Alomainy, Qammer H. Abbasi

Abstract: Consider impersonation attack by an active malicious nano node (Eve) on a diffusion based molecular communication (DbMC) system---Eve transmits during the idle slots to deceive the nano receiver (Bob) that she is indeed the legitimate nano transmitter (Alice). To this end, this work exploits the 3-dimensional (3D) channel impulse response (CIR) with $L$ taps as device fingerprint for authenticatio… ▽ More Consider impersonation attack by an active malicious nano node (Eve) on a diffusion based molecular communication (DbMC) system---Eve transmits during the idle slots to deceive the nano receiver (Bob) that she is indeed the legitimate nano transmitter (Alice). To this end, this work exploits the 3-dimensional (3D) channel impulse response (CIR) with $L$ taps as device fingerprint for authentication of the nano transmitter during each slot. Specifically, Bob utilizes the Alice's CIR as ground truth to construct a binary hypothesis test to systematically accept/reject the data received in each slot. Simulation results highlight the great challenge posed by impersonation attack--i.e., it is not possible to simultaneously minimize the two error probabilities. In other words, one needs to tolerate on one error type in order to minimize the other error type. △ Less

Submitted 18 July, 2019; originally announced July 2019.

Comments: 2 pages, 3 figures, Accepted for publication in UCET-19 as an Extended Abstract

arXiv:1809.06218 [pdf, other]

Facial Recognition with Encoded Local Projections

Authors: Dhruv Sharma, Sarim Zafar, Morteza Babaie, H. R. Tizhoosh

Abstract: Encoded Local Projections (ELP) is a recently introduced dense sampling image descriptor which uses projections in small neighbourhoods to construct a histogram/descriptor for the entire image. ELP has shown to be as accurate as other state-of-the-art features in searching medical images while being time and resource efficient. This paper attempts for the first time to utilize ELP descriptor as pr… ▽ More Encoded Local Projections (ELP) is a recently introduced dense sampling image descriptor which uses projections in small neighbourhoods to construct a histogram/descriptor for the entire image. ELP has shown to be as accurate as other state-of-the-art features in searching medical images while being time and resource efficient. This paper attempts for the first time to utilize ELP descriptor as primary features for facial recognition and compare the results with LBP histogram on the Labeled Faces in the Wild dataset. We have evaluated descriptors by comparing the chi-squared distance of each image descriptor versus all others as well as training Support Vector Machines (SVM) with each feature vector. In both cases, the results of ELP were better than LBP in the same sub-image configuration. △ Less

Submitted 11 September, 2018; originally announced September 2018.

Comments: To be published at the 2018 IEEE Symp. Series on Comp. Intelligence (IEEE SSCI 2018), 18-21 NOV, 2018, BENGALURU, INDIA

arXiv:1807.02365 [pdf, ps, other]

On minimal edge version of doubly resolving sets of a graph

Authors: Muhammad Ahmad, Zohaib Zahid, Sohail Zafar

Abstract: In this paper, we introduce the edge version of doubly resolving set of a graph which is based on the edge distances of the graph. As a main result, we computed the minimum cardinality $ψ_E$ of edge version of doubly resolving sets of family of $n$-sunlet graph $S_n$ and prism graph $Y_n$. In this paper, we introduce the edge version of doubly resolving set of a graph which is based on the edge distances of the graph. As a main result, we computed the minimum cardinality $ψ_E$ of edge version of doubly resolving sets of family of $n$-sunlet graph $S_n$ and prism graph $Y_n$. △ Less

Submitted 6 July, 2018; originally announced July 2018.

arXiv:1704.07731 [pdf, ps, other]

Dynamical Classification of a Family of Birational Maps of C^2 via Algebraic Entropy

Authors: Anna Cima, Sundus Zafar

Abstract: This work dynamically classifies a 9-parametric family of birational maps f : C2 -> C2. From the sequence of the degrees dn of the iterates of f, we find the dynamical degree delta(f) of f. We identify when dn grows periodically, linearly, quadratically or exponentially. The considered family includes the birational maps studied by Bedford and Kim in [4] as one of its subfamilies. This work dynamically classifies a 9-parametric family of birational maps f : C2 -> C2. From the sequence of the degrees dn of the iterates of f, we find the dynamical degree delta(f) of f. We identify when dn grows periodically, linearly, quadratically or exponentially. The considered family includes the birational maps studied by Bedford and Kim in [4] as one of its subfamilies. △ Less

Submitted 25 April, 2017; originally announced April 2017.

Comments: arXiv admin note: text overlap with arXiv:1702.00959

MSC Class: 14E05; 26C15; 34K19; 37B40; 37C15; 39A23; 39A45

arXiv:1704.07108 [pdf, ps, other]

Zero Entropy for Some Birational Maps of C^2

Authors: Anna Cima, Sundus Zafar

Abstract: This work deals with a special case of family of birational maps f : C2 -> C2 dynamically classified in [9]. In this work we study the zero entropy sub families of f. The sequence of degrees dn associated to the iterates of f is found to grow periodically, linearly, quadratically or exponentially. Explicit invariant fibrations for zero entropy families and all the integrable and periodic mappings… ▽ More This work deals with a special case of family of birational maps f : C2 -> C2 dynamically classified in [9]. In this work we study the zero entropy sub families of f. The sequence of degrees dn associated to the iterates of f is found to grow periodically, linearly, quadratically or exponentially. Explicit invariant fibrations for zero entropy families and all the integrable and periodic mappings inside the family f are given. △ Less

Submitted 24 April, 2017; originally announced April 2017.

MSC Class: 14E05; 26C15; 34K19; 37B40; 37C15; 39A23; 39A45

arXiv:1702.00959 [pdf, ps, other]

Invariant Fibrations for some Birational Maps of C^2

Authors: Anna Cima, Sundus Zafar

Abstract: In this article we extract and study the zero entropy subfamilies of a certain family of birational maps of the plane. We find these zero entropy mappings and give the invariant fibrations associated to them. In this article we extract and study the zero entropy subfamilies of a certain family of birational maps of the plane. We find these zero entropy mappings and give the invariant fibrations associated to them. △ Less

Submitted 3 February, 2017; originally announced February 2017.

arXiv:1310.3981 [pdf, ps, other]

On the Betti numbers of some classes of Binomial edge ideals

Authors: Zohaib Zahid, Sohail Zafar

Abstract: We study the Betti numbers of binomial edge ideal associated to some classes of graphs with large Castelnuovo-Mumford regularity. As an application we give several lower bounds of the Castelnuovo-Mumford regularity of arbitrary graphs depending on induced subgraphs. We study the Betti numbers of binomial edge ideal associated to some classes of graphs with large Castelnuovo-Mumford regularity. As an application we give several lower bounds of the Castelnuovo-Mumford regularity of arbitrary graphs depending on induced subgraphs. △ Less

Submitted 15 October, 2013; originally announced October 2013.

MSC Class: 05E40; 16E30

arXiv:1308.0795 [pdf]

The Economic and Sustainability Future of Cellular Networks

Authors: Salman Zafar

Abstract: Global data traffic is expected to grow exponentially in the next few years with video and smartphone applications driving data growth. Many mobile network providers in the UK have either deployed or planning to deploy 4th generation Long-Term-Evolution (LTE) mobile technology as the solution to meet capacity demands. This study evaluates the technological improvements in 4G LTE in comparison to 3… ▽ More Global data traffic is expected to grow exponentially in the next few years with video and smartphone applications driving data growth. Many mobile network providers in the UK have either deployed or planning to deploy 4th generation Long-Term-Evolution (LTE) mobile technology as the solution to meet capacity demands. This study evaluates the technological improvements in 4G LTE in comparison to 3G High Speed Packet Access (HSPA) and further conducts a techno-economic analysis using primary researched tariff data to determine network operator profitability and mobile tariff strategy to meet user demand. To ensure holistic analysis, the study also considers the environmental impacts of LTE by determining the annual carbon emission for a network operator. The study results shows LTE will prove profitable; however a trade-off has to be made by network operators between meeting consumer tariff demands or increasing profitability. Analysis also shows a 63% reduced in carbon emissions is possible with migration to 4G services with implication of further financial benefits for network operators as a result. △ Less

Submitted 4 August, 2013; originally announced August 2013.

arXiv:1302.2460 [pdf, ps, other]

Scheme of 2-dimensional atom localization for a three-level atom via quantum coherence

Authors: Sajjad Zafar, Rizwan Ahmed, M. Khalid Khan

Abstract: We present a scheme for two-dimensional (2D) atom localization in a three-level atomic system. The scheme is based on quantum coherence via classical standing wave fields between the two excited levels. Our results show that conditional position probability is significantly phase dependent of the applied field and frequency detuning of spontaneously emitted photons. We obtain a single localization… ▽ More We present a scheme for two-dimensional (2D) atom localization in a three-level atomic system. The scheme is based on quantum coherence via classical standing wave fields between the two excited levels. Our results show that conditional position probability is significantly phase dependent of the applied field and frequency detuning of spontaneously emitted photons. We obtain a single localization peak having probability close to unity by manipulating the control parameters. The effect of atomic level coherence on the sub-wavelength localization has also been studied. Our scheme may be helpful in systems involving atom-field interaction. △ Less

Submitted 11 February, 2013; originally announced February 2013.

Comments: 1 tex file and 16 figures in .eps formate

arXiv:1301.0789 [pdf, ps, other]

Algebraic properties of the binomial edge ideal of complete bipartite graph

Authors: Peter Schenzel, Sohail Zafar

Abstract: Let $J_G$ denote the binomial edge ideal of a connected undirected graph on $n$ vertices. This is the ideal generated by the binomials $x_iy_j - x_jy_i, 1\leq i < j \leq n,$ in the polynomial ring $S= K[x_1,...,x_n,y_1,...,y_n]$ where $\{i,j\}$ is an edge of $G$. We study the arithmetic properties of $S/J_G$ for $G$, the complete bipartite graph. In particular we compute dimensions, depths, Castel… ▽ More Let $J_G$ denote the binomial edge ideal of a connected undirected graph on $n$ vertices. This is the ideal generated by the binomials $x_iy_j - x_jy_i, 1\leq i < j \leq n,$ in the polynomial ring $S= K[x_1,...,x_n,y_1,...,y_n]$ where $\{i,j\}$ is an edge of $G$. We study the arithmetic properties of $S/J_G$ for $G$, the complete bipartite graph. In particular we compute dimensions, depths, Castelnuovo-Mumford regularities, Hilbert functions and multiplicities of them. As main results we give an explicit description of the modules of deficiencies, the duals of local cohomology modules, and prove the purity of the minimal free resolution of $S/J_G$. △ Less

Submitted 4 January, 2013; originally announced January 2013.

Comments: 15 pages, Accepted in An. St. Univ. Ovidius Constanta, Ser. Mat

MSC Class: 05E40; 13H10; 13D45

arXiv:1301.0784 [pdf, ps, other]

On approximately Cohen-Macaulay binomial edge ideal

Authors: Sohail Zafar

Abstract: Binomial edge ideals IG of a graph G were introduced by [4]. They found some classes of graphs G with the property that IG is a Cohen-Macaulay ideal. This might happen only for few classes of graphs. A certain generalization of being Cohen-Macaulay, named approximately Cohen-Macaulay, was introduced by S. Goto in [3]. We study classes of graphs whose binomial edge ideal are approximately Cohen-Mac… ▽ More Binomial edge ideals IG of a graph G were introduced by [4]. They found some classes of graphs G with the property that IG is a Cohen-Macaulay ideal. This might happen only for few classes of graphs. A certain generalization of being Cohen-Macaulay, named approximately Cohen-Macaulay, was introduced by S. Goto in [3]. We study classes of graphs whose binomial edge ideal are approximately Cohen-Macaulay. Moreover we use some homological methods in order to compute their Hilbert series. △ Less

Submitted 4 January, 2013; originally announced January 2013.

Comments: 12 pages and 2 figures

MSC Class: 05C05; 05C38; 05E40; 13H10

Journal ref: Bull. Math. Soc. Sci. Math. Roumanie Tome 55(103) No. 4, 2012, 429-442

Showing 1–29 of 29 results for author: Zafar, S