Skip to main content

Showing 1–36 of 36 results for author: Khalifa, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.16828  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Process Reward Models That Think

    Authors: Muhammad Khalifa, Rishabh Agarwal, Lajanugen Logeswaran, Jaekyeom Kim, Hao Peng, Moontae Lee, Honglak Lee, Lu Wang

    Abstract: Step-by-step verifiers -- also known as process reward models (PRMs) -- are a key ingredient for test-time scaling. PRMs require step-level supervision, making them expensive to train. This work aims to build data-efficient PRMs as verbalized step-wise reward models that verify every step in the solution by generating a verification chain-of-thought (CoT). We propose ThinkPRM, a long CoT verifier… ▽ More

    Submitted 23 June, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

  2. arXiv:2504.09702  [pdf, other

    cs.AI

    MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?

    Authors: Yunxiang Zhang, Muhammad Khalifa, Shitanshu Bhushan, Grant D Murphy, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, Lu Wang

    Abstract: We introduce MLRC-Bench, a benchmark designed to quantify how effectively language agents can tackle challenging Machine Learning (ML) Research Competitions, with a focus on open research problems that demand novel methodologies. Unlike prior work, e.g., AI Scientist, which evaluates the end-to-end agentic pipeline by using LLM-as-a-judge, MLRC-Bench measures the key steps of proposing and impleme… ▽ More

    Submitted 18 May, 2025; v1 submitted 13 April, 2025; originally announced April 2025.

  3. arXiv:2412.04144  [pdf, other

    cs.CL cs.AI

    If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

    Authors: Muhammad Khalifa, Yi-Chern Tan, Arash Ahmadian, Tom Hosking, Honglak Lee, Lu Wang, Ahmet Üstün, Tom Sherborne, Matthias Gallé

    Abstract: Model merging has shown great promise at combining expert models, but the benefit of merging is unclear when merging "generalist" models trained on many tasks. We explore merging in the context of large (~100B) models, by recycling checkpoints that exhibit tradeoffs among different tasks. Such checkpoints are often created in the process of developing a frontier model, and the suboptimal ones are… ▽ More

    Submitted 3 February, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: 13 pages, 9 figures

  4. arXiv:2411.07130  [pdf, ps, other

    cs.CL

    On Many-Shot In-Context Learning for Long-Context Evaluation

    Authors: Kaijian Zou, Muhammad Khalifa, Lu Wang

    Abstract: Many-shot in-context learning (ICL) has emerged as a unique setup to both utilize and test the ability of large language models to handle long context. This paper delves into long-context language model (LCLM) evaluation through many-shot ICL. We first ask: what types of ICL tasks benefit from additional demonstrations, and how effective are they in evaluating LCLMs? We find that classification an… ▽ More

    Submitted 12 June, 2025; v1 submitted 11 November, 2024; originally announced November 2024.

    Comments: ACL 2025 Main Conference

  5. arXiv:2410.02899  [pdf, ps, other

    cs.CL

    FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs

    Authors: Deema Alnuhait, Neeraja Kirtane, Muhammad Khalifa, Hao Peng

    Abstract: Language models (LMs) hallucinate. We inquire: Can we detect and mitigate hallucinations before they happen? This work answers this research question in the positive, by showing that the internal representations of LMs provide rich signals that can be used for this purpose. We introduce FactCheckmate, which preemptively detects hallucinations by learning a classifier that predicts whether the LM w… ▽ More

    Submitted 24 June, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

  6. arXiv:2405.16337  [pdf, other

    cs.CL cs.AI

    Learning to Reason via Program Generation, Emulation, and Search

    Authors: Nathaniel Weir, Muhammad Khalifa, Linlu Qiu, Orion Weller, Peter Clark

    Abstract: Program synthesis with language models (LMs) has unlocked a large set of reasoning abilities; code-tuned LMs have proven adept at generating programs that solve a wide variety of algorithmic symbolic manipulation tasks (e.g. word concatenation). However, not all reasoning tasks are easily expressible as code, e.g. tasks involving commonsense reasoning, moral decision-making, and sarcasm understand… ▽ More

    Submitted 3 November, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024 camera ready

  7. arXiv:2404.17140  [pdf, other

    cs.CL

    Small Language Models Need Strong Verifiers to Self-Correct Reasoning

    Authors: Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, Lu Wang

    Abstract: Self-correction has emerged as a promising solution to boost the reasoning performance of large language models (LLMs), where LLMs refine their solutions using self-generated critiques that pinpoint the errors. This work explores whether small (<= 13B) language models (LMs) have the ability of self-correction on reasoning tasks with minimal inputs from stronger LMs. We propose a novel pipeline tha… ▽ More

    Submitted 5 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: ACL Findings 2024 - Camera Ready

  8. arXiv:2404.01019  [pdf, other

    cs.CL cs.AI

    Source-Aware Training Enables Knowledge Attribution in Language Models

    Authors: Muhammad Khalifa, David Wadden, Emma Strubell, Honglak Lee, Lu Wang, Iz Beltagy, Hao Peng

    Abstract: Large language models (LLMs) learn a vast amount of knowledge during pretraining, but they are often oblivious to the source(s) of such knowledge. We investigate the problem of intrinsic source citation, where LLMs are required to cite the pretraining source supporting a generated response. Intrinsic source citation can enhance LLM transparency, interpretability, and verifiability. To give LLMs su… ▽ More

    Submitted 12 August, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: COLM '24

  9. arXiv:2310.19208  [pdf, other

    cs.CL

    LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses

    Authors: Xin Liu, Muhammad Khalifa, Lu Wang

    Abstract: A model is considered well-calibrated when its probability estimate aligns with the actual likelihood of the output being correct. Calibrating language models (LMs) is crucial, as it plays a vital role in detecting and mitigating hallucinations of LMs as well as building more trustworthy models. However, standard calibration techniques may not be suited for LM calibration. For instance, post-proce… ▽ More

    Submitted 13 March, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: accepted to ICLR 2024

  10. arXiv:2310.14393  [pdf, other

    cs.CL cs.AI

    Merging Generated and Retrieved Knowledge for Open-Domain QA

    Authors: Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

    Abstract: Open-domain question answering (QA) systems are often built with retrieval modules. However, retrieving passages from a given source is known to suffer from insufficient knowledge coverage. Alternatively, prompting large language models (LLMs) to generate contextual passages based on their parametric knowledge has been shown to improve QA performance. Yet, LLMs tend to "hallucinate" content that c… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 - Camera Ready

  11. arXiv:2308.08780  [pdf, other

    cs.CL cs.AI

    Exploring Demonstration Ensembling for In-context Learning

    Authors: Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

    Abstract: In-context learning (ICL) operates by showing language models (LMs) examples of input-output pairs for a given task, i.e., demonstrations. The standard approach for ICL is to prompt the LM with concatenated demonstrations followed by the test input. This approach suffers from some issues. First, concatenation offers almost no control over the contribution of each demo to the model prediction. This… ▽ More

    Submitted 20 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: Published at ME-FoMo workshop at ICLR 2023. Arxiv version includes evaluation on 5 more tasks

  12. arXiv:2305.15629  [pdf, other

    cs.LG cs.AI

    Patient Outcome Predictions Improve Operations at a Large Hospital Network

    Authors: Liangyuan Na, Kimberly Villalobos Carballo, Jean Pauphilet, Ali Haddad-Sisakht, Daniel Kombert, Melissa Boisjoli-Langlois, Andrew Castiglione, Maram Khalifa, Pooja Hebbal, Barry Stein, Dimitris Bertsimas

    Abstract: Problem definition: Access to accurate predictions of patients' outcomes can enhance medical staff's decision-making, which ultimately benefits all stakeholders in the hospitals. A large hospital network in the US has been collaborating with academics and consultants to predict short-term and long-term outcomes for all inpatients across their seven hospitals. Methodology/results: We develop machin… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 41 pages, 13 figures

  13. arXiv:2305.14934  [pdf, other

    cs.CL cs.AI

    GRACE: Discriminator-Guided Chain-of-Thought Reasoning

    Authors: Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

    Abstract: In the context of multi-step reasoning, e.g., with chain-of-thought, language models (LMs) can easily assign a high likelihood to incorrect steps. As a result, decoding strategies that optimize for solution likelihood often yield incorrect solutions. To address this issue, we propose Guiding chain-of-thought ReAsoning with a CorrectnEss Discriminator (GRACE), a stepwise decoding approach that stee… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: To appear at Findings of EMNLP 2023

  14. arXiv:2305.12544  [pdf, other

    cs.CL cs.AI

    Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models

    Authors: Oana Ignat, Zhijing Jin, Artem Abzaliev, Laura Biester, Santiago Castro, Naihao Deng, Xinyi Gao, Aylin Gunal, Jacky He, Ashkan Kazemi, Muhammad Khalifa, Namho Koh, Andrew Lee, Siyang Liu, Do June Min, Shinka Mori, Joan Nwatu, Veronica Perez-Rosas, Siqi Shen, Zekun Wang, Winston Wu, Rada Mihalcea

    Abstract: Recent progress in large language models (LLMs) has enabled the deployment of many generative NLP applications. At the same time, it has also led to a misleading public discourse that ``it's all been solved.'' Not surprisingly, this has, in turn, made many NLP researchers -- especially those at the beginning of their careers -- worry about what NLP research area they should focus on. Has it all be… ▽ More

    Submitted 15 March, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted at COLING 2024

  15. arXiv:2305.12018  [pdf, other

    cs.CL

    BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases

    Authors: Xin Liu, Muhammad Khalifa, Lu Wang

    Abstract: Energy-based models (EBMs) have gained popularity for controlled text generation due to their high applicability to a wide range of constraints. However, sampling from EBMs is non-trivial, as it often requires a large number of iterations to converge to plausible text, which slows down the decoding process and makes it less practical for real-world applications. In this work, we propose BOLT, whic… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023

  16. arXiv:2302.08284  [pdf, other

    cs.LG eess.SY

    ClaPIM: Scalable Sequence CLAssification using Processing-In-Memory

    Authors: Marcel Khalifa, Barak Hoffer, Orian Leitersdorf, Robert Hanhan, Ben Perach, Leonid Yavits, Shahar Kvatinsky

    Abstract: DNA sequence classification is a fundamental task in computational biology with vast implications for applications such as disease prevention and drug design. Therefore, fast high-quality sequence classifiers are significantly important. This paper introduces ClaPIM, a scalable DNA sequence classification architecture based on the emerging concept of hybrid in-crossbar and near-crossbar memristive… ▽ More

    Submitted 5 November, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  17. arXiv:2211.04903  [pdf, other

    cs.CL

    Novel Chapter Abstractive Summarization using Spinal Tree Aware Sub-Sentential Content Selection

    Authors: Hardy Hardy, Miguel Ballesteros, Faisal Ladhak, Muhammad Khalifa, Vittorio Castelli, Kathleen McKeown

    Abstract: Summarizing novel chapters is a difficult task due to the input length and the fact that sentences that appear in the desired summaries draw content from multiple places throughout the chapter. We present a pipelined extractive-abstractive approach where the extractive step filters the content that is passed to the abstractive component. Extremely lengthy input also results in a highly skewed data… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  18. arXiv:2210.05613  [pdf, other

    cs.CL cs.AI

    Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

    Authors: Muhammad Khalifa, Yogarshi Vyas, Shuai Wang, Graham Horwood, Sunil Mallya, Miguel Ballesteros

    Abstract: We investigate semi-structured document classification in a zero-shot setting. Classification of semi-structured documents is more challenging than that of standard unstructured documents, as positional, layout, and style information play a vital role in interpreting such documents. The standard classification setting where categories are fixed during both training and testing falls short in dynam… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  19. arXiv:2205.12650  [pdf, other

    cs.CL cs.IR

    Few-shot Reranking for Multi-hop QA via Language Model Prompting

    Authors: Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

    Abstract: We study few-shot reranking for multi-hop QA with open-domain questions. To alleviate the need for a large number of labeled question-document pairs for retriever training, we propose PromptRank, which relies on large language models prompting for multi-hop path reranking. PromptRank first constructs an instruction-based prompt that includes a candidate document path and then computes the relevanc… ▽ More

    Submitted 2 July, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: ACL 2023 - Camera Ready

  20. arXiv:2201.04205  [pdf, other

    cs.GR

    JSOL: JavaScript Open-source Library for Grammar of Graphics

    Authors: Waleed A. Yousef, Hisham E. Mohammed, Andrew A. Naguib, Rafat S. Eid, Sherif E. Emabrak, Ahmed F. Hamed, Yusuf M. Khalifa, Shrouk T. AbdElrheem, Eman A. Awad, Sara G. Gaafar, Alaa M. Mamdoh, Nada A. Shawky

    Abstract: In this paper, we introduce the JavaScript Open-source Library (\libname), a high-level grammar for representing data in visualization graphs and plots. \libname~perspective on the grammar of graphics is unique; it provides state-of-art rules for encoding visual primitives that can be used to generate a known scene or to invent a new one. \libname~has ton rules developed specifically for data-mung… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

  21. arXiv:2109.08232  [pdf, other

    cs.CL

    A Bag of Tricks for Dialogue Summarization

    Authors: Muhammad Khalifa, Miguel Ballesteros, Kathleen McKeown

    Abstract: Dialogue summarization comes with its own peculiar challenges as opposed to news or scientific articles summarization. In this work, we explore four different challenges of the task: handling and differentiating parts of the dialogue belonging to multiple speakers, negation understanding, reasoning about the situation, and informal language understanding. Using a pretrained sequence-to-sequence la… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 - short paper

  22. arXiv:2104.06591  [pdf, other

    cs.CL

    Zero-Resource Multi-Dialectal Arabic Natural Language Understanding

    Authors: Muhammad Khalifa, Hesham Hassan, Aly Fahmy

    Abstract: A reasonable amount of annotated data is required for fine-tuning pre-trained language models (PLM) on downstream tasks. However, obtaining labeled examples for different language varieties can be costly. In this paper, we investigate the zero-shot performance on Dialectal Arabic (DA) when fine-tuning a PLM on modern standard Arabic (MSA) data only -- identifying a significant performance drop whe… ▽ More

    Submitted 25 May, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2101.04758

  23. arXiv:2101.04758  [pdf, other

    cs.CL cs.AI cs.NE

    Self-Training Pre-Trained Language Models for Zero- and Few-Shot Multi-Dialectal Arabic Sequence Labeling

    Authors: Muhammad Khalifa, Muhammad Abdul-Mageed, Khaled Shaalan

    Abstract: A sufficient amount of annotated data is usually required to fine-tune pre-trained language models for downstream tasks. Unfortunately, attaining labeled data can be costly, especially for multiple language varieties and dialects. We propose to self-train pre-trained language models in zero- and few-shot scenarios to improve performance on data-scarce varieties using only resources from data-rich… ▽ More

    Submitted 2 February, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

    Comments: Accepted at EACL 2021 (Camera Ready Version)

  24. arXiv:2012.11635  [pdf, other

    cs.CL cs.AI cs.LG

    A Distributional Approach to Controlled Text Generation

    Authors: Muhammad Khalifa, Hady Elsahar, Marc Dymetman

    Abstract: We propose a Distributional Approach for addressing Controlled Text Generation from pre-trained Language Models (LMs). This approach permits to specify, in a single formal framework, both "pointwise" and "distributional" constraints over the target LM -- to our knowledge, the first model with such generality -- while minimizing KL divergence from the initial LM distribution. The optimal target dis… ▽ More

    Submitted 6 May, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: ICLR 2021 camera-ready version

  25. arXiv:2012.00600  [pdf

    cs.CL cs.AI cs.IR

    Extracting Synonyms from Bilingual Dictionaries

    Authors: Mustafa Jarrar, Eman Karajah, Muhammad Khalifa, Khaled Shaalan

    Abstract: We present our progress in developing a novel algorithm to extract synonyms from bilingual dictionaries. Identification and usage of synonyms play a significant role in improving the performance of information access applications. The idea is to construct a translation graph from translation pairs, then to extract and consolidate cyclic paths to form bilingual sets of synonyms. The initial evaluat… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

    Comments: In Proceedings - 11th International Global Wordnet Conference (GWC2021). Global Wordnet Association (2021)

    Journal ref: In Proceedings of the11th International Global Wordnet Conference (GWC2021). (pp. 215-222). Global Wordnet Association. (2021)

  26. arXiv:2011.10255  [pdf

    cs.CR

    A lightweight cryptography (LWC) framework to secure memory heap in Internet of Things

    Authors: Mahmoud Khalifa, Fahad Algarni, Mohammad Ayoub Khan, Azmat Ullah, Khalid Aloufic

    Abstract: The extensive networking of devices and the large amount of data generated from the Internet of Things (IoT) has brought security issues to the attention of the researcher. Java is the most common platform for embedded applications such as IoT, Wireless Sensors Networks (WSN), Near Field Communications (NFC) and Radio Frequency Identification (RFID). The object programming languages such as Java,… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: Alexandria Engineering Journal

  27. arXiv:2007.11073  [pdf, other

    cs.CL cs.AI

    Book Success Prediction with Pretrained Sentence Embeddings and Readability Scores

    Authors: Muhammad Khalifa, Aminul Islam

    Abstract: Predicting the potential success of a book in advance is vital in many applications. This could help both publishers and readers in their decision-making process whether or not a book is worth publishing and reading, respectively. In this paper, we propose a model that leverages pretrained sentence embeddings along with various readability scores for book success prediction. Unlike previous method… ▽ More

    Submitted 5 October, 2021; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: To Appear at HICSS-55

  28. arXiv:2004.01184  [pdf

    eess.IV cs.CV cs.LG

    Detection of Coronavirus (COVID-19) Associated Pneumonia based on Generative Adversarial Networks and a Fine-Tuned Deep Transfer Learning Model using Chest X-ray Dataset

    Authors: Nour Eldeen M. Khalifa, Mohamed Hamed N. Taha, Aboul Ella Hassanien, Sally Elghamrawy

    Abstract: The COVID-19 coronavirus is one of the devastating viruses according to the world health organization. This novel virus leads to pneumonia, which is an infection that inflames the lungs' air sacs of a human. One of the methods to detect those inflames is by using x-rays for the chest. In this paper, a pneumonia chest x-ray detection based on generative adversarial networks (GAN) with a fine-tuned… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: 15 pages, 3 Tables and 10 Figures

  29. arXiv:1910.05983  [pdf, other

    cs.LG stat.ML

    On the Reduction of Variance and Overestimation of Deep Q-Learning

    Authors: Mohammed Sabry, Amr M. A. Khalifa

    Abstract: The breakthrough of deep Q-Learning on different types of environments revolutionized the algorithmic design of Reinforcement Learning to introduce more stable and robust algorithms, to that end many extensions to deep Q-Learning algorithm have been proposed to reduce the variance of the target values and the overestimation phenomena. In this paper, we examine new methodology to solve these issues… ▽ More

    Submitted 14 April, 2024; v1 submitted 14 October, 2019; originally announced October 2019.

  30. arXiv:1908.06738   

    cs.SE cs.CL

    Semantic Source Code Search: A Study of the Past and a Glimpse at the Future

    Authors: Muhammad Khalifa

    Abstract: With the recent explosion in the size and complexity of source codebases and software projects, the need for efficient source code search engines has increased dramatically. Unfortunately, existing information retrieval-based methods fail to capture the query semantics and perform well only when the query contains syntax-based keywords. Consequently, such methods will perform poorly when given hig… ▽ More

    Submitted 23 September, 2021; v1 submitted 15 August, 2019; originally announced August 2019.

    Comments: The paper is outdated as there have been new methods and I have little time to work on it

  31. arXiv:1908.02300  [pdf, other

    cs.CV eess.IV eess.SP

    Relative Afferent Pupillary Defect Screening through Transfer Learning

    Authors: Dogancan Temel, Melvin J. Mathew, Ghassan AlRegib, Yousuf M. Khalifa

    Abstract: Abnormalities in pupillary light reflex can indicate optic nerve disorders that may lead to permanent visual loss if not diagnosed in an early stage. In this study, we focus on relative afferent pupillary defect (RAPD), which is based on the difference between the reactions of the eyes when they are exposed to light stimuli. Incumbent RAPD assessment methods are based on subjective practices that… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: 8 pages, 7 figures, 4 tables. IEEE Journal of Biomedical and Health Informatics, 2019

    ACM Class: I.4

  32. arXiv:1907.11524  [pdf

    cs.CY

    Validating and Updating GRASP: A New Evidence-Based Framework for Grading and Assessment of Clinical Predictive Tools

    Authors: Mohamed Khalifa, Farah Magrabi, Blanca Gallego

    Abstract: Background: When selecting predictive tools, for implementation in clinical practice or for recommendation in guidelines, clinicians are challenged with an overwhelming and ever-growing number of tools. Many of these have never been implemented or evaluated for comparative effectiveness. The authors developed an evidence-based framework for grading and assessment of predictive tools (GRASP), based… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1907.03706, arXiv:1907.11523

  33. arXiv:1907.11523  [pdf

    cs.CY

    Evaluating the Impact of Using GRASP Framework on Clinicians and Healthcare Professionals Decisions in Selecting Clinical Predictive Tools

    Authors: Mohamed Khalifa, Farah Magrabi, Blanca Gallego

    Abstract: Background. When selecting predictive tools, clinicians and healthcare professionals are challenged with an overwhelming number of tools, most of which have never been evaluated for comparative effectiveness. To overcome this challenge, the authors developed and validated an evidence-based framework for grading and assessment of predictive tools (GRASP), based on the critical appraisal of publishe… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: 42 pages, 9 figures, and 13 tables. arXiv admin note: text overlap with arXiv:1907.03706, arXiv:1907.11524

  34. arXiv:1907.03706  [pdf

    cs.CY

    Developing an Evidence-Based Framework for Grading and Assessment of Predictive Tools for Clinical Decision Support

    Authors: Mohamed Khalifa, Farah Magrabi, Blanca Gallego

    Abstract: Background: Clinical predictive tools quantify contributions of relevant patient characteristics to derive likelihood of diseases or predict clinical outcomes. When selecting a predictive tool, for implementation at clinical practice or for recommendation in clinical guidelines, clinicians are challenged with an overwhelming and ever growing number of tools, most of which have never been implement… ▽ More

    Submitted 18 June, 2019; originally announced July 2019.

    Comments: 63 pages; 48 pages main text and 15 pages appendix. 6 figures and 12 tables

  35. arXiv:1905.08886  [pdf, other

    eess.IV cs.CV eess.SP

    Automated Pupillary Light Reflex Test on a Portable Platform

    Authors: Dogancan Temel, Melvin J. Mathew, Ghassan AlRegib, Yousuf M. Khalifa

    Abstract: In this paper, we introduce a portable eye imaging device denoted as lab-on-a-headset, which can automatically perform a swinging flashlight test. We utilized this device in a clinical study to obtain high-resolution recordings of eyes while they are exposed to a varying light stimuli. Half of the participants had relative afferent pupillary defect (RAPD) while the other half was a control group.… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

    Comments: 7 pages, 11 figures, 3 tables

    ACM Class: I.4

    Journal ref: International Symposium on Medical Robotics (ISMR), Atlanta, GA, USA, 2019, pp. 1-7

  36. arXiv:1709.02245  [pdf

    cs.CV

    Deep Galaxy: Classification of Galaxies based on Deep Convolutional Neural Networks

    Authors: Nour Eldeen M. Khalifa, Mohamed Hamed N. Taha, Aboul Ella Hassanien, I. M. Selim

    Abstract: In this paper, a deep convolutional neural network architecture for galaxies classification is presented. The galaxy can be classified based on its features into main three categories Elliptical, Spiral, and Irregular. The proposed deep galaxies architecture consists of 8 layers, one main convolutional layer for features extraction with 96 filters, followed by two principles fully connected layers… ▽ More

    Submitted 2 September, 2017; originally announced September 2017.

    Comments: 4 pages, 6 figures, 2 tables, Conference