Skip to main content

Showing 1–50 of 106 results for author: Cohen, P

.
  1. arXiv:2506.00448  [pdf, ps, other

    cs.CL

    Fact-Controlled Diagnosis of Hallucinations in Medical Text Summarization

    Authors: Suhas BN, Han-Chin Shing, Lei Xu, Mitch Strong, Jon Burnsky, Jessica Ofor, Jordan R. Mason, Susan Chen, Sundararajan Srinivasan, Chaitanya Shivade, Jack Moriarty, Joseph Paul Cohen

    Abstract: Hallucinations in large language models (LLMs) during summarization of patient-clinician dialogues pose significant risks to patient care and clinical decision-making. However, the phenomenon remains understudied in the clinical domain, with uncertainty surrounding the applicability of general-domain hallucination detectors. The rarity and randomness of hallucinations further complicate their inve… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: https://github.com/amazon-science/acibench-hallucination-annotations

  2. arXiv:2502.07156  [pdf, other

    cs.CV cs.AI

    Explaining 3D Computed Tomography Classifiers with Counterfactuals

    Authors: Joseph Paul Cohen, Louis Blankemeier, Akshay Chaudhari

    Abstract: Counterfactual explanations enhance the interpretability of deep learning models in medical imaging, yet adapting them to 3D CT scans poses challenges due to volumetric complexity and resource demands. We extend the Latent Shift counterfactual generation method from 2D applications to explain 3D computed tomography (CT) scans classifiers. We address the challenges associated with 3D classifiers, s… ▽ More

    Submitted 2 April, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: Code and models: https://github.com/ieee8023/ct-counterfactuals

  3. arXiv:2406.06512  [pdf, other

    cs.CV cs.AI

    Merlin: A Vision Language Foundation Model for 3D Computed Tomography

    Authors: Louis Blankemeier, Joseph Paul Cohen, Ashwin Kumar, Dave Van Veen, Syed Jamal Safdar Gardezi, Magdalini Paschali, Zhihong Chen, Jean-Benoit Delbrouck, Eduardo Reis, Cesar Truyts, Christian Bluethgen, Malte Engmann Kjeldskov Jensen, Sophie Ostmeier, Maya Varma, Jeya Maria Jose Valanarasu, Zhongnan Fang, Zepeng Huo, Zaid Nabulsi, Diego Ardila, Wei-Hung Weng, Edson Amaro Junior, Neera Ahuja, Jason Fries, Nigam H. Shah, Andrew Johnston , et al. (6 additional authors not shown)

    Abstract: Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision la… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  4. arXiv:2403.05651  [pdf, other

    physics.atom-ph physics.acc-ph physics.chem-ph physics.plasm-ph

    High-energy polarized electron beams from the ionization of isolated spin polarized hydrogen atoms

    Authors: Dimitris Sofikitis, Lars Reichwein, Marios G. Stamatakis, Christos Zois, Dimitrios G. Papazoglou Samuel Cohen, Markus Büscher, Alexander Pukhov, T. Peter Rakitzis

    Abstract: We propose a laser-based method for the preparation of high-energy polarized electrons, from the ionization of isolated spin-polarized hydrogen (SPH) atoms. The SPH atoms are prepared from the photodissociation of HCl, using two consecutive UV pulses of ps duration. By appropriately timing and focusing the pulses, we can spatially separate the highly polarized SPH from other unwanted photoproducts… ▽ More

    Submitted 2 April, 2025; v1 submitted 8 March, 2024; originally announced March 2024.

  5. arXiv:2402.04609  [pdf, other

    cs.CL

    Improving Cross-Domain Low-Resource Text Generation through LLM Post-Editing: A Programmer-Interpreter Approach

    Authors: Zhuang Li, Levon Haroutunian, Raj Tumuluri, Philip Cohen, Gholamreza Haffari

    Abstract: Post-editing has proven effective in improving the quality of text generated by large language models (LLMs) such as GPT-3.5 or GPT-4, particularly when direct updating of their parameters to enhance text quality is infeasible or expensive. However, relying solely on smaller language models for post-editing can limit the LLMs' ability to generalize across domains. Moreover, the editing strategies… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: EACL 2024 (findings), short paper, 5 pages

  6. arXiv:2401.12208  [pdf, other

    cs.CV cs.CL

    A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray Interpretation

    Authors: Zhihong Chen, Maya Varma, Justin Xu, Magdalini Paschali, Dave Van Veen, Andrew Johnston, Alaa Youssef, Louis Blankemeier, Christian Bluethgen, Stephan Altmayer, Jeya Maria Jose Valanarasu, Mohamed Siddig Eltayeb Muneer, Eduardo Pontes Reis, Joseph Paul Cohen, Cameron Olsen, Tanishq Mathew Abraham, Emily B. Tsai, Christopher F. Beaulieu, Jenia Jitsev, Sergios Gatidis, Jean-Benoit Delbrouck, Akshay S. Chaudhari, Curtis P. Langlotz

    Abstract: Over 1.4 billion chest X-rays (CXRs) are performed annually due to their cost-effectiveness as an initial diagnostic test. This scale of radiological studies provides a significant opportunity to streamline CXR interpretation and documentation. While foundation models are a promising solution, the lack of publicly available large-scale datasets and benchmarks inhibits their iterative development a… ▽ More

    Submitted 18 December, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 26 pages, 8 figures

  7. arXiv:2312.02186  [pdf, other

    cs.CV cs.AI cs.LG

    Identifying Spurious Correlations using Counterfactual Alignment

    Authors: Joseph Paul Cohen, Louis Blankemeier, Akshay Chaudhari

    Abstract: Models driven by spurious correlations often yield poor generalization performance. We propose the counterfactual (CF) alignment method to detect and quantify spurious correlations of black box classifiers. Our methodology is based on counterfactual images generated with respect to one classifier being input into other classifiers to see if they also induce changes in the outputs of these classifi… ▽ More

    Submitted 15 January, 2025; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted to Transactions on Machine Learning Research (TMLR), Code: https://github.com/ieee8023/latentshift

  8. arXiv:2309.12294  [pdf, other

    cs.CL

    Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models

    Authors: Levon Haroutunian, Zhuang Li, Lucian Galescu, Philip Cohen, Raj Tumuluri, Gholamreza Haffari

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in natural language generation. However, their output quality can be inconsistent, posing challenges for generating natural language from logical forms (LFs). This task requires the generated outputs to embody the exact semantics of LFs, without missing any LF semantics or creating any hallucinations. In this work, we tackle th… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: IJCNLP-AACL 2023

  9. arXiv:2305.12737  [pdf, other

    cs.CL cs.AI

    The Best of Both Worlds: Combining Human and Machine Translations for Multilingual Semantic Parsing with Active Learning

    Authors: Zhuang Li, Lizhen Qu, Philip R. Cohen, Raj V. Tumuluri, Gholamreza Haffari

    Abstract: Multilingual semantic parsing aims to leverage the knowledge from the high-resource languages to improve low-resource semantic parsing, yet commonly suffers from the data imbalance problem. Prior works propose to utilize the translations by either humans or machines to alleviate such issues. However, human translations are expensive, while machine translations are cheap but prone to error and bias… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  10. arXiv:2304.00487  [pdf, other

    eess.IV cs.AI cs.CV cs.HC cs.LG

    The Effect of Counterfactuals on Reading Chest X-rays

    Authors: Joseph Paul Cohen, Rupert Brooks, Sovann En, Evan Zucker, Anuj Pareek, Matthew Lungren, Akshay Chaudhari

    Abstract: This study evaluates the effect of counterfactual explanations on the interpretation of chest X-rays. We conduct a reader study with two radiologists assessing 240 chest X-ray predictions to rate their confidence that the model's prediction is correct using a 5 point scale. Half of the predictions are false positives. Each prediction is explained twice, once using traditional attribution methods a… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: Abstract submitted to CVPR XAI4CV 2023 based on longer version: arXiv:2102.09475

  11. arXiv:2302.09646  [pdf

    cs.AI

    An Explainable Collaborative Dialogue System using a Theory of Mind

    Authors: Philip R. Cohen, Lucian Galescu, Maayan Shvo

    Abstract: Eva is a neuro-symbolic domain-independent multimodal collaborative dialogue system that takes seriously that the purpose of task-oriented dialogue is to assist the user. To do this, the system collaborates by inferring their intentions and plans, detects obstacles to success, finds plans to overcome them or to achieve higher-level goals, and plans its actions, including speech acts, to help users… ▽ More

    Submitted 20 June, 2024; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: 46 pages, 7 figures, 2 appendices

    ACM Class: I.2.7; I.2.8; I.2.4; I.2.3; I.2.11

  12. arXiv:2211.14830  [pdf, other

    eess.IV cs.CV

    Medical Image Segmentation Review: The success of U-Net

    Authors: Reza Azad, Ehsan Khodapanah Aghdam, Amelie Rauland, Yiwei Jia, Atlas Haddadi Avval, Afshin Bozorgpour, Sanaz Karimijafarbigloo, Joseph Paul Cohen, Ehsan Adeli, Dorit Merhof

    Abstract: Automatic medical image segmentation is a crucial topic in the medical domain and successively a critical counterpart in the computer-aided diagnosis paradigm. U-Net is the most widespread image segmentation architecture due to its flexibility, optimized modular design, and success in all medical image modalities. Over the years, the U-Net model achieved tremendous attention from academic and indu… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Submitted to the IEEE Transactions on Pattern Analysis and Machine Intelligence Journal

  13. arXiv:2208.02625  [pdf, ps, other

    math.NT

    On the moments of one-level densities in families of holomorphic cusp forms in the level aspect

    Authors: Peter Cohen, Justine Dell, Oscar E. González, Simran Khunger, Chung-Hang Kwan, Steven J. Miller, Alexander Shashkov, Alicia Smith Reina, Carsten Sprunger, Nicholas Triantafillou, Nhi Truong, Roger Van Peski, Stephen Willis

    Abstract: We study the $n^{\rm th}$ centered moments of the $1$-level density for the low-lying zeros of $L$-functions attached to holomorphic cuspidal newforms of large prime level and fixed weight. Assuming the Generalized Riemann Hypotheses, we compute this statistic for any $n\ge 1$ and for all test functions whose Fourier transforms are supported in $\left(-2/n, \, 2/n\right)$. This is believed to be t… ▽ More

    Submitted 28 March, 2025; v1 submitted 27 July, 2022; originally announced August 2022.

    Comments: 58 pages. Revised version, to appear in Algebra & Number Theory

  14. arXiv:2203.09016  [pdf, other

    cs.HC cs.CL

    Natural Language Communication with a Teachable Agent

    Authors: Rachel Love, Edith Law, Philip R. Cohen, Dana Kulić

    Abstract: Conversational teachable agents offer a promising platform to support learning, both in the classroom and in remote settings. In this context, the agent takes the role of the novice, while the student takes on the role of teacher. This framing is significant for its ability to elicit the Protégé effect in the student-teacher, a pedagogical phenomenon known to increase engagement in the teaching ta… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  15. The Lick Observatory Supernova Search follow-up program: photometry data release of 70 stripped-envelope supernovae

    Authors: WeiKang Zheng, Benjamin E. Stahl, Thomas de Jaeger, Alexei V. Filippenko, Shan-Qin Wang, Wen-Pei Gan, Thomas G. Brink, Ivan Altunin, Raphael Baer-Way, Andrew Bigley, Kyle Blanchard, Peter K. Blanchard, James Bradley, Samantha K. Cargill, Chadwick Casper, Teagan Chapman, Vidhi Chander, Sanyum Channa, Byung Yun Choi, Nick Choksi, Matthew Chu, Kelsey I. Clubb, Daniel P. Cohen, Paul A. Dalba, Asia deGraw , et al. (63 additional authors not shown)

    Abstract: We present BVRI and unfiltered Clear light curves of 70 stripped-envelope supernovae (SESNe), observed between 2003 and 2020, from the Lick Observatory Supernova Search (LOSS) follow-up program. Our SESN sample consists of 19 spectroscopically normal SNe~Ib, two peculiar SNe Ib, six SN Ibn, 14 normal SNe Ic, one peculiar SN Ic, ten SNe Ic-BL, 15 SNe IIb, one ambiguous SN IIb/Ib/c, and two superlum… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Accepted by MNRAS

  16. arXiv:2202.02833  [pdf, other

    eess.IV cs.CV cs.LG

    CheXstray: Real-time Multi-Modal Data Concordance for Drift Detection in Medical Imaging AI

    Authors: Arjun Soin, Jameson Merkow, Jin Long, Joseph Paul Cohen, Smitha Saligrama, Stephen Kaiser, Steven Borg, Ivan Tarapov, Matthew P Lungren

    Abstract: Clinical Artificial lntelligence (AI) applications are rapidly expanding worldwide, and have the potential to impact to all areas of medical practice. Medical imaging applications constitute a vast majority of approved clinical AI applications. Though healthcare systems are eager to adopt AI solutions a fundamental question remains: \textit{what happens after the AI model goes into production?} We… ▽ More

    Submitted 17 March, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: Added code url

  17. arXiv:2112.13734  [pdf, ps, other

    cs.CV

    Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models

    Authors: Enoch Tetteh, Joseph Viviano, Yoshua Bengio, David Krueger, Joseph Paul Cohen

    Abstract: Learning models that generalize under different distribution shifts in medical imaging has been a long-standing research challenge. There have been several proposals for efficient and robust visual representation learning among vision research practitioners, especially in the sensitive and critical biomedical domain. In this paper, we propose an idea for out-of-distribution generalization of chest… ▽ More

    Submitted 27 December, 2021; v1 submitted 27 December, 2021; originally announced December 2021.

    Comments: MED-NEURIPS 2021

  18. arXiv:2112.02064  [pdf, ps, other

    math.PR math-ph

    Moments of discrete classical $q$-orthogonal polynomial ensembles

    Authors: Philip Cohen

    Abstract: We consider some discrete $q$-analogues of the classical continuous orthogonal polynomial ensembles. Building on results due to Morozov, Popolitov and Shakirov, we find representations for the moments of the discrete $q$-Hermite and discrete $q$-Laguerre ensembles in terms of basic hypergeometric series. We find that when the number of particles is suitably randomised, the moments may be represent… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: 19 pages, 0 figures

    MSC Class: 60B20 (Primary) 33D45 (Secondary)

  19. arXiv:2111.00595  [pdf, other

    eess.IV cs.AI cs.CV

    TorchXRayVision: A library of chest X-ray datasets and models

    Authors: Joseph Paul Cohen, Joseph D. Viviano, Paul Bertin, Paul Morrison, Parsa Torabian, Matteo Guarrera, Matthew P Lungren, Akshay Chaudhari, Rupert Brooks, Mohammad Hashir, Hadrien Bertrand

    Abstract: TorchXRayVision is an open source software library for working with chest X-ray datasets and deep learning models. It provides a common interface and common pre-processing chain for a wide set of publicly available chest X-ray datasets. In addition, a number of classification and representation learning models with different architectures, trained on different data combinations, are available thro… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: Library source code: https://github.com/mlmed/torchxrayvision

  20. The Impact of the Coronavirus Pandemic on New York City Real Estate: First Evidence

    Authors: Jeffrey P. Cohen, Felix L. Friedt, Jackson P. Lautier

    Abstract: We investigate whether pandemic-induced contagion disamenities and income effects arising due to COVID-related unemployment adversely affected real estate prices of one- or two-family owner-occupied properties across New York City (NYC). First, OLS hedonic results indicate that greater COVID case numbers are concentrated in neighborhoods with lower-valued properties. Second, we use a repeat-sales… ▽ More

    Submitted 27 January, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 38 pages, 5 tables, 3 figures, Revised 1/27/2022

  21. arXiv:2103.02641  [pdf, other

    astro-ph.GA astro-ph.SR

    Observing the influence of the youngest super star clusters in NGC 1569: Keck Brackett $α$ spectroscopy

    Authors: Daniel P. Cohen, Jean L. Turner, Sara C. Beck, S. Michelle Consiglio

    Abstract: We report Keck-NIRSPEC observations of the Brackett $α$ 4.05 $μ$m recombination line across the two candidate embedded super star clusters (SSCs) in NGC 1569. These SSCs power a bright HII region and have been previously detected as radio and mid-infrared sources. Supplemented with high resolution VLA mapping of the radio continuum along with IRTF-TEXES spectroscopy of the [SIV] 10.5 $μ$m line, th… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in MNRAS 2021 Feb 26; 9 pages, 7 figures

  22. arXiv:2102.09582  [pdf, other

    cs.CV eess.IV

    Benefits of Linear Conditioning with Metadata for Image Segmentation

    Authors: Andreanne Lemay, Charley Gros, Olivier Vincent, Yaou Liu, Joseph Paul Cohen, Julien Cohen-Adad

    Abstract: Medical images are often accompanied by metadata describing the image (vendor, acquisition parameters) and the patient (disease type or severity, demographics, genomics). This metadata is usually disregarded by image segmentation methods. In this work, we adapt a linear conditioning method called FiLM (Feature-wise Linear Modulation) for image segmentation tasks. This FiLM adaptation enables integ… ▽ More

    Submitted 26 April, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at MIDL 2021

  23. arXiv:2102.09475  [pdf, other

    cs.CV cs.AI eess.IV

    Gifsplanation via Latent Shift: A Simple Autoencoder Approach to Counterfactual Generation for Chest X-rays

    Authors: Joseph Paul Cohen, Rupert Brooks, Sovann En, Evan Zucker, Anuj Pareek, Matthew P. Lungren, Akshay Chaudhari

    Abstract: Motivation: Traditional image attribution methods struggle to satisfactorily explain predictions of neural networks. Prediction explanation is important, especially in medical imaging, for avoiding the unintended consequences of deploying AI systems when false positive predictions can impact patient care. Thus, there is a pressing need to develop improved models for model explainability and intros… ▽ More

    Submitted 24 April, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Full paper at MIDL2021

  24. arXiv:2101.11506  [pdf, ps, other

    physics.plasm-ph physics.comp-ph

    Multi-Group Discontinuous Asymptotic $P_1$ Approximation in Radiative Marshak Waves Experiments

    Authors: Avner P. Cohen, Shay I. Heizler

    Abstract: We study the propagation of radiative heat (Marshak) waves, using modified $P_1$-approximation equations. In relatively optically-thin media the heat propagation is supersonic,~i.e. hydrodynamic motion is negligible, and thus can be described by the radiative transfer Boltzmann equation, coupled with the material energy equation. However, the exact thermal radiative transfer problem is still diffi… ▽ More

    Submitted 10 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: 17 pages, 7 figures

    Journal ref: Journal of Quantitative Spectroscopy and Radiative Transfer, 272, 107822 (2021)

  25. arXiv:2012.09246  [pdf, ps, other

    stat.ME

    No-harm calibration for generalized Oaxaca-Blinder estimators

    Authors: Peter L. Cohen, Colin B. Fogarty

    Abstract: In randomized experiments, adjusting for observed features when estimating treatment effects has been proposed as a way to improve asymptotic efficiency. However, only linear regression has been proven to form an estimate of the average treatment effect that is asymptotically no less efficient than the treated-minus-control difference in means regardless of the true data generating process. Random… ▽ More

    Submitted 12 April, 2022; v1 submitted 16 December, 2020; originally announced December 2020.

    MSC Class: 62G99

  26. arXiv:2010.09984  [pdf, other

    eess.IV cs.CV

    ivadomed: A Medical Imaging Deep Learning Toolbox

    Authors: Charley Gros, Andreanne Lemay, Olivier Vincent, Lucas Rouhier, Anthime Bucquet, Joseph Paul Cohen, Julien Cohen-Adad

    Abstract: ivadomed is an open-source Python package for designing, end-to-end training, and evaluating deep learning models applied to medical imaging data. The package includes APIs, command-line tools, documentation, and tutorials. ivadomed also includes pre-trained models such as spinal tumor segmentation and vertebral labeling. Original features of ivadomed include a data loader that can parse image met… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  27. arXiv:2009.08348  [pdf, other

    cs.CV

    S2SD: Simultaneous Similarity-based Self-Distillation for Deep Metric Learning

    Authors: Karsten Roth, Timo Milbich, Björn Ommer, Joseph Paul Cohen, Marzyeh Ghassemi

    Abstract: Deep Metric Learning (DML) provides a crucial tool for visual similarity and zero-shot applications by learning generalizing embedding spaces, although recent work in DML has shown strong performance saturation across training objectives. However, generalization capacity is known to scale with the embedding space dimensionality. Unfortunately, high dimensional embeddings also create higher retriev… ▽ More

    Submitted 4 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted to ICML2021

  28. arXiv:2007.13224  [pdf, other

    eess.IV cs.CV

    Uniformizing Techniques to Process CT scans with 3D CNNs for Tuberculosis Prediction

    Authors: Hasib Zunair, Aimon Rahman, Nabeel Mohammed, Joseph Paul Cohen

    Abstract: A common approach to medical image analysis on volumetric data uses deep 2D convolutional neural networks (CNNs). This is largely attributed to the challenges imposed by the nature of the 3D data: variable volume size, GPU exhaustion during optimization. However, dealing with the individual slices independently in 2D CNNs deliberately discards the depth information which results in poor performanc… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

    Comments: Accepted for publication at the MICCAI 2020 International Workshop on PRedictive Intelligence In MEdicine (PRIME)

  29. arXiv:2007.04250  [pdf, other

    cs.LG cs.CV stat.ML

    A Benchmark of Medical Out of Distribution Detection

    Authors: Tianshi Cao, Chin-Wei Huang, David Yu-Tung Hui, Joseph Paul Cohen

    Abstract: Motivation: Deep learning models deployed for use on medical tasks can be equipped with Out-of-Distribution Detection (OoDD) methods in order to avoid erroneous predictions. However it is unclear which OoDD method should be used in practice. Specific Problem: Systems trained for one particular domain of images cannot be expected to perform accurately on images of a different domain. These images s… ▽ More

    Submitted 4 August, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Submitted to Machine Learning for Biomedical Imaging Journal (MELBA)

  30. arXiv:2006.11988  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV

    COVID-19 Image Data Collection: Prospective Predictions Are the Future

    Authors: Joseph Paul Cohen, Paul Morrison, Lan Dao, Karsten Roth, Tim Q Duong, Marzyeh Ghassemi

    Abstract: Across the world's coronavirus disease 2019 (COVID-19) hot spots, the need to streamline patient diagnosis and management has become more pressing than ever. As one of the main imaging tools, chest X-rays (CXRs) are common, fast, non-invasive, relatively cheap, and potentially bedside to monitor the progression of the disease. This paper describes the first public COVID-19 image data collection as… ▽ More

    Submitted 14 December, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org. Code for baseline experiments can be found here: https://github.com/mlmed/covid-baselines

  31. arXiv:2005.13009  [pdf, ps, other

    math.GN

    A Kuratowski closure-complement variant whose solution is independent of ZF

    Authors: Michael P. Cohen, Todd Johnson, Adam Kral, Aaron Li, Justin Soll

    Abstract: We pose the following new variant of the Kuratowski closure-complement problem: How many distinct sets may be obtained by starting with a set $A$ of a Polish space $X$, and applying only closure, complementation, and the $d$ operator, as often as desired, in any order? The set operator $d$ was studied by Kuratowski in his foundational text \textit{Topology: Volume I}; it assigns to $A$ the collect… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Comments: 9 pages, 3 figures

  32. arXiv:2005.11856  [pdf, other

    eess.IV cs.LG q-bio.QM stat.AP

    Predicting COVID-19 Pneumonia Severity on Chest X-ray with Deep Learning

    Authors: Joseph Paul Cohen, Lan Dao, Paul Morrison, Karsten Roth, Yoshua Bengio, Beiyi Shen, Almas Abbasi, Mahsa Hoshmand-Kochi, Marzyeh Ghassemi, Haifang Li, Tim Q Duong

    Abstract: Purpose: The need to streamline patient management for COVID-19 has become more pressing than ever. Chest X-rays provide a non-invasive (potentially bedside) tool to monitor the progression of the disease. In this study, we present a severity score prediction model for COVID-19 pneumonia for frontal chest X-ray images. Such a tool can gauge severity of COVID-19 lung infections (and pneumonia in ge… ▽ More

    Submitted 30 June, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

  33. arXiv:2004.13458  [pdf, other

    cs.CV

    DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning

    Authors: Timo Milbich, Karsten Roth, Homanga Bharadhwaj, Samarth Sinha, Yoshua Bengio, Björn Ommer, Joseph Paul Cohen

    Abstract: Visual Similarity plays an important role in many computer vision applications. Deep metric learning (DML) is a powerful framework for learning such similarities which not only generalize from training data to identically distributed test distributions, but in particular also translate to unknown test classes. However, its prevailing learning paradigm is class-discriminative supervised training, w… ▽ More

    Submitted 10 September, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: published at ECCV 2020

  34. arXiv:2003.11597  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    COVID-19 Image Data Collection

    Authors: Joseph Paul Cohen, Paul Morrison, Lan Dao

    Abstract: This paper describes the initial COVID-19 open image data collection. It was created by assembling medical images from websites and publications and currently contains 123 frontal view X-rays.

    Submitted 25 March, 2020; originally announced March 2020.

    Comments: Dataset available here: https://github.com/ieee8023/covid-chestxray-dataset

  35. arXiv:2003.08783  [pdf, other

    cs.MA cs.AI cs.SI

    Redistribution Systems and PRAM

    Authors: Paul Cohen, Tomasz Loboda

    Abstract: Redistribution systems iteratively redistribute mass between groups under the control of rules. PRAM is a framework for building redistribution systems. We discuss the relationships between redistribution systems, agent-based systems, compartmental models and Bayesian models. PRAM puts agent-based models on a sound probabilistic footing by reformulating them as redistribution systems. This provide… ▽ More

    Submitted 19 March, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1902.05677

  36. arXiv:2003.04387  [pdf, other

    eess.IV cs.CV

    Spine intervertebral disc labeling using a fully convolutional redundant counting model

    Authors: Lucas Rouhier, Francisco Perdigon Romero, Joseph Paul Cohen, Julien Cohen-Adad

    Abstract: Labeling intervertebral discs is relevant as it notably enables clinicians to understand the relationship between a patient's symptoms (pain, paralysis) and the exact level of spinal cord injury. However manually labeling those discs is a tedious and user-biased task which would benefit from automated methods. While some automated methods already exist for MRI and CT-scan, they are either not publ… ▽ More

    Submitted 11 March, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: MIDL 2020

  37. arXiv:2003.04377  [pdf, other

    eess.IV cs.CV cs.LG

    Automatic segmentation of spinal multiple sclerosis lesions: How to generalize across MRI contrasts?

    Authors: Olivier Vincent, Charley Gros, Joseph Paul Cohen, Julien Cohen-Adad

    Abstract: Despite recent improvements in medical image segmentation, the ability to generalize across imaging contrasts remains an open issue. To tackle this challenge, we implement Feature-wise Linear Modulation (FiLM) to leverage physics knowledge within the segmentation model and learn the characteristics of each contrast. Interestingly, a well-optimised U-Net reached the same performance as our FiLMed-U… ▽ More

    Submitted 3 June, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: Presented at OHBM 2020 (v2-3 : corrected typos)

  38. arXiv:2002.08473  [pdf, other

    cs.CV

    Revisiting Training Strategies and Generalization Performance in Deep Metric Learning

    Authors: Karsten Roth, Timo Milbich, Samarth Sinha, Prateek Gupta, Björn Ommer, Joseph Paul Cohen

    Abstract: Deep Metric Learning (DML) is arguably one of the most influential lines of research for learning visual similarities with many proposed approaches every year. Although the field benefits from the rapid progress, the divergence in training protocols, architectures, and parameter choices make an unbiased comparison difficult. To provide a consistent reference point, we revisit the most widely used… ▽ More

    Submitted 1 August, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: ICML 2020. Main paper 8.25 pages, 26 pages total

  39. arXiv:2002.06654  [pdf, other

    math.ST stat.ME

    Gaussian Prepivoting for Finite Population Causal Inference

    Authors: Peter L. Cohen, Colin B. Fogarty

    Abstract: In finite population causal inference exact randomization tests can be constructed for sharp null hypotheses, i.e. hypotheses which fully impute the missing potential outcomes. Oftentimes inference is instead desired for the weak null that the sample average of the treatment effects takes on a particular value while leaving the subject-specific treatment effects unspecified. Without proper care, t… ▽ More

    Submitted 13 June, 2021; v1 submitted 16 February, 2020; originally announced February 2020.

  40. arXiv:2002.02582  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Quantifying the Value of Lateral Views in Deep Learning for Chest X-rays

    Authors: Mohammad Hashir, Hadrien Bertrand, Joseph Paul Cohen

    Abstract: Most deep learning models in chest X-ray prediction utilize the posteroanterior (PA) view due to the lack of other views available. PadChest is a large-scale chest X-ray dataset that has almost 200 labels and multiple views available. In this work, we use PadChest to explore multiple approaches to merging the PA and lateral views for predicting the radiological labels associated with the X-ray ima… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: Under review at MIDL 2020

  41. arXiv:2002.02497  [pdf, other

    eess.IV cs.LG q-bio.QM stat.ML

    On the limits of cross-domain generalization in automated X-ray prediction

    Authors: Joseph Paul Cohen, Mohammad Hashir, Rupert Brooks, Hadrien Bertrand

    Abstract: This large scale study focuses on quantifying what X-rays diagnostic prediction tasks generalize well across multiple different datasets. We present evidence that the issue of generalization is not due to a shift in the images but instead a shift in the labels. We study the cross-domain performance, agreement between models, and model representations. We find interesting discrepancies between perf… ▽ More

    Submitted 24 May, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Full paper at MIDL2020

  42. arXiv:2001.11002  [pdf, other

    astro-ph.GA astro-ph.SR

    Unveiling Kinematic Structure in the Starburst Heart of NGC 253

    Authors: Daniel P. Cohen, Jean L. Turner, S. Michelle Consiglio

    Abstract: We investigate the kinematics of ionized gas within the nuclear starburst of NGC 253 with observations of the Brackett $α$ recombination line at 4.05 $μ$m. The goal is to distinguish motions driven by star-formation feedback from gravitational motions induced by the central mass structure. Using NIRSPEC on Keck II, we obtained 30 spectra through a $0''.5$ slit stepped across the central $\sim$5… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: Accepted for publication in MNRAS on Jan 24, 2020 ; 12 pages, 7 figures

  43. Key to understanding supersonic radiative Marshak waves using simple models and advanced simulations

    Authors: Avner P. Cohen, Guy Malamud, Shay I. Heizler

    Abstract: This article studies the propagation of supersonic radiative Marshak waves. These waves are radiation dominated, and play an important role in inertial confinement fusion and in astrophysical and laboratory systems. For that reason, this phenomenon has attracted considerable experimental attention in recent decades in several different facilities. The present study integrates the various experimen… ▽ More

    Submitted 22 March, 2020; v1 submitted 16 November, 2019; originally announced November 2019.

    Comments: 33 pages, 17 figures

    Journal ref: Phys. Rev. Research 2, 023007 (2020)

  44. arXiv:1910.13249  [pdf, other

    cs.CV cs.HC cs.LG

    Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments

    Authors: Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira E. Kahou, Joseph P. Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo, Chris Pal

    Abstract: Millions of blind and visually-impaired (BVI) people navigate urban environments every day, using smartphones for high-level path-planning and white canes or guide dogs for local information. However, many BVI people still struggle to travel to new places. In our endeavor to create a navigation assistant for the BVI, we found that existing Reinforcement Learning (RL) environments were unsuitable f… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted at CoRL2019. Code & video available at https://mweiss17.github.io/SEVN/

  45. arXiv:1910.09600  [pdf, other

    q-bio.GN cs.LG q-bio.QM

    Is graph-based feature selection of genes better than random?

    Authors: Mohammad Hashir, Paul Bertin, Martin Weiss, Vincent Frappier, Theodore J. Perkins, Geneviève Boucher, Joseph Paul Cohen

    Abstract: Gene interaction graphs aim to capture various relationships between genes and represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing whether those graphs capture dep… ▽ More

    Submitted 27 December, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Accepted to the Machine Learning in Computational Biology (MLCB) meeting 2019. 7 pages. 4 figures. arXiv admin note: substantial text overlap with arXiv:1905.02295

  46. arXiv:1910.09570  [pdf, other

    q-bio.QM cs.CV eess.SP stat.AP stat.ML

    Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery

    Authors: Shawn Tan, Guillaume Androz, Ahmad Chamseddine, Pierre Fecteau, Aaron Courville, Yoshua Bengio, Joseph Paul Cohen

    Abstract: We release the largest public ECG dataset of continuous raw signals for representation learning containing 11 thousand patients and 2 billion labelled beats. Our goal is to enable semi-supervised ECG models to be made as well as to discover unknown subtypes of arrhythmia and anomalous ECG signal events. To this end, we propose an unsupervised representation learning task, evaluated in a semi-super… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: Under Review

  47. arXiv:1910.08636  [pdf, other

    cs.LG q-bio.QM stat.ML

    The TCGA Meta-Dataset Clinical Benchmark

    Authors: Mandana Samiei, Tobias Würfl, Tristan Deleu, Martin Weiss, Francis Dutil, Thomas Fevens, Geneviève Boucher, Sebastien Lemieux, Joseph Paul Cohen

    Abstract: Machine learning is bringing a paradigm shift to healthcare by changing the process of disease diagnosis and prognosis in clinics and hospitals. This development equips doctors and medical staff with tools to evaluate their hypotheses and hence make more precise decisions. Although most current research in the literature seeks to develop techniques and methods for predicting one particular clinica… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: 5 Pages, Submitted to MLCB 2019

  48. arXiv:1910.07655  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Semantic Segmentation of Natural and Medical Images: A Review

    Authors: Saeid Asgari Taghanaki, Kumar Abhishek, Joseph Paul Cohen, Julien Cohen-Adad, Ghassan Hamarneh

    Abstract: The semantic image segmentation task consists of classifying each pixel of an image into an instance, where each instance corresponds to a class. This task is a part of the concept of scene understanding or better explaining the global context of an image. In the medical image analysis domain, image segmentation can be used for image-guided interventions, radiotherapy, or improved radiological dia… ▽ More

    Submitted 30 March, 2024; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: 45 pages, 16 figures. Accepted for publication in Springer Artificial Intelligence Review

  49. arXiv:1910.00199  [pdf, other

    cs.CV cs.LG eess.IV

    Saliency is a Possible Red Herring When Diagnosing Poor Generalization

    Authors: Joseph D. Viviano, Becks Simpson, Francis Dutil, Yoshua Bengio, Joseph Paul Cohen

    Abstract: Poor generalization is one symptom of models that learn to predict target variables using spuriously-correlated image features present only in the training distribution instead of the true image features that denote a class. It is often thought that this can be diagnosed visually using attribution (aka saliency) maps. We study if this assumption is correct. In some prediction tasks, such as for me… ▽ More

    Submitted 10 February, 2021; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: 25 pages, 27 figures, 5 tables, code in paper (https://github.com/josephdviviano/saliency-red-herring). Published at International Conference on Learning Representations (ICLR) 2021. Previously titled "Underwhelming Generalization Improvements from Controlling Feature Attribution"

  50. arXiv:1909.11140  [pdf, other

    astro-ph.SR astro-ph.CO astro-ph.HE

    Lick Observatory Supernova Search Follow-Up Program: Photometry Data Release of 93 Type Ia Supernovae

    Authors: Benjamin E. Stahl, WeiKang Zheng, Thomas de Jaeger, Alexei V. Filippenko, Andrew Bigley, Kyle Blanchard, Peter K. Blanchard, Thomas G. Brink, Samantha K. Cargill, Chadwick Casper, Sanyum Channa, Byung Yun Choi, Nick Choksi, Jason Chu, Kelsey I. Clubb, Daniel P. Cohen, Michael Ellison, Edward Falcon, Pegah Fazeli, Kiera Fuller, Mohan Ganeshalingam, Elinor L. Gates, Carolina Gould, Goni Halevi, Kevin T. Hayakawa , et al. (30 additional authors not shown)

    Abstract: We present BVRI and unfiltered light curves of 93 Type Ia supernovae (SNe Ia) from the Lick Observatory Supernova Search (LOSS) follow-up program conducted between 2005 and 2018. Our sample consists of 78 spectroscopically normal SNe Ia, with the remainder divided between distinct subclasses (three SN 1991bg-like, three SN 1991T-like, four SNe Iax, two peculiar, and three super-Chandrasekhar event… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

    Comments: 29 pages, 13 figures, accepted for publication in MNRAS