Skip to main content

Showing 1–46 of 46 results for author: Cohen, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.19367  [pdf, other

    cs.LG stat.ML

    dCMF: Learning interpretable evolving patterns from temporal multiway data

    Authors: Christos Chatzis, Carla Schenker, Jérémy E. Cohen, Evrim Acar

    Abstract: Multiway datasets are commonly analyzed using unsupervised matrix and tensor factorization methods to reveal underlying patterns. Frequently, such datasets include timestamps and could correspond to, for example, health-related measurements of subjects collected over time. The temporal dimension is inherently different from the other dimensions, requiring methods that account for its intrinsic pro… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  2. arXiv:2501.10202  [pdf, ps, other

    stat.ML cs.LG

    Provably Safeguarding a Classifier from OOD and Adversarial Samples: an Extreme Value Theory Approach

    Authors: Nicolas Atienza, Christophe Labreuche, Johanne Cohen, Michele Sebag

    Abstract: This paper introduces a novel method, Sample-efficient Probabilistic Detection using Extreme Value Theory (SPADE), which transforms a classifier into an abstaining classifier, offering provable protection against out-of-distribution and adversarial samples. The approach is based on a Generalized Extreme Value (GEV) model of the training distribution in the classifier's latent space, enabling the f… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: under review

  3. arXiv:2410.24206  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Understanding Optimization in Deep Learning with Central Flows

    Authors: Jeremy M. Cohen, Alex Damian, Ameet Talwalkar, Zico Kolter, Jason D. Lee

    Abstract: Optimization in deep learning remains poorly understood, even in the simple setting of deterministic (i.e. full-batch) training. A key difficulty is that much of an optimizer's behavior is implicitly determined by complex oscillatory dynamics, referred to as the "edge of stability." The main contribution of this paper is to show that an optimizer's implicit behavior can be explicitly captured by a… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    Comments: first two authors contributed equally; author order determined by coin flip

  4. arXiv:2408.16023  [pdf, other

    stat.ME math.ST

    Inferring the parameters of Taylor's law in ecology

    Authors: Lionel Truquet, Joel E. Cohen, Paul Doukhan

    Abstract: Taylor's power law (TL) or fluctuation scaling has been verified empirically for the abundances of many species, human and non-human, and in many other fields including physics, meteorology, computer science, and finance. TL asserts that the variance is directly proportional to a power of the mean, exactly for population moments and, whether or not population moments exist, approximately for sampl… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  5. arXiv:2408.15805  [pdf, other

    stat.AP stat.CO

    Investigating Complex HPV Dynamics Using Emulation and History Matching

    Authors: Andrew Iskauskas, Jamie A. Cohen, Danny Scarponi, Ian Vernon, Michael Goldstein, Daniel Klein, Richard G. White, Nicky McCreesh

    Abstract: The study of transmission and progression of human papillomavirus (HPV) is crucial for understanding the incidence of cervical cancers, and has been identified as a priority worldwide. The complexity of the disease necessitates a detailed model of HPV transmission and its progression to cancer; to infer properties of the above we require a careful process that can match to imperfect or incomplete… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 21 pages, 15 figures; submitted to Epidemics

  6. arXiv:2304.00195  [pdf, other

    stat.ML cs.LG

    Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers

    Authors: Awni Altabaa, Taylor Webb, Jonathan Cohen, John Lafferty

    Abstract: An extension of Transformers is proposed that enables explicit relational reasoning through a novel module called the Abstractor. At the core of the Abstractor is a variant of attention called relational cross-attention. The approach is motivated by an architectural inductive bias for relational learning that disentangles relational information from object-level features. This enables explicit rel… ▽ More

    Submitted 12 April, 2024; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: Published at ICLR 2024

  7. arXiv:2209.10666  [pdf, other

    cs.LG physics.ao-ph stat.ML

    Adaptive Bias Correction for Improved Subseasonal Forecasting

    Authors: Soukayna Mouatadid, Paulo Orenstein, Genevieve Flaspohler, Judah Cohen, Miruna Oprescu, Ernest Fraenkel, Lester Mackey

    Abstract: Subseasonal forecasting -- predicting temperature and precipitation 2 to 6 weeks ahead -- is critical for effective water allocation, wildfire management, and drought and flood mitigation. Recent international research efforts have advanced the subseasonal capabilities of operational dynamical models, yet temperature and precipitation prediction skills remain poor, partly due to stubborn errors in… ▽ More

    Submitted 15 May, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

  8. arXiv:2206.10654  [pdf, other

    cs.LG stat.ML

    On the Maximum Hessian Eigenvalue and Generalization

    Authors: Simran Kaur, Jeremy Cohen, Zachary C. Lipton

    Abstract: The mechanisms by which certain training interventions, such as increasing learning rates and applying batch normalization, improve the generalization of deep networks remains a mystery. Prior works have speculated that "flatter" solutions generalize better than "sharper" solutions to unseen data, motivating several metrics for measuring flatness (particularly $λ_{max}$, the largest eigenvalue of… ▽ More

    Submitted 23 May, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Proceedings on "I Can't Believe It's Not Better! - Understanding Deep Learning Through Empirical Falsification" at NeurIPS 2022 Workshops, PMLR 187:51-65, 2023

  9. arXiv:2202.01650  [pdf, other

    stat.ME stat.AP

    Exposure Effects on Count Outcomes with Observational Data, with Application to Incarcerated Women

    Authors: Bonnie E. Shook-Sa, Michael G. Hudgens, Andrea K. Knittel, Andrew Edmonds, Catalina Ramirez, Stephen R. Cole, Mardge Cohen, Adebola Adedimeji, Tonya Taylor, Katherine G. Michel, Andrea Kovacs, Jennifer Cohen, Jessica Donohue, Antonina Foster, Margaret A. Fischl, Dustin Long, Adaora A. Adimora

    Abstract: Causal inference methods can be applied to estimate the effect of a point exposure or treatment on an outcome of interest using data from observational studies. For example, in the Women's Interagency HIV Study, it is of interest to understand the effects of incarceration on the number of sexual partners and the number of cigarettes smoked after incarceration. In settings like this where the outco… ▽ More

    Submitted 6 November, 2023; v1 submitted 3 February, 2022; originally announced February 2022.

  10. arXiv:2111.12399  [pdf, other

    cs.LG stat.ML

    Dictionary-based Low-Rank Approximations and the Mixed Sparse Coding problem

    Authors: Jeremy E. Cohen

    Abstract: Constrained tensor and matrix factorization models allow to extract interpretable patterns from multiway data. Therefore identifiability properties and efficient algorithms for constrained low-rank approximations are nowadays important research topics. This work deals with columns of factor matrices of a low-rank approximation being sparse in a known and possibly overcomplete basis, a model coined… ▽ More

    Submitted 21 January, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  11. arXiv:2110.01278  [pdf, other

    cs.LG math.OC stat.ML

    An AO-ADMM approach to constraining PARAFAC2 on all modes

    Authors: Marie Roald, Carla Schenker, Vince D. Calhoun, Tülay Adalı, Rasmus Bro, Jeremy E. Cohen, Evrim Acar

    Abstract: Analyzing multi-way measurements with variations across one mode of the dataset is a challenge in various fields including data mining, neuroscience and chemometrics. For example, measurements may evolve over time or have unaligned time profiles. The PARAFAC2 model has been successfully used to analyze such data by allowing the underlying factor matrices in one mode (i.e., the evolving mode) to ch… ▽ More

    Submitted 8 July, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    MSC Class: 15A69; 90C26

    Journal ref: SIAM J. Math. Data Sci. 4 (2022) 1191-1222

  12. arXiv:2109.10399  [pdf, other

    physics.ao-ph cs.LG stat.ML

    SubseasonalClimateUSA: A Dataset for Subseasonal Forecasting and Benchmarking

    Authors: Soukayna Mouatadid, Paulo Orenstein, Genevieve Flaspohler, Miruna Oprescu, Judah Cohen, Franklyn Wang, Sean Knight, Maria Geogdzhayeva, Sam Levang, Ernest Fraenkel, Lester Mackey

    Abstract: Subseasonal forecasting of the weather two to six weeks in advance is critical for resource allocation and advance disaster notice but poses many challenges for the forecasting community. At this forecast horizon, physics-based dynamical models have limited skill, and the targets for prediction depend in a complex manner on both local weather variables and global climate variables. Recently, machi… ▽ More

    Submitted 16 January, 2024; v1 submitted 21 September, 2021; originally announced September 2021.

  13. arXiv:2106.06885  [pdf, other

    cs.LG stat.ML

    Online Learning with Optimism and Delay

    Authors: Genevieve Flaspohler, Francesco Orabona, Judah Cohen, Soukayna Mouatadid, Miruna Oprescu, Paulo Orenstein, Lester Mackey

    Abstract: Inspired by the demands of real-time climate and weather forecasting, we develop optimistic online learning algorithms that require no parameter tuning and have optimal regret guarantees under delayed feedback. Our algorithms -- DORM, DORM+, and AdaHedgeD -- arise from a novel reduction of delayed online learning to optimistic online learning that reveals how optimistic hints can mitigate the regr… ▽ More

    Submitted 12 July, 2021; v1 submitted 12 June, 2021; originally announced June 2021.

    Comments: ICML 2021. 9 pages of main paper and 26 pages of appendix text

  14. arXiv:2105.00773  [pdf, other

    stat.AP cs.LG stat.ML

    Approximate Bayesian Computation for an Explicit-Duration Hidden Markov Model of COVID-19 Hospital Trajectories

    Authors: Gian Marco Visani, Alexandra Hope Lee, Cuong Nguyen, David M. Kent, John B. Wong, Joshua T. Cohen, Michael C. Hughes

    Abstract: We address the problem of modeling constrained hospital resources in the midst of the COVID-19 pandemic in order to inform decision-makers of future demand and assess the societal value of possible interventions. For broad applicability, we focus on the common yet challenging scenario where patient-level data for a region of interest are not available. Instead, given daily admissions counts, we mo… ▽ More

    Submitted 28 July, 2021; v1 submitted 28 April, 2021; originally announced May 2021.

    Comments: To appear in the Proceedings of the Machine Learning for Healthcare (MLHC) conference, 2021. 20 pages, 7 figures and 1 table. 26 additional pages of supplementary material

  15. arXiv:2104.09327  [pdf, other

    stat.ML cs.LG

    Forecasting COVID-19 Counts At A Single Hospital: A Hierarchical Bayesian Approach

    Authors: Alexandra Hope Lee, Panagiotis Lymperopoulos, Joshua T. Cohen, John B. Wong, Michael C. Hughes

    Abstract: We consider the problem of forecasting the daily number of hospitalized COVID-19 patients at a single hospital site, in order to help administrators with logistics and planning. We develop several candidate hierarchical Bayesian models which directly capture the count nature of data via a generalized Poisson likelihood, model time-series dependencies via autoregressive and Gaussian process latent… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: In ICLR 2021 Workshop on Machine Learning for Preventing and Combating Pandemics

  16. arXiv:2103.00065  [pdf, other

    cs.LG stat.ML

    Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability

    Authors: Jeremy M. Cohen, Simran Kaur, Yuanzhi Li, J. Zico Kolter, Ameet Talwalkar

    Abstract: We empirically demonstrate that full-batch gradient descent on neural network training objectives typically operates in a regime we call the Edge of Stability. In this regime, the maximum eigenvalue of the training loss Hessian hovers just above the numerical value $2 / \text{(step size)}$, and the training loss behaves non-monotonically over short timescales, yet consistently decreases over long… ▽ More

    Submitted 23 November, 2022; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: ICLR 2021. v3 moves several figures from the appendix into the main text, and adds more discussion regarding Jastrzębski et al (2020): https://doi.org/10.48550/arXiv.2002.09572

  17. PARAFAC2 AO-ADMM: Constraints in all modes

    Authors: Marie Roald, Carla Schenker, Jeremy E. Cohen, Evrim Acar

    Abstract: The PARAFAC2 model provides a flexible alternative to the popular CANDECOMP/PARAFAC (CP) model for tensor decompositions. Unlike CP, PARAFAC2 allows factor matrices in one mode (i.e., evolving mode) to change across tensor slices, which has proven useful for applications in different domains such as chemometrics, and neuroscience. However, the evolving mode of the PARAFAC2 model is traditionally m… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 5 pages, 4 figures, submitted to EUSIPCO21

  18. Matrix-wise $\ell_0$-constrained Sparse Nonnegative Least Squares

    Authors: Nicolas Nadisic, Jeremy E Cohen, Arnaud Vandaele, Nicolas Gillis

    Abstract: Nonnegative least squares problems with multiple right-hand sides (MNNLS) arise in models that rely on additive linear combinations. In particular, they are at the core of most nonnegative matrix factorization algorithms and have many applications. The nonnegativity constraint is known to naturally favor sparsity, that is, solutions with few non-zero entries. However, it is often useful to further… ▽ More

    Submitted 22 June, 2022; v1 submitted 22 November, 2020; originally announced November 2020.

    Comments: 25 pages + 18 pages supplementary material. This is the new version of a work originally called "A Homotopy-based Algorithm for Sparse Multiple Right-hand Sides Nonnegative Least Squares". Although the central concept is the same, the paper has been almost completely rewritten

    Journal ref: Machine Learning 111, pp. 4453-4495, 2022

  19. arXiv:2007.10527  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Navigating the Trade-Off between Multi-Task Learning and Learning to Multitask in Deep Neural Networks

    Authors: Sachin Ravi, Sebastian Musslick, Maia Hamin, Theodore L. Willke, Jonathan D. Cohen

    Abstract: The terms multi-task learning and multitasking are easily confused. Multi-task learning refers to a paradigm in machine learning in which a network is trained on various related tasks to facilitate the acquisition of tasks. In contrast, multitasking is used to indicate, especially in the cognitive science literature, the ability to execute multiple tasks simultaneously. While multi-task learning e… ▽ More

    Submitted 5 January, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

  20. arXiv:2007.09605  [pdf, other

    cs.LG math.OC stat.ML

    A Flexible Optimization Framework for Regularized Matrix-Tensor Factorizations with Linear Couplings

    Authors: Carla Schenker, Jeremy E. Cohen, Evrim Acar

    Abstract: Coupled matrix and tensor factorizations (CMTF) are frequently used to jointly analyze data from multiple sources, also called data fusion. However, different characteristics of datasets stemming from multiple sources pose many challenges in data fusion and require to employ various regularizations, constraints, loss functions and different types of coupling structures between datasets. In this pa… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

  21. arXiv:2007.04250  [pdf, other

    cs.LG cs.CV stat.ML

    A Benchmark of Medical Out of Distribution Detection

    Authors: Tianshi Cao, Chin-Wei Huang, David Yu-Tung Hui, Joseph Paul Cohen

    Abstract: Motivation: Deep learning models deployed for use on medical tasks can be equipped with Out-of-Distribution Detection (OoDD) methods in order to avoid erroneous predictions. However it is unclear which OoDD method should be used in practice. Specific Problem: Systems trained for one particular domain of images cannot be expected to perform accurately on images of a different domain. These images s… ▽ More

    Submitted 4 August, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Submitted to Machine Learning for Biomedical Imaging Journal (MELBA)

  22. arXiv:2006.07553  [pdf, other

    cs.LG cs.CV eess.SP math.OC stat.ML

    Sparse Separable Nonnegative Matrix Factorization

    Authors: Nicolas Nadisic, Arnaud Vandaele, Jeremy E. Cohen, Nicolas Gillis

    Abstract: We propose a new variant of nonnegative matrix factorization (NMF), combining separability and sparsity assumptions. Separability requires that the columns of the first NMF factor are equal to columns of the input matrix, while sparsity requires that the columns of the second NMF factor are sparse. We call this variant sparse separable NMF (SSNMF), which we prove to be NP-complete, as opposed to s… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: 20 pages, accepted in ECML 2020

  23. arXiv:2005.11856  [pdf, other

    eess.IV cs.LG q-bio.QM stat.AP

    Predicting COVID-19 Pneumonia Severity on Chest X-ray with Deep Learning

    Authors: Joseph Paul Cohen, Lan Dao, Paul Morrison, Karsten Roth, Yoshua Bengio, Beiyi Shen, Almas Abbasi, Mahsa Hoshmand-Kochi, Marzyeh Ghassemi, Haifang Li, Tim Q Duong

    Abstract: Purpose: The need to streamline patient management for COVID-19 has become more pressing than ever. Chest X-rays provide a non-invasive (potentially bedside) tool to monitor the progression of the disease. In this study, we present a severity score prediction model for COVID-19 pneumonia for frontal chest X-ray images. Such a tool can gauge severity of COVID-19 lung infections (and pneumonia in ge… ▽ More

    Submitted 30 June, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

  24. arXiv:2002.02582  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Quantifying the Value of Lateral Views in Deep Learning for Chest X-rays

    Authors: Mohammad Hashir, Hadrien Bertrand, Joseph Paul Cohen

    Abstract: Most deep learning models in chest X-ray prediction utilize the posteroanterior (PA) view due to the lack of other views available. PadChest is a large-scale chest X-ray dataset that has almost 200 labels and multiple views available. In this work, we use PadChest to explore multiple approaches to merging the PA and lateral views for predicting the radiological labels associated with the X-ray ima… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: Under review at MIDL 2020

  25. arXiv:2002.02497  [pdf, other

    eess.IV cs.LG q-bio.QM stat.ML

    On the limits of cross-domain generalization in automated X-ray prediction

    Authors: Joseph Paul Cohen, Mohammad Hashir, Rupert Brooks, Hadrien Bertrand

    Abstract: This large scale study focuses on quantifying what X-rays diagnostic prediction tasks generalize well across multiple different datasets. We present evidence that the issue of generalization is not due to a shift in the images but instead a shift in the labels. We study the cross-domain performance, agreement between models, and model representations. We find interesting discrepancies between perf… ▽ More

    Submitted 24 May, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Full paper at MIDL2020

  26. arXiv:2001.04321  [pdf, other

    math.NA cs.LG math.OC stat.ML

    Accelerating Block Coordinate Descent for Nonnegative Tensor Factorization

    Authors: Andersen Man Shun Ang, Jeremy E. Cohen, Nicolas Gillis, Le Thi Khanh Hien

    Abstract: This paper is concerned with improving the empirical convergence speed of block-coordinate descent algorithms for approximate nonnegative tensor factorization (NTF). We propose an extrapolation strategy in-between block updates, referred to as heuristic extrapolation with restarts (HER). HER significantly accelerates the empirical convergence speed of most existing block-coordinate algorithms for… ▽ More

    Submitted 20 November, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: 32 pages, 24 figures

    Journal ref: Numerical Linear Algebra with Applications, e2373, 2021

  27. arXiv:1910.09570  [pdf, other

    q-bio.QM cs.CV eess.SP stat.AP stat.ML

    Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery

    Authors: Shawn Tan, Guillaume Androz, Ahmad Chamseddine, Pierre Fecteau, Aaron Courville, Yoshua Bengio, Joseph Paul Cohen

    Abstract: We release the largest public ECG dataset of continuous raw signals for representation learning containing 11 thousand patients and 2 billion labelled beats. Our goal is to enable semi-supervised ECG models to be made as well as to discover unknown subtypes of arrhythmia and anomalous ECG signal events. To this end, we propose an unsupervised representation learning task, evaluated in a semi-super… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: Under Review

  28. arXiv:1910.08640  [pdf, other

    cs.LG cs.CV stat.ML

    Are Perceptually-Aligned Gradients a General Property of Robust Classifiers?

    Authors: Simran Kaur, Jeremy Cohen, Zachary C. Lipton

    Abstract: For a standard convolutional neural network, optimizing over the input pixels to maximize the score of some target class will generally produce a grainy-looking version of the original image. However, Santurkar et al. (2019) demonstrated that for adversarially-trained neural networks, this optimization produces images that uncannily resemble the target class. In this paper, we show that these "per… ▽ More

    Submitted 23 October, 2019; v1 submitted 18 October, 2019; originally announced October 2019.

    Comments: To appear in the "Science Meets Engineering of Deep Learning" Workshop at NeurIPS 2019

  29. arXiv:1910.08636  [pdf, other

    cs.LG q-bio.QM stat.ML

    The TCGA Meta-Dataset Clinical Benchmark

    Authors: Mandana Samiei, Tobias Würfl, Tristan Deleu, Martin Weiss, Francis Dutil, Thomas Fevens, Geneviève Boucher, Sebastien Lemieux, Joseph Paul Cohen

    Abstract: Machine learning is bringing a paradigm shift to healthcare by changing the process of disease diagnosis and prognosis in clinics and hospitals. This development equips doctors and medical staff with tools to evaluate their hypotheses and hence make more precise decisions. Although most current research in the literature seeks to develop techniques and methods for predicting one particular clinica… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: 5 Pages, Submitted to MLCB 2019

  30. arXiv:1909.06576  [pdf, ps, other

    cs.LG stat.ML

    Torchmeta: A Meta-Learning library for PyTorch

    Authors: Tristan Deleu, Tobias Würfl, Mandana Samiei, Joseph Paul Cohen, Yoshua Bengio

    Abstract: The constant introduction of standardized benchmarks in the literature has helped accelerating the recent advances in meta-learning research. They offer a way to get a fair comparison between different algorithms, and the wide range of datasets available allows full control over the complexity of this evaluation. However, for a large majority of code available online, the data pipeline is often sp… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

  31. arXiv:1905.11286  [pdf, other

    cs.LG stat.ML

    Stochastic Gradient Methods with Layer-wise Adaptive Moments for Training of Deep Networks

    Authors: Boris Ginsburg, Patrice Castonguay, Oleksii Hrinchuk, Oleksii Kuchaiev, Vitaly Lavrukhin, Ryan Leary, Jason Li, Huyen Nguyen, Yang Zhang, Jonathan M. Cohen

    Abstract: We propose NovoGrad, an adaptive stochastic gradient descent method with layer-wise gradient normalization and decoupled weight decay. In our experiments on neural networks for image classification, speech recognition, machine translation, and language modeling, it performs on par or better than well tuned SGD with momentum and Adam or AdamW. Additionally, NovoGrad (1) is robust to the choice of l… ▽ More

    Submitted 6 February, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: Preprint, under review

  32. arXiv:1904.04861  [pdf, other

    cs.LG stat.ML

    Universal Lipschitz Approximation in Bounded Depth Neural Networks

    Authors: Jeremy E. J. Cohen, Todd Huster, Ra Cohen

    Abstract: Adversarial attacks against machine learning models are a rather hefty obstacle to our increasing reliance on these models. Due to this, provably robust (certified) machine learning models are a major topic of interest. Lipschitz continuous models present a promising approach to solving this problem. By leveraging the expressive power of a variant of neural networks which maintain low Lipschitz co… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

  33. arXiv:1902.02918  [pdf, other

    cs.LG stat.ML

    Certified Adversarial Robustness via Randomized Smoothing

    Authors: Jeremy M Cohen, Elan Rosenfeld, J. Zico Kolter

    Abstract: We show how to turn any classifier that classifies well under Gaussian noise into a new classifier that is certifiably robust to adversarial perturbations under the $\ell_2$ norm. This "randomized smoothing" technique has been proposed recently in the literature, but existing guarantees are loose. We prove a tight robustness guarantee in $\ell_2$ norm for smoothing with Gaussian noise. We use rand… ▽ More

    Submitted 15 June, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

    Comments: ICML 2019

  34. arXiv:1810.03442  [pdf, other

    q-bio.GN cs.LG stat.ML

    Towards the Latent Transcriptome

    Authors: Assya Trofimov, Francis Dutil, Claude Perreault, Sebastien Lemieux, Yoshua Bengio, Joseph Paul Cohen

    Abstract: In this work we propose a method to compute continuous embeddings for kmers from raw RNA-seq data, without the need for alignment to a reference genome. The approach uses an RNN to transform kmers of the RNA-seq reads into a 2 dimensional representation that is used to predict abundance of each kmer. We report that our model captures information of both DNA sequence similarity as well as DNA seque… ▽ More

    Submitted 10 December, 2018; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: 7 figures

  35. arXiv:1810.00045  [pdf, other

    cs.LG q-bio.NC stat.ML

    Adversarial Domain Adaptation for Stable Brain-Machine Interfaces

    Authors: Ali Farshchian, Juan A. Gallego, Joseph P. Cohen, Yoshua Bengio, Lee E. Miller, Sara A. Solla

    Abstract: Brain-Machine Interfaces (BMIs) have recently emerged as a clinically viable option to restore voluntary movements after paralysis. These devices are based on the ability to extract information about movement intent from neural signals recorded using multi-electrode arrays chronically implanted in the motor cortices of the brain. However, the inherent loss and turnover of recorded neurons requires… ▽ More

    Submitted 15 January, 2019; v1 submitted 28 September, 2018; originally announced October 2018.

    Comments: 14 pages, 6 figures

  36. arXiv:1809.07394  [pdf, other

    stat.AP cs.CY stat.ML

    Improving Subseasonal Forecasting in the Western U.S. with Machine Learning

    Authors: Jessica Hwang, Paulo Orenstein, Judah Cohen, Karl Pfeiffer, Lester Mackey

    Abstract: Water managers in the western United States (U.S.) rely on longterm forecasts of temperature and precipitation to prepare for droughts and other wet weather extremes. To improve the accuracy of these longterm forecasts, the U.S. Bureau of Reclamation and the National Oceanic and Atmospheric Administration (NOAA) launched the Subseasonal Climate Forecast Rodeo, a year-long real-time forecasting cha… ▽ More

    Submitted 22 May, 2019; v1 submitted 19 September, 2018; originally announced September 2018.

  37. arXiv:1808.08765  [pdf, ps, other

    stat.ML cs.LG

    Identifiability of Complete Dictionary Learning

    Authors: Jérémy E. Cohen, Nicolas Gillis

    Abstract: Sparse component analysis (SCA), also known as complete dictionary learning, is the following problem: Given an input matrix $M$ and an integer $r$, find a dictionary $D$ with $r$ columns and a matrix $B$ with $k$-sparse columns (that is, each column of $B$ has at most $k$ non-zero entries) such that $M \approx DB$. A key issue in SCA is identifiability, that is, characterizing the conditions unde… ▽ More

    Submitted 28 March, 2019; v1 submitted 27 August, 2018; originally announced August 2018.

    Comments: 19 pages, 2 figures, new title, added references and discussions

    Journal ref: SIAM Journal on Mathematics of Data Science 1 (3), pp. 518-536, 2019

  38. arXiv:1806.06975  [pdf, other

    q-bio.GN cs.CE cs.LG stat.ML

    Towards Gene Expression Convolutions using Gene Interaction Graphs

    Authors: Francis Dutil, Joseph Paul Cohen, Martin Weiss, Georgy Derevyanko, Yoshua Bengio

    Abstract: We study the challenges of applying deep learning to gene expression data. We find experimentally that there exists non-linear signal in the data, however is it not discovered automatically given the noise and low numbers of samples used in most research. We discuss how gene interaction graphs (same pathway, protein-protein, co-expression, or research paper text association) can be used to impose… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: 4 pages +1 page references, To appear in the International Conference on Machine Learning Workshop on Computational Biology, 2018

  39. arXiv:1806.01984  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to rank for censored survival data

    Authors: Margaux Luck, Tristan Sylvain, Joseph Paul Cohen, Heloise Cardinal, Andrea Lodi, Yoshua Bengio

    Abstract: Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on… ▽ More

    Submitted 8 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  40. arXiv:1802.05035  [pdf, other

    stat.ML

    Nonnegative PARAFAC2: a flexible coupling approach

    Authors: Jeremy E. Cohen, Rasmus Bro

    Abstract: Modeling variability in tensor decomposition methods is one of the challenges of source separation. One possible solution to account for variations from one data set to another, jointly analysed, is to resort to the PARAFAC2 model. However, so far imposing constraints on the mode with variability has not been possible. In the following manuscript, a relaxation of the PARAFAC2 model is introduced,… ▽ More

    Submitted 14 February, 2018; originally announced February 2018.

  41. arXiv:1802.03203  [pdf, other

    stat.ML cs.LG

    Curve Registered Coupled Low Rank Factorization

    Authors: Jeremy Emile Cohen, Rodrigo Cabral Farias, Bertrand Rivet

    Abstract: We propose an extension of the canonical polyadic (CP) tensor model where one of the latent factors is allowed to vary through data slices in a constrained way. The components of the latent factors, which we want to retrieve from data, can vary from one slice to another up to a diffeomorphism. We suppose that the diffeomorphisms are also unknown, thus merging curve registration and tensor decompos… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.

  42. arXiv:1712.04120  [pdf, other

    stat.ML cs.LG

    GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

    Authors: Alex Lamb, Devon Hjelm, Yaroslav Ganin, Joseph Paul Cohen, Aaron Courville, Yoshua Bengio

    Abstract: Directed latent variable models that formulate the joint distribution as $p(x,z) = p(z) p(x \mid z)$ have the advantage of fast and exact sampling. However, these models have the weakness of needing to specify $p(z)$, often with a simple fixed prior that limits the expressiveness of the model. Undirected latent variable models discard the requirement that $p(z)$ be specified with a prior, yet samp… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

    Comments: NIPS 2017

  43. arXiv:1711.03058  [pdf, other

    stat.ML q-bio.NC

    Matrix-normal models for fMRI analysis

    Authors: Michael Shvartsman, Narayanan Sundaram, Mikio C. Aoi, Adam Charles, Theodore C. Wilke, Jonathan D. Cohen

    Abstract: Multivariate analysis of fMRI data has benefited substantially from advances in machine learning. Most recently, a range of probabilistic latent variable models applied to fMRI data have been successful in a variety of tasks, including identifying similarity patterns in neural data (Representational Similarity Analysis and its empirical Bayes variant, RSA and BRSA; Intersubject Functional Connecti… ▽ More

    Submitted 9 November, 2017; v1 submitted 8 November, 2017; originally announced November 2017.

  44. Dictionary-based Tensor Canonical Polyadic Decomposition

    Authors: Jérémy E. Cohen, Nicolas Gillis

    Abstract: To ensure interpretability of extracted sources in tensor decomposition, we introduce in this paper a dictionary-based tensor canonical polyadic decomposition which enforces one factor to belong exactly to a known dictionary. A new formulation of sparse coding is proposed which enables high dimensional tensors dictionary-based canonical polyadic decomposition. The benefits of using a dictionary in… ▽ More

    Submitted 8 November, 2017; v1 submitted 3 April, 2017; originally announced April 2017.

    Journal ref: IEEE Trans. on Signal Processing 66 (7), pp. 1876-1889, 2018

  45. arXiv:1703.08710  [pdf, other

    cs.CV cs.LG stat.ML

    Count-ception: Counting by Fully Convolutional Redundant Counting

    Authors: Joseph Paul Cohen, Genevieve Boucher, Craig A. Glastonbury, Henry Z. Lo, Yoshua Bengio

    Abstract: Counting objects in digital images is a process that should be replaced by machines. This tedious task is time consuming and prone to errors due to fatigue of human annotators. The goal is to have a system that takes as input an image and returns a count of the objects inside and justification for the prediction in the form of object localization. We repose a problem, originally posed by Lempitsky… ▽ More

    Submitted 23 July, 2017; v1 submitted 25 March, 2017; originally announced March 2017.

    Comments: Under Review

  46. arXiv:1702.00261  [pdf, other

    stat.AP q-bio.QM stat.ME

    Phenomenological forecasting of disease incidence using heteroskedastic Gaussian processes: a dengue case study

    Authors: Leah R. Johnson, Robert B. Gramacy, Jeremy Cohen, Erin Mordecai, Courtney Murdock, Jason Rohr, Sadie J. Ryan, Anna M. Stewart-Ibarra, Daniel Weikel

    Abstract: In 2015 the US federal government sponsored a dengue forecasting competition using historical case data from Iquitos, Peru and San Juan, Puerto Rico. Competitors were evaluated on several aspects of out-of-sample forecasts including the targets of peak week, peak incidence during that week and total season incidence across each of several seasons. Our team was one of the top performers of that com… ▽ More

    Submitted 1 August, 2017; v1 submitted 1 February, 2017; originally announced February 2017.

    Comments: 39 pages, 13 figures, 4 tables, including appendices