-
Exchangeable Neural ODE for Set Modeling
Authors:
Yang Li,
Haidong Yi,
Christopher M. Bender,
Siyuan Shan,
Junier B. Oliva
Abstract:
Reasoning over an instance composed of a set of vectors, like a point cloud, requires that one accounts for intra-set dependent features among elements. However, since such instances are unordered, the elements' features should remain unchanged when the input's order is permuted. This property, permutation equivariance, is a challenging constraint for most neural architectures. While recent work h…
▽ More
Reasoning over an instance composed of a set of vectors, like a point cloud, requires that one accounts for intra-set dependent features among elements. However, since such instances are unordered, the elements' features should remain unchanged when the input's order is permuted. This property, permutation equivariance, is a challenging constraint for most neural architectures. While recent work has proposed global pooling and attention-based solutions, these may be limited in the way that intradependencies are captured in practice. In this work we propose a more general formulation to achieve permutation equivariance through ordinary differential equations (ODE). Our proposed module, Exchangeable Neural ODE (ExNODE), can be seamlessly applied for both discriminative and generative tasks. We also extend set modeling in the temporal dimension and propose a VAE based model for temporal set modeling. Extensive experiments demonstrate the efficacy of our method over strong baselines.
△ Less
Submitted 6 August, 2020;
originally announced August 2020.
-
Deep Goal-Oriented Clustering
Authors:
Yifeng Shi,
Christopher M. Bender,
Junier B. Oliva,
Marc Niethammer
Abstract:
Clustering and prediction are two primary tasks in the fields of unsupervised and supervised learning, respectively. Although much of the recent advances in machine learning have been centered around those two tasks, the interdependent, mutually beneficial relationship between them is rarely explored. One could reasonably expect appropriately clustering the data would aid the downstream prediction…
▽ More
Clustering and prediction are two primary tasks in the fields of unsupervised and supervised learning, respectively. Although much of the recent advances in machine learning have been centered around those two tasks, the interdependent, mutually beneficial relationship between them is rarely explored. One could reasonably expect appropriately clustering the data would aid the downstream prediction task and, conversely, a better prediction performance for the downstream task could potentially inform a more appropriate clustering strategy. In this work, we focus on the latter part of this mutually beneficial relationship. To this end, we introduce Deep Goal-Oriented Clustering (DGC), a probabilistic framework that clusters the data by jointly using supervision via side-information and unsupervised modeling of the inherent data structure in an end-to-end fashion. We show the effectiveness of our model on a range of datasets by achieving prediction accuracies comparable to the state-of-the-art, while, more importantly in our setting, simultaneously learning congruent clustering strategies.
△ Less
Submitted 15 June, 2020; v1 submitted 7 June, 2020;
originally announced June 2020.
-
Defense Through Diverse Directions
Authors:
Christopher M. Bender,
Yang Li,
Yifeng Shi,
Michael K. Reiter,
Junier B. Oliva
Abstract:
In this work we develop a novel Bayesian neural network methodology to achieve strong adversarial robustness without the need for online adversarial training. Unlike previous efforts in this direction, we do not rely solely on the stochasticity of network weights by minimizing the divergence between the learned parameter distribution and a prior. Instead, we additionally require that the model mai…
▽ More
In this work we develop a novel Bayesian neural network methodology to achieve strong adversarial robustness without the need for online adversarial training. Unlike previous efforts in this direction, we do not rely solely on the stochasticity of network weights by minimizing the divergence between the learned parameter distribution and a prior. Instead, we additionally require that the model maintain some expected uncertainty with respect to all input covariates. We demonstrate that by encouraging the network to distribute evenly across inputs, the network becomes less susceptible to localized, brittle features which imparts a natural robustness to targeted perturbations. We show empirical robustness on several benchmark datasets.
△ Less
Submitted 23 March, 2020;
originally announced March 2020.
-
Assessing Health Care Interventions via an Interrupted Time Series Model: Study Power and Design Considerations
Authors:
Maricela Cruz,
Daniel L. Gillen,
Miriam Bender,
Hernando Ombao
Abstract:
The delivery and assessment of quality health care is complex with many interacting and interdependent components. In terms of research design and statistical analysis, this complexity and interdependency makes it difficult to assess the true impact of interventions designed to improve patient health care outcomes. Interrupted time series (ITS) is a quasi-experimental design developed for inferrin…
▽ More
The delivery and assessment of quality health care is complex with many interacting and interdependent components. In terms of research design and statistical analysis, this complexity and interdependency makes it difficult to assess the true impact of interventions designed to improve patient health care outcomes. Interrupted time series (ITS) is a quasi-experimental design developed for inferring the effectiveness of a health policy intervention while accounting for temporal dependence within a single system or unit. Current standardized ITS methods do not simultaneously analyze data for several units, nor are there methods to test for the existence of a change point and to assess statistical power for study planning purposes in this context. To address this limitation we propose the `Robust Multiple ITS' (R-MITS) model, appropriate for multi-unit ITS data, that allows for inference regarding the estimation of a global change point across units in the presence of a potentially lagged (or anticipatory) treatment effect. Under the R-MITS model, one can formally test for the existence of a change point and estimate the time delay between the formal intervention implementation and the over-all-unit intervention effect. We conducted empirical simulation studies to assess the type one error rate of the testing procedure, power for detecting specified change-point alternatives, and accuracy of the proposed estimating methodology. R-MITS is illustrated by analyzing patient satisfaction data from a hospital that implemented and evaluated a new care delivery model in multiple units.
△ Less
Submitted 29 November, 2018; v1 submitted 18 May, 2018;
originally announced May 2018.
-
A Robust Interrupted Time Series Model for Analyzing Complex Healthcare Intervention Data
Authors:
Maricela Cruz,
Miriam Bender,
Hernando Ombao
Abstract:
Current health policy calls for greater use of evidence based care delivery services to improve patient quality and safety outcomes. Care delivery is complex, with interacting and interdependent components that challenge traditional statistical analytic techniques, in particular when modeling a time series of outcomes data that might be "interrupted" by a change in a particular method of health ca…
▽ More
Current health policy calls for greater use of evidence based care delivery services to improve patient quality and safety outcomes. Care delivery is complex, with interacting and interdependent components that challenge traditional statistical analytic techniques, in particular when modeling a time series of outcomes data that might be "interrupted" by a change in a particular method of health care delivery. Interrupted time series (ITS) is a robust quasi-experimental design with the ability to infer the effectiveness of an intervention that accounts for data dependency. Current standardized methods for analyzing ITS data do not model changes in variation and correlation following the intervention. This is a key limitation since it is plausible for data variability and dependency to change because of the intervention. Moreover, present methodology either assumes a pre-specified interruption time point with an instantaneous effect or removes data for which the effect of intervention is not fully realized. In this paper, we describe and develop a novel `Robust-ITS' model that overcomes these omissions and limitations. The Robust-ITS model formally performs inference on: (a) identifying the change point; (b) differences in pre- and post-intervention correlation; (c) differences in the outcome variance pre- and post-intervention; and (d) differences in the mean pre- and post-intervention. We illustrate the proposed method by analyzing patient satisfaction data from a hospital that implemented and evaluated a new nursing care delivery model as the intervention of interest. The Robust-ITS model is implemented in a R Shiny toolbox which is freely available to the community.
△ Less
Submitted 25 March, 2019; v1 submitted 6 July, 2017;
originally announced July 2017.