Skip to main content

Showing 1–9 of 9 results for author: White, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2301.00557  [pdf, other

    cs.LG cs.IT stat.ML

    Learning to Maximize Mutual Information for Dynamic Feature Selection

    Authors: Ian Covert, Wei Qiu, Mingyu Lu, Nayoon Kim, Nathan White, Su-In Lee

    Abstract: Feature selection helps reduce data acquisition costs in ML, but the standard approach is to train models with static feature subsets. Here, we consider the dynamic feature selection (DFS) problem where a model sequentially queries features based on the presently available information. DFS is often addressed with reinforcement learning, but we explore a simpler approach of greedily selecting featu… ▽ More

    Submitted 8 June, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

    Comments: ICML 2023 camera-ready

  2. arXiv:2001.11552  [pdf

    physics.soc-ph cs.CL cs.CY cs.SI stat.AP stat.ML

    Unwanted Advances in Higher Education: Uncovering Sexual Harassment Experiences in Academia with Text Mining

    Authors: Amir Karami, Cynthia Nicole White, Kayla Ford, Suzanne Swan, Melek Yildiz Spinel

    Abstract: Sexual harassment in academia is often a hidden problem because victims are usually reluctant to report their experiences. Recently, a web survey was developed to provide an opportunity to share thousands of sexual harassment experiences in academia. Using an efficient approach, this study collected and investigated more than 2,000 sexual harassment experiences to better understand these unwanted… ▽ More

    Submitted 11 December, 2019; originally announced January 2020.

  3. arXiv:1910.02379  [pdf, other

    stat.AP

    Factors associated with injurious from falls in people with early stage Parkinson's disease

    Authors: Sarini Abdullah, James McGree, Nicole White, Kerrie Mengersen, Graham Kerr

    Abstract: Falls are common in people with Parkinson's disease (PD) and have detrimental effects which can lower the quality of life. While studies have been conducted to learn about falling in general, factors distinguishing injurious from non-injurious falls are less clear. We develop a two-stage Bayesian logistic regression model was used to model the association of falls and injurious falls with data mea… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

    Comments: 18 pages, 3 figures, 4 tables

    MSC Class: 62P10; 62-07; 62J12 ACM Class: J.3.2; G.3.2; G.3.6

  4. arXiv:1910.01864  [pdf, other

    stat.AP

    Profile regression for subgrouping patients with early stage Parkinson's disease

    Authors: Sarini Abdullah, James McGree, Nicole White, Kerrie Mengersen, Graham Kerr

    Abstract: Falls are detrimental to people with Parkinson's Disease (PD) because of the potentially severe consequences to the patients' quality of life. While many studies have attempted to predict falls/non-falls, this study aimed to determine factors related to falls frequency in people with early PD. Ninety nine participants with early stage PD were assessed based on two types of tests. The first type of… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: 30 pages, 11 figures, 4 tables

    MSC Class: 62-07; 62P10; 62H30 ACM Class: G.3.6; G.3.14; J.3.2

  5. arXiv:1910.01313  [pdf, other

    stat.AP

    Assessing the predictive ability of the UPDRS for falls classification in early stage Parkinson's disease

    Authors: Sarini Abdullah, Nicole White, James McGree, Kerrie Mengersen, Graham Kerr

    Abstract: Identification of risk factors associated with falls in people with Parkinson's Disease (PD) is important due to their high risk of falling. In this study, various ways of utilizing the Unified Parkinson's Disease Rating Scale (UPDRS) were assessed for the identification of risk factors and for the prediction of falls. Three statistical methods for classification were considered:decision trees, ra… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: 29 pages, 7 figures, 5 tables

    MSC Class: 62P10; 62-07; 62H30 ACM Class: G.3.6; G.3.7; J.3.2

  6. arXiv:1907.00510  [pdf

    cs.CY cs.CL stat.AP

    Hidden in Plain Sight For Too Long: Using Text Mining Techniques to Shine a Light on Workplace Sexism and Sexual Harassment

    Authors: Amir Karami, Suzanne C. Swan, Cynthia Nicole White, Kayla Ford

    Abstract: Objective: The goal of this study is to understand how people experience sexism and sexual harassment in the workplace by discovering themes in 2,362 experiences posted on the Everyday Sexism Project's website everydaysexism.com. Method: This study used both quantitative and qualitative methods. The quantitative method was a computational framework to collect and analyze a large number of workplac… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

  7. arXiv:1602.02466  [pdf, other

    stat.ME math.ST

    Overfitting hidden Markov models with an unknown number of states

    Authors: Zoé van Havre, Judith Rousseau, Nicole White, Kerrie Mengersen

    Abstract: This paper presents new theory and methodology for the Bayesian estimation of overfitted hidden Markov models, with finite state space. The goal is then to achieve posterior emptying of extra states. A prior configuration is constructed which favours configurations where the hidden Markov chain remains ergodic although it empties out some of the states. Asymptotic posterior convergence rates are p… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.

    Comments: Submitted to Bayesian Analysis on 04-August-2015

  8. arXiv:1602.01915  [pdf, ps, other

    stat.AP stat.ME

    Clustering action potential spikes: Insights on the use of overfitted finite mixture models and Dirichlet process mixture models

    Authors: Zoé van Havre, Nicole White, Judith Rousseau, Kerrie Mengersen

    Abstract: The modelling of action potentials from extracellular recordings, or spike sorting, is a rich area of neuroscience research in which latent variable models are often used. Two such models, Overfitted Finite Mixture models (OFMs) and Dirichlet Process Mixture models (DPMs) are considered to provide insights for unsupervised clustering of complex, multivariate medical data when the number of cluster… ▽ More

    Submitted 4 February, 2016; originally announced February 2016.

    Comments: Submitted to Australian & New Zealand Journal of Statistics on 31-Aug-2015

  9. Overfitting Bayesian Mixture Models with an Unknown Number of Components

    Authors: Zoe van Havre, Nicole White, Judith Rousseau, Kerrie Mengersen

    Abstract: This paper proposes solutions to three issues pertaining to the estimation of finite mixture models with an unknown number of components: the non-identifiability induced by overfitting the number of components, the mixing limitations of standard Markov Chain Monte Carlo (MCMC) sampling techniques, and the related label switching problem. An overfitting approach is used to estimate the number of co… ▽ More

    Submitted 24 August, 2015; v1 submitted 18 February, 2015; originally announced February 2015.

    Journal ref: Plos One, 10(7), e0131739 (2015)