Skip to main content

Showing 1–2 of 2 results for author: Lynch, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2009.00065  [pdf

    stat.AP

    Variable selection in social-environmental data: Sparse regression and tree ensemble machine learning approaches

    Authors: Elizabeth Handorf, Yinuo Yin, Michael Slifker, Shannon Lynch

    Abstract: Objective: Social-environmental data obtained from the U.S. Census is an important resource for understanding health disparities, but rarely is the full dataset utilized for analysis. A barrier to incorporating the full data is a lack of solid recommendations for variable selection, with researchers often hand-selecting a few variables. Thus, we evaluated the ability of empirical machine learning… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Comments: 22 pages, 1 figure, 4 tables

  2. arXiv:2008.12829  [pdf, other

    cs.LG stat.ML

    A Rigorous Machine Learning Analysis Pipeline for Biomedical Binary Classification: Application in Pancreatic Cancer Nested Case-control Studies with Implications for Bias Assessments

    Authors: Ryan J. Urbanowicz, Pranshu Suri, Yuhan Cui, Jason H. Moore, Karen Ruth, Rachael Stolzenberg-Solomon, Shannon M. Lynch

    Abstract: Machine learning (ML) offers a collection of powerful approaches for detecting and modeling associations, often applied to data having a large number of features and/or complex associations. Currently, there are many tools to facilitate implementing custom ML analyses (e.g. scikit-learn). Interest is also increasing in automated ML packages, which can make it easier for non-experts to apply ML and… ▽ More

    Submitted 8 September, 2020; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: 22 pages, 12 figures