Skip to main content

Showing 1–1 of 1 results for author: Bodine, C S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.08643  [pdf, other

    stat.ML cs.AI cs.LG

    SOAK: Same/Other/All K-fold cross-validation for estimating similarity of patterns in data subsets

    Authors: Toby Dylan Hocking, Gabrielle Thibault, Cameron Scott Bodine, Paul Nelson Arellano, Alexander F Shenkin, Olivia Jasmine Lindly

    Abstract: In many real-world applications of machine learning, we are interested to know if it is possible to train on the data that we have gathered so far, and obtain accurate predictions on a new test data subset that is qualitatively different in some respect (time period, geographic region, etc). Another question is whether data subsets are similar enough so that it is beneficial to combine subsets dur… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.