Skip to main content

Showing 1–7 of 7 results for author: Rhodes, J S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.13257  [pdf, other

    cs.LG

    Random Forest Autoencoders for Guided Representation Learning

    Authors: Adrien Aumon, Shuang Ni, Myriam Lizotte, Guy Wolf, Kevin R. Moon, Jake S. Rhodes

    Abstract: Extensive research has produced robust methods for unsupervised data visualization. Yet supervised visualization$\unicode{x2013}$where expert labels guide representations$\unicode{x2013}$remains underexplored, as most supervised approaches prioritize classification over visualization. Recently, RF-PHATE, a diffusion-based manifold learning method leveraging random forests and information geometry,… ▽ More

    Submitted 18 May, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

  2. arXiv:2411.15179  [pdf, other

    cs.LG stat.ML

    Random Forest-Supervised Manifold Alignment

    Authors: Jake S. Rhodes, Adam G. Rustad

    Abstract: Manifold alignment is a type of data fusion technique that creates a shared low-dimensional representation of data collected from multiple domains, enabling cross-domain learning and improved performance in downstream tasks. This paper presents an approach to manifold alignment using random forests as a foundation for semi-supervised alignment algorithms, leveraging the model's inherent strengths.… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: 4 pages, 3 figures, Accepted at MMAI 2024 (BigData 2024)

  3. arXiv:2410.22978  [pdf, other

    stat.ML cs.LG

    Graph Integration for Diffusion-Based Manifold Alignment

    Authors: Jake S. Rhodes, Adam G. Rustad

    Abstract: Data from individual observations can originate from various sources or modalities but are often intrinsically linked. Multimodal data integration can enrich information content compared to single-source data. Manifold alignment is a form of data integration that seeks a shared, underlying low-dimensional representation of multiple data sources that emphasizes similarities between alternative repr… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 8 pages, 4 figures, Accepted at ICMLA 2024

  4. arXiv:2406.04421  [pdf, other

    cs.LG stat.ML

    Enhancing Supervised Visualization through Autoencoder and Random Forest Proximities for Out-of-Sample Extension

    Authors: Shuang Ni, Adrien Aumon, Guy Wolf, Kevin R. Moon, Jake S. Rhodes

    Abstract: The value of supervised dimensionality reduction lies in its ability to uncover meaningful connections between data features and labels. Common dimensionality reduction methods embed a set of fixed, latent points, but are not capable of generalizing to an unseen test set. In this paper, we provide an out-of-sample extension method for the random forest-based supervised dimensionality reduction met… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 7 pages, 3 figures

  5. arXiv:2307.01077  [pdf, other

    stat.ML cs.LG

    Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities

    Authors: Jake S. Rhodes

    Abstract: Manifold learning approaches seek the intrinsic, low-dimensional data structure within a high-dimensional space. Mainstream manifold learning algorithms, such as Isomap, UMAP, $t$-SNE, Diffusion Map, and Laplacian Eigenmaps do not use data labels and are thus considered unsupervised. Existing supervised extensions of these methods are limited to classification problems and fall short of uncovering… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 10 pages

  6. arXiv:2201.12682  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Geometry- and Accuracy-Preserving Random Forest Proximities

    Authors: Jake S. Rhodes, Adele Cutler, Kevin R. Moon

    Abstract: Random forests are considered one of the best out-of-the-box classification and regression algorithms due to their high level of predictive performance with relatively little tuning. Pairwise proximities can be computed from a trained random forest and measure the similarity between data points relative to the supervised task. Random forest proximities have been used in many applications including… ▽ More

    Submitted 28 February, 2023; v1 submitted 29 January, 2022; originally announced January 2022.

  7. arXiv:2006.08701  [pdf, other

    stat.ML cs.HC cs.LG stat.AP

    Supervised Visualization for Data Exploration

    Authors: Jake S. Rhodes, Adele Cutler, Guy Wolf, Kevin R. Moon

    Abstract: Dimensionality reduction is often used as an initial step in data exploration, either as preprocessing for classification or regression or for visualization. Most dimensionality reduction techniques to date are unsupervised; they do not take class labels into account (e.g., PCA, MDS, t-SNE, Isomap). Such methods require large amounts of data and are often sensitive to noise that may obfuscate impo… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 21 pages, 9 figures