Skip to main content

Showing 1–8 of 8 results for author: Chlenski, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04360  [pdf, ps, other

    cs.LG

    Even Faster Hyperbolic Random Forests: A Beltrami-Klein Wrapper Approach

    Authors: Philippe Chlenski, Itsik Pe'er

    Abstract: Decision trees and models that use them as primitives are workhorses of machine learning in Euclidean spaces. Recent work has further extended these models to the Lorentz model of hyperbolic space by replacing axis-parallel hyperplanes with homogeneous hyperplanes when partitioning the input space. In this paper, we show how the hyperDT algorithm can be elegantly reexpressed in the Beltrami-Klein… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 15 pages, 4 figures, 2 tables

  2. arXiv:2503.09576  [pdf, other

    cs.LG

    Manify: A Python Library for Learning Non-Euclidean Representations

    Authors: Philippe Chlenski, Kaizhu Du, Dylan Satow, Itsik Pe'er

    Abstract: We present Manify, an open-source Python library for non-Euclidean representation learning. Leveraging manifold learning techniques, Manify provides tools for learning embeddings in (products of) non-Euclidean spaces, performing classification and regression with data that lives in such spaces, and estimating the curvature of a manifold. Manify aims to advance research and applications in machine… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: 30 pages, 4 figures, 4 tables. Preprint

  3. arXiv:2501.17965  [pdf, other

    cs.LG stat.ML

    Variational Combinatorial Sequential Monte Carlo for Bayesian Phylogenetics in Hyperbolic Space

    Authors: Alex Chen, Philipe Chlenski, Kenneth Munyuza, Antonio Khalil Moretti, Christian A. Naesseth, Itsik Pe'er

    Abstract: Hyperbolic space naturally encodes hierarchical structures such as phylogenies (binary trees), where inward-bending geodesics reflect paths through least common ancestors, and the exponential growth of neighborhoods mirrors the super-exponential scaling of topologies. This scaling challenge limits the efficiency of Euclidean-based approximate inference methods. Motivated by the geometric connectio… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Comments: 24 pages, 10 figures

  4. arXiv:2410.13879  [pdf, ps, other

    cs.LG

    Mixed-curvature decision trees and random forests

    Authors: Philippe Chlenski, Quentin Chu, Raiyan R. Khan, Kaizhu Du, Antonio Khalil Moretti, Itsik Pe'er

    Abstract: Decision trees (DTs) and their random forest (RF) extensions are workhorses of classification and regression in Euclidean spaces. However, algorithms for learning in non-Euclidean spaces are still limited. We extend DT and RF algorithms to product manifolds: Cartesian products of several hyperbolic, hyperspherical, or Euclidean components. Such manifolds handle heterogeneous curvature while still… ▽ More

    Submitted 6 June, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: 30 pages, 12 figures, 13 tables. Camera-ready version for ICML 2025

  5. arXiv:2406.11944  [pdf, other

    cs.LG cs.CL

    Transcoders Find Interpretable LLM Feature Circuits

    Authors: Jacob Dunefsky, Philippe Chlenski, Neel Nanda

    Abstract: A key goal in mechanistic interpretability is circuit analysis: finding sparse subgraphs of models corresponding to specific behaviors or capabilities. However, MLP sublayers make fine-grained circuit analysis on transformer-based language models difficult. In particular, interpretable features -- such as those found by sparse autoencoders (SAEs) -- are typically linear combinations of extremely m… ▽ More

    Submitted 6 November, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 29 pages, 6 figures, 4 tables, 2 algorithms. NeurIPS 2024

  6. arXiv:2406.05227   

    cs.LG

    Mixed-Curvature Decision Trees and Random Forests

    Authors: Philippe Chlenski, Quentin Chu, Itsik Pe'er

    Abstract: We extend decision tree and random forest algorithms to product space manifolds: Cartesian products of Euclidean, hyperspherical, and hyperbolic manifolds. Such spaces have extremely expressive geometries capable of representing many arrangements of distances with low metric distortion. To date, all classifiers for product spaces fit a single linear decision boundary, and no regressor has been des… ▽ More

    Submitted 7 May, 2025; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: This paper has been replaced by a newer version at arXiv:2410.13879

  7. arXiv:2406.03242  [pdf, other

    cs.LG stat.CO

    Variational Pseudo Marginal Methods for Jet Reconstruction in Particle Physics

    Authors: Hanming Yang, Antonio Khalil Moretti, Sebastian Macaluso, Philippe Chlenski, Christian A. Naesseth, Itsik Pe'er

    Abstract: Reconstructing jets, which provide vital insights into the properties and histories of subatomic particles produced in high-energy collisions, is a main problem in data analyses in collider physics. This intricate task deals with estimating the latent structure of a jet (binary tree) and involves parameters such as particle energy, momentum, and types. While Bayesian methods offer a natural approa… ▽ More

    Submitted 30 December, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 21 pages, 9 figures

    Journal ref: Transactions on Machine Learning Research 2024

  8. arXiv:2310.13841  [pdf, other

    cs.LG

    Fast hyperboloid decision tree algorithms

    Authors: Philippe Chlenski, Ethan Turok, Antonio Moretti, Itsik Pe'er

    Abstract: Hyperbolic geometry is gaining traction in machine learning for its effectiveness at capturing hierarchical structures in real-world data. Hyperbolic spaces, where neighborhoods grow exponentially, offer substantial advantages and consistently deliver state-of-the-art results across diverse applications. However, hyperbolic classifiers often grapple with computational challenges. Methods reliant o… ▽ More

    Submitted 4 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Journal ref: International Conference on Learning Representations (2024)