Skip to main content

Showing 1–21 of 21 results for author: Obuchi, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.05801  [pdf, ps, other

    cs.LG stat.ML

    Neural Collapse in Cumulative Link Models for Ordinal Regression: An Analysis with Unconstrained Feature Model

    Authors: Chuang Ma, Tomoyuki Obuchi, Toshiyuki Tanaka

    Abstract: A phenomenon known as ''Neural Collapse (NC)'' in deep classification tasks, in which the penultimate-layer features and the final classifiers exhibit an extremely simple geometric structure, has recently attracted considerable attention, with the expectation that it can deepen our understanding of how deep neural networks behave. The Unconstrained Feature Model (UFM) has been proposed to explain… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2411.19553  [pdf, other

    cs.LG stat.ML

    Analysis of High-dimensional Gaussian Labeled-unlabeled Mixture Model via Message-passing Algorithm

    Authors: Xiaosi Gu, Tomoyuki Obuchi

    Abstract: Semi-supervised learning (SSL) is a machine learning methodology that leverages unlabeled data in conjunction with a limited amount of labeled data. Although SSL has been applied in various applications and its effectiveness has been empirically demonstrated, it is still not fully understood when and why SSL performs well. Some existing theoretical studies have attempted to address this issue by m… ▽ More

    Submitted 12 March, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

    Comments: 48 pages, 16 figures

  3. arXiv:2409.17704  [pdf, other

    stat.ML cond-mat.dis-nn cond-mat.stat-mech cs.LG

    Transfer Learning in $\ell_1$ Regularized Regression: Hyperparameter Selection Strategy based on Sharp Asymptotic Analysis

    Authors: Koki Okajima, Tomoyuki Obuchi

    Abstract: Transfer learning techniques aim to leverage information from multiple related datasets to enhance prediction quality against a target dataset. Such methods have been adopted in the context of high-dimensional sparse regression, and some Lasso-based algorithms have been invented: Trans-Lasso and Pretraining Lasso are such examples. These algorithms require the statistician to select hyperparameter… ▽ More

    Submitted 30 January, 2025; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: 23 pages, 9 figures

    Journal ref: Transactions on Machine Learning Research (2025). < https://openreview.net/forum?id=ccu0M3nmlF>

  4. arXiv:2409.05598  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.IT cs.LG

    When resampling/reweighting improves feature learning in imbalanced classification?: A toy-model study

    Authors: Tomoyuki Obuchi, Toshiyuki Tanaka

    Abstract: A toy model of binary classification is studied with the aim of clarifying the class-wise resampling/reweighting effect on the feature learning performance under the presence of class imbalance. In the analysis, a high-dimensional limit of the input space is taken while keeping the ratio of the dataset size against the input dimension finite and the non-rigorous replica method from statistical mec… ▽ More

    Submitted 22 April, 2025; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: 33 pages, 14 figures

    Journal ref: Transactions on Machine Learning Research, 2025. Available at: https://openreview.net/forum?id=spqbyeGyLR

  5. arXiv:2110.08500  [pdf, other

    stat.ML cs.LG math.ST

    On Model Selection Consistency of Lasso for High-Dimensional Ising Models

    Authors: Xiangming Meng, Tomoyuki Obuchi, Yoshiyuki Kabashima

    Abstract: We theoretically analyze the model selection consistency of least absolute shrinkage and selection operator (Lasso), both with and without post-thresholding, for high-dimensional Ising models. For random regular (RR) graphs of size $p$ with regular node degree $d$ and uniform couplings $θ_0$, it is rigorously proved that Lasso \textit{without post-thresholding} is model selection consistent in the… ▽ More

    Submitted 17 February, 2023; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: AISTATS2023, camera-ready version

  6. arXiv:2102.03988  [pdf, other

    cs.LG cond-mat.dis-nn cs.AI stat.ML

    Ising Model Selection Using $\ell_{1}$-Regularized Linear Regression: A Statistical Mechanics Analysis

    Authors: Xiangming Meng, Tomoyuki Obuchi, Yoshiyuki Kabashima

    Abstract: We theoretically analyze the typical learning performance of $\ell_{1}$-regularized linear regression ($\ell_1$-LinR) for Ising model selection using the replica method from statistical mechanics. For typical random regular graphs in the paramagnetic phase, an accurate estimate of the typical sample complexity of $\ell_1$-LinR is obtained. Remarkably, despite the model misspecification, $\ell_1$-L… ▽ More

    Submitted 1 November, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: Accepted to NeurIPS 2021. Camera-ready version with supplementary materials

  7. arXiv:2008.08342  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    Structure Learning in Inverse Ising Problems Using $\ell_2$-Regularized Linear Estimator

    Authors: Xiangming Meng, Tomoyuki Obuchi, Yoshiyuki Kabashima

    Abstract: The inference performance of the pseudolikelihood method is discussed in the framework of the inverse Ising problem when the $\ell_2$-regularized (ridge) linear regression is adopted. This setup is introduced for theoretically investigating the situation where the data generation model is different from the inference one, namely the model mismatch situation. In the teacher-student scenario under t… ▽ More

    Submitted 23 November, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: 35 pages, 8 figures

  8. arXiv:2008.03175  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Reconstructing Sparse Signals via Greedy Monte-Carlo Search

    Authors: Kao Hayashi, Tomoyuki Obuchi, Yoshiyuki Kabashima

    Abstract: We propose a Monte-Carlo-based method for reconstructing sparse signals in the formulation of sparse linear regression in a high-dimensional setting. The basic idea of this algorithm is to explicitly select variables or covariates to represent a given data vector or responses and accept randomly generated updates of that selection if and only if the energy or cost function decreases. This algorith… ▽ More

    Submitted 29 January, 2021; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: 15 pages, 4 figures

  9. arXiv:1912.11591  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    Learning performance in inverse Ising problems with sparse teacher couplings

    Authors: Alia Abbara, Yoshiyuki Kabashima, Tomoyuki Obuchi, Yingying Xu

    Abstract: We investigate the learning performance of the pseudolikelihood maximization method for inverse Ising problems. In the teacher-student scenario under the assumption that the teacher's couplings are sparse and the student does not know the graphical structure, the learning curve and order parameters are assessed in the typical case using the replica and cavity methods from statistical mechanics. Ou… ▽ More

    Submitted 1 May, 2020; v1 submitted 24 December, 2019; originally announced December 2019.

    Comments: 29 pages, 8 figures

  10. arXiv:1906.06002  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.LG

    Empirical Bayes Method for Boltzmann Machines

    Authors: Muneki Yasuda, Tomoyuki Obuchi

    Abstract: In this study, we consider an empirical Bayes method for Boltzmann machines and propose an algorithm for it. The empirical Bayes method allows estimation of the values of the hyperparameters of the Boltzmann machine by maximizing a specific likelihood function referred to as the empirical Bayes likelihood function in this study. However, the maximization is computationally hard because the empiric… ▽ More

    Submitted 7 September, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

    Journal ref: Journal of Physics A: Mathematical and Theoretical, vol.53, 014004, 2019

  11. arXiv:1902.10375  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.LG

    Cross validation in sparse linear regression with piecewise continuous nonconvex penalties and its acceleration

    Authors: Tomoyuki Obuchi, Ayaka Sakata

    Abstract: We investigate the signal reconstruction performance of sparse linear regression in the presence of noise when piecewise continuous nonconvex penalties are used. Among such penalties, we focus on the SCAD penalty. The contributions of this study are three-fold: We first present a theoretical analysis of a typical reconstruction performance, using the replica method, under the assumption that each… ▽ More

    Submitted 25 December, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: 33 pages, 18 figures. MATLAB codes implementing the proposed method are distributed in https://github.com/T-Obuchi/SLRpackage_AcceleratedCV_matlab

  12. arXiv:1902.07436  [pdf, other

    stat.ML cs.LG

    Perfect reconstruction of sparse signals with piecewise continuous nonconvex penalties and nonconvexity control

    Authors: Ayaka Sakata, Tomoyuki Obuchi

    Abstract: We consider compressed sensing formulated as a minimization problem of nonconvex sparse penalties, Smoothly Clipped Absolute deviation (SCAD) and Minimax Concave Penalty (MCP). The nonconvexity of these penalties is controlled by nonconvexity parameters, and L1 penalty is contained as a limit with respect to these parameters. The analytically derived reconstruction limit overcomes that of L1 and t… ▽ More

    Submitted 5 June, 2021; v1 submitted 20 February, 2019; originally announced February 2019.

    Comments: 25 pages, 17 figures

  13. arXiv:1810.11908  [pdf, other

    cs.LG cs.SI physics.soc-ph stat.ML

    Mean-field theory of graph neural networks in graph partitioning

    Authors: Tatsuro Kawamoto, Masashi Tsubaki, Tomoyuki Obuchi

    Abstract: A theoretical performance analysis of the graph neural network (GNN) is presented. For classification tasks, the neural network approach has the advantage in terms of flexibility that it can be employed in a data-driven manner, whereas Bayesian inference requires the assumption of a specific model. A fundamental question is then whether GNN has a high accuracy in addition to this flexibility. More… ▽ More

    Submitted 28 October, 2018; originally announced October 2018.

    Comments: 16 pages, 6 figures, Thirty-second Conference on Neural Information Processing Systems (NIPS2018)

  14. arXiv:1805.11259  [pdf, other

    cond-mat.dis-nn cs.IT stat.ML

    Statistical mechanical analysis of sparse linear regression as a variable selection problem

    Authors: Tomoyuki Obuchi, Yoshinori Nakanishi-Ohno, Masato Okada, Yoshiyuki Kabashima

    Abstract: An algorithmic limit of compressed sensing or related variable-selection problems is analytically evaluated when a design matrix is given by an overcomplete random matrix. The replica method from statistical mechanics is employed to derive the result. The analysis is conducted through evaluation of the entropy, an exponential rate of the number of combinations of variables giving a specific value… ▽ More

    Submitted 10 September, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: 39 pages, 14 figures

  15. arXiv:1802.10254  [pdf, ps, other

    stat.ML cond-mat.dis-nn stat.ME

    Semi-Analytic Resampling in Lasso

    Authors: Tomoyuki Obuchi, Yoshiyuki Kabashima

    Abstract: An approximate method for conducting resampling in Lasso, the $\ell_1$ penalized linear regression, in a semi-analytic manner is developed, whereby the average over the resampled datasets is directly computed without repeated numerical sampling, thus enabling an inference free of the statistical fluctuations due to sampling finiteness, as well as a significant reduction of computational time. The… ▽ More

    Submitted 10 December, 2018; v1 submitted 27 February, 2018; originally announced February 2018.

    Comments: 33 pages, 10 figures, MATLAB codes implementing the proposed method are distributed in https://github.com/T-Obuchi/AMPR_lasso_matlab

  16. arXiv:1711.05420  [pdf, ps, other

    stat.ML cond-mat.dis-nn

    Accelerating Cross-Validation in Multinomial Logistic Regression with $\ell_1$-Regularization

    Authors: Tomoyuki Obuchi, Yoshiyuki Kabashima

    Abstract: We develop an approximate formula for evaluating a cross-validation estimator of predictive likelihood for multinomial logistic regression regularized by an $\ell_1$-norm. This allows us to avoid repeated optimizations required for literally conducting cross-validation; hence, the computational time can be significantly reduced. The formula is derived through a perturbative approach employing the… ▽ More

    Submitted 18 September, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: 30 pages, 9 figures. MATLAB and python codes implementing the formula derived in the manuscript are distributed in https://github.com/T-Obuchi/AcceleratedCVonMLR_matlab and https://github.com/T-Obuchi/AcceleratedCVonMLR_python

  17. arXiv:1611.07197  [pdf, ps, other

    stat.ME astro-ph.GA cond-mat.dis-nn

    Accelerating cross-validation with total variation and its application to super-resolution imaging

    Authors: Tomoyuki Obuchi, Shiro Ikeda, Kazunori Akiyama, Yoshiyuki Kabashima

    Abstract: We develop an approximation formula for the cross-validation error (CVE) of a sparse linear regression penalized by $\ell_1$-norm and total variation terms, which is based on a perturbative expansion utilizing the largeness of both the data dimensionality and the model. The developed formula allows us to reduce the necessary computational cost of the CVE evaluation significantly. The practicality… ▽ More

    Submitted 20 November, 2017; v1 submitted 22 November, 2016; originally announced November 2016.

    Comments: 14 pages, 4 figures. A Matlab package implementing the approximation formula is available from https://github.com/T-Obuchi/AcceleratedCVon2DTVLR

    Journal ref: PLoS ONE 12(12): e0188012 (2017)

  18. arXiv:1610.07733  [pdf, ps, other

    stat.ML cs.LG

    Approximate cross-validation formula for Bayesian linear regression

    Authors: Yoshiyuki Kabashima, Tomoyuki Obuchi, Makoto Uemura

    Abstract: Cross-validation (CV) is a technique for evaluating the ability of statistical models/learning systems based on a given data set. Despite its wide applicability, the rather heavy computational cost can prevent its use as the system size grows. To resolve this difficulty in the case of Bayesian linear regression, we develop a formula for evaluating the leave-one-out CV error approximately without a… ▽ More

    Submitted 25 October, 2016; originally announced October 2016.

    Comments: 5 pages, 2 figures, invited paper for Allerton2016 conference

  19. arXiv:1603.01399  [pdf, ps, other

    cs.IT cond-mat.dis-nn cond-mat.stat-mech stat.ME

    Sampling approach to sparse approximation problem: determining degrees of freedom by simulated annealing

    Authors: Tomoyuki Obuchi, Yoshiyuki Kabashima

    Abstract: The approximation of a high-dimensional vector by a small combination of column vectors selected from a fixed matrix has been actively debated in several different disciplines. In this paper, a sampling approach based on the Monte Carlo method is presented as an efficient solver for such problems. Especially, the use of simulated annealing (SA), a metaheuristic optimization algorithm, for determin… ▽ More

    Submitted 4 October, 2016; v1 submitted 4 March, 2016; originally announced March 2016.

    Comments: 5 pages, 3 figures, Proceedings of Eusipco 2016

  20. arXiv:1503.02802  [pdf, ps, other

    cond-mat.stat-mech stat.ME

    Learning probabilities from random observables in high dimensions: the maximum entropy distribution and others

    Authors: Tomoyuki Obuchi, Simona Cocco, Rémi Monasson

    Abstract: We consider the problem of learning a target probability distribution over a set of $N$ binary variables from the knowledge of the expectation values (with this target distribution) of $M$ observables, drawn uniformly at random. The space of all probability distributions compatible with these $M$ expectation values within some fixed accuracy, called version space, is studied. We introduce a biased… ▽ More

    Submitted 21 July, 2015; v1 submitted 10 March, 2015; originally announced March 2015.

    Comments: 30 pages, 13 figures

  21. arXiv:1412.7012  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.CV

    Boltzmann-Machine Learning of Prior Distributions of Binarized Natural Images

    Authors: Tomoyuki Obuchi, Hirokazu Koma, Muneki Yasuda

    Abstract: Prior distributions of binarized natural images are learned by using a Boltzmann machine. According the results of this study, there emerges a structure with two sublattices in the interactions, and the nearest-neighbor and next-nearest-neighbor interactions correspondingly take two discriminative values, which reflects the individual characteristics of the three sets of pictures that we process.… ▽ More

    Submitted 23 October, 2016; v1 submitted 15 December, 2014; originally announced December 2014.

    Comments: 32 pages, 33 figures

    Journal ref: J. Phys. Soc. Jpn. 85 (2016) 114803