Skip to main content

Showing 1–27 of 27 results for author: Kanamori, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.16923  [pdf, ps, other

    stat.ML cs.LG

    TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation

    Authors: Yuhui Zhang, Dongshen Wu, Yuichiro Wada, Takafumi Kanamori

    Abstract: A reliable uncertainty estimation method is the foundation of many modern out-of-distribution (OOD) detectors, which are critical for safe deployments of deep learning models in the open world. In this work, we propose TULiP, a theoretically-driven post-hoc uncertainty estimator for OOD detection. Our approach considers a hypothetical perturbation applied to the network before convergence. Based o… ▽ More

    Submitted 23 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

  2. arXiv:2410.20780  [pdf, other

    stat.ML cs.CV cs.LG

    Scaling-based Data Augmentation for Generative Models and its Theoretical Extension

    Authors: Yoshitaka Koike, Takumi Nakagawa, Hiroki Waida, Takafumi Kanamori

    Abstract: This paper studies stable learning methods for generative models that enable high-quality data generation. Noise injection is commonly used to stabilize learning. However, selecting a suitable noise distribution is challenging. Diffusion-GAN, a recently developed method, addresses this by using the diffusion process with a timestep-dependent discriminator. We investigate Diffusion-GAN and reveal t… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  3. arXiv:2410.20760  [pdf, ps, other

    stat.ML cs.LG

    Robust Estimation for Kernel Exponential Families with Smoothed Total Variation Distances

    Authors: Takafumi Kanamori, Kodai Yokoyama, Takayuki Kawashima

    Abstract: In statistical inference, we commonly assume that samples are independent and identically distributed from a probability distribution included in a pre-specified statistical model. However, such an assumption is often violated in practice. Even an unexpected extreme sample called an {\it outlier} can significantly impact classical estimators. Robust statistics studies how to construct reliable sta… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

  4. arXiv:2309.12450  [pdf, other

    stat.ML cs.LG

    A Convex Framework for Confounding Robust Inference

    Authors: Kei Ishikawa, Niao He, Takafumi Kanamori

    Abstract: We study policy evaluation of offline contextual bandits subject to unobserved confounders. Sensitivity analysis methods are commonly used to estimate the policy value under the worst-case confounding over a given uncertainty set. However, existing work often resorts to some coarse relaxation of the uncertainty set for the sake of tractability, leading to overly conservative estimation of the poli… ▽ More

    Submitted 1 November, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: This is an extended version of the following work https://proceedings.mlr.press/v206/ishikawa23a.html. arXiv admin note: text overlap with arXiv:2302.13348

  5. arXiv:2304.09552  [pdf, other

    stat.ML cs.LG

    Denoising Cosine Similarity: A Theory-Driven Approach for Efficient Representation Learning

    Authors: Takumi Nakagawa, Yutaro Sanada, Hiroki Waida, Yuhui Zhang, Yuichiro Wada, Kōsaku Takanashi, Tomonori Yamada, Takafumi Kanamori

    Abstract: Representation learning has been increasing its impact on the research and practice of machine learning, since it enables to learn representations that can apply to various downstream tasks efficiently. However, recent works pay little attention to the fact that real-world datasets used during the stage of representation learning are commonly contaminated by noise, which can degrade the quality of… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  6. arXiv:2304.00395  [pdf, other

    cs.LG stat.ML

    Towards Understanding the Mechanism of Contrastive Learning via Similarity Structure: A Theoretical Analysis

    Authors: Hiroki Waida, Yuichiro Wada, Léo Andéol, Takumi Nakagawa, Yuhui Zhang, Takafumi Kanamori

    Abstract: Contrastive learning is an efficient approach to self-supervised representation learning. Although recent studies have made progress in the theoretical understanding of contrastive learning, the investigation of how to characterize the clusters of the learned representations is still limited. In this paper, we aim to elucidate the characterization from theoretical perspectives. To this end, we con… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  7. arXiv:2303.03036  [pdf, other

    stat.ML cs.LG

    Deep Clustering with a Constraint for Topological Invariance based on Symmetric InfoNCE

    Authors: Yuhui Zhang, Yuichiro Wada, Hiroki Waida, Kaito Goto, Yusaku Hino, Takafumi Kanamori

    Abstract: We consider the scenario of deep clustering, in which the available prior knowledge is limited. In this scenario, few existing state-of-the-art deep clustering methods can perform well for both non-complex topology and complex topology datasets. To address the problem, we propose a constraint utilizing symmetric InfoNCE, which helps an objective of deep clustering method in the scenario train the… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 48 pages, 6 figures

  8. Learning Domain Invariant Representations by Joint Wasserstein Distance Minimization

    Authors: Léo Andeol, Yusei Kawakami, Yuichiro Wada, Takafumi Kanamori, Klaus-Robert Müller, Grégoire Montavon

    Abstract: Domain shifts in the training data are common in practical applications of machine learning; they occur for instance when the data is coming from different sources. Ideally, a ML model should work well independently of these shifts, for example, by learning a domain-invariant representation. However, common ML losses do not give strong guarantees on how consistently the ML model performs for diffe… ▽ More

    Submitted 21 August, 2023; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 23 pages + supplement

  9. arXiv:1910.08280  [pdf, other

    stat.ML cs.LG

    Robust modal regression with direct log-density derivative estimation

    Authors: Hiroaki Sasaki, Tomoya Sakai, Takafumi Kanamori

    Abstract: Modal regression is aimed at estimating the global mode (i.e., global maximum) of the conditional density function of the output variable given input variables, and has led to regression methods robust against heavy-tailed or skewed noises. The conditional mode is often estimated through maximization of the modal regression risk (MRR). In order to apply a gradient method for the maximization, the… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

  10. arXiv:1910.03834  [pdf, other

    stat.ML cs.LG math.ST

    Estimating Density Models with Truncation Boundaries using Score Matching

    Authors: Song Liu, Takafumi Kanamori, Daniel J. Williams

    Abstract: Truncated densities are probability density functions defined on truncated domains. They share the same parametric form with their non-truncated counterparts up to a normalizing constant. Since the computation of their normalizing constants is usually infeasible, Maximum Likelihood Estimation cannot be easily applied to estimate truncated density models. Score Matching (SM) is a powerful tool for… ▽ More

    Submitted 20 April, 2022; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: to be published in the Journal of Machine Learning Research

  11. arXiv:1901.07710  [pdf, other

    stat.ML cs.LG

    Unified estimation framework for unnormalized models with statistical efficiency

    Authors: Masatoshi Uehara, Takafumi Kanamori, Takashi Takenouchi, Takeru Matsuda

    Abstract: The parameter estimation of unnormalized models is a challenging problem. The maximum likelihood estimation (MLE) is computationally infeasible for these models since normalizing constants are not explicitly calculated. Although some consistent estimators have been proposed earlier, the problem of statistical efficiency remains. In this study, we propose a unified, statistically efficient estimati… ▽ More

    Submitted 5 June, 2020; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: To appear at AISTATS 2020

  12. arXiv:1806.00569  [pdf, other

    stat.ML cs.LG

    Variable Selection for Nonparametric Learning with Power Series Kernels

    Authors: Kota Matsui, Wataru Kumagai, Kenta Kanamori, Mitsuaki Nishikimi, Takafumi Kanamori

    Abstract: In this paper, we propose a variable selection method for general nonparametric kernel-based estimation. The proposed method consists of two-stage estimation: (1) construct a consistent estimator of the target function, (2) approximate the estimator using a few variables by l1-type penalized estimation. We see that the proposed method can be applied to various kernel nonparametric estimation such… ▽ More

    Submitted 4 December, 2018; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: 24 pages, 3 tables, 2 figures

  13. arXiv:1805.07454  [pdf, other

    stat.ML cs.LG

    Fisher Efficient Inference of Intractable Models

    Authors: Song Liu, Takafumi Kanamori, Wittawat Jitkrittum, Yu Chen

    Abstract: Maximum Likelihood Estimators (MLE) has many good properties. For example, the asymptotic variance of MLE solution attains equality of the asymptotic Cram{é}r-Rao lower bound (efficiency bound), which is the minimum possible variance for an unbiased estimator. However, obtaining such MLE solution requires calculating the likelihood function which may not be tractable due to the normalization term… ▽ More

    Submitted 1 November, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: Fixed typos in the text. To appear in Neural Information Process 2019

  14. arXiv:1707.01711  [pdf, other

    stat.ML

    Mode-Seeking Clustering and Density Ridge Estimation via Direct Estimation of Density-Derivative-Ratios

    Authors: Hiroaki Sasaki, Takafumi Kanamori, Aapo Hyvärinen, Gang Niu, Masashi Sugiyama

    Abstract: Modes and ridges of the probability density function behind observed data are useful geometric features. Mode-seeking clustering assigns cluster labels by associating data samples with the nearest modes, and estimation of density ridges enables us to find lower-dimensional structures hidden in data. A key technical challenge both in mode-seeking clustering and density ridge estimation is accurate… ▽ More

    Submitted 30 March, 2018; v1 submitted 6 July, 2017; originally announced July 2017.

  15. arXiv:1409.3912  [pdf, other

    stat.ML cs.LG

    Parallel Distributed Block Coordinate Descent Methods based on Pairwise Comparison Oracle

    Authors: Kota Matsui, Wataru Kumagai, Takafumi Kanamori

    Abstract: This paper provides a block coordinate descent algorithm to solve unconstrained optimization problems. In our algorithm, computation of function values or gradients is not required. Instead, pairwise comparison of function values is used. Our algorithm consists of two steps; one is the direction estimate step and the other is the search step. Both steps require only pairwise comparison of function… ▽ More

    Submitted 13 September, 2014; originally announced September 2014.

  16. arXiv:1409.0934  [pdf, other

    stat.ML cs.LG

    Breakdown Point of Robust Support Vector Machine

    Authors: Takafumi Kanamori, Shuhei Fujiwara, Akiko Takeda

    Abstract: The support vector machine (SVM) is one of the most successful learning methods for solving classification problems. Despite its popularity, SVM has a serious drawback, that is sensitivity to outliers in training samples. The penalty on misclassification is defined by a convex loss called the hinge loss, and the unboundedness of the convex loss causes the sensitivity to outliers. To deal wit… ▽ More

    Submitted 2 September, 2014; originally announced September 2014.

    Comments: 27 pages

  17. arXiv:1305.2473  [pdf, ps, other

    math.ST stat.ML

    Affine Invariant Divergences associated with Composite Scores and its Applications

    Authors: Takafumi Kanamori, Hironori Fujisawa

    Abstract: In statistical analysis, measuring a score of predictive performance is an important task. In many scientific fields, appropriate scores were tailored to tackle the problems at hand. A proper score is a popular tool to obtain statistically consistent forecasts. Furthermore, a mathematical characterization of the proper score was studied. As a result, it was revealed that the proper score correspon… ▽ More

    Submitted 11 May, 2013; originally announced May 2013.

    Comments: 24 pages

  18. arXiv:1207.0099  [pdf, ps, other

    cs.LG stat.ML

    Density-Difference Estimation

    Authors: Masashi Sugiyama, Takafumi Kanamori, Taiji Suzuki, Marthinus Christoffel du Plessis, Song Liu, Ichiro Takeuchi

    Abstract: We address the problem of estimating the difference between two probability densities. A naive approach is a two-step procedure of first estimating two densities separately and then computing their difference. However, such a two-step procedure does not necessarily work well because the first step is performed without regard to the second step and thus a small error incurred in the first stage can… ▽ More

    Submitted 30 June, 2012; originally announced July 2012.

  19. arXiv:1206.4599  [pdf

    cs.LG stat.ML

    A Unified Robust Classification Model

    Authors: Akiko Takeda, Hiroyuki Mitsugi, Takafumi Kanamori

    Abstract: A wide variety of machine learning algorithms such as support vector machine (SVM), minimax probability machine (MPM), and Fisher discriminant analysis (FDA), exist for binary classification. The purpose of this paper is to provide a unified classification model that includes the above models through a robust optimization approach. This unified model has several benefits. One is that the extension… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  20. arXiv:1204.6583  [pdf, ps, other

    stat.ML cs.LG

    A Conjugate Property between Loss Functions and Uncertainty Sets in Classification Problems

    Authors: Takafumi Kanamori, Akiko Takeda, Taiji Suzuki

    Abstract: In binary classification problems, mainly two approaches have been proposed; one is loss function approach and the other is uncertainty set approach. The loss function approach is applied to major learning algorithms such as support vector machine (SVM) and boosting methods. The loss function represents the penalty of the decision function on the training samples. In the learning algorithm, the em… ▽ More

    Submitted 30 April, 2012; originally announced April 2012.

    Comments: 41 pages, 4 figures. The shorter version is accepted by COLT2012

  21. arXiv:1204.3965  [pdf, ps, other

    stat.ML

    Semi-Supervised learning with Density-Ratio Estimation

    Authors: Masanori Kawakita, Takafumi Kanamori

    Abstract: In this paper, we study statistical properties of semi-supervised learning, which is considered as an important problem in the community of machine learning. In the standard supervised learning, only the labeled data is observed. The classification and regression problems are formalized as the supervised learning. In semi-supervised learning, unlabeled data is also obtained in addition to labeled… ▽ More

    Submitted 17 April, 2012; originally announced April 2012.

    Comments: 17 pages

  22. arXiv:1106.4729  [pdf, ps, other

    stat.ML math.ST stat.ME

    Relative Density-Ratio Estimation for Robust Distribution Comparison

    Authors: Makoto Yamada, Taiji Suzuki, Takafumi Kanamori, Hirotaka Hachiya, Masashi Sugiyama

    Abstract: Divergence estimators based on direct approximation of density-ratios without going through separate approximation of numerator and denominator densities have been successfully applied to machine learning tasks that involve distribution comparison such as outlier detection, transfer learning, and two-sample homogeneity test. However, since density-ratio functions often possess high fluctuation, di… ▽ More

    Submitted 23 June, 2011; originally announced June 2011.

  23. arXiv:1010.4945  [pdf, ps, other

    stat.ML

    f-divergence estimation and two-sample homogeneity test under semiparametric density-ratio models

    Authors: Takafumi Kanamori, Taiji Suzuki, Masashi Sugiyama

    Abstract: A density ratio is defined by the ratio of two probability densities. We study the inference problem of density ratios and apply a semi-parametric density-ratio estimator to the two-sample homogeneity test. In the proposed test procedure, the f-divergence between two probability densities is estimated using a density-ratio estimator. The f-divergence estimator is then exploited for the two-sample… ▽ More

    Submitted 24 October, 2010; originally announced October 2010.

    Comments: 28 pages, 3 tables

  24. arXiv:1010.2847  [pdf, ps, other

    stat.CO

    A Bregman Extension of quasi-Newton updates I: An Information Geometrical framework

    Authors: Takafumi Kanamori, Atsumi Ohara

    Abstract: We study quasi-Newton methods from the viewpoint of information geometry induced associated with Bregman divergences. Fletcher has studied a variational problem which derives the approximate Hessian update formula of the quasi-Newton methods. We point out that the variational problem is identical to optimization of the Kullback-Leibler divergence, which is a discrepancy measure between two probabi… ▽ More

    Submitted 14 October, 2010; originally announced October 2010.

    Comments: 34 pages, 4 figures

  25. arXiv:1010.2846  [pdf, ps, other

    stat.CO

    A Bregman Extension of quasi-Newton updates II: Convergence and Robustness Properties

    Authors: Takafumi Kanamori, Atsumi Ohara

    Abstract: We propose an extension of quasi-Newton methods, and investigate the convergence and the robustness properties of the proposed update formulae for the approximate Hessian matrix. Fletcher has studied a variational problem which derives the approximate Hessian update formula of the quasi-Newton methods. We point out that the variational problem is identical to optimization of the Kullback-Leibler d… ▽ More

    Submitted 14 October, 2010; originally announced October 2010.

    Comments: 39 pages, 1 figures, 3 tables

  26. arXiv:1004.4041  [pdf, ps, other

    stat.CO q-bio.QM

    Pooling Design and Bias Correction in DNA Library Screening

    Authors: Takafumi Kanamori, Hiroaki Uehara, Masakazu Jimbo

    Abstract: We study the group test for DNA library screening based on probabilistic approach. Group test is a method of detecting a few positive items from among a large number of items, and has wide range of applications. In DNA library screening, positive item corresponds to the clone having a specified DNA segment, and it is necessary to identify and isolate the positive clones for compiling the librarie… ▽ More

    Submitted 25 April, 2010; v1 submitted 22 April, 2010; originally announced April 2010.

    Comments: 18 pages, 1 figure, 8 tables, submitted.

  27. arXiv:0912.2800  [pdf, ps, other

    stat.ML math.ST

    Condition Number Analysis of Kernel-based Density Ratio Estimation

    Authors: Takafumi Kanamori, Taiji Suzuki, Masashi Sugiyama

    Abstract: The ratio of two probability densities can be used for solving various machine learning tasks such as covariate shift adaptation (importance sampling), outlier detection (likelihood-ratio test), and feature selection (mutual information). Recently, several methods of directly estimating the density ratio have been developed, e.g., kernel mean matching, maximum likelihood density ratio estimation… ▽ More

    Submitted 15 December, 2009; originally announced December 2009.

    Comments: 37 pages, 1 figure, submitted