Skip to main content

Showing 1–16 of 16 results for author: Takeda, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.06990  [pdf, ps, other

    cs.LG math.OC stat.ML

    Modified K-means Algorithm with Local Optimality Guarantees

    Authors: Mingyi Li, Michael R. Metel, Akiko Takeda

    Abstract: The K-means algorithm is one of the most widely studied clustering algorithms in machine learning. While extensive research has focused on its ability to achieve a globally optimal solution, there still lacks a rigorous analysis of its local optimality guarantees. In this paper, we first present conditions under which the K-means algorithm converges to a locally optimal solution. Based on this, we… ▽ More

    Submitted 11 June, 2025; v1 submitted 8 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  2. arXiv:2505.18909  [pdf, ps, other

    stat.ML cs.LG

    On the Role of Label Noise in the Feature Learning Process

    Authors: Andi Han, Wei Huang, Zhanpeng Zhou, Gang Niu, Wuyang Chen, Junchi Yan, Akiko Takeda, Taiji Suzuki

    Abstract: Deep learning with noisy labels presents significant challenges. In this work, we theoretically characterize the role of label noise from a feature learning perspective. Specifically, we consider a signal-noise data distribution, where each sample comprises a label-dependent signal and label-independent noise, and rigorously analyze the training dynamics of a two-layer convolutional neural network… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025

  3. arXiv:2505.12378  [pdf, ps, other

    math.OC cs.LG stat.ML

    Efficient Optimization with Orthogonality Constraint: a Randomized Riemannian Submanifold Method

    Authors: Andi Han, Pierre-Louis Poirion, Akiko Takeda

    Abstract: Optimization with orthogonality constraints frequently arises in various fields such as machine learning. Riemannian optimization offers a powerful framework for solving these problems by equipping the constraint set with a Riemannian manifold structure and performing optimization intrinsically on the manifold. This approach typically involves computing a search direction in the tangent space and… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025

  4. arXiv:2402.03883  [pdf, other

    math.OC cs.LG stat.ML

    A Framework for Bilevel Optimization on Riemannian Manifolds

    Authors: Andi Han, Bamdev Mishra, Pratik Jawanpuria, Akiko Takeda

    Abstract: Bilevel optimization has gained prominence in various applications. In this study, we introduce a framework for solving bilevel optimization problems, where the variables in both the lower and upper levels are constrained on Riemannian manifolds. We present several hypergradient estimation strategies on manifolds and analyze their estimation errors. Furthermore, we provide comprehensive convergenc… ▽ More

    Submitted 2 November, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  5. arXiv:2203.10215  [pdf, other

    math.OC math.PR stat.ML

    Convergence Error Analysis of Reflected Gradient Langevin Dynamics for Globally Optimizing Non-Convex Constrained Problems

    Authors: Kanji Sato, Akiko Takeda, Reiichiro Kawai, Taiji Suzuki

    Abstract: Gradient Langevin dynamics and a variety of its variants have attracted increasing attention owing to their convergence towards the global optimal solution, initially in the unconstrained convex framework while recently even in convex constrained non-convex problems. In the present work, we extend those frameworks to non-convex problems on a non-convex feasible region with a global optimization al… ▽ More

    Submitted 13 August, 2024; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: 16 pages, 10 figures

  6. Theory and Algorithms for Shapelet-based Multiple-Instance Learning

    Authors: Daiki Suehiro, Kohei Hatano, Eiji Takimoto, Shuji Yamamoto, Kenichi Bannai, Akiko Takeda

    Abstract: We propose a new formulation of Multiple-Instance Learning (MIL), in which a unit of data consists of a set of instances called a bag. The goal is to find a good classifier of bags based on the similarity with a "shapelet" (or pattern), where the similarity of a bag with a shapelet is the maximum similarity of instances in the bag. In previous work, some of the training instances are chosen as sha… ▽ More

    Submitted 13 October, 2020; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: The full version of this paper is published in Neural Computation. arXiv admin note: substantial text overlap with arXiv:1811.08084

  7. arXiv:2002.06501  [pdf, other

    stat.ML cs.AI cs.LG

    Convex Fairness Constrained Model Using Causal Effect Estimators

    Authors: Hikaru Ogura, Akiko Takeda

    Abstract: Recent years have seen much research on fairness in machine learning. Here, mean difference (MD) or demographic parity is one of the most popular measures of fairness. However, MD quantifies not only discrimination but also explanatory bias which is the difference of outcomes justified by explanatory features. In this paper, we devise novel models, called FairCEEs, which remove discrimination whil… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

    Comments: 10 pages, 5 figures, Accepted for the 2nd Workshop on Fairness, Accountability, Transparency, Ethics and Society on the Web (FATES on the Web 2020), held in conjunction with the WWW'20

  8. arXiv:1811.08084  [pdf, other

    cs.LG stat.ML

    Multiple-Instance Learning by Boosting Infinitely Many Shapelet-based Classifiers

    Authors: Daiki Suehiro, Kohei Hatano, Eiji Takimoto, Shuji Yamamoto, Kenichi Bannai, Akiko Takeda

    Abstract: We propose a new formulation of Multiple-Instance Learning (MIL). In typical MIL settings, a unit of data is given as a set of instances called a bag and the goal is to find a good classifier of bags based on similarity from a single or finitely many "shapelets" (or patterns), where the similarity of the bag from a shapelet is the maximum similarity of instances in the bag. Classifiers based on a… ▽ More

    Submitted 9 December, 2018; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: The preliminary version of this paper is arXiv:1709.01300. which only focuses on shapelet-based time-series classification but not Muptiple-Instance Learning. Note that the preliminary version has not been published

  9. arXiv:1806.05924  [pdf, other

    stat.AP stat.CO stat.ML

    Robust Bayesian Model Selection for Variable Clustering with the Gaussian Graphical Model

    Authors: Daniel Andrade, Akiko Takeda, Kenji Fukumizu

    Abstract: Variable clustering is important for explanatory analysis. However, only few dedicated methods for variable clustering with the Gaussian graphical model have been proposed. Even more severe, small insignificant partial correlations due to noise can dramatically change the clustering result when evaluating for example with the Bayesian Information Criteria (BIC). In this work, we try to address thi… ▽ More

    Submitted 15 June, 2018; originally announced June 2018.

  10. arXiv:1804.07213  [pdf, other

    math.OC stat.ML

    A refined convergence analysis of pDCA$_e$ with applications to simultaneous sparse recovery and outlier detection

    Authors: Tianxiang Liu, Ting Kei Pong, Akiko Takeda

    Abstract: We consider the problem of minimizing a difference-of-convex (DC) function, which can be written as the sum of a smooth convex function with Lipschitz gradient, a proper closed convex function and a continuous possibly nonsmooth concave function. We refine the convergence analysis in [38] for the proximal DC algorithm with extrapolation (pDCA$_e$) and show that the whole sequence generated by the… ▽ More

    Submitted 19 April, 2018; originally announced April 2018.

  11. arXiv:1711.07511  [pdf, other

    stat.ML cs.LG math.OC

    Optimistic Robust Optimization With Applications To Machine Learning

    Authors: Matthew Norton, Akiko Takeda, Alexander Mafusalov

    Abstract: Robust Optimization has traditionally taken a pessimistic, or worst-case viewpoint of uncertainty which is motivated by a desire to find sets of optimal policies that maintain feasibility under a variety of operating conditions. In this paper, we explore an optimistic, or best-case view of uncertainty and show that it can be a fruitful approach. We show that these techniques can be used to address… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

  12. arXiv:1710.05778  [pdf, other

    math.OC stat.ML

    A successive difference-of-convex approximation method for a class of nonconvex nonsmooth optimization problems

    Authors: Tianxiang Liu, Ting Kei Pong, Akiko Takeda

    Abstract: We consider a class of nonconvex nonsmooth optimization problems whose objective is the sum of a smooth function and a finite number of nonnegative proper closed possibly nonsmooth functions (whose proximal mappings are easy to compute), some of which are further composed with linear maps. This kind of problems arises naturally in various applications when different regularizers are introduced for… ▽ More

    Submitted 26 May, 2018; v1 submitted 16 October, 2017; originally announced October 2017.

  13. arXiv:1703.03216  [pdf, other

    stat.ML

    Trimmed Density Ratio Estimation

    Authors: Song Liu, Akiko Takeda, Taiji Suzuki, Kenji Fukumizu

    Abstract: Density ratio estimation is a vital tool in both machine learning and statistical community. However, due to the unbounded nature of density ratio, the estimation procedure can be vulnerable to corrupted data points, which often pushes the estimated ratio toward infinity. In this paper, we present a robust estimator which automatically identifies and trims outliers. The proposed estimator has a co… ▽ More

    Submitted 6 November, 2017; v1 submitted 9 March, 2017; originally announced March 2017.

    Comments: Made minor revisions. Restructured the introductory sections

  14. arXiv:1409.0934  [pdf, other

    stat.ML cs.LG

    Breakdown Point of Robust Support Vector Machine

    Authors: Takafumi Kanamori, Shuhei Fujiwara, Akiko Takeda

    Abstract: The support vector machine (SVM) is one of the most successful learning methods for solving classification problems. Despite its popularity, SVM has a serious drawback, that is sensitivity to outliers in training samples. The penalty on misclassification is defined by a convex loss called the hinge loss, and the unboundedness of the convex loss causes the sensitivity to outliers. To deal wit… ▽ More

    Submitted 2 September, 2014; originally announced September 2014.

    Comments: 27 pages

  15. arXiv:1206.4599  [pdf

    cs.LG stat.ML

    A Unified Robust Classification Model

    Authors: Akiko Takeda, Hiroyuki Mitsugi, Takafumi Kanamori

    Abstract: A wide variety of machine learning algorithms such as support vector machine (SVM), minimax probability machine (MPM), and Fisher discriminant analysis (FDA), exist for binary classification. The purpose of this paper is to provide a unified classification model that includes the above models through a robust optimization approach. This unified model has several benefits. One is that the extension… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  16. arXiv:1204.6583  [pdf, ps, other

    stat.ML cs.LG

    A Conjugate Property between Loss Functions and Uncertainty Sets in Classification Problems

    Authors: Takafumi Kanamori, Akiko Takeda, Taiji Suzuki

    Abstract: In binary classification problems, mainly two approaches have been proposed; one is loss function approach and the other is uncertainty set approach. The loss function approach is applied to major learning algorithms such as support vector machine (SVM) and boosting methods. The loss function represents the penalty of the decision function on the training samples. In the learning algorithm, the em… ▽ More

    Submitted 30 April, 2012; originally announced April 2012.

    Comments: 41 pages, 4 figures. The shorter version is accepted by COLT2012