Skip to main content

Showing 1–13 of 13 results for author: Hayakawa, S

Searching in archive math. Search in all archives.
.
  1. arXiv:2410.08709  [pdf, other

    cs.LG math.NA stat.ML

    Distillation of Discrete Diffusion through Dimensional Correlations

    Authors: Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi, Hiromi Wakaki, Yuki Mitsufuji

    Abstract: Diffusion models have demonstrated exceptional performances in various fields of generative modeling, but suffer from slow sampling speed due to their iterative nature. While this issue is being addressed in continuous domains, discrete diffusion models face unique challenges, particularly in capturing dependencies between elements (e.g., pixel relationships in image, sequential dependencies in la… ▽ More

    Submitted 8 May, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: 39 pages, ICML 2025 accepted

  2. arXiv:2404.12219  [pdf, other

    cs.LG math.NA stat.ML

    A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

    Abstract: Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian opt… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: This work is the journal extension of the workshop paper (arXiv:2301.11832) and AISTATS paper (arXiv:2306.05843). 48 pages, 11 figures

    MSC Class: 62C10; 62F15

  3. arXiv:2306.05843  [pdf, other

    cs.LG cs.AI math.NA stat.CO stat.ML

    Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

    Abstract: Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at AISTATS 2024. 33 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: AISTATS 238, 496-504, 2024

  4. arXiv:2301.11832  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    SOBER: Highly Parallel Bayesian Optimization and Bayesian Quadrature over Discrete and Mixed Spaces

    Authors: Masaki Adachi, Satoshi Hayakawa, Saad Hamid, Martin Jørgensen, Harald Oberhauser, Micheal A. Osborne

    Abstract: Batch Bayesian optimisation and Bayesian quadrature have been shown to be sample-efficient methods of performing optimisation and quadrature where expensive-to-evaluate objective functions can be queried in parallel. However, current methods do not scale to large batch sizes -- a frequent desideratum in practice (e.g. drug discovery or simulation-based inference). We present a novel algorithm, SOB… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 62C10; 62F15

  5. arXiv:2301.09517  [pdf, other

    math.NA cs.LG stat.ML

    Sampling-based Nyström Approximation and Kernel Quadrature

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: We analyze the Nyström approximation of a positive definite kernel associated with a probability measure. We first prove an improved error bound for the conventional Nyström approximation with i.i.d. sampling and singular-value decomposition in the continuous regime; the proof techniques are borrowed from statistical learning theory. We further introduce a refined selection of subspaces in Nyström… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: 22 pages, ICML 2023 camera-ready version. Typos fixed

  6. arXiv:2210.05787  [pdf, ps, other

    math.PR math.NA

    Hypercontractivity Meets Random Convex Hulls: Analysis of Randomized Multivariate Cubatures

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: Given a probability measure $μ$ on a set $\mathcal{X}$ and a vector-valued function $\varphi$, a common problem is to construct a discrete probability measure on $\mathcal{X}$ such that the push-forward of these two probability measures under $\varphi$ is the same. This construction is at the heart of numerical integration methods that run under various names such as quadrature, cubature, or recom… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: 20 pages

    Journal ref: Proceedings of the Royal Society A, 2023

  7. arXiv:2206.04734  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Harald Oberhauser, Michael A. Osborne

    Abstract: Calculation of Bayesian posteriors and model evidences typically requires numerical integration. Bayesian quadrature (BQ), a surrogate-model-based approach to numerical integration, is capable of superb sample efficiency, but its lack of parallelisation has hindered its practical applications. In this work, we propose a parallelised (batch) BQ method, employing techniques from kernel quadrature, t… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: NeurIPS 35, 16533--16547 (2022)

  8. arXiv:2107.09597  [pdf, other

    math.NA cs.LG stat.ML

    Positively Weighted Kernel Quadrature via Subsampling

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: We study kernel quadrature rules with convex weights. Our approach combines the spectral properties of the kernel with recombination results about point measures. This results in effective algorithms that construct convex quadrature rules using only access to i.i.d. samples from the underlying measure and evaluation of the kernel and that result in a small worst-case error. In addition to our theo… ▽ More

    Submitted 11 October, 2022; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 29 pages, NeurIPS 2022 camera-ready version

  9. Estimating the probability that a given vector is in the convex hull of a random sample

    Authors: Satoshi Hayakawa, Terry Lyons, Harald Oberhauser

    Abstract: For a $d$-dimensional random vector $X$, let $p_{n, X}(θ)$ be the probability that the convex hull of $n$ independent copies of $X$ contains a given point $θ$. We provide several sharp inequalities regarding $p_{n, X}(θ)$ and $N_X(θ)$ denoting the smallest $n$ for which $p_{n, X}(θ)\ge1/2$. As a main result, we derive the totally general inequality $1/2 \le α_X(θ)N_X(θ)\le 3d + 1$, where $α_X(θ)$… ▽ More

    Submitted 22 March, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: 34 pages

    Journal ref: Probability Theory and Related Fields, 2023

  10. Monte Carlo construction of cubature on Wiener space

    Authors: Satoshi Hayakawa, Ken'ichiro Tanaka

    Abstract: In this paper, we investigate application of mathematical optimization to construction of a cubature formula on Wiener space, which is a weak approximation method of stochastic differential equations introduced by Lyons and Victoir (Cubature on Wiener Space, Proc. R. Soc. Lond. A 460, 169--198). After giving a brief review of the cubature theory on Wiener space, we show that a cubature formula of… ▽ More

    Submitted 19 October, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: 25 pages

    Journal ref: Japan Journal of Industrial and Applied Mathematics, 2022

  11. Monte Carlo Cubature Construction

    Authors: Satoshi Hayakawa

    Abstract: In numerical integration, cubature methods are effective, especially when the integrands can be well-approximated by known test functions, such as polynomials. However, the construction of cubature formulas has not generally been known, and existing examples only represent the particular domains of integrands, such as hypercubes and spheres. In this study, we show that cubature formulas can be con… ▽ More

    Submitted 24 January, 2020; v1 submitted 3 January, 2020; originally announced January 2020.

    Comments: 10 pages

    Journal ref: Japan Journal of Industrial and Applied Mathematics, 2021

  12. Convergence analysis of approximation formulas for analytic functions via duality for potential energy minimization

    Authors: Satoshi Hayakawa, Ken'ichiro Tanaka

    Abstract: We investigate the approximation formulas that were proposed by Tanaka & Sugihara (2019), in weighted Hardy spaces, which are analytic function spaces with certain asymptotic decay. Under the criterion of minimum worst error of $n$-point approximation formulas, we demonstrate that the formulas are nearly optimal. We also obtain the upper bounds of the approximation errors that coincide with the ex… ▽ More

    Submitted 21 October, 2022; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: 17 pages

    Journal ref: Japan Journal of Industrial and Applied Mathematics, 2023

  13. arXiv:1905.09195  [pdf, other

    stat.ML cs.LG math.ST

    On the minimax optimality and superiority of deep neural network learning over sparse parameter spaces

    Authors: Satoshi Hayakawa, Taiji Suzuki

    Abstract: Deep learning has been applied to various tasks in the field of machine learning and has shown superiority to other common procedures such as kernel methods. To provide a better theoretical understanding of the reasons for its success, we discuss the performance of deep learning and other methods on a nonparametric regression problem with a Gaussian noise. Whereas existing theoretical studies of d… ▽ More

    Submitted 20 September, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: 33 pages

    MSC Class: 62G08

    Journal ref: Neural Networks, 2020