Skip to main content

Showing 1–19 of 19 results for author: Kato, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2210.09160  [pdf, other

    stat.ML cs.LG

    Statistical, Robustness, and Computational Guarantees for Sliced Wasserstein Distances

    Authors: Sloan Nietert, Ritwik Sadhu, Ziv Goldfeld, Kengo Kato

    Abstract: Sliced Wasserstein distances preserve properties of classic Wasserstein distances while being more scalable for computation and estimation in high dimensions. The goal of this work is to quantify this scalability from three key aspects: (i) empirical convergence rates; (ii) robustness to data contamination; and (iii) efficient computational methods. For empirical convergence, we derive fast rates… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  2. arXiv:2107.13494  [pdf, ps, other

    math.ST math.PR stat.ML

    Limit Distribution Theory for the Smooth 1-Wasserstein Distance with Applications

    Authors: Ritwik Sadhu, Ziv Goldfeld, Kengo Kato

    Abstract: The smooth 1-Wasserstein distance (SWD) $W_1^σ$ was recently proposed as a means to mitigate the curse of dimensionality in empirical approximation while preserving the Wasserstein structure. Indeed, SWD exhibits parametric convergence rates and inherits the metric and topological structure of the classic Wasserstein distance. Motivated by the above, this work conducts a thorough statistical study… ▽ More

    Submitted 24 February, 2022; v1 submitted 28 July, 2021; originally announced July 2021.

    MSC Class: 62E17; 60F05; 60F17; 62G10; 62F12; 62F40

  3. arXiv:2102.06586  [pdf, ps, other

    econ.EM stat.ME

    Linear programming approach to nonparametric inference under shape restrictions: with an application to regression kink designs

    Authors: Harold D. Chiang, Kengo Kato, Yuya Sasaki, Takuya Ura

    Abstract: We develop a novel method of constructing confidence bands for nonparametric regression functions under shape constraints. This method can be implemented via a linear programming, and it is thus computationally appealing. We illustrate a usage of our proposed method with an application to the regression kink design (RKD). Econometric analyses based on the RKD often suffer from wide confidence inte… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  4. arXiv:2101.04039  [pdf, other

    math.ST stat.ML

    Smooth $p$-Wasserstein Distance: Structure, Empirical Approximation, and Statistical Applications

    Authors: Sloan Nietert, Ziv Goldfeld, Kengo Kato

    Abstract: Discrepancy measures between probability distributions, often termed statistical distances, are ubiquitous in probability theory, statistics and machine learning. To combat the curse of dimensionality when estimating these distances from data, recent work has proposed smoothing out local irregularities in the measured distributions via convolution with a Gaussian kernel. Motivated by the scalabili… ▽ More

    Submitted 17 December, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: updated to match ICML 2021 paper

  5. arXiv:2007.15190  [pdf, other

    stat.ML cs.IT cs.LG

    Quantitative Understanding of VAE as a Non-linearly Scaled Isometric Embedding

    Authors: Akira Nakagawa, Keizo Kato, Taiji Suzuki

    Abstract: Variational autoencoder (VAE) estimates the posterior parameters (mean and variance) of latent variables corresponding to each input data. While it is used for many tasks, the transparency of the model is still an underlying issue. This paper provides a quantitative understanding of VAE property through the differential geometric and information-theoretic interpretations of VAE. According to the R… ▽ More

    Submitted 22 February, 2023; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: Accepted to the International Conference on Machine Learning (ICML) 2021. 40 pages, 29 figures

    ACM Class: I.2.4

  6. arXiv:2006.00952  [pdf, other

    math.ST stat.ME

    Bootstrap inference for quantile-based modal regression

    Authors: Tao Zhang, Kengo Kato, David Ruppert

    Abstract: In this paper, we develop uniform inference methods for the conditional mode based on quantile regression. Specifically, we propose to estimate the conditional mode by minimizing the derivative of the estimated conditional quantile function defined by smoothing the linear quantile regression estimator, and develop two bootstrap methods, a novel pivotal bootstrap and the nonparametric bootstrap, fo… ▽ More

    Submitted 12 April, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 78 pages

  7. arXiv:1910.04329  [pdf, other

    cs.LG stat.ML

    Rate-Distortion Optimization Guided Autoencoder for Isometric Embedding in Euclidean Latent Space

    Authors: Keizo Kato, Jing Zhou, Tomotake Sasaki, Akira Nakagawa

    Abstract: To analyze high-dimensional and complex data in the real world, deep generative models, such as variational autoencoder (VAE) embed data in a low-dimensional space (latent space) and learn a probabilistic model in the latent space. However, they struggle to accurately reproduce the probability distribution function (PDF) in the input space from that in the latent space. If the embedding were isome… ▽ More

    Submitted 30 August, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Accepted to the International Conference on Machine Learning (ICML) 2020

    MSC Class: 68T01

  8. arXiv:1908.03152  [pdf, other

    math.ST econ.EM stat.ME

    Analysis of Networks via the Sparse $β$-Model

    Authors: Mingli Chen, Kengo Kato, Chenlei Leng

    Abstract: Data in the form of networks are increasingly available in a variety of areas, yet statistical models allowing for parameter estimates with desirable statistical properties for sparse networks remain scarce. To address this, we propose the Sparse $β$-Model (S$β$M), a new network model that interpolates the celebrated Erdős-Rényi model and the $β$-model that assigns one different parameter to each… ▽ More

    Submitted 17 December, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: 36 pages

  9. arXiv:1901.01163  [pdf, ps, other

    math.ST stat.ME

    Approximating high-dimensional infinite-order $U$-statistics: statistical and computational guarantees

    Authors: Yanglei Song, Xiaohui Chen, Kengo Kato

    Abstract: We study the problem of distributional approximations to high-dimensional non-degenerate $U$-statistics with random kernels of diverging orders. Infinite-order $U$-statistics (IOUS) are a useful tool for constructing simultaneous prediction intervals that quantify the uncertainty of ensemble methods such as subbagging and random forests. A major obstacle in using the IOUS is their computational in… ▽ More

    Submitted 15 November, 2019; v1 submitted 4 January, 2019; originally announced January 2019.

    Journal ref: Electronic Journal of Statistics 2019, Vol. 13, No. 2, 4794-4848

  10. arXiv:1811.05379  [pdf, other

    math.ST stat.ME

    Quantile regression approach to conditional mode estimation

    Authors: Hirofumi Ota, Kengo Kato, Satoshi Hara

    Abstract: In this paper, we consider estimation of the conditional mode of an outcome variable given regressors. To this end, we propose and analyze a computationally scalable estimator derived from a linear quantile regression model and develop asymptotic distributional theory for the estimator. Specifically, we find that the pointwise limiting distribution is a scale transformation of Chernoff's distribut… ▽ More

    Submitted 29 July, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: This paper supersedes "On estimation of conditional modes using multiple quantile regressions" (Hirofumi Ohta and Satoshi Hara, arXiv:1712.08754)

  11. arXiv:1712.00771  [pdf, other

    math.ST math.PR stat.CO stat.ME

    Randomized incomplete $U$-statistics in high dimensions

    Authors: Xiaohui Chen, Kengo Kato

    Abstract: This paper studies inference for the mean vector of a high-dimensional $U$-statistic. In the era of Big Data, the dimension $d$ of the $U$-statistic and the sample size $n$ of the observations tend to be both large, and the computation of the $U$-statistic is prohibitively demanding. Data-dependent inferential procedures such as the empirical bootstrap for $U$-statistics is even more computational… ▽ More

    Submitted 27 January, 2019; v1 submitted 3 December, 2017; originally announced December 2017.

    MSC Class: 62E17; 62F40; 62H15

  12. arXiv:1708.02705  [pdf, other

    math.ST math.PR stat.ME

    Jackknife multiplier bootstrap: finite sample approximations to the $U$-process supremum with applications

    Authors: Xiaohui Chen, Kengo Kato

    Abstract: This paper is concerned with finite sample approximations to the supremum of a non-degenerate $U$-process of a general order indexed by a function class. We are primarily interested in situations where the function class as well as the underlying distribution change with the sample size, and the $U$-process itself is not weakly convergent as a process. Such situations arise in a variety of modern… ▽ More

    Submitted 13 February, 2019; v1 submitted 8 August, 2017; originally announced August 2017.

    MSC Class: 60F17; 62E17; 62F40; 62G10

  13. arXiv:1312.7614  [pdf, ps, other

    math.ST econ.EM stat.AP

    Inference on causal and structural parameters using many moment inequalities

    Authors: Victor Chernozhukov, Denis Chetverikov, Kengo Kato

    Abstract: This paper considers the problem of testing many moment inequalities where the number of moment inequalities, denoted by $p$, is possibly much larger than the sample size $n$. There is a variety of economic applications where solving this problem allows to carry out inference on causal and structural parameters, a notable example is the market structure model of Ciliberto and Tamer (2009) where… ▽ More

    Submitted 18 October, 2018; v1 submitted 29 December, 2013; originally announced December 2013.

    Comments: This paper was previously circulated under the title "Testing many moment inequalities"

  14. arXiv:1304.0282  [pdf, ps, other

    math.ST econ.EM stat.ME

    Uniform Post Selection Inference for LAD Regression and Other Z-estimation problems

    Authors: Alexandre Belloni, Victor Chernozhukov, Kengo Kato

    Abstract: We develop uniformly valid confidence regions for regression coefficients in a high-dimensional sparse median regression model with homoscedastic errors. Our methods are based on a moment equation that is immunized against non-regular estimation of the nuisance part of the median regression function by using Neyman's orthogonalization. We establish that the resulting instrumental median regression… ▽ More

    Submitted 18 October, 2020; v1 submitted 31 March, 2013; originally announced April 2013.

    Comments: includes supplementary material; 2 figures

    MSC Class: 62F03; 62F12; 62F40

  15. arXiv:1212.0442  [pdf, ps, other

    stat.ME econ.EM

    Some New Asymptotic Theory for Least Squares Series: Pointwise and Uniform Results

    Authors: Alexandre Belloni, Victor Chernozhukov, Denis Chetverikov, Kengo Kato

    Abstract: In applications it is common that the exact form of a conditional expectation is unknown and having flexible functional forms can lead to improvements. Series method offers that by approximating the unknown function based on $k$ basis functions, where $k$ is allowed to grow with the sample size $n$. We consider series estimators for the conditional mean in light of: (i) sharp LLNs for matrices der… ▽ More

    Submitted 17 June, 2015; v1 submitted 3 December, 2012; originally announced December 2012.

    Journal ref: Journal of Econometrics 186 (2015) 345-366

  16. arXiv:1207.5313  [pdf, ps, other

    math.ST stat.ME

    Two-step estimation of high dimensional additive models

    Authors: Kengo Kato

    Abstract: This paper investigates the two-step estimation of a high dimensional additive regression model, in which the number of nonparametric additive components is potentially larger than the sample size but the number of significant additive components is sufficiently small. The approach investigated consists of two steps. The first step implements the variable selection, typically by the group Lasso, a… ▽ More

    Submitted 29 January, 2013; v1 submitted 23 July, 2012; originally announced July 2012.

    Comments: 49 pages, 3 tables; minor errors corrected

  17. arXiv:1204.2108  [pdf, ps, other

    math.ST stat.ME

    Quasi-Bayesian analysis of nonparametric instrumental variables models

    Authors: Kengo Kato

    Abstract: This paper aims at developing a quasi-Bayesian analysis of the nonparametric instrumental variables model, with a focus on the asymptotic properties of quasi-posterior distributions. In this paper, instead of assuming a distributional assumption on the data generating process, we consider a quasi-likelihood induced from the conditional moment restriction, and put priors on the function-valued para… ▽ More

    Submitted 20 November, 2013; v1 submitted 10 April, 2012; originally announced April 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1150 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1150

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 5, 2359-2390

  18. arXiv:1202.4850  [pdf, ps, other

    math.ST stat.ME

    Estimation in functional linear quantile regression

    Authors: Kengo Kato

    Abstract: This paper studies estimation in functional linear quantile regression in which the dependent variable is scalar while the covariate is a function, and the conditional quantile for each fixed quantile index is modeled as a linear functional of the covariate. Here we suppose that covariates are discretely observed and sampling points may differ across subjects, where the number of measurements per… ▽ More

    Submitted 27 February, 2013; v1 submitted 22 February, 2012; originally announced February 2012.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOS1066 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1066

    Journal ref: Annals of Statistics 2012, Vol. 40, No. 6, 3108-3136

  19. arXiv:1103.1458  [pdf, ps, other

    stat.ME math.ST

    Group Lasso for high dimensional sparse quantile regression models

    Authors: Kengo Kato

    Abstract: This paper studies the statistical properties of the group Lasso estimator for high dimensional sparse quantile regression models where the number of explanatory variables (or the number of groups of explanatory variables) is possibly much larger than the sample size while the number of variables in "active" groups is sufficiently small. We establish a non-asymptotic bound on the $\ell_{2}$-estima… ▽ More

    Submitted 25 March, 2011; v1 submitted 8 March, 2011; originally announced March 2011.

    Comments: 37 pages. Some errors are corrected

    MSC Class: 62G05; 62J99