-
Counting torsors for wild abelian groups
Authors:
Ratko Darda,
Takehiko Yasuda
Abstract:
Let $F$ be a global field of characteristic $p > 0$ and $G$ a finite abelian $p$-group. In this paper we treat the question of counting $G$-torsors over $F$ for certain heights developed in [DY25].
Let $F$ be a global field of characteristic $p > 0$ and $G$ a finite abelian $p$-group. In this paper we treat the question of counting $G$-torsors over $F$ for certain heights developed in [DY25].
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
Partitioning Law of Polymer Chains into Flexible Polymer Networks
Authors:
Haruki Takarai,
Takashi Yasuda,
Naoyuki Sakumichi,
Takamasa Sakai
Abstract:
The equilibrium partitioning of linear polymer chains into flexible polymer networks is governed by intricate entropic constraints arising from configurational degrees of freedom of both chains and network, yet a quantitative understanding remains elusive. Using model hydrogels with precisely defined network structures, we experimentally reveal a universal law governing linear polymer partitioning…
▽ More
The equilibrium partitioning of linear polymer chains into flexible polymer networks is governed by intricate entropic constraints arising from configurational degrees of freedom of both chains and network, yet a quantitative understanding remains elusive. Using model hydrogels with precisely defined network structures, we experimentally reveal a universal law governing linear polymer partitioning into flexible polymer networks. We establish a novel label-free, contactless method to measure partition ratio, based on the increase in osmotic pressure induced by external polymer chains partitioning into the network. Moreover, we find a universal law in which the partition constant is solely determined by the squared ratio $(R_g / l_\mathrm{cycle})^2$, where $R_g$ is the gyration radius of the polymer chain and $l_\mathrm{cycle}$ is the characteristic mesh size of the network, as defined by the cycle length.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Thermal evolution model from cometary nuclei to asteroids considering contraction associated with ice sublimation
Authors:
Hitoshi Miura,
Takumi Yasuda
Abstract:
Comet--asteroid transition (CAT) objects are small solar system bodies in the process of evolving from cometary nuclei into asteroids, as they gradually lose volatile substances due to solar heating. The volatile material is mainly water ice, and the time required for its complete depletion is called the desiccation time. Estimating the desiccation time is important for examining the formation and…
▽ More
Comet--asteroid transition (CAT) objects are small solar system bodies in the process of evolving from cometary nuclei into asteroids, as they gradually lose volatile substances due to solar heating. The volatile material is mainly water ice, and the time required for its complete depletion is called the desiccation time. Estimating the desiccation time is important for examining the formation and evolution of small solar system bodies. Here, we propose a new theoretical model for evaluating the desiccation time as a function of orbital elements, considering the contraction of the entire cometary nucleus due to ice sublimation. First, we performed numerical calculations of the thermal evolution of a cometary nucleus in an eccentric orbit, considering the seasonal variation in the solar heating rate. Next, we derived the desiccation time analytically as a function of orbital elements based on a steady-state model considering the solar heating rate averaged over the seasons. We compared the numerical solutions for the desiccation time with the analytical solutions and clarified the conditions under which the analytical model can be applied. Additionally, based on the analytical model, we derived formulae for estimating the emission rates of water vapor and dust on the surface of the cometary nucleus, the maximum size of the emitted dust, and the dust emission velocity, by assuming the amount of ice remaining inside the nucleus. Using these analytical solutions, we considered the internal structure and evolution process of typical CAT objects. Our analytical model was generally consistent with that of the results of earlier observations of these objects. Our model provides a theoretical guideline for discussing the evolution of cometary nuclei and the possibility of retaining internal ice in asteroids.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
The Batyrev-Manin conjecture for DM stacks II
Authors:
Ratko Darda,
Takehiko Yasuda
Abstract:
This is a sequel to a paper by the authors. We formulate a prototype of a Batyrev-Manin type conjecture for DM stacks in positive characteristic with emphasis on the wild case. The key to doing so is the notion of sectoroids, which plays a role similar to the role played by sectors in characteristic zero. We also check some compatibilities of the proposed prototype and see how it reduces to specia…
▽ More
This is a sequel to a paper by the authors. We formulate a prototype of a Batyrev-Manin type conjecture for DM stacks in positive characteristic with emphasis on the wild case. The key to doing so is the notion of sectoroids, which plays a role similar to the role played by sectors in characteristic zero. We also check some compatibilities of the proposed prototype and see how it reduces to special cases. In the course of formulating the prototype, we introduce a new type of height functions, which is more general and flexible than the one we considered in the preceding paper. Examples of this new height function include discriminants of torsors, minimal discriminants and conductors of elliptic curves in characteristic three. In appendices, we generalize construction of a moduli stack by Tonini-Yasuda and construct a morphism between stacks of twisted arcs. These results are closely related to the main text, but also of independent interest.
△ Less
Submitted 13 May, 2025; v1 submitted 10 February, 2025;
originally announced February 2025.
-
John Ellipsoids via Lazy Updates
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
We give a faster algorithm for computing an approximate John ellipsoid around $n$ points in $d$ dimensions. The best known prior algorithms are based on repeatedly computing the leverage scores of the points and reweighting them by these scores [CCLY19]. We show that this algorithm can be substantially sped up by delaying the computation of high accuracy leverage scores by using sampling, and then…
▽ More
We give a faster algorithm for computing an approximate John ellipsoid around $n$ points in $d$ dimensions. The best known prior algorithms are based on repeatedly computing the leverage scores of the points and reweighting them by these scores [CCLY19]. We show that this algorithm can be substantially sped up by delaying the computation of high accuracy leverage scores by using sampling, and then later computing multiple batches of high accuracy leverage scores via fast rectangular matrix multiplication. We also give low-space streaming algorithms for John ellipsoids using similar ideas.
△ Less
Submitted 3 January, 2025;
originally announced January 2025.
-
Quotient singularities by permutation actions are canonical
Authors:
Takehiko Yasuda
Abstract:
The quotient variety associated to a permutation representation of a finite group has only canonical singularities in arbitrary characteristic. Moreover, the log pair associated to such a representation is Kawamata log terminal except in characteristic two, and log canonical in arbitrary characteristic.
The quotient variety associated to a permutation representation of a finite group has only canonical singularities in arbitrary characteristic. Moreover, the log pair associated to such a representation is Kawamata log terminal except in characteristic two, and log canonical in arbitrary characteristic.
△ Less
Submitted 19 February, 2025; v1 submitted 24 August, 2024;
originally announced August 2024.
-
Motivic versions of mass formulas by Krasner, Serre and Bhargava
Authors:
Takehiko Yasuda
Abstract:
We prove motivic versions of mass formulas by Krasner, Serre and Bhargava concerning (weighted) counts of extensions of local fields.
We prove motivic versions of mass formulas by Krasner, Serre and Bhargava concerning (weighted) counts of extensions of local fields.
△ Less
Submitted 24 August, 2024;
originally announced August 2024.
-
Ridge Leverage Score Sampling for $\ell_p$ Subspace Approximation
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
The $\ell_p$ subspace approximation problem is an NP-hard low rank approximation problem that generalizes the median hyperplane ($p = 1$), principal component analysis ($p = 2$), and center hyperplane problems ($p = \infty$). A popular approach to cope with the NP-hardness is to compute a strong coreset, which is a weighted subset of input points that simultaneously approximates the cost of every…
▽ More
The $\ell_p$ subspace approximation problem is an NP-hard low rank approximation problem that generalizes the median hyperplane ($p = 1$), principal component analysis ($p = 2$), and center hyperplane problems ($p = \infty$). A popular approach to cope with the NP-hardness is to compute a strong coreset, which is a weighted subset of input points that simultaneously approximates the cost of every $k$-dimensional subspace, typically to $(1+ε)$ relative error for a small constant $ε$.
We obtain an algorithm for constructing a strong coreset for $\ell_p$ subspace approximation of size $\tilde O(kε^{-4/p})$ for $p<2$ and $\tilde O(k^{p/2}ε^{-p})$ for $p>2$. This offers the following improvements over prior work:
- We construct the first strong coresets with nearly optimal dependence on $k$ for all $p\neq 2$. In prior work, [SW18] constructed coresets of modified points with a similar dependence on $k$, while [HV20] constructed true coresets with polynomially worse dependence on $k$. - We recover or improve the best known $ε$ dependence for all $p$. In particular, for $p > 2$, the [SW18] coreset of modified points had a dependence of $ε^{-p^2/2}$ and the [HV20] coreset had a dependence of $ε^{-3p}$.
Our algorithm is based on sampling by root ridge leverage scores, which admits fast algorithms, especially for sparse or structured matrices. Our analysis avoids the use of the representative subspace theorem [SW18], which is a critical component of all prior dimension-independent coresets for $\ell_p$ subspace approximation.
Our techniques also lead to the first nearly optimal online strong coresets for $\ell_p$ subspace approximation with similar bounds as the offline setting, resolving a problem of [WY23]. All prior approaches lose $\mathrm{poly}(k)$ factors in this setting, even when allowed to modify the original points.
△ Less
Submitted 2 April, 2025; v1 submitted 3 July, 2024;
originally announced July 2024.
-
Stabilising nonlinear travelling waves in pipe flow using time-delayed feedback
Authors:
Tatsuya Yasuda,
Dan Lucas
Abstract:
We demonstrate the first successful non-invasive stabilisation of nonlinear travelling waves in a straight cylindrical pipe using time-delayed feedback control (TDF) working in various symmetry subspaces. By using an approximate linear stability analysis and by analysing the frequency domain effect of the control using transfer functions, we find that solutions with well separated unstable eigenfr…
▽ More
We demonstrate the first successful non-invasive stabilisation of nonlinear travelling waves in a straight cylindrical pipe using time-delayed feedback control (TDF) working in various symmetry subspaces. By using an approximate linear stability analysis and by analysing the frequency domain effect of the control using transfer functions, we find that solutions with well separated unstable eigenfrequencies can have narrow windows of stabilising time-delays. To mitigate this issue we employ a "multiple time-delayed feedback" (MTDF) approach, where several control terms are included to attenuate a broad range of unstable eigenfrequencies. We implement a gradient descent method to dynamically adjust the gain functions in order to reduce the need for tuning a high dimensional parameter space. This results in a novel control method where the properties of the target state are not needed in advance and speculative guesses can result in robust stabilisation. This enables travelling waves to be stabilised from generic turbulent states and unknown travelling waves to be obtained in highly symmetric subspaces.
△ Less
Submitted 6 November, 2024; v1 submitted 21 June, 2024;
originally announced June 2024.
-
Coresets for Multiple $\ell_p$ Regression
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
A coreset of a dataset with $n$ examples and $d$ features is a weighted subset of examples that is sufficient for solving downstream data analytic tasks. Nearly optimal constructions of coresets for least squares and $\ell_p$ linear regression with a single response are known in prior work. However, for multiple $\ell_p$ regression where there can be $m$ responses, there are no known constructions…
▽ More
A coreset of a dataset with $n$ examples and $d$ features is a weighted subset of examples that is sufficient for solving downstream data analytic tasks. Nearly optimal constructions of coresets for least squares and $\ell_p$ linear regression with a single response are known in prior work. However, for multiple $\ell_p$ regression where there can be $m$ responses, there are no known constructions with size sublinear in $m$. In this work, we construct coresets of size $\tilde O(\varepsilon^{-2}d)$ for $p<2$ and $\tilde O(\varepsilon^{-p}d^{p/2})$ for $p>2$ independently of $m$ (i.e., dimension-free) that approximate the multiple $\ell_p$ regression objective at every point in the domain up to $(1\pm\varepsilon)$ relative error. If we only need to preserve the minimizer subject to a subspace constraint, we improve these bounds by an $\varepsilon$ factor for all $p>1$. All of our bounds are nearly tight.
We give two application of our results. First, we settle the number of uniform samples needed to approximate $\ell_p$ Euclidean power means up to a $(1+\varepsilon)$ factor, showing that $\tildeΘ(\varepsilon^{-2})$ samples for $p = 1$, $\tildeΘ(\varepsilon^{-1})$ samples for $1 < p < 2$, and $\tildeΘ(\varepsilon^{1-p})$ samples for $p>2$ is tight, answering a question of Cohen-Addad, Saulpic, and Schwiegelshohn. Second, we show that for $1<p<2$, every matrix has a subset of $\tilde O(\varepsilon^{-1}k)$ rows which spans a $(1+\varepsilon)$-approximately optimal $k$-dimensional subspace for $\ell_p$ subspace approximation, which is also nearly optimal.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Reweighted Solutions for Weighted Low Rank Approximation
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
Weighted low rank approximation (WLRA) is an important yet computationally challenging primitive with applications ranging from statistical analysis, model compression, and signal processing. To cope with the NP-hardness of this problem, prior work considers heuristics, bicriteria, or fixed parameter tractable algorithms to solve this problem. In this work, we introduce a new relaxed solution to W…
▽ More
Weighted low rank approximation (WLRA) is an important yet computationally challenging primitive with applications ranging from statistical analysis, model compression, and signal processing. To cope with the NP-hardness of this problem, prior work considers heuristics, bicriteria, or fixed parameter tractable algorithms to solve this problem. In this work, we introduce a new relaxed solution to WLRA which outputs a matrix that is not necessarily low rank, but can be stored using very few parameters and gives provable approximation guarantees when the weight matrix has low rank. Our central idea is to use the weight matrix itself to reweight a low rank solution, which gives an extremely simple algorithm with remarkable empirical performance in applications to model compression and on synthetic datasets. Our algorithm also gives nearly optimal communication complexity bounds for a natural distributed problem associated with this problem, for which we show matching communication lower bounds. Together, our communication complexity bounds show that the rank of the weight matrix provably parameterizes the communication complexity of WLRA. We also obtain the first relative error guarantees for feature selection with a weighted objective.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Martin's Maximum${}^{\ast, ++}_{\mathfrak{c}}$ in $\mathbb{P}_{\max}$ extensions of strong models of determinacy
Authors:
Ralf Schindler,
Taichi Yasuda
Abstract:
We study a strengthening of $\mathrm{MM}^{++}$ which is called $\mathrm{MM}^{\ast, ++}$ and which was introduced by Asperó and Schindler. We force its bounded version $\mathrm{MM}^{\ast, ++}_{\mathfrak{c}}$, which is stronger than both $\mathrm{MM}^{++}(\mathfrak{c})$ as well as $\mathrm{BMM}^{++}$, by $\mathbb{P}_{\max}$ forcing over a determinacy model…
▽ More
We study a strengthening of $\mathrm{MM}^{++}$ which is called $\mathrm{MM}^{\ast, ++}$ and which was introduced by Asperó and Schindler. We force its bounded version $\mathrm{MM}^{\ast, ++}_{\mathfrak{c}}$, which is stronger than both $\mathrm{MM}^{++}(\mathfrak{c})$ as well as $\mathrm{BMM}^{++}$, by $\mathbb{P}_{\max}$ forcing over a determinacy model $L^{F_{\mathrm{uB}}}({\mathbb R}^*,\mbox{Hom}^{\ast})$. The construction of the ground model $L^{F_{\mathrm{uB}}}({\mathbb R}^{\ast},\mbox{Hom}^{\ast})$ builds upon Gappo and Sargsyan, and the derived model construction of Larson, Sargsyan, and Wilson.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Estimation of migrate histories of the Japanese sardine in the Sea of Japan by combining the microscale stable isotope analysis of otoliths and a data assimilation model
Authors:
Tomoya Aono,
Tatsuya Sakamoto,
Toyoho Ishimura,
Motomitsu Takahashi,
Tohya Yasuda,
Satoshi Kitajima,
Kozue Nishida,
Takayoshi Matsuura,
Akito Ikari,
Shin-ichi Ito
Abstract:
The Japanese sardine (Sardinops melanostictus) is a small pelagic fish found in the Sea of Japan, the marginal sea of the western North Pacific. It is an important species for regional fisheries, but their transportation and migration patterns during early life stages remain unclear. In this study, we analyzed the stable oxygen isotope ratios of otoliths of young-of-the-year (age 0) Japanese sardi…
▽ More
The Japanese sardine (Sardinops melanostictus) is a small pelagic fish found in the Sea of Japan, the marginal sea of the western North Pacific. It is an important species for regional fisheries, but their transportation and migration patterns during early life stages remain unclear. In this study, we analyzed the stable oxygen isotope ratios of otoliths of young-of-the-year (age 0) Japanese sardines collected from the northern offshore and southern coastal areas of the Sea of Japan in 2015 and 2016. The ontogenetic shifts of the geographic distribution were estimated by comparing the profiles of life-long isotope ratios and temporally varying isoscape, which was calculated using the temperature and salinity fields produced by an ocean data assimilation model. Individuals that were collected in the northern and southern areas hatched and stayed in the southern areas (west offshore of Kyushu) until late June, and thereafter, they can be distinguished into two groups: one that migrated northward at shallow layer and one that stayed around the southern area in the deep layer. A comparison of somatic growth trajectories of the two groups, which was reconstructed based on otolith microstructure analysis, suggested that individuals that migrated northward had significantly larger body lengths in late June than those that stayed in the southern area. These results indicate that young-of-the-year Japanese sardines that hatched in the southern area may have been forced to choose one of two strategies to avoid extremely high water temperatures within seasonal and geographical limits. These include migrating northward or moving to deeper layers. Our results indicate that the environmental variabilities in the southern area could critically impact sardine population dynamics in the Sea of Japan.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
Authors:
Taisuke Yasuda,
Kyriakos Axiotis,
Gang Fu,
MohammadHossein Bateni,
Vahab Mirrokni
Abstract:
Neural network pruning is a key technique towards engineering large yet scalable, interpretable, and generalizable models. Prior work on the subject has developed largely along two orthogonal directions: (1) differentiable pruning for efficiently and accurately scoring the importance of parameters, and (2) combinatorial optimization for efficiently searching over the space of sparse models. We uni…
▽ More
Neural network pruning is a key technique towards engineering large yet scalable, interpretable, and generalizable models. Prior work on the subject has developed largely along two orthogonal directions: (1) differentiable pruning for efficiently and accurately scoring the importance of parameters, and (2) combinatorial optimization for efficiently searching over the space of sparse models. We unite the two approaches, both theoretically and empirically, to produce a coherent framework for structured neural network pruning in which differentiable pruning guides combinatorial optimization algorithms to select the most important sparse set of parameters. Theoretically, we show how many existing differentiable pruning techniques can be understood as nonconvex regularization for group sparse optimization, and prove that for a wide class of nonconvex regularizers, the global optimum is unique, group-sparse, and provably yields an approximate solution to a sparse convex optimization problem. The resulting algorithm that we propose, SequentialAttention++, advances the state of the art in large-scale neural network block-wise pruning tasks on the ImageNet and Criteo datasets.
△ Less
Submitted 10 February, 2025; v1 submitted 27 February, 2024;
originally announced February 2024.
-
Polymer Network Diffusion in Charged Gels
Authors:
Shoei Sano,
Takashi Yasuda,
Takeshi Fujiyabu,
Naoyuki Sakumichi,
Takamasa Sakai
Abstract:
The swelling kinetics of charged polymer gels reflect the complex competition among elastic, mixing, and ionic contributions. Here, we used dynamic light scattering to investigate the collective diffusion coefficient of model gels, whose polymer network structure was controlled so that the three contributions were comparable. We demonstrate that the collective diffusion coefficient stems from the…
▽ More
The swelling kinetics of charged polymer gels reflect the complex competition among elastic, mixing, and ionic contributions. Here, we used dynamic light scattering to investigate the collective diffusion coefficient of model gels, whose polymer network structure was controlled so that the three contributions were comparable. We demonstrate that the collective diffusion coefficient stems from the sum of elastic, mixing, and ionic contributions, without evident cross-correlations. The significant ionic contribution conforms to the Donnan equilibrium, which explains equilibrium electrical potential gradients in biological systems.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
The Manin conjecture for toric stacks
Authors:
Ratko Darda,
Takehiko Yasuda
Abstract:
Split toric stacks over a number field $F$ are natural generalization of split toric varieties over $F$. Notable examples are weighted projective stacks. In our previous work, we defined heights on Deligne-Mumford stacks using so-called raised line bundles and made predictions on asymptotic formulas of the number of rational points of bounded height. In this paper, we prove that the number of rati…
▽ More
Split toric stacks over a number field $F$ are natural generalization of split toric varieties over $F$. Notable examples are weighted projective stacks. In our previous work, we defined heights on Deligne-Mumford stacks using so-called raised line bundles and made predictions on asymptotic formulas of the number of rational points of bounded height. In this paper, we prove that the number of rational points of any split toric stack of bounded height with respect to the anti-canonical raised line bundle satisfies one of our predictions, the Manin conjecture for Deligne-Mumford stacks.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming
Authors:
Gregory Dexter,
Petros Drineas,
David P. Woodruff,
Taisuke Yasuda
Abstract:
Sketching algorithms have recently proven to be a powerful approach both for designing low-space streaming algorithms as well as fast polynomial time approximation schemes (PTAS). In this work, we develop new techniques to extend the applicability of sketching-based approaches to the sparse dictionary learning and the Euclidean $k$-means clustering problems. In particular, we initiate the study of…
▽ More
Sketching algorithms have recently proven to be a powerful approach both for designing low-space streaming algorithms as well as fast polynomial time approximation schemes (PTAS). In this work, we develop new techniques to extend the applicability of sketching-based approaches to the sparse dictionary learning and the Euclidean $k$-means clustering problems. In particular, we initiate the study of the challenging setting where the dictionary/clustering assignment for each of the $n$ input points must be output, which has surprisingly received little attention in prior work. On the fast algorithms front, we obtain a new approach for designing PTAS's for the $k$-means clustering problem, which generalizes to the first PTAS for the sparse dictionary learning problem. On the streaming algorithms front, we obtain new upper bounds and lower bounds for dictionary learning and $k$-means clustering. In particular, given a design matrix $\mathbf A\in\mathbb R^{n\times d}$ in a turnstile stream, we show an $\tilde O(nr/ε^2 + dk/ε)$ space upper bound for $r$-sparse dictionary learning of size $k$, an $\tilde O(n/ε^2 + dk/ε)$ space upper bound for $k$-means clustering, as well as an $\tilde O(n)$ space upper bound for $k$-means clustering on random order row insertion streams with a natural "bounded sensitivity" assumption. On the lower bounds side, we obtain a general $\tildeΩ(n/ε+ dk/ε)$ lower bound for $k$-means clustering, as well as an $\tildeΩ(n/ε^2)$ lower bound for algorithms which can estimate the cost of a single fixed set of candidate centers.
△ Less
Submitted 29 October, 2023;
originally announced October 2023.
-
Higher Jacobian ideals, contact equivalence and motivic zeta functions
Authors:
Quy Thuong Lê,
Takehiko Yasuda
Abstract:
We show basic properties of higher Jacobian matrices and higher Jacobian ideals for functions and apply it to obtain two results concerning singularities of functions. Firstly, we prove that a higher Nash blowup algebra is invariant under contact equivalences, which was recently conjectured by Hussain, Ma, Yau and Zuo. Secondly, we obtain an analogue of a result on motivic nearby cycles by Bussi,…
▽ More
We show basic properties of higher Jacobian matrices and higher Jacobian ideals for functions and apply it to obtain two results concerning singularities of functions. Firstly, we prove that a higher Nash blowup algebra is invariant under contact equivalences, which was recently conjectured by Hussain, Ma, Yau and Zuo. Secondly, we obtain an analogue of a result on motivic nearby cycles by Bussi, Joyce and Meinhardt.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
Quantum reservoir computing with repeated measurements on superconducting devices
Authors:
Toshiki Yasuda,
Yudai Suzuki,
Tomoyuki Kubota,
Kohei Nakajima,
Qi Gao,
Wenlong Zhang,
Satoshi Shimono,
Hendra I. Nurdin,
Naoki Yamamoto
Abstract:
Reservoir computing is a machine learning framework that uses artificial or physical dissipative dynamics to predict time-series data using nonlinearity and memory properties of dynamical systems. Quantum systems are considered as promising reservoirs, but the conventional quantum reservoir computing (QRC) models have problems in the execution time. In this paper, we develop a quantum reservoir (Q…
▽ More
Reservoir computing is a machine learning framework that uses artificial or physical dissipative dynamics to predict time-series data using nonlinearity and memory properties of dynamical systems. Quantum systems are considered as promising reservoirs, but the conventional quantum reservoir computing (QRC) models have problems in the execution time. In this paper, we develop a quantum reservoir (QR) system that exploits repeated measurement to generate a time-series, which can effectively reduce the execution time. We experimentally implement the proposed QRC on the IBM's quantum superconducting device and show that it achieves higher accuracy as well as shorter execution time than the conventional QRC method. Furthermore, we study the temporal information processing capacity to quantify the computational capability of the proposed QRC; in particular, we use this quantity to identify the measurement strength that best tradeoffs the amount of available information and the strength of dissipation. An experimental demonstration with soft robot is also provided, where the repeated measurement over 1000 timesteps was effectively applied. Finally, a preliminary result with 120 qubits device is discussed.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Performance of $\ell_1$ Regularization for Sparse Convex Optimization
Authors:
Kyriakos Axiotis,
Taisuke Yasuda
Abstract:
Despite widespread adoption in practice, guarantees for the LASSO and Group LASSO are strikingly lacking in settings beyond statistical problems, and these algorithms are usually considered to be a heuristic in the context of sparse convex optimization on deterministic inputs. We give the first recovery guarantees for the Group LASSO for sparse convex optimization with vector-valued features. We s…
▽ More
Despite widespread adoption in practice, guarantees for the LASSO and Group LASSO are strikingly lacking in settings beyond statistical problems, and these algorithms are usually considered to be a heuristic in the context of sparse convex optimization on deterministic inputs. We give the first recovery guarantees for the Group LASSO for sparse convex optimization with vector-valued features. We show that if a sufficiently large Group LASSO regularization is applied when minimizing a strictly convex function $l$, then the minimizer is a sparse vector supported on vector-valued features with the largest $\ell_2$ norm of the gradient. Thus, repeating this procedure selects the same set of features as the Orthogonal Matching Pursuit algorithm, which admits recovery guarantees for any function $l$ with restricted strong convexity and smoothness via weak submodularity arguments. This answers open questions of Tibshirani et al. and Yasuda et al. Our result is the first to theoretically explain the empirical success of the Group LASSO for convex functions under general input instances assuming only restricted strong convexity and smoothness. Our result also generalizes provable guarantees for the Sequential Attention algorithm, which is a feature selection algorithm inspired by the attention mechanism proposed by Yasuda et al.
As an application of our result, we give new results for the column subset selection problem, which is well-studied when the loss is the Frobenius norm or other entrywise matrix losses. We give the first result for general loss functions for this problem that requires only restricted strong convexity and smoothness.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Sharper Bounds for $\ell_p$ Sensitivity Sampling
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
In large scale machine learning, random sampling is a popular way to approximate datasets by a small representative subset of examples. In particular, sensitivity sampling is an intensely studied technique which provides provable guarantees on the quality of approximation, while reducing the number of examples to the product of the VC dimension $d$ and the total sensitivity $\mathfrak S$ in remark…
▽ More
In large scale machine learning, random sampling is a popular way to approximate datasets by a small representative subset of examples. In particular, sensitivity sampling is an intensely studied technique which provides provable guarantees on the quality of approximation, while reducing the number of examples to the product of the VC dimension $d$ and the total sensitivity $\mathfrak S$ in remarkably general settings. However, guarantees going beyond this general bound of $\mathfrak S d$ are known in perhaps only one setting, for $\ell_2$ subspace embeddings, despite intense study of sensitivity sampling in prior work. In this work, we show the first bounds for sensitivity sampling for $\ell_p$ subspace embeddings for $p > 2$ that improve over the general $\mathfrak S d$ bound, achieving a bound of roughly $\mathfrak S^{2-2/p}$ for $2<p<\infty$. Furthermore, our techniques yield further new results in the study of sampling algorithms, showing that the root leverage score sampling algorithm achieves a bound of roughly $d$ for $1\leq p<2$, and that a combination of leverage score and sensitivity sampling achieves an improved bound of roughly $d^{2/p}\mathfrak S^{2-4/p}$ for $2<p<\infty$. Our sensitivity sampling results yield the best known sample complexity for a wide class of structured matrices that have small $\ell_p$ sensitivity.
△ Less
Submitted 3 January, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
Non-commutative resolutions of linearly reductive quotient singularities
Authors:
Christian Liedtke,
Takehiko Yasuda
Abstract:
We prove existence of non-commutative crepant resolutions (in the sense of van den Bergh) of quotient singularities by finite and linearly reductive group schemes in positive characteristic. In dimension two, we relate these to resolutions of singularities provided by G-Hilbert schemes and F-blowups. As an application, we establish and recover results concerning resolutions for toric singularities…
▽ More
We prove existence of non-commutative crepant resolutions (in the sense of van den Bergh) of quotient singularities by finite and linearly reductive group schemes in positive characteristic. In dimension two, we relate these to resolutions of singularities provided by G-Hilbert schemes and F-blowups. As an application, we establish and recover results concerning resolutions for toric singularities, as well as canonical, log terminal, and F-regular singularities in dimension 2.
△ Less
Submitted 19 May, 2023; v1 submitted 28 April, 2023;
originally announced April 2023.
-
F-blowups and essential divisors for toric varieties
Authors:
Enrique Chávez-Martínez,
Daniel Duarte,
Takehiko Yasuda
Abstract:
We investigate the relation between essential divisors and F-blowups, in particular, address the problem whether all essential divisors appear on the $e$-th F-blowup for large enough $e$. Focusing on the case of normal affine toric varieties, we establish a simple sufficient condition for a divisor over the given toric variety to appear on the normalized limit F-blowup as a prime divisor. As a cor…
▽ More
We investigate the relation between essential divisors and F-blowups, in particular, address the problem whether all essential divisors appear on the $e$-th F-blowup for large enough $e$. Focusing on the case of normal affine toric varieties, we establish a simple sufficient condition for a divisor over the given toric variety to appear on the normalized limit F-blowup as a prime divisor. As a corollary, we show that if a normal toric variety has a crepant resolution, then the above problem has a positive answer, provided that we use the notion of essential divisors in the sense of Bouvier and Gonzalez-Sprinberg. We also provide an example of toric threefold singularities for which a non-essential divisor appears on an F-blowup.
△ Less
Submitted 19 April, 2024; v1 submitted 25 April, 2023;
originally announced April 2023.
-
New Subset Selection Algorithms for Low Rank Approximation: Offline and Online
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
Subset selection for the rank $k$ approximation of an $n\times d$ matrix $A$ offers improvements in the interpretability of matrices, as well as a variety of computational savings. This problem is well-understood when the error measure is the Frobenius norm, with various tight algorithms known even in challenging models such as the online model, where an algorithm must select the column subset irr…
▽ More
Subset selection for the rank $k$ approximation of an $n\times d$ matrix $A$ offers improvements in the interpretability of matrices, as well as a variety of computational savings. This problem is well-understood when the error measure is the Frobenius norm, with various tight algorithms known even in challenging models such as the online model, where an algorithm must select the column subset irrevocably when the columns arrive one by one. In contrast, for other matrix losses, optimal trade-offs between the subset size and approximation quality have not been settled, even in the offline setting. We give a number of results towards closing these gaps.
In the offline setting, we achieve nearly optimal bicriteria algorithms in two settings. First, we remove a $\sqrt k$ factor from a result of [SWZ19] when the loss function is any entrywise loss with an approximate triangle inequality and at least linear growth. Our result is tight for the $\ell_1$ loss. We give a similar improvement for entrywise $\ell_p$ losses for $p>2$, improving a previous distortion of $k^{1-1/p}$ to $k^{1/2-1/p}$. Our results come from a technique which replaces the use of a well-conditioned basis with a slightly larger spanning set for which any vector can be expressed as a linear combination with small Euclidean norm. We show that this technique also gives the first oblivious $\ell_p$ subspace embeddings for $1<p<2$ with $\tilde O(d^{1/p})$ distortion, which is nearly optimal and closes a long line of work.
In the online setting, we give the first online subset selection algorithm for $\ell_p$ subspace approximation and entrywise $\ell_p$ low rank approximation by implementing sensitivity sampling online, which is challenging due to the sequential nature of sensitivity sampling. Our main technique is an online algorithm for detecting when an approximately optimal subspace changes substantially.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Universality of Osmotic Equation of State in Star Polymer Solutions
Authors:
Takashi Yasuda,
Masanobu Ino,
Takamasa Sakai,
Naoyuki Sakumichi
Abstract:
We experimentally measure the osmotic pressures of linear polymers and three-, four-, and eight-arm star polymers in a good solvent via membrane osmometry. These results reveal that the osmotic equations of state in the star polymer solutions are universally described by the same scaling function that describes linear polymer solutions. This universality is achieved by canceling increasing overlap…
▽ More
We experimentally measure the osmotic pressures of linear polymers and three-, four-, and eight-arm star polymers in a good solvent via membrane osmometry. These results reveal that the osmotic equations of state in the star polymer solutions are universally described by the same scaling function that describes linear polymer solutions. This universality is achieved by canceling increasing overlap concentrations and decreasing osmotic pressure, owing to the increased arm number. We further clarify the molar mass and arm number dependencies of the gyration radius and interpenetration factor, ensuring universality in star polymer solutions.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Log Centres of Noncommutative Crepant Resolutions are Kawamata Log Terminal: Remarks on a paper of Stafford and van den Bergh
Authors:
Colin Ingalls,
Takehiko Yasuda
Abstract:
We show that if a finitely generated prime algebra $Δ$ is a finitely generated maximal Cohen-Macaulay module over its centre $Z$, and has global dimension equal to $\dim Z$, then the pair given by its centre and ramification divisor is Kawamata log terminal.
We show that if a finitely generated prime algebra $Δ$ is a finitely generated maximal Cohen-Macaulay module over its centre $Z$, and has global dimension equal to $\dim Z$, then the pair given by its centre and ramification divisor is Kawamata log terminal.
△ Less
Submitted 7 April, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Semidilute Principle for Gels
Authors:
Naoyuki Sakumichi,
Takashi Yasuda,
Takamasa Sakai
Abstract:
Polymer gels such as jellies and soft contact lenses are soft solids consisting of three-dimensional polymer networks swollen with a large amount of solvent. For approximately 80 years, the swelling of polymer gels has been described using the Flory--Huggins mean-field theory. However, this theory is problematic when applied to polymer gels with large solvent contents owing to the significant fluc…
▽ More
Polymer gels such as jellies and soft contact lenses are soft solids consisting of three-dimensional polymer networks swollen with a large amount of solvent. For approximately 80 years, the swelling of polymer gels has been described using the Flory--Huggins mean-field theory. However, this theory is problematic when applied to polymer gels with large solvent contents owing to the significant fluctuations in polymer concentration. In this study, we experimentally demonstrate the superiority of the semidilute scaling law over the mean-field theory for predicting the swelling of polymer gels. Using the semidilute scaling law, we experimentally determine the universal critical exponent $ν$ of the self-avoiding walk via swelling experiments on polymer gels. The experimentally obtained value $ν\simeq 0.589$ is consistent with the previously reported value of $ν\simeq 0.588$, which was obtained by precise numerical calculations. Furthermore, we theoretically derive and experimentally demonstrate a scaling law that governs the equilibrium concentrations. This scaling law contradicts the predictions made by de Gennes' $c^{*}$ theorem. A major deficiency of the $c^*$ theorem is that the network elasticity, which depends on the as-prepared state, is neglected. These findings reveal that the semidilute scaling law is a fundamental principle for accurately predicting and controlling the equilibrium swelling of polymer gels.
△ Less
Submitted 8 December, 2022; v1 submitted 27 October, 2022;
originally announced October 2022.
-
Quantitative inverse Galois problem for semicommutative finite group schemes
Authors:
Ratko Darda,
Takehiko Yasuda
Abstract:
A semicommutative finite group scheme is a finite group scheme which can be obtained from commutative finite group schemes by iterated performing semidirect products with commutative kernels and taking quotients by normal subgroups. In this article, for an étale tame semicommutative finite group scheme $G$, we give a lower bound on the number of connected $G$-torsors of bounded height (such as dis…
▽ More
A semicommutative finite group scheme is a finite group scheme which can be obtained from commutative finite group schemes by iterated performing semidirect products with commutative kernels and taking quotients by normal subgroups. In this article, for an étale tame semicommutative finite group scheme $G$, we give a lower bound on the number of connected $G$-torsors of bounded height (such as discriminant).
△ Less
Submitted 4 November, 2022; v1 submitted 4 October, 2022;
originally announced October 2022.
-
Sequential Attention for Feature Selection
Authors:
Taisuke Yasuda,
MohammadHossein Bateni,
Lin Chen,
Matthew Fahrbach,
Gang Fu,
Vahab Mirrokni
Abstract:
Feature selection is the problem of selecting a subset of features for a machine learning model that maximizes model quality subject to a budget constraint. For neural networks, prior methods, including those based on $\ell_1$ regularization, attention, and other techniques, typically select the entire feature subset in one evaluation round, ignoring the residual value of features during selection…
▽ More
Feature selection is the problem of selecting a subset of features for a machine learning model that maximizes model quality subject to a budget constraint. For neural networks, prior methods, including those based on $\ell_1$ regularization, attention, and other techniques, typically select the entire feature subset in one evaluation round, ignoring the residual value of features during selection, i.e., the marginal contribution of a feature given that other features have already been selected. We propose a feature selection algorithm called Sequential Attention that achieves state-of-the-art empirical results for neural networks. This algorithm is based on an efficient one-pass implementation of greedy forward selection and uses attention weights at each step as a proxy for feature importance. We give theoretical insights into our algorithm for linear regression by showing that an adaptation to this setting is equivalent to the classical Orthogonal Matching Pursuit (OMP) algorithm, and thus inherits all of its provable guarantees. Our theoretical and empirical analyses offer new explanations towards the effectiveness of attention and its connections to overparameterization, which may be of independent interest.
△ Less
Submitted 25 April, 2023; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Online Lewis Weight Sampling
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
The seminal work of Cohen and Peng introduced Lewis weight sampling to the theoretical computer science community, yielding fast row sampling algorithms for approximating $d$-dimensional subspaces of $\ell_p$ up to $(1+ε)$ error. Several works have extended this important primitive to other settings, including the online coreset and sliding window models. However, these results are only for…
▽ More
The seminal work of Cohen and Peng introduced Lewis weight sampling to the theoretical computer science community, yielding fast row sampling algorithms for approximating $d$-dimensional subspaces of $\ell_p$ up to $(1+ε)$ error. Several works have extended this important primitive to other settings, including the online coreset and sliding window models. However, these results are only for $p\in\{1,2\}$, and results for $p=1$ require a suboptimal $\tilde O(d^2/ε^2)$ samples.
In this work, we design the first nearly optimal $\ell_p$ subspace embeddings for all $p\in(0,\infty)$ in the online coreset and sliding window models. In both models, our algorithms store $\tilde O(d^{1\lor(p/2)}/ε^2)$ rows. This answers a substantial generalization of the main open question of [BDMMUWZ2020], and gives the first results for all $p\notin\{1,2\}$.
Towards our result, we give the first analysis of "one-shot'' Lewis weight sampling of sampling rows proportionally to their Lewis weights, with sample complexity $\tilde O(d^{p/2}/ε^2)$ for $p>2$. Previously, this scheme was only known to have sample complexity $\tilde O(d^{p/2}/ε^5)$, whereas $\tilde O(d^{p/2}/ε^2)$ is known if a more sophisticated recursive sampling is used. The recursive sampling cannot be implemented online, thus necessitating an analysis of one-shot Lewis weight sampling. Our analysis uses a novel connection to online numerical linear algebra.
As an application, we obtain the first one-pass streaming coreset algorithms for $(1+ε)$ approximation of important generalized linear models, such as logistic regression and $p$-probit regression. Our upper bounds are parameterized by a complexity parameter $μ$ introduced by [MSSW2018], and we show the first lower bounds showing that a linear dependence on $μ$ is necessary.
△ Less
Submitted 17 December, 2022; v1 submitted 17 July, 2022;
originally announced July 2022.
-
The Batyrev-Manin conjecture for DM stacks
Authors:
Ratko Darda,
Takehiko Yasuda
Abstract:
We define a new height function on rational points of a DM (Deligne-Mumford) stack over a number field. This generalizes a generalized discriminant of Ellenberg-Venkatesh, the height function recently introduced by Ellenberg-Satriano-Zureick-Brown (as far as DM stacks over number fields are concerned), and the quasi-toric height function on weighted projective stacks by Darda. Generalizing the Man…
▽ More
We define a new height function on rational points of a DM (Deligne-Mumford) stack over a number field. This generalizes a generalized discriminant of Ellenberg-Venkatesh, the height function recently introduced by Ellenberg-Satriano-Zureick-Brown (as far as DM stacks over number fields are concerned), and the quasi-toric height function on weighted projective stacks by Darda. Generalizing the Manin conjecture and the more general Batyrev-Manin conjecture, we formulate a few conjectures on the asymptotic behavior of the number of rational points of a DM stack with bounded height. To formulate the Batyrev-Manin conjecture for DM stacks, we introduce the orbifold versions of the so-called $a$- and $b$-invariants. When applied to the classifying stack of a finite group, these conjectures specialize to the Malle conjecture, except that we remove certain thin subsets from counting. More precisely, we remove breaking thin subsets, which have been studied in the case of varieties by people including Hassett, Tschinkel, Tanimoto, Lehmann and Sengupta, and can be generalized to DM stack thanks to our generalization of $a$- and $b$-invariants. The breaking thin subset enables us to reinterpret Klüners' counterexample to the Malle conjecture.
△ Less
Submitted 10 January, 2024; v1 submitted 7 July, 2022;
originally announced July 2022.
-
Torsors for finite group schemes of bounded height
Authors:
Ratko Darda,
Takehiko Yasuda
Abstract:
Let $F$ be a global field. Let $G$ be a non trivial finite étale tame $F$-group scheme. We define height functions on the set of $G$-torsors over $F,$ which generalize the usual heights such as discriminant. As an analogue of the Malle conjecture for group schemes, we formulate a conjecture on the asymptotic behavior of the number of $G$-torsors over $F$ of bounded height. This is a special case o…
▽ More
Let $F$ be a global field. Let $G$ be a non trivial finite étale tame $F$-group scheme. We define height functions on the set of $G$-torsors over $F,$ which generalize the usual heights such as discriminant. As an analogue of the Malle conjecture for group schemes, we formulate a conjecture on the asymptotic behavior of the number of $G$-torsors over $F$ of bounded height. This is a special case of our more general Stacky Batyrev-Manin conjecture from arXiv:2207.03645. The conjectured asymptotic is proven for the case $G$ is commutative. When $F$ is a number field, the leading constant is expressed as a product of certain arithmetic invariants of $G$ and a volume of a space attached to $G$. Moreover, an equidistribution property of $G$-torsors in the space is established.
△ Less
Submitted 19 October, 2022; v1 submitted 7 July, 2022;
originally announced July 2022.
-
Three-Pass Identification Scheme Based on MinRank Problem with Half Cheating Probability
Authors:
Bagus Santoso,
Yasuhiko Ikematsu,
Shuhei Nakamura,
Takanori Yasuda
Abstract:
In Asiacrypt 2001, Courtois proposed the first three-pass zero-knowledge identification (ID) scheme based on the MinRank problem. However, in a single round of Courtois' ID scheme, the cheating probability, i.e., the success probability of the cheating prover, is 2/3 which is larger than half. Although Courtois also proposed a variant scheme which is claimed to have half cheating probability, its…
▽ More
In Asiacrypt 2001, Courtois proposed the first three-pass zero-knowledge identification (ID) scheme based on the MinRank problem. However, in a single round of Courtois' ID scheme, the cheating probability, i.e., the success probability of the cheating prover, is 2/3 which is larger than half. Although Courtois also proposed a variant scheme which is claimed to have half cheating probability, its security is not formally proven and it requires another hardness assumption on a specific one-way function and that verifier always generates challenges according to a specific non-uniform distribution. In this paper, we propose the first three-pass zero-knowledge ID scheme based on the MinRank problem with the cheating probability of exactly half for each round, even with only two-bit challenge space, without any additional assumption. Our proposed ID scheme requires fewer rounds and less total average communications costs compared to Curtois' under the same security level against impersonation.
△ Less
Submitted 6 May, 2022;
originally announced May 2022.
-
High-Dimensional Geometric Streaming in Polynomial Space
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
Many existing algorithms for streaming geometric data analysis have been plagued by exponential dependencies in the space complexity, which are undesirable for processing high-dimensional data sets. In particular, once $d\geq\log n$, there are no known non-trivial streaming algorithms for problems such as maintaining convex hulls and Löwner-John ellipsoids of $n$ points, despite a long line of wor…
▽ More
Many existing algorithms for streaming geometric data analysis have been plagued by exponential dependencies in the space complexity, which are undesirable for processing high-dimensional data sets. In particular, once $d\geq\log n$, there are no known non-trivial streaming algorithms for problems such as maintaining convex hulls and Löwner-John ellipsoids of $n$ points, despite a long line of work in streaming computational geometry since [AHV04].
We simultaneously improve these results to $\mathrm{poly}(d,\log n)$ bits of space by trading off with a $\mathrm{poly}(d,\log n)$ factor distortion. We achieve these results in a unified manner, by designing the first streaming algorithm for maintaining a coreset for $\ell_\infty$ subspace embeddings with $\mathrm{poly}(d,\log n)$ space and $\mathrm{poly}(d,\log n)$ distortion. Our algorithm also gives similar guarantees in the \emph{online coreset} model. Along the way, we sharpen results for online numerical linear algebra by replacing a log condition number dependence with a $\log n$ dependence, answering a question of [BDM+20]. Our techniques provide a novel connection between leverage scores, a fundamental object in numerical linear algebra, and computational geometry.
For $\ell_p$ subspace embeddings, we give nearly optimal trade-offs between space and distortion for one-pass streaming algorithms. For instance, we give a deterministic coreset using $O(d^2\log n)$ space and $O((d\log n)^{1/2-1/p})$ distortion for $p>2$, whereas previous deterministic algorithms incurred a $\mathrm{poly}(n)$ factor in the space or the distortion [CDW18].
Our techniques have implications in the offline setting, where we give optimal trade-offs between the space complexity and distortion of subspace sketch data structures. To do this, we give an elementary proof of a "change of density" theorem of [LT80] and make it algorithmic.
△ Less
Submitted 26 September, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
Active Linear Regression for $\ell_p$ Norms and Beyond
Authors:
Cameron Musco,
Christopher Musco,
David P. Woodruff,
Taisuke Yasuda
Abstract:
We study active sampling algorithms for linear regression, which aim to query only a few entries of a target vector $b\in\mathbb R^n$ and output a near minimizer to $\min_{x\in\mathbb R^d} \|Ax-b\|$, for a design matrix $A\in\mathbb R^{n \times d}$ and loss $\|\cdot\|$.
For $p$ norm regression for any $0<p<\infty$, we give an algorithm based on Lewis weight sampling outputting a $(1+ε)$-approxim…
▽ More
We study active sampling algorithms for linear regression, which aim to query only a few entries of a target vector $b\in\mathbb R^n$ and output a near minimizer to $\min_{x\in\mathbb R^d} \|Ax-b\|$, for a design matrix $A\in\mathbb R^{n \times d}$ and loss $\|\cdot\|$.
For $p$ norm regression for any $0<p<\infty$, we give an algorithm based on Lewis weight sampling outputting a $(1+ε)$-approximate solution using just $\tilde O(d/ε^2)$ queries to $b$ for $p\in(0,1)$, $\tilde{O}(d/ε)$ queries for $1<p<2$, and $\tilde{O}(d^{p/2}/ε^p)$ queries for $2<p<\infty$. For $0<p<2$, our bounds are optimal up to log factors, settling the query complexity for this range. For $2<p<\infty$, our dependence on $d$ is optimal, while our dependence on $ε$ is off by at most $ε$, up to log factors. Our result resolves an open question of [CD21], who gave near optimal bounds for the $1$ norm, but required $d^2/ε^2$ samples for $\ell_p$ regression with $1<p<2$, and gave no bounds for $2<p<\infty$ or $0<p<1$.
We also give the first total sensitivity bound of $O(d^{\max\{1,p/2\}}\log^2n)$ for loss functions of degree $p$ polynomial growth, improving a result of [TMF20]. By combining this with our techniques for $\ell_p$ regression, we obtain an active regression algorithm making $\tilde O(d^{1+\max\{1,p/2\}}/\mathrm{poly}(ε))$ queries for such loss functions, including the Tukey and Huber losses, answering another question of [CD21]. For the Huber loss, we further improve our bound to $\tilde O(d^{4-2\sqrt2}/\mathrm{poly}(ε))$ samples. Our sensitivity bounds also have many applications, including Orlicz norm subspace embeddings, robust subspace approximation, and dimension reduction for smoothed $p$-norms.
Finally, our active sampling results give the first sublinear time algorithms for Kronecker product regression under every $p$ norm.
△ Less
Submitted 26 September, 2022; v1 submitted 8 November, 2021;
originally announced November 2021.
-
Improved Algorithms for Low Rank Approximation from Sparsity
Authors:
David P. Woodruff,
Taisuke Yasuda
Abstract:
We overcome two major bottlenecks in the study of low rank approximation by assuming the low rank factors themselves are sparse. Specifically, (1) for low rank approximation with spectral norm error, we show how to improve the best known $\mathsf{nnz}(\mathbf A) k / \sqrt{\varepsilon}$ running time to $\mathsf{nnz}(\mathbf A)/\sqrt{\varepsilon}$ running time plus low order terms depending on the s…
▽ More
We overcome two major bottlenecks in the study of low rank approximation by assuming the low rank factors themselves are sparse. Specifically, (1) for low rank approximation with spectral norm error, we show how to improve the best known $\mathsf{nnz}(\mathbf A) k / \sqrt{\varepsilon}$ running time to $\mathsf{nnz}(\mathbf A)/\sqrt{\varepsilon}$ running time plus low order terms depending on the sparsity of the low rank factors, and (2) for streaming algorithms for Frobenius norm error, we show how to bypass the known $Ω(nk/\varepsilon)$ memory lower bound and obtain an $s k (\log n)/ \mathrm{poly}(\varepsilon)$ memory bound, where $s$ is the number of non-zeros of each low rank factor. Although this algorithm is inefficient, as it must be under standard complexity theoretic assumptions, we also present polynomial time algorithms using $\mathrm{poly}(s,k,\log n,\varepsilon^{-1})$ memory that output rank $k$ approximations supported on a $O(sk/\varepsilon)\times O(sk/\varepsilon)$ submatrix.
Both the prior $\mathsf{nnz}(\mathbf A) k / \sqrt{\varepsilon}$ running time and the $nk/\varepsilon$ memory for these problems were long-standing barriers; our results give a natural way of overcoming them assuming sparsity of the low rank factors.
△ Less
Submitted 31 October, 2021;
originally announced November 2021.
-
The isomorphism problem of projective schemes and related algorithmic problems
Authors:
Takehiko Yasuda
Abstract:
We discuss the isomorphism problem of projective schemes; given two projective schemes, can we algorithmically decide whether they are isomorphic? We give affirmative answers in the case of one-dimensional projective schemes, the case of smooth irreducible varieties with a big canonical sheaf or a big anti-canonical sheaf, and the case of K3 surfaces with a finite automorphism group. As related al…
▽ More
We discuss the isomorphism problem of projective schemes; given two projective schemes, can we algorithmically decide whether they are isomorphic? We give affirmative answers in the case of one-dimensional projective schemes, the case of smooth irreducible varieties with a big canonical sheaf or a big anti-canonical sheaf, and the case of K3 surfaces with a finite automorphism group. As related algorithmic problems, we also discuss decidability of positivity properties of invertible sheaves, and approximation of the nef cone and the pseudo-effective cone.
△ Less
Submitted 26 December, 2021; v1 submitted 20 July, 2021;
originally announced July 2021.
-
Open problems in the wild McKay correspondence and related fields
Authors:
Takehiko Yasuda
Abstract:
The wild McKay correspondence is a form of McKay correspondence in terms of stringy invariants that is generalized to arbitrary characteristics. It gives rise to an interesting connection between the geometry of wild quotient varieties and arithmetic on extensions of local fields.
The principal purpose of this article is to collect open problems on the wild McKay correspondence, as well as those…
▽ More
The wild McKay correspondence is a form of McKay correspondence in terms of stringy invariants that is generalized to arbitrary characteristics. It gives rise to an interesting connection between the geometry of wild quotient varieties and arithmetic on extensions of local fields.
The principal purpose of this article is to collect open problems on the wild McKay correspondence, as well as those in related fields that the author believes are interesting or important. It also serves as a survey on the present state of these fields.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
On the behavior of stringy motives under Galois quasi-étale covers
Authors:
Javier Carvajal-Rojas,
Takehiko Yasuda
Abstract:
We investigate the behavior of stringy motives under Galois quasi-étale covers. We prove that they descend under such covers in a sense defined via their Poincaré realizations. Further, we show that such descent is strict in the presence of ramification. As a corollary, we reduce the problem regarding the finiteness of the étale fundamental group of KLT singularities to a DCC property for their st…
▽ More
We investigate the behavior of stringy motives under Galois quasi-étale covers. We prove that they descend under such covers in a sense defined via their Poincaré realizations. Further, we show that such descent is strict in the presence of ramification. As a corollary, we reduce the problem regarding the finiteness of the étale fundamental group of KLT singularities to a DCC property for their stringy motives. We verify such DCC property for surfaces in arbitrary characteristic. As an application, we give a characteristic-free proof for the finiteness of the étale fundamental group of log terminal surface singularities, which was unknown in equal characteristics 2 and 3 and in mixed characteristics.
△ Less
Submitted 27 March, 2025; v1 submitted 11 May, 2021;
originally announced May 2021.
-
Exponentially Improved Dimensionality Reduction for $\ell_1$: Subspace Embeddings and Independence Testing
Authors:
Yi Li,
David P. Woodruff,
Taisuke Yasuda
Abstract:
Despite many applications, dimensionality reduction in the $\ell_1$-norm is much less understood than in the Euclidean norm. We give two new oblivious dimensionality reduction techniques for the $\ell_1$-norm which improve exponentially over prior ones:
1. We design a distribution over random matrices $S \in \mathbb{R}^{r \times n}$, where $r = 2^{\tilde O(d/(\varepsilon δ))}$, such that given a…
▽ More
Despite many applications, dimensionality reduction in the $\ell_1$-norm is much less understood than in the Euclidean norm. We give two new oblivious dimensionality reduction techniques for the $\ell_1$-norm which improve exponentially over prior ones:
1. We design a distribution over random matrices $S \in \mathbb{R}^{r \times n}$, where $r = 2^{\tilde O(d/(\varepsilon δ))}$, such that given any matrix $A \in \mathbb{R}^{n \times d}$, with probability at least $1-δ$, simultaneously for all $x$, $\|SAx\|_1 = (1 \pm \varepsilon)\|Ax\|_1$. Note that $S$ is linear, does not depend on $A$, and maps $\ell_1$ into $\ell_1$. Our distribution provides an exponential improvement on the previous best known map of Wang and Woodruff (SODA, 2019), which required $r = 2^{2^{Ω(d)}}$, even for constant $\varepsilon$ and $δ$. Our bound is optimal, up to a polynomial factor in the exponent, given a known $2^{\sqrt d}$ lower bound for constant $\varepsilon$ and $δ$.
2. We design a distribution over matrices $S \in \mathbb{R}^{k \times n}$, where $k = 2^{O(q^2)}(\varepsilon^{-1} q \log d)^{O(q)}$, such that given any $q$-mode tensor $A \in (\mathbb{R}^{d})^{\otimes q}$, one can estimate the entrywise $\ell_1$-norm $\|A\|_1$ from $S(A)$. Moreover, $S = S^1 \otimes S^2 \otimes \cdots \otimes S^q$ and so given vectors $u_1, \ldots, u_q \in \mathbb{R}^d$, one can compute $S(u_1 \otimes u_2 \otimes \cdots \otimes u_q)$ in time $2^{O(q^2)}(\varepsilon^{-1} q \log d)^{O(q)}$, which is much faster than the $d^q$ time required to form $u_1 \otimes u_2 \otimes \cdots \otimes u_q$. Our linear map gives a streaming algorithm for independence testing using space $2^{O(q^2)}(\varepsilon^{-1} q \log d)^{O(q)}$, improving the previous doubly exponential $(\varepsilon^{-1} \log d)^{q^{O(q)}}$ space bound of Braverman and Ostrovsky (STOC, 2010).
△ Less
Submitted 5 August, 2021; v1 submitted 26 April, 2021;
originally announced April 2021.
-
Comparison of $pp$ and $p \bar{p}$ differential elastic cross sections and observation of the exchange of a colorless $C$-odd gluonic compound
Authors:
V. M. Abazov,
B. Abbott,
B. S. Acharya,
M. Adams,
T. Adams,
J. P. Agnew,
G. D. Alexeev,
G. Alkhazov,
A. Alton,
G. A. Alves,
G. Antchev,
A. Askew,
P. Aspell,
A. C. S. Assis Jesus,
I. Atanassov,
S. Atkins,
K. Augsten,
V. Aushev,
Y. Aushev,
V. Avati,
C. Avila,
F. Badaud,
J. Baechler,
L. Bagby,
C. Baldenegro Barrera
, et al. (451 additional authors not shown)
Abstract:
We describe an analysis comparing the $p\bar{p}$ elastic cross section as measured by the D0 Collaboration at a center-of-mass energy of 1.96 TeV to that in $pp$ collisions as measured by the TOTEM Collaboration at 2.76, 7, 8, and 13 TeV using a model-independent approach. The TOTEM cross sections extrapolated to a center-of-mass energy of $\sqrt{s} =$ 1.96 TeV are compared with the D0 measurement…
▽ More
We describe an analysis comparing the $p\bar{p}$ elastic cross section as measured by the D0 Collaboration at a center-of-mass energy of 1.96 TeV to that in $pp$ collisions as measured by the TOTEM Collaboration at 2.76, 7, 8, and 13 TeV using a model-independent approach. The TOTEM cross sections extrapolated to a center-of-mass energy of $\sqrt{s} =$ 1.96 TeV are compared with the D0 measurement in the region of the diffractive minimum and the second maximum of the $pp$ cross section. The two data sets disagree at the 3.4$σ$ level and thus provide evidence for the $t$-channel exchange of a colorless, $C$-odd gluonic compound, also known as the odderon. We combine these results with a TOTEM analysis of the same $C$-odd exchange based on the total cross section and the ratio of the real to imaginary parts of the forward elastic scattering amplitude in $pp$ scattering. The combined significance of these results is larger than 5$σ$ and is interpreted as the first observation of the exchange of a colorless, $C$-odd gluonic compound.
△ Less
Submitted 25 June, 2021; v1 submitted 7 December, 2020;
originally announced December 2020.
-
Stabilisation of exact coherent structures in two-dimensional turbulence using time-delayed feedback
Authors:
Dan Lucas,
Tatsuya Yasuda
Abstract:
Time-delayed feedback control, attributed to Pyragas (1992 Physics Letters 170(6) 421-428), is a method known to stabilise periodic orbits in low dimensional chaotic dynamical systems. A system of the form $\dot{\mathbf{x}}(t)=f(\mathbf{x})$ has an additional term $G(\mathbf{x}(t)-\mathbf{x}(t-T))$ introduced where $G$ is some `gain matrix' and $T$ a time delay. The form of the delay term is such…
▽ More
Time-delayed feedback control, attributed to Pyragas (1992 Physics Letters 170(6) 421-428), is a method known to stabilise periodic orbits in low dimensional chaotic dynamical systems. A system of the form $\dot{\mathbf{x}}(t)=f(\mathbf{x})$ has an additional term $G(\mathbf{x}(t)-\mathbf{x}(t-T))$ introduced where $G$ is some `gain matrix' and $T$ a time delay. The form of the delay term is such that it will vanish for any orbit of period $T,$ therefore making it also an orbit of the uncontrolled system. This non-invasive feature makes the method attractive for stabilising exact coherent structures in fluid turbulence. Here we begin by validating the method for the basic flow in Kolmogorov flow; a two-dimensional incompressible Navier-Stokes flow with a sinusoidal body force. The linear predictions for stabilisation are well captured by direct numerical simulation. By applying an adaptive method to adjust the streamwise translation of the delay, a known travelling wave solution is able to be stabilised up to relatively high Reynolds number. We discover that the famous `odd-number' limitation of this time-delayed feedback method can be overcome in the fluid problem by using the symmetries of the system. This leads to the discovery of 8 additional exact coherent structures which can be stabilised with this approach. This means that certain unstable exact coherent structures can be obtained by simply time-stepping a modified set of equations, thus circumventing the usual convergence algorithms.
△ Less
Submitted 20 January, 2022; v1 submitted 19 August, 2020;
originally announced August 2020.
-
The Wild McKay Correspondence for Cyclic Groups of Prime Power Order
Authors:
Mahito Tanno,
Takehiko Yasuda
Abstract:
The $\boldsymbol{v}$-function is a key ingredient in the wild McKay correspondence. In this paper, we give a formula to compute it in terms of valuations of Witt vectors, when the given group is a cyclic group of prime power order. We apply it to study singularities of a quotient variety by a cyclic group of prime square order. We give a criterion whether the stringy motive of the quotient variety…
▽ More
The $\boldsymbol{v}$-function is a key ingredient in the wild McKay correspondence. In this paper, we give a formula to compute it in terms of valuations of Witt vectors, when the given group is a cyclic group of prime power order. We apply it to study singularities of a quotient variety by a cyclic group of prime square order. We give a criterion whether the stringy motive of the quotient variety converges or not. Furthermore, if the given representation is indecomposable, then we also give a simple criterion for the quotient variety being terminal, canonical, log canonical, and not log canonical.
△ Less
Submitted 17 June, 2021; v1 submitted 22 June, 2020;
originally announced June 2020.
-
Charge correlation in V$_2$OPO$_4$ probed by hard x-ray photoemission spectroscopy
Authors:
K. Murota,
E. Pachoud,
J. P. Attfield,
R. Glaum,
T. Yasuda,
D. Ootsuki,
Y. Takagi,
A. Yasui,
D. I. Khomskii,
T. Mizokawa
Abstract:
Electronic properties of V$_2$OPO$_4$ have been investigated by means of hard x-ray photoemission spectroscopy (HAXPES) and subsequent theoretical calculations. The V 1$s$ and 2$p$ HAXPES spectra are consistent with the charge ordering of V$^{2+}$ and V$^{3+}$. The binding energy difference between the V$^{2+}$ and V$^{3+}$ components is unexpectedly large indicating large bonding-antibonding spli…
▽ More
Electronic properties of V$_2$OPO$_4$ have been investigated by means of hard x-ray photoemission spectroscopy (HAXPES) and subsequent theoretical calculations. The V 1$s$ and 2$p$ HAXPES spectra are consistent with the charge ordering of V$^{2+}$ and V$^{3+}$. The binding energy difference between the V$^{2+}$ and V$^{3+}$ components is unexpectedly large indicating large bonding-antibonding splitting between them in the final states of core level photoemission. The V 1$s$ HAXPES spectrum exhibits a charge transfer satellite which can be analyzed by configuration interaction calculations on a V$_2$O$_9$ cluster. The V 3$d$ spectral weight near the Fermi level is assigned to the 3$d$ $t_{2g}$ orbitals of the V$^{2+}$ site. The broad V 3$d$ spectral distribution is consistent with the strong hybridization between V$^{2+}$ and V$^{3+}$ in the ground state. The core level and valence band HAXPES results indicate substantial charge transfer from the V$^{2+}$ site to the V$^{3+}$ site.7 figure
△ Less
Submitted 7 June, 2020;
originally announced June 2020.
-
Universal equation of state describes osmotic pressure throughout gelation process
Authors:
Takashi Yasuda,
Naoyuki Sakumichi,
Ung-il Chung,
Takamasa Sakai
Abstract:
The equation of state of the osmotic pressure for linear-polymer solutions in good solvents is universally described by a scaling function. We experimentally measure the osmotic pressure of the gelation process via osmotic deswelling. We find that the same scaling function for linear-polymer solutions also describes the osmotic pressure throughout the gelation process involving both the sol and ge…
▽ More
The equation of state of the osmotic pressure for linear-polymer solutions in good solvents is universally described by a scaling function. We experimentally measure the osmotic pressure of the gelation process via osmotic deswelling. We find that the same scaling function for linear-polymer solutions also describes the osmotic pressure throughout the gelation process involving both the sol and gel states. Furthermore, we reveal that the osmotic pressure of polymer gels is universally governed by the semidilute scaling law of linear-polymer solutions.
△ Less
Submitted 15 December, 2020; v1 submitted 7 May, 2020;
originally announced May 2020.
-
Graph Spanners in the Message-Passing Model
Authors:
Manuel Fernandez,
David P. Woodruff,
Taisuke Yasuda
Abstract:
Graph spanners are sparse subgraphs which approximately preserve all pairwise shortest-path distances in an input graph. The notion of approximation can be additive, multiplicative, or both, and many variants of this problem have been extensively studied. We study the problem of computing a graph spanner when the edges of the input graph are distributed across two or more sites in an arbitrary, po…
▽ More
Graph spanners are sparse subgraphs which approximately preserve all pairwise shortest-path distances in an input graph. The notion of approximation can be additive, multiplicative, or both, and many variants of this problem have been extensively studied. We study the problem of computing a graph spanner when the edges of the input graph are distributed across two or more sites in an arbitrary, possibly worst-case partition, and the goal is for the sites to minimize the communication used to output a spanner. We assume the message-passing model of communication, for which there is a point-to-point link between all pairs of sites as well as a coordinator who is responsible for producing the output. We stress that the subset of edges that each site has is not related to the network topology, which is fixed to be point-to-point. While this model has been extensively studied for related problems such as graph connectivity, it has not been systematically studied for graph spanners. We present the first tradeoffs for total communication versus the quality of the spanners computed, for two or more sites, as well as for additive and multiplicative notions of distortion. We show separations in the communication complexity when edges are allowed to occur on multiple sites, versus when each edge occurs on at most one site. We obtain nearly tight bounds (up to polylog factors) for the communication of additive $2$-spanners in both the with and without duplication models, multiplicative $(2k-1)$-spanners in the with duplication model, and multiplicative $3$ and $5$-spanners in the without duplication model. Our lower bound for multiplicative $3$-spanners employs biregular bipartite graphs rather than the usual Erdős girth conjecture graphs and may be of wider interest.
△ Less
Submitted 16 November, 2019; v1 submitted 14 November, 2019;
originally announced November 2019.
-
The Query Complexity of Mastermind with $\ell_p$ Distances
Authors:
Manuel Fernandez,
David P. Woodruff,
Taisuke Yasuda
Abstract:
Consider a variant of the Mastermind game in which queries are $\ell_p$ distances, rather than the usual Hamming distance. That is, a codemaker chooses a hidden vector $\mathbf{y}\in\{-k,-k+1,\dots,k-1,k\}^n$ and answers to queries of the form $\Vert\mathbf{y}-\mathbf{x}\Vert_p$ where $\mathbf{x}\in\{-k,-k+1,\dots,k-1,k\}^n$. The goal is to minimize the number of queries made in order to correctly…
▽ More
Consider a variant of the Mastermind game in which queries are $\ell_p$ distances, rather than the usual Hamming distance. That is, a codemaker chooses a hidden vector $\mathbf{y}\in\{-k,-k+1,\dots,k-1,k\}^n$ and answers to queries of the form $\Vert\mathbf{y}-\mathbf{x}\Vert_p$ where $\mathbf{x}\in\{-k,-k+1,\dots,k-1,k\}^n$. The goal is to minimize the number of queries made in order to correctly guess $\mathbf{y}$.
Motivated by this question, in this work, we develop a nonadaptive polynomial time algorithm that works for a natural class of separable distance measures, i.e.\ coordinate-wise sums of functions of the absolute value. This in particular includes distances such as the smooth max (LogSumExp) as well as many widely-studied $M$-estimator losses, such as $\ell_p$ norms, the $\ell_1$-$\ell_2$ loss, the Huber loss, and the Fair estimator loss. When we apply this result to $\ell_p$ queries, we obtain an upper bound of $O\left(\min\left\{n,\frac{n\log k}{\log n}\right\}\right)$ queries for any real $1\leq p<\infty$. We also show matching lower bounds up to constant factors for the $\ell_p$ problem, even for adaptive algorithms for the approximation version of the problem, in which the problem is to output $\mathbf{y}'$ such that $\Vert\mathbf{y}'-\mathbf{y}\Vert_p\leq R$ for any $R\leq k^{1-\varepsilon}n^{1/p}$ for constant $\varepsilon>0$. Thus, essentially any approximation of this problem is as hard as finding the hidden vector exactly, up to constant factors. Finally, we show that for the noisy version of the problem, i.e. the setting when the codemaker answers queries with any $q = (1\pm\varepsilon)\Vert\mathbf{y}-\mathbf{x}\Vert_p$, there is no query efficient algorithm.
△ Less
Submitted 23 September, 2019;
originally announced September 2019.
-
Moduli of formal torsors II
Authors:
Fabio Tonini,
Takehiko Yasuda
Abstract:
Applying the authors' preceding work, we construct a version of the moduli space of $G$-torsors over the formal punctured disk for a finite group $G$. To do so, we introduce two Grothendieck topologies, the sur (surjective) and luin (locally universally injective) topologies, and define P-schemes using them as variants of schemes. Our moduli space is defined as a P-scheme approximating the relevan…
▽ More
Applying the authors' preceding work, we construct a version of the moduli space of $G$-torsors over the formal punctured disk for a finite group $G$. To do so, we introduce two Grothendieck topologies, the sur (surjective) and luin (locally universally injective) topologies, and define P-schemes using them as variants of schemes. Our moduli space is defined as a P-scheme approximating the relevant moduli functor. We then prove that Fröhlich's module resolvent gives a locally constructible function on this moduli space, which implies that motivic integrals appearing the wild McKay correspondence are well-defined.
△ Less
Submitted 28 June, 2021; v1 submitted 19 September, 2019;
originally announced September 2019.
-
Motivic integration over wild Deligne-Mumford stacks
Authors:
Takehiko Yasuda
Abstract:
We develop the motivic integration theory over formal Deligne-Mumford stacks over a power series ring of arbitrary characteristic. This is a generalization of the corresponding theory for tame and smooth Deligne-Mumford stacks constructed in earlier papers of the author. As an application, we obtain the wild motivic McKay correspondence for linear actions of arbitrary finite groups, which has been…
▽ More
We develop the motivic integration theory over formal Deligne-Mumford stacks over a power series ring of arbitrary characteristic. This is a generalization of the corresponding theory for tame and smooth Deligne-Mumford stacks constructed in earlier papers of the author. As an application, we obtain the wild motivic McKay correspondence for linear actions of arbitrary finite groups, which has been known only for cyclic groups of prime order. In particular, this implies the motivic version of Bhargava's mass formula as a special case. In fact, we prove a more general result, the invariance of stringy motives of (stacky) log pairs under crepant morphisms.
△ Less
Submitted 27 May, 2020; v1 submitted 8 August, 2019;
originally announced August 2019.
-
Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel $k$-means Clustering
Authors:
Manuel Fernandez,
David P. Woodruff,
Taisuke Yasuda
Abstract:
We present tight lower bounds on the number of kernel evaluations required to approximately solve kernel ridge regression (KRR) and kernel $k$-means clustering (KKMC) on $n$ input points. For KRR, our bound for relative error approximation to the minimizer of the objective function is $Ω(nd_{\mathrm{eff}}^λ/\varepsilon)$ where $d_{\mathrm{eff}}^λ$ is the effective statistical dimension, which is t…
▽ More
We present tight lower bounds on the number of kernel evaluations required to approximately solve kernel ridge regression (KRR) and kernel $k$-means clustering (KKMC) on $n$ input points. For KRR, our bound for relative error approximation to the minimizer of the objective function is $Ω(nd_{\mathrm{eff}}^λ/\varepsilon)$ where $d_{\mathrm{eff}}^λ$ is the effective statistical dimension, which is tight up to a $\log(d_{\mathrm{eff}}^λ/\varepsilon)$ factor. For KKMC, our bound for finding a $k$-clustering achieving a relative error approximation of the objective function is $Ω(nk/\varepsilon)$, which is tight up to a $\log(k/\varepsilon)$ factor. Our KRR result resolves a variant of an open question of El Alaoui and Mahoney, asking whether the effective statistical dimension is a lower bound on the sampling complexity or not. Furthermore, for the important practical case when the input is a mixture of Gaussians, we provide a KKMC algorithm which bypasses the above lower bound.
△ Less
Submitted 15 May, 2019;
originally announced May 2019.