Search | arXiv e-print repository

Revisiting Le Cam's Equation: Exact Minimax Rates over Convex Density Classes

Authors: Shamindra Shrotriya, Matey Neykov

Abstract: We study the classical problem of deriving minimax rates for density estimation over convex density classes. Building on the pioneering work of Le Cam (1973), Birge (1983, 1986), Wong and Shen (1995), Yang and Barron (1999), we determine the exact (up to constants) minimax rate over any convex density class. This work thus extends these known results by demonstrating that the local metric entropy… ▽ More We study the classical problem of deriving minimax rates for density estimation over convex density classes. Building on the pioneering work of Le Cam (1973), Birge (1983, 1986), Wong and Shen (1995), Yang and Barron (1999), we determine the exact (up to constants) minimax rate over any convex density class. This work thus extends these known results by demonstrating that the local metric entropy of the density class always captures the minimax optimal rates under such settings. Our bounds provide a unifying perspective across both parametric and nonparametric convex density classes, under weaker assumptions on the richness of the density class than previously considered. Our proposed `multistage sieve' MLE applies to any such convex density class. We further demonstrate that this estimator is also adaptive to the true underlying density of interest. We apply our risk bounds to rederive known minimax rates including bounded total variation, and Holder density classes. We further illustrate the utility of the result by deriving upper bounds for less studied classes, e.g., convex mixture of densities. △ Less

Submitted 23 October, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

Comments: Total paper (46 pages, 2 figures): Main paper (17 pages, 2 figures) + Appendix (29 pages). Updated to include proof of adaptivity of estimator

arXiv:2207.07075 [pdf, other]

Adversarial Sign-Corrupted Isotonic Regression

Authors: Shamindra Shrotriya, Matey Neykov

Abstract: Classical univariate isotonic regression involves nonparametric estimation under a monotonicity constraint of the true signal. We consider a variation of this generating process, which we term adversarial sign-corrupted isotonic (\texttt{ASCI}) regression. Under this \texttt{ASCI} setting, the adversary has full access to the true isotonic responses, and is free to sign-corrupt them. Estimating th… ▽ More Classical univariate isotonic regression involves nonparametric estimation under a monotonicity constraint of the true signal. We consider a variation of this generating process, which we term adversarial sign-corrupted isotonic (\texttt{ASCI}) regression. Under this \texttt{ASCI} setting, the adversary has full access to the true isotonic responses, and is free to sign-corrupt them. Estimating the true monotonic signal given these sign-corrupted responses is a highly challenging task. Notably, the sign-corruptions are designed to violate monotonicity, and possibly induce heavy dependence between the corrupted response terms. In this sense, \texttt{ASCI} regression may be viewed as an adversarial stress test for isotonic regression. Our motivation is driven by understanding whether efficient robust estimation of the monotone signal is feasible under this adversarial setting. We develop \texttt{ASCIFIT}, a three-step estimation procedure under the \texttt{ASCI} setting. The \texttt{ASCIFIT} procedure is conceptually simple, easy to implement with existing software, and consists of applying the \texttt{PAVA} with crucial pre- and post-processing corrections. We formalize this procedure, and demonstrate its theoretical guarantees in the form of sharp high probability upper bounds and minimax lower bounds. We illustrate our findings with detailed simulations. △ Less

Submitted 14 July, 2022; originally announced July 2022.

Comments: Total paper (52 pages, 2 figures): Main paper (13 pages, 2 figures) + Appendix (39 pages)

arXiv:2110.10825 [pdf, other]

$\ell_{\infty}$-Bounds of the MLE in the BTL Model under General Comparison Graphs

Authors: Wanshan Li, Shamindra Shrotriya, Alessandro Rinaldo

Abstract: The Bradley-Terry-Luce (BTL) model is a popular statistical approach for estimating the global ranking of a collection of items using pairwise comparisons. To ensure accurate ranking, it is essential to obtain precise estimates of the model parameters in the $\ell_{\infty}$-loss. The difficulty of this task depends crucially on the topology of the pairwise comparison graph over the given items. Ho… ▽ More The Bradley-Terry-Luce (BTL) model is a popular statistical approach for estimating the global ranking of a collection of items using pairwise comparisons. To ensure accurate ranking, it is essential to obtain precise estimates of the model parameters in the $\ell_{\infty}$-loss. The difficulty of this task depends crucially on the topology of the pairwise comparison graph over the given items. However, beyond very few well-studied cases, such as the complete and Erdös-Rényi comparison graphs, little is known about the performance of the maximum likelihood estimator MLE) of the BTL model parameters in the $\ell_{\infty}$-loss under more general graph topologies. In this paper, we derive novel, general upper bounds on the $\ell_{\infty}$ estimation error of the BTL MLE that depend explicitly on the algebraic connectivity of the comparison graph, the maximal performance gap across items and the sample complexity. We demonstrate that the derived bounds perform well and in some cases are sharper compared to known results obtained using different loss functions and more restricted assumptions and graph topologies. We carefully compare our results to Yan et al. (2012), which is closest in spirit to our work. We further provide minimax lower bounds under $\ell_{\infty}$-error that nearly match the upper bounds over a class of sufficiently regular graph topologies. Finally, we study the implications of our $\ell_{\infty}$-bounds for efficient (offline) tournament design. We illustrate and discuss our findings through various examples and simulations. △ Less

Submitted 22 June, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

Comments: Accepted for the 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022), 43 pages, 7 figures

arXiv:2106.11188 [pdf, other]

maars: Tidy Inference under the 'Models as Approximations' Framework in R

Authors: Riccardo Fogliato, Shamindra Shrotriya, Arun Kumar Kuchibhotla

Abstract: Linear regression using ordinary least squares (OLS) is a critical part of every statistician's toolkit. In R, this is elegantly implemented via lm() and its related functions. However, the statistical inference output from this suite of functions is based on the assumption that the model is well specified. This assumption is often unrealistic and at best satisfied approximately. In the statistics… ▽ More Linear regression using ordinary least squares (OLS) is a critical part of every statistician's toolkit. In R, this is elegantly implemented via lm() and its related functions. However, the statistical inference output from this suite of functions is based on the assumption that the model is well specified. This assumption is often unrealistic and at best satisfied approximately. In the statistics and econometrics literature, this has long been recognized and a large body of work provides inference for OLS under more practical assumptions. This can be seen as model-free inference. In this paper, we introduce our package maars ("models as approximations") that aims at bringing research on model-free inference to R via a comprehensive workflow. The maars package differs from other packages that also implement variance estimation, such as sandwich, in three key ways. First, all functions in maars follow a consistent grammar and return output in tidy format, with minimal deviation from the typical lm() workflow. Second, maars contains several tools for inference including empirical, multiplier, residual bootstrap, and subsampling, for easy comparison. Third, maars is developed with pedagogy in mind. For this, most of its functions explicitly return the assumptions under which the output is valid. This key innovation makes maars useful in teaching inference under misspecification and also a powerful tool for applied researchers. We hope our default feature of explicitly presenting assumptions will become a de facto standard for most statistical modeling in R. △ Less

Submitted 21 June, 2021; originally announced June 2021.

Comments: The first two authors contributed equally to this work and are ordered alphabetically

arXiv:2003.00083 [pdf, other]

Nonparametric Estimation in the Dynamic Bradley-Terry Model

Authors: Heejong Bong, Wanshan Li, Shamindra Shrotriya, Alessandro Rinaldo

Abstract: We propose a time-varying generalization of the Bradley-Terry model that allows for nonparametric modeling of dynamic global rankings of distinct teams. We develop a novel estimator that relies on kernel smoothing to pre-process the pairwise comparisons over time and is applicable in sparse settings where the Bradley-Terry may not be fit. We obtain necessary and sufficient conditions for the exist… ▽ More We propose a time-varying generalization of the Bradley-Terry model that allows for nonparametric modeling of dynamic global rankings of distinct teams. We develop a novel estimator that relies on kernel smoothing to pre-process the pairwise comparisons over time and is applicable in sparse settings where the Bradley-Terry may not be fit. We obtain necessary and sufficient conditions for the existence and uniqueness of our estimator. We also derive time-varying oracle bounds for both the estimation error and the excess risk in the model-agnostic setting where the Bradley-Terry model is not necessarily the true data generating process. We thoroughly test the practical effectiveness of our model using both simulated and real world data and suggest an efficient data-driven approach for bandwidth tuning. △ Less

Submitted 28 February, 2020; originally announced March 2020.

Comments: To appear in AISTATS 2020

arXiv:1701.07032 [pdf, other]

doi 10.1051/0004-6361/201629685

Delay-time distribution of core-collapse supernovae with late events resulting from binary interaction

Authors: E. Zapartas, S. E. de Mink, R. G. Izzard, S. -C. Yoon, C. Badenes, Y. Gotberg, A. de Koter, C. J. Neijssel, M. Renzo, A. Schootemeijer, T. S. Shrotriya

Abstract: Most massive stars, the progenitors of core-collapse supernovae, are in close binary systems and may interact with their companion through mass transfer or merging. We undertake a population synthesis study to compute the delay-time distribution of core-collapse supernovae, that is, the supernova rate versus time following a starburst, taking into account binary interactions. We test the systemati… ▽ More Most massive stars, the progenitors of core-collapse supernovae, are in close binary systems and may interact with their companion through mass transfer or merging. We undertake a population synthesis study to compute the delay-time distribution of core-collapse supernovae, that is, the supernova rate versus time following a starburst, taking into account binary interactions. We test the systematic robustness of our results by running various simulations to account for the uncertainties in our standard assumptions. We find that a significant fraction, $15^{+9}_{-8}$%, of core-collapse supernovae are `late', that is, they occur 50-200 Myrs after birth, when all massive single stars have already exploded. These late events originate predominantly from binary systems with at least one, or, in most cases, with both stars initially being of intermediate mass ($4-8M_{\odot}$). The main evolutionary channels that contribute often involve either the merging of the initially more massive primary star with its companion or the engulfment of the remaining core of the primary by the expanding secondary that has accreted mass at an earlier evolutionary stage. Also, the total number of core-collapse supernovae increases by $14^{+15}_{-14}$% because of binarity for the same initial stellar mass. The high rate implies that we should have already observed such late core-collapse supernovae, but have not recognized them as such. We argue that $φ$ Persei is a likely progenitor and that eccentric neutron star - white dwarf systems are likely descendants. Late events can help explain the discrepancy in the delay-time distributions derived from supernova remnants in the Magellanic Clouds and extragalactic type Ia events, lowering the contribution of prompt Ia events. We discuss ways to test these predictions and speculate on the implications for supernova feedback in simulations of galaxy evolution. △ Less

Submitted 24 January, 2017; originally announced January 2017.

Comments: Accepted for publication in Astronomy & Astrophysics

Journal ref: A&A 601, A29 (2017)

arXiv:1602.01851 [pdf, other]

doi 10.3847/0004-637X/820/1/56

The K2-ESPRINT Project II: Spectroscopic follow-up of three exoplanet systems from Campaign 1 of K2

Authors: Vincent Van Eylen, Grzegorz Nowak, Simon Albrecht, Enric Palle, Ignasi Ribas, Hans Bruntt, Manuel Perger, Davide Gandolfi, Teriyuki Hirano, Roberto Sanchis-Ojeda, Amanda Kiilerich, Jorge P. Arranz, Mariona Badenas, Fei Dai, Hans J. Deeg, Eike W. Guenther, Pilar Montanes-Rodriguez, Norio Narita, Leslie A. Rogers, Victor J. S. Bejar, Tushar S. Shrotriya, Joshua N. Winn, Daniel Sebastian

Abstract: We report on Doppler observations of three transiting planet candidates that were detected during Campaign 1 of the K2 mission. The Doppler observations were conducted with FIES, HARPS-N and HARPS. We measure the mass of K2-27b (EPIC 201546283b), and provide constraints and upper limits for EPIC 201295312b and EPIC 201577035b. K2-27b is a warm Neptune orbiting its host star in 6.77 days and has a… ▽ More We report on Doppler observations of three transiting planet candidates that were detected during Campaign 1 of the K2 mission. The Doppler observations were conducted with FIES, HARPS-N and HARPS. We measure the mass of K2-27b (EPIC 201546283b), and provide constraints and upper limits for EPIC 201295312b and EPIC 201577035b. K2-27b is a warm Neptune orbiting its host star in 6.77 days and has a radius of $4.45^{+0.33}_{-0.33}~\mathrm{R_\oplus}$ and a mass of $29.1^{+7.5}_{-7.4}~\mathrm{M_\oplus}$, which leads to a mean density of $1.80^{+0.70}_{-0.55}~\mathrm{g~cm^{-3}}$. EPIC 201295312b is smaller than Neptune with an orbital period of 5.66 days, radius $2.75^{+0.24}_{-0.22}~\mathrm{R_\oplus}$ and we constrain the mass to be below $12~\mathrm{M_\oplus}$ at 95% confidence. We also find a long-term trend indicative of another body in the system. EPIC 201577035b, previously confirmed as the planet K2-10b, is smaller than Neptune orbiting its host star in 19.3 days, with radius $3.84^{+0.35}_{-0.34}~\mathrm{R_\oplus}$. We determine its mass to be $27^{+17}_{-16}~\mathrm{M_\oplus}$, with a 95% confidence uppler limit at $57~\mathrm{M_\oplus}$, and mean density $2.6^{+2.1}_{-1.6}~{\rm g~cm}^{-3}$. These measurements join the relatively small collection of planets smaller than Neptune with measurements or constraints of the mean density. Our code for performing K2 photometry and detecting planetary transits is now publicly available. △ Less

Submitted 4 February, 2016; originally announced February 2016.

Comments: Accepted for publication in ApJ

Showing 1–7 of 7 results for author: Shrotriya, S