Skip to main content

Showing 1–11 of 11 results for author: Moka, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.10099  [pdf, other

    stat.ML cs.LG math.OC q-fin.PM

    A Scalable Gradient-Based Optimization Framework for Sparse Minimum-Variance Portfolio Selection

    Authors: Sarat Moka, Matias Quiroz, Vali Asimit, Samuel Muller

    Abstract: Portfolio optimization involves selecting asset weights to minimize a risk-reward objective, such as the portfolio variance in the classical minimum-variance framework. Sparse portfolio selection extends this by imposing a cardinality constraint: only $k$ assets from a universe of $p$ may be included. The standard approach models this problem as a mixed-integer quadratic program and relies on comm… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  2. arXiv:2504.10530  [pdf, other

    math.PR stat.CO

    Efficient Rare-Event Simulation for Random Geometric Graphs via Importance Sampling

    Authors: Sarat Moka, Christian Hirsch, Volker Schmidt, Dirk Kroese

    Abstract: Random geometric graphs defined on Euclidean subspaces, also called Gilbert graphs, are widely used to model spatially embedded networks across various domains. In such graphs, nodes are located at random in Euclidean space, and any two nodes are connected by an edge if they lie within a certain distance threshold. Accurately estimating rare-event probabilities related to key properties of these g… ▽ More

    Submitted 15 April, 2025; v1 submitted 12 April, 2025; originally announced April 2025.

    Comments: 29 Pages, 2 figures

  3. arXiv:2409.05052  [pdf, other

    stat.AP

    Rating Players of Counter-Strike: Global Offensive Based on Plus/Minus value

    Authors: Hongyu Xu, Sarat Moka

    Abstract: We propose a player rating mechanism for Counter-Strike: Global Offensive (CS ), a popular e-sport, by analyzing players' Plus/Minus values. The Plus/Minus value represents the average point difference between a player's team and the opponent's team across all matches the player has participated in. Using models such as regularized linear regression, logistic regression, and Bayesian linear models… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 8 pages

  4. arXiv:2407.03383  [pdf, other

    stat.ME stat.CO stat.ML

    Continuous Optimization for Offline Change Point Detection and Estimation

    Authors: Hans Reimann, Sarat Moka, Georgy Sofronov

    Abstract: This work explores use of novel advances in best subset selection for regression modelling via continuous optimization for offline change point detection and estimation in univariate Gaussian data sequences. The approach exploits reformulating the normal mean multiple change point model into a regularized statistical inverse problem enforcing sparsity. After introducing the problem statement, crit… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2404.13339  [pdf, other

    stat.ME stat.CO

    Group COMBSS: Group Selection via Continuous Optimization

    Authors: Anant Mathur, Sarat Moka, Benoit Liquet, Zdravko Botev

    Abstract: We present a new optimization method for the group selection problem in linear regression. In this problem, predictors are assumed to have a natural group structure and the goal is to select a small set of groups that best fits the response. The incorporation of group structure in a predictor matrix is a key factor in obtaining better estimators and identifying associations between response and pr… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  6. arXiv:2403.20007  [pdf, other

    stat.ME stat.CO stat.OT

    Best Subset Solution Path for Linear Dimension Reduction Models using Continuous Optimization

    Authors: Benoit Liquet, Sarat Moka, Samuel Muller

    Abstract: The selection of best variables is a challenging problem in supervised and unsupervised learning, especially in high dimensional contexts where the number of variables is usually much larger than the number of observations. In this paper, we focus on two multivariate statistical methods: principal components analysis and partial least squares. Both approaches are popular linear dimension-reduction… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Main paper 26 pages including references and 17 pages for the supplementary material

  7. arXiv:2403.13076  [pdf, other

    stat.ME

    Spatial Autoregressive Model on a Dirichlet Distribution

    Authors: Teo Nguyen, Sarat Moka, Kerrie Mengersen, Benoit Liquet

    Abstract: Compositional data find broad application across diverse fields due to their efficacy in representing proportions or percentages of various components within a whole. Spatial dependencies often exist in compositional data, particularly when the data represents different land uses or ecological variables. Ignoring the spatial autocorrelations in modelling of compositional data may lead to incorrect… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 33 pages, 2 figures, submitted to "Computational Statistics & Data Analysis"

  8. arXiv:2311.11236  [pdf, other

    stat.ME stat.CO

    Generalized Linear Models via the Lasso: To Scale or Not to Scale?

    Authors: Anant Mathur, Sarat Moka, Zdravko Botev

    Abstract: The Lasso regression is a popular regularization method for feature selection in statistics. Prior to computing the Lasso estimator in both linear and generalized linear models, it is common to conduct a preliminary rescaling of the feature matrix to ensure that all the features are standardized. Without this standardization, it is argued, the Lasso estimate will unfortunately depend on the units… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  9. arXiv:2304.09678  [pdf, other

    stat.ME stat.CO

    Column Subset Selection and Nyström Approximation via Continuous Optimization

    Authors: Anant Mathur, Sarat Moka, Zdravko Botev

    Abstract: We propose a continuous optimization algorithm for the Column Subset Selection Problem (CSSP) and Nyström approximation. The CSSP and Nyström method construct low-rank approximations of matrices based on a predetermined subset of columns. It is well known that choosing the best column subset of size $k$ is a difficult combinatorial problem. In this work, we show how one can approximate the optimal… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  10. arXiv:2205.02617  [pdf, other

    stat.ME stat.CO

    COMBSS: Best Subset Selection via Continuous Optimization

    Authors: Sarat Moka, Benoit Liquet, Houying Zhu, Samuel Muller

    Abstract: The problem of best subset selection in linear regression is considered with the aim to find a fixed size subset of features that best fits the response. This is particularly challenging when the total available number of features is very large compared to the number of data samples. Existing optimal methods for solving this problem tend to be slow while fast methods tend to have low accuracy. Ide… ▽ More

    Submitted 24 November, 2023; v1 submitted 5 May, 2022; originally announced May 2022.

  11. arXiv:2106.14565   

    stat.ML cs.LG stat.CO

    Variance Reduction for Matrix Computations with Applications to Gaussian Processes

    Authors: Anant Mathur, Sarat Moka, Zdravko Botev

    Abstract: In addition to recent developments in computing speed and memory, methodological advances have contributed to significant gains in the performance of stochastic simulation. In this paper, we focus on variance reduction for matrix computations via matrix factorization. We provide insights into existing variance reduction methods for estimating the entries of large matrices. Popular methods do not e… ▽ More

    Submitted 26 March, 2023; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Unable to be updated