Skip to main content

Showing 1–7 of 7 results for author: Nguyen, T N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.01808  [pdf, ps, other

    cs.LG stat.ML

    Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification

    Authors: Kapilan Balagopalan, Tuan Ngo Nguyen, Yao Zhao, Kwang-Sung Jun

    Abstract: The best arm identification problem requires identifying the best alternative (i.e., arm) in active experimentation using the smallest number of experiments (i.e., arm pulls), which is crucial for cost-efficient and timely decision-making processes. In the fixed confidence setting, an algorithm must stop data-dependently and return the estimated best arm with a correctness guarantee. Since this st… ▽ More

    Submitted 14 June, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: Accepted by ICML 2025. This version has fixed a minor typo in Lemma C.3. upon the camera-ready version

  2. arXiv:2411.00405  [pdf, other

    stat.ML cs.LG

    HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search

    Authors: Tuan Ngo Nguyen, Jay Barrett, Kwang-Sung Jun

    Abstract: We study the problem of estimating the \emph{value} of the largest mean among K distributions via samples from them (rather than estimating \emph{which} distribution has the largest mean), which arises from various machine learning tasks including Q-learning and Monte Carlo Tree Search (MCTS). While there have been a few proposed algorithms, their performance analyses have been limited to their bi… ▽ More

    Submitted 28 April, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: In Proceedings of the Artificial Intelligence and Statistics (AISTATS) 2025

  3. arXiv:2208.10536  [pdf

    stat.AP cs.LG

    A Meta-Analysis of Solar Forecasting Based on Skill Score

    Authors: Thi Ngoc Nguyen, Felix Müsgens

    Abstract: We conduct the first comprehensive meta-analysis of deterministic solar forecasting based on skill score, screening 1,447 papers from Google Scholar and reviewing the full texts of 320 papers for data extraction. A database of 4,687 points was built and analyzed with multivariate adaptive regression spline modelling, partial dependence plots, and linear regression. The marginal impacts on skill sc… ▽ More

    Submitted 12 April, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

  4. arXiv:2111.02092  [pdf

    stat.AP econ.EM stat.ME

    What drives the accuracy of PV output forecasts?

    Authors: Thi Ngoc Nguyen, Felix Müsgens

    Abstract: Due to the stochastic nature of photovoltaic (PV) power generation, there is high demand for forecasting PV output to better integrate PV generation into power grids. Systematic knowledge regarding the factors influencing forecast accuracy is crucially important, but still mostly unknown. In this paper, we review 180 papers on PV forecasts and extract a database of forecast errors for statistical… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  5. arXiv:2102.09030  [pdf, other

    cs.LG math.OC stat.ML

    Proactive DP: A Multple Target Optimization Framework for DP-SGD

    Authors: Marten van Dijk, Nhuong V. Nguyen, Toan N. Nguyen, Lam M. Nguyen, Phuong Ha Nguyen

    Abstract: We introduce a multiple target optimization framework for DP-SGD referred to as pro-active DP. In contrast to traditional DP accountants, which are used to track the expenditure of privacy budgets, the pro-active DP scheme allows one to a-priori select parameters of DP-SGD based on a fixed privacy budget (in terms of $ε$ and $δ$) in such a way to optimize the anticipated utility (test accuracy) th… ▽ More

    Submitted 4 June, 2024; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: text overlap with arXiv:2007.09208, changes in contents and title

  6. arXiv:2010.14763  [pdf, other

    cs.LG math.OC stat.ML

    Hogwild! over Distributed Local Data Sets with Linearly Increasing Mini-Batch Sizes

    Authors: Marten van Dijk, Nhuong V. Nguyen, Toan N. Nguyen, Lam M. Nguyen, Quoc Tran-Dinh, Phuong Ha Nguyen

    Abstract: Hogwild! implements asynchronous Stochastic Gradient Descent (SGD) where multiple threads in parallel access a common repository containing training data, perform SGD iterations and update shared state that represents a jointly learned (global) model. We consider big data analysis where training data is distributed among local data sets in a heterogeneous way -- and we wish to move SGD computation… ▽ More

    Submitted 26 February, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.09208 AISTATS 2021

  7. arXiv:2007.09208  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Asynchronous Federated Learning with Reduced Number of Rounds and with Differential Privacy from Less Aggregated Gaussian Noise

    Authors: Marten van Dijk, Nhuong V. Nguyen, Toan N. Nguyen, Lam M. Nguyen, Quoc Tran-Dinh, Phuong Ha Nguyen

    Abstract: The feasibility of federated learning is highly constrained by the server-clients infrastructure in terms of network communication. Most newly launched smartphones and IoT devices are equipped with GPUs or sufficient computing hardware to run powerful AI models. However, in case of the original synchronous federated learning, client devices suffer waiting times and regular communication between cl… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.