Skip to main content

Showing 1–19 of 19 results for author: Subedi, U

.
  1. arXiv:2505.18288  [pdf, ps, other

    stat.ML cs.LG

    Operator Learning for Schrödinger Equation: Unitarity, Error Bounds, and Time Generalization

    Authors: Yash Patel, Unique Subedi, Ambuj Tewari

    Abstract: We consider the problem of learning the evolution operator for the time-dependent Schrödinger equation, where the Hamiltonian may vary with time. Existing neural network-based surrogates often ignore fundamental properties of the Schrödinger equation, such as linearity and unitarity, and lack theoretical guarantees on prediction error or time generalization. To address this, we introduce a linear… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 25 pages

  2. arXiv:2505.17288  [pdf, ps, other

    stat.ML cs.LG

    Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation

    Authors: Seamus Somerstep, Vinod Raman, Unique Subedi, Yuekai Sun

    Abstract: Using the bit string generation problem as a case study, we theoretically compare two standard methods for adapting large language models to new tasks. The first, referred to as supervised fine-tuning, involves training a new next token predictor on good generations. The second method, Best-of-N, trains a reward model to select good responses from a collection generated by an unaltered base model.… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  3. arXiv:2504.03503  [pdf, other

    stat.ML cs.LG

    Operator Learning: A Statistical Perspective

    Authors: Unique Subedi, Ambuj Tewari

    Abstract: Operator learning has emerged as a powerful tool in scientific computing for approximating mappings between infinite-dimensional function spaces. A primary application of operator learning is the development of surrogate models for the solution operators of partial differential equations (PDEs). These methods can also be used to develop black-box simulators to model system behavior from experiment… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 28 pages, 6 figures

  4. arXiv:2411.01634  [pdf, ps, other

    cs.LG stat.ML

    Multiclass Transductive Online Learning

    Authors: Steve Hanneke, Vinod Raman, Amirreza Shaeiri, Unique Subedi

    Abstract: We consider the problem of multiclass transductive online learning when the number of labels can be unbounded. Previous works by Ben-David et al. [1997] and Hanneke et al. [2023b] only consider the case of binary and finite label spaces, respectively. The latter work determined that their techniques fail to extend to the case of unbounded label spaces, and they pose the question of characterizing… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

    Comments: 23 pages

  5. arXiv:2410.19725  [pdf, other

    stat.ML cs.LG

    On the Benefits of Active Data Collection in Operator Learning

    Authors: Unique Subedi, Ambuj Tewari

    Abstract: We study active data collection strategies for operator learning when the target operator is linear and the input functions are drawn from a mean-zero stochastic process with continuous covariance kernels. With an active data collection strategy, we establish an error convergence rate in terms of the decay rate of the eigenvalues of the covariance kernel. We can achieve arbitrarily fast error conv… ▽ More

    Submitted 6 February, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

    Comments: Moved Proofs to the Appendix

  6. arXiv:2408.09004  [pdf, other

    stat.ML cs.LG math.NA

    Controlling Statistical, Discretization, and Truncation Errors in Learning Fourier Linear Operators

    Authors: Unique Subedi, Ambuj Tewari

    Abstract: We study learning-theoretic foundations of operator learning, using the linear layer of the Fourier Neural Operator architecture as a model problem. First, we identify three main errors that occur during the learning process: statistical error due to finite sample size, truncation error from finite rank approximation of the operator, and discretization error from handling functional data on a fini… ▽ More

    Submitted 6 February, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: Added Experiments

  7. arXiv:2405.15424  [pdf, ps, other

    cs.LG

    Smoothed Online Classification can be Harder than Batch Classification

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study online classification under smoothed adversaries. In this setting, at each time point, the adversary draws an example from a distribution that has a bounded density with respect to a fixed base measure, which is known apriori to the learner. For binary classification and scalar-valued regression, previous works \citep{haghtalab2020smoothed, block2022smoothed} have shown that smoothed onli… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 18 pages

  8. arXiv:2402.06614  [pdf, ps, other

    cs.LG stat.ML

    The Complexity of Sequential Prediction in Dynamical Systems

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study the problem of learning to predict the next state of a dynamical system when the underlying evolution function is unknown. Unlike previous work, we place no parametric assumptions on the dynamical system, and study the problem from a learning theory perspective. We define new combinatorial measures and dimensions and show that they quantify the optimal mistake and regret bounds in the rea… ▽ More

    Submitted 2 June, 2025; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: L4DC Camera Ready

  9. arXiv:2310.19064  [pdf, other

    cs.LG stat.ML

    Apple Tasting: Combinatorial Dimensions and Minimax Rates

    Authors: Vinod Raman, Unique Subedi, Ananth Raman, Ambuj Tewari

    Abstract: In online binary classification under \emph{apple tasting} feedback, the learner only observes the true label if it predicts ``1". First studied by \cite{helmbold2000apple}, we revisit this classical partial-feedback setting and study online learnability from a combinatorial perspective. We show that the Littlestone dimension continues to provide a tight quantitative characterization of apple tast… ▽ More

    Submitted 18 June, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: 21 pages, COLT 2024 Camera Ready

  10. arXiv:2309.06548  [pdf, ps, other

    stat.ML cs.LG

    Online Infinite-Dimensional Regression: Learning Linear Operators

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We consider the problem of learning linear operators under squared loss between two infinite-dimensional Hilbert spaces in the online setting. We show that the class of linear operators with uniformly bounded $p$-Schatten norm is online learnable for any $p \in [1, \infty)$. On the other hand, we prove an impossibility result by showing that the class of uniformly bounded linear operators with res… ▽ More

    Submitted 24 January, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: 21 pages, ALT 2024 Camera Ready

  11. arXiv:2308.04620  [pdf, other

    cs.LG stat.ML

    Multiclass Online Learnability under Bandit Feedback

    Authors: Ananth Raman, Vinod Raman, Unique Subedi, Idan Mehalel, Ambuj Tewari

    Abstract: We study online multiclass classification under bandit feedback. We extend the results of Daniely and Helbertal [2013] by showing that the finiteness of the Bandit Littlestone dimension is necessary and sufficient for bandit online learnability even when the label space is unbounded. Moreover, we show that, unlike the full-information setting, sequential uniform convergence is necessary but not su… ▽ More

    Submitted 20 January, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: 16 pages, ALT 2024 Camera Ready

  12. arXiv:2307.03816  [pdf, ps, other

    cs.LG

    A Combinatorial Characterization of Supervised Online Learnability

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study the online learnability of hypothesis classes with respect to arbitrary, but bounded loss functions. No characterization of online learnability is known at this level of generality. We give a new scale-sensitive combinatorial dimension, named the sequential minimax dimension, and show that it gives a tight quantitative characterization of online learnability. In addition, we show that the… ▽ More

    Submitted 9 February, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 20 pages. arXiv admin note: text overlap with arXiv:2306.06247

  13. arXiv:2306.06247  [pdf, ps, other

    cs.LG stat.ML

    Online Learning with Set-Valued Feedback

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study a variant of online multiclass classification where the learner predicts a single label but receives a \textit{set of labels} as feedback. In this model, the learner is penalized for not outputting a label contained in the revealed set. We show that unlike online multiclass learning with single-label feedback, deterministic and randomized online learnability are \textit{not equivalent} ev… ▽ More

    Submitted 18 June, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted to COLT 2024

  14. arXiv:2304.03337  [pdf, ps, other

    cs.LG stat.ML

    On the Learnability of Multilabel Ranking

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: Multilabel ranking is a central task in machine learning. However, the most fundamental question of learnability in a multilabel ranking setting with relevance-score feedback remains unanswered. In this work, we characterize the learnability of multilabel ranking problems in both batch and online settings for a large family of ranking losses. Along the way, we give two equivalence classes of ranki… ▽ More

    Submitted 25 May, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 28 pages

  15. arXiv:2303.17716  [pdf, ps, other

    cs.LG stat.ML

    Multiclass Online Learning and Uniform Convergence

    Authors: Steve Hanneke, Shay Moran, Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We study multiclass classification in the agnostic adversarial online learning setting. As our main result, we prove that any multiclass concept class is agnostically learnable if and only if its Littlestone dimension is finite. This solves an open problem studied by Daniely, Sabato, Ben-David, and Shalev-Shwartz (2011,2015) who handled the case when the number of classes (or labels) is bounded. W… ▽ More

    Submitted 7 July, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: COLT Camera-Ready, 15 pages

  16. arXiv:2301.02729  [pdf, ps, other

    cs.LG stat.ML

    A Characterization of Multioutput Learnability

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: We consider the problem of learning multioutput function classes in the batch and online settings. In both settings, we show that a multioutput function class is learnable if and only if each single-output restriction of the function class is learnable. This provides a complete characterization of the learnability of multilabel classification and multioutput regression in both batch and online set… ▽ More

    Submitted 24 November, 2024; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 54 pages; JMLR version

  17. arXiv:2211.05656  [pdf, other

    cs.LG stat.ML

    On Proper Learnability between Average- and Worst-case Robustness

    Authors: Vinod Raman, Unique Subedi, Ambuj Tewari

    Abstract: Recently, Montasser et al. [2019] showed that finite VC dimension is not sufficient for proper adversarially robust PAC learning. In light of this hardness, there is a growing effort to study what type of relaxations to the adversarially robust PAC learning setup can enable proper learnability. In this work, we initiate the study of proper learning under relaxations of the worst-case robust loss.… ▽ More

    Submitted 25 May, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: 19 pages

  18. A conjectural asymptotic formula for multiplicative chaos in number theory

    Authors: Daksh Aggarwal, Unique Subedi, William Verreault, Asif Zaman, Chenghui Zheng

    Abstract: We investigate a special sequence of random variables $A(N)$ defined by an exponential power series with independent standard complex Gaussians $(X(k))_{k \geq 1}$. Introduced by Hughes, Keating, and O'Connell in the study of random matrix theory, this sequence relates to Gaussian multiplicative chaos (in particular "holomorphic multiplicative chaos'' per Najnudel, Paquette, and Simm) and random m… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: 18 pages

    MSC Class: 60-08 (Primary); 60-11; 60F99; 11K65 (Secondary)

    Journal ref: Res. Number Theory 8, 35 (2022)

  19. Sums of random multiplicative functions over function fields with few irreducible factors

    Authors: Daksh Aggarwal, Unique Subedi, William Verreault, Asif Zaman, Chenghui Zheng

    Abstract: We establish a normal approximation for the limiting distribution of partial sums of random Rademacher multiplicative functions over function fields, provided the number of irreducible factors of the polynomials is small enough. This parallels work of Harper for random Rademacher multiplicative functions over the integers.

    Submitted 28 January, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 10 pages. Simplification of the proof of Lemma 5 and typos corrected, one reference added

    MSC Class: 11K65 (Primary) 60F05; 60G50 (Secondary)

    Journal ref: Math. Proc. Camb. Phil. Soc. (2022), 1-12