Skip to main content

Showing 1–23 of 23 results for author: Nevmyvaka, Y

.
  1. arXiv:2505.11821  [pdf, other

    cs.LG

    Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment

    Authors: Siliang Zeng, Quan Wei, William Brown, Oana Frunza, Yuriy Nevmyvaka, Mingyi Hong

    Abstract: This paper investigates approaches to enhance the reasoning capabilities of Large Language Model (LLM) agents using Reinforcement Learning (RL). Specifically, we focus on multi-turn tool-use scenarios, which can be naturally modeled as Markov Decision Processes (MDPs). While existing approaches often train multi-turn LLM agents with trajectory-level advantage estimation in bandit settings, they st… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: work in progress

  2. arXiv:2503.13709  [pdf, other

    cs.LG

    Multi-modal Time Series Analysis: A Tutorial and Survey

    Authors: Yushan Jiang, Kanghui Ning, Zijie Pan, Xuyang Shen, Jingchao Ni, Wenchao Yu, Anderson Schneider, Haifeng Chen, Yuriy Nevmyvaka, Dongjin Song

    Abstract: Multi-modal time series analysis has recently emerged as a prominent research area in data mining, driven by the increasing availability of diverse data modalities, such as text, images, and structured tabular data from real-world sources. However, effective analysis of multi-modal time series is hindered by data heterogeneity, modality gap, misalignment, and inherent noise. Recent advancements in… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  3. arXiv:2503.07649  [pdf, other

    cs.LG cs.AI

    TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster

    Authors: Kanghui Ning, Zijie Pan, Yu Liu, Yushan Jiang, James Y. Zhang, Kashif Rasul, Anderson Schneider, Lintao Ma, Yuriy Nevmyvaka, Dongjin Song

    Abstract: Large Language Models (LLMs) and Foundation Models (FMs) have recently become prevalent for time series forecasting tasks. While fine-tuning LLMs enables domain adaptation, they often struggle to generalize across diverse and unseen datasets. Moreover, existing Time Series Foundation Models (TSFMs) still face challenges in handling non-stationary dynamics and distribution shifts, largely due to th… ▽ More

    Submitted 27 May, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

  4. arXiv:2502.02410  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Privacy Amplification by Structured Subsampling for Deep Differentially Private Time Series Forecasting

    Authors: Jan Schuchardt, Mina Dalirrooyfard, Jed Guzelkabaagac, Anderson Schneider, Yuriy Nevmyvaka, Stephan Günnemann

    Abstract: Many forms of sensitive data, such as web traffic, mobility data, or hospital occupancy, are inherently sequential. The standard method for training machine learning models while ensuring privacy for units of sensitive information, such as individual hospital visits, is differentially private stochastic gradient descent (DP-SGD). However, we observe in this work that the formal guarantees of DP-SG… ▽ More

    Submitted 29 May, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: Accepted as ICML 2025 Spotlight

  5. arXiv:2501.16675  [pdf, other

    stat.ML cs.LG

    Variational Schrödinger Momentum Diffusion

    Authors: Kevin Rojas, Yixin Tan, Molei Tao, Yuriy Nevmyvaka, Wei Deng

    Abstract: The momentum Schrödinger Bridge (mSB) has emerged as a leading method for accelerating generative diffusion processes and reducing transport costs. However, the lack of simulation-free properties inevitably results in high training costs and affects scalability. To obtain a trade-off between transport properties and scalability, we introduce variational Schrödinger momentum diffusion (VSMD), which… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: AISTATS 25

  6. arXiv:2501.02353  [pdf, other

    cs.LG stat.ML

    Reweighting Improves Conditional Risk Bounds

    Authors: Yikai Zhang, Jiahe Lin, Fengpei Li, Songzhu Zheng, Anant Raj, Anderson Schneider, Yuriy Nevmyvaka

    Abstract: In this work, we study the weighted empirical risk minimization (weighted ERM) schema, in which an additional data-dependent weight function is incorporated when the empirical risk function is being minimized. We show that under a general ``balanceable" Bernstein condition, one can design a weighted ERM estimator to achieve superior performance in certain sub-regions over the one obtained from sta… ▽ More

    Submitted 4 January, 2025; originally announced January 2025.

    Comments: 33 pages

    ACM Class: G.3; I.3

  7. arXiv:2412.11257  [pdf, ps, other

    stat.ML cs.CE cs.LG q-fin.PR

    Prediction-Enhanced Monte Carlo: A Machine Learning View on Control Variate

    Authors: Fengpei Li, Haoxian Chen, Jiahe Lin, Arkin Gupta, Xiaowei Tan, Honglei Zhao, Gang Xu, Yuriy Nevmyvaka, Agostino Capponi, Henry Lam

    Abstract: For many complex simulation tasks spanning areas such as healthcare, engineering, and finance, Monte Carlo (MC) methods are invaluable due to their unbiased estimates and precise error quantification. Nevertheless, Monte Carlo simulations often become computationally prohibitive, especially for nested, multi-level, or path-dependent evaluations lacking effective variance reduction techniques. Whil… ▽ More

    Submitted 7 June, 2025; v1 submitted 15 December, 2024; originally announced December 2024.

  8. arXiv:2408.01798  [pdf, ps, other

    cs.DS cs.CR

    Differentially Private Gomory-Hu Trees

    Authors: Anders Aamand, Justin Y. Chen, Mina Dalirrooyfard, Slobodan Mitrović, Yuriy Nevmyvaka, Sandeep Silwal, Yinzhan Xu

    Abstract: Given an undirected, weighted $n$-vertex graph $G = (V, E, w)$, a Gomory-Hu tree $T$ is a weighted tree on $V$ such that for any pair of distinct vertices $s, t \in V$, the Min-$s$-$t$-Cut on $T$ is also a Min-$s$-$t$-Cut on $G$. Computing a Gomory-Hu tree is a well-studied problem in graph algorithms and has received considerable attention. In particular, a long line of work recently culminated i… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  9. arXiv:2405.04795  [pdf, other

    cs.LG

    Variational Schrödinger Diffusion Models

    Authors: Wei Deng, Weijian Luo, Yixin Tan, Marin Biloš, Yu Chen, Yuriy Nevmyvaka, Ricky T. Q. Chen

    Abstract: Schrödinger bridge (SB) has emerged as the go-to method for optimizing transportation plans in diffusion models. However, SB requires estimating the intractable forward score functions, inevitably resulting in the costly implicit training loss based on simulated trajectories. To improve the scalability while preserving efficient transportation plans, we leverage variational inference to linearize… ▽ More

    Submitted 24 May, 2025; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  10. arXiv:2404.07377  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.IT

    Deep Generative Sampling in the Dual Divergence Space: A Data-efficient & Interpretative Approach for Generative AI

    Authors: Sahil Garg, Anderson Schneider, Anant Raj, Kashif Rasul, Yuriy Nevmyvaka, Sneihil Gopal, Amit Dhurandhar, Guillermo Cecchi, Irina Rish

    Abstract: Building on the remarkable achievements in generative sampling of natural images, we propose an innovative challenge, potentially overly ambitious, which involves generating samples of entire multivariate time series that resemble images. However, the statistical challenge lies in the small sample size, sometimes consisting of a few hundred subjects. This issue is especially problematic for deep g… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  11. arXiv:2403.05798  [pdf, other

    cs.LG

    $\textbf{S}^2$IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series Forecasting

    Authors: Zijie Pan, Yushan Jiang, Sahil Garg, Anderson Schneider, Yuriy Nevmyvaka, Dongjin Song

    Abstract: Recently, there has been a growing interest in leveraging pre-trained large language models (LLMs) for various time series applications. However, the semantic space of LLMs, established through the pre-training, is still underexplored and may help yield more distinctive and informative representations to facilitate time series forecasting. To this end, we propose Semantic Space Informed Prompt lea… ▽ More

    Submitted 7 July, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  12. arXiv:2402.15485  [pdf, other

    cs.DS

    Graph Partitioning With Limited Moves

    Authors: Majid Behbahani, Mina Dalirrooyfard, Elaheh Fata, Yuriy Nevmyvaka

    Abstract: In many real world networks, there already exists a (not necessarily optimal) $k$-partitioning of the network. Oftentimes, one aims to find a $k$-partitioning with a smaller cut value for such networks by moving only a few nodes across partitions. The number of nodes that can be moved across partitions is often a constraint forced by budgetary limitations. Motivated by such real-world applications… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: shortened version accepted in AISTATS 2024 as oral

  13. arXiv:2402.12722  [pdf, other

    cs.LG

    Structural Knowledge Informed Continual Multivariate Time Series Forecasting

    Authors: Zijie Pan, Yushan Jiang, Dongjin Song, Sahil Garg, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka

    Abstract: Recent studies in multivariate time series (MTS) forecasting reveal that explicitly modeling the hidden dependencies among different time series can yield promising forecasting performance and reliable explanations. However, modeling variable dependencies remains underexplored when MTS is continuously accumulated under different regimes (stages). Due to the potential distribution and dependency di… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  14. arXiv:2402.03182  [pdf, other

    cs.LG

    Empowering Time Series Analysis with Large Language Models: A Survey

    Authors: Yushan Jiang, Zijie Pan, Xikun Zhang, Sahil Garg, Anderson Schneider, Yuriy Nevmyvaka, Dongjin Song

    Abstract: Recently, remarkable progress has been made over large language models (LLMs), demonstrating their unprecedented capability in varieties of natural language tasks. However, completely training a large general-purpose model from the scratch is challenging for time series analysis, due to the large volumes and varieties of time series data, as well as the non-stationarity that leads to concept drift… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  15. arXiv:2312.16370  [pdf, other

    cs.DS

    Nearly Tight Bounds For Differentially Private Min $s$-$t$ and Multiway Cut

    Authors: Mina Dalirrooyfard, Slobodan Mitrović, Yuriy Nevmyvaka

    Abstract: Finding min $s$-$t$ cuts in graphs is a basic algorithmic tool with applications in image segmentation, community detection, reinforcement learning, and data clustering. In this problem, we are given two nodes as terminals, and the goal is to remove the smallest number of edges from the graph so that these two terminals are disconnected. We study the complexity of differential privacy for the min… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  16. arXiv:2310.08278  [pdf, other

    cs.LG cs.AI

    Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

    Authors: Kashif Rasul, Arjun Ashok, Andrew Robert Williams, Hena Ghonia, Rishika Bhagwatkar, Arian Khorasani, Mohammad Javad Darvishi Bayazi, George Adamopoulos, Roland Riachi, Nadhir Hassen, Marin Biloš, Sahil Garg, Anderson Schneider, Nicolas Chapados, Alexandre Drouin, Valentina Zantedeschi, Yuriy Nevmyvaka, Irina Rish

    Abstract: Over the past years, foundation models have caused a paradigm shift in machine learning due to their unprecedented capabilities for zero-shot and few-shot generalization. However, despite the success of foundation models in modalities such as natural language processing and computer vision, the development of foundation models for time series forecasting has lagged behind. We present Lag-Llama, a… ▽ More

    Submitted 8 February, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: First two authors contributed equally. All data, models and code used are open-source. GitHub: https://github.com/time-series-foundation-models/lag-llama

  17. arXiv:2309.14240  [pdf, other

    cs.LG

    Learning to Abstain From Uninformative Data

    Authors: Yikai Zhang, Songzhu Zheng, Mina Dalirrooyfard, Pengxiang Wu, Anderson Schneider, Anant Raj, Yuriy Nevmyvaka, Chao Chen

    Abstract: Learning and decision-making in domains with naturally high noise-to-signal ratio, such as Finance or Healthcare, is often challenging, while the stakes are very high. In this paper, we study the problem of learning and acting under a general noisy generative process. In this problem, the data distribution has a significant proportion of uninformative samples with high noise in the label, while pa… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  18. arXiv:2305.18412  [pdf, other

    stat.AP cs.LG

    Short-term Temporal Dependency Detection under Heterogeneous Event Dynamic with Hawkes Processes

    Authors: Yu Chen, Fengpei Li, Anderson Schneider, Yuriy Nevmyvaka, Asohan Amarasingham, Henry Lam

    Abstract: Many event sequence data exhibit mutually exciting or inhibiting patterns. Reliable detection of such temporal dependency is crucial for scientific investigation. The de facto model is the Multivariate Hawkes Process (MHP), whose impact function naturally encodes a causal structure in Granger causality. However, the vast majority of existing methods use direct or nonlinear transform of standard MH… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Conference on Uncertainty in Artificial Intelligence 2023

  19. arXiv:2305.07247  [pdf, other

    cs.LG

    Provably Convergent Schrödinger Bridge with Applications to Probabilistic Time Series Imputation

    Authors: Yu Chen, Wei Deng, Shikai Fang, Fengpei Li, Nicole Tianjiao Yang, Yikai Zhang, Kashif Rasul, Shandian Zhe, Anderson Schneider, Yuriy Nevmyvaka

    Abstract: The Schrödinger bridge problem (SBP) is gaining increasing attention in generative modeling and showing promising potential even in comparison with the score-based generative models (SGMs). SBP can be interpreted as an entropy-regularized optimal transport problem, which conducts projections onto every other marginal alternatingly. However, in practice, only approximated projections are accessible… ▽ More

    Submitted 10 September, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted by ICML 2023

  20. arXiv:2211.02590  [pdf, other

    cs.LG

    Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion

    Authors: Marin Biloš, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka, Stephan Günnemann

    Abstract: Temporal data such as time series can be viewed as discretized measurements of the underlying function. To build a generative model for such data we have to model the stochastic process that governs it. We propose a solution by defining the denoising diffusion model in the function space which also allows us to naturally handle irregularly-sampled observations. The forward process gradually adds n… ▽ More

    Submitted 19 May, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: International Conference on Machine Learning (ICML), 2023

  21. arXiv:2205.13423  [pdf, other

    q-fin.TR

    Do price trajectory data increase the efficiency of market impact estimation?

    Authors: Fengpei Li, Vitalii Ihnatiuk, Ryan Kinnear, Anderson Schneider, Yuriy Nevmyvaka

    Abstract: Market impact is an important problem faced by large institutional investor and active market participant. In this paper, we rigorously investigate whether price trajectory data from the metaorder increases the efficiency of estimation, from an asymptotic view of statistical estimation. We show that, for popular market impact models, estimation methods based on partial price trajectory data, espec… ▽ More

    Submitted 30 March, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

  22. arXiv:1205.2646  [pdf

    cs.LG cs.GT

    Censored Exploration and the Dark Pool Problem

    Authors: Kuzman Ganchev, Michael Kearns, Yuriy Nevmyvaka, Jennifer Wortman Vaughan

    Abstract: We introduce and analyze a natural algorithm for multi-venue exploration from censored data, which is motivated by the Dark Pool Problem of modern quantitative finance. We prove that our algorithm converges in polynomial time to a near-optimal allocation policy; prior results for similar problems in stochastic inventory control guaranteed only asymptotic convergence and examined variants in which… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-185-194

  23. arXiv:1007.2593  [pdf, ps, other

    q-fin.TR q-fin.CP

    Empirical Limitations on High Frequency Trading Profitability

    Authors: Michael Kearns, Alex Kulesza, Yuriy Nevmyvaka

    Abstract: Addressing the ongoing examination of high-frequency trading practices in financial markets, we report the results of an extensive empirical study estimating the maximum possible profitability of the most aggressive such practices, and arrive at figures that are surprisingly modest. By "aggressive" we mean any trading strategy exclusively employing market orders and relatively short holding period… ▽ More

    Submitted 14 September, 2010; v1 submitted 15 July, 2010; originally announced July 2010.