Skip to main content

Showing 1–7 of 7 results for author: Perakis, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.16037  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Causal LLM Routing: End-to-End Regret Minimization from Observational Data

    Authors: Asterios Tsiourvas, Wei Sun, Georgia Perakis

    Abstract: LLM routing aims to select the most appropriate model for each query, balancing competing performance metrics such as accuracy and cost across a pool of language models. Prior approaches typically adopt a decoupled strategy, where the metrics are first predicted and the model is then selected based on these estimates. This setup is prone to compounding errors and often relies on full-feedback data… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2505.11360  [pdf, ps, other

    cs.LG

    Efficient End-to-End Learning for Decision-Making: A Meta-Optimization Approach

    Authors: Rares Cristian, Pavithra Harsha, Georgia Perakis, Brian Quanz

    Abstract: End-to-end learning has become a widely applicable and studied problem in training predictive ML models to be aware of their impact on downstream decision-making tasks. These end-to-end models often outperform traditional methods that separate training from the optimization and only myopically focus on prediction error. However, the computational complexity of end-to-end frameworks poses a signifi… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  3. arXiv:2502.15983  [pdf, other

    cs.LG stat.ML

    CoRe: Coherency Regularization for Hierarchical Time Series

    Authors: Rares Cristian, Pavithra Harhsa, Georgia Perakis, Brian Quanz

    Abstract: Hierarchical time series forecasting presents unique challenges, particularly when dealing with noisy data that may not perfectly adhere to aggregation constraints. This paper introduces a novel approach to soft coherency in hierarchical time series forecasting using neural networks. We present a network coherency regularization method, which we denote as CoRe (Coherency Regularization), a techniq… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  4. arXiv:2408.03872  [pdf, other

    cs.LG cs.AI

    Inter-Series Transformer: Attending to Products in Time Series Forecasting

    Authors: Rares Cristian, Pavithra Harsha, Clemente Ocejo, Georgia Perakis, Brian Quanz, Ioannis Spantidakis, Hamza Zerhouni

    Abstract: Time series forecasting is an important task in many fields ranging from supply chain management to weather forecasting. Recently, Transformer neural network architectures have shown promising results in forecasting on common time series benchmark datasets. However, application to supply chain demand forecasting, which can have challenging characteristics such as sparsity and cross-series effects,… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    ACM Class: I.2.6; G.3; I.5.1

  5. arXiv:2406.05633  [pdf, other

    stat.ML cs.LG econ.EM

    Heterogeneous Treatment Effects in Panel Data

    Authors: Retsef Levi, Elisabeth Paulson, Georgia Perakis, Emily Zhang

    Abstract: We address a core problem in causal inference: estimating heterogeneous treatment effects using panel data with general treatment patterns. Many existing methods either do not utilize the potential underlying structure in panel data or have limitations in the allowable treatment patterns. In this work, we propose and evaluate a new method that first partitions observations into disjoint clusters w… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  6. arXiv:2302.14744  [pdf, other

    math.OC cs.LG

    Tight Mixed-Integer Optimization Formulations for Prescriptive Trees

    Authors: Max Biggs, Georgia Perakis

    Abstract: We focus on modeling the relationship between an input feature vector and the predicted outcome of a trained decision tree using mixed-integer optimization. This can be used in many practical applications where a decision tree or tree ensemble is incorporated into an optimization problem to model the predicted outcomes of a decision. We propose tighter mixed-integer optimization formulations than… ▽ More

    Submitted 19 May, 2025; v1 submitted 28 February, 2023; originally announced February 2023.

  7. arXiv:2205.14189  [pdf, other

    math.OC cs.LG stat.ML

    Optimizing Objective Functions from Trained ReLU Neural Networks via Sampling

    Authors: Georgia Perakis, Asterios Tsiourvas

    Abstract: This paper introduces scalable, sampling-based algorithms that optimize trained neural networks with ReLU activations. We first propose an iterative algorithm that takes advantage of the piecewise linear structure of ReLU neural networks and reduces the initial mixed-integer optimization problem (MIP) into multiple easy-to-solve linear optimization problems (LPs) through sampling. Subsequently, we… ▽ More

    Submitted 6 June, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Review 2: Fixed typo in Table 1 and page 7. Bold values in Tables 2 and 4