Skip to main content

Showing 1–12 of 12 results for author: Rabanser, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04203  [pdf, ps, other

    cs.DC

    Cascadia: A Cascade Serving System for Large Language Models

    Authors: Youhe Jiang, Fangcheng Fu, Wanru Zhao, Stephan Rabanser, Nicholas D. Lane, Binhang Yuan

    Abstract: Recent advances in large language models (LLMs) have intensified the need to deliver both rapid responses and high-quality answers. More powerful models yield better results but incur higher inference latency, whereas smaller models are faster yet less capable. Recent work proposes balancing this latency-quality trade-off using model cascades, which route simpler queries to smaller models and more… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  2. arXiv:2505.23968  [pdf, other

    cs.CR cs.AI cs.CY cs.LG stat.ML

    Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention

    Authors: Stephan Rabanser, Ali Shahin Shamsabadi, Olive Franzese, Xiao Wang, Adrian Weller, Nicolas Papernot

    Abstract: Cautious predictions -- where a machine learning model abstains when uncertain -- are crucial for limiting harmful errors in safety-critical applications. In this work, we identify a novel threat: a dishonest institution can exploit these mechanisms to discriminate or unjustly deny services under the guise of uncertainty. We demonstrate the practicality of this threat by introducing an uncertainty… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Proceedings of the 42nd International Conference on Machine Learning

  3. arXiv:2505.22356  [pdf, ps, other

    cs.LG cs.AI cs.CY stat.ML

    Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings

    Authors: Angéline Pouget, Mohammad Yaghini, Stephan Rabanser, Nicolas Papernot

    Abstract: Deploying machine learning models in safety-critical domains poses a key challenge: ensuring reliable model performance on downstream user data without access to ground truth labels for direct validation. We propose the suitability filter, a novel framework designed to detect performance deterioration by utilizing suitability signals -- model output features that are sensitive to covariate shifts… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025

  4. arXiv:2502.19335  [pdf, ps, other

    cs.LG

    Gatekeeper: Improving Model Cascades Through Confidence Tuning

    Authors: Stephan Rabanser, Nathalie Rauschmayr, Achin Kulshrestha, Petra Poklukar, Wittawat Jitkrittum, Sean Augenstein, Congchao Wang, Federico Tombari

    Abstract: Large-scale machine learning models deliver strong performance across a wide range of tasks but come with significant computational and resource constraints. To mitigate these challenges, local smaller models are often deployed alongside larger models, relying on routing and deferral mechanisms to offload complex tasks. However, existing approaches inadequately balance the capabilities of these mo… ▽ More

    Submitted 16 June, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: Presented at the TTODLer-FM workshop at the International Conference on Machine Learning (ICML) 2025

  5. arXiv:2310.16678  [pdf, other

    cs.LG cs.CR

    Robust and Actively Secure Serverless Collaborative Learning

    Authors: Olive Franzese, Adam Dziedzic, Christopher A. Choquette-Choo, Mark R. Thomas, Muhammad Ahmad Kaleem, Stephan Rabanser, Congyu Fang, Somesh Jha, Nicolas Papernot, Xiao Wang

    Abstract: Collaborative machine learning (ML) is widely used to enable institutions to learn better models from distributed data. While collaborative approaches to learning intuitively protect user data, they remain vulnerable to either the server, the clients, or both, deviating from the protocol. Indeed, because the protocol is asymmetric, a malicious server can abuse its power to reconstruct client data… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023

  6. arXiv:2305.18393  [pdf, other

    cs.LG cs.CR

    Training Private Models That Know What They Don't Know

    Authors: Stephan Rabanser, Anvith Thudi, Abhradeep Thakurta, Krishnamurthy Dvijotham, Nicolas Papernot

    Abstract: Training reliable deep learning models which avoid making overconfident but incorrect predictions is a longstanding challenge. This challenge is further exacerbated when learning has to be differentially private: protection provided to sensitive data comes at the price of injecting additional randomness into the learning process. In this work, we conduct a thorough empirical investigation of selec… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  7. arXiv:2207.12545  [pdf, other

    cs.LG stat.ML

    $p$-DkNN: Out-of-Distribution Detection Through Statistical Testing of Deep Representations

    Authors: Adam Dziedzic, Stephan Rabanser, Mohammad Yaghini, Armin Ale, Murat A. Erdogdu, Nicolas Papernot

    Abstract: The lack of well-calibrated confidence estimates makes neural networks inadequate in safety-critical domains such as autonomous driving or healthcare. In these settings, having the ability to abstain from making a prediction on out-of-distribution (OOD) data can be as important as correctly classifying in-distribution data. We introduce $p$-DkNN, a novel inference procedure that takes a trained de… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  8. arXiv:2206.14342  [pdf, other

    cs.LG stat.ML

    Intrinsic Anomaly Detection for Multi-Variate Time Series

    Authors: Stephan Rabanser, Tim Januschowski, Kashif Rasul, Oliver Borchert, Richard Kurle, Jan Gasthaus, Michael Bohlke-Schneider, Nicolas Papernot, Valentin Flunkert

    Abstract: We introduce a novel, practically relevant variation of the anomaly detection problem in multi-variate time series: intrinsic anomaly detection. It appears in diverse practical scenarios ranging from DevOps to IoT, where we want to recognize failures of a system that operates under the influence of a surrounding environment. Intrinsic anomalies are changes in the functional dependency structure be… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  9. arXiv:2205.13532  [pdf, ps, other

    cs.LG stat.ML

    Selective Prediction via Training Dynamics

    Authors: Stephan Rabanser, Anvith Thudi, Kimia Hamidieh, Adam Dziedzic, Israfil Bahceci, Akram Bin Sediq, Hamza Sokun, Nicolas Papernot

    Abstract: Selective Prediction is the task of rejecting inputs a model would predict incorrectly on. This involves a trade-off between input space coverage (how many data points are accepted) and model utility (how good is the performance on accepted data points). Current methods for selective prediction typically impose constraints on either the model architecture or the optimization objective; this inhibi… ▽ More

    Submitted 6 July, 2025; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  10. arXiv:2005.10111  [pdf, other

    cs.LG stat.ML

    The Effectiveness of Discretization in Forecasting: An Empirical Study on Neural Time Series Models

    Authors: Stephan Rabanser, Tim Januschowski, Valentin Flunkert, David Salinas, Jan Gasthaus

    Abstract: Time series modeling techniques based on deep learning have seen many advancements in recent years, especially in data-abundant settings and with the central aim of learning global models that can extract patterns across multiple time series. While the crucial importance of appropriate data pre-processing and scaling has often been noted in prior work, most studies focus on improving model archite… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

  11. arXiv:1810.11953  [pdf, other

    stat.ML cs.LG

    Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift

    Authors: Stephan Rabanser, Stephan Günnemann, Zachary C. Lipton

    Abstract: We might hope that when faced with unexpected inputs, well-designed software systems would fire off warnings. Machine learning (ML) systems, however, which depend strongly on properties of their inputs (e.g. the i.i.d. assumption), tend to fail silently. This paper explores the problem of building ML systems that fail loudly, investigating methods for detecting dataset shift, identifying exemplars… ▽ More

    Submitted 28 October, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: Advances in Neural Information Processing Systems (NeurIPS) 2019

  12. arXiv:1711.10781  [pdf, other

    stat.ML cs.LG

    Introduction to Tensor Decompositions and their Applications in Machine Learning

    Authors: Stephan Rabanser, Oleksandr Shchur, Stephan Günnemann

    Abstract: Tensors are multidimensional arrays of numerical values and therefore generalize matrices to multiple dimensions. While tensors first emerged in the psychometrics community in the $20^{\text{th}}$ century, they have since then spread to numerous other disciplines, including machine learning. Tensors and their decompositions are especially beneficial in unsupervised learning settings, but are gaini… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Comments: 13 pages, 12 figures