Skip to main content

Showing 1–9 of 9 results for author: Rasul, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.19731  [pdf, ps, other

    stat.ML cs.LG

    Accelerating Nash Learning from Human Feedback via Mirror Prox

    Authors: Daniil Tiapkin, Daniele Calandriello, Denis Belomestny, Eric Moulines, Alexey Naumov, Kashif Rasul, Michal Valko, Pierre Menard

    Abstract: Traditional Reinforcement Learning from Human Feedback (RLHF) often relies on reward models, frequently assuming preference structures like the Bradley-Terry model, which may not accurately capture the complexities of real human preferences (e.g., intransitivity). Nash Learning from Human Feedback (NLHF) offers a more direct alternative by framing the problem as finding a Nash equilibrium of a gam… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2409.11684  [pdf, other

    cs.LG stat.ML

    Recurrent Interpolants for Probabilistic Time Series Prediction

    Authors: Yu Chen, Marin Biloš, Sarthak Mittal, Wei Deng, Kashif Rasul, Anderson Schneider

    Abstract: Sequential models like recurrent neural networks and transformers have become standard for probabilistic multivariate time series forecasting across various domains. Despite their strengths, they struggle with capturing high-dimensional distributions and cross-feature dependencies. Recent work explores generative approaches using diffusion or flow-based models, extending to time series imputation… ▽ More

    Submitted 4 October, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

  3. arXiv:2405.07836  [pdf, other

    cs.LG stat.ME

    Forecasting with Hyper-Trees

    Authors: Alexander März, Kashif Rasul

    Abstract: We introduce the concept of Hyper-Trees and offer a new direction in applying tree-based models to time series data. Unlike conventional applications of decision trees that forecast time series directly, Hyper-Trees are designed to learn the parameters of time series models. Our framework combines the effectiveness of gradient boosted trees on tabular data with the advantages of established time s… ▽ More

    Submitted 14 October, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Forecasting, Gradient Boosting Machines, Hyper-Networks, Parameter Non-Stationarity, Time Series

  4. arXiv:2206.14342  [pdf, other

    cs.LG stat.ML

    Intrinsic Anomaly Detection for Multi-Variate Time Series

    Authors: Stephan Rabanser, Tim Januschowski, Kashif Rasul, Oliver Borchert, Richard Kurle, Jan Gasthaus, Michael Bohlke-Schneider, Nicolas Papernot, Valentin Flunkert

    Abstract: We introduce a novel, practically relevant variation of the anomaly detection problem in multi-variate time series: intrinsic anomaly detection. It appears in diverse practical scenarios ranging from DevOps to IoT, where we want to recognize failures of a system that operates under the influence of a surrounding environment. Intrinsic anomalies are changes in the functional dependency structure be… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  5. arXiv:2002.06103  [pdf, other

    cs.LG stat.ML

    Multivariate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

    Authors: Kashif Rasul, Abdul-Saboor Sheikh, Ingmar Schuster, Urs Bergmann, Roland Vollgraf

    Abstract: Time series forecasting is often fundamental to scientific and engineering problems and enables decision making. With ever increasing data set sizes, a trivial solution to scale up predictions is to assume independence between interacting time series. However, modeling statistical dependencies can improve accuracy and enable analysis of interaction effects. Deep learning methods are well suited fo… ▽ More

    Submitted 14 January, 2021; v1 submitted 14 February, 2020; originally announced February 2020.

  6. arXiv:1909.02775  [pdf, other

    cs.LG stat.ML

    Set Flow: A Permutation Invariant Normalizing Flow

    Authors: Kashif Rasul, Ingmar Schuster, Roland Vollgraf, Urs Bergmann

    Abstract: We present a generative model that is defined on finite sets of exchangeable, potentially high dimensional, data. As the architecture is an extension of RealNVPs, it inherits all its favorable properties, such as being invertible and allowing for exact log-likelihood evaluation. We show that this architecture is able to learn finite non-i.i.d. set data distributions, learn statistical dependencies… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

  7. arXiv:1902.03657  [pdf, other

    cs.LG stat.ML

    A Bandit Framework for Optimal Selection of Reinforcement Learning Agents

    Authors: Andreas Merentitis, Kashif Rasul, Roland Vollgraf, Abdul-Saboor Sheikh, Urs Bergmann

    Abstract: Deep Reinforcement Learning has been shown to be very successful in complex games, e.g. Atari or Go. These games have clearly defined rules, and hence allow simulation. In many practical applications, however, interactions with the environment are costly and a good simulator of the environment is not available. Further, as environments differ by application, the optimal inductive bias (architectur… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

    Comments: Published at the 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montreal, Canada. Deep Reinforcement Learning Workshop

  8. arXiv:1712.01141  [pdf, other

    stat.ML cs.LG

    Stochastic Maximum Likelihood Optimization via Hypernetworks

    Authors: Abdul-Saboor Sheikh, Kashif Rasul, Andreas Merentitis, Urs Bergmann

    Abstract: This work explores maximum likelihood optimization of neural networks through hypernetworks. A hypernetwork initializes the weights of another network, which in turn can be employed for typical functional tasks such as regression and classification. We optimize hypernetworks to directly maximize the conditional likelihood of target variables given input. Using this approach we obtain competitive e… ▽ More

    Submitted 12 January, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: To appear at NIPS 2017 Workshop on Bayesian Deep Learning

  9. arXiv:1708.07747  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

    Authors: Han Xiao, Kashif Rasul, Roland Vollgraf

    Abstract: We present Fashion-MNIST, a new dataset comprising of 28x28 grayscale images of 70,000 fashion products from 10 categories, with 7,000 images per category. The training set has 60,000 images and the test set has 10,000 images. Fashion-MNIST is intended to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms, as it shares the same image s… ▽ More

    Submitted 15 September, 2017; v1 submitted 25 August, 2017; originally announced August 2017.

    Comments: Dataset is freely available at https://github.com/zalandoresearch/fashion-mnist Benchmark is available at http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/