Skip to main content

Showing 1–9 of 9 results for author: Oreshkin, B N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.17451  [pdf, other

    cs.LG stat.ML

    Any-Quantile Probabilistic Forecasting of Short-Term Electricity Demand

    Authors: Slawek Smyl, Boris N. Oreshkin, Paweł Pełka, Grzegorz Dudek

    Abstract: Power systems operate under uncertainty originating from multiple factors that are impossible to account for deterministically. Distributional forecasting is used to control and mitigate risks associated with this uncertainty. Recent progress in deep learning has helped to significantly improve the accuracy of point forecasts, while accurate distributional forecasting still presents a significant… ▽ More

    Submitted 4 October, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  2. arXiv:2109.09705  [pdf, other

    cs.LG cs.DC cs.NE stat.ML

    Neural forecasting at scale

    Authors: Philippe Chatigny, Shengrui Wang, Jean-Marc Patenaude, Boris N. Oreshkin

    Abstract: We study the problem of efficiently scaling ensemble-based deep neural networks for multi-step time series (TS) forecasting on a large set of time series. Current state-of-the-art deep ensemble models have high memory and computational requirements, hampering their use to forecast millions of TS in practical scenarios. We propose N-BEATS(P), a global parallel variant of the N-BEATS model designed… ▽ More

    Submitted 28 January, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

  3. arXiv:2007.15531  [pdf, other

    cs.LG stat.ML

    FC-GAGA: Fully Connected Gated Graph Architecture for Spatio-Temporal Traffic Forecasting

    Authors: Boris N. Oreshkin, Arezou Amini, Lucy Coyle, Mark J. Coates

    Abstract: Forecasting of multivariate time-series is an important problem that has applications in traffic management, cellular network configuration, and quantitative finance. A special case of the problem arises when there is a graph available that captures the relationships between the time-series. In this paper we propose a novel learning architecture that achieves performance competitive with or better… ▽ More

    Submitted 14 December, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

  4. arXiv:2002.02887  [pdf, other

    cs.LG stat.ML

    Meta-learning framework with applications to zero-shot time-series forecasting

    Authors: Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio

    Abstract: Can meta-learning discover generic ways of processing time series (TS) from a diverse dataset so as to greatly improve generalization on new TS coming from different datasets? This work provides positive evidence to this using a broad meta-learning framework which we show subsumes many existing meta-learning algorithms. Our theoretical analysis suggests that residual connections act as a meta-lear… ▽ More

    Submitted 14 December, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  5. arXiv:1906.11892  [pdf, other

    cs.CV cs.LG stat.ML

    CLAREL: Classification via retrieval loss for zero-shot learning

    Authors: Boris N. Oreshkin, Negar Rostamzadeh, Pedro O. Pinheiro, Christopher Pal

    Abstract: We address the problem of learning fine-grained cross-modal representations. We propose an instance-based deep metric learning approach in joint visual and textual space. The key novelty of this paper is that it shows that using per-image semantic supervision leads to substantial improvement in zero-shot performance over using class-only supervision. On top of that, we provide a probabilistic just… ▽ More

    Submitted 5 April, 2020; v1 submitted 31 May, 2019; originally announced June 2019.

  6. arXiv:1905.10437  [pdf, other

    cs.LG stat.ML

    N-BEATS: Neural basis expansion analysis for interpretable time series forecasting

    Authors: Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio

    Abstract: We focus on solving the univariate times series point forecasting problem using deep learning. We propose a deep neural architecture based on backward and forward residual links and a very deep stack of fully-connected layers. The architecture has a number of desirable properties, being interpretable, applicable without modification to a wide array of target domains, and fast to train. We test the… ▽ More

    Submitted 20 February, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

  7. arXiv:1902.07104  [pdf, other

    cs.LG stat.ML

    Adaptive Cross-Modal Few-Shot Learning

    Authors: Chen Xing, Negar Rostamzadeh, Boris N. Oreshkin, Pedro O. Pinheiro

    Abstract: Metric-based meta-learning techniques have successfully been applied to few-shot classification problems. In this paper, we propose to leverage cross-modal information to enhance metric-based few-shot learning methods. Visual and semantic feature spaces have different structures by definition. For certain concepts, visual features might be richer and more discriminative than text ones. While for o… ▽ More

    Submitted 17 February, 2020; v1 submitted 19 February, 2019; originally announced February 2019.

  8. arXiv:1805.10123  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    TADAM: Task dependent adaptive metric for improved few-shot learning

    Authors: Boris N. Oreshkin, Pau Rodriguez, Alexandre Lacoste

    Abstract: Few-shot learning has become essential for producing models that generalize from few examples. In this work, we identify that metric scaling and metric task conditioning are important to improve the performance of few-shot algorithms. Our analysis reveals that simple metric scaling completely changes the nature of few-shot algorithm parameter updates. Metric scaling provides improvements up to 14%… ▽ More

    Submitted 25 January, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

    Journal ref: Advances in Neural Information Processing Systems 31, 2018

  9. Efficient delay-tolerant particle filtering

    Authors: Boris N. Oreshkin, Xuan Liu, Mark J. Coates

    Abstract: This paper proposes a novel framework for delay-tolerant particle filtering that is computationally efficient and has limited memory requirements. Within this framework the informativeness of a delayed (out-of-sequence) measurement (OOSM) is estimated using a lightweight procedure and uninformative measurements are immediately discarded. The framework requires the identification of a threshold tha… ▽ More

    Submitted 22 September, 2010; originally announced September 2010.