Skip to main content

Showing 1–7 of 7 results for author: Kahou, S E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2210.13583  [pdf, other

    cs.LG cs.AI stat.ME

    Learning Latent Structural Causal Models

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Nan Rosemary Ke, Tristan Deleu, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level causal variables are given. However, in machine learning tasks, one often operates on low-level data like image pixels or high-dimensional vectors. In such setti… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 21 pages, 19 figures

  2. arXiv:2207.05723  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Variable Models for Bayesian Causal Discovery

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Learning predictors that do not rely on spurious correlations involves building causal representations. However, learning such a representation is very challenging. We, therefore, formulate the problem of learning a causal representation from high dimensional data and study causal recovery with synthetic data. This work introduces a latent variable decoder model, Decoder BCD, for Bayesian causal d… ▽ More

    Submitted 10 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 7 figures, Published at the ICML 2022 Workshop on Spurious Correlations, Invariance, and Stability

  3. arXiv:2103.03098  [pdf, other

    cs.LG stat.ML

    Accounting for Variance in Machine Learning Benchmarks

    Authors: Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi, Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Naz Sepah, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Dmitriy Serdyuk, Tal Arbel, Chris Pal, Gaƫl Varoquaux, Pascal Vincent

    Abstract: Strong empirical evidence that one machine-learning algorithm A outperforms another one B ideally calls for multiple trials optimizing the learning pipeline over sources of variation such as data sampling, data augmentation, parameter initialization, and hyperparameters choices. This is prohibitively expensive, and corners are cut to reach conclusions. We model the whole benchmarking process, reve… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to MLSys2021

  4. arXiv:2002.06460  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    HighRes-net: Recursive Fusion for Multi-Frame Super-Resolution of Satellite Imagery

    Authors: Michel Deudon, Alfredo Kalaitzis, Israel Goytom, Md Rifat Arefin, Zhichao Lin, Kris Sankaran, Vincent Michalski, Samira E. Kahou, Julien Cornebise, Yoshua Bengio

    Abstract: Generative deep learning has sparked a new wave of Super-Resolution (SR) algorithms that enhance single images with impressive aesthetic results, albeit with imaginary details. Multi-frame Super-Resolution (MFSR) offers a more grounded approach to the ill-posed problem, by conditioning on multiple low-resolution views. This is important for satellite monitoring of human impact on the planet -- fro… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

    Comments: 15 pages, 5 figures

  5. arXiv:1902.06704  [pdf, other

    cs.NE cs.LG stat.ML

    Towards Non-saturating Recurrent Units for Modelling Long-term Dependencies

    Authors: Sarath Chandar, Chinnadhurai Sankar, Eugene Vorontsov, Samira Ebrahimi Kahou, Yoshua Bengio

    Abstract: Modelling long-term dependencies is a challenge for recurrent neural networks. This is primarily due to the fact that gradients vanish during training, as the sequence length increases. Gradients can be attenuated by transition operators and are attenuated or dropped by activation functions. Canonical architectures like LSTM alleviate this issue by skipping information through a memory mechanism.… ▽ More

    Submitted 22 January, 2019; originally announced February 2019.

    Comments: In Proceedings of AAAI 2019

  6. arXiv:1612.02095  [pdf, other

    cs.CV stat.ML

    ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events

    Authors: Evan Racah, Christopher Beckham, Tegan Maharaj, Samira Ebrahimi Kahou, Prabhat, Christopher Pal

    Abstract: Then detection and identification of extreme weather events in large-scale climate simulations is an important problem for risk management, informing governmental policy decisions and advancing our basic understanding of the climate system. Recent work has shown that fully supervised convolutional neural networks (CNNs) can yield acceptable accuracy for classifying well-known types of extreme weat… ▽ More

    Submitted 25 November, 2017; v1 submitted 6 December, 2016; originally announced December 2016.

  7. arXiv:1603.05691  [pdf, other

    stat.ML cs.LG

    Do Deep Convolutional Nets Really Need to be Deep and Convolutional?

    Authors: Gregor Urban, Krzysztof J. Geras, Samira Ebrahimi Kahou, Ozlem Aslan, Shengjie Wang, Rich Caruana, Abdelrahman Mohamed, Matthai Philipose, Matt Richardson

    Abstract: Yes, they do. This paper provides the first empirical demonstration that deep convolutional models really need to be both deep and convolutional, even when trained with methods such as distillation that allow small or shallow models of high accuracy to be trained. Although previous research showed that shallow feed-forward nets sometimes can learn the complex functions previously learned by deep n… ▽ More

    Submitted 3 March, 2017; v1 submitted 17 March, 2016; originally announced March 2016.