Skip to main content

Showing 1–9 of 9 results for author: Sotelo, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00727  [pdf, ps, other

    cs.LG cs.CV

    Adaptive Plane Reformatting for 4D Flow MRI using Deep Reinforcement Learning

    Authors: Javier Bisbal, Julio Sotelo, Maria I Valdés, Pablo Irarrazaval, Marcelo E Andia, Julio García, José Rodriguez-Palomarez, Francesca Raimondi, Cristián Tejos, Sergio Uribe

    Abstract: Deep reinforcement learning (DRL) algorithms have shown robust results in plane reformatting tasks. In these methods, an agent sequentially adjusts the position and orientation of an initial plane towards an objective location. This process allows accurate plane reformatting, without the need for detailed landmarks, which makes it suitable for images with limited contrast and resolution, such as 4… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: 11 pages, 4 figures, submitted to IEEE Transactions on Medical Imaging

    ACM Class: I.4.0

  2. arXiv:2412.14969  [pdf, other

    cs.CV

    PhotoHolmes: a Python library for forgery detection in digital images

    Authors: Julián O'Flaherty, Rodrigo Paganini, Juan Pablo Sotelo, Julieta Umpiérrez, Marina Gardella, Matías Tailanian, Pablo Musé

    Abstract: In this paper, we introduce PhotoHolmes, an open-source Python library designed to easily run and benchmark forgery detection methods on digital images. The library includes implementations of popular and state-of-the-art methods, dataset integration tools, and evaluation metrics. Utilizing the Benchmark tool in PhotoHolmes, users can effortlessly compare various methods. This facilitates an accur… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

  3. arXiv:1910.06711  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

    Authors: Kundan Kumar, Rithesh Kumar, Thibault de Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brebisson, Yoshua Bengio, Aaron Courville

    Abstract: Previous works (Donahue et al., 2018a; Engel et al., 2019a) have found that generating coherent raw audio waveforms with GANs is challenging. In this paper, we show that it is possible to train GANs reliably to generate high quality coherent waveforms by introducing a set of architectural changes and simple training techniques. Subjective evaluation metric (Mean Opinion Score, or MOS) shows the ef… ▽ More

    Submitted 8 December, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

  4. arXiv:1807.09232  [pdf

    cs.CV cs.AI

    Deep Learning on Retina Images as Screening Tool for Diagnostic Decision Support

    Authors: Maria Camila Alvarez Trivino, Jeremie Despraz, Jesus Alfonso Lopez Sotelo, Carlos Andres Pena

    Abstract: In this project, we developed a deep learning system applied to human retina images for medical diagnostic decision support. The retina images were provided by EyePACS. These images were used in the framework of a Kaggle contest, whose purpose to identify diabetic retinopathy signs through an automatic detection system. Using as inspiration one of the solutions proposed in the contest, we implemen… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.

  5. arXiv:1801.06700  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot (Short Version)

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeswar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including neural network and template-based… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.

    Comments: 9 pages, 1 figure, 2 tables; presented at NIPS 2017, Conversational AI: "Today's Practice and Tomorrow's Potential" Workshop

    ACM Class: I.5.1; I.2.7

  6. arXiv:1801.01442  [pdf, other

    cs.CV

    ObamaNet: Photo-realistic lip-sync from text

    Authors: Rithesh Kumar, Jose Sotelo, Kundan Kumar, Alexandre de Brebisson, Yoshua Bengio

    Abstract: We present ObamaNet, the first architecture that generates both audio and synchronized photo-realistic lip-sync videos from any new text. Contrary to other published lip-sync approaches, ours is only composed of fully trainable neural modules and does not rely on any traditional computer graphics methods. More precisely, we use three main modules: a text-to-speech network based on Char2Wav, a time… ▽ More

    Submitted 6 December, 2017; originally announced January 2018.

  7. arXiv:1709.02349  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeshwar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including template-based models, bag-of-wor… ▽ More

    Submitted 5 November, 2017; v1 submitted 7 September, 2017; originally announced September 2017.

    Comments: 40 pages, 9 figures, 11 tables

    ACM Class: I.5.1; I.2.7

  8. arXiv:1703.00788  [pdf, other

    cs.LG

    A Robust Adaptive Stochastic Gradient Method for Deep Learning

    Authors: Caglar Gulcehre, Jose Sotelo, Marcin Moczulski, Yoshua Bengio

    Abstract: Stochastic gradient algorithms are the main focus of large-scale optimization problems and led to important successes in the recent advancement of the deep learning algorithms. The convergence of SGD depends on the careful choice of learning rate and the amount of the noise in stochastic estimates of the gradients. In this paper, we propose an adaptive learning rate algorithm, which utilizes stoch… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.

    Comments: IJCNN 2017 Accepted Paper, An extension of our paper, "ADASECANT: Robust Adaptive Secant Method for Stochastic Gradient"

  9. arXiv:1612.07837  [pdf, other

    cs.SD cs.AI

    SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

    Authors: Soroush Mehri, Kundan Kumar, Ishaan Gulrajani, Rithesh Kumar, Shubham Jain, Jose Sotelo, Aaron Courville, Yoshua Bengio

    Abstract: In this paper we propose a novel model for unconditional audio generation based on generating one audio sample at a time. We show that our model, which profits from combining memory-less modules, namely autoregressive multilayer perceptrons, and stateful recurrent neural networks in a hierarchical structure is able to capture underlying sources of variations in the temporal sequences over very lon… ▽ More

    Submitted 11 February, 2017; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: Published as a conference paper at ICLR 2017