Skip to main content

Showing 1–13 of 13 results for author: Elhoseiny, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2201.01942  [pdf, other

    cs.LG stat.ML

    Efficiently Disentangle Causal Representations

    Authors: Yuanpeng Li, Joel Hestness, Mohamed Elhoseiny, Liang Zhao, Kenneth Church

    Abstract: This paper proposes an efficient approach to learning disentangled representations with causal mechanisms based on the difference of conditional probabilities in original and new distributions. We approximate the difference with models' generalization abilities so that it fits in the standard machine learning framework and can be efficiently computed. In contrast to the state-of-the-art approach,… ▽ More

    Submitted 1 January, 2024; v1 submitted 6 January, 2022; originally announced January 2022.

    Comments: 17 pages, 7 figures

    Report number: Causal-01

  2. arXiv:2010.01916  [pdf, other

    cs.LG cs.AI stat.ML

    Temporal Positive-unlabeled Learning for Biomedical Hypothesis Generation via Risk Estimation

    Authors: Uchenna Akujuobi, Jun Chen, Mohamed Elhoseiny, Michael Spranger, Xiangliang Zhang

    Abstract: Understanding the relationships between biomedical terms like viruses, drugs, and symptoms is essential in the fight against diseases. Many attempts have been made to introduce the use of machine learning to the scientific process of hypothesis generation(HG), which refers to the discovery of meaningful implicit connections between biomedical terms. However, most existing methods fail to truly cap… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted for Neurips 2020

  3. arXiv:2006.11328  [pdf, other

    cs.LG cs.CV stat.ML

    Class Normalization for (Continual)? Generalized Zero-Shot Learning

    Authors: Ivan Skorokhodov, Mohamed Elhoseiny

    Abstract: Normalization techniques have proved to be a crucial ingredient of successful training in a traditional supervised learning regime. However, in the zero-shot learning (ZSL) world, these ideas have received only marginal attention. This work studies normalization in ZSL scenario from both theoretical and practical perspectives. First, we give a theoretical explanation to two popular tricks used in… ▽ More

    Submitted 14 April, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 22 pages, 7 figures, 7 tables

  4. arXiv:2006.08305  [pdf, other

    cs.LG stat.ML

    Inner Ensemble Networks: Average Ensemble as an Effective Regularizer

    Authors: Abduallah Mohamed, Muhammed Mohaimin Sadiq, Ehab AlBadawy, Mohamed Elhoseiny, Christian Claudel

    Abstract: We introduce Inner Ensemble Networks (IENs) which reduce the variance within the neural network itself without an increase in the model complexity. IENs utilize ensemble parameters during the training phase to reduce the network variance. While in the testing phase, these parameters are removed without a change in the enhanced performance. IENs reduce the variance of an ordinary deep model by a fa… ▽ More

    Submitted 9 October, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  5. arXiv:2004.00436  [pdf, other

    cs.CV cs.LG stat.ML

    Exploring Long Tail Visual Relationship Recognition with Large Vocabulary

    Authors: Sherif Abdelkarim, Aniket Agarwal, Panos Achlioptas, Jun Chen, Jiaji Huang, Boyang Li, Kenneth Church, Mohamed Elhoseiny

    Abstract: Several approaches have been proposed in recent literature to alleviate the long-tail problem, mainly in object classification tasks. In this paper, we make the first large-scale study concerning the task of Long-Tail Visual Relationship Recognition (LTVRR). LTVRR aims at improving the learning of structured visual relationships that come from the long-tail (e.g., "rabbit grazing on grass"). In th… ▽ More

    Submitted 25 September, 2021; v1 submitted 25 March, 2020; originally announced April 2020.

    ACM Class: I.2.10; I.5.0; I.4.0

  6. arXiv:1906.02425  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Uncertainty-guided Continual Learning with Bayesian Neural Networks

    Authors: Sayna Ebrahimi, Mohamed Elhoseiny, Trevor Darrell, Marcus Rohrbach

    Abstract: Continual learning aims to learn new tasks without forgetting previously learned ones. This is especially challenging when one cannot access data from previous tasks and when the model has a fixed capacity. Current regularization-based continual learning algorithms need an external representation and extra computation to measure the parameters' \textit{importance}. In contrast, we propose Uncertai… ▽ More

    Submitted 19 February, 2020; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Accepted at ICLR 2020

  7. arXiv:1903.02164  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Few-Shot Learning with Prototypical Random Walks

    Authors: Ahmed Ayyad, Yuchen Li, Nassir Navab, Shadi Albarqouni, Mohamed Elhoseiny

    Abstract: Recent progress has shown that few-shot learning can be improved with access to unlabelled data, known as semi-supervised few-shot learning(SS-FSL). We introduce an SS-FSL approach, dubbed as Prototypical Random Walk Networks(PRWN), built on top of Prototypical Networks (PN). We develop a random walk semi-supervised loss that enables the network to learn representations that are compact and well-s… ▽ More

    Submitted 9 February, 2021; v1 submitted 5 March, 2019; originally announced March 2019.

    Comments: Accepted by AAAI 2021 Workshop (Oral)

  8. arXiv:1902.10486  [pdf, other

    cs.LG stat.ML

    On Tiny Episodic Memories in Continual Learning

    Authors: Arslan Chaudhry, Marcus Rohrbach, Mohamed Elhoseiny, Thalaiyasingam Ajanthan, Puneet K. Dokania, Philip H. S. Torr, Marc'Aurelio Ranzato

    Abstract: In continual learning (CL), an agent learns from a stream of tasks leveraging prior experience to transfer knowledge to future tasks. It is an ideal framework to decrease the amount of supervision in the existing learning algorithms. But for a successful knowledge transfer, the learner needs to remember how to perform previous tasks. One way to endow the learner the ability to perform tasks seen i… ▽ More

    Submitted 4 June, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: Making the main point of the paper more clear

  9. arXiv:1812.00420  [pdf, other

    cs.LG stat.ML

    Efficient Lifelong Learning with A-GEM

    Authors: Arslan Chaudhry, Marc'Aurelio Ranzato, Marcus Rohrbach, Mohamed Elhoseiny

    Abstract: In lifelong learning, the learner is presented with a sequence of tasks, incrementally building a data-driven prior which may be leveraged to speed up learning of a new task. In this work, we investigate the efficiency of current lifelong approaches, in terms of sample complexity, computational and memory cost. Towards this end, we first introduce a new and a more realistic evaluation protocol, wh… ▽ More

    Submitted 9 January, 2019; v1 submitted 2 December, 2018; originally announced December 2018.

    Comments: Published as a conference paper at ICLR 2019

  10. arXiv:1812.00068  [pdf, other

    cs.LG stat.ML

    GDPP: Learning Diverse Generations Using Determinantal Point Process

    Authors: Mohamed Elfeki, Camille Couprie, Morgane Riviere, Mohamed Elhoseiny

    Abstract: Generative models have proven to be an outstanding tool for representing high-dimensional probability distributions and generating realistic-looking images. An essential characteristic of generative models is their ability to produce multi-modal outputs. However, while training, they are often susceptible to mode collapse, that is models are limited in mapping input noise to only a few modes of th… ▽ More

    Submitted 24 November, 2019; v1 submitted 30 November, 2018; originally announced December 2018.

    Journal ref: International Conference on Machine Learning 2019

  11. arXiv:1804.00921  [pdf, other

    cs.LG stat.ML

    DeSIGN: Design Inspiration from Generative Networks

    Authors: Othman Sbai, Mohamed Elhoseiny, Antoine Bordes, Yann LeCun, Camille Couprie

    Abstract: Can an algorithm create original and compelling fashion designs to serve as an inspirational assistant? To help answer this question, we design and investigate different image generation models associated with different loss functions to boost creativity in fashion generation. The dimensions of our explorations include: (i) different Generative Adversarial Networks architectures that start from no… ▽ More

    Submitted 14 September, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

  12. arXiv:1711.09601  [pdf, other

    cs.CV cs.AI stat.ML

    Memory Aware Synapses: Learning what (not) to forget

    Authors: Rahaf Aljundi, Francesca Babiloni, Mohamed Elhoseiny, Marcus Rohrbach, Tinne Tuytelaars

    Abstract: Humans can learn in a continuous manner. Old rarely utilized knowledge can be overwritten by new incoming information while important, frequently used knowledge is prevented from being erased. In artificial learning systems, lifelong learning so far has focused mainly on accumulating knowledge over tasks and overcoming catastrophic forgetting. In this paper, we argue that, given the limited model… ▽ More

    Submitted 5 October, 2018; v1 submitted 27 November, 2017; originally announced November 2017.

    Comments: ECCV 2018

  13. arXiv:1409.7480  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Generalized Twin Gaussian Processes using Sharma-Mittal Divergence

    Authors: Mohamed Elhoseiny, Ahmed Elgammal

    Abstract: There has been a growing interest in mutual information measures due to their wide range of applications in Machine Learning and Computer Vision. In this paper, we present a generalized structured regression framework based on Shama-Mittal divergence, a relative entropy measure, which is introduced to the Machine Learning community in this work. Sharma-Mittal (SM) divergence is a generalized mutua… ▽ More

    Submitted 1 June, 2015; v1 submitted 26 September, 2014; originally announced September 2014.

    Comments: This work got accepted for Publication in the Machine Learning Journal 2015. The work is scheduled for presentation at ECML-PKDD 2015 journal track papers