Skip to main content

Showing 1–3 of 3 results for author: Bennani-Smires, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:1902.08232  [pdf, other

    cs.LG stat.ML

    Overcoming Multi-Model Forgetting

    Authors: Yassine Benyahia, Kaicheng Yu, Kamil Bennani-Smires, Martin Jaggi, Anthony Davison, Mathieu Salzmann, Claudiu Musat

    Abstract: We identify a phenomenon, which we refer to as multi-model forgetting, that occurs when sequentially training multiple deep networks with partially-shared parameters; the performance of previously-trained models degrades as one optimizes a subsequent one, due to the overwriting of shared parameters. To overcome this, we introduce a statistically-justified weight plasticity loss that regularizes th… ▽ More

    Submitted 2 March, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

  2. arXiv:1801.05159  [pdf, other

    cs.LG cs.AI

    GitGraph - Architecture Search Space Creation through Frequent Computational Subgraph Mining

    Authors: Kamil Bennani-Smires, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl

    Abstract: The dramatic success of deep neural networks across multiple application areas often relies on experts painstakingly designing a network architecture specific to each task. To simplify this process and make it more accessible, an emerging research effort seeks to automate the design of neural network architectures, using e.g. evolutionary algorithms or reinforcement learning or simple search in a… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

  3. arXiv:1801.04470  [pdf, other

    cs.CL

    Simple Unsupervised Keyphrase Extraction using Sentence Embeddings

    Authors: Kamil Bennani-Smires, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl, Martin Jaggi

    Abstract: Keyphrase extraction is the task of automatically selecting a small set of phrases that best describe a given free text document. Supervised keyphrase extraction requires large amounts of labeled training data and generalizes very poorly outside the domain of the training data. At the same time, unsupervised systems have poor accuracy, and often do not generalize well, as they require the input do… ▽ More

    Submitted 5 September, 2018; v1 submitted 13 January, 2018; originally announced January 2018.