Skip to main content

Showing 1–13 of 13 results for author: Baeriswyl, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1903.12157  [pdf, other

    cs.CL

    Resilient Combination of Complementary CNN and RNN Features for Text Classification through Attention and Ensembling

    Authors: Athanasios Giannakopoulos, Maxime Coriou, Andreea Hossmann, Michael Baeriswyl, Claudiu Musat

    Abstract: State-of-the-art methods for text classification include several distinct steps of pre-processing, feature extraction and post-processing. In this work, we focus on end-to-end neural architectures and show that the best performance in text classification is obtained by combining information from different neural modules. Concretely, we combine convolution, recurrent and attention modules with ense… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: 5 pages, 1 figure, SDS 2019 - The 6th Swiss Conference on Data Science

  2. arXiv:1903.09878  [pdf, other

    cs.CL

    Expanding the Text Classification Toolbox with Cross-Lingual Embeddings

    Authors: Meryem M'hamdi, Robert West, Andreea Hossmann, Michael Baeriswyl, Claudiu Musat

    Abstract: Most work in text classification and Natural Language Processing (NLP) focuses on English or a handful of other languages that have text corpora of hundreds of millions of words. This is creating a new version of the digital divide: the artificial intelligence (AI) divide. Transfer-based approaches, such as Cross-Lingual Text Classification (CLTC) - the task of categorizing texts written in differ… ▽ More

    Submitted 26 March, 2019; v1 submitted 23 March, 2019; originally announced March 2019.

  3. arXiv:1902.09713  [pdf, other

    cs.CL cs.AI cs.LG

    Interpretable Structure-aware Document Encoders with Hierarchical Attention

    Authors: Khalil Mrini, Claudiu Musat, Michael Baeriswyl, Martin Jaggi

    Abstract: We propose a method to create document representations that reflect their internal structure. We modify Tree-LSTMs to hierarchically merge basic elements such as words and sentences into blocks of increasing complexity. Our Structure Tree-LSTM implements a hierarchical attention mechanism over individual components and combinations thereof. We thus emphasize the usefulness of Tree-LSTMs for texts… ▽ More

    Submitted 5 October, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

  4. arXiv:1811.00633  [pdf, other

    cs.CL

    Embedding Individual Table Columns for Resilient SQL Chatbots

    Authors: Bojan Petrovski, Ignacio Aguado, Andreea Hossmann, Michael Baeriswyl, Claudiu Musat

    Abstract: Most of the world's data is stored in relational databases. Accessing these requires specialized knowledge of the Structured Query Language (SQL), putting them out of the reach of many people. A recent research thread in Natural Language Processing (NLP) aims to alleviate this problem by automatically translating natural language questions into SQL queries. While the proposed solutions are a great… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: SCAI, 2018

  5. arXiv:1808.08432  [pdf, other

    cs.CL

    Churn Intent Detection in Multilingual Chatbot Conversations and Social Media

    Authors: Christian Abbet, Meryem M'hamdi, Athanasios Giannakopoulos, Robert West, Andreea Hossmann, Michael Baeriswyl, Claudiu Musat

    Abstract: We propose a new method to detect when users express the intent to leave a service, also known as churn. While previous work focuses solely on social media, we show that this intent can be detected in chatbot conversations. As companies increasingly rely on chatbots they need an overview of potentially churny users. To this end, we crowdsource and publish a dataset of churn intent expressions in c… ▽ More

    Submitted 25 August, 2018; originally announced August 2018.

    Comments: 10 pages

  6. arXiv:1802.01457  [pdf, other

    cs.CL

    Diverse Beam Search for Increased Novelty in Abstractive Summarization

    Authors: André Cibils, Claudiu Musat, Andreea Hossman, Michael Baeriswyl

    Abstract: Text summarization condenses a text to a shorter version while retaining the important informations. Abstractive summarization is a recent development that generates new phrases, rather than simply copying or rephrasing sentences within the original text. Recently neural sequence-to-sequence models have achieved good results in the field of abstractive summarization, which opens new possibilities… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

  7. arXiv:1802.00757  [pdf, other

    cs.CL

    Submodularity-Inspired Data Selection for Goal-Oriented Chatbot Training Based on Sentence Embeddings

    Authors: Mladen Dimovski, Claudiu Musat, Vladimir Ilievski, Andreea Hossmann, Michael Baeriswyl

    Abstract: Spoken language understanding (SLU) systems, such as goal-oriented chatbots or personal assistants, rely on an initial natural language understanding (NLU) module to determine the intent and to extract the relevant information from the user queries they take as input. SLU systems usually help users to solve problems in relatively narrow domains and require a large amount of in-domain training data… ▽ More

    Submitted 8 July, 2018; v1 submitted 2 February, 2018; originally announced February 2018.

  8. arXiv:1802.00500  [pdf, other

    cs.CL

    Goal-Oriented Chatbot Dialog Management Bootstrapping with Transfer Learning

    Authors: Vladimir Ilievski, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl

    Abstract: Goal-Oriented (GO) Dialogue Systems, colloquially known as goal oriented chatbots, help users achieve a predefined goal (e.g. book a movie ticket) within a closed domain. A first step is to understand the user's goal by using natural language understanding techniques. Once the goal is known, the bot must manage a dialogue to achieve that goal, which is conducted with respect to a learnt policy. Th… ▽ More

    Submitted 24 July, 2018; v1 submitted 1 February, 2018; originally announced February 2018.

    Comments: 7 pages (6 pages plus 1 page of references), 5 figures, 1 pseudocode figure

  9. arXiv:1801.05159  [pdf, other

    cs.LG cs.AI

    GitGraph - Architecture Search Space Creation through Frequent Computational Subgraph Mining

    Authors: Kamil Bennani-Smires, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl

    Abstract: The dramatic success of deep neural networks across multiple application areas often relies on experts painstakingly designing a network architecture specific to each task. To simplify this process and make it more accessible, an emerging research effort seeks to automate the design of neural network architectures, using e.g. evolutionary algorithms or reinforcement learning or simple search in a… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

  10. arXiv:1801.04470  [pdf, other

    cs.CL

    Simple Unsupervised Keyphrase Extraction using Sentence Embeddings

    Authors: Kamil Bennani-Smires, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl, Martin Jaggi

    Abstract: Keyphrase extraction is the task of automatically selecting a small set of phrases that best describe a given free text document. Supervised keyphrase extraction requires large amounts of labeled training data and generalizes very poorly outside the domain of the training data. At the same time, unsupervised systems have poor accuracy, and often do not generalize well, as they require the input do… ▽ More

    Submitted 5 September, 2018; v1 submitted 13 January, 2018; originally announced January 2018.

  11. arXiv:1710.11035  [pdf, other

    cs.CL

    Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German

    Authors: Pierre-Edouard Honnet, Andrei Popescu-Belis, Claudiu Musat, Michael Baeriswyl

    Abstract: The goal of this work is to design a machine translation (MT) system for a low-resource family of dialects, collectively known as Swiss German, which are widely spoken in Switzerland but seldom written. We collected a significant number of parallel written resources to start with, up to a total of about 60k words. Moreover, we identified several other promising data sources for Swiss German. Then,… ▽ More

    Submitted 6 February, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: 11th Language Resources and Evaluation Conference (LREC), 7-12 May 2018, Miyazaki (Japan)

  12. arXiv:1709.09220  [pdf, other

    cs.CL

    Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision

    Authors: Athanasios Giannakopoulos, Diego Antognini, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl

    Abstract: Aspect Term Extraction (ATE) detects opinionated aspect terms in sentences or text spans, with the end goal of performing aspect-based sentiment analysis. The small amount of available datasets for supervised ATE and the fact that they cover only a few domains raise the need for exploiting other data sources in new and creative ways. Publicly available review corpora contain a plethora of opiniona… ▽ More

    Submitted 26 September, 2017; originally announced September 2017.

  13. arXiv:1709.05094  [pdf, other

    cs.CL

    Unsupervised Aspect Term Extraction with B-LSTM & CRF using Automatically Labelled Datasets

    Authors: Athanasios Giannakopoulos, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl

    Abstract: Aspect Term Extraction (ATE) identifies opinionated aspect terms in texts and is one of the tasks in the SemEval Aspect Based Sentiment Analysis (ABSA) contest. The small amount of available datasets for supervised ATE and the costly human annotation for aspect term labelling give rise to the need for unsupervised ATE. In this paper, we introduce an architecture that achieves top-ranking performan… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    Comments: 9 pages, 3 figures, 2 tables 8th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis (WASSA), EMNLP 2017