Skip to main content

Showing 1–10 of 10 results for author: Schamoni, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.03037  [pdf, other

    cs.LG q-bio.QM stat.AP stat.ML

    Validity problems in clinical machine learning by indirect data labeling using consensus definitions

    Authors: Michael Hagmann, Shigehiko Schamoni, Stefan Riezler

    Abstract: We demonstrate a validity problem of machine learning in the vital application area of disease diagnosis in medicine. It arises when target labels in training data are determined by an indirect measurement, and the fundamental measurements needed to determine this indirect measurement are included in the input data representation. Machine learning models trained on this data will learn nothing els… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 11 pages

  2. Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation

    Authors: Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

    Abstract: Data augmentation is a technique to generate new training data based on existing data. We evaluate the simple and cost-effective method of concatenating the original data examples to build new training instances. Continued training with such augmented data is able to improve off-the-shelf Transformer and Conformer models that were optimized on the original data only. We demonstrate considerable im… ▽ More

    Submitted 14 April, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: Accepted at ICASSP 2023

  3. arXiv:2209.00439  [pdf, ps, other

    cs.LG

    Ensembling Neural Networks for Improved Prediction and Privacy in Early Diagnosis of Sepsis

    Authors: Shigehiko Schamoni, Michael Hagmann, Stefan Riezler

    Abstract: Ensembling neural networks is a long-standing technique for improving the generalization error of neural networks by combining networks with orthogonal properties via a committee decision. We show that this technique is an ideal fit for machine learning on medical data: First, ensembles are amenable to parallel and asynchronous learning, thus enabling efficient training of patient-specific compone… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: Accepted at MLHC 2022

    Journal ref: Proceedings of the 7th Machine Learning for Healthcare Conference, PMLR 182:123-145, 2022

  4. Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation

    Authors: Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

    Abstract: End-to-end speech translation relies on data that pair source-language speech inputs with corresponding translations into a target language. Such data are notoriously scarce, making synthetic data augmentation by back-translation or knowledge distillation a necessary ingredient of end-to-end training. In this paper, we present a novel approach to data augmentation that leverages audio alignments,… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022

  5. On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASR

    Authors: Tsz Kin Lam, Mayumi Ohta, Shigehiko Schamoni, Stefan Riezler

    Abstract: We propose an on-the-fly data augmentation method for automatic speech recognition (ASR) that uses alignment information to generate effective training samples. Our method, called Aligned Data Augmentation (ADA) for ASR, replaces transcribed tokens and the speech representations in an aligned manner to generate previously unseen training pairs. The speech representations are sampled from an audio… ▽ More

    Submitted 9 June, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Comments: Accepted at INTERSPEECH 2021

  6. Embedding Meta-Textual Information for Improved Learning to Rank

    Authors: Toshitaka Kuwa, Shigehiko Schamoni, Stefan Riezler

    Abstract: Neural approaches to learning term embeddings have led to improved computation of similarity and ranking in information retrieval (IR). So far neural representation learning has not been extended to meta-textual information that is readily available for many IR tasks, for example, patent classes in prior-art retrieval, topical information in Wikipedia articles, or product categories in e-commerce… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

    Comments: Accepted as a long paper at COLING 2020, Barcelona, Spain

  7. Cascaded Models With Cyclic Feedback For Direct Speech Translation

    Authors: Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

    Abstract: Direct speech translation describes a scenario where only speech inputs and corresponding translations are available. Such data are notoriously limited. We present a technique that allows cascades of automatic speech recognition (ASR) and machine translation (MT) to exploit in-domain direct speech translation data in addition to out-of-domain MT and ASR data. After pre-training MT and ASR, we use… ▽ More

    Submitted 11 February, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted at ICASSP 2021

  8. arXiv:1909.09557  [pdf, other

    q-bio.QM cs.LG stat.ML

    Leveraging Implicit Expert Knowledge for Non-Circular Machine Learning in Sepsis Prediction

    Authors: Shigehiko Schamoni, Holger A. Lindner, Verena Schneider-Lindner, Manfred Thiel, Stefan Riezler

    Abstract: Sepsis is the leading cause of death in non-coronary intensive care units. Moreover, a delay of antibiotic treatment of patients with severe sepsis by only few hours is associated with increased mortality. This insight makes accurate models for early prediction of sepsis a key task in machine learning for healthcare. Previous approaches have achieved high AUROC by learning from electronic health r… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

    Comments: Accepted for publication in Journal of Artificial Intelligence in Medicine

    Journal ref: Artificial Intelligence in Medicine, Volume 100, September 2019, Pages 101725

  9. arXiv:1907.02326  [pdf, other

    cs.CL

    Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation

    Authors: Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

    Abstract: We propose an interactive-predictive neural machine translation framework for easier model personalization using reinforcement and imitation learning. During the interactive translation process, the user is asked for feedback on uncertain locations identified by the system. Responses are weak feedback in the form of "keep" and "delete" edits, and expert demonstrations in the form of "substitute" e… ▽ More

    Submitted 5 July, 2019; v1 submitted 4 July, 2019; originally announced July 2019.

    Comments: Machine Translation Summit 2019 (MTSUMMIT XVII), Dublin, Ireland

  10. Multimodal Pivots for Image Caption Translation

    Authors: Julian Hitschler, Shigehiko Schamoni, Stefan Riezler

    Abstract: We present an approach to improve statistical machine translation of image descriptions by multimodal pivots defined in visual space. The key idea is to perform image retrieval over a database of images that are captioned in the target language, and use the captions of the most similar images for crosslingual reranking of translation outputs. Our approach does not depend on the availability of lar… ▽ More

    Submitted 13 June, 2016; v1 submitted 15 January, 2016; originally announced January 2016.

    Comments: Final version, accepted at ACL 2016. New section on Human Evaluation