Skip to main content

Showing 1–5 of 5 results for author: Odermatt, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.02577  [pdf, other

    cs.CL

    A comparison of translation performance between DeepL and Supertext

    Authors: Alex Flückiger, Chantal Amrhein, Tim Graf, Frédéric Odermatt, Martin Pömsl, Philippe Schläpfer, Florian Schottmann, Samuel Läubli

    Abstract: As strong machine translation (MT) systems are increasingly based on large language models (LLMs), reliable quality benchmarking requires methods that capture their ability to leverage extended context. This study compares two commercial MT systems -- DeepL and Supertext -- by assessing their performance on unsegmented texts. We evaluate translation quality across four language directions with pro… ▽ More

    Submitted 20 May, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: Paper accepted at MT Summit 2025

  2. A Scalable and Transferable Time Series Prediction Framework for Demand Forecasting

    Authors: Young-Jin Park, Donghyun Kim, Frédéric Odermatt, Juho Lee, Kyung-Min Kim

    Abstract: Time series forecasting is one of the most essential and ubiquitous tasks in many business problems, including demand forecasting and logistics optimization. Traditional time series forecasting methods, however, have resulted in small models with limited expressive power because they have difficulty in scaling their model size up while maintaining high accuracy. In this paper, we propose Forecasti… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Published as a full paper at ICDM 2022

  3. Gradient descent-based programming of analog in-memory computing cores

    Authors: Julian Büchel, Athanasios Vasilopoulos, Benedikt Kersting, Frederic Odermatt, Kevin Brew, Injo Ok, Sam Choi, Iqbal Saraf, Victor Chan, Timothy Philip, Nicole Saulnier, Vijay Narayanan, Manuel Le Gallo, Abu Sebastian

    Abstract: The precise programming of crossbar arrays of unit-cells is crucial for obtaining high matrix-vector-multiplication (MVM) accuracy in analog in-memory computing (AIMC) cores. We propose a radically different approach based on directly minimizing the MVM error using gradient descent with synthetic random input data. Our method significantly reduces the MVM error compared with conventional unit-cell… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Journal ref: 2022 International Electron Devices Meeting (IEDM), San Francisco, CA, USA, 2022, pp. 33.1.1-33.1.4

  4. arXiv:2305.14538  [pdf, other

    cs.CL cs.AI

    Cascaded Beam Search: Plug-and-Play Terminology-Forcing For Neural Machine Translation

    Authors: Frédéric Odermatt, Béni Egressy, Roger Wattenhofer

    Abstract: This paper presents a plug-and-play approach for translation with terminology constraints. Terminology constraints are an important aspect of many modern translation pipelines. In both specialized domains and newly emerging domains (such as the COVID-19 pandemic), accurate translation of technical terms is crucial. Recent approaches often train models to copy terminologies from the input into the… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 14 pages, 7 figures

  5. arXiv:2302.08469  [pdf, ps, other

    cs.LG cs.ET

    Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators

    Authors: Malte J. Rasch, Charles Mackin, Manuel Le Gallo, An Chen, Andrea Fasoli, Frederic Odermatt, Ning Li, S. R. Nandakumar, Pritish Narayanan, Hsinyu Tsai, Geoffrey W. Burr, Abu Sebastian, Vijay Narayanan

    Abstract: Analog in-memory computing (AIMC) -- a promising approach for energy-efficient acceleration of deep learning workloads -- computes matrix-vector multiplications (MVMs) but only approximately, due to nonidealities that often are non-deterministic or nonlinear. This can adversely impact the achievable deep neural network (DNN) inference accuracy as compared to a conventional floating point (FP) impl… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 35 pages, 7 figures, 5 tables