Skip to main content

Showing 1–10 of 10 results for author: Schmid, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  2. arXiv:2502.18966  [pdf, other

    cs.LG

    One Set to Rule Them All: How to Obtain General Chemical Conditions via Bayesian Optimization over Curried Functions

    Authors: Stefan P. Schmid, Ella Miray Rajaonson, Cher Tian Ser, Mohammad Haddadnia, Shi Xuan Leong, Alán Aspuru-Guzik, Agustinus Kristiadi, Kjell Jorner, Felix Strieth-Kalthoff

    Abstract: General parameters are highly desirable in the natural sciences - e.g., chemical reaction conditions that enable high yields across a range of related transformations. This has a significant practical impact since those general parameters can be transferred to related tasks without the need for laborious and time-intensive re-optimization. While Bayesian optimization (BO) is widely applied to find… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    ACM Class: J.2

  3. arXiv:2311.10550  [pdf, other

    physics.flu-dyn cs.LG

    RONAALP: Reduced-Order Nonlinear Approximation with Active Learning Procedure

    Authors: Clément Scherding, Georgios Rigas, Denis Sipp, Peter J Schmid, Taraneh Sayadi

    Abstract: Many engineering applications rely on the evaluation of expensive, non-linear high-dimensional functions. In this paper, we propose the RONAALP algorithm (Reduced Order Nonlinear Approximation with Active Learning Procedure) to incrementally learn a fast and accurate reduced-order surrogate model of a target function on-the-fly as the application progresses. First, the combination of nonlinear aut… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 38 pages, 16 figures

  4. arXiv:2310.10745  [pdf, other

    cs.LG math.DS physics.flu-dyn stat.ML

    Mori-Zwanzig latent space Koopman closure for nonlinear autoencoder

    Authors: Priyam Gupta, Peter J. Schmid, Denis Sipp, Taraneh Sayadi, Georgios Rigas

    Abstract: The Koopman operator presents an attractive approach to achieve global linearization of nonlinear systems, making it a valuable method for simplifying the understanding of complex dynamics. While data-driven methodologies have exhibited promise in approximating finite Koopman operators, they grapple with various challenges, such as the judicious selection of observables, dimensionality reduction,… ▽ More

    Submitted 7 May, 2025; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 23 pages, 11 figures

  5. arXiv:2210.04269  [pdf, ps, other

    physics.flu-dyn cs.LG

    Data-driven framework for input/output lookup tables reduction: Application to hypersonic flows in chemical non-equilibrium

    Authors: Clément Scherding, Georgios Rigas, Denis Sipp, Peter J. Schmid, Taraneh Sayadi

    Abstract: In this paper, we present a novel model-agnostic machine learning technique to extract a reduced thermochemical model for reacting hypersonic flows simulation. A first simulation gathers all relevant thermodynamic states and the corresponding gas properties via a given model. The states are embedded in a low-dimensional space and clustered to identify regions with different levels of thermochemica… ▽ More

    Submitted 17 February, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: 28 pages, 19 figures, 3 tables

    Journal ref: Physical Review Fluids, 8(2023), 023201

  6. arXiv:2111.02893  [pdf, ps, other

    physics.flu-dyn cs.LG math.DS

    Symmetry-Aware Autoencoders: s-PCA and s-nlPCA

    Authors: Simon Kneer, Taraneh Sayadi, Denis Sipp, Peter Schmid, Georgios Rigas

    Abstract: Nonlinear principal component analysis (NLPCA) via autoencoders has attracted attention in the dynamical systems community due to its larger compression rate when compared to linear principal component analysis (PCA). These model reduction methods experience an increase in the dimensionality of the latent space when applied to datasets that exhibit invariant samples due to the presence of symmetri… ▽ More

    Submitted 14 November, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: 12 pages, 8 Figures, 2 Tables

    MSC Class: 37E99 ACM Class: I.2.10

  7. arXiv:2109.02846  [pdf, other

    cs.CL

    Datasets: A Community Library for Natural Language Processing

    Authors: Quentin Lhoest, Albert Villanova del Moral, Yacine Jernite, Abhishek Thakur, Patrick von Platen, Suraj Patil, Julien Chaumond, Mariama Drame, Julien Plu, Lewis Tunstall, Joe Davison, Mario Šaško, Gunjan Chhablani, Bhavitvya Malik, Simon Brandeis, Teven Le Scao, Victor Sanh, Canwen Xu, Nicolas Patry, Angelina McMillan-Major, Philipp Schmid, Sylvain Gugger, Clément Delangue, Théo Matussière, Lysandre Debut , et al. (7 additional authors not shown)

    Abstract: The scale, variety, and quantity of publicly-available NLP datasets has grown rapidly as researchers propose new tasks, larger models, and novel benchmarks. Datasets is a community library for contemporary NLP designed to support this ecosystem. Datasets aims to standardize end-user interfaces, versioning, and documentation, while providing a lightweight front-end that behaves similarly for small… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: EMNLP Demo 2021

  8. arXiv:2105.02922  [pdf, other

    cs.CV

    SkyCam: A Dataset of Sky Images and their Irradiance values

    Authors: Evangelos Ntavelis, Jan Remund, Philipp Schmid

    Abstract: Recent advances in Computer Vision and Deep Learning have enabled astonishing results in a variety of fields and applications. Motivated by this success, the SkyCam Dataset aims to enable image-based Deep Learning solutions for short-term, precise prediction of solar radiation on a local level. For the span of a year, three different cameras in three topographically different locations in Switzerl… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: https://github.com/vglsd/SkyCam

  9. arXiv:2010.09854  [pdf, other

    cs.DC

    High-Performance Distributed RMA Locks

    Authors: Patrick Schmid, Maciej Besta, Torsten Hoefler

    Abstract: We propose a topology-aware distributed Reader-Writer lock that accelerates irregular workloads for supercomputers and data centers. The core idea behind the lock is a modular design that is an interplay of three distributed data structures: a counter of readers/writers in the critical section, a set of queues for ordering writers waiting for the lock, and a tree that binds all the queues and sync… ▽ More

    Submitted 23 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Best Paper Award at ACM HPDC'16 (1/129)

    Journal ref: Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing (ACM HPDC'16), 2016

  10. arXiv:1902.03154  [pdf, other

    cs.DC

    SimFS: A Simulation Data Virtualizing File System Interface

    Authors: Salvatore Di Girolamo, Pirmin Schmid, Thomas Schulthess, Torsten Hoefler

    Abstract: Nowadays simulations can produce petabytes of data to be stored in parallel filesystems or large-scale databases. This data is accessed over the course of decades often by thousands of analysts and scientists. However, storing these volumes of data for long periods of time is not cost effective and, in some cases, practically impossible. We propose to transparently virtualize the simulation data,… ▽ More

    Submitted 24 January, 2019; originally announced February 2019.