-
Micelle Forming Linear-Dendritic Block Copolymers: A Theoretical Comparison between Random Hyperbranched and Precise Dendrimer Polymer Architectures
Authors:
Marios Giannakou,
Oleg V. Borisov,
Friederike Schmid
Abstract:
Hyperbranched block copolymers offer a simpler and more efficient synthesis route compared to more traditional dendritic systems, while still providing exceptional control over surface functionality and self-assembly. This makes them ideal candidates for engineering nanoparticles with tailored properties for applications such as drug delivery and sensing. Here we use self-consistent field calculat…
▽ More
Hyperbranched block copolymers offer a simpler and more efficient synthesis route compared to more traditional dendritic systems, while still providing exceptional control over surface functionality and self-assembly. This makes them ideal candidates for engineering nanoparticles with tailored properties for applications such as drug delivery and sensing. Here we use self-consistent field calculations to compare the micelle structures formed by copolymers with a polydisperse hyperbranched (LHBC), monodisperse dendritic (LDBC), and linear solvophilic blocks. Representative LHBC structures were generated by molecular dynamics simulations mimicking the slow-monomer addition protocol. We find that LHBC micelles are more stable, have a lower critical micelle concentration, and are better at accommodating larger drug payloads than LDBC micelles, and these properties further improve with increasing polydispersity. LHBC micelles also offer more terminal ends for functionalization than LDBC micelles for LDBCs with up to four branching generations, with the number of terminal ends being surprisingly independent of the LHBC polydispersity. Our findings highlight the superiority of LHBC micelles in flexibility and performance over LDBC micelles.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
Authors:
Paul Primus,
Florian Schmid,
Gerhard Widmer
Abstract:
Learning to associate audio with textual descriptions is valuable for a range of tasks, including pretraining, zero-shot classification, audio retrieval, audio captioning, and text-conditioned audio generation. Existing contrastive language-audio pretrained models are typically trained using global, clip-level descriptions, which provide only weak temporal supervision. We hypothesize that CLAP-lik…
▽ More
Learning to associate audio with textual descriptions is valuable for a range of tasks, including pretraining, zero-shot classification, audio retrieval, audio captioning, and text-conditioned audio generation. Existing contrastive language-audio pretrained models are typically trained using global, clip-level descriptions, which provide only weak temporal supervision. We hypothesize that CLAP-like language-audio models - particularly, if they are expected to produce frame-level embeddings - can benefit from a stronger temporal supervision. To confirm our hypothesis, we curate a novel dataset of approximately 12,000 audio recordings from Freesound, each annotated with single-sentence free-text descriptions linked to a specific temporal segment in an audio recording. We use large language models to clean these annotations by removing references to non-audible events, transcribed speech, typos, and annotator language bias. We further propose a frame-wise contrastive training strategy that learns to align text descriptions with temporal regions in an audio recording and demonstrate that our model has better temporal text-audio alignment abilities compared to models trained only on global captions when evaluated on the AudioSet Strong benchmark. The dataset and our source code are available on Zenodo and GitHub, respectively.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 Challenge
Authors:
Florian Schmid,
Paul Primus,
Toni Heittola,
Annamaria Mesaros,
Irene Martín-Morató,
Gerhard Widmer
Abstract:
This paper presents the Low-Complexity Acoustic Scene Classification with Device Information Task of the DCASE 2025 Challenge and its baseline system. Continuing the focus on low-complexity models, data efficiency, and device mismatch from previous editions (2022--2024), this year's task introduces a key change: recording device information is now provided at inference time. This enables the devel…
▽ More
This paper presents the Low-Complexity Acoustic Scene Classification with Device Information Task of the DCASE 2025 Challenge and its baseline system. Continuing the focus on low-complexity models, data efficiency, and device mismatch from previous editions (2022--2024), this year's task introduces a key change: recording device information is now provided at inference time. This enables the development of device-specific models that leverage device characteristics -- reflecting real-world deployment scenarios in which a model is designed with awareness of the underlying hardware. The training set matches the 25% subset used in the corresponding DCASE 2024 challenge, with no restrictions on external data use, highlighting transfer learning as a central topic. The baseline achieves 50.72% accuracy on this ten-class problem with a device-general model, improving to 51.89% when using the available device information.
△ Less
Submitted 3 May, 2025;
originally announced May 2025.
-
Towards a fast and robust deep hedging approach
Authors:
Fabienne Schmid,
Daniel Oeltz
Abstract:
We present a robust Deep Hedging framework for the pricing and hedging of option portfolios that significantly improves training efficiency and model robustness. In particular, we propose a neural model for training model embeddings which utilizes the paths of several advanced equity option models with stochastic volatility in order to learn the relationships that exist between hedging strategies.…
▽ More
We present a robust Deep Hedging framework for the pricing and hedging of option portfolios that significantly improves training efficiency and model robustness. In particular, we propose a neural model for training model embeddings which utilizes the paths of several advanced equity option models with stochastic volatility in order to learn the relationships that exist between hedging strategies. A key advantage of the proposed method is its ability to rapidly and reliably adapt to new market regimes through the recalibration of a low-dimensional embedding vector, rather than retraining the entire network. Moreover, we examine the observed Profit and Loss distributions on the parameter space of the models used to learn the embeddings. The results show that the proposed framework works well with data generated by complex models and can serve as a construction basis for an efficient and robust simulation tool for the systematic development of an entirely model-independent hedging strategy.
△ Less
Submitted 23 April, 2025;
originally announced April 2025.
-
From Heteropolymer Stiffness Distributions to Effective Homopolymers: A Conformational Analysis of Intrinsically Disordered Proteins
Authors:
Yannick Witzky,
Friederike Schmid,
Arash Nikoubashman
Abstract:
Intrinsically disordered proteins (IDPs) are characterized by a lack of defined secondary and tertiary structures, and are thus well-suited for descriptions within polymer theory. However, the intrinsic heterogeneity of proteins, stemming from their diverse amino acid building blocks, introduces local variations in chain stiffness, which can impact conformational behavior at larger scales. To inve…
▽ More
Intrinsically disordered proteins (IDPs) are characterized by a lack of defined secondary and tertiary structures, and are thus well-suited for descriptions within polymer theory. However, the intrinsic heterogeneity of proteins, stemming from their diverse amino acid building blocks, introduces local variations in chain stiffness, which can impact conformational behavior at larger scales. To investigate this effect, we developed a heterogeneous worm-like chain model in which the local persistence length follows a Gaussian distribution. We demonstrate that these heterogeneous chains can be effectively mapped to homogeneous chains with a single effective persistence length. To assess whether this mapping can be extended to naturally occurring IDPs, we performed simulations using various coarse-grained IDP models, finding that the simulated IDPs have similar shapes like the corresponding homogeneous and heterogeneous worm-like chains. However, the IDPs are systematically larger than ideal worm-like chains, yet slightly more compact when excluded volume interactions are considered. We attribute these differences to intramolecular interactions between non-bonded monomers, which our theoretical models do not account for.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
Exploring Performance-Complexity Trade-Offs in Sound Event Detection
Authors:
Tobias Morocutti,
Florian Schmid,
Jonathan Greif,
Francesco Foscarin,
Gerhard Widmer
Abstract:
We target the problem of developing new low-complexity networks for the sound event detection task. Our goal is to meticulously analyze the performance-complexity trade-off, aiming to be competitive with the large state-of-the-art models, at a fraction of the computational requirements. We find that low-complexity convolutional models previously proposed for audio tagging can be effectively adapte…
▽ More
We target the problem of developing new low-complexity networks for the sound event detection task. Our goal is to meticulously analyze the performance-complexity trade-off, aiming to be competitive with the large state-of-the-art models, at a fraction of the computational requirements. We find that low-complexity convolutional models previously proposed for audio tagging can be effectively adapted for event detection (which requires frame-wise prediction) by adjusting convolutional strides, removing the global pooling, and, importantly, adding a sequence model before the (now frame-wise) classification heads. Systematic experiments reveal that the best choice for the sequence model type depends on which complexity metric is most important for the given application. We also investigate the impact of enhanced training strategies such as knowledge distillation. In the end, we show that combined with an optimized training strategy, we can reach event detection performance comparable to state-of-the-art transformers while requiring only around 5% of the parameters. We release all our pre-trained models and the code for reproducing this work to support future research in low-complexity sound event detection at https://github.com/theMoro/EfficientSED.
△ Less
Submitted 14 March, 2025;
originally announced March 2025.
-
Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification
Authors:
Tobias Morocutti,
Florian Schmid,
Khaled Koutini,
Gerhard Widmer
Abstract:
Knowledge Distillation (KD) is a widespread technique for compressing the knowledge of large models into more compact and efficient models. KD has proved to be highly effective in building well-performing low-complexity Acoustic Scene Classification (ASC) systems and was used in all the top-ranked submissions to this task of the annual DCASE challenge in the past three years. There is extensive re…
▽ More
Knowledge Distillation (KD) is a widespread technique for compressing the knowledge of large models into more compact and efficient models. KD has proved to be highly effective in building well-performing low-complexity Acoustic Scene Classification (ASC) systems and was used in all the top-ranked submissions to this task of the annual DCASE challenge in the past three years. There is extensive research available on establishing the KD process, designing efficient student models, and forming well-performing teacher ensembles. However, less research has been conducted on investigating which teacher model attributes are beneficial for low-complexity students. In this work, we try to close this gap by studying the effects on the student's performance when using different teacher network architectures, varying the teacher model size, training them with different device generalization methods, and applying different ensembling strategies. The results show that teacher model sizes, device generalization methods, the ensembling strategy and the ensemble size are key factors for a well-performing student network.
△ Less
Submitted 14 March, 2025;
originally announced March 2025.
-
Sol-gel transition in heteroassociative RNA-protein solutions: A quantitative comparison of coarse-grained simulations and the Semenov-Rubinstein theory
Authors:
Xinxiang Chen,
Jude Ann Vishnu,
Pol Besenius,
Julian König,
Friederike Schmid
Abstract:
Protein RNA-binding domains selectively interact with specific RNA sites, a key interaction that determines the emergent cooperative behaviors in RNA-protein mixtures. Through molecular dynamics simulations, we investigate the impact of the specific binding interactions on the phase transitions of an examplary RNA-protein system and compare it with predictions of the Semenov-Rubinstein theory of a…
▽ More
Protein RNA-binding domains selectively interact with specific RNA sites, a key interaction that determines the emergent cooperative behaviors in RNA-protein mixtures. Through molecular dynamics simulations, we investigate the impact of the specific binding interactions on the phase transitions of an examplary RNA-protein system and compare it with predictions of the Semenov-Rubinstein theory of associative polymers. Our findings reveal a sol-gel (percolation) transition without phase separation, characterized by double reentrant behavior as the RNA or protein concentration increases. We highlight the crucial role of bridge formations in driving these transitions, particularly when binding sites are saturated. The theory quantitatively predicts the binding numbers at equilibrium in the semidilute regime, but it significantly overestimates the size of the concentration range where percolation is observed. This can partly be traced back to the fact that the mean-field assumption in the theory is not valid in the dilute regime, and that the theory neglects the existence of cycles in the connectivity graph of the percolating cluster at the sol-gel transition. Our study enriches the understanding of RNA-protein phase behaviors, providing valuable insights for the interpretation of experimental observations.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
-
Structure and Dynamic Evolution of Interfaces between Polymer Solutions and Gels and Polymer Interdiffusion: A Molecular Dynamics Study
Authors:
Jude Ann Vishnu,
Torsten Gereon Linder,
Sebastian Seiffert,
Friederike Schmid
Abstract:
Letting free polymers diffuse from solution into a crosslinked polymer gel is often a crucial processing step in the synthesis of multiphase polymer-based gels, e.g., core-shell microgels. Here we use coarse-grained molecular dynamics simulations to obtain molecular insights into this process. We consider idealized situations where the gel is modeled as a regular polymer network with the topology…
▽ More
Letting free polymers diffuse from solution into a crosslinked polymer gel is often a crucial processing step in the synthesis of multiphase polymer-based gels, e.g., core-shell microgels. Here we use coarse-grained molecular dynamics simulations to obtain molecular insights into this process. We consider idealized situations where the gel is modeled as a regular polymer network with the topology of a diamond lattice, and all free polymers and strands have the same length and consist of the same type of monomer. After bringing the gel and the polymer solution into contact, two time regimes are observed: An initial compression of the gel caused by the osmotic pressure of the solution, followed by an expansion due to swelling. We characterize the time evolution of density profiles, the penetration of free polymers into the gel and the connection between the gel and solution phase. The interfacial structure locally equilibrates after roughly 100 chain relaxation times. At late times, the free chains inside the gel undergo a percolation transition if the polymer concentration in the gel exceeds a critical value, which is of the same order as the overlap concentration. The fluctuations of the interface can be described by a capillary wave model that accounts for the elasticity of the gel. Based on this, we extract the interfacial tension of the gel-solution interface. Interestingly, both the interfacial tension and the local interfacial width increase with increasing free polymer concentration - in contrast to liquid-liquid interfaces, where these two quantities are typically anticorrelated.
△ Less
Submitted 15 December, 2024;
originally announced December 2024.
-
Nonlinear calcium King plot constrains new bosons and nuclear properties
Authors:
A. Wilzewski,
L. I. Huber,
M. Door,
J. Richter,
A. Mariotti,
L. J. Spieß,
M. Wehrheim,
S. Chen,
S. A. King,
P. Micke,
M. Filzinger,
M. R. Steinel,
N. Huntemann,
E. Benkler,
P. O. Schmidt,
J. Flannery,
R. Matt,
M. Stadler,
R. Oswald,
F. Schmid,
D. Kienzler,
J. Home,
D. P. L. Aude Craik,
S. Eliseev,
P. Filianin
, et al. (17 additional authors not shown)
Abstract:
Nonlinearities in King plots (KP) of isotope shifts (IS) can reveal the existence of beyond-Standard-Model (BSM) interactions that couple electrons and neutrons. However, it is crucial to distinguish higher-order Standard Model (SM) effects from BSM physics. We measure the IS of the transitions ${{}^{3}P_{0}~\rightarrow~{}^{3}P_{1}}$ in $\mathrm{Ca}^{14+}$ and…
▽ More
Nonlinearities in King plots (KP) of isotope shifts (IS) can reveal the existence of beyond-Standard-Model (BSM) interactions that couple electrons and neutrons. However, it is crucial to distinguish higher-order Standard Model (SM) effects from BSM physics. We measure the IS of the transitions ${{}^{3}P_{0}~\rightarrow~{}^{3}P_{1}}$ in $\mathrm{Ca}^{14+}$ and ${{}^{2}S_{1/2} \rightarrow {}^{2}D_{5/2}}$ in $\mathrm{Ca}^{+}$ with sub-Hz precision as well as the nuclear mass ratios with relative uncertainties below $4\times10^{-11}$ for the five stable, even isotopes of calcium (${}^{40,42,44,46,48}\mathrm{Ca}$). Combined, these measurements yield a calcium KP nonlinearity with a significance of $\sim 900 σ$. Precision calculations show that the nonlinearity cannot be fully accounted for by the expected largest higher-order SM effect, the second-order mass shift, and identify the little-studied nuclear polarization as the only remaining SM contribution that may be large enough to explain it. Despite the observed nonlinearity, we improve existing KP-based constraints on a hypothetical Yukawa interaction for most of the new boson masses between $10~\mathrm{eV/c^2}$ and $10^7~\mathrm{eV/c^2}$.
△ Less
Submitted 13 December, 2024;
originally announced December 2024.
-
Relaxation Dynamics of Entangled Linear Polymer Melts via Molecular Dynamics Simulations
Authors:
Alireza F. Behbahani,
Friederike Schmid
Abstract:
We present an extensive analysis of the relaxation dynamics of entangled linear polymer melts via long-time molecular dynamics simulations of a generic bead-spring model. We study the mean-squared displacements, the autocorrelation function of the end-to-end vector, $P(t)$, the single-chain dynamic structure factor, $S(q,t)$, and the linear viscoelastic properties, especially the shear stress rela…
▽ More
We present an extensive analysis of the relaxation dynamics of entangled linear polymer melts via long-time molecular dynamics simulations of a generic bead-spring model. We study the mean-squared displacements, the autocorrelation function of the end-to-end vector, $P(t)$, the single-chain dynamic structure factor, $S(q,t)$, and the linear viscoelastic properties, especially the shear stress relaxation modulus, $G(t)$. The simulation data are compared with the theoretically expected scaling laws for different time regimes of entangled melts, and with analytical expressions that account for different relaxation mechanisms in the tube model, namely, reptation, contour length fluctuation (CLF), and constraint release (CR). CLF involves a $t^{1/4}$ scaling regime in the time-dependence of $(1-P(t))$. With increasing chain length, a gradual development of this scaling regime is observed. In the absence of CR, the tube model further predicts that at long times, the chain dynamics is governed by one central quantity, the ``surviving tube fraction'' $μ(t)$. As a result, one expects $S(q,t) \propto G(t) \propto P(t)$ in that time regime. We test this prediction by comparing $S(q,t)$ and $G(t)$ with $P(t)$. For both quantities, proportionality with $P(t)$ is not observed, indicating that CR has an important effect on the relaxation of these two quantities. Instead, to a very good approximation, we find $G(t)\propto P(t)^{2}$ at late times, which is consistent with the dynamic tube dilation or double reptation approximations for the CR process. In addition, we calculate non-local mobility functions, which can be used in dynamic density functional theories for entangled inhomogeneous polymer blends, and discuss the effect of entanglements on the shape of these functions.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Effective Pre-Training of Audio Transformers for Sound Event Detection
Authors:
Florian Schmid,
Tobias Morocutti,
Francesco Foscarin,
Jan Schlüter,
Paul Primus,
Gerhard Widmer
Abstract:
We propose a pre-training pipeline for audio spectrogram transformers for frame-level sound event detection tasks. On top of common pre-training steps, we add a meticulously designed training routine on AudioSet frame-level annotations. This includes a balanced sampler, aggressive data augmentation, and ensemble knowledge distillation. For five transformers, we obtain a substantial performance imp…
▽ More
We propose a pre-training pipeline for audio spectrogram transformers for frame-level sound event detection tasks. On top of common pre-training steps, we add a meticulously designed training routine on AudioSet frame-level annotations. This includes a balanced sampler, aggressive data augmentation, and ensemble knowledge distillation. For five transformers, we obtain a substantial performance improvement over previously available checkpoints both on AudioSet frame-level predictions and on frame-level sound event detection downstream tasks, confirming our pipeline's effectiveness. We publish the resulting checkpoints that researchers can directly fine-tune to build high-performance models for sound event detection tasks.
△ Less
Submitted 28 November, 2024; v1 submitted 14 September, 2024;
originally announced September 2024.
-
Quantum control of a single $\mathrm{H}_2^+$ molecular ion
Authors:
David Holzapfel,
Fabian Schmid,
Nick Schwegler,
Oliver Stadler,
Martin Stadler,
Alexander Ferk,
Jonathan P. Home,
Daniel Kienzler
Abstract:
Science is founded on the benchmarking of theoretical models against experimental measurements, with the challenge that for all but the simplest systems, the calculations required for high precision become extremely challenging. $\mathrm{H}_2^+$ is the simplest stable molecule, and its internal structure is calculable to high precision from first principles. This allows tests of theoretical models…
▽ More
Science is founded on the benchmarking of theoretical models against experimental measurements, with the challenge that for all but the simplest systems, the calculations required for high precision become extremely challenging. $\mathrm{H}_2^+$ is the simplest stable molecule, and its internal structure is calculable to high precision from first principles. This allows tests of theoretical models and the determination of fundamental constants. However, studying $\mathrm{H}_2^+$ experimentally presents significant challenges. Standard control methods such as laser cooling, fluorescence detection and optical pumping are not applicable to $\mathrm{H}_2^+$ due to the very long lifetimes of its excited rotational and vibrational states. Here we solve this issue by using Quantum Logic Spectroscopy techniques to demonstrate full quantum control of a single $\mathrm{H}_2^+$ molecule by co-trapping it with an atomic 'helper' ion and performing quantum operations between the two ions. This enables us to perform pure quantum state preparation, coherent control and non-destructive readout, which we use to perform high-resolution microwave spectroscopy of $\mathrm{H}_2^+$. Our results pave the way for high precision spectroscopy of $\mathrm{H}_2^+$ in both the microwave and optical domains, while offering techniques which are transferable to other molecular ions.
△ Less
Submitted 10 September, 2024;
originally announced September 2024.
-
Estimated Audio-Caption Correspondences Improve Language-Based Audio Retrieval
Authors:
Paul Primus,
Florian Schmid,
Gerhard Widmer
Abstract:
Dual-encoder-based audio retrieval systems are commonly optimized with contrastive learning on a set of matching and mismatching audio-caption pairs. This leads to a shared embedding space in which corresponding items from the two modalities end up close together. Since audio-caption datasets typically only contain matching pairs of recordings and descriptions, it has become common practice to cre…
▽ More
Dual-encoder-based audio retrieval systems are commonly optimized with contrastive learning on a set of matching and mismatching audio-caption pairs. This leads to a shared embedding space in which corresponding items from the two modalities end up close together. Since audio-caption datasets typically only contain matching pairs of recordings and descriptions, it has become common practice to create mismatching pairs by pairing the audio with a caption randomly drawn from the dataset. This is not ideal because the randomly sampled caption could, just by chance, partly or entirely describe the audio recording. However, correspondence information for all possible pairs is costly to annotate and thus typically unavailable; we, therefore, suggest substituting it with estimated correspondences. To this end, we propose a two-staged training procedure in which multiple retrieval models are first trained as usual, i.e., without estimated correspondences. In the second stage, the audio-caption correspondences predicted by these models then serve as prediction targets. We evaluate our method on the ClothoV2 and the AudioCaps benchmark and show that it improves retrieval performance, even in a restricting self-distillation setting where a single model generates and then learns from the estimated correspondences. We further show that our method outperforms the current state of the art by 1.6 pp. mAP@10 on the ClothoV2 benchmark.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining
Authors:
Jonathan Greif,
Florian Schmid,
Paul Primus,
Gerhard Widmer
Abstract:
Query-by-Vocal Imitation (QBV) is about searching audio files within databases using vocal imitations created by the user's voice. Since most humans can effectively communicate sound concepts through voice, QBV offers the more intuitive and convenient approach compared to text-based search. To fully leverage QBV, developing robust audio feature representations for both the vocal imitation and the…
▽ More
Query-by-Vocal Imitation (QBV) is about searching audio files within databases using vocal imitations created by the user's voice. Since most humans can effectively communicate sound concepts through voice, QBV offers the more intuitive and convenient approach compared to text-based search. To fully leverage QBV, developing robust audio feature representations for both the vocal imitation and the original sound is crucial. In this paper, we present a new system for QBV that utilizes the feature extraction capabilities of Convolutional Neural Networks pre-trained with large-scale general-purpose audio datasets. We integrate these pre-trained models into a dual encoder architecture and fine-tune them end-to-end using contrastive learning. A distinctive aspect of our proposed method is the fine-tuning strategy of pre-trained models using an adapted NT-Xent loss for contrastive learning, creating a shared embedding space for reference recordings and vocal imitations. The proposed system significantly enhances audio retrieval performance, establishing a new state of the art on both coarse- and fine-grained QBV tasks.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
Improving Audio Spectrogram Transformers for Sound Event Detection Through Multi-Stage Training
Authors:
Florian Schmid,
Paul Primus,
Tobias Morocutti,
Jonathan Greif,
Gerhard Widmer
Abstract:
This technical report describes the CP-JKU team's submission for Task 4 Sound Event Detection with Heterogeneous Training Datasets and Potentially Missing Labels of the DCASE 24 Challenge. We fine-tune three large Audio Spectrogram Transformers, PaSST, BEATs, and ATST, on the joint DESED and MAESTRO datasets in a two-stage training procedure. The first stage closely matches the baseline system set…
▽ More
This technical report describes the CP-JKU team's submission for Task 4 Sound Event Detection with Heterogeneous Training Datasets and Potentially Missing Labels of the DCASE 24 Challenge. We fine-tune three large Audio Spectrogram Transformers, PaSST, BEATs, and ATST, on the joint DESED and MAESTRO datasets in a two-stage training procedure. The first stage closely matches the baseline system setup and trains a CRNN model while keeping the large pre-trained transformer model frozen. In the second stage, both CRNN and transformer are fine-tuned using heavily weighted self-supervised losses. After the second stage, we compute strong pseudo-labels for all audio clips in the training set using an ensemble of all three fine-tuned transformers. Then, in a second iteration, we repeat the two-stage training process and include a distillation loss based on the pseudo-labels, boosting single-model performance substantially. Additionally, we pre-train PaSST and ATST on the subset of AudioSet that comes with strong temporal labels, before fine-tuning them on the Task 4 datasets.
△ Less
Submitted 17 July, 2024;
originally announced August 2024.
-
Multi-Iteration Multi-Stage Fine-Tuning of Transformers for Sound Event Detection with Heterogeneous Datasets
Authors:
Florian Schmid,
Paul Primus,
Tobias Morocutti,
Jonathan Greif,
Gerhard Widmer
Abstract:
A central problem in building effective sound event detection systems is the lack of high-quality, strongly annotated sound event datasets. For this reason, Task 4 of the DCASE 2024 challenge proposes learning from two heterogeneous datasets, including audio clips labeled with varying annotation granularity and with different sets of possible events. We propose a multi-iteration, multi-stage proce…
▽ More
A central problem in building effective sound event detection systems is the lack of high-quality, strongly annotated sound event datasets. For this reason, Task 4 of the DCASE 2024 challenge proposes learning from two heterogeneous datasets, including audio clips labeled with varying annotation granularity and with different sets of possible events. We propose a multi-iteration, multi-stage procedure for fine-tuning Audio Spectrogram Transformers on the joint DESED and MAESTRO Real datasets. The first stage closely matches the baseline system setup and trains a CRNN model while keeping the pre-trained transformer model frozen. In the second stage, both CRNN and transformer are fine-tuned using heavily weighted self-supervised losses. After the second stage, we compute strong pseudo-labels for all audio clips in the training set using an ensemble of fine-tuned transformers. Then, in a second iteration, we repeat the two-stage training process and include a distillation loss based on the pseudo-labels, achieving a new single-model, state-of-the-art performance on the public evaluation set of DESED with a PSDS1 of 0.692. A single model and an ensemble, both based on our proposed training procedure, ranked first in Task 4 of the DCASE Challenge 2024.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge
Authors:
Florian Schmid,
Paul Primus,
Toni Heittola,
Annamaria Mesaros,
Irene Martín-Morató,
Khaled Koutini,
Gerhard Widmer
Abstract:
This article describes the Data-Efficient Low-Complexity Acoustic Scene Classification Task in the DCASE 2024 Challenge and the corresponding baseline system. The task setup is a continuation of previous editions (2022 and 2023), which focused on recording device mismatches and low-complexity constraints. This year's edition introduces an additional real-world problem: participants must develop da…
▽ More
This article describes the Data-Efficient Low-Complexity Acoustic Scene Classification Task in the DCASE 2024 Challenge and the corresponding baseline system. The task setup is a continuation of previous editions (2022 and 2023), which focused on recording device mismatches and low-complexity constraints. This year's edition introduces an additional real-world problem: participants must develop data-efficient systems for five scenarios, which progressively limit the available training data. The provided baseline system is based on an efficient, factorized CNN architecture constructed from inverted residual blocks and uses Freq-MixStyle to tackle the device mismatch problem. The task received 37 submissions from 17 teams, with the large majority of systems outperforming the baseline. The top-ranked system's accuracy ranges from 54.3% on the smallest to 61.8% on the largest subset, corresponding to relative improvements of approximately 23% and 9% over the baseline system on the evaluation set.
△ Less
Submitted 17 July, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
Stability and Elasticity of Ultrathin Sphere-Patterned Block Copolymer Films
Authors:
Le Qiao,
Daniel A. Vega,
Friederike Schmid
Abstract:
Sphere-patterned ultrathin block copolymers films are potentially interesting for a variety of applications in nanotechnology. We use self-consistent field theory to investigate the elastic response of sphere monolayer films with respect to in-plane shear, in-plane extension and compression deformations, and with respect to bending. The relations between the in-plane elastic moduli is roughly comp…
▽ More
Sphere-patterned ultrathin block copolymers films are potentially interesting for a variety of applications in nanotechnology. We use self-consistent field theory to investigate the elastic response of sphere monolayer films with respect to in-plane shear, in-plane extension and compression deformations, and with respect to bending. The relations between the in-plane elastic moduli is roughly compatible with the expectations for two-dimensional elastic systems with hexagonal symmetry, with one notable exception: The pure shear and the simple shear moduli differ from each other by roughly 20%. Even more importantly, the bending constants are found to be negative, indicating that free-standing block copolymer membranes made of only sphere mono-layer are inherently unstable above the glass transition. Our results are discussed in view of experimental findings.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
Strong stretching theory of polydisperse curved polymer brushes
Authors:
Marios Giannakou,
Oleg V. Borisov,
Friederike Schmid
Abstract:
We investigate the effect of polydispersity on the properties of curved linear brushes in good solvent and for molten brushes. To this end, we extend the strong stretching theory for polydisperse brushes to curved geometries and investigate the polymer chain end profiles, bending moduli and other properties for experimentally relevant polymer chain length distributions of the Schulz-Zimm type. We…
▽ More
We investigate the effect of polydispersity on the properties of curved linear brushes in good solvent and for molten brushes. To this end, we extend the strong stretching theory for polydisperse brushes to curved geometries and investigate the polymer chain end profiles, bending moduli and other properties for experimentally relevant polymer chain length distributions of the Schulz-Zimm type. We also investigate the properties of End Exclusion Zones (EEZ) that may appear in convex geometries under certain conditions, and show that their position in the brush can be engineered by careful selection of the polymer length distribution. Lastly, we propose a method to engineer chain end profiles by engineering the polymer length distribution.
△ Less
Submitted 24 June, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Cloaking Transition of Droplets on Lubricated Brushes
Authors:
Rodrique G. M. Badr,
Lukas Hauer,
Doris Vollmer,
Friederike Schmid
Abstract:
We study the equilibrium properties and the wetting behavior of a simple liquid on a polymer brush, with and without presence of lubricant by multibody Dissipative Particle Dynamics simulations. The lubricant is modelled as a polymeric liquid consisting of short chains that are chemically identical to the brush polymers. We investigate the behavior of the brush in terms of the grafting density and…
▽ More
We study the equilibrium properties and the wetting behavior of a simple liquid on a polymer brush, with and without presence of lubricant by multibody Dissipative Particle Dynamics simulations. The lubricant is modelled as a polymeric liquid consisting of short chains that are chemically identical to the brush polymers. We investigate the behavior of the brush in terms of the grafting density and the amount of lubricant present. Regarding the wetting behavior, we study a sessile droplet on top of the brush. The droplet consists of non-bonded particles that form a dense phase. Our model and choice of parameters result in the formation of a wetting ridge and in the cloaking of the droplet by the lubricant, i.e. the lubricant chains creep up onto the droplet and eventually cover its surface completely. Cloaking is a phenomenon that is observed experimentally and is of integral importance to the dynamics of sliding droplets. We quantify the cloaking in terms of its thickness, which increases with the amount of lubricant present. The analysis reveals a well-defined transition point where the cloaking sets in. We propose a thermodynamic theory to explain this behavior. In addition we investigate the dependence of the contact angles on the size of the droplet and the possible effect of line tension. We quantify the variation of the contact angle with the curvature of the contact line on a lubricant free brush and find a negative value for the line tension. Finally we investigate the effect of cloaking/lubrication on the contact angles and the wetting ridge. We find that lubrication and cloaking reduce the contact angles by a couple of degrees. The effect on the wetting ridge is a reduction in the extension of the brush chains near the three phase contact line, an effect that was also observed in experiments of droplets on crosslinked gels.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Dynamics of Droplets Moving on Lubricated Polymer Brushes
Authors:
Rodrique G. M. Badr,
Lukas Hauer,
Doris Vollmer,
Friederike Schmid
Abstract:
Understanding the dynamics of drops on polymer-coated surfaces is crucial for optimizing applications such as self-cleaning materials or microfluidic devices. While the static and dynamic properties of deposited drops have been well characterised, a microscopic understanding of the underlying dynamics is missing. In particular, it is unclear how drop dynamics depends on the amount of uncrosslinked…
▽ More
Understanding the dynamics of drops on polymer-coated surfaces is crucial for optimizing applications such as self-cleaning materials or microfluidic devices. While the static and dynamic properties of deposited drops have been well characterised, a microscopic understanding of the underlying dynamics is missing. In particular, it is unclear how drop dynamics depends on the amount of uncrosslinked chains in the brush, because experimental techniques fail to quantify those. Here we use coarse-grained simulations to study droplets moving on a lubricated polymer brush substrate under the influence of an external body force. The simulation model is based on the many body dissipative particle dynamics (mDPD) method and designed to mimic a system of water droplets on polydimethylsiloxane (PDMS) brushes with chemically identical PDMS lubricant. In agreement with experiments, we find a sublinear power law dependence between the external force $F$ and the droplet velocity $v$, $F \propto v^α$ with $α<1$; however, the exponents differ ($α\sim 0.6-0.7$ in simulations versus $α\sim 0.25$ in experiments). With increasing velocity, the droplets elongate and the receding contact angle decreases, whereas the advancing contact angle remains roughly constant. Analyzing the flow profiles inside the droplet reveals that the droplets do not slide, but roll, with vanishing slip at the substrate surface. Surprisingly, adding lubricant has very little effect on the effective friction force between the droplet and the substrate, even though it has a pronounced effect on the size and structure of the wetting ridge, especially above the cloaking transition.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Tracing Dirac points of topological surface states by ferromagnetic resonance
Authors:
Laura Pietanesi,
Magdalena Marganska,
Thomas Mayer,
Michael Barth,
Lin Chen,
Ji Zou,
Adrian Weindl,
Alexander Liebig,
Rebeca Díaz-Pardo,
Dhavala Suri,
Florian Schmid,
Franz J. Gießibl,
Klaus Richter,
Yaroslav Tserkovnyak,
Matthias Kronseder,
Christian H. Back
Abstract:
Ferromagnetic resonance is used to reveal features of the buried electronic band structure at interfaces between ferromagnetic metals and topological insulators. By monitoring the evolution of magnetic damping, the application of this method to a hybrid structure consisting of a ferromagnetic layer and a 3D topological insulator reveals a clear fingerprint of the Dirac point and exhibits additiona…
▽ More
Ferromagnetic resonance is used to reveal features of the buried electronic band structure at interfaces between ferromagnetic metals and topological insulators. By monitoring the evolution of magnetic damping, the application of this method to a hybrid structure consisting of a ferromagnetic layer and a 3D topological insulator reveals a clear fingerprint of the Dirac point and exhibits additional features of the interfacial band structure not otherwise observable. The underlying spin-pumping mechanism is discussed in the framework of dissipation of angular momentum by topological surface states (TSSs). Tuning of the Fermi level within the TSS was verified both by varying the stoichiometry of the topological insulator layer and by electrostatic backgating and the damping values obtained in both cases show a remarkable agreement. The high energy resolution of this method additionally allows us to resolve the energetic shift of the local Dirac points generated by local variations of the electrostatic potential. Calculations based on the chiral tunneling process naturally occurring in TSS agree well with the experimental results.
△ Less
Submitted 7 March, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
A Comprehensive Approach to Characterize Navigation Instruments for Magnetic Guidance in Biological Systems
Authors:
Peter Blümler,
Fabian Raudzus,
Friederike Schmid
Abstract:
The non-invasive spatiotemporal control of cellular functions, organization of tissues, and even the behavior of small animals has become paramount for advanced therapies. As magnetic fields do not interact with biological matter, their application is not only suitable for in vitro experiments but also for in vivo applications, even in deep tissues. Particularly, the remote manipulation of paramag…
▽ More
The non-invasive spatiotemporal control of cellular functions, organization of tissues, and even the behavior of small animals has become paramount for advanced therapies. As magnetic fields do not interact with biological matter, their application is not only suitable for in vitro experiments but also for in vivo applications, even in deep tissues. Particularly, the remote manipulation of paramagnetic entities through magnetic instruments has emerged as a promising approach across various biological contexts. Despite similarities in basic experimental concepts, variations in the properties and descriptions of those magnetic instruments among the authors and studies resulted in a lack of reproducibility and comparability. Therefore, this article addresses the question of how to standardize the characterization of magnetic instruments. Our emphasis lies on the ability of magnetic systems to control the movement of paramagnetic objects such as ferro- or superparamagnetic particles, within organisms. This movement is achieved by exerting a force on magnetic particles by exposing them to a locally varying magnetic field. While it is well-known that the exerted force depends on the spatial variation (i.e. the gradient) of the magnetic field, the magnitude of the field is equally important. However, this second factor is often neglected in the literature. Therefore, we conduct a comprehensive analysis and discussion of both factors. Furthermore, we propose a novel descriptor, termed "effective gradient", which combines both dependencies. To illustrate, we characterize different magnet systems by calculating and comparing the different quantities and relating them to two experiments with different superparamagnetic nanoparticles.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
A numerical approach for calculating exact non-adiabatic terms in quantum dynamics
Authors:
Ewen D C Lawrence,
Sebastian F J Schmid,
Ieva Čepaitė,
Peter Kirton,
Callum W Duncan
Abstract:
Understanding how non-adiabatic terms affect quantum dynamics is fundamental to improving various protocols for quantum technologies. We present a novel approach to computing the Adiabatic Gauge Potential (AGP), which gives information on the non-adiabatic terms that arise from time dependence in the Hamiltonian. Our approach uses commutators of the Hamiltonian to build up an appropriate basis of…
▽ More
Understanding how non-adiabatic terms affect quantum dynamics is fundamental to improving various protocols for quantum technologies. We present a novel approach to computing the Adiabatic Gauge Potential (AGP), which gives information on the non-adiabatic terms that arise from time dependence in the Hamiltonian. Our approach uses commutators of the Hamiltonian to build up an appropriate basis of the AGP, which can be easily truncated to give an approximate form when the exact result is intractable. We use this approach to study the AGP obtained for the transverse field Ising model on a variety of graphs, showing how the different underlying graph structures can give rise to very different scaling for the number of terms required in the AGP.
△ Less
Submitted 19 September, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
How boundary interactions dominate emergent driving of passive probes in active matter
Authors:
Jeanine Shea,
Gerhard Jung,
Friederike Schmid
Abstract:
Colloidal probes immersed in an active bath have been found to behave like active particles themselves. Here, we use coarse-grained simulations to investigate the mechanisms behind this behavior. We find that the active motion of the colloid cannot be simply attributed to the convective motion in the bath. Instead, the boundary of the probe contributes significantly to these adopted dynamics by ca…
▽ More
Colloidal probes immersed in an active bath have been found to behave like active particles themselves. Here, we use coarse-grained simulations to investigate the mechanisms behind this behavior. We find that the active motion of the colloid cannot be simply attributed to the convective motion in the bath. Instead, the boundary of the probe contributes significantly to these adopted dynamics by causing active bath particles to spontaneously accumulate at the probe. This gathering of active bath particles then pushes the probe, thus promoting its emergent active-particle-like behavior. Furthermore, we find that the dynamic properties of the probe depend on its size in a non-monotonic way, which further highlights the non-trivial interplay between probe and bath.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Passive particle in an active bath: How can we tell it is out of equilibrium?
Authors:
Jeanine Shea,
Gerhard Jung,
Friederike Schmid
Abstract:
We study a passive probe immersed in a fluid of active particles. Despite the system's non-equilibrium nature, the trajectory of the probe does not exhibit non-equilibrium signatures: its velocity distribution remains Gaussian, the second fluctuation dissipation theorem is not fundamentally violated, and the motion does not indicate breaking of time reversal symmetry. To tell that the probe is out…
▽ More
We study a passive probe immersed in a fluid of active particles. Despite the system's non-equilibrium nature, the trajectory of the probe does not exhibit non-equilibrium signatures: its velocity distribution remains Gaussian, the second fluctuation dissipation theorem is not fundamentally violated, and the motion does not indicate breaking of time reversal symmetry. To tell that the probe is out of equilibrium requires examination of its behavior in tandem with that of the active fluid: the kinetic temperature of the probe does not equilibrate to that of the surrounding active particles. As a strategy to diagnose non-equilibrium from probe trajectories alone, we propose to examine their response to a small perturbation which reveals a non-equilibrium signature through a violation of the first fluctuation dissipation theorem.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Stability of branched tubular membrane structures
Authors:
Maike Jung,
Gerhard Jung,
Friederike Schmid
Abstract:
We study the energetics and stability of branched tubular membrane structures by computer simulations of a triangulated network model. We find that triple (Y-)junctions can be created and stabilized by applying mechanical forces, if the angle between branches is 120 o . The same holds for tetrahedral junctions with tetraeder angles. If the wrong angles are enforced, the branches coalesce to a line…
▽ More
We study the energetics and stability of branched tubular membrane structures by computer simulations of a triangulated network model. We find that triple (Y-)junctions can be created and stabilized by applying mechanical forces, if the angle between branches is 120 o . The same holds for tetrahedral junctions with tetraeder angles. If the wrong angles are enforced, the branches coalesce to a linear structure, a pure tube. After releasing the mechanical force, Y-branched structures remain metastable if one constrains the enclosed volume and the average curvature (the area difference) to a fixed value; tetrahedral junctions however split up into two Y-junctions. Somewhat counterintuitively, the energy cost of adding a Y-branch is negative in structures with fixed surface area and tube diameter, even if one accounts for the positive contribution of the additional branch end. For fixed average curvature, however, adding a branch also enforces a thinning of tubes, therefore the overall curvature energy cost is positive. Possible implications for the stability of branched networks structures in cells are discussed.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models
Authors:
Florian Schmid,
Khaled Koutini,
Gerhard Widmer
Abstract:
The introduction of large-scale audio datasets, such as AudioSet, paved the way for Transformers to conquer the audio domain and replace CNNs as the state-of-the-art neural network architecture for many tasks. Audio Spectrogram Transformers are excellent at exploiting large datasets, creating powerful pre-trained models that surpass CNNs when fine-tuned on downstream tasks. However, current popula…
▽ More
The introduction of large-scale audio datasets, such as AudioSet, paved the way for Transformers to conquer the audio domain and replace CNNs as the state-of-the-art neural network architecture for many tasks. Audio Spectrogram Transformers are excellent at exploiting large datasets, creating powerful pre-trained models that surpass CNNs when fine-tuned on downstream tasks. However, current popular Audio Spectrogram Transformers are demanding in terms of computational complexity compared to CNNs. Recently, we have shown that, by employing Transformer-to-CNN Knowledge Distillation, efficient CNNs can catch up with and even outperform Transformers on large datasets. In this work, we extend this line of research and increase the capacity of efficient CNNs by introducing dynamic CNN blocks, constructed of dynamic non-linearities, dynamic convolutions and attention mechanisms. We show that these dynamic CNNs outperform traditional efficient CNNs, in terms of the performance-complexity trade-off and parameter efficiency, at the task of audio tagging on the large-scale AudioSet. Our experiments further indicate that the introduced dynamic CNNs achieve better performance on downstream tasks and scale up well, attaining Transformer performance and even outperforming them on AudioSet and several downstream tasks.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Force renormalization for probes immersed in an active bath
Authors:
Jeanine Shea,
Gerhard Jung,
Friederike Schmid
Abstract:
Langevin equations or generalized Langevin equations (GLEs) are popular models for describing the motion of a particle in a fluid medium in an effective manner. Here we examine particles immersed in an inherently nonequilibrium fluid, i.e., an active bath, which are subject to an external force. Specifically, we consider two types of forces that are highly relevant for microrheological studies: A…
▽ More
Langevin equations or generalized Langevin equations (GLEs) are popular models for describing the motion of a particle in a fluid medium in an effective manner. Here we examine particles immersed in an inherently nonequilibrium fluid, i.e., an active bath, which are subject to an external force. Specifically, we consider two types of forces that are highly relevant for microrheological studies: A harmonic, trapping force and a constant, "drag" force. We study such systems by molecular simulations and use the simulation data to derive an effective GLE description. We find that, in an active bath, the external force in the GLE is not equal to the physical external force, but rather a renormalized external force, which can be significantly smaller. The effect cannot be attributed to the mere temperature renormalization, which is also observed.
△ Less
Submitted 2 May, 2024; v1 submitted 4 October, 2023;
originally announced October 2023.
-
A low repetition rate optical frequency comb
Authors:
Francesco Canella,
Johannes Weitenberg,
Muhammad Thariq,
Fabian Schmid,
Paras Dwivedi,
Gianluca Galzerano,
Theodor W. Haensch,
Thomas Udem,
Akira Ozawa
Abstract:
Reducing the pulse repetition rate of an optical frequency comb increases the pulse energy for a given average power. This enhances the efficiency of nonlinear frequency conversion and it facilitates extending the accessible wavelength range, for example into the extreme ultraviolet (XUV). The resulting spectrally dense frequency comb can still be used for precision spectroscopy of narrow atomic o…
▽ More
Reducing the pulse repetition rate of an optical frequency comb increases the pulse energy for a given average power. This enhances the efficiency of nonlinear frequency conversion and it facilitates extending the accessible wavelength range, for example into the extreme ultraviolet (XUV). The resulting spectrally dense frequency comb can still be used for precision spectroscopy of narrow atomic or molecular transitions. In this article, we demonstrate a low-noise infrared frequency comb with a repetition rate as low as 40 kHz using a Yb:KYW mode-locked laser, pulse picking, and subsequent amplification. The frequency comb structure is confirmed by generating a beat note with a continuous wave reference laser. A comb mode is actively stabilized to the reference laser, and the integrated rms phase noise from 20 Hz to 20 kHz is measured to be 195 mrad.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Nanodroplet Flight Control in Electrohydrodynamic Redox 3D Printing
Authors:
Maxence Menétrey,
Lukáš Zezulka,
Pascal Fandré,
Fabian Schmid,
Ralph Spolenak
Abstract:
Electrohydrodynamic 3D printing is an additive manufacturing technique with enormous potential in plasmonics, microelectronics, and sensing applications, thanks to its broad materials palette, high voxel deposition rate, and compatibility with various substrates. However, the electric field used to deposit material is concentrated at the depositing structure resulting in the focusing of the charge…
▽ More
Electrohydrodynamic 3D printing is an additive manufacturing technique with enormous potential in plasmonics, microelectronics, and sensing applications, thanks to its broad materials palette, high voxel deposition rate, and compatibility with various substrates. However, the electric field used to deposit material is concentrated at the depositing structure resulting in the focusing of the charged droplets and geometry-dependent landing positions, which complicates the fabrication of complex 3D shapes. The low level of concordance between design and printout seriously impedes the development of electrohydrodynamic 3D printing and rationalizes the simplicity of the designs reported so far. In this work, we break the electric field centrosymmetry to study the resulting deviation in the flight trajectory of the droplets. Comparison of experimental outcomes with predictions of an FEM model provides new insights into the droplet characteristics and unveils how the product of droplet size and charge uniquely governs its kinematics. From these insights, we develop reliable predictions of the jet trajectory and allow the computation of optimized printing paths counterbalancing the electric field distortion, thereby enabling the fabrication of geometries with unprecedented complexity.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Viscosity of flexible and semiflexible ring melts -- molecular origins and flow-induced segregation
Authors:
Ranajay Datta,
Fabian Berressem,
Friederike Schmid,
Arash Nikoubashman,
Peter Virnau
Abstract:
We investigate with numerical simulations the molecular origin of viscosity in melts of flexible and semiflexible oligomer rings in comparison to corresponding systems with linear chains. The strong increase of viscosity with ring stiffness is linked to the formation of entangled clusters, which dissolve under shear. This shear-induced breakup and alignment of rings in the flow direction lead to p…
▽ More
We investigate with numerical simulations the molecular origin of viscosity in melts of flexible and semiflexible oligomer rings in comparison to corresponding systems with linear chains. The strong increase of viscosity with ring stiffness is linked to the formation of entangled clusters, which dissolve under shear. This shear-induced breakup and alignment of rings in the flow direction lead to pronounced shear-thinning and non-Newtonian behavior. In melts of linear chains, the viscosity can be associated with the (average) number of entanglements between chains, which also dissolve under shear. While blends of flexible and semiflexible rings are mixed at rest, the two species separate under flow. This phenomenon has potential applications in microfluidic devices to segregate ring polymers of similar mass and chemical composition by their bending rigidity.
△ Less
Submitted 27 July, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Device-Robust Acoustic Scene Classification via Impulse Response Augmentation
Authors:
Tobias Morocutti,
Florian Schmid,
Khaled Koutini,
Gerhard Widmer
Abstract:
The ability to generalize to a wide range of recording devices is a crucial performance factor for audio classification models. The characteristics of different types of microphones introduce distributional shifts in the digitized audio signals due to their varying frequency responses. If this domain shift is not taken into account during training, the model's performance could degrade severely wh…
▽ More
The ability to generalize to a wide range of recording devices is a crucial performance factor for audio classification models. The characteristics of different types of microphones introduce distributional shifts in the digitized audio signals due to their varying frequency responses. If this domain shift is not taken into account during training, the model's performance could degrade severely when it is applied to signals recorded by unseen devices. In particular, training a model on audio signals recorded with a small number of different microphones can make generalization to unseen devices difficult. To tackle this problem, we convolve audio signals in the training set with pre-recorded device impulse responses (DIRs) to artificially increase the diversity of recording devices. We systematically study the effect of DIR augmentation on the task of Acoustic Scene Classification using CNNs and Audio Spectrogram Transformers. The results show that DIR augmentation in isolation performs similarly to the state-of-the-art method Freq-MixStyle. However, we also show that DIR augmentation and Freq-MixStyle are complementary, achieving a new state-of-the-art performance on signals recorded by devices unseen during training.
△ Less
Submitted 27 June, 2023; v1 submitted 12 May, 2023;
originally announced May 2023.
-
Improving Resolution and Resolvability of Single Particle CryoEM using Gaussian Mixture Models
Authors:
Muyuan Chen,
Michael F. Schmid,
Wah Chiu
Abstract:
Cryogenic electron microscopy is widely used in structural biology, but its resolution is often limited by the dynamics of the macromolecule. Here, we developed a refinement protocol based on Gaussian mixture models that integrates particle orientation and conformation estimation, and improves the alignment for flexible domains of protein structures. We demonstrated this protocol on multiple datas…
▽ More
Cryogenic electron microscopy is widely used in structural biology, but its resolution is often limited by the dynamics of the macromolecule. Here, we developed a refinement protocol based on Gaussian mixture models that integrates particle orientation and conformation estimation, and improves the alignment for flexible domains of protein structures. We demonstrated this protocol on multiple datasets, resulting in improved resolution and resolvability, locally and globally, by visual and quantitative measures.
△ Less
Submitted 29 August, 2023; v1 submitted 31 March, 2023;
originally announced March 2023.
-
Low-Complexity Audio Embedding Extractors
Authors:
Florian Schmid,
Khaled Koutini,
Gerhard Widmer
Abstract:
Solving tasks such as speaker recognition, music classification, or semantic audio event tagging with deep learning models typically requires computationally demanding networks. General-purpose audio embeddings (GPAEs) are dense representations of audio signals that allow lightweight, shallow classifiers to tackle various audio tasks. The idea is that a single complex feature extractor would extra…
▽ More
Solving tasks such as speaker recognition, music classification, or semantic audio event tagging with deep learning models typically requires computationally demanding networks. General-purpose audio embeddings (GPAEs) are dense representations of audio signals that allow lightweight, shallow classifiers to tackle various audio tasks. The idea is that a single complex feature extractor would extract dense GPAEs, while shallow MLPs can produce task-specific predictions. If the extracted dense representations are general enough to allow the simple downstream classifiers to generalize to a variety of tasks in the audio domain, a single costly forward pass suffices to solve multiple tasks in parallel. In this work, we try to reduce the cost of GPAE extractors to make them suitable for resource-constrained devices. We use efficient MobileNets trained on AudioSet using Knowledge Distillation from a Transformer ensemble as efficient GPAE extractors. We explore how to obtain high-quality GPAEs from the model, study how model complexity relates to the quality of extracted GPAEs, and conclude that low-complexity models can generate competitive GPAEs, paving the way for analyzing audio streams on edge devices w.r.t. multiple audio classification and recognition tasks.
△ Less
Submitted 23 June, 2023; v1 submitted 3 March, 2023;
originally announced March 2023.
-
Understanding and modeling polymers: The challenge of multiple scales
Authors:
Friederike Schmid
Abstract:
Polymer materials have the characteristic feature that they are multiscale systems by definition. Already the description of a single molecules involves a multitude of different scales, and cooperative processes in polymer assemblies are governed by the interplay of these scales. Polymers have been among the first materials for which systematic multiscale techniques were developed, yet they contin…
▽ More
Polymer materials have the characteristic feature that they are multiscale systems by definition. Already the description of a single molecules involves a multitude of different scales, and cooperative processes in polymer assemblies are governed by the interplay of these scales. Polymers have been among the first materials for which systematic multiscale techniques were developed, yet they continue to present extraordinary challenges for modellers. In this perspective, we review popular models that are used to describe polymers on different scales and discuss scale bridging strategies such as static and dynamic coarse-graining methods and multiresolution approaches. We close with a list of hard problems which still need to be solved in order to gain a comprehensive quantitative understanding of polymer systems on all scales.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
Learning General Audio Representations with Large-Scale Training of Patchout Audio Transformers
Authors:
Khaled Koutini,
Shahed Masoudian,
Florian Schmid,
Hamid Eghbal-zadeh,
Jan Schlüter,
Gerhard Widmer
Abstract:
The success of supervised deep learning methods is largely due to their ability to learn relevant features from raw data. Deep Neural Networks (DNNs) trained on large-scale datasets are capable of capturing a diverse set of features, and learning a representation that can generalize onto unseen tasks and datasets that are from the same domain. Hence, these models can be used as powerful feature ex…
▽ More
The success of supervised deep learning methods is largely due to their ability to learn relevant features from raw data. Deep Neural Networks (DNNs) trained on large-scale datasets are capable of capturing a diverse set of features, and learning a representation that can generalize onto unseen tasks and datasets that are from the same domain. Hence, these models can be used as powerful feature extractors, in combination with shallower models as classifiers, for smaller tasks and datasets where the amount of training data is insufficient for learning an end-to-end model from scratch. During the past years, Convolutional Neural Networks (CNNs) have largely been the method of choice for audio processing. However, recently attention-based transformer models have demonstrated great potential in supervised settings, outperforming CNNs. In this work, we investigate the use of audio transformers trained on large-scale datasets to learn general-purpose representations. We study how the different setups in these audio transformers affect the quality of their embeddings. We experiment with the models' time resolution, extracted embedding level, and receptive fields in order to see how they affect performance on a variety of tasks and datasets, following the HEAR 2021 NeurIPS challenge evaluation setup. Our results show that representations extracted by audio transformers outperform CNN representations. Furthermore, we will show that transformers trained on Audioset can be extremely effective representation extractors for a wide range of downstream tasks.
△ Less
Submitted 2 March, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Efficient Large-scale Audio Tagging via Transformer-to-CNN Knowledge Distillation
Authors:
Florian Schmid,
Khaled Koutini,
Gerhard Widmer
Abstract:
Audio Spectrogram Transformer models rule the field of Audio Tagging, outrunning previously dominating Convolutional Neural Networks (CNNs). Their superiority is based on the ability to scale up and exploit large-scale datasets such as AudioSet. However, Transformers are demanding in terms of model size and computational requirements compared to CNNs. We propose a training procedure for efficient…
▽ More
Audio Spectrogram Transformer models rule the field of Audio Tagging, outrunning previously dominating Convolutional Neural Networks (CNNs). Their superiority is based on the ability to scale up and exploit large-scale datasets such as AudioSet. However, Transformers are demanding in terms of model size and computational requirements compared to CNNs. We propose a training procedure for efficient CNNs based on offline Knowledge Distillation (KD) from high-performing yet complex transformers. The proposed training schema and the efficient CNN design based on MobileNetV3 results in models outperforming previous solutions in terms of parameter and computational efficiency and prediction performance. We provide models of different complexity levels, scaling from low-complexity models up to a new state-of-the-art performance of .483 mAP on AudioSet. Source Code available at: https://github.com/fschmid56/EfficientAT
△ Less
Submitted 23 June, 2023; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Number-Resolved Detection of Dark Ions in Coulomb Crystals
Authors:
Fabian Schmid,
Johannes Weitenberg,
Jorge Moreno,
Theodor W. Hänsch,
Thomas Udem,
Akira Ozawa
Abstract:
While it is straightforward to count laser-cooled trapped ions by fluorescence imaging, detecting the number of dark ions embedded and sympathetically cooled in a mixed ion crystal is more challenging. We demonstrate a method to track the number of dark ions in real time with single-particle sensitivity. This is achieved by observing discrete steps in the amount of fluorescence emitted from the co…
▽ More
While it is straightforward to count laser-cooled trapped ions by fluorescence imaging, detecting the number of dark ions embedded and sympathetically cooled in a mixed ion crystal is more challenging. We demonstrate a method to track the number of dark ions in real time with single-particle sensitivity. This is achieved by observing discrete steps in the amount of fluorescence emitted from the coolant ions while exciting secular motional resonances of dark ions. By counting the number of fluorescence steps, we can identify the number of dark ions without calibration and without relying on any physical model of the motional excitation. We demonstrate the scheme by detecting H$_2^+$ and H$_3^+$ ions embedded in a Be$^+$ ion Coulomb crystal in a linear radio frequency trap. Our method allows observing the generation and destruction of individual ions simultaneously for different types of ions. Besides high-resolution spectroscopy of dark ions, another application is the detection of chemical reactions in real time with single-particle sensitivity. This is demonstrated in this work.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
Adsorption-active polydisperse brush with tunable molecular mass distribution
Authors:
Anna S. Ivanova,
Alexey A. Polotsky,
Alexander M. Skvortsov,
Leonid I. Klushin,
Friederike Schmid
Abstract:
Recently a novel class of responsive uncharged polymer brushes has been proposed [Klushin et al, J. Chem. Phys. 154, 074904 (2021)] where the brush-forming chains have an affinity to the substrate. For sufficiently strong surface interactions, a fraction of chains condenses into a near-surface layer, while the remaining ones form the outer brush with a reduced grafting density. The dense layer and…
▽ More
Recently a novel class of responsive uncharged polymer brushes has been proposed [Klushin et al, J. Chem. Phys. 154, 074904 (2021)] where the brush-forming chains have an affinity to the substrate. For sufficiently strong surface interactions, a fraction of chains condenses into a near-surface layer, while the remaining ones form the outer brush with a reduced grafting density. The dense layer and the more tenuous outer brush can be seen as coexisting microphases. The effective grafting density of the outer brush is controlled by the adsorption strength and can be changed reversibly as a response to changes in environmental parameters.
In this paper we use numerical self-consistent field calculations and theoretical considerations to study this phenomenon in polydisperse brushes. Our results reveal an unexpected effect: Although all chains are chemically identical, shorter chains are adsorbed preferentially. Hence, with the increase in the surface affinity parameter, a reduction in the surface grafting density of the residual brush is accompanied by a change in the shape of its molecular mass distribution. In particular, an originally bidisperse brush can be effectively transformed into a nearly monodisperse one containing only the longer chain fraction.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Deep Metric Learning for Ground Images
Authors:
Raaghav Radhakrishnan,
Jan Fabian Schmid,
Randolf Scholz,
Lars Schmidt-Thieme
Abstract:
Ground texture based localization methods are potential prospects for low-cost, high-accuracy self-localization solutions for robots. These methods estimate the pose of a given query image, i.e. the current observation of the ground from a downward-facing camera, in respect to a set of reference images whose poses are known in the application area. In this work, we deal with the initial localizati…
▽ More
Ground texture based localization methods are potential prospects for low-cost, high-accuracy self-localization solutions for robots. These methods estimate the pose of a given query image, i.e. the current observation of the ground from a downward-facing camera, in respect to a set of reference images whose poses are known in the application area. In this work, we deal with the initial localization task, in which we have no prior knowledge about the current robot positioning. In this situation, the localization method would have to consider all available reference images. However, in order to reduce computational effort and the risk of receiving a wrong result, we would like to consider only those reference images that are actually overlapping with the query image. For this purpose, we propose a deep metric learning approach that retrieves the most similar reference images to the query image. In contrast to existing approaches to image retrieval for ground images, our approach achieves significantly better recall performance and improves the localization performance of a state-of-the-art ground texture based localization method.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
Model-Based Parameter Optimization for Ground Texture Based Localization Methods
Authors:
Jan Fabian Schmid,
Stephan F. Simon,
Rudolf Mester
Abstract:
A promising approach to accurate positioning of robots is ground texture based localization. It is based on the observation that visual features of ground images enable fingerprint-like place recognition. We tackle the issue of efficient parametrization of such methods, deriving a prediction model for localization performance, which requires only a small collection of sample images of an applicati…
▽ More
A promising approach to accurate positioning of robots is ground texture based localization. It is based on the observation that visual features of ground images enable fingerprint-like place recognition. We tackle the issue of efficient parametrization of such methods, deriving a prediction model for localization performance, which requires only a small collection of sample images of an application area. In a first step, we examine whether the model can predict the effects of changing one of the most important parameters of feature-based localization methods: the number of extracted features. We examine two localization methods, and in both cases our evaluation shows that the predictions are sufficiently accurate. Since this model can be used to find suitable values for any parameter, we then present a holistic parameter optimization framework, which finds suitable texture-specific parameter configurations, using only the model to evaluate the considered parameter configurations.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
Fluctuation-Dissipation Relations Far from Equilibrium: A Case Study
Authors:
Gerhard Jung,
Friederike Schmid
Abstract:
Fluctuation-dissipation relations or "theorems" (FDTs) are fundamental for statistical physics and can be rigorously derived for equilibrium systems. Their applicability to non-equilibrium systems is, however, debated. Here, we simulate an active microrheology experiment, in which a spherical colloid is pulled with a constant external force through a fluid, creating near-equilibrium and far-from-e…
▽ More
Fluctuation-dissipation relations or "theorems" (FDTs) are fundamental for statistical physics and can be rigorously derived for equilibrium systems. Their applicability to non-equilibrium systems is, however, debated. Here, we simulate an active microrheology experiment, in which a spherical colloid is pulled with a constant external force through a fluid, creating near-equilibrium and far-from-equilibrium systems. We characterize the structural and dynamical properties of these systems, and reconstruct an effective generalized Langevin equation (GLE) for the colloid dynamics. Specifically, we test the validity of two FDTs: The first FDT relates the non-equilibrium response of a system to equilibrium correlation functions, and the second FDT relates the memory friction kernel in the GLE to the stochastic force. We find that the validity of the first FDT depends strongly on the strength of the external driving: it is fulfilled close to equilibrium and breaks down far from it. In contrast, we observe that the second FDT is always fulfilled. We provide a mathematical argument why this generally holds for memory kernels reconstructed from a deterministic Volterra equation for correlation functions, even for non-stationary non-equilibrium systems.
Motivated by the Mori-Zwanzig formalism, we therefore suggest to impose an orthogonality constraint on the stochastic force, which is in fact equivalent to the validity of this Volterra equation. Such GLEs automatically satisfy the second FDT and are unique, which is desirable when using GLEs for coarse-grained modeling.
△ Less
Submitted 14 September, 2021; v1 submitted 1 June, 2021;
originally announced June 2021.
-
Dynamic coarse-graining of polymer systems using mobility functions
Authors:
Bing Li,
Kostas Daoulas,
Friederike Schmid
Abstract:
We propose a dynamic coarse-graining (CG) scheme for mapping heterogeneous polymer fluids onto extremely CG models in a dynamically consistent manner. The idea is to use as target function for the mapping a wave-vector dependent mobility function derived from the single-chain dynamic structure factor, which is calculated in the microscopic reference system. In previous work, we have shown that dyn…
▽ More
We propose a dynamic coarse-graining (CG) scheme for mapping heterogeneous polymer fluids onto extremely CG models in a dynamically consistent manner. The idea is to use as target function for the mapping a wave-vector dependent mobility function derived from the single-chain dynamic structure factor, which is calculated in the microscopic reference system. In previous work, we have shown that dynamic density functional calculations based on this mobility function can accurately reproduce the order/disorder kinetics in polymer melts, thus it is a suitable starting point for dynamic mapping. To enable the mapping over a range of relevant wave vectors, we propose to modify the CG dynamics by introducing internal friction parameters that slow down the CG monomer dynamics on local scales, without affecting the static equilibrium structure of the system. We illustrate and discuss the method using the example of infinitely long linear Rouse polymers mapped onto ultrashort CG chains. We show that our method can be used to construct dynamically consistent CG models for homopolymers with CG chain length N=4, whereas for copolymers, longer CG chain lengths are necessary
△ Less
Submitted 7 March, 2021; v1 submitted 26 February, 2021;
originally announced February 2021.
-
Motional resonances of three-dimensional dual-species Coulomb crystals
Authors:
Byoung-moo Ann,
Fabian Schmid,
Jonas Krause,
Theodor W. Hänsch,
Thomas Udem,
Akira Ozawa
Abstract:
We investigate the motional resonances of dual-species Coulomb crystals comprised of $^9$Be$^+$ and $^{24}$Mg$^+$ ions held in a 4-rod linear Paul trap. Our experimental data and simulations show that the secular motion of such mixed crystals has rich dynamics. Their secular spectra can differ significantly from those of pure ion crystals. We propose a simple model based on mechanical coupling wit…
▽ More
We investigate the motional resonances of dual-species Coulomb crystals comprised of $^9$Be$^+$ and $^{24}$Mg$^+$ ions held in a 4-rod linear Paul trap. Our experimental data and simulations show that the secular motion of such mixed crystals has rich dynamics. Their secular spectra can differ significantly from those of pure ion crystals. We propose a simple model based on mechanical coupling with Coulomb interactions between the two different ion species that explains many features of the secular spectrum. Our findings contribute to a more reliable identification of the ion species in mixed crystals.
△ Less
Submitted 15 February, 2021;
originally announced February 2021.
-
Simple phase noise measurement scheme for cavity-stabilized laser systems
Authors:
Fabian Schmid,
Johannes Weitenberg,
Theodor W. Hänsch,
Thomas Udem,
Akira Ozawa
Abstract:
We describe a simple method for measuring the residual fast phase noise of a cavity-stabilized laser using the cavity as a reference. The method is based on generating a beat note between the laser output and the strongly filtered light transmitted through the cavity. The beat note can be directly analyzed without requiring further calibration of system parameters. We apply the method to measure t…
▽ More
We describe a simple method for measuring the residual fast phase noise of a cavity-stabilized laser using the cavity as a reference. The method is based on generating a beat note between the laser output and the strongly filtered light transmitted through the cavity. The beat note can be directly analyzed without requiring further calibration of system parameters. We apply the method to measure the residual phase noise of an external-cavity diode laser (ECDL) locked to a reference cavity and compare the results with an analysis of the in-loop error signal of the feedback system.
△ Less
Submitted 15 February, 2021;
originally announced February 2021.
-
Polymer brushes with reversibly tunable grafting density
Authors:
Leonid I. Klushin,
Alexander M. Skvortsov,
Alexey A. Polotsky,
Anna S. Ivanova,
Friederike Schmid
Abstract:
We propose a novel class of responsive polymer brushes, where the effective grafting density can be controlled by external stimuli. This is achieved by using end-grafted polymer chains that have an affinity to the substrate. For sufficiently strong surface interactions, a fraction of chains condenses into a near-surface layer, while the remaining ones form the outer brush. The dense layer and the…
▽ More
We propose a novel class of responsive polymer brushes, where the effective grafting density can be controlled by external stimuli. This is achieved by using end-grafted polymer chains that have an affinity to the substrate. For sufficiently strong surface interactions, a fraction of chains condenses into a near-surface layer, while the remaining ones form the outer brush. The dense layer and the more tenuous outer brush can be seen as coexisting microphases. The effective grafting density of the outer brush is controlled by the adsorption strength and can be changed reversibly and in a controlled way as a response to changes in environmental parameters. The effect is demonstrated by numerical SCF calculations and analyzed by scaling arguments. Since the thickness of the denser layer is about a few monomer sizes, its capacity to form a microphase is limited by the product of the brush chain length and the grafting density. We explore the range of chain lengths and grafting densities where the effect is most pronounced. In this range, the SCF studies suggest that individual chains inside the brush show large rapid fluctuations between two states that are separated by only a small free energy barrier. The behavior of the brush as a whole, however, does not reflect these large fluctuations, and the effective grafting density varies smoothly as a function of the control parameters.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
A multivariate extension of the Lorenz curve based on copulas and a related multivariate Gini coefficient
Authors:
Oliver Grothe,
Fabian Kächele,
Friedrich Schmid
Abstract:
We propose an extension of the univariate Lorenz curve and of the Gini coefficient to the multivariate case, i.e., to simultaneously measure inequality in more than one variable. Our extensions are based on copulas and measure inequality stemming from inequality in every single variable as well as inequality stemming from the dependence structure of the variables. We derive simple nonparametric es…
▽ More
We propose an extension of the univariate Lorenz curve and of the Gini coefficient to the multivariate case, i.e., to simultaneously measure inequality in more than one variable. Our extensions are based on copulas and measure inequality stemming from inequality in every single variable as well as inequality stemming from the dependence structure of the variables. We derive simple nonparametric estimators for both instruments and apply them exemplary to data of individual income and wealth for various countries.
△ Less
Submitted 22 April, 2022; v1 submitted 12 January, 2021;
originally announced January 2021.
-
Shear-thinning in Polymer Melts -- Molecular Origins and Hybrid Multiscale Simulations
Authors:
Ranajay Datta,
Leonid Yelash,
Friederike Schmid,
Florian Kummer,
Martin Oberlack,
Maria Lukáčová-Medvid'ová,
Peter Virnau
Abstract:
We investigate the molecular origin of shear-thinning in melts of flexible, semiflexible and rigid oligomers with coarse-grained simulations of a sheared melt. Alignment, stretching and tumbling modes or suppression of the latter all contribute to understanding how macroscopic flow properties emerge from the molecular level. By performing simulations of single chains in a shear flow, we identify w…
▽ More
We investigate the molecular origin of shear-thinning in melts of flexible, semiflexible and rigid oligomers with coarse-grained simulations of a sheared melt. Alignment, stretching and tumbling modes or suppression of the latter all contribute to understanding how macroscopic flow properties emerge from the molecular level. By performing simulations of single chains in a shear flow, we identify which of these phenomena are of collective nature and arise through interchain interactions and which are already present in dilute systems. Building upon these microscopic simulations we identify by means of the Irving-Kirkwood formula the corresponding macroscopic stress tensor for a non-Newtonian polymer fluid. Shear-thinning effects in oligomer melts are also demonstrated by macroscopic simulations of a channel flow. The latter have been obtained by the discontinuous Galerkin method approximating macroscopic polymer flows. Our study confirms the influence of microscopic details in the molecular structure of short polymers such as chain flexibility on macroscopic polymer flows.
△ Less
Submitted 14 July, 2021; v1 submitted 10 January, 2021;
originally announced January 2021.