-
Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models
Authors:
Shuzhou Yuan,
Ercong Nie,
Mario Tawfelis,
Helmut Schmid,
Hinrich Schütze,
Michael Färber
Abstract:
Hate speech detection is a socially sensitive and inherently subjective task, with judgments often varying based on personal traits. While prior work has examined how socio-demographic factors influence annotation, the impact of personality traits on Large Language Models (LLMs) remains largely unexplored. In this paper, we present the first comprehensive study on the role of persona prompts in ha…
▽ More
Hate speech detection is a socially sensitive and inherently subjective task, with judgments often varying based on personal traits. While prior work has examined how socio-demographic factors influence annotation, the impact of personality traits on Large Language Models (LLMs) remains largely unexplored. In this paper, we present the first comprehensive study on the role of persona prompts in hate speech classification, focusing on MBTI-based traits. A human annotation survey confirms that MBTI dimensions significantly affect labeling behavior. Extending this to LLMs, we prompt four open-source models with MBTI personas and evaluate their outputs across three hate speech datasets. Our analysis uncovers substantial persona-driven variation, including inconsistencies with ground truth, inter-persona disagreement, and logit-level biases. These findings highlight the need to carefully define persona prompts in LLM-based annotation workflows, with implications for fairness and alignment with human values.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
Authors:
Chunkit Chan,
Yauwai Yim,
Hongchuan Zeng,
Zhiying Zou,
Xinyuan Cheng,
Zhifan Sun,
Zheye Deng,
Kawai Chung,
Yuzhuo Ao,
Yixiang Fan,
Cheng Jiayang,
Ercong Nie,
Ginny Y. Wong,
Helmut Schmid,
Hinrich Schütze,
Simon See,
Yangqiu Song
Abstract:
Theory of Mind (ToM), the ability to infer mental states in others, is pivotal for human social cognition. Existing evaluations of ToM in LLMs are largely limited to English, neglecting the linguistic diversity that shapes human cognition. This limitation raises a critical question: can LLMs exhibit Multilingual Theory of Mind, which is the capacity to reason about mental states across diverse lin…
▽ More
Theory of Mind (ToM), the ability to infer mental states in others, is pivotal for human social cognition. Existing evaluations of ToM in LLMs are largely limited to English, neglecting the linguistic diversity that shapes human cognition. This limitation raises a critical question: can LLMs exhibit Multilingual Theory of Mind, which is the capacity to reason about mental states across diverse linguistic contexts? To address this gap, we present XToM, a rigorously validated multilingual benchmark that evaluates ToM across five languages and incorporates diverse, contextually rich task scenarios. Using XToM, we systematically evaluate LLMs (e.g., DeepSeek R1), revealing a pronounced dissonance: while models excel in multilingual language understanding, their ToM performance varies across languages. Our findings expose limitations in LLMs' ability to replicate human-like mentalizing across linguistic contexts.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification
Authors:
Shuzhou Yuan,
Ercong Nie,
Lukas Kouba,
Ashish Yashwanth Kangen,
Helmut Schmid,
Hinrich Schütze,
Michael Färber
Abstract:
Detoxification, the task of rewriting harmful language into non-toxic text, has become increasingly important amid the growing prevalence of toxic content online. However, high-quality parallel datasets for detoxification, especially for hate speech, remain scarce due to the cost and sensitivity of human annotation. In this paper, we propose a novel LLM-in-the-loop pipeline leveraging GPT-4o-mini…
▽ More
Detoxification, the task of rewriting harmful language into non-toxic text, has become increasingly important amid the growing prevalence of toxic content online. However, high-quality parallel datasets for detoxification, especially for hate speech, remain scarce due to the cost and sensitivity of human annotation. In this paper, we propose a novel LLM-in-the-loop pipeline leveraging GPT-4o-mini for automated detoxification. We first replicate the ParaDetox pipeline by replacing human annotators with an LLM and show that the LLM performs comparably to human annotation. Building on this, we construct ParaDeHate, a large-scale parallel dataset specifically for hatespeech detoxification. We release ParaDeHate as a benchmark of over 8K hate/non-hate text pairs and evaluate a wide range of baseline methods. Experimental results show that models such as BART, fine-tuned on ParaDeHate, achieve better performance in style accuracy, content preservation, and fluency, demonstrating the effectiveness of LLM-generated detoxification text as a scalable alternative to human annotation.
△ Less
Submitted 6 June, 2025; v1 submitted 2 June, 2025;
originally announced June 2025.
-
EXECUTE: A Multilingual Benchmark for LLM Token Understanding
Authors:
Lukas Edman,
Helmut Schmid,
Alexander Fraser
Abstract:
The CUTE benchmark showed that LLMs struggle with character understanding in English. We extend it to more languages with diverse scripts and writing systems, introducing EXECUTE. Our simplified framework allows easy expansion to any language. Tests across multiple LLMs reveal that challenges in other languages are not always on the character level as in English. Some languages show word-level pro…
▽ More
The CUTE benchmark showed that LLMs struggle with character understanding in English. We extend it to more languages with diverse scripts and writing systems, introducing EXECUTE. Our simplified framework allows easy expansion to any language. Tests across multiple LLMs reveal that challenges in other languages are not always on the character level as in English. Some languages show word-level processing issues, some show no issues at all. We also examine sub-character tasks in Chinese, Japanese, and Korean to assess LLMs' understanding of character components.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models
Authors:
Ercong Nie,
Helmut Schmid,
Hinrich Schütze
Abstract:
Language confusion -- where large language models (LLMs) generate unintended languages against the user's need -- remains a critical challenge, especially for English-centric models. We present the first mechanistic interpretability (MI) study of language confusion, combining behavioral benchmarking with neuron-level analysis. Using the Language Confusion Benchmark (LCB), we show that confusion po…
▽ More
Language confusion -- where large language models (LLMs) generate unintended languages against the user's need -- remains a critical challenge, especially for English-centric models. We present the first mechanistic interpretability (MI) study of language confusion, combining behavioral benchmarking with neuron-level analysis. Using the Language Confusion Benchmark (LCB), we show that confusion points (CPs) -- specific positions where language switches occur -- are central to this phenomenon. Through layer-wise analysis with TunedLens and targeted neuron attribution, we reveal that transition failures in the final layers drive confusion. We further demonstrate that editing a small set of critical neurons, identified via comparative analysis with multilingual-tuned models, substantially mitigates confusion without harming general competence or fluency. Our approach matches multilingual alignment in confusion reduction for most languages and yields cleaner, higher-quality outputs. These findings provide new insights into the internal dynamics of LLMs and highlight neuron-level interventions as a promising direction for robust, interpretable multilingual language modeling.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Odd-parity ground state in dilute Yu-Shiba-Rusinov dimers and chains
Authors:
Lisa M. Rütten,
Harald Schmid,
Werner M. J. van Weerdenburg,
Eva Liebhaber,
Kai Rossnagel,
Katharina J. Franke
Abstract:
Magnetic adatoms on superconductors induce Yu-Shiba-Rusinov (YSR) states, which are key to the design of low-dimensional correlated systems and topological superconductivity. Competing magnetic interactions and superconducting pairing lead to a rich phase diagram. Using a scanning tunneling microscope (STM), we position Fe atoms on 2H-NbSe$_2$ to build a dimer with an odd-parity ground state, i.e.…
▽ More
Magnetic adatoms on superconductors induce Yu-Shiba-Rusinov (YSR) states, which are key to the design of low-dimensional correlated systems and topological superconductivity. Competing magnetic interactions and superconducting pairing lead to a rich phase diagram. Using a scanning tunneling microscope (STM), we position Fe atoms on 2H-NbSe$_2$ to build a dimer with an odd-parity ground state, i.e., a partially screened YSR channel with the hybridized states spanning the Fermi level. This ground state makes the dimer a promising precursor for a topological YSR chain. By adding one atom at a time, we track the formation of YSR bands. The lowest-energy band crosses the Fermi level and we find strong site-dependent spectral variations especially at the chain's terminations. We attribute these features to quantum spin effects and ferromagnetic coupling influenced by the local chemical environment, rather than topological superconductivity or Majorana modes.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
High-contrast spectroscopy with the new VLT/ERIS instrument: Molecular maps and radial velocity of the gas giant AF Lep b
Authors:
Jean Hayoz,
Markus Johannes Bonse,
Felix Dannert,
Emily Omaya Garvin,
Gabriele Cugno,
Polychronis Patapis,
Timothy D. Gebhard,
William O. Balmer,
Robert J. De Rosa,
Alexander Agudo Berbel,
Yixian Cao,
Gilles Orban de Xivry,
Tomas Stolker,
Richard Davies,
Olivier Absil,
Hans Martin Schmid,
Sascha Patrick Quanz,
Guido Agapito,
Andrea Baruffolo,
Martin Black,
Marco Bonaglia,
Runa Briguglio,
Luca Carbonaro,
Giovanni Cresci,
Yigit Dallilar
, et al. (44 additional authors not shown)
Abstract:
The Enhanced Resolution Imager and Spectrograph (ERIS) is the new Adaptive-Optics (AO) assisted Infrared instrument at the Very Large Telescope (VLT). Its refurbished Integral Field Spectrograph (IFS) SPIFFIER leverages a new AO module, enabling high-contrast imaging applications and giving access to the orbital and atmospheric characterisation of super-Jovian exoplanets. We test the detection lim…
▽ More
The Enhanced Resolution Imager and Spectrograph (ERIS) is the new Adaptive-Optics (AO) assisted Infrared instrument at the Very Large Telescope (VLT). Its refurbished Integral Field Spectrograph (IFS) SPIFFIER leverages a new AO module, enabling high-contrast imaging applications and giving access to the orbital and atmospheric characterisation of super-Jovian exoplanets. We test the detection limits of ERIS and demonstrate its scientific potential by exploring the atmospheric composition of the young super-Jovian AF Lep b and improving its orbital solution by measuring its radial velocity relative to its host star. We present new spectroscopic observations of AF Lep b in $K$-band at $R\sim 11000$ obtained with ERIS/SPIFFIER at the VLT. We reduce the data using the standard pipeline together with a custom wavelength calibration routine, and remove the stellar PSF using principal component analysis along the spectral axis. We compute molecular maps by cross-correlating the residuals with molecular spectral templates and measure the radial velocity of the planet relative to the star. Furthermore, we compute contrast grids for molecular mapping by injecting fake planets. We detect a strong signal from H$_{2}$O and CO but not from CH$_{4}$ or CO$_{2}$. This result corroborates the hypothesis of chemical disequilibrium in the atmosphere of AF Lep b. Our measurement of the RV of the planet yields $Δv_{\mathrm{R,P\star}} = 7.8 \pm 1.7$ km s$^{-1}$. This enables us to disentangle the degeneracy of the orbital solution, namely the correct longitude of the ascending node is $Ω=248^{+0.4}_{-0.7}$ deg and the argument of periapsis is $ω=109^{+13}_{-21}$ deg. Our results demonstrate the competitiveness of the new ERIS/SPIFFIER instrument for the orbital and atmospheric characterisation of exoplanets at high contrast and small angular separation.
△ Less
Submitted 3 June, 2025; v1 submitted 27 February, 2025;
originally announced February 2025.
-
XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs
Authors:
Linyang He,
Ercong Nie,
Sukru Samet Dindar,
Arsalan Firoozi,
Adrian Florea,
Van Nguyen,
Corentin Puffay,
Riki Shimizu,
Haotian Ye,
Jonathan Brennan,
Helmut Schmid,
Hinrich Schütze,
Nima Mesgarani
Abstract:
We introduce XCOMPS in this work, a multilingual conceptual minimal pair dataset covering 17 languages. Using this dataset, we evaluate LLMs' multilingual conceptual understanding through metalinguistic prompting, direct probability measurement, and neurolinguistic probing. By comparing base, instruction-tuned, and knowledge-distilled models, we find that: 1) LLMs exhibit weaker conceptual underst…
▽ More
We introduce XCOMPS in this work, a multilingual conceptual minimal pair dataset covering 17 languages. Using this dataset, we evaluate LLMs' multilingual conceptual understanding through metalinguistic prompting, direct probability measurement, and neurolinguistic probing. By comparing base, instruction-tuned, and knowledge-distilled models, we find that: 1) LLMs exhibit weaker conceptual understanding for low-resource languages, and accuracy varies across languages despite being tested on the same concept sets. 2) LLMs excel at distinguishing concept-property pairs that are visibly different but exhibit a marked performance drop when negative pairs share subtle semantic similarities. 3) Instruction tuning improves performance in concept understanding but does not enhance internal competence; knowledge distillation can enhance internal competence in conceptual understanding for low-resource languages with limited gains in explicit task performance. 4) More morphologically complex languages yield lower concept understanding scores and require deeper layers for conceptual reasoning.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
Language Model Re-rankers are Steered by Lexical Similarities
Authors:
Lovisa Hagström,
Ercong Nie,
Ruben Halifa,
Helmut Schmid,
Richard Johansson,
Alexander Junge
Abstract:
Language model (LM) re-rankers are used to refine retrieval results for retrieval-augmented generation (RAG). They are more expensive than lexical matching methods like BM25 but assumed to better process semantic information. To understand whether LM re-rankers always live up to this assumption, we evaluate 6 different LM re-rankers on the NQ, LitQA2 and DRUID datasets. Our results show that LM re…
▽ More
Language model (LM) re-rankers are used to refine retrieval results for retrieval-augmented generation (RAG). They are more expensive than lexical matching methods like BM25 but assumed to better process semantic information. To understand whether LM re-rankers always live up to this assumption, we evaluate 6 different LM re-rankers on the NQ, LitQA2 and DRUID datasets. Our results show that LM re-rankers struggle to outperform a simple BM25 re-ranker on DRUID. Leveraging a novel separation metric based on BM25 scores, we explain and identify re-ranker errors stemming from lexical dissimilarities. We also investigate different methods to improve LM re-ranker performance and find these methods mainly useful for NQ. Taken together, our work identifies and explains weaknesses of LM re-rankers and points to the need for more adversarial and realistic datasets for their evaluation.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
Orientation Dependent Resistivity Scaling in Mesoscopic NbP Crystals
Authors:
Gianluca Mariani,
Federico Balduini,
Nathan Drucker,
Lorenzo Rocchino,
Vicky Hasse,
Claudia Felser,
Heinz Schmid,
Cezar Zota,
Bernd Gotsmann
Abstract:
The scaling of Si transistor technology has resulted in a remarkable improvement in the performance of integrated circuits over the last decades. However, scaled transistors also require reduced electrical interconnect dimensions, which lead to greater losses and power dissipation at circuit level. This is mainly caused by enhanced surface scattering of charge carriers in copper interconnect wires…
▽ More
The scaling of Si transistor technology has resulted in a remarkable improvement in the performance of integrated circuits over the last decades. However, scaled transistors also require reduced electrical interconnect dimensions, which lead to greater losses and power dissipation at circuit level. This is mainly caused by enhanced surface scattering of charge carriers in copper interconnect wires at dimensions below 30 nm. A promising approach to mitigate this issue is to use directional conductors, i.e. materials with anisotropic Fermi surface, where proper alignment of crystalline orientation and transport direction can minimize surface scattering. In this work, we perform a resistivity scaling study of the anisotropic semimetal NbP as a function of crystalline orientation. We use here focused ion beam to pattern and scale down NbP crystallites to dimensions comparable to the electron scattering length at cryogenic temperatures. The experimental transport properties are correlated with the Fermi surface characteristics through a theoretical model, thus identifying the physical mechanisms that influence the resistivity scaling of anisotropic conductors. Our methodology provides an effective approach for early evaluation of anisotropic materials as future ultra-scalable interconnects, even when they are unavailable as epitaxial films.
△ Less
Submitted 18 February, 2025;
originally announced February 2025.
-
Subharmonic spin correlations and spectral pairing in Floquet time crystals
Authors:
Alexander-Georg Penner,
Harald Schmid,
Leonid I. Glazman,
Felix von Oppen
Abstract:
Floquet time crystals are characterized by subharmonic behavior of temporal correlation functions. Studying the paradigmatic time crystal based on the disordered Floquet quantum Ising model, we show that its temporal spin correlations are directly related to spectral characteristics and that this relation provides analytical expressions for the correlation function of finite chains, which compare…
▽ More
Floquet time crystals are characterized by subharmonic behavior of temporal correlation functions. Studying the paradigmatic time crystal based on the disordered Floquet quantum Ising model, we show that its temporal spin correlations are directly related to spectral characteristics and that this relation provides analytical expressions for the correlation function of finite chains, which compare favorably with numerical simulations. Specifically, we show that the disorder-averaged temporal spin correlations are proportional to the Fourier transform of the splitting distribution of the pairs of eigenvalues of the Floquet operator, which differ by $π$ to exponential accuracy in the chain length. We find that the splittings are well described by a log-normal distribution, implying that the temporal spin correlations are characterized by two parameters. We discuss possible implications for the phase diagram of the Floquet time crystals.
△ Less
Submitted 30 January, 2025;
originally announced January 2025.
-
Engineering Magnetotransport Through Hierarchical Symmetry in Weyl Semimetal Superlattices
Authors:
Nathan C. Drucker,
Federico Balduini,
Jules Schadt,
Lorenzo Rocchino,
Tathagata Paul,
Vicky Hasse,
Claudia Felser,
Heinz Schmid,
Cezar B. Zota,
Bernd Gotsmann
Abstract:
Superlattice engineering is a powerful way to tune the transport properties of a material. In this work we show that magnetotransport can be modified by superlattices in 3D materials based on the relative symmetry between the Fermi-surface and superlattice. We demonstrate commensuration oscillations in the ballistic transport regime of a nanostructured 3D material with the Weyl semimetal NbP, a si…
▽ More
Superlattice engineering is a powerful way to tune the transport properties of a material. In this work we show that magnetotransport can be modified by superlattices in 3D materials based on the relative symmetry between the Fermi-surface and superlattice. We demonstrate commensuration oscillations in the ballistic transport regime of a nanostructured 3D material with the Weyl semimetal NbP, a signature typically limited to superlattices in 2D materials. The behavior of the oscillations encodes information about the shared properties between the quasiparticles at the Fermi-surface--including their momentum, charge, mass, and rotational symmetry--and the structure of the superlattice. The magnetic field and temperature dependence of the commensuration oscillations enables us to extract the Fermi-momenta and quasiparticle mass at an order of magnitude lower magnetic field and higher temperature than Shubnikov-de Haas quantum oscillations. Furthermore, we use a chiral superlattice to engineer asymmetric longitudinal magnetoresistance based on the charge of the quasiparticles and superlattice enantiomer. These results demonstrate nanopatterned superlattices as an effective method for fermiology, and also point towards new ways of engineering quantum transport in these systems based on the mutual properties of the superlattice and Fermi-surface.
△ Less
Submitted 2 May, 2025; v1 submitted 27 January, 2025;
originally announced January 2025.
-
The SPHERE infrared survey for exoplanets (SHINE). V. Complete observations, data reduction and analysis, detection performances, and final results
Authors:
A. Chomez,
P. Delorme,
A. -M. Lagrange,
R. Gratton,
O. Flasseur,
G. Chauvin,
M. Langlois,
J. Mazoyer,
A. Zurlo,
S. Desidera,
D. Mesa,
M. Bonnefoy,
M. Feldt,
J. Hagelberg,
M. Meyer,
A. Vigan,
C. Ginski,
M. Kenworthy,
D. Albert,
S. Bergeon,
J. -L. Beuzit,
B. Biller,
T. Bhowmik,
A. Boccaletti,
M. Bonavita
, et al. (95 additional authors not shown)
Abstract:
During the past decade, state-of-the-art planet-finder instruments like SPHERE@VLT, coupling coronagraphic devices and extreme adaptive optics systems, unveiled, thanks to large surveys, around 20 planetary mass companions at semi-major axis greater than 10 astronomical units. Direct imaging being the only detection technique to be able to probe this outer region of planetary systems, the SPHERE i…
▽ More
During the past decade, state-of-the-art planet-finder instruments like SPHERE@VLT, coupling coronagraphic devices and extreme adaptive optics systems, unveiled, thanks to large surveys, around 20 planetary mass companions at semi-major axis greater than 10 astronomical units. Direct imaging being the only detection technique to be able to probe this outer region of planetary systems, the SPHERE infrared survey for exoplanets (SHINE) was designed and conducted from 2015 to 2021 to study the demographics of such young gas giant planets around 400 young nearby solar-type stars. In this paper, we present the observing strategy, the data quality, and the point sources analysis of the full SHINE statistical sample as well as snapSHINE. Both surveys used the SPHERE@VLT instrument with the IRDIS dual band imager in conjunction with the integral field spectrograph IFS and the angular differential imaging observing technique. All SHINE data (650 datasets), corresponding to 400 stars, including the targets of the F150 survey, are processed in a uniform manner with an advanced post-processing algorithm called PACO ASDI. An emphasis is put on the classification and identification of the most promising candidate companions. Compared to the previous early analysis SHINE F150, the use of advanced post-processing techniques significantly improved by one or 2 magnitudes (x3-x6) the contrast detection limits, which will allow us to put even tighter constraints on the radial distribution of young gas giants. This increased sensitivity directly places SHINE as the largest and deepest direct imaging survey ever conducted. We detected and classified more than 3500 physical sources. One additional substellar companion has been confirmed during the second phase of the survey (HIP 74865 B), and several new promising candidate companions are awaiting second epoch confirmations.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
Digital N-of-1 Trials and their Application in Experimental Physiology
Authors:
Stefan Konigorski,
Mathias Ried-Larsen,
Christopher H Schmid
Abstract:
Traditionally, studies in experimental physiology have been conducted in small groups of human participants, animal models or cell lines. Identifying optimal study designs that achieve sufficient power for drawing proper statistical inferences to detect group level effects with small sample sizes has been challenging. Moreover, average effects derived from traditional group-level inference do not…
▽ More
Traditionally, studies in experimental physiology have been conducted in small groups of human participants, animal models or cell lines. Identifying optimal study designs that achieve sufficient power for drawing proper statistical inferences to detect group level effects with small sample sizes has been challenging. Moreover, average effects derived from traditional group-level inference do not necessarily apply to individual participants. Here, we introduce N-of-1 trials as an innovative study design that can be used to draw valid statistical inference about the effects of interventions on individual participants and can be aggregated across multiple study participants to provide population-level inferences more efficiently than standard group randomized trials. In this manuscript, we introduce the key components and design features of N-of-1 trials, describe statistical analysis and interpretations of the results, and describe some available digital tools to facilitate their use using examples from experimental physiology.
△ Less
Submitted 22 February, 2025; v1 submitted 19 December, 2024;
originally announced December 2024.
-
Meta-analysis models relaxing the random effects normality assumption: methodological systematic review and simulation study
Authors:
Kanella Panagiotopoulou,
Theodoros Evrenoglou,
Christopher H Schmid,
Silvia Metelli,
Anna Chaimani
Abstract:
Random effects meta-analysis is widely used for synthesizing studies under the assumption that underlying effects come from a normal distribution. However, under certain conditions the use of alternative distributions might be more appropriate. We conducted a systematic review to identify articles introducing alternative meta-analysis models assuming non-normal between-study distributions. We iden…
▽ More
Random effects meta-analysis is widely used for synthesizing studies under the assumption that underlying effects come from a normal distribution. However, under certain conditions the use of alternative distributions might be more appropriate. We conducted a systematic review to identify articles introducing alternative meta-analysis models assuming non-normal between-study distributions. We identified 27 eligible articles suggesting 24 alternative meta-analysis models based on long-tail and skewed distributions, on mixtures of distributions, and on Dirichlet process priors. Subsequently, we performed a simulation study to evaluate the performance of these models and to compare them with the standard normal model. We considered 22 scenarios varying the amount of between-study variance, the shape of the true distribution, and the number of included studies. We compared 15 models implemented in the Frequentist or in the Bayesian framework. We found small differences with respect to bias between the different models but larger differences in the level of coverage probability. In scenarios with large between-study variance, all models were substantially biased in the estimation of the mean treatment effect. This implies that focusing only on the mean treatment effect of random effects meta-analysis can be misleading when substantial heterogeneity is suspected or outliers are present.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Multiwavelength high-resolution polarimetric imaging of second-generation disc around post-AGB binary IRAS 08544-4431 with SPHERE
Authors:
Kateryna Andrych,
Devika Kamath,
Hans Van Winckel,
Jacques Kluska,
Hans Martin Schmid,
Akke Corporaal,
Julien Milli
Abstract:
Exploring the formation and evolution of second-generation circumbinary discs around evolved binary stars, such as post-Asymptotic Giant Branch (post-AGB) and post-Red Giant Branch (post-RGB) binaries, provides valuable insights into the complex binary interaction process that concludes the red-giant phase of evolution in these systems. Additionally, it offers a novel opportunity to investigate th…
▽ More
Exploring the formation and evolution of second-generation circumbinary discs around evolved binary stars, such as post-Asymptotic Giant Branch (post-AGB) and post-Red Giant Branch (post-RGB) binaries, provides valuable insights into the complex binary interaction process that concludes the red-giant phase of evolution in these systems. Additionally, it offers a novel opportunity to investigate the formation of second-generation planets within dusty discs surrounding evolved stars. We present a pilot multi-wavelength polarimetric imaging study of the post-AGB binary system IRAS 08544-4431 using the European Southern Observatory-Very Large Telescope/SPHERE instrument. This study is focused on optical V- and I'-band ZIMPOL data to complement near-infrared H-band IRDIS data presented previously. The study aims to investigate the dust scattering properties and surface morphology of the post-AGB circumbinary disc as a function of wavelength. We successfully resolved the extended disc structure of IRAS\,08544-4431, revealing a complex disc morphology, high polarimetric disc brightness (up to ~1.5%), and significant forward scattering at optical wavelengths. Additionally, we found that the disc shows a grey polarimetric colour in both optical and near-infrared. The findings highlight similarities between post-AGB circumbinary discs and protoplanetary discs, suggesting submicron-size porous aggregates as the dominant surface dust composition, and indicating potential warping within the disc. However, further expansion of the multi-wavelength analysis to a larger sample of post-AGB binary systems, as well as high-resolution observations of dust continuum and gas emission, is necessary to fully explore the underlying structure of post-AGB circumbinary discs and associated physical mechanisms.
△ Less
Submitted 25 November, 2024;
originally announced November 2024.
-
OrigamiPlot: An R Package and Shiny Web App Enhanced Visualizations for Multivariate Data
Authors:
Yiwen Lu,
Jiayi Tong,
Yuqing Lei,
Alex J. Sutton,
Haitao Chu,
Lisa D. Levine,
Thomas Lumley,
David A. Asch,
Rui Duan,
Christopher H. Schmid,
Yong Chen
Abstract:
We introduce OrigamiPlot, an open-source R package and Shiny web application designed to enhance the visualization of multivariate data. This package implements the origami plot, a novel visualization technique proposed by Duan et al. in 2023, which improves upon traditional radar charts by ensuring that the area of the connected region is invariant to the ordering of attributes, addressing a key…
▽ More
We introduce OrigamiPlot, an open-source R package and Shiny web application designed to enhance the visualization of multivariate data. This package implements the origami plot, a novel visualization technique proposed by Duan et al. in 2023, which improves upon traditional radar charts by ensuring that the area of the connected region is invariant to the ordering of attributes, addressing a key limitation of radar charts. The software facilitates multivariate decision-making by supporting comparisons across multiple objects and attributes, offering customizable features such as auxiliary axes and weighted attributes for enhanced clarity. Through the R package and user-friendly Shiny interface, researchers can efficiently create and customize plots without requiring extensive programming knowledge. Demonstrated using network meta-analysis as a real-world example, OrigamiPlot proves to be a versatile tool for visualizing multivariate data across various fields. This package opens new opportunities for simplifying decision-making processes with complex data.
△ Less
Submitted 19 November, 2024;
originally announced November 2024.
-
Large Language Models as Neurolinguistic Subjects: Discrepancy in Performance and Competence for Form and Meaning
Authors:
Linyang He,
Ercong Nie,
Helmut Schmid,
Hinrich Schütze,
Nima Mesgarani,
Jonathan Brennan
Abstract:
This study investigates the linguistic understanding of Large Language Models (LLMs) regarding signifier (form) and signified (meaning) by distinguishing two LLM assessment paradigms: psycholinguistic and neurolinguistic. Traditional psycholinguistic evaluations often reflect statistical rules that may not accurately represent LLMs' true linguistic competence. We introduce a neurolinguistic approa…
▽ More
This study investigates the linguistic understanding of Large Language Models (LLMs) regarding signifier (form) and signified (meaning) by distinguishing two LLM assessment paradigms: psycholinguistic and neurolinguistic. Traditional psycholinguistic evaluations often reflect statistical rules that may not accurately represent LLMs' true linguistic competence. We introduce a neurolinguistic approach, utilizing a novel method that combines minimal pair and diagnostic probing to analyze activation patterns across model layers. This method allows for a detailed examination of how LLMs represent form and meaning, and whether these representations are consistent across languages. We found: (1) Psycholinguistic and neurolinguistic methods reveal that language performance and competence are distinct; (2) Direct probability measurement may not accurately assess linguistic competence; (3) Instruction tuning won't change much competence but improve performance; (4) LLMs exhibit higher competence and performance in form compared to meaning. Additionally, we introduce new conceptual minimal pair datasets for Chinese (COMPS-ZH) and German (COMPS-DE), complementing existing English datasets.
△ Less
Submitted 25 February, 2025; v1 submitted 11 November, 2024;
originally announced November 2024.
-
A generalization of the second Pappus-Guldin theorem
Authors:
Harald Schmid
Abstract:
This paper deals with the question of how to calculate the volume of a body in the three-dimensional Euclidean space when it is cut into slices perpendicular to a given curve. The answer is provided by a formula that can be considered as a generalized version of the second Pappus-Guldin theorem. It turns out that the computation becomes very simple if the curve passes directly through the centroid…
▽ More
This paper deals with the question of how to calculate the volume of a body in the three-dimensional Euclidean space when it is cut into slices perpendicular to a given curve. The answer is provided by a formula that can be considered as a generalized version of the second Pappus-Guldin theorem. It turns out that the computation becomes very simple if the curve passes directly through the centroids of the perpendicular cross-sections. In this context, the question arises whether a curve with this centroid property exists. We investigate this problem for a convex body $K$ by using the volume distance and certain features of the so-called floating bodies of $K$. As an example, we further determine the non-trivial centroid curves of a triaxial ellipsoid, and finally we apply our results to derive a rather simple formula for determining the centroid of a bent rod.
△ Less
Submitted 7 November, 2024;
originally announced November 2024.
-
Temporal and chromatic variation of polarized scattered light in the outer disk of PDS 70
Authors:
J. Ma,
C. Ginski,
R. Tazaki,
C. Dominik,
H. M. Schmid,
F. Ménard
Abstract:
PDS 70 is a unique system as it hosts a protoplanetary disk with two confirmed forming planets, making it an ideal target for characterizing dust in such disks. We present new high-contrast polarimetric differential imaging of PDS 70 using the $N\_R$ filter on SPHERE/ZIMPOL, combined with archival VLT/SPHERE data across five wavelengths ($N\_R$, $VBB$, $J$, $H$, and $Ks$) spanning seven epochs ove…
▽ More
PDS 70 is a unique system as it hosts a protoplanetary disk with two confirmed forming planets, making it an ideal target for characterizing dust in such disks. We present new high-contrast polarimetric differential imaging of PDS 70 using the $N\_R$ filter on SPHERE/ZIMPOL, combined with archival VLT/SPHERE data across five wavelengths ($N\_R$, $VBB$, $J$, $H$, and $Ks$) spanning seven epochs over eight years. For each epoch, we corrected smearing effects from instrument resolution, analyzed azimuthal brightness profiles, and derived intrinsic disk-integrated polarized reflectivity and brightness contrasts. Our analysis reveals significant temporal variability in both integrated polarized reflectivity and azimuthal brightness profiles, suggesting variable shadowing on the outer disk from unresolved inner disk structures. Nonetheless, we observe a systematic wavelength-dependent contrast between the near and far sides of the inclined disk, highlighting the need to consider shadowing from the inner disk and surface geometry of the reflecting disk in data interpretation.
△ Less
Submitted 6 November, 2024;
originally announced November 2024.
-
Self-similar phase diagram of the Fibonacci-driven quantum Ising model
Authors:
Harald Schmid,
Yang Peng,
Gil Refael,
Felix von Oppen
Abstract:
We study a stroboscopic quantum Ising model with Fibonacci dynamics. Focusing on boundary spin correlation functions in long but finite chains, our simulations as well as analytical arguments reveal a self-similar phase diagram exhibiting regions with Majorana zero modes (MZM) as well as Majorana golden-ratio modes (MGM). We identify the self-similarity transform which governs the evolution of the…
▽ More
We study a stroboscopic quantum Ising model with Fibonacci dynamics. Focusing on boundary spin correlation functions in long but finite chains, our simulations as well as analytical arguments reveal a self-similar phase diagram exhibiting regions with Majorana zero modes (MZM) as well as Majorana golden-ratio modes (MGM). We identify the self-similarity transform which governs the evolution of the phase diagram with increasing simulation time. Integrability-breaking perturbations lead to a temporal decay of the boundary spin correlations, ultimaltely limiting the self-similarity of the phase diagram. Our predictions are testable with current quantum information processors.
△ Less
Submitted 23 October, 2024;
originally announced October 2024.
-
CUTE: Measuring LLMs' Understanding of Their Tokens
Authors:
Lukas Edman,
Helmut Schmid,
Alexander Fraser
Abstract:
Large Language Models (LLMs) show remarkable performance on a wide variety of tasks. Most LLMs split text into multi-character tokens and process them as atomic units without direct access to individual characters. This raises the question: To what extent can LLMs learn orthographic information? To answer this, we propose a new benchmark, CUTE, which features a collection of tasks designed to test…
▽ More
Large Language Models (LLMs) show remarkable performance on a wide variety of tasks. Most LLMs split text into multi-character tokens and process them as atomic units without direct access to individual characters. This raises the question: To what extent can LLMs learn orthographic information? To answer this, we propose a new benchmark, CUTE, which features a collection of tasks designed to test the orthographic knowledge of LLMs. We evaluate popular LLMs on CUTE, finding that most of them seem to know the spelling of their tokens, yet fail to use this information effectively to manipulate text, calling into question how much of this knowledge is generalizable.
△ Less
Submitted 2 October, 2024; v1 submitted 23 September, 2024;
originally announced September 2024.
-
Experimental demonstration of photonic phase correctors based on grating coupler arrays and thermo-optic shifters
Authors:
Momen Diab,
Ross Cheriton,
Jacob Taylor,
Dhwanil Patel,
Libertad Rojas,
Mark Barnet,
Polina Zavyalova,
Dan-Xia Xu,
Pavel Cheben,
Siegfried Janz,
Jens H. Schmid,
Suresh Sivanandam
Abstract:
In ground-based astronomy, the ability to couple light into single-mode fibers (SMFs) is limited by atmospheric turbulence, which prohibits the use of many astrophotonic instruments. We propose a silicon-on-insulator photonic chip capable of coherently coupling the out-of-phase beamlets from the subapertures of a telescope pupil into an SMF. The photonic integrated circuit (PIC) consists of an arr…
▽ More
In ground-based astronomy, the ability to couple light into single-mode fibers (SMFs) is limited by atmospheric turbulence, which prohibits the use of many astrophotonic instruments. We propose a silicon-on-insulator photonic chip capable of coherently coupling the out-of-phase beamlets from the subapertures of a telescope pupil into an SMF. The photonic integrated circuit (PIC) consists of an array of grating couplers that are used to inject light from free space into single-mode waveguides on the chip. Metallic heaters modulate the refractive index of a coiled section of the waveguides, facilitating the co-phasing of the propagating modes. The phased beamlets can then be coherently combined to efficiently deliver the light to an output SMF. In an adaptive optics (AO) system, the phase corrector acts as a deformable mirror (DM) commanded by a controller that takes phase measurements from a wavefront sensor (WFS). We present experimental results for the PIC tested on an AO testbed and compare the performance to simulations.
△ Less
Submitted 14 August, 2024;
originally announced August 2024.
-
Retinomorphic Machine Vision in a Network Laser
Authors:
Wai Kit Ng,
Jakub Dranczewski,
Anna Fischer,
T V Raziman,
Dhruv Saxena,
Tobias Farchy,
Kilian Stenning,
Jonathan Peters,
Heinz Schmid,
Will R Branford,
Mauricio Barahona,
Kirsten Moselund,
Riccardo Sapienza,
Jack C. Gartside
Abstract:
With the growing prevalence of AI, demand increases for efficient machine learning hardware. Physical systems are sought which combine image feature detection with the essential nonlinearity for tasks such as image classification. Existing physical hardware typically detects features linearly, then employs digital processing for nonlinear activation. Biological vision systems excel at nonlinear im…
▽ More
With the growing prevalence of AI, demand increases for efficient machine learning hardware. Physical systems are sought which combine image feature detection with the essential nonlinearity for tasks such as image classification. Existing physical hardware typically detects features linearly, then employs digital processing for nonlinear activation. Biological vision systems excel at nonlinear image processing. The retina detects features in ganglion cells via lateral inhibition, where cells nonlinearly compete for neuronal firing while supressing neighbouring cells.
We present a bio-inspired 'retinomorphic' machine vision platform using an on-chip semiconductor network laser. The system detects multiple features in parallel via spatially-overlapping lasing modes, with integrated nonlinearity provided by antagonistic gain competition between modes - a photonic analogue of retinal inhibition. Parallel feature-detection enhances efficiency relative to feature-detection schemes which operate sequentially or via multiple device copies, with Si-compatible processing and a compact micron-scale footprint relative to existing mm-scale systems. We report 98.05% accuracy on MNIST-digits and 87.85% on Fashion-MNIST, with strong performance on short training datasets.
△ Less
Submitted 2 August, 2024; v1 submitted 22 July, 2024;
originally announced July 2024.
-
End-to-end simulations of photonic phase correctors for adaptive optics systems
Authors:
Dhwanil Patel,
Momen Diab,
Ross Cheriton,
Jacob Taylor,
Libertad Rojas,
Martin Vachon,
Dan-Xia Xu,
Jens H. Schmid,
Pavel Cheben,
Siegfried Janz,
Suresh Sivanandam
Abstract:
Optical beams and starlight distorted by atmospheric turbulence can be corrected with adaptive optics systems to enable efficient coupling into single-mode fibers. Deformable mirrors, used to flatten the wavefront in astronomical telescopes, are costly, sensitive, and complex mechanical components that require careful calibration to enable high-quality imaging in astronomy, microscopy, and vision…
▽ More
Optical beams and starlight distorted by atmospheric turbulence can be corrected with adaptive optics systems to enable efficient coupling into single-mode fibers. Deformable mirrors, used to flatten the wavefront in astronomical telescopes, are costly, sensitive, and complex mechanical components that require careful calibration to enable high-quality imaging in astronomy, microscopy, and vision science. They are also impractical to deploy in large numbers for non-imaging applications like free-space optical communication. Here, we propose a photonic integrated c rcuit capable of spatially sampling the wavefront collected by the telescope and co-phasing the subapertures to maximize the flux delivered to an output single-mode fiber as the integrated photonic implementation of a deformable mirror. We present the results of end-to-end simulations to quantify the performance of the proposed photonic solution under varying atmospheric conditions toward realizing an adaptive optics system without a deformable mirror for free-space optical receivers.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning
Authors:
Ercong Nie,
Bo Shao,
Zifeng Ding,
Mingyang Wang,
Helmut Schmid,
Hinrich Schütze
Abstract:
This paper introduces BMIKE-53, a comprehensive benchmark for cross-lingual in-context knowledge editing (IKE) across 53 languages, unifying three knowledge editing (KE) datasets: zsRE, CounterFact, and WikiFactDiff. Cross-lingual KE, which requires knowledge edited in one language to generalize across others while preserving unrelated knowledge, remains underexplored. To address this gap, we syst…
▽ More
This paper introduces BMIKE-53, a comprehensive benchmark for cross-lingual in-context knowledge editing (IKE) across 53 languages, unifying three knowledge editing (KE) datasets: zsRE, CounterFact, and WikiFactDiff. Cross-lingual KE, which requires knowledge edited in one language to generalize across others while preserving unrelated knowledge, remains underexplored. To address this gap, we systematically evaluate IKE under zero-shot, one-shot, and few-shot setups, incorporating tailored metric-specific demonstrations. Our findings reveal that model scale and demonstration alignment critically govern cross-lingual IKE efficacy, with larger models and tailored demonstrations significantly improving performance. Linguistic properties, particularly script type, strongly influence performance variation across languages, with non-Latin languages underperforming due to issues like language confusion. Code and data are publicly available at: https://github.com/ercong21/MultiKnow/.
△ Less
Submitted 31 May, 2025; v1 submitted 25 June, 2024;
originally announced June 2024.
-
SPHERE RefPlanets: Search for epsilon Eridani b and warm dust
Authors:
C. Tschudi,
H. M. Schmid,
M. Nowak,
H. Le Coroller,
S. Hunziker,
R. G. van Holstein,
C. Perrot,
D. Mouillet,
J. -C. Augereau,
A. Bazzon,
J. L. Beuzit,
A. Boccaletti,
M. J. Bonse,
G. Chauvin,
S. Desidera,
K. Dohlen,
C. Dominik,
N. Engler,
M. Feldt,
J. H. Girard,
R. Gratton,
Th. Henning,
M. Kasper,
P. Kervella,
A. -M. Lagrange
, et al. (13 additional authors not shown)
Abstract:
We carried out very deep VLT/SPHERE imaging polarimetry of the nearby system Eps Eri based on 38.5 hours of integration time with a 600 - 900 nm broadband filter to search for polarized scattered light from a planet or from circumstellar dust using AO, coronagraphy, high precision differential polarimetry, and angular differential imaging. We have improved several data reduction and post-processin…
▽ More
We carried out very deep VLT/SPHERE imaging polarimetry of the nearby system Eps Eri based on 38.5 hours of integration time with a 600 - 900 nm broadband filter to search for polarized scattered light from a planet or from circumstellar dust using AO, coronagraphy, high precision differential polarimetry, and angular differential imaging. We have improved several data reduction and post-processing techniques and also developed new ones to further increase the sensitivity of SPHERE/ZIMPOL. The data provide unprecedented contrast limits, but no significant detection of a point source or an extended signal from circumstellar dust. For each observing epoch, we obtained a point source contrast for the polarized intensity between $2\cdot 10^{-8}$ and $4\cdot 10^{-8}$ at the expected separation of the planet Eps Eri b of 1'' near quadrature phase. The polarimetric contrast limits are about six to 50 times better than the intensity limits because polarimetric imaging is much more efficient in speckle suppression. Combining the entire 14-month data set to the search for a planet moving on a Keplerian orbit with the K-Stacker software further improves the contrast limits by a factor of about two, to about $8 \cdot 10^{-9}$ at 1''. This would allow the detection of a planet with a radius of about 2.5 Jupiter radii. The surface brightness contrast limits achieved for the polarized intensity from an extended scattering region are about 15 mag arcsec$^{-2}$ at 1'', or up to 3 mag arcsec$^{-2}$ deeper than previous limits. For Eps Eri, these limits exclude the presence of a narrow dust ring and they constrain the dust properties. This study shows that the polarimetric contrast limits for reflecting planets with SPHERE/ZIMPOL can be improved to a level $<10^{-8}$ simply by collecting more data over many nights and using the K-Stacker software.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Intrinsic negative magnetoresistance from the chiral anomaly of multifold fermions
Authors:
F. Balduini,
A. Molinari,
L. Rocchino,
V. Hasse,
C. Felser,
M. Sousa,
C. Zota,
H. Schmid,
A. G. Grushin,
B. Gotsmann
Abstract:
The chiral anomaly, a hallmark of chiral spin-1/2 Weyl fermions, is an imbalance between left- and right-moving particles that underpins both high and low energy phenomena, including particle decay and negative longitudinal magnetoresistance in Weyl semimetals. The discovery that chiral crystals can host higher-spin generalizations of Weyl quasiparticles without high-energy counterparts, known as…
▽ More
The chiral anomaly, a hallmark of chiral spin-1/2 Weyl fermions, is an imbalance between left- and right-moving particles that underpins both high and low energy phenomena, including particle decay and negative longitudinal magnetoresistance in Weyl semimetals. The discovery that chiral crystals can host higher-spin generalizations of Weyl quasiparticles without high-energy counterparts, known as multifold fermions, raises the fundamental question of whether the chiral anomaly is a more general phenomenon. Answering this question requires materials with chiral quasiparticles within a sizable energy window around the Fermi level, that are unaffected by trivial extrinsic effects such as current jetting. Here we report the chiral anomaly of multifold fermions in CoSi, which features multifold bands within about 0.85 eV around the Fermi level. By excluding current jetting through the squeezing test, we measure an intrinsic, longitudinal negative magnetoresistance. We develop the semiclassical theory of magnetotransport of multifold fermions that shows that the negative magnetoresistance originates in their chiral anomaly, despite a sizable and detrimental orbital magnetic moment contribution, previously unaccounted for. A concomitant nonlinear Hall effect supports the multifold-fermion origin of magnetotransport. Our work confirms the chiral anomaly of higher-spin generalizations of Weyl fermions, currently inaccessible outside the solid-state.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Wave-function engineering on superconducting substrates: Chiral Yu-Shiba-Rusinov molecules
Authors:
Lisa M. Rütten,
Harald Schmid,
Eva Liebhaber,
Giada Franceschi,
Ali Yazdani,
Gael Reecht,
Kai Rossnagel,
Felix von Oppen,
Katharina J. Franke
Abstract:
Magnetic adatoms on superconductors give rise to Yu-Shiba-Rusinov (YSR) states that hold considerable interest for the design of topological superconductivity. Here, we show that YSR states are also an ideal platform to engineer structures with intricate wave-function symmetries. We assemble structures of iron atoms on the quasi-two-dimensional superconductor $2H$-NbSe$_2$. The Yu-Shiba-Rusinov wa…
▽ More
Magnetic adatoms on superconductors give rise to Yu-Shiba-Rusinov (YSR) states that hold considerable interest for the design of topological superconductivity. Here, we show that YSR states are also an ideal platform to engineer structures with intricate wave-function symmetries. We assemble structures of iron atoms on the quasi-two-dimensional superconductor $2H$-NbSe$_2$. The Yu-Shiba-Rusinov wave functions of individual atoms extend over several nanometers enabling hybridization even at large adatom spacing. We show that the substrate can be exploited to deliberately break symmetries of the adatom structure in ways unachievable in the gas phase. We highlight this potential by designing chiral wave functions of triangular adatom structures confined within a plane. Our results significantly expand the range of interesting quantum states that can be engineered using arrays of magnetic adatoms on superconductors.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Surface lattice resonance lasers with epitaxial InP gain medium
Authors:
Anna Fischer,
Toby Severs Millard,
Xiaofei Xiao,
T. V. Raziman,
Jakub Dranczewski,
Ross C. Schofield,
Heinz Schmid,
Kirsten Moselund,
Riccardo Sapienza,
Rupert Oulton
Abstract:
Surface lattice resonance (SLR) lasers, where gain is supplied by a thin film active material and the feedback comes from multiple scattering by plasmonic nanoparticles, have shown both low threshold lasing and tunability of the angular and spectral emission. However, typically used materials such as organic dyes and QD films suffer from photo-degradation which hampers practical applications. Here…
▽ More
Surface lattice resonance (SLR) lasers, where gain is supplied by a thin film active material and the feedback comes from multiple scattering by plasmonic nanoparticles, have shown both low threshold lasing and tunability of the angular and spectral emission. However, typically used materials such as organic dyes and QD films suffer from photo-degradation which hampers practical applications. Here, we demonstrate photo-stable single-mode lasing of SLR modes sustained in an epitaxial solid-state InP slab waveguide. The nanoparticle array is weakly coupled to the optical modes, which decreases the scattering losses and hence the experimental lasing threshold is as low as 90 $μ$J/cm$^{2}$. The nanoparticle periodicity defines the lasing wavelength and enables tuneable emission wavelengths over a 70 nm spectral range. Combining plasmonic nanoparticles with an epitaxial solid-state gain medium paves the way for large-area on-chip integrated SLR lasers for applications including optical communication, optical computing, sensing, and LiDAR.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
The SPHERE view of the Taurus star-forming region
Authors:
A. Garufi,
C. Ginski,
R. G. van Holstein,
M. Benisty,
C. F. Manara,
S. Pérez,
P. Pinilla,
Á. Ribas,
P. Weber,
J. Williams,
L. Cieza,
C. Dominik,
S. Facchini,
J. Huang,
A. Zurlo,
J. Bae,
J. Hagelberg,
Th. Henning,
M. R. Hogerheijde,
M. Janson,
F. Ménard,
S. Messina,
M. R. Meyer,
C. Pinte,
S. P. Quanz
, et al. (9 additional authors not shown)
Abstract:
The sample of planet-forming disks observed by high-contrast imaging campaigns over the last decade is mature enough to enable the demographical analysis of individual star-forming regions. We present the full census of Taurus sources with VLT/SPHERE polarimetric images available. The whole sample sums up to 43 targets (of which 31 have not been previously published) corresponding to one-fifth of…
▽ More
The sample of planet-forming disks observed by high-contrast imaging campaigns over the last decade is mature enough to enable the demographical analysis of individual star-forming regions. We present the full census of Taurus sources with VLT/SPHERE polarimetric images available. The whole sample sums up to 43 targets (of which 31 have not been previously published) corresponding to one-fifth of the Class II population in Taurus and about half of such objects that are observable. A large fraction of the sample is apparently made up of isolated faint disks (equally divided between small and large self-shadowed disks). Ambient signal is visible in about one-third of the sample. This probes the interaction with the environment and with companions or the outflow activity of the system. The central portion of the Taurus region almost exclusively hosts faint disks, while the periphery also hosts bright disks interacting with their surroundings. The few bright disks are found around apparently older stars. The overall picture is that the Taurus region is in an early evolutionary stage of planet formation. Yet, some objects are discussed individually, as in an intermediate or exceptional stage of the disk evolution. This census provides a first benchmark for the comparison of the disk populations in different star forming regions.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models
Authors:
Ercong Nie,
Shuzhou Yuan,
Bolei Ma,
Helmut Schmid,
Michael Färber,
Frauke Kreuter,
Hinrich Schütze
Abstract:
Despite the predominance of English in their training data, English-centric Large Language Models (LLMs) like GPT-3 and LLaMA display a remarkable ability to perform multilingual tasks, raising questions about the depth and nature of their cross-lingual capabilities. This paper introduces the decomposed prompting approach to probe the linguistic structure understanding of these LLMs in sequence la…
▽ More
Despite the predominance of English in their training data, English-centric Large Language Models (LLMs) like GPT-3 and LLaMA display a remarkable ability to perform multilingual tasks, raising questions about the depth and nature of their cross-lingual capabilities. This paper introduces the decomposed prompting approach to probe the linguistic structure understanding of these LLMs in sequence labeling tasks. Diverging from the single text-to-text prompt, our method generates for each token of the input sentence an individual prompt which asks for its linguistic label. We assess our method on the Universal Dependencies part-of-speech tagging dataset for 38 languages, utilizing both English-centric and multilingual LLMs. Our findings show that decomposed prompting surpasses the iterative prompting baseline in efficacy and efficiency under zero- and few-shot settings. Further analysis reveals the influence of evaluation methods and the use of instructions in prompts. Our multilingual investigation shows that English-centric language models perform better on average than multilingual models. Our study offers insights into the multilingual transferability of English-centric LLMs, contributing to the understanding of their multilingual linguistic knowledge.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network
Authors:
Shuzhou Yuan,
Ercong Nie,
Michael Färber,
Helmut Schmid,
Hinrich Schütze
Abstract:
Large Language Models (LLMs) exhibit strong In-Context Learning (ICL) capabilities when prompts with demonstrations are used. However, fine-tuning still remains crucial to further enhance their adaptability. Prompt-based fine-tuning proves to be an effective fine-tuning method in low-data scenarios, but high demands on computing resources limit its practicality. We address this issue by introducin…
▽ More
Large Language Models (LLMs) exhibit strong In-Context Learning (ICL) capabilities when prompts with demonstrations are used. However, fine-tuning still remains crucial to further enhance their adaptability. Prompt-based fine-tuning proves to be an effective fine-tuning method in low-data scenarios, but high demands on computing resources limit its practicality. We address this issue by introducing a prompt-based parameter-efficient fine-tuning (PEFT) approach. GNNavi leverages insights into ICL's information flow dynamics, which indicates that label words act in prompts as anchors for information propagation. GNNavi employs a Graph Neural Network (GNN) layer to precisely guide the aggregation and distribution of information flow during the processing of prompts by hardwiring the desired information flow into the GNN. Our experiments on text classification tasks with GPT-2 and Llama2 show GNNavi surpasses standard prompt-based fine-tuning methods in few-shot settings by updating just 0.2% to 0.5% of parameters. We compare GNNavi with prevalent PEFT approaches, such as prefix tuning, LoRA and Adapter in terms of performance and efficiency. Our analysis reveals that GNNavi enhances information flow and ensures a clear aggregation process.
△ Less
Submitted 7 June, 2024; v1 submitted 18 February, 2024;
originally announced February 2024.
-
On the eigenvalues of the spheroidal wave equation
Authors:
Harald Schmid
Abstract:
This paper presents some new results on the eigenvalues of the spheroidal wave equation. We study the angular and Coulomb spheroidal wave equation as a special case of a more general linear Hamiltonian system depending on three parameters. We prove that the eigenvalues of this system satisfy a first-order quasilinear partial differential equation with respect to the parameters. This relation offer…
▽ More
This paper presents some new results on the eigenvalues of the spheroidal wave equation. We study the angular and Coulomb spheroidal wave equation as a special case of a more general linear Hamiltonian system depending on three parameters. We prove that the eigenvalues of this system satisfy a first-order quasilinear partial differential equation with respect to the parameters. This relation offers a new insight on how the eigenvalues of the spheroidal wave equation depend on the spheroidal parameter. Apart from analytical considerations, the PDE we obtain can also be used for a numerical computation of spheroidal eigenvalues.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks
Authors:
Bolei Ma,
Ercong Nie,
Shuzhou Yuan,
Helmut Schmid,
Michael Färber,
Frauke Kreuter,
Hinrich Schütze
Abstract:
Prompt-based methods have been successfully applied to multilingual pretrained language models for zero-shot cross-lingual understanding. However, most previous studies primarily focused on sentence-level classification tasks, and only a few considered token-level labeling tasks such as Named Entity Recognition (NER) and Part-of-Speech (POS) tagging. In this paper, we propose Token-Level Prompt De…
▽ More
Prompt-based methods have been successfully applied to multilingual pretrained language models for zero-shot cross-lingual understanding. However, most previous studies primarily focused on sentence-level classification tasks, and only a few considered token-level labeling tasks such as Named Entity Recognition (NER) and Part-of-Speech (POS) tagging. In this paper, we propose Token-Level Prompt Decomposition (ToPro), which facilitates the prompt-based method for token-level sequence labeling tasks. The ToPro method decomposes an input sentence into single tokens and applies one prompt template to each token. Our experiments on multilingual NER and POS tagging datasets demonstrate that ToPro-based fine-tuning outperforms Vanilla fine-tuning and Prompt-Tuning in zero-shot cross-lingual transfer, especially for languages that are typologically different from the source language English. Our method also attains state-of-the-art performance when employed with the mT5 model. Besides, our exploratory study in multilingual large language models shows that ToPro performs much better than the current in-context learning method. Overall, the performance improvements show that ToPro could potentially serve as a novel and simple benchmarking method for sequence labeling tasks.
△ Less
Submitted 13 March, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Probing the shape of the Weyl Fermi surface of NbP using transverse electron focusing
Authors:
F. Balduini,
L. Rocchino,
A. Molinari,
T. Paul,
G. Mariani,
V. Hasse,
C. Felser,
C. Zota,
H. Schmid,
B. Gotsmann
Abstract:
The topology of the Fermi surface significantly influences the transport properties of a material. Firstly measured through quantum oscillation experiments, the Fermi surfaces of crystals are now commonly characterized using angle-resolved photoemission spectroscopy (ARPES), given the larger information volume it provides. In the case of Weyl semimetals, ARPES has proven remarkably successful in v…
▽ More
The topology of the Fermi surface significantly influences the transport properties of a material. Firstly measured through quantum oscillation experiments, the Fermi surfaces of crystals are now commonly characterized using angle-resolved photoemission spectroscopy (ARPES), given the larger information volume it provides. In the case of Weyl semimetals, ARPES has proven remarkably successful in verifying the existence of the Weyl points and the Fermi arcs, which define a Weyl Fermi surface. However, ARPES is limited in resolution, leading to significant uncertainty when measuring relevant features such as the distance between the Weyl points. While quantum oscillation measurements offer higher resolution, they do not reveal insights into the cross-sectional shape of a Fermi surface. Moreover, both techniques lack critical information about transport, like the carriers mean free path. Here, we report measurements unveiling the distinctive peanut-shaped cross-section of the Fermi surface of Weyl fermions and accurately determine the separation between Weyl points in the Weyl semimetal NbP. To surpass the resolution of ARPES, we combine quantum oscillation measurements with transverse electron focusing (TEF) experiments, conducted on microstructured single-crystals. The TEF spectrum relates to the Fermi surface shape, while the frequency of the quantum oscillations to its area. Together, these techniques offer complementary information, enabling the reconstruction of the distinctive Weyl Fermi surface geometry. Concurrently, we extract the electrical transport properties of the bulk Weyl fermions. Our work showcases the integration of quantum oscillations and transverse electron focusing in a singular experiment, allowing for the measurements of complex Fermi surface geometries in high-mobility quantum materials.
△ Less
Submitted 19 April, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
Robust spectral $π$ pairing in the random-field Floquet quantum Ising model
Authors:
Harald Schmid,
Alexander-Georg Penner,
Kang Yang,
Leonid Glazman,
Felix von Oppen
Abstract:
Motivated by an experiment on a superconducting quantum processor [Mi et al., Science 378, 785 (2022)], we study level pairings in the many-body spectrum of the random-field Floquet quantum Ising model. The pairings derive from Majorana zero and $π$ modes when writing the spin model in Jordan-Wigner fermions. Both splittings have lognormal distributions with random transverse fields. In contrast,…
▽ More
Motivated by an experiment on a superconducting quantum processor [Mi et al., Science 378, 785 (2022)], we study level pairings in the many-body spectrum of the random-field Floquet quantum Ising model. The pairings derive from Majorana zero and $π$ modes when writing the spin model in Jordan-Wigner fermions. Both splittings have lognormal distributions with random transverse fields. In contrast, random longitudinal fields affect the zero and $π$ splittings in drastically different ways. While zero pairings are rapidly lifted, the $π$ pairings are remarkably robust, or even strengthened, up to vastly larger disorder strengths. We explain our results within a self-consistent Floquet perturbation theory and study implications for boundary spin correlations. The robustness of $π$ pairings against longitudinal disorder may be useful for quantum information processing.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Color measurements of the polarized light scattered by the dust in protoplanetary disks
Authors:
J. Ma,
H. M. Schmid,
T. Stolker
Abstract:
Ground-based high-contrast instruments have yielded reflected light images of protoplanetary disks. Quantitative measurements of the reflected radiation provide strong constraints on the scattering dust which can clarify the dust particle evolution in these disks and the composition of the forming planets. This study aimed to derive the wavelength dependence of polarized reflectivity…
▽ More
Ground-based high-contrast instruments have yielded reflected light images of protoplanetary disks. Quantitative measurements of the reflected radiation provide strong constraints on the scattering dust which can clarify the dust particle evolution in these disks and the composition of the forming planets. This study aimed to derive the wavelength dependence of polarized reflectivity $(\hat{Q}_{\varphi}/I_\star)_λ$ for 11 disks, constraining dust properties and identifying systematic differences. Using ESO archive data from SPHERE/ZIMPOL and SPHERE/IRDIS instruments, we obtained accurate intrinsic polarized reflectivity $\hat{Q}_\varphi/I_\star$ values at wavelengths from 0.62$μ$m to 2.2$μ$m.
Polarized reflectivities ranged from $Q_\varphi/I_\star\approx 0.1\%$ to 1.0$\%$, with PSF-corrected values averaging 1.6 times higher than observed. Accurate PSF calibrations reduced systematic errors to $Δ\hat{Q}_\varphi/\hat{Q}_\varphi\approx 10\%$ or less. For each disk, we derived a polarized reflectivity color $η_{V/IR}$ between a visible band $λ<1~μ$m and a near-IR band $λ>1~μ$m and other wavelength combinations. Wavelength gradients $η$ varied significantly among objects. Disks around Herbig stars (HD 169142, HD 135334B, HD 100453, MWC 758, and HD 142527) showed a red color $η_{\rm V/IR}>0.5$, suggesting rather compact dust grains. T-Tauri star disks (PDS 70, TW Hya, RX J1615, and PDS 66) were predominantly gray $-0.5<η_{\rm V/IR}<0.5$, with an absence of blue colors incompatible with porous aggregates. Exceptional red colors for LkCa15 and MWC758 were attributed to potential extra reddening from hot dust near the star. Future studies incorporating parameters like fractional polarization $\langle p_\varphi \rangle$ hold promise for advancing our understanding of dust properties within protoplanetary disks.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
The polarisation properties of the HD 181327 debris ring. Evidence for sub-micron particles from scattered light observations
Authors:
Julien Milli,
Elodie Choquet,
Ryo Tazaki,
François Ménard,
Jean-Charles Augereau,
Johan Olofsson,
Philippe Thébault,
Olivier Poch,
Anny-Chantal Levasseur-Regourd,
Jérémie Lasue,
Jean-Baptiste Renard,
Edith Hadamcik,
Clément Baruteau,
Hans Martin Schmid,
Natalia Engler,
Rob G. van Holstein,
Evgenij Zubko,
Anne-Marie Lagrange,
Sebastian Marino,
Chirstophe Pinte,
Carsten Dominik,
Anthony Boccaletti,
Maud Langlois,
Alice Zurlo,
Célia Desgrange
, et al. (4 additional authors not shown)
Abstract:
Polarisation is a powerful remote-sensing tool to study the nature of particles scattering the starlight. It is widely used to characterise interplanetary dust particles in the Solar System and increasingly employed to investigate extrasolar dust in debris discs' systems. We aim to measure the scattering properties of the dust from the debris ring around HD 181327 at near-infrared wavelengths. We…
▽ More
Polarisation is a powerful remote-sensing tool to study the nature of particles scattering the starlight. It is widely used to characterise interplanetary dust particles in the Solar System and increasingly employed to investigate extrasolar dust in debris discs' systems. We aim to measure the scattering properties of the dust from the debris ring around HD 181327 at near-infrared wavelengths. We obtained high-contrast polarimetric images of HD 181327 in the H band with the SPHERE / IRDIS instrument on the Very Large Telescope (ESO). We complemented them with archival data from HST / NICMOS in the F110W filter reprocessed in the context of the Archival Legacy Investigations of Circumstellar Environments (ALICE) project. We developed a combined forward-modelling framework to simultaneously retrieve the scattering phase function in polarisation and intensity. We detected the debris disc around HD 181327 in polarised light and total intensity. We measured the scattering phase function and the degree of linear polarisation of the dust at 1.6 micron in the birth ring. The maximum polarisation is 23.6% +/- 2.6% and occurs between a scattering angle of 70 deg and 82 deg. We show that compact spherical particles made of a highly refractive and relatively absorbing material in a differential power-law size distribution of exponent $-3.5$ can simultaneously reproduce the polarimetric and total intensity scattering properties of the dust. This type of material cannot be obtained with a mixture of silicates, amorphous carbon, water ice, and porosity, and requires a more refracting component such as iron-bearing minerals. We reveal a striking analogy between the near-infrared polarisation of comets and that of HD 181327. The methodology developed here combining VLT/SPHERE and HST/NICMOS may be applicable in the future to combine the polarimetric capabilities of SPHERE with the sensitivity of JWST.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Perfectly vertical silicon metamaterial grating couplers with large segmentation periods up to 650 nm
Authors:
Jianhao Zhang,
Daniele Melati,
Yuri Grinberg,
Martin Vachon,
Shurui Wang,
Muhammad Al-Digeil,
Siegfried Janz,
Jens H. Schmid,
Pavel Cheben,
Dan-Xia Xu
Abstract:
Perfectly vertical grating couplers leveraging metamaterials can achieve both high coupling efficiency and minimal back reflection. The fabricability of these designs, with segmentations in both the longitudinal and transverse dimensions, hinges on the minimum feature size offered by cutting-edge fabrication technologies. In this work we present both numerical and experimental evidence that high p…
▽ More
Perfectly vertical grating couplers leveraging metamaterials can achieve both high coupling efficiency and minimal back reflection. The fabricability of these designs, with segmentations in both the longitudinal and transverse dimensions, hinges on the minimum feature size offered by cutting-edge fabrication technologies. In this work we present both numerical and experimental evidence that high performance devices can be obtained while using large transverse segmentation periods of up to 650 nm, thereby increasing the critical feature sizes. For single-step etched couplers produced on the 220 nm silicon-on-insulator platform, we demonstrate coupling efficiencies of nearly 50% in the C-band and remarkably low back reflections of -22 dB at zero-degree incidence angle. Notably, the duty cycles used in our optimized designs deviate significantly from those predicted by traditional effective medium models, even for small periods. Our findings promise to expand the range of optical properties achievable in metamaterials and offer fresh insights into the fine-tuning of nanophotonic devices.
△ Less
Submitted 1 November, 2024; v1 submitted 18 November, 2023;
originally announced November 2023.
-
Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration
Authors:
Ercong Nie,
Helmut Schmid,
Hinrich Schütze
Abstract:
Pretrained multilingual encoder models can directly perform zero-shot multilingual tasks or linguistic probing by reformulating the input examples into cloze-style prompts. This is accomplished by predicting the probabilities of the label words at the masked token position, without requiring any updates to the model parameters. However, the performance of this method is limited by the model's bias…
▽ More
Pretrained multilingual encoder models can directly perform zero-shot multilingual tasks or linguistic probing by reformulating the input examples into cloze-style prompts. This is accomplished by predicting the probabilities of the label words at the masked token position, without requiring any updates to the model parameters. However, the performance of this method is limited by the model's bias toward predicting label words which frequently occurred during the pretraining. These words typically receive high probabilities. To address this issue, we combine the models with calibration techniques which modify the probabilities of label words predicted by the models. We first validate the effectiveness of a proposed simple calibration method together with other existing techniques on monolingual encoders in both zero- and few-shot scenarios. We subsequently employ these calibration techniques on multilingual encoders, resulting in substantial performance improvements across a wide range of tasks.
△ Less
Submitted 19 October, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Optical wavefront phase-tilt measurement using Si-photonic waveguide grating couplers
Authors:
Siegfried Janz,
Dan-Xia Xu,
Yuri Grinberg,
Shurui Wang,
Martin Vachon,
Pavel Cheben,
Jens H. Schmid,
Daniele Melati
Abstract:
Silicon photonic wavefront phase-tilt sensors for wavefront monitoring using surface coupling grating arrays are demonstrated. The first design employs the intrinsic angle dependence of the grating coupling efficiency to determine local wavefront tilt, with a measured sensitivity of 7 dB/degree. A second design connects four gratings in an interferometric waveguide circuit to determine incident wa…
▽ More
Silicon photonic wavefront phase-tilt sensors for wavefront monitoring using surface coupling grating arrays are demonstrated. The first design employs the intrinsic angle dependence of the grating coupling efficiency to determine local wavefront tilt, with a measured sensitivity of 7 dB/degree. A second design connects four gratings in an interferometric waveguide circuit to determine incident wavefront phase variation across the sensor area. In this device, one fringe spacing corresponds to approximately 2 degree wavefront tilt change. These sensor elements can sample a wavefront incident on the chip surface without the use of bulk optic elements, fiber arrays, or imaging arrays. Both sensor elements are less than 60 um across, and can be combined into larger arrays to monitor wavefront tilt and distortion across an image or pupil plane in adaptive optics systems for free space optical communications, astronomy and beam pointing applications.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Single mode laser in the telecom range by deterministic amplification of the topological interface mode
Authors:
Markus Scherrer,
Chang-Won Lee,
Heinz Schmid,
Kirsten E. Moselund
Abstract:
Photonic integrated circuits are paving the way for novel on-chip functionalities with diverse applications in communication, computing, and beyond. The integration of on-chip light sources, especially single-mode lasers, is crucial for advancing those photonic chips to their full potential. Recently, novel concepts involving topological designs introduced a variety of options for tuning device pr…
▽ More
Photonic integrated circuits are paving the way for novel on-chip functionalities with diverse applications in communication, computing, and beyond. The integration of on-chip light sources, especially single-mode lasers, is crucial for advancing those photonic chips to their full potential. Recently, novel concepts involving topological designs introduced a variety of options for tuning device properties such as the desired single mode emission. Here we introduce a novel cavity design that allows to amplify the topological interface mode by deterministic placement of gain material within the topological lattice. The proposed design is experimentally implemented by a selective epitaxy process resulting in Si and InGaAs nanorods embedded within the same topological lattice. This results in the first demonstration of a single-mode laser in the telecom band using the concept of amplified topological modes.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
On the connection coefficients for linear differential systems with applications to the spheroidal and ellipsoidal wave equation
Authors:
Harald Schmid
Abstract:
This paper is concerned with the connection coefficients between the local fundamental solutions of a $2\times 2$ linear ordinary differential system with two neighboring regular singular points at $z=0$ and $z=1$. We derive an asymptotic formula for the connection coefficients which can be used for numerical calculations and, in particular, for determining the eigenvalues of some spectral problem…
▽ More
This paper is concerned with the connection coefficients between the local fundamental solutions of a $2\times 2$ linear ordinary differential system with two neighboring regular singular points at $z=0$ and $z=1$. We derive an asymptotic formula for the connection coefficients which can be used for numerical calculations and, in particular, for determining the eigenvalues of some spectral problems arising in mathematical physics. As an application, new algorithms for computing the eigenvalues of the ellipsoidal wave equation and the spheroidal wave equation are presented.
△ Less
Submitted 12 August, 2023;
originally announced August 2023.
-
Cross-Lingual Constituency Parsing for Middle High German: A Delexicalized Approach
Authors:
Ercong Nie,
Helmut Schmid,
Hinrich Schütze
Abstract:
Constituency parsing plays a fundamental role in advancing natural language processing (NLP) tasks. However, training an automatic syntactic analysis system for ancient languages solely relying on annotated parse data is a formidable task due to the inherent challenges in building treebanks for such languages. It demands extensive linguistic expertise, leading to a scarcity of available resources.…
▽ More
Constituency parsing plays a fundamental role in advancing natural language processing (NLP) tasks. However, training an automatic syntactic analysis system for ancient languages solely relying on annotated parse data is a formidable task due to the inherent challenges in building treebanks for such languages. It demands extensive linguistic expertise, leading to a scarcity of available resources. To overcome this hurdle, cross-lingual transfer techniques which require minimal or even no annotated data for low-resource target languages offer a promising solution. In this study, we focus on building a constituency parser for $\mathbf{M}$iddle $\mathbf{H}$igh $\mathbf{G}$erman ($\mathbf{MHG}$) under realistic conditions, where no annotated MHG treebank is available for training. In our approach, we leverage the linguistic continuity and structural similarity between MHG and $\mathbf{M}$odern $\mathbf{G}$erman ($\mathbf{MG}$), along with the abundance of MG treebank resources. Specifically, by employing the $\mathit{delexicalization}$ method, we train a constituency parser on MG parse datasets and perform cross-lingual transfer to MHG parsing. Our delexicalized constituency parser demonstrates remarkable performance on the MHG test set, achieving an F1-score of 67.3%. It outperforms the best zero-shot cross-lingual baseline by a margin of 28.6% points. These encouraging results underscore the practicality and potential for automatic syntactic analysis in other ancient languages that face similar challenges as MHG.
△ Less
Submitted 29 August, 2023; v1 submitted 8 August, 2023;
originally announced August 2023.
-
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding
Authors:
Bolei Ma,
Ercong Nie,
Helmut Schmid,
Hinrich Schütze
Abstract:
Multilingual pretrained language models (MPLMs) have demonstrated substantial performance improvements in zero-shot cross-lingual transfer across various natural language understanding tasks by finetuning MPLMs on task-specific labelled data of a source language (e.g. English) and evaluating on a wide range of target languages. Recent studies show that prompt-based finetuning surpasses regular fin…
▽ More
Multilingual pretrained language models (MPLMs) have demonstrated substantial performance improvements in zero-shot cross-lingual transfer across various natural language understanding tasks by finetuning MPLMs on task-specific labelled data of a source language (e.g. English) and evaluating on a wide range of target languages. Recent studies show that prompt-based finetuning surpasses regular finetuning in few-shot scenarios. However, the exploration of prompt-based learning in multilingual tasks remains limited. In this study, we propose the ProFiT pipeline to investigate the cross-lingual capabilities of Prompt-based Finetuning. We conduct comprehensive experiments on diverse cross-lingual language understanding tasks (sentiment classification, paraphrase identification, and natural language inference) and empirically analyze the variation trends of prompt-based finetuning performance in cross-lingual transfer across different few-shot and full-data settings. Our results reveal the effectiveness and versatility of prompt-based finetuning in cross-lingual language understanding. Our findings indicate that prompt-based finetuning outperforms vanilla finetuning in full-data scenarios and exhibits greater advantages in few-shot scenarios, with different performance patterns dependent on task types. Additionally, we analyze underlying factors such as language similarity and pretraining data size that impact the cross-lingual performance of prompt-based finetuning. Overall, our work provides valuable insights into the cross-lingual prowess of prompt-based finetuning.
△ Less
Submitted 15 July, 2023;
originally announced July 2023.
-
Controlling lasing around Exceptional Points in Coupled Nanolasers
Authors:
Anna Fischer,
T. V. Raziman,
Wai Kit Ng,
Jente Clarysse,
Jakub Dranczewski,
Dhruv Saxena,
Stefano Vezzoli,
Heinz Schmid,
Kirsten Moselund,
Riccardo Sapienza
Abstract:
Coupled nanolasers are of growing interest for on-chip optical computation and data transmission, which requires an understanding of how lasers interact to form complex systems. The non-Hermitian interaction between two coupled resonators, when excited selectively, can lead to parity-time symmetry, the formation of exceptional points, and subsequently spectral control and increased sensitivity. Th…
▽ More
Coupled nanolasers are of growing interest for on-chip optical computation and data transmission, which requires an understanding of how lasers interact to form complex systems. The non-Hermitian interaction between two coupled resonators, when excited selectively, can lead to parity-time symmetry, the formation of exceptional points, and subsequently spectral control and increased sensitivity. These investigations have been limited to pump energies close to the lasing threshold, and large or narrow-line lasers. Here, by programmable optical excitation we study two coupled nanolasers significantly above threshold, where mode instability plays an important role. We map the mode evolution around two exceptional points, and observe lasing gaps due to reversed pump dependence which compare well with nonlinear theory. Finally, the coupling can be exploited to control the lasing threshold and wavelength, and for frequency switching around the lasing gap. Controlled and integrated nanolasers constitutes a promising platform for future highly sensitive and programmable on-chip laser sources.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Instrumental Variable Approach to Estimating Individual Causal Effects in N-of-1 Trials: Application to ISTOP Study
Authors:
Kexin Qu,
Christopher H. Schmid,
Tao Liu
Abstract:
An N-of-1 trial is a multiple crossover trial conducted in a single individual to provide evidence to directly inform personalized treatment decisions. Advancements in wearable devices greatly improved the feasibility of adopting these trials to identify optimal individual treatment plans, particularly when treatments differ among individuals and responses are highly heterogeneous. Our work was mo…
▽ More
An N-of-1 trial is a multiple crossover trial conducted in a single individual to provide evidence to directly inform personalized treatment decisions. Advancements in wearable devices greatly improved the feasibility of adopting these trials to identify optimal individual treatment plans, particularly when treatments differ among individuals and responses are highly heterogeneous. Our work was motivated by the I-STOP-AFib Study, which examined the impact of different triggers on atrial fibrillation (AF) occurrence. We described a causal framework for 'N-of-1' trial using potential treatment selection paths and potential outcome paths. Two estimands of individual causal effect were defined:(a) the effect of continuous exposure, and (b) the effect of an individual observed behavior. We addressed three challenges: (a) imperfect compliance to the randomized treatment assignment; (b) binary treatments and binary outcomes which led to the 'non-collapsibility' issue of estimating odds ratios; and (c) serial inference in the longitudinal observations. We adopted the Bayesian IV approach where the study randomization was the IV as it impacted the choice of exposure of a subject but not directly the outcome. Estimations were through a system of two parametric Bayesian models to estimate the individual causal effect. Our model got around the non-collapsibility and non-consistency by modeling the confounding mechanism through latent structural models and by inferring with Bayesian posterior of functionals. Autocorrelation present in the repeated measurements was also accounted for. The simulation study showed our method largely reduced bias and greatly improved the coverage of the estimated causal effect, compared to existing methods (ITT, PP, and AT). We applied the method to I-STOP-AFib Study to estimate the individual effect of alcohol on AF occurrence.
△ Less
Submitted 12 October, 2024; v1 submitted 24 June, 2023;
originally announced June 2023.
-
Quantitative polarimetry for the transition disk in RX J1604.3-213010
Authors:
Jie Ma,
Hans Martin Schmid,
Christian Tschudi
Abstract:
The bright disk of RX J1604 has a very simple axisymmetric structure and is well suited as a benchmark object for accurate photo-polarimetric measurements. We used archival data of RX J1604 from the ESO archive and carefully corrected the polarization signal for instrumental effects, also taking the interstellar polarization into account. We derive accurate radial disk profiles for the intrinsic p…
▽ More
The bright disk of RX J1604 has a very simple axisymmetric structure and is well suited as a benchmark object for accurate photo-polarimetric measurements. We used archival data of RX J1604 from the ESO archive and carefully corrected the polarization signal for instrumental effects, also taking the interstellar polarization into account. We derive accurate radial disk profiles for the intrinsic polarized intensity, ${\hat{Q}}_{\varphi}(r)/I_{\star}$, and measure different profile peak radii for different bands because of the wavelength dependence of the dust opacity. The disk-integrated polarization is $\hat{Q}_{\varphi}/I_{\star} = 0.92 \pm 0.04\%$ for the R band and $1.51 \pm 0.11\%$ for the J band, indicating a red color for the polarized reflectivity of the disk. The intensity of the disk is $I_{\rm disk}/I_{\star} = 3.9 \pm 0.5 \%$ in the J band, and the fractional polarization is $\hat{p}_{\varphi} = 38 \pm 4\%$ for the J band and $42 \pm 2\%$ for the H band. The comparison with the IR excess for RX J1604 yields an apparent disk albedo of about $Λ_{I} \approx 0.16 \pm 0.08$. We also find that previously described shadows seen in the R band data are likely affected by calibration errors. Using dust scattering models for transition disks, We derive approximate J band values for the scattering albedo $ω\approx 0.5$, scattering asymmetry $g \approx 0.5$, and scattering polarization $p_{\rm max} \approx 0.7$ for the dust. The positive R to J band color for the polarized reflectivity is mainly a result of the wavelength dependence of dust parameters because the scattering geometry is expected to be very similar for different colors. This work demonstrates the potential of accurate photo-polarimetric measurements of the circumstellar disk RX J1604 for the determination of dust scattering parameters that strongly constrain the physical properties of the dust.
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
Authors:
Ayyoob Imani,
Peiqin Lin,
Amir Hossein Kargaran,
Silvia Severini,
Masoud Jalili Sabet,
Nora Kassner,
Chunlan Ma,
Helmut Schmid,
André F. T. Martins,
François Yvon,
Hinrich Schütze
Abstract:
The NLP community has mainly focused on scaling Large Language Models (LLMs) vertically, i.e., making them better for about 100 languages. We instead scale LLMs horizontally: we create, through continued pretraining, Glot500-m, an LLM that covers 511 predominantly low-resource languages. An important part of this effort is to collect and clean Glot500-c, a corpus that covers these 511 languages an…
▽ More
The NLP community has mainly focused on scaling Large Language Models (LLMs) vertically, i.e., making them better for about 100 languages. We instead scale LLMs horizontally: we create, through continued pretraining, Glot500-m, an LLM that covers 511 predominantly low-resource languages. An important part of this effort is to collect and clean Glot500-c, a corpus that covers these 511 languages and allows us to train Glot500-m. We evaluate Glot500-m on five diverse tasks across these languages. We observe large improvements for both high-resource and low-resource languages compared to an XLM-R baseline. Our analysis shows that no single factor explains the quality of multilingual LLM representations. Rather, a combination of factors determines quality including corpus size, script, "help" from related languages and the total capacity of the model. Our work addresses an important goal of NLP research: we should not limit NLP to a small fraction of the world's languages and instead strive to support as many languages as possible to bring the benefits of NLP technology to all languages and cultures. Code, data and models are available at https://github.com/cisnlp/Glot500.
△ Less
Submitted 26 May, 2023; v1 submitted 20 May, 2023;
originally announced May 2023.