-
Light-driven lattice metastability for enhanced superconductivity in FeSe/SrTiO3
Authors:
Qiang Zou,
Zhan Su,
Andres Tellez Mora,
Na Wu,
Joseph Benigno,
Christopher L. Jacobs,
Aldo H. Romero,
Subhasish Mandal,
Yaxian Wang,
Sheng Meng,
Michael Weinert,
Hua Zhou,
Lian Li,
Cheng Cen
Abstract:
Driven quantum materials with on demand properties controlled by external stimuli are critical for emergent quantum technology. In optically tunable superconducting heterostructures, the lattice responses at the buried interface may hold the key to the light susceptibility but is very challenging to detect. In this work, a nondestructive synchrotron-based X-ray scattering phase-retrieval technique…
▽ More
Driven quantum materials with on demand properties controlled by external stimuli are critical for emergent quantum technology. In optically tunable superconducting heterostructures, the lattice responses at the buried interface may hold the key to the light susceptibility but is very challenging to detect. In this work, a nondestructive synchrotron-based X-ray scattering phase-retrieval technique is implemented in monolayer-FeSe/SrTiO3 heterostructures to capture the three-dimensional interfacial atomic displacements in-situ as the interface superconductivity is actively manipulated by light. It is found that the interlayer sliding between FeSe and SrTiO3 can drastically alter how the lattice responds to the light. In domains with selected stacking configurations, the interface transforms the very weak photoexcitation in SrTiO3 into significant Fe-atom displacements in FeSe and generate metastable interfacial structures that can lead to a persistent superconductivity enhancement. These findings demonstrate an effective strategy for achieving greatly amplified light-lattice coupling for efficient quantum phase manipulations at designed interfaces.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
A Bayesian account of pronoun and neopronoun acquisition
Authors:
Cassandra L. Jacobs,
Morgan Grobol
Abstract:
A major challenge to equity among members of queer communities is the use of one's chosen forms of reference, such as personal names or pronouns. Speakers often dismiss their misuses of pronouns as "unintentional", and claim that their errors reflect many decades of fossilized mainstream language use, as well as attitudes or expectations about the relationship between one's appearance and acceptab…
▽ More
A major challenge to equity among members of queer communities is the use of one's chosen forms of reference, such as personal names or pronouns. Speakers often dismiss their misuses of pronouns as "unintentional", and claim that their errors reflect many decades of fossilized mainstream language use, as well as attitudes or expectations about the relationship between one's appearance and acceptable forms of reference. We argue for explicitly modeling individual differences in pronoun selection and present a probabilistic graphical modeling approach based on the nested Chinese Restaurant Franchise Process (nCRFP) (Ahmed et al., 2013) to account for flexible pronominal reference such as chosen names and neopronouns while moving beyond form-to-meaning mappings and without lexical co-occurrence statistics to learn referring expressions, as in contemporary language models. We show that such a model can account for variability in how quickly pronouns or names are integrated into symbolic knowledge and can empower computational systems to be both flexible and respectful of queer people with diverse gender expression.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
A Topological Superconductor Tuned by Electronic Correlations
Authors:
Haoran Lin,
Christopher L. Jacobs,
Chenhui Yan,
Gillian M. Nolan,
Gabriele Berruto,
Patrick Singleton,
Khanh Duy Nguyen,
Yunhe Bai,
Qiang Gao,
Xianxin Wu,
Chao-Xing Liu,
Gangbin Yan,
Suin Choi,
Chong Liu,
Nathan P. Guisinger,
Pinshane Y. Huang,
Subhasish Mandal,
Shuolong Yang
Abstract:
A topological superconductor, characterized by either a chiral order parameter or a chiral topological surface state in proximity to bulk superconductivity, is foundational to topological quantum computing. As in other topological phases of matter, electronic correlations can tune topological superconductivity via modifications of the low-energy Fermiology. Such tuning has not been realized so far…
▽ More
A topological superconductor, characterized by either a chiral order parameter or a chiral topological surface state in proximity to bulk superconductivity, is foundational to topological quantum computing. As in other topological phases of matter, electronic correlations can tune topological superconductivity via modifications of the low-energy Fermiology. Such tuning has not been realized so far. Here we uncover a unique topological superconducting phase in competition with electronic correlations in 10-unit-cell thick FeTe$_{x}$Se$_{1-x}$ films grown on SrTiO$_{3}$ substrates. When the Te content $x$ exceeds $0.7$, we observe a rapid increase of the effective mass for the Fe $d_{xy}$ band, with the emergence of a superconducting topological surface state confirmed by high-resolution angle-resolved photoemission spectroscopy; however, near the FeTe limit, the system enters an incoherent regime where the topological surface state becomes unidentifiable and superconductivity is suppressed. Theory suggests that the electron-electron interactions in the odd-parity $xy^-$ band with a strong $d_{xy}$ character lead to an orbital-selective correlated phase. Our work establishes FeTe$_{x}$Se$_{1-x}$ thin films as a unique platform where electronic correlations sensitively modulate topological superconductivity, suggesting opportunities to use tunable electron-electron interactions to engineer new topological phases in a broad class of materials.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Large-scale cloze evaluation reveals that token prediction tasks are neither lexically nor semantically aligned
Authors:
Cassandra L. Jacobs,
Loïc Grobol,
Alvin Tsang
Abstract:
In this work we compare the generative behavior at the next token prediction level in several language models by comparing them to human productions in the cloze task. We find that while large models trained for longer are typically better estimators of human productions, but they reliably under-estimate the probabilities of human responses, over-rank rare responses, under-rank top responses, and…
▽ More
In this work we compare the generative behavior at the next token prediction level in several language models by comparing them to human productions in the cloze task. We find that while large models trained for longer are typically better estimators of human productions, but they reliably under-estimate the probabilities of human responses, over-rank rare responses, under-rank top responses, and produce highly distinct semantic spaces. Altogether, this work demonstrates in a tractable, interpretable domain that LM generations can not be used as replacements of or models of the cloze task.
△ Less
Submitted 28 October, 2024; v1 submitted 15 October, 2024;
originally announced October 2024.
-
Incorporating Annotator Uncertainty into Representations of Discourse Relations
Authors:
S. Magalí López Cortez,
Cassandra L. Jacobs
Abstract:
Annotation of discourse relations is a known difficult task, especially for non-expert annotators. In this paper, we investigate novice annotators' uncertainty on the annotation of discourse relations on spoken conversational data. We find that dialogue context (single turn, pair of turns within speaker, and pair of turns across speakers) is a significant predictor of confidence scores. We compute…
▽ More
Annotation of discourse relations is a known difficult task, especially for non-expert annotators. In this paper, we investigate novice annotators' uncertainty on the annotation of discourse relations on spoken conversational data. We find that dialogue context (single turn, pair of turns within speaker, and pair of turns across speakers) is a significant predictor of confidence scores. We compute distributed representations of discourse relations from co-occurrence statistics that incorporate information about confidence scores and dialogue context. We perform a hierarchical clustering analysis using these representations and show that weighting discourse relation representations with information about confidence and dialogue context coherently models our annotators' uncertainty about discourse relation labels.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
The distribution of discourse relations within and across turns in spontaneous conversation
Authors:
S. Magalí López Cortez,
Cassandra L. Jacobs
Abstract:
Time pressure and topic negotiation may impose constraints on how people leverage discourse relations (DRs) in spontaneous conversational contexts. In this work, we adapt a system of DRs for written language to spontaneous dialogue using crowdsourced annotations from novice annotators. We then test whether discourse relations are used differently across several types of multi-utterance contexts. W…
▽ More
Time pressure and topic negotiation may impose constraints on how people leverage discourse relations (DRs) in spontaneous conversational contexts. In this work, we adapt a system of DRs for written language to spontaneous dialogue using crowdsourced annotations from novice annotators. We then test whether discourse relations are used differently across several types of multi-utterance contexts. We compare the patterns of DR annotation within and across speakers and within and across turns. Ultimately, we find that different discourse contexts produce distinct distributions of discourse relations, with single-turn annotations creating the most uncertainty for annotators. Additionally, we find that the discourse relation annotations are of sufficient quality to predict from embeddings of discourse units.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
Lost in Space Marking
Authors:
Cassandra L. Jacobs,
Yuval Pinter
Abstract:
We look at a decision taken early in training a subword tokenizer, namely whether it should be the word-initial token that carries a special mark, or the word-final one. Based on surface-level considerations of efficiency and cohesion, as well as morphological coverage, we find that a Unigram LM tokenizer trained on pre-tokenized English text is better off marking the word-initial token, while one…
▽ More
We look at a decision taken early in training a subword tokenizer, namely whether it should be the word-initial token that carries a special mark, or the word-final one. Based on surface-level considerations of efficiency and cohesion, as well as morphological coverage, we find that a Unigram LM tokenizer trained on pre-tokenized English text is better off marking the word-initial token, while one trained on raw text benefits from marking word ends. Our findings generalize across domains.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Will it Unblend?
Authors:
Yuval Pinter,
Cassandra L. Jacobs,
Jacob Eisenstein
Abstract:
Natural language processing systems often struggle with out-of-vocabulary (OOV) terms, which do not appear in training data. Blends, such as "innoventor", are one particularly challenging class of OOV, as they are formed by fusing together two or more bases that relate to the intended meaning in unpredictable manners and degrees. In this work, we run experiments on a novel dataset of English OOV b…
▽ More
Natural language processing systems often struggle with out-of-vocabulary (OOV) terms, which do not appear in training data. Blends, such as "innoventor", are one particularly challenging class of OOV, as they are formed by fusing together two or more bases that relate to the intended meaning in unpredictable manners and degrees. In this work, we run experiments on a novel dataset of English OOV blends to quantify the difficulty of interpreting the meanings of blends by large-scale contextual language models such as BERT. We first show that BERT's processing of these blends does not fully access the component meanings, leaving their contextual representations semantically impoverished. We find this is mostly due to the loss of characters resulting from blend formation. Then, we assess how easily different models can recognize the structure and recover the origin of blends, and find that context-aware embedding systems outperform character-level and context-free embeddings, although their results are still far from satisfactory.
△ Less
Submitted 18 September, 2020;
originally announced September 2020.
-
NYTWIT: A Dataset of Novel Words in the New York Times
Authors:
Yuval Pinter,
Cassandra L. Jacobs,
Max Bittker
Abstract:
We present the New York Times Word Innovation Types dataset, or NYTWIT, a collection of over 2,500 novel English words published in the New York Times between November 2017 and March 2019, manually annotated for their class of novelty (such as lexical derivation, dialectal variation, blending, or compounding). We present baseline results for both uncontextual and contextual prediction of novelty c…
▽ More
We present the New York Times Word Innovation Types dataset, or NYTWIT, a collection of over 2,500 novel English words published in the New York Times between November 2017 and March 2019, manually annotated for their class of novelty (such as lexical derivation, dialectal variation, blending, or compounding). We present baseline results for both uncontextual and contextual prediction of novelty class, showing that there is room for improvement even for state-of-the-art NLP systems. We hope this resource will prove useful for linguists and NLP practitioners by providing a real-world environment of novel word appearance.
△ Less
Submitted 23 October, 2020; v1 submitted 6 March, 2020;
originally announced March 2020.