Search | arXiv e-print repository

Why Linguistics Will Thrive in the 21st Century: A Reply to Piantadosi (2023)

Authors: Jordan Kodner, Sarah Payne, Jeffrey Heinz

Abstract: We present a critical assessment of Piantadosi's (2023) claim that "Modern language models refute Chomsky's approach to language," focusing on four main points. First, despite the impressive performance and utility of large language models (LLMs), humans achieve their capacity for language after exposure to several orders of magnitude less data. The fact that young children become competent, fluen… ▽ More We present a critical assessment of Piantadosi's (2023) claim that "Modern language models refute Chomsky's approach to language," focusing on four main points. First, despite the impressive performance and utility of large language models (LLMs), humans achieve their capacity for language after exposure to several orders of magnitude less data. The fact that young children become competent, fluent speakers of their native languages with relatively little exposure to them is the central mystery of language learning to which Chomsky initially drew attention, and LLMs currently show little promise of solving this mystery. Second, what can the artificial reveal about the natural? Put simply, the implications of LLMs for our understanding of the cognitive structures and mechanisms underlying language and its acquisition are like the implications of airplanes for understanding how birds fly. Third, LLMs cannot constitute scientific theories of language for several reasons, not least of which is that scientific theories must provide interpretable explanations, not just predictions. This leads to our final point: to even determine whether the linguistic and cognitive capabilities of LLMs rival those of humans requires explicating what humans' capacities actually are. In other words, it requires a separate theory of language and cognition; generative linguistics provides precisely such a theory. As such, we conclude that generative linguistics as a scientific discipline will remain indispensable throughout the 21st century and beyond. △ Less

Submitted 6 August, 2023; originally announced August 2023.

arXiv:2306.16858 [pdf, other]

Methods for non-proportional hazards in clinical trials: A systematic review

Authors: Maximilian Bardo, Cynthia Huber, Norbert Benda, Jonas Brugger, Tobias Fellinger, Vaidotas Galaune, Judith Heinz, Harald Heinzl, Andrew C. Hooker, Florian Klinglmüller, Franz König, Tim Mathes, Martina Mittlböck, Martin Posch, Robin Ristl, Tim Friede

Abstract: For the analysis of time-to-event data, frequently used methods such as the log-rank test or the Cox proportional hazards model are based on the proportional hazards assumption, which is often debatable. Although a wide range of parametric and non-parametric methods for non-proportional hazards (NPH) has been proposed, there is no consensus on the best approaches. To close this gap, we conducted a… ▽ More For the analysis of time-to-event data, frequently used methods such as the log-rank test or the Cox proportional hazards model are based on the proportional hazards assumption, which is often debatable. Although a wide range of parametric and non-parametric methods for non-proportional hazards (NPH) has been proposed, there is no consensus on the best approaches. To close this gap, we conducted a systematic literature search to identify statistical methods and software appropriate under NPH. Our literature search identified 907 abstracts, out of which we included 211 articles, mostly methodological ones. Review articles and applications were less frequently identified. The articles discuss effect measures, effect estimation and regression approaches, hypothesis tests, and sample size calculation approaches, which are often tailored to specific NPH situations. Using a unified notation, we provide an overview of methods available. Furthermore, we derive some guidance from the identified articles. We summarized the contents from the literature review in a concise way in the main text and provide more detailed explanations in the supplement. △ Less

Submitted 29 January, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

MSC Class: 62Nxx; 62Pxx

arXiv:2304.07687 [pdf, other]

MLRegTest: A Benchmark for the Machine Learning of Regular Languages

Authors: Sam van der Poel, Dakotah Lambert, Kalina Kostyszyn, Tiantian Gao, Rahul Verma, Derek Andersen, Joanne Chau, Emily Peterson, Cody St. Clair, Paul Fodor, Chihiro Shibata, Jeffrey Heinz

Abstract: Synthetic datasets constructed from formal languages allow fine-grained examination of the learning and generalization capabilities of machine learning systems for sequence classification. This article presents a new benchmark for machine learning systems on sequence classification called MLRegTest, which contains training, development, and test sets from 1,800 regular languages. Different kinds o… ▽ More Synthetic datasets constructed from formal languages allow fine-grained examination of the learning and generalization capabilities of machine learning systems for sequence classification. This article presents a new benchmark for machine learning systems on sequence classification called MLRegTest, which contains training, development, and test sets from 1,800 regular languages. Different kinds of formal languages represent different kinds of long-distance dependencies, and correctly identifying long-distance dependencies in sequences is a known challenge for ML systems to generalize successfully. MLRegTest organizes its languages according to their logical complexity (monadic second order, first order, propositional, or monomial expressions) and the kind of logical literals (string, tier-string, subsequence, or combinations thereof). The logical complexity and choice of literal provides a systematic way to understand different kinds of long-distance dependencies in regular languages, and therefore to understand the capacities of different ML systems to learn such long-distance dependencies. Finally, the performance of different neural networks (simple RNN, LSTM, GRU, transformer) on MLRegTest is examined. The main conclusion is that performance depends significantly on the kind of test set, the class of language, and the neural network architecture. △ Less

Submitted 1 September, 2024; v1 submitted 15 April, 2023; originally announced April 2023.

Comments: Accepted for publication in the Journal of Machine Learning Research. Dataset available at https://doi.org/10.5061/dryad.dncjsxm4h , code available at https://github.com/heinz-jeffrey/subregular-learning

arXiv:2210.07604 [pdf, other]

High-Order Non-Conforming Discontinuous Galerkin Methods for the Acoustic Conservation Equations

Authors: Johannes Heinz, Peter Munch, Manfred Kaltenbacher

Abstract: This work compares two Nitsche-type approaches to treat non-conforming triangulations for a high-order discontinuous Galerkin (DG) solver for the acoustic conservation equations. The first approach (point-to-point interpolation) uses inexact integration with quadrature points prescribed by a primary element. The second approach uses exact integration (mortaring) by choosing quadratures depending o… ▽ More This work compares two Nitsche-type approaches to treat non-conforming triangulations for a high-order discontinuous Galerkin (DG) solver for the acoustic conservation equations. The first approach (point-to-point interpolation) uses inexact integration with quadrature points prescribed by a primary element. The second approach uses exact integration (mortaring) by choosing quadratures depending on the intersection between non-conforming elements. In literature, some excellent properties regarding performance and ease of implementation are reported for point-to-point interpolation. However, we show that this approach can not safely be used for DG discretizations of the acoustic conservation equations since, in our setting, it yields spurious oscillations that lead to instabilities. This work presents a test case in that we can observe the instabilities and shows that exact integration is required to maintain a stable method. Additionally, we provide a detailed analysis of the method with exact integration. We show optimal spatial convergence rates globally and in each mesh region separately. The method is constructed such that it can natively treat overlaps between elements. Finally, we highlight the benefits of non-conforming discretizations in acoustic computations by a numerical test case with different fluids. △ Less

Submitted 14 October, 2022; originally announced October 2022.

arXiv:2003.07166 [pdf, other]

doi 10.1016/j.jcp.2020.110040

High-order arbitrary Lagrangian-Eulerian discontinuous Galerkin methods for the incompressible Navier-Stokes equations

Authors: Niklas Fehn, Johannes Heinz, Wolfgang A. Wall, Martin Kronbichler

Abstract: This paper presents robust discontinuous Galerkin methods for the incompressible Navier-Stokes equations on moving meshes. High-order accurate arbitrary Lagrangian-Eulerian formulations are proposed in a unified framework for both monolithic as well as projection or splitting-type Navier-Stokes solvers. The framework is flexible, allows implicit and explicit formulations of the convective term, an… ▽ More This paper presents robust discontinuous Galerkin methods for the incompressible Navier-Stokes equations on moving meshes. High-order accurate arbitrary Lagrangian-Eulerian formulations are proposed in a unified framework for both monolithic as well as projection or splitting-type Navier-Stokes solvers. The framework is flexible, allows implicit and explicit formulations of the convective term, and adaptive time-stepping. The Navier-Stokes equations with ALE transport term are solved on the deformed geometry storing one instance of the mesh that is updated from one time step to the next. Discretization in space is applied to the time discrete equations so that all weak forms and mass matrices are evaluated at the end of the current time step. This design ensures that the proposed formulations fulfill the geometric conservation law automatically, as is shown theoretically and demonstrated numerically by the example of the free-stream preservation test. We discuss the peculiarities related to the imposition of boundary conditions in intermediate steps of projection-type methods and the ingredients needed to preserve high-order accuracy. We show numerically that the formulations proposed in this work maintain the formal order of accuracy of the Navier-Stokes solvers. Moreover, we demonstrate robustness and accuracy for under-resolved turbulent flows. △ Less

Submitted 16 March, 2020; originally announced March 2020.

arXiv:1910.11493 [pdf, ps, other]

doi 10.18653/v1/W19-4226

The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection

Authors: Arya D. McCarthy, Ekaterina Vylomova, Shijie Wu, Chaitanya Malaviya, Lawrence Wolf-Sonkin, Garrett Nicolai, Christo Kirov, Miikka Silfverberg, Sabrina J. Mielke, Jeffrey Heinz, Ryan Cotterell, Mans Hulden

Abstract: The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages. The first task evolves past years' inflection tasks by examining transfer of morphological inflection knowledge from a high-resource language to a low… ▽ More The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages. The first task evolves past years' inflection tasks by examining transfer of morphological inflection knowledge from a high-resource language to a low-resource language. This year also presents a new second challenge on lemmatization and morphological feature analysis in context. All submissions featured a neural component and built on either this year's strong baselines or highly ranked systems from previous years' shared tasks. Every participating team improved in accuracy over the baselines for the inflection task (though not Levenshtein distance), and every team in the contextual analysis task improved on both state-of-the-art neural and non-neural baselines. △ Less

Submitted 25 February, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

Comments: Presented at SIGMORPHON 2019

Journal ref: Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology (2019) 229-244

arXiv:1908.06340 [pdf]

doi 10.1186/s12931-019-1163-2

Decline of COPD exacerbations in clinical trials over two decades -- a systematic review and meta-regression

Authors: Stefan Andreas, Christian Röver, Judith Heinz, Sebastian Straube, Henrik Watz, Tim Friede

Abstract: BACKGROUND: An important goal of chronic obstructive pulmonary disease (COPD) treatment is to reduce the frequency of exacerbations. Some observations suggest a decline in exacerbation rates in clinical trials over time. A more systematic understanding would help to improve the design and interpretation of COPD trials. METHODS: We performed a systematic review and meta-regression of the placebo… ▽ More BACKGROUND: An important goal of chronic obstructive pulmonary disease (COPD) treatment is to reduce the frequency of exacerbations. Some observations suggest a decline in exacerbation rates in clinical trials over time. A more systematic understanding would help to improve the design and interpretation of COPD trials. METHODS: We performed a systematic review and meta-regression of the placebo groups in published randomized controlled trials reporting exacerbations as an outcome. A Bayesian negative binomial model was developed to accommodate results that are reported in different formats; results are reported with credible intervals (CI) and posterior tail probabilities ($p_B$). RESULTS: Of 1114 studies identified by our search, 55 were ultimately included. Exacerbation rates decreased by 6.7% (95% CI (4.4, 9.0); $p_B$ < 0.001) per year, or 50% (95% CI (36, 61)) per decade. Adjusting for available study and baseline characteristics such as forced expiratory volume in 1 s (FEV1) did not alter the observed trend considerably. Two subsets of studies, one using a true placebo group and the other allowing inhaled corticosteroids in the "placebo" group, also yielded consistent results. CONCLUSIONS: In conclusion, this meta-regression indicates that the rate of COPD exacerbations decreased over the past two decades to a clinically relevant extent independent of important prognostic factors. This suggests that care is needed in the design of new trials or when comparing results from older trials with more recent ones. Also a considerable effect of adjunct therapy on COPD exacerbations can be assumed. △ Less

Submitted 17 August, 2019; originally announced August 2019.

Comments: 11 pages, 4 figures

Journal ref: Respiratory Research 20:186, 2019

arXiv:1906.07886 [pdf, other]

Learning with Partially Ordered Representations

Authors: Jane Chandlee, Remi Eyraud, Jeffrey Heinz, Adam Jardine, Jonathan Rawski

Abstract: This paper examines the characterization and learning of grammars defined with enriched representational models. Model-theoretic approaches to formal language theory traditionally assume that each position in a string belongs to exactly one unary relation. We consider unconventional string models where positions can have multiple, shared properties, which are arguably useful in many applications.… ▽ More This paper examines the characterization and learning of grammars defined with enriched representational models. Model-theoretic approaches to formal language theory traditionally assume that each position in a string belongs to exactly one unary relation. We consider unconventional string models where positions can have multiple, shared properties, which are arguably useful in many applications. We show the structures given by these models are partially ordered, and present a learning algorithm that exploits this ordering relation to effectively prune the hypothesis space. We prove this learning algorithm, which takes positive examples as input, finds the most general grammar which covers the data. △ Less

Submitted 23 June, 2019; v1 submitted 18 June, 2019; originally announced June 2019.

Comments: to appear in Proceedings of Mathematics of Language (ACL SIGMOL 2019)

arXiv:1705.05940 [pdf, ps, other]

Subregular Complexity and Deep Learning

Authors: Enes Avcu, Chihiro Shibata, Jeffrey Heinz

Abstract: This paper argues that the judicial use of formal language theory and grammatical inference are invaluable tools in understanding how deep neural networks can and cannot represent and learn long-term dependencies in temporal sequences. Learning experiments were conducted with two types of Recurrent Neural Networks (RNNs) on six formal languages drawn from the Strictly Local (SL) and Strictly Piece… ▽ More This paper argues that the judicial use of formal language theory and grammatical inference are invaluable tools in understanding how deep neural networks can and cannot represent and learn long-term dependencies in temporal sequences. Learning experiments were conducted with two types of Recurrent Neural Networks (RNNs) on six formal languages drawn from the Strictly Local (SL) and Strictly Piecewise (SP) classes. The networks were Simple RNNs (s-RNNs) and Long Short-Term Memory RNNs (LSTMs) of varying sizes. The SL and SP classes are among the simplest in a mathematically well-understood hierarchy of subregular classes. They encode local and long-term dependencies, respectively. The grammatical inference algorithm Regular Positive and Negative Inference (RPNI) provided a baseline. According to earlier research, the LSTM architecture should be capable of learning long-term dependencies and should outperform s-RNNs. The results of these experiments challenge this narrative. First, the LSTMs' performance was generally worse in the SP experiments than in the SL ones. Second, the s-RNNs out-performed the LSTMs on the most complex SP experiment and performed comparably to them on the others. △ Less

Submitted 14 October, 2017; v1 submitted 16 May, 2017; originally announced May 2017.

arXiv:1210.1630 [pdf, ps, other]

Symbolic Planning and Control Using Game Theory and Grammatical Inference

Authors: Jie Fu, Herbert G. Tanner, Jeffrey Heinz, Jane Chandlee, Konstantinos Karydis, Cesar Koirala

Abstract: This paper presents an approach that brings together game theory with grammatical inference and discrete abstractions in order to synthesize control strategies for hybrid dynamical systems performing tasks in partially unknown but rule-governed adversarial environments. The combined formulation guarantees that a system specification is met if (a) the true model of the environment is in the class o… ▽ More This paper presents an approach that brings together game theory with grammatical inference and discrete abstractions in order to synthesize control strategies for hybrid dynamical systems performing tasks in partially unknown but rule-governed adversarial environments. The combined formulation guarantees that a system specification is met if (a) the true model of the environment is in the class of models inferable from a positive presentation, (b) a characteristic sample is observed, and (c) the task specification is satisfiable given the capabilities of the system (agent) and the environment. △ Less

Submitted 4 October, 2012; originally announced October 2012.

Showing 1–10 of 10 results for author: Heinz, J