Skip to main content

Showing 1–14 of 14 results for author: Earls, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17071  [pdf, other

    cs.CL

    What's in a prompt? Language models encode literary style in prompt embeddings

    Authors: Raphaël Sarfati, Haley Moller, Toni J. B. Liu, Nicolas Boullé, Christopher Earls

    Abstract: Large language models use high-dimensional latent spaces to encode and process textual information. Much work has investigated how the conceptual content of words translates into geometrical relationships between their vector representations. Fewer studies analyze how the cumulative information of an entire prompt becomes condensed into individual embeddings under the action of transformer layers.… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2504.17065  [pdf, other

    cs.LG

    Antenna Near-Field Reconstruction from Far-Field Data Using Convolutional Neural Networks

    Authors: Sahar Bagherkhani, Jackson Christopher Earls, Franco De Flaviis, Pierre Baldi

    Abstract: Electromagnetic field reconstruction is crucial in many applications, including antenna diagnostics, electromagnetic interference analysis, and system modeling. This paper presents a deep learning-based approach for Far-Field to Near-Field (FF-NF) transformation using Convolutional Neural Networks (CNNs). The goal is to reconstruct near-field distributions from the far-field data of an antenna wit… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  3. arXiv:2501.18715  [pdf, other

    cs.LG math.NA

    chebgreen: Learning and Interpolating Continuous Empirical Green's Functions from Data

    Authors: Harshwardhan Praveen, Jacob Brown, Christopher Earls

    Abstract: In this work, we present a mesh-independent, data-driven library, chebgreen, to mathematically model one-dimensional systems, possessing an associated control parameter, and whose governing partial differential equation is unknown. The proposed method learns an Empirical Green's Function for the associated, but hidden, boundary value problem, in the form of a Rational Neural Network from which we… ▽ More

    Submitted 2 May, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

    Comments: Code is available at https://github.com/hsharsh/chebgreen

  4. arXiv:2412.15113  [pdf, other

    cs.NE cs.AI cs.CL

    Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture

    Authors: Thomas F Burns, Tomoki Fukai, Christopher J Earls

    Abstract: Large language models (LLMs) demonstrate an impressive ability to utilise information within the context of their input sequences to appropriately respond to data unseen by the LLM during its training procedure. This ability is known as in-context learning (ICL). Humans and non-human animals demonstrate similar abilities, however their neural architectures differ substantially from LLMs. Despite t… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 18 pages, 6 figures, 3 tables

    MSC Class: 92B20; 68T01; 68T37; 68T50 ACM Class: I.2; I.5; I.7; J.2; J.3

  5. arXiv:2410.05218  [pdf, other

    cs.LG cs.CL stat.ML

    Density estimation with LLMs: a geometric investigation of in-context learning trajectories

    Authors: Toni J. B. Liu, Nicolas Boullé, Raphaël Sarfati, Christopher J. Earls

    Abstract: Large language models (LLMs) demonstrate remarkable emergent abilities to perform in-context learning across various tasks, including time series forecasting. This work investigates LLMs' ability to estimate probability density functions (PDFs) from data observed in-context; such density estimation (DE) is a fundamental task underlying many probabilistic modeling problems. We leverage the Intensiv… ▽ More

    Submitted 3 March, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

  6. arXiv:2410.01545  [pdf, other

    cs.LG physics.data-an

    Lines of Thought in Large Language Models

    Authors: Raphaël Sarfati, Toni J. B. Liu, Nicolas Boullé, Christopher J. Earls

    Abstract: Large Language Models achieve next-token prediction by transporting a vectorized piece of text (prompt) across an accompanying embedding space under the action of successive transformer layers. The resulting high-dimensional trajectories realize different contextualization, or 'thinking', steps, and fully determine the output probability distribution. We aim to characterize the statistical propert… ▽ More

    Submitted 13 February, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  7. arXiv:2402.00795  [pdf, other

    cs.LG cs.AI

    LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law

    Authors: Toni J. B. Liu, Nicolas Boullé, Raphaël Sarfati, Christopher J. Earls

    Abstract: Pretrained large language models (LLMs) are surprisingly effective at performing zero-shot tasks, including time-series forecasting. However, understanding the mechanisms behind such capabilities remains highly challenging due to the complexity of the models. We study LLMs' ability to extrapolate the behavior of dynamical systems whose evolution is governed by principles of physical interest. Our… ▽ More

    Submitted 9 October, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  8. arXiv:2309.04699  [pdf, other

    cs.LG

    Weak-PDE-LEARN: A Weak Form Based Approach to Discovering PDEs From Noisy, Limited Data

    Authors: Robert Stephany, Christopher Earls

    Abstract: We introduce Weak-PDE-LEARN, a Partial Differential Equation (PDE) discovery algorithm that can identify non-linear PDEs from noisy, limited measurements of their solutions. Weak-PDE-LEARN uses an adaptive loss function based on weak forms to train a neural network, $U$, to approximate the PDE solution while simultaneously identifying the governing PDE. This approach yields an algorithm that is ro… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: 29 pages, 8 figures

  9. arXiv:2212.04971  [pdf, other

    cs.LG

    PDE-LEARN: Using Deep Learning to Discover Partial Differential Equations from Noisy, Limited Data

    Authors: Robert Stephany, Christopher Earls

    Abstract: In this paper, we introduce PDE-LEARN, a novel deep learning algorithm that can identify governing partial differential equations (PDEs) directly from noisy, limited measurements of a physical system of interest. PDE-LEARN uses a Rational Neural Network, $U$, to approximate the system response function and a sparse, trainable vector, $ξ$, to characterize the hidden PDE that the system response fun… ▽ More

    Submitted 9 February, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 25 pages, 7 figures, 9 tables

  10. arXiv:2111.00998  [pdf, other

    cs.LG

    PDE-READ: Human-readable Partial Differential Equation Discovery using Deep Learning

    Authors: Robert Stephany, Christopher Earls

    Abstract: PDE discovery shows promise for uncovering predictive models of complex physical systems but has difficulty when measurements are sparse and noisy. We introduce a new approach for PDE discovery that uses two Rational Neural Networks and a principled sparse regression algorithm to identify the hidden dynamics that govern a system's response. The first network learns the system response function, wh… ▽ More

    Submitted 17 June, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 41 pages, 18 figures

  11. arXiv:2108.04085  [pdf, other

    math.NA cs.LG

    Bayesian Deep Learning for Partial Differential Equation Parameter Discovery with Sparse and Noisy Data

    Authors: Christophe Bonneville, Christopher J. Earls

    Abstract: Scientific machine learning has been successfully applied to inverse problems and PDE discovery in computational physics. One caveat concerning current methods is the need for large amounts of ("clean") data, in order to characterize the full system response and discover underlying physical models. Bayesian methods may be particularly promising for overcoming these challenges, as they are naturall… ▽ More

    Submitted 29 November, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

  12. arXiv:2105.00266  [pdf, other

    cs.LG cs.AI math.NA

    Data-driven discovery of Green's functions with human-understandable deep learning

    Authors: Nicolas Boullé, Christopher J. Earls, Alex Townsend

    Abstract: There is an opportunity for deep learning to revolutionize science and technology by revealing its findings in a human interpretable manner. To do this, we develop a novel data-driven approach for creating a human-machine partnership to accelerate scientific discovery. By collecting physical system responses under excitations drawn from a Gaussian process, we train rational neural networks to lear… ▽ More

    Submitted 11 March, 2022; v1 submitted 1 May, 2021; originally announced May 2021.

    Comments: 54 pages, 23 figures

  13. arXiv:2008.09687  [pdf, other

    cs.CE math.NA physics.flu-dyn

    A Principled Approach to Design Using High Fidelity Fluid-Structure Interaction Simulations

    Authors: Wensi Wu, Christophe Bonneville, Christopher J. Earls

    Abstract: A high fidelity fluid-structure interaction simulation may require many days to run, on hundreds of cores. This poses a serious burden, both in terms of time and economic considerations, when repetitions of such simulations may be required (e.g. for the purpose of design optimization). In this paper we present strategies based on (constrained) Bayesian optimization (BO) to alleviate this burden. B… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

  14. arXiv:1905.07622  [pdf, other

    math.NA cs.MS

    Analysis of heterogeneous computing approaches to simulating heat transfer in heterogeneous material

    Authors: Andrew Loeb, Christopher Earls

    Abstract: The simulation of heat flow through heterogeneous material is important for the design of structural and electronic components. Classical analytical solutions to the heat equation PDE are not known for many such domains, even those having simple geometries. The finite element method can provide approximations to a weak form continuum solution, with increasing accuracy as the number of degrees of f… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.