Skip to main content

Showing 1–7 of 7 results for author: Kantamneni, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.18530  [pdf, other

    cs.AI cs.CY cs.LG

    Scaling Laws For Scalable Oversight

    Authors: Joshua Engels, David D. Baek, Subhash Kantamneni, Max Tegmark

    Abstract: Scalable oversight, the process by which weaker AI systems supervise stronger ones, has been proposed as a key strategy to control future superintelligent systems. However, it is still unclear how scalable oversight itself scales. To address this gap, we propose a framework that quantifies the probability of successful oversight as a function of the capabilities of the overseer and the system bein… ▽ More

    Submitted 9 May, 2025; v1 submitted 25 April, 2025; originally announced April 2025.

    Comments: 32 pages, 18 figures; The first three authors contributed equally

  2. arXiv:2502.16681  [pdf, other

    cs.LG cs.AI

    Are Sparse Autoencoders Useful? A Case Study in Sparse Probing

    Authors: Subhash Kantamneni, Joshua Engels, Senthooran Rajamanoharan, Max Tegmark, Neel Nanda

    Abstract: Sparse autoencoders (SAEs) are a popular method for interpreting concepts represented in large language model (LLM) activations. However, there is a lack of evidence regarding the validity of their interpretations due to the lack of a ground truth for the concepts used by an LLM, and a growing number of works have presented problems with current SAEs. One alternative source of evidence would be de… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  3. arXiv:2502.00873  [pdf, other

    cs.AI cs.CL cs.LG

    Language Models Use Trigonometry to Do Addition

    Authors: Subhash Kantamneni, Max Tegmark

    Abstract: Mathematical reasoning is an increasingly important indicator of large language model (LLM) capabilities, yet we lack understanding of how LLMs process even simple mathematical tasks. To address this, we reverse engineer how three mid-sized LLMs compute addition. We first discover that numbers are represented in these LLMs as a generalized helix, which is strongly causally implicated for the tasks… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  4. arXiv:2405.17209  [pdf, other

    cs.LG cond-mat.dis-nn cs.AI

    How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator

    Authors: Subhash Kantamneni, Ziming Liu, Max Tegmark

    Abstract: How do transformers model physics? Do transformers model systems with interpretable analytical solutions, or do they create "alien physics" that are difficult for humans to decipher? We take a step in demystifying this larger puzzle by investigating the simple harmonic oscillator (SHO), $\ddot{x}+2γ\dot{x}+ω_0^2x=0$, one of the most fundamental systems in physics. Our goal is to identify the metho… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 9 pages, 9 figures

  5. arXiv:2405.04484  [pdf, other

    cs.LG physics.comp-ph

    OptPDE: Discovering Novel Integrable Systems via AI-Human Collaboration

    Authors: Subhash Kantamneni, Ziming Liu, Max Tegmark

    Abstract: Integrable partial differential equation (PDE) systems are of great interest in natural science, but are exceedingly rare and difficult to discover. To solve this, we introduce OptPDE, a first-of-its-kind machine learning approach that Optimizes PDEs' coefficients to maximize their number of conserved quantities, $n_{\rm CQ}$, and thus discover new integrable systems. We discover four families of… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2312.12610  [pdf, other

    physics.plasm-ph cs.LG physics.comp-ph

    Enhancing predictive capabilities in fusion burning plasmas through surrogate-based optimization in core transport solvers

    Authors: P. Rodriguez-Fernandez, N. T. Howard, A. Saltzman, S. Kantamneni, J. Candy, C. Holland, M. Balandat, S. Ament, A. E. White

    Abstract: This work presents the PORTALS framework, which leverages surrogate modeling and optimization techniques to enable the prediction of core plasma profiles and performance with nonlinear gyrokinetic simulations at significantly reduced cost, with no loss of accuracy. The efficiency of PORTALS is benchmarked against standard methods, and its full potential is demonstrated on a unique, simultaneous 5-… ▽ More

    Submitted 9 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  7. arXiv:2306.06099  [pdf, other

    nucl-th cs.LG nucl-ex

    NuCLR: Nuclear Co-Learned Representations

    Authors: Ouail Kitouni, Niklas Nolte, Sokratis Trifinopoulos, Subhash Kantamneni, Mike Williams

    Abstract: We introduce Nuclear Co-Learned Representations (NuCLR), a deep learning model that predicts various nuclear observables, including binding and decay energies, and nuclear charge radii. The model is trained using a multi-task approach with shared representations and obtains state-of-the-art performance, achieving levels of precision that are crucial for understanding fundamental phenomena in nucle… ▽ More

    Submitted 21 July, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 7 pages, 5 figures. Accepted after peer review at the ICML 2023 1st workshop on Synergy of Scientific and Machine Learning Modeling (SynS & ML)