-
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens
Authors:
Jiacheng Liu,
Taylor Blanton,
Yanai Elazar,
Sewon Min,
YenSung Chen,
Arnavi Chheda-Kothary,
Huy Tran,
Byron Bischoff,
Eric Marsh,
Michael Schmitz,
Cassidy Trier,
Aaron Sarnat,
Jenna James,
Jon Borchardt,
Bailey Kuehl,
Evie Cheng,
Karen Farley,
Sruthi Sreeram,
Taira Anderson,
David Albright,
Carissa Schoenick,
Luca Soldaini,
Dirk Groeneveld,
Rock Yuren Pang,
Pang Wei Koh
, et al. (6 additional authors not shown)
Abstract:
We present OLMoTrace, the first system that traces the outputs of language models back to their full, multi-trillion-token training data in real time. OLMoTrace finds and shows verbatim matches between segments of language model output and documents in the training text corpora. Powered by an extended version of infini-gram (Liu et al., 2024), our system returns tracing results within a few second…
▽ More
We present OLMoTrace, the first system that traces the outputs of language models back to their full, multi-trillion-token training data in real time. OLMoTrace finds and shows verbatim matches between segments of language model output and documents in the training text corpora. Powered by an extended version of infini-gram (Liu et al., 2024), our system returns tracing results within a few seconds. OLMoTrace can help users understand the behavior of language models through the lens of their training data. We showcase how it can be used to explore fact checking, hallucination, and the creativity of language models. OLMoTrace is publicly available and fully open-source.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
In Situ Geochronology for the Next Decade: Mission Designs for the Moon, Mars, and Vesta
Authors:
Barbara A. Cohen,
Kelsey E. Young,
Nicolle E. B. Zellner,
Kris Zacny,
R. Aileen Yingst,
Ryan N. Watkins,
Richard Warwick,
Sarah N. Valencia,
Timothy D. Swindle,
Stuart J. Robbins,
Noah E. Petro,
Anthony Nicoletti,
Daniel P. Moriarty, III,
Richard Lynch,
Stephen J. Indyk,
Juliane Gross,
Jennifer A. Grier,
John A. Grant,
Amani Ginyard,
Caleb I. Fassett,
Kenneth A. Farley,
Benjamin J. Farcy,
Bethany L. Ehlmann,
M. Darby Dyar,
Gerard Daelemans
, et al. (4 additional authors not shown)
Abstract:
Geochronology, or determination of absolute ages for geologic events, underpins many inquiries into the formation and evolution of planets and our Solar System. Absolute ages of ancient and recent magmatic products provide strong constraints on the dynamics of magma oceans and crustal formation, as well as the longevity and evolution of interior heat engines and distinct mantle/crustal source regi…
▽ More
Geochronology, or determination of absolute ages for geologic events, underpins many inquiries into the formation and evolution of planets and our Solar System. Absolute ages of ancient and recent magmatic products provide strong constraints on the dynamics of magma oceans and crustal formation, as well as the longevity and evolution of interior heat engines and distinct mantle/crustal source regions. Absolute dating also relates habitability markers to the timescale of evolution of life on Earth. However, the number of geochronologically-significant terrains across the inner Solar System far exceeds our ability to conduct sample return from all of them. In preparation for the upcoming Decadal Survey, our team formulated a set of medium-class (New Frontiers) mission concepts to three different locations (the Moon, Mars, and Vesta) where sites that record Solar System bombardment, magmatism, and/or habitability are uniquely preserved and accessible. We developed a notional payload to directly date planetary surfaces, consisting of two instruments capable of measuring radiometric ages in situ, an imaging spectrometer, optical cameras to provide site geologic context and sample characterization, a trace element analyzer to augment sample contextualization, and a sample acquisition and handling system. Landers carrying this payload to the Moon, Mars, and Vesta would likely fit into the New Frontiers cost cap in our study (~$1B). A mission of this type would provide crucial constraints on planetary history while also enabling a broad suite of investigations such as basic geologic characterization, geomorphologic analysis, ground truth for remote sensing analyses, analyses of major, minor, trace, and volatile elements, atmospheric and other long-lived monitoring, organic molecule analyses, and soil and geotechnical properties.
△ Less
Submitted 4 January, 2021;
originally announced January 2021.
-
High-precision measurements of krypton and xenon isotopes with a new static-mode Quadrupole Ion Trap Mass Spectrometer
Authors:
G. Avice,
A. Belousov,
K. A. Farley,
S. M. Madzunkov,
J. Simcic,
D. Nikolić,
M. R. Darrach,
C. Sotin
Abstract:
Measuring the abundance and isotopic composition of noble gases in planetary atmospheres can answer fundamental questions in cosmochemistry and comparative planetology. However, noble gases are rare elements, a feature making their measurement challenging even on Earth. Furthermore, in space applications, power consumption, volume and mass constraints on spacecraft instrument accommodations requir…
▽ More
Measuring the abundance and isotopic composition of noble gases in planetary atmospheres can answer fundamental questions in cosmochemistry and comparative planetology. However, noble gases are rare elements, a feature making their measurement challenging even on Earth. Furthermore, in space applications, power consumption, volume and mass constraints on spacecraft instrument accommodations require the development of compact innovative instruments able to meet the engineering requirements of the mission while still meeting the science requirements. Here we demonstrate the ability of the quadrupole ion trap mass spectrometer (QITMS) developed at the Jet Propulsion Laboratory (Caltech, Pasadena) to measure low quantities of heavy noble gases (Kr, Xe) in static operating mode and in the absence of a buffer gas such as helium. The sensitivity reaches 1E13 cps Torr-1 (about 1011 cps/Pa) of gas (Kr or Xe). The instrument is able to measure gas in static mode for extended periods of time (up to 48 h) enabling the acquisition of thousands of isotope ratios per measurement. Errors on isotope ratios follow predictions of the counting statistics and the instrument provides reproducible results over several days of measurements. For example, 1.7E-10 Torr (2.3E-8 Pa) of Kr measured continuously for 7 hours yielded a 0.6 permil precision on the 86Kr/84Kr ratio. Measurements of terrestrial and extraterrestrial samples reproduce values from the literature. A compact instrument based upon the QITMS design would have a sensitivity high enough to reach the precision on isotope ratios (e.g. better than 1 percent for 129,131-136Xe/130Xe ratios) necessary for a scientific payload measuring noble gases collected in the Venus atmosphere.
△ Less
Submitted 13 December, 2020;
originally announced December 2020.
-
Non-monotonic resistance noise in the charge density wave pinned state in single nanoribbons of CDW conductor NbSe$_{3}$
Authors:
Zhenzhong Shi,
Peter M. Marley,
Katie Farley,
Sarbajit Banerjee,
G. Sambandamurthy
Abstract:
Electrical transport and broadband resistance noise measurements in an ultra low frequency window (30 mHz - 8 Hz) are carried out in single nanoribbon devices of charge density wave (CDW) conductor NbSe$_{3}$. In the temperature and electric field range where the CDW is expected to be completed pinned by residual impurities, a hitherto unseen non-monotonic behavior in the noise magnitude vs. elect…
▽ More
Electrical transport and broadband resistance noise measurements in an ultra low frequency window (30 mHz - 8 Hz) are carried out in single nanoribbon devices of charge density wave (CDW) conductor NbSe$_{3}$. In the temperature and electric field range where the CDW is expected to be completed pinned by residual impurities, a hitherto unseen non-monotonic behavior in the noise magnitude vs. electric field is observed. This behavior can be attributed to the proliferation of thermally activated phase slip events and this idea is supported by the observation of a smeared activated behavior described by the Dutta-Horn relation. Certain features of the temperature dependence of the noise magnitude do not follow an activated behavior pointing to a complex origin of the fluctuations in a CDW system.
△ Less
Submitted 30 October, 2014;
originally announced October 2014.