Skip to main content

Showing 1–6 of 6 results for author: Wheeler, T J

.
  1. arXiv:2505.06395  [pdf, other

    q-bio.OT

    Contributions of the Petabyte Scale Sequence Search Codeathon toward efforts to scale sequence-based searches on SRA

    Authors: Priyanka Ghosh, Kjiersten Fagnan, Ryan Connor, Ravinder Pannu, Travis J. Wheeler, Mihai Pop, C. Titus Brown, Tessa Pierce-Ward, Rob Patro, Jacquelyn S. Michaelis, Thomas L. Madden, Christiam Camacho, Olaitan I. Awe, Arianna I. Krinos, René KM Xavier, Rodrigo Ortega Polo, Jack W. Roddy, Adelaide Rhodes, Alexander Sweeten, Adrian Viehweger, Bariş Ekim, Harihara Subrahmaniam Muralidharan, Amatur Rahman, Vinícius W. Salazar, Andrew Tritt , et al. (13 additional authors not shown)

    Abstract: The volume of biological data being generated by the scientific community is growing exponentially, reflecting technological advances and research activities. The National Institutes of Health's (NIH) Sequence Read Archive (SRA), which is maintained by the National Center for Biotechnology Information (NCBI) at the National Library of Medicine (NLM), is a rapidly growing public database that resea… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

  2. arXiv:2503.22133  [pdf

    q-bio.PE

    Describing the Persistence Landscape for Introducing Microbes into Complex Communities

    Authors: Jason E. McDermott, William C. Nelson, Amy E. Zimmerman, Winston Anthony, Devin Coleman-Derr, Joshua Elmore, Tara Nitka, Ryan S. McClure, Pubudu P. Handakumbura, Adam Guss, Travis J. Wheeler, Robert G. Egbert

    Abstract: The introduction of non-native organisms into complex microbiome communities holds enormous potential to benefit society. However, microbiome engineering faces several challenges including successful establishment of the organism into the community, its persistence in the microbiome to serve a specified purpose, and constraint of the organism and its activity to the intended environment. A theoret… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

    Comments: 26 pages, 6 figures

  3. The need to implement FAIR principles in biomolecular simulations

    Authors: Rommie Amaro, Johan Åqvist, Ivet Bahar, Federica Battistini, Adam Bellaiche, Daniel Beltran, Philip C. Biggin, Massimiliano Bonomi, Gregory R. Bowman, Richard Bryce, Giovanni Bussi, Paolo Carloni, David Case, Andrea Cavalli, Chie-En A. Chang, Thomas E. Cheatham III, Margaret S. Cheung, Cris Chipot, Lillian T. Chong, Preeti Choudhary, Gerardo Andres Cisneros, Cecilia Clementi, Rosana Collepardo-Guevara, Peter Coveney, Roberto Covino , et al. (103 additional authors not shown)

    Abstract: This letter illustrates the opinion of the molecular dynamics (MD) community on the need to adopt a new FAIR paradigm for the use of molecular simulations. It highlights the necessity of a collaborative effort to create, establish, and sustain a database that allows findability, accessibility, interoperability, and reusability of molecular dynamics simulation data. Such a development would democra… ▽ More

    Submitted 3 April, 2025; v1 submitted 23 July, 2024; originally announced July 2024.

    Journal ref: Nat Methods (2025)

  4. arXiv:2406.12159  [pdf, other

    cs.CL cs.LG

    Exploring the Impact of a Transformer's Latent Space Geometry on Downstream Task Performance

    Authors: Anna C. Marbut, John W. Chandler, Travis J. Wheeler

    Abstract: It is generally thought that transformer-based large language models benefit from pre-training by learning generic linguistic knowledge that can be focused on a specific task during fine-tuning. However, we propose that much of the benefit from pre-training may be captured by geometric characteristics of the latent space representations, divorced from any specific linguistic knowledge. In this wor… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2212.08172  [pdf, other

    cs.LG cs.CL

    Reliable Measures of Spread in High Dimensional Latent Spaces

    Authors: Anna C. Marbut, Katy McKinney-Bock, Travis J. Wheeler

    Abstract: Understanding geometric properties of natural language processing models' latent spaces allows the manipulation of these properties for improved performance on downstream tasks. One such property is the amount of data spread in a model's latent space, or how fully the available latent space is being used. In this work, we define data spread and demonstrate that the commonly used measures of data s… ▽ More

    Submitted 31 July, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 24 pages, 11 figures, 13 tables

  6. SODA: a TypeScript/JavaScript Library for Visualizing Biological Sequence Annotation

    Authors: Jack W. Roddy, George T. Lesica, Travis J. Wheeler

    Abstract: We present SODA, a lightweight and open-source visualization library for biological sequence annotations that enables straightforward development of flexible, dynamic, and interactive web graphics. SODA is implemented in TypeScript and can be used as a library within TypeScript and JavaScript.

    Submitted 12 May, 2022; originally announced May 2022.