-
An improved, high yield method for isolating nuclei from individual zebrafish embryos for single-nucleus RNA sequencing
Authors:
Clifford Rostomily,
Heidi Lee,
Amy Tresenrider,
Riza Daza,
Andrew Mullen,
Jay Shendure,
David Kimelman,
Cole Trapnell
Abstract:
Zebrafish are an ideal system to study the effect(s) of chemical, genetic, and environmental perturbations on development due to their high fecundity and fast growth. Recently, single cell sequencing has emerged as a powerful tool to measure the effect of these perturbations at a whole embryo scale. These types of experiments rely on the ability to isolate nuclei from a large number of individuall…
▽ More
Zebrafish are an ideal system to study the effect(s) of chemical, genetic, and environmental perturbations on development due to their high fecundity and fast growth. Recently, single cell sequencing has emerged as a powerful tool to measure the effect of these perturbations at a whole embryo scale. These types of experiments rely on the ability to isolate nuclei from a large number of individually barcoded zebrafish embryos in parallel. Here we report a method for efficiently isolating high-quality nuclei from zebrafish embryos in a 96-well plate format by bead homogenization in a lysis buffer. Through head-to-head sciPlex-RNA-seq experiments, we demonstrate that this method represents a substantial improvement over enzymatic dissociation and that it is compatible with a wide range of developmental stages.
△ Less
Submitted 22 November, 2024;
originally announced November 2024.
-
Forecasting Opioid Incidents for Rapid Actionable Data for Opioid Response in Kentucky
Authors:
Aaron D. Mullen,
Daniel Harris,
Peter Rock,
Svetla Slavova,
Jeffery Talbert,
V. K. Cody Bumgardner
Abstract:
We present efforts in the fields of machine learning and time series forecasting to accurately predict counts of future opioid overdose incidents recorded by Emergency Medical Services (EMS) in the state of Kentucky. Forecasts are useful to state government agencies to properly prepare and distribute resources related to opioid overdoses effectively. Our approach uses county and district level agg…
▽ More
We present efforts in the fields of machine learning and time series forecasting to accurately predict counts of future opioid overdose incidents recorded by Emergency Medical Services (EMS) in the state of Kentucky. Forecasts are useful to state government agencies to properly prepare and distribute resources related to opioid overdoses effectively. Our approach uses county and district level aggregations of EMS opioid overdose encounters and forecasts future counts for each month. A variety of additional covariates were tested to determine their impact on the model's performance. Models with different levels of complexity were evaluated to optimize training time and accuracy. Our results show that when special precautions are taken to address data sparsity, useful predictions can be generated with limited error by utilizing yearly trends and covariance with additional data sources.
△ Less
Submitted 21 October, 2024;
originally announced October 2024.
-
Toward Automated Clinical Transcriptions
Authors:
Mitchell A. Klusty,
W. Vaiden Logan,
Samuel E. Armstrong,
Aaron D. Mullen,
Caroline N. Leach,
Jeff Talbert,
V. K. Cody Bumgardner
Abstract:
Administrative documentation is a major driver of rising healthcare costs and is linked to adverse outcomes, including physician burnout and diminished quality of care. This paper introduces a secure system that applies recent advancements in speech-to-text transcription and speaker-labeling (diarization) to patient-provider conversations. This system is optimized to produce accurate transcription…
▽ More
Administrative documentation is a major driver of rising healthcare costs and is linked to adverse outcomes, including physician burnout and diminished quality of care. This paper introduces a secure system that applies recent advancements in speech-to-text transcription and speaker-labeling (diarization) to patient-provider conversations. This system is optimized to produce accurate transcriptions and highlight potential errors to promote rapid human verification, further reducing the necessary manual effort. Applied to over 40 hours of simulated conversations, this system offers a promising foundation for automating clinical transcriptions.
△ Less
Submitted 20 September, 2024;
originally announced September 2024.
-
State-Based Automation for Time-Restricted Eating Adherence
Authors:
Samuel E. Armstrong,
Aaron D. Mullen,
J. Matthew Thomas,
Dorothy D. Sears,
Julie S. Pendergast,
Jeffrey Talbert,
Cody Bumgardner
Abstract:
Developing and enforcing study protocols is a foundational component of medical research. As study complexity for participant interactions increases, translating study protocols to supporting application code becomes challenging. A collaboration exists between the University of Kentucky and Arizona State University to determine the efficacy of time-restricted eating in improving metabolic risk amo…
▽ More
Developing and enforcing study protocols is a foundational component of medical research. As study complexity for participant interactions increases, translating study protocols to supporting application code becomes challenging. A collaboration exists between the University of Kentucky and Arizona State University to determine the efficacy of time-restricted eating in improving metabolic risk among postmenopausal women. This study utilizes a graph-based approach to monitor and support adherence to a designated schedule, enabling the validation and step-wise audit of participants' statuses to derive dependable conclusions. A texting service, driven by a participant graph, automatically manages interactions and collects data. Participant data is then accessible to the research study team via a website, which enables viewing, management, and exportation. This paper presents a system for automatically managing participants in a time-restricted eating study that eliminates time-consuming interactions with participants.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Multi-Modal Machine Learning Framework for Automated Seizure Detection in Laboratory Rats
Authors:
Aaron Mullen,
Samuel E. Armstrong,
Jasmine Perdeh,
Bjorn Bauer,
Jeffrey Talbert,
V. K. Cody Bumgardner
Abstract:
A multi-modal machine learning system uses multiple unique data sources and types to improve its performance. This article proposes a system that combines results from several types of models, all of which are trained on different data signals. As an example to illustrate the efficacy of the system, an experiment is described in which multiple types of data are collected from rats suffering from s…
▽ More
A multi-modal machine learning system uses multiple unique data sources and types to improve its performance. This article proposes a system that combines results from several types of models, all of which are trained on different data signals. As an example to illustrate the efficacy of the system, an experiment is described in which multiple types of data are collected from rats suffering from seizures. This data includes electrocorticography readings, piezoelectric motion sensor data, and video recordings. Separate models are trained on each type of data, with the goal of classifying each time frame as either containing a seizure or not. After each model has generated its classification predictions, these results are combined. While each data signal works adequately on its own for prediction purposes, the significant imbalance in class labels leads to increased numbers of false positives, which can be filtered and removed by utilizing all data sources. This paper will demonstrate that, after postprocessing and combination techniques, classification accuracy is improved with this multi-modal system when compared to the performance of each individual data source.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
CLASSify: A Web-Based Tool for Machine Learning
Authors:
Aaron D. Mullen,
Samuel E. Armstrong,
Jeff Talbert,
V. K. Cody Bumgardner
Abstract:
Machine learning classification problems are widespread in bioinformatics, but the technical knowledge required to perform model training, optimization, and inference can prevent researchers from utilizing this technology. This article presents an automated tool for machine learning classification problems to simplify the process of training models and producing results while providing informative…
▽ More
Machine learning classification problems are widespread in bioinformatics, but the technical knowledge required to perform model training, optimization, and inference can prevent researchers from utilizing this technology. This article presents an automated tool for machine learning classification problems to simplify the process of training models and producing results while providing informative visualizations and insights into the data. This tool supports both binary and multiclass classification problems, and it provides access to a variety of models and methods. Synthetic data can be generated within the interface to fill missing values, balance class labels, or generate entirely new datasets. It also provides support for feature evaluation and generates explainability scores to indicate which features influence the output the most. We present CLASSify, an open-source tool for simplifying the user experience of solving classification problems without the need for knowledge of machine learning.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Local Large Language Models for Complex Structured Medical Tasks
Authors:
V. K. Cody Bumgardner,
Aaron Mullen,
Sam Armstrong,
Caylin Hickey,
Jeff Talbert
Abstract:
This paper introduces an approach that combines the language reasoning capabilities of large language models (LLMs) with the benefits of local training to tackle complex, domain-specific tasks. Specifically, the authors demonstrate their approach by extracting structured condition codes from pathology reports. The proposed approach utilizes local LLMs, which can be fine-tuned to respond to specifi…
▽ More
This paper introduces an approach that combines the language reasoning capabilities of large language models (LLMs) with the benefits of local training to tackle complex, domain-specific tasks. Specifically, the authors demonstrate their approach by extracting structured condition codes from pathology reports. The proposed approach utilizes local LLMs, which can be fine-tuned to respond to specific generative instructions and provide structured outputs. The authors collected a dataset of over 150k uncurated surgical pathology reports, containing gross descriptions, final diagnoses, and condition codes. They trained different model architectures, including LLaMA, BERT and LongFormer and evaluated their performance. The results show that the LLaMA-based models significantly outperform BERT-style models across all evaluated metrics, even with extremely reduced precision. The LLaMA models performed especially well with large datasets, demonstrating their ability to handle complex, multi-label tasks. Overall, this work presents an effective approach for utilizing LLMs to perform domain-specific tasks using accessible hardware, with potential applications in the medical domain, where complex data extraction and classification are required.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
SmartState: An Automated Research Protocol Adherence System
Authors:
Samuel E. Armstrong,
Mitchell A. Klusty,
Aaron D. Mullen,
Jeffery C. Talbert,
V. K. Cody Bumgardner
Abstract:
Developing and enforcing study protocols is crucial in medical research, especially as interactions with participants become more intricate. Traditional rules-based systems struggle to provide the automation and flexibility required for real-time, personalized data collection. We introduce SmartState, a state-based system designed to act as a personal agent for each participant, continuously manag…
▽ More
Developing and enforcing study protocols is crucial in medical research, especially as interactions with participants become more intricate. Traditional rules-based systems struggle to provide the automation and flexibility required for real-time, personalized data collection. We introduce SmartState, a state-based system designed to act as a personal agent for each participant, continuously managing and tracking their unique interactions. Unlike traditional reporting systems, SmartState enables real-time, automated data collection with minimal oversight. By integrating large language models to distill conversations into structured data, SmartState reduces errors and safeguards data integrity through built-in protocol and participant auditing. We demonstrate its utility in research trials involving time-dependent participant interactions, addressing the increasing need for reliable automation in complex clinical studies.
△ Less
Submitted 22 January, 2025; v1 submitted 7 May, 2023;
originally announced May 2023.
-
Exclusion and Verification of Remote Nuclear Reactors with a 1-Kiloton Gd-Doped Water Detector
Authors:
O. A. Akindele,
A. Bernstein,
M. Bergevin,
S. A. Dazeley,
F. Sutanto,
A. Mullen,
J. Hecla
Abstract:
To date, antineutrino experiments built for the purpose of demonstrating a nonproliferation capability have typically employed organic scintillators, were situated as close to the core as possible -typically a few meters to tens of meters distant and have not exceeded a few tons in size. One problem with this approach is that proximity to the reactor core require accommodation by the host facility…
▽ More
To date, antineutrino experiments built for the purpose of demonstrating a nonproliferation capability have typically employed organic scintillators, were situated as close to the core as possible -typically a few meters to tens of meters distant and have not exceeded a few tons in size. One problem with this approach is that proximity to the reactor core require accommodation by the host facility. Water Cherenkov detectors located offsite, at distances of a few kilometers or greater, may facilitate non-intrusive monitoring and verification of reactor activities over a large area. As the standoff distance increases, the detector target mass must scale accordingly. This article quantifies the degree to which a kiloton-scale gadolinium-doped water-Cherenkov detector can exclude the existence of undeclared reactors within a specified distance, and remotely detect the presence of a hidden reactor in the presence of declared reactors, by verifying the operational power and standoff distance using a Feldman-Cousins based likelihood analysis. A 1-kton scale (fiducial) water Cherenkov detector can exclude gigawatt-scale nuclear reactors up to tens of kilometers within a year. When attempting to identify the specific range and power of a reactor, the detector energy resolution was not sufficient to delineate between the two.
△ Less
Submitted 17 October, 2022;
originally announced October 2022.
-
Improvement in light collection of a photomultiplier tube using a wavelength-shifting plate
Authors:
Austin Mullen,
Oluwatomi Akindele,
Marc Bergevin,
Adam Bernstein,
Steven Dazeley
Abstract:
Large-volume water-Cherenkov neutrino detectors are a light-starved environment, as each interaction produces only $\sim 50-100$ photons per MeV. As such, maximizing the light collection efficiency of the detector is vital to performance. Since Cherenkov emission is heavily weighted towards the near UV, one method to maximize overall detector light collection without increasing the number of photo…
▽ More
Large-volume water-Cherenkov neutrino detectors are a light-starved environment, as each interaction produces only $\sim 50-100$ photons per MeV. As such, maximizing the light collection efficiency of the detector is vital to performance. Since Cherenkov emission is heavily weighted towards the near UV, one method to maximize overall detector light collection without increasing the number of photomultiplier tubes is to couple each tube to a wavelength-shifting plastic plate, thus shifting photon wavelengths to a regime better suited to maximize photomultiplier efficiency and potentially detecting photons that miss the photocathode. To better understand the behavior of such plates, a scan of a rectangular wavelength-shifting plate was performed, and the results were used to calculate the overall percentage improvement in light collection that could be expected for individual PMTs in a large water-Cherenkov detector. Measurements of a 15.1 in. by 11.5 in. wavelength-shifting plate using a 365 nm LED were found to increase overall light collection at the photomultiplier tube by $7.4\pm0.7\%$. A simulation tuned to reproduce these results was used to predict the behavior of a wavelength shifting plate exposed to Cherenkov spectrum light and found increases in light collection that were linear with edge length, assuming square geometries. These results demonstrate the potential of wavelength-shifting plates to increase the overall light collection efficiency in a large detector.
△ Less
Submitted 12 April, 2022;
originally announced April 2022.
-
A Call to Arms Control: Synergies between Nonproliferation Applications of Neutrino Detectors and Large-Scale Fundamental Neutrino Physics Experiments
Authors:
T. Akindele,
T. Anderson,
E. Anderssen,
M. Askins,
M. Bohles,
A. J. Bacon,
Z. Bagdasarian,
A. Baldoni,
A. Barna,
N. Barros,
L. Bartoszek,
A. Bat,
E. W. Beier,
T. Benson,
M. Bergevin,
A. Bernstein,
B. Birrittella,
E. Blucher,
J. Boissevain,
R. Bonventre,
J. Borusinki,
E. Bourret,
D. Brown,
E. J. Callaghan,
J. Caravaca
, et al. (140 additional authors not shown)
Abstract:
The High Energy Physics community can benefit from a natural synergy in research activities into next-generation large-scale water and scintillator neutrino detectors, now being studied for remote reactor monitoring, discovery and exclusion applications in cooperative nonproliferation contexts.
Since approximately 2010, US nonproliferation researchers, supported by the National Nuclear Security…
▽ More
The High Energy Physics community can benefit from a natural synergy in research activities into next-generation large-scale water and scintillator neutrino detectors, now being studied for remote reactor monitoring, discovery and exclusion applications in cooperative nonproliferation contexts.
Since approximately 2010, US nonproliferation researchers, supported by the National Nuclear Security Administration (NNSA), have been studying a range of possible applications of relatively large (100 ton) to very large (hundreds of kiloton) water and scintillator neutrino detectors.
In parallel, the fundamental physics community has been developing detectors at similar scales and with similar design features for a range of high-priority physics topics, primarily in fundamental neutrino physics. These topics include neutrino oscillation studies at beams and reactors, solar, and geological neutrino measurements, supernova studies, and others.
Examples of ongoing synergistic work at U.S. national laboratories and universities include prototype gadolinium-doped water and water-based and opaque scintillator test-beds and demonstrators, extensive testing and industry partnerships related to large area fast position-sensitive photomultiplier tubes, and the development of concepts for a possible underground kiloton-scale water-based detector for reactor monitoring and technology demonstrations.
Some opportunities for engagement between the two communities include bi-annual Applied Antineutrino Physics conferences, collaboration with U.S. National Laboratories engaging in this research, and occasional NNSA funding opportunities supporting a blend of nonproliferation and basic science R&D, directed at the U.S. academic community.
△ Less
Submitted 20 April, 2022; v1 submitted 28 February, 2022;
originally announced March 2022.
-
Development of a low background liquid scintillation counter for a shallow underground laboratory
Authors:
J. L. Erchinger,
C. E. Aalseth,
B. E. Bernacki,
M. Douglas,
E. S. Fuller,
M. E. Keillor,
S. M. Morley,
C. A. Mullen,
J. L. Orrell,
M. E. Panisko,
G. A. Warren,
R. O. Williams,
M. E. Wright
Abstract:
Pacific Northwest National Laboratory has recently opened a shallow underground laboratory intended for measurement of low-concentration levels of radioactive isotopes in samples collected from the environment. The development of a low-background liquid scintillation counter is currently underway to further augment the measurement capabilities within this underground laboratory. Liquid scintillati…
▽ More
Pacific Northwest National Laboratory has recently opened a shallow underground laboratory intended for measurement of low-concentration levels of radioactive isotopes in samples collected from the environment. The development of a low-background liquid scintillation counter is currently underway to further augment the measurement capabilities within this underground laboratory. Liquid scintillation counting is especially useful for measuring charged particle (e.g., $β$, $α$) emitting isotopes with no (or very weak) gamma-ray yields. The combination of high-efficiency detection of charged particle emission in a liquid scintillation cocktail coupled with the low-background environment of an appropriately-designed shield located in a clean underground laboratory provides the opportunity for increased-sensitivity measurements of a range of isotopes. To take advantage of the 35 meters-water-equivalent overburden of the underground laboratory, a series of simulations have evaluated the scintillation counter's shield design requirements to assess the possible background rate achievable. This report presents the design and background evaluation for a shallow underground, low background liquid scintillation counter design for sample measurements.
△ Less
Submitted 20 December, 2015;
originally announced December 2015.
-
PTFE treatment by remote atmospheric Ar/O2 plasmas: a simple reaction scheme model proposal
Authors:
E. A. D. Carbone,
M. W. G. M. Verhoeven,
W. Keuning,
J. J. A. M. van der Mullen
Abstract:
Polytetrafluoroethylene (PTFE) samples were treated by a remote atmospheric pressure microwave plasma torch and analyzed by water contact angle (WCA) and X-ray photoelectron spectroscopy (XPS). In the case of pure argon plasma a decrease of WCA is observed meanwhile an increase of hydrophobicity was observed when some oxygen was added to the discharge. The WCA results are correlated to XPS of refe…
▽ More
Polytetrafluoroethylene (PTFE) samples were treated by a remote atmospheric pressure microwave plasma torch and analyzed by water contact angle (WCA) and X-ray photoelectron spectroscopy (XPS). In the case of pure argon plasma a decrease of WCA is observed meanwhile an increase of hydrophobicity was observed when some oxygen was added to the discharge. The WCA results are correlated to XPS of reference samples and the change of WCA are attributed to changes in roughness of the samples. A simple kinetics scheme for the chemistry on the PTFE surface is proposed to explain the results.
△ Less
Submitted 15 March, 2013;
originally announced March 2013.
-
Deviations from the local field approximation in negative streamer heads
Authors:
Chao Li,
W. J. M. Brok,
Ute Ebert,
J. J. A. M. van der Mullen
Abstract:
Negative streamer ionization fronts in nitrogen under normal conditions are investigated both in a particle model and in a fluid model in local field approximation. The parameter functions for the fluid model are derived from swarm experiments in the particle model. The front structure on the inner scale is investigated in a 1D setting, allowing reasonable run-time and memory consumption and hig…
▽ More
Negative streamer ionization fronts in nitrogen under normal conditions are investigated both in a particle model and in a fluid model in local field approximation. The parameter functions for the fluid model are derived from swarm experiments in the particle model. The front structure on the inner scale is investigated in a 1D setting, allowing reasonable run-time and memory consumption and high numerical accuracy without introducing super-particles. If the reduced electric field immediately before the front is >= 50kV/(cm bar), solutions of fluid and particle model agree very well. If the field increases up to 200kV/(cm bar), the solutions of particle and fluid model deviate, in particular, the ionization level behind the front becomes up to 60% higher in the particle model while the velocity is rather insensitive. Particle and fluid model deviate because electrons with high energies do not yet fully run away from the front, but are somewhat ahead. This leads to increasing ionization rates in the particle model at the very tip of the front. The energy overshoot of electrons in the leading edge of the front actually agrees quantitatively with the energy overshoot in the leading edge of an electron swarm or avalanche in the same electric field.
△ Less
Submitted 15 February, 2007;
originally announced February 2007.