-
ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish
Authors:
Jan-Matthis Lueckmann,
Alexander Immer,
Alex Bo-Yuan Chen,
Peter H. Li,
Mariela D. Petkova,
Nirmala A. Iyer,
Luuk Willem Hesselink,
Aparna Dev,
Gudrun Ihrke,
Woohyun Park,
Alyson Petruncio,
Aubrey Weigel,
Wyatt Korff,
Florian Engert,
Jeff W. Lichtman,
Misha B. Ahrens,
Michał Januszewski,
Viren Jain
Abstract:
Data-driven benchmarks have led to significant progress in key scientific modeling domains including weather and structural biology. Here, we introduce the Zebrafish Activity Prediction Benchmark (ZAPBench) to measure progress on the problem of predicting cellular-resolution neural activity throughout an entire vertebrate brain. The benchmark is based on a novel dataset containing 4d light-sheet m…
▽ More
Data-driven benchmarks have led to significant progress in key scientific modeling domains including weather and structural biology. Here, we introduce the Zebrafish Activity Prediction Benchmark (ZAPBench) to measure progress on the problem of predicting cellular-resolution neural activity throughout an entire vertebrate brain. The benchmark is based on a novel dataset containing 4d light-sheet microscopy recordings of over 70,000 neurons in a larval zebrafish brain, along with motion stabilized and voxel-level cell segmentations of these data that facilitate development of a variety of forecasting methods. Initial results from a selection of time series and volumetric video modeling approaches achieve better performance than naive baseline methods, but also show room for further improvement. The specific brain used in the activity recording is also undergoing synaptic-level anatomical mapping, which will enable future integration of detailed structural information into forecasting methods.
△ Less
Submitted 4 March, 2025;
originally announced March 2025.
-
Forecasting Whole-Brain Neuronal Activity from Volumetric Video
Authors:
Alexander Immer,
Jan-Matthis Lueckmann,
Alex Bo-Yuan Chen,
Peter H. Li,
Mariela D. Petkova,
Nirmala A. Iyer,
Aparna Dev,
Gudrun Ihrke,
Woohyun Park,
Alyson Petruncio,
Aubrey Weigel,
Wyatt Korff,
Florian Engert,
Jeff W. Lichtman,
Misha B. Ahrens,
Viren Jain,
Michał Januszewski
Abstract:
Large-scale neuronal activity recordings with fluorescent calcium indicators are increasingly common, yielding high-resolution 2D or 3D videos. Traditional analysis pipelines reduce this data to 1D traces by segmenting regions of interest, leading to inevitable information loss. Inspired by the success of deep learning on minimally processed data in other domains, we investigate the potential of f…
▽ More
Large-scale neuronal activity recordings with fluorescent calcium indicators are increasingly common, yielding high-resolution 2D or 3D videos. Traditional analysis pipelines reduce this data to 1D traces by segmenting regions of interest, leading to inevitable information loss. Inspired by the success of deep learning on minimally processed data in other domains, we investigate the potential of forecasting neuronal activity directly from volumetric videos. To capture long-range dependencies in high-resolution volumetric whole-brain recordings, we design a model with large receptive fields, which allow it to integrate information from distant regions within the brain. We explore the effects of pre-training and perform extensive model selection, analyzing spatio-temporal trade-offs for generating accurate forecasts. Our model outperforms trace-based forecasting approaches on ZAPBench, a recently proposed benchmark on whole-brain activity prediction in zebrafish, demonstrating the advantages of preserving the spatial structure of neuronal activity.
△ Less
Submitted 27 February, 2025;
originally announced March 2025.
-
Generalists vs. Specialists: Evaluating LLMs on Highly-Constrained Biophysical Sequence Optimization Tasks
Authors:
Angelica Chen,
Samuel D. Stanton,
Frances Ding,
Robert G. Alberstein,
Andrew M. Watkins,
Richard Bonneau,
Vladimir Gligorijević,
Kyunghyun Cho,
Nathan C. Frey
Abstract:
Although large language models (LLMs) have shown promise in biomolecule optimization problems, they incur heavy computational costs and struggle to satisfy precise constraints. On the other hand, specialized solvers like LaMBO-2 offer efficiency and fine-grained control but require more domain expertise. Comparing these approaches is challenging due to expensive laboratory validation and inadequat…
▽ More
Although large language models (LLMs) have shown promise in biomolecule optimization problems, they incur heavy computational costs and struggle to satisfy precise constraints. On the other hand, specialized solvers like LaMBO-2 offer efficiency and fine-grained control but require more domain expertise. Comparing these approaches is challenging due to expensive laboratory validation and inadequate synthetic benchmarks. We address this by introducing Ehrlich functions, a synthetic test suite that captures the geometric structure of biophysical sequence optimization problems. With prompting alone, off-the-shelf LLMs struggle to optimize Ehrlich functions. In response, we propose LLOME (Language Model Optimization with Margin Expectation), a bilevel optimization routine for online black-box optimization. When combined with a novel preference learning loss, we find LLOME can not only learn to solve some Ehrlich functions, but can even outperform LaMBO-2 on moderately difficult Ehrlich variants. However, LLOME is comparable to LaMBO-2 on very easy or difficult variants, exhibits some likelihood-reward miscalibration, and struggles without explicit rewards. Our results indicate LLMs can provide significant benefits in some cases, but specialized solvers are still competitive and incur less overhead.
△ Less
Submitted 2 April, 2025; v1 submitted 29 October, 2024;
originally announced October 2024.
-
Focal Loss Analysis of Peripapillary Nerve Fiber Layer Reflectance for Glaucoma Diagnosis
Authors:
Ou Tan,
Dongseok Choi,
Aiyin Chen,
David S. Greenfield,
Brian A. Francis,
Rohit Varma,
Joel S. Schuman,
David Huang,
Advanced Imaging for Glaucoma Study Group
Abstract:
Purpose: To evaluate nerve fiber layer (NFL) reflectance for glaucoma diagnosis using a large dataset. Methods: Participants were imaged with 4.9mm ONH scans using spectral-domain optical coherence tomography (OCT). The NFL reflectance map was reconstructed from 13 concentric rings of optic nerve head(ONH) scan, then processed by an azimuthal filter to reduce directional reflectance bias due to va…
▽ More
Purpose: To evaluate nerve fiber layer (NFL) reflectance for glaucoma diagnosis using a large dataset. Methods: Participants were imaged with 4.9mm ONH scans using spectral-domain optical coherence tomography (OCT). The NFL reflectance map was reconstructed from 13 concentric rings of optic nerve head(ONH) scan, then processed by an azimuthal filter to reduce directional reflectance bias due to variation of beam incidence angle. The peripapillary thickness and reflectance maps were both divided into 96 superpixels. Low-reflectance and low-thickness superpixels were defined as values below the 5th percentile normative reference for that location. Focal reflectance loss was measured by summing loss, relative to the normal reference average, in low-reflectance superpixels. Focal thickness loss was calculated in a similar fashion. The area under receiving characteristic curve (AROC) was used to assess diagnostic accuracy. Results: Fifty-three normal, 196 pre-perimetric, 132 early perimetric, and 59 moderate and advanced perimetric glaucoma participants were included from the Advanced Imaging for Glaucoma Study. Sixty-seven percent of glaucomatous reflectance maps showed characteristic contiguous wedge or diffuse defects. Focal NFL reflectance loss had significantly higher diagnostic accuracy than the best NFL thickness parameters (both map-based and profile-based): AROC 0.80 v. 0.75 (p<0.004) for distinguishing glaucoma eyes from healthy control eyes. The diagnostic sensitivity was also significantly higher at both 99% and 95% specificity operating points. Conclusions: Focal NFL reflectance loss improved glaucoma diagnostic accuracy compared to the standard NFL thickness parameters.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
Feasibility of Identifying Factors Related to Alzheimer's Disease and Related Dementia in Real-World Data
Authors:
Aokun Chen,
Qian Li,
Yu Huang,
Yongqiu Li,
Yu-neng Chuang,
Xia Hu,
Serena Guo,
Yonghui Wu,
Yi Guo,
Jiang Bian
Abstract:
A comprehensive view of factors associated with AD/ADRD will significantly aid in studies to develop new treatments for AD/ADRD and identify high-risk populations and patients for prevention efforts. In our study, we summarized the risk factors for AD/ADRD by reviewing existing meta-analyses and review articles on risk and preventive factors for AD/ADRD. In total, we extracted 477 risk factors in…
▽ More
A comprehensive view of factors associated with AD/ADRD will significantly aid in studies to develop new treatments for AD/ADRD and identify high-risk populations and patients for prevention efforts. In our study, we summarized the risk factors for AD/ADRD by reviewing existing meta-analyses and review articles on risk and preventive factors for AD/ADRD. In total, we extracted 477 risk factors in 10 categories from 537 studies. We constructed an interactive knowledge map to disseminate our study results. Most of the risk factors are accessible from structured Electronic Health Records (EHRs), and clinical narratives show promise as information sources. However, evaluating genomic risk factors using RWD remains a challenge, as genetic testing for AD/ADRD is still not a common practice and is poorly documented in both structured and unstructured EHRs. Considering the constantly evolving research on AD/ADRD risk factors, literature mining via NLP methods offers a solution to automatically update our knowledge map.
△ Less
Submitted 3 February, 2024;
originally announced February 2024.
-
Clinical Applications of Plantar Pressure Measurement
Authors:
Kelsey Detels,
David Shin,
Harrison Wilson,
Shanni Zhou,
Andrew Chen,
Jessica Rosendorf,
Atta Taseh,
Bardiya Akhbari,
Joseph H. Schwab,
Hamid Ghaednia
Abstract:
Plantar pressure measurements can provide valuable insight into various health characteristics in patients. In this study, we describe different plantar pressure devices available on the market and their clinical relevance. Current devices are either platform-based or wearable and consist of a variety of sensor technologies: resistive, capacitive, piezoelectric, and optical. The measurements colle…
▽ More
Plantar pressure measurements can provide valuable insight into various health characteristics in patients. In this study, we describe different plantar pressure devices available on the market and their clinical relevance. Current devices are either platform-based or wearable and consist of a variety of sensor technologies: resistive, capacitive, piezoelectric, and optical. The measurements collected from any of these sensors can be utilized for a range of clinical applications including patients with diabetes, trauma, deformity and cerebral palsy, stroke, cervical myelopathy, ankle instability, sports injuries, and Parkinsons disease. However, the proper technology should be selected based on the clinical need and the type of tests being performed on the device. In this review we provide the reader with a simple overview of the existing technologies their advantages and disadvantages and provide application examples for each. Moreover, we suggest new areas in orthopaedic that plantar pressure mapping technology can be utilized for increased quality of care.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Microfluidics for Hydrodynamics Investigations of Sand Dollar Larvae
Authors:
Wesley A. Chen,
Bryant A. Lopez,
Haley B. Obenshain,
Moses Villeda,
Brian T. Le,
Brenda AAB. Ametepe,
Ariana Lee,
Douglas A. Pace,
Siavash Ahrar
Abstract:
The life cycle of most marine invertebrates includes a planktonic larval stage before metamorphosis to bottom-dwelling adulthood. During larval stage, ciliary-mediated activity enables feeding (capture unicellular algae) and transport of materials (oxygen) required for the larva's growth, development, and successful metamorphosis. Investigating the underlying hydrodynamics of these behaviors is va…
▽ More
The life cycle of most marine invertebrates includes a planktonic larval stage before metamorphosis to bottom-dwelling adulthood. During larval stage, ciliary-mediated activity enables feeding (capture unicellular algae) and transport of materials (oxygen) required for the larva's growth, development, and successful metamorphosis. Investigating the underlying hydrodynamics of these behaviors is valuable for addressing fundamental biological questions (e.g., phenotypic plasticity) and advancing engineering applications. In this work, we combined microfluidics and fluorescence microscopy as a miniaturized PIV (mPIV) to study ciliary-medicated hydrodynamics during suspension feeding in sand dollar larvae (Dendraster excentricus). First, we confirmed the approach's feasibility by examining the underlying hydrodynamics (vortex patterns) for low- and high-fed larvae. Next, ciliary hydrodynamics were tracked from 11 days post-fertilization (DPF) to 20 DPF for 21 low-fed larvae. Microfluidics enabled the examination of baseline activities (without external flow) and behaviors in the presence of environmental cues (external flow). A library of qualitative vortex patterns and quantitative hydrodynamics was generated and shared as a stand alone repository. Results from mPIV (velocities) were used to examine the role of ciliary activity in transporting materials (oxygen). Given the laminar flow and the viscosity-dominated environments surrounding the larvae, overcoming the diffusive boundary layer is critical for the organism's survival. Peclet number analysis for oxygen transport suggested that ciliary velocities help overcome the diffusion dominated transport (max Pe numbers between 30-60). Microfluidics serving as mPIV provided a scalable and accessible approach for investigating the ciliary hydrodynamics of marine organisms.
△ Less
Submitted 29 December, 2023;
originally announced January 2024.
-
Defining Reference Sequences for Nocardia Species by Similarity and Clustering Analyses of 16S rRNA Gene Sequence Data
Authors:
Manal Helal,
Fanrong Kong,
Sharon C. A. Chen,
Michael Bain,
Richard Christen,
Vitali Sintchenko
Abstract:
The intra- and inter-species genetic diversity of bacteria and the absence of 'reference', or the most representative, sequences of individual species present a significant challenge for sequence-based identification. The aims of this study were to determine the utility, and compare the performance of several clustering and classification algorithms to identify the species of 364 sequences of 16S…
▽ More
The intra- and inter-species genetic diversity of bacteria and the absence of 'reference', or the most representative, sequences of individual species present a significant challenge for sequence-based identification. The aims of this study were to determine the utility, and compare the performance of several clustering and classification algorithms to identify the species of 364 sequences of 16S rRNA gene with a defined species in GenBank, and 110 sequences of 16S rRNA gene with no defined species, all within the genus Nocardia. A total of 364 16S rRNA gene sequences of Nocardia species were studied. In addition, 110 16S rRNA gene sequences assigned only to the Nocardia genus level at the time of submission to GenBank were used for machine learning classification experiments. Different clustering algorithms were compared with a novel algorithm or the linear mapping (LM) of the distance matrix. Principal Components Analysis was used for the dimensionality reduction and visualization. Results: The LM algorithm achieved the highest performance and classified the set of 364 16S rRNA sequences into 80 clusters, the majority of which (83.52%) corresponded with the original species. The most representative 16S rRNA sequences for individual Nocardia species have been identified as 'centroids' in respective clusters from which the distances to all other sequences were minimized; 110 16S rRNA gene sequences with identifications recorded only at the genus level were classified using machine learning methods. Simple kNN machine learning demonstrated the highest performance and classified Nocardia species sequences with an accuracy of 92.7% and a mean frequency of 0.578.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Deep learning-based Segmentation of Rabbit fetal skull with limited and sub-optimal annotations
Authors:
Rajath Soans,
Alexa Gleason,
Tosha Shah,
Corey Miller,
Barbara Robinson,
Kimberly Brannen,
Antong Chen
Abstract:
In this paper, we propose a deep learning-based method to segment the skeletal structures in the micro-CT images of Dutch-Belted rabbit fetuses which can assist in the assessment of drug-induced skeletal abnormalities as a required study in developmental and reproductive toxicology (DART). Our strategy leverages sub-optimal segmentation labels of 22 skull bones from 26 micro-CT volumes and maps th…
▽ More
In this paper, we propose a deep learning-based method to segment the skeletal structures in the micro-CT images of Dutch-Belted rabbit fetuses which can assist in the assessment of drug-induced skeletal abnormalities as a required study in developmental and reproductive toxicology (DART). Our strategy leverages sub-optimal segmentation labels of 22 skull bones from 26 micro-CT volumes and maps them to 250 unlabeled volumes on which a deep CNN-based segmentation model is trained. In the experiments, our model was able to achieve an average Dice Similarity Coefficient (DSC) of 0.89 across all bones on the testing set, and 14 out of the 26 skull bones reached average DSC >0.93. Our next steps are segmenting the whole body followed by developing a model to classify abnormalities.
△ Less
Submitted 24 May, 2023;
originally announced July 2023.
-
Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery
Authors:
Yulun Wu,
Mikaela Cashman,
Nicholas Choma,
Érica T. Prates,
Verónica G. Melesse Vergara,
Manesh Shah,
Andrew Chen,
Austin Clyde,
Thomas S. Brettin,
Wibe A. de Jong,
Neeraj Kumar,
Martha S. Head,
Rick L. Stevens,
Peter Nugent,
Daniel A. Jacobson,
James B. Brown
Abstract:
We developed Distilled Graph Attention Policy Network (DGAPN), a reinforcement learning model to generate novel graph-structured chemical representations that optimize user-defined objectives by efficiently navigating a physically constrained domain. The framework is examined on the task of generating molecules that are designed to bind, noncovalently, to functional sites of SARS-CoV-2 proteins. W…
▽ More
We developed Distilled Graph Attention Policy Network (DGAPN), a reinforcement learning model to generate novel graph-structured chemical representations that optimize user-defined objectives by efficiently navigating a physically constrained domain. The framework is examined on the task of generating molecules that are designed to bind, noncovalently, to functional sites of SARS-CoV-2 proteins. We present a spatial Graph Attention (sGAT) mechanism that leverages self-attention over both node and edge attributes as well as encoding the spatial structure -- this capability is of considerable interest in synthetic biology and drug discovery. An attentional policy network is introduced to learn the decision rules for a dynamic, fragment-based chemical environment, and state-of-the-art policy gradient techniques are employed to train the network with stability. Exploration is driven by the stochasticity of the action space design and the innovation reward bonuses learned and proposed by random network distillation. In experiments, our framework achieved outstanding results compared to state-of-the-art algorithms, while reducing the complexity of paths to chemical synthesis.
△ Less
Submitted 11 May, 2022; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Towards Understanding the COVID-19 Case Fatality Rate
Authors:
Donghui Yan,
Aiyou Chen,
Buqing Yang
Abstract:
An important parameter for COVID-19 is the case fatality rate (CFR). It has been applied to wide applications, including the measure of the severity of the infection, the estimation of the number of infected cases, risk assessment etc. However, there remains a lack of understanding on several aspects of CFR, including population factors that are important to CFR, the apparent discrepancy of CFRs i…
▽ More
An important parameter for COVID-19 is the case fatality rate (CFR). It has been applied to wide applications, including the measure of the severity of the infection, the estimation of the number of infected cases, risk assessment etc. However, there remains a lack of understanding on several aspects of CFR, including population factors that are important to CFR, the apparent discrepancy of CFRs in different countries, and how the age effect comes into play. We analyze the CFRs at two different time snapshots, July 6 and Dec 28, 2020, with one during the first wave and the other a second wave of the COVID-19 pandemic. We consider two important population covariates, age and GDP as a proxy for the quality and abundance of public health. Extensive exploratory data analysis leads to some interesting findings. First, there is a clear exponential age effect among different age groups, and, more importantly, the exponential index is almost invariant across countries and time in the pandemic. Second, the roles played by the age and GDP are a little surprising: during the first wave, age is a more significant factor than GDP, while their roles have switched during the second wave of the pandemic, which may be partially explained by the delay in time for the quality and abundance of public health and medical research to factor in.
△ Less
Submitted 1 March, 2021;
originally announced March 2021.
-
Adaptive Semi-Supervised Intent Inferral to Control a Powered Hand Orthosis for Stroke
Authors:
Jingxi Xu,
Cassie Meeker,
Ava Chen,
Lauren Winterbottom,
Michaela Fraser,
Sangwoo Park,
Lynne M. Weber,
Mitchell Miya,
Dawn Nilsen,
Joel Stein,
Matei Ciocarlie
Abstract:
In order to provide therapy in a functional context, controls for wearable robotic orthoses need to be robust and intuitive. We have previously introduced an intuitive, user-driven, EMG-based method to operate a robotic hand orthosis, but the process of training a control that is robust to concept drift (changes in the input signal) places a substantial burden on the user. In this paper, we explor…
▽ More
In order to provide therapy in a functional context, controls for wearable robotic orthoses need to be robust and intuitive. We have previously introduced an intuitive, user-driven, EMG-based method to operate a robotic hand orthosis, but the process of training a control that is robust to concept drift (changes in the input signal) places a substantial burden on the user. In this paper, we explore semi-supervised learning as a paradigm for controlling a powered hand orthosis for stroke subjects. To the best of our knowledge, this is the first use of semi-supervised learning for an orthotic application. Specifically, we propose a disagreement-based semi-supervision algorithm for handling intrasession concept drift based on multimodal ipsilateral sensing. We evaluate the performance of our algorithm on data collected from five stroke subjects. Our results show that the proposed algorithm helps the device adapt to intrasession drift using unlabeled data and reduces the training burden placed on the user. We also validate the feasibility of our proposed algorithm with a functional task; in these experiments, two subjects successfully completed multiple instances of a pick-and-handover task.
△ Less
Submitted 1 March, 2022; v1 submitted 30 October, 2020;
originally announced November 2020.
-
PECAIQR: A Model for Infectious Disease Applied to the Covid-19 Epidemic
Authors:
Richard Bao,
August Chen,
Jethin Gowda,
Shiva Mudide
Abstract:
The Covid-19 pandemic has made clear the need to improve modern multivariate time-series forecasting models. Current state of the art predictions of future daily deaths and, especially, hospital resource usage have confidence intervals that are unacceptably wide. Policy makers and hospitals require accurate forecasts to make informed decisions on passing legislation and allocating resources. We us…
▽ More
The Covid-19 pandemic has made clear the need to improve modern multivariate time-series forecasting models. Current state of the art predictions of future daily deaths and, especially, hospital resource usage have confidence intervals that are unacceptably wide. Policy makers and hospitals require accurate forecasts to make informed decisions on passing legislation and allocating resources. We used US county-level data on daily deaths and population statistics to forecast future deaths. We extended the SIR epidemiological model to a novel model we call the PECAIQR model. It adds several new variables and parameters to the naive SIR model by taking into account the ramifications of the partial quarantining implemented in the US. We fitted data to the model parameters with numerical integration. Because of the fit degeneracy in parameter space and non-constant nature of the parameters, we developed several methods to optimize our fit, such as training on the data tail and training on specific policy regimes. We use cross-validation to tune our hyper parameters at the county level and generate a CDF for future daily deaths. For predictions made from training data up to May 25th, we consistently obtained an averaged pinball loss score of 0.096 on a 14 day forecast. We finally present examples of possible avenues for utility from our model. We generate longer-time horizon predictions over various 1-month windows in the past, forecast how many medical resources such as ventilators and ICU beds will be needed in counties, and evaluate the efficacy of our model in other countries.
△ Less
Submitted 17 June, 2020;
originally announced June 2020.
-
Focal Loss Analysis of Nerve Fiber Layer Reflectance for Glaucoma Diagnosis
Authors:
Ou Tan,
Liang Liu,
Qisheng You,
Jie Wang,
Aiyin Chen,
Eliesa Ing,
John C. Morrison,
Yali Jia,
David Huang
Abstract:
Purpose: To evaluate nerve fiber layer (NFL) reflectance for glaucoma diagnosis. Methods: Participants were imaged with 4.5X4.5-mm volumetric disc scans using spectral-domain optical coherence tomography (OCT). The normalized NFL reflectance map was processed by an azimuthal filter to reduce directional reflectance bias due to variation of beam incidence angle. The peripapillary area of the map wa…
▽ More
Purpose: To evaluate nerve fiber layer (NFL) reflectance for glaucoma diagnosis. Methods: Participants were imaged with 4.5X4.5-mm volumetric disc scans using spectral-domain optical coherence tomography (OCT). The normalized NFL reflectance map was processed by an azimuthal filter to reduce directional reflectance bias due to variation of beam incidence angle. The peripapillary area of the map was divided into 160 superpixels. Average reflectance was the mean of superpixel reflectance. Low-reflectance superpixels were identified as those with NFL reflectance below the 5 percentile normative cutoff. Focal reflectance loss was measure by summing loss in low-reflectance superpixels. Results: Thirty-five normal, 30 pre-perimetric and 35 perimetric glaucoma participants were enrolled. Azimuthal filtering improved the repeatability of the normalized NFL reflectance, as measured by the pooled superpixel standard deviation (SD), from 0.73 to 0.57 dB (p<0.001, paired t-test) and reduced the population SD from 2.14 to 1.78 dB (p<0.001, t-test). Most glaucomatous reflectance maps showed characteristic patterns of contiguous wedge or diffuse defects. Focal NFL reflectance loss had significantly higher diagnostic sensitivity than the best NFL thickness parameter (overall, inferior, or focal loss volume): 53% v. 23% (p=0.027) in PPG eyes and 100% v. 80% (p=0.023) in PG eyes, with the specificity fixed at 99%. Conclusions: Azimuthal filtering reduces the variability of NFL reflectance measurements. Focal NFL reflectance loss has excellent glaucoma diagnostic accuracy compared to the standard NFL thickness parameters. The reflectance map may be useful for localizing NFL defects.
△ Less
Submitted 24 June, 2020;
originally announced June 2020.
-
Using Reports of Own and Others' Symptoms and Diagnosis on Social Media to Predict COVID-19 Case Counts: Observational Infoveillance Study in Mainland China
Authors:
Cuihua Shen,
Anfan Chen,
Chen Luo,
Jingwen Zhang,
Bo Feng,
Wang Liao
Abstract:
Can public social media data be harnessed to predict COVID-19 case counts? We analyzed approximately 15 million COVID-19 related posts on Weibo, a popular Twitter-like social media platform in China, from November 1, 2019 to March 31, 2020. We developed a machine learning classifier to identify "sick posts," which are reports of one's own and other people's symptoms and diagnosis related to COVID-…
▽ More
Can public social media data be harnessed to predict COVID-19 case counts? We analyzed approximately 15 million COVID-19 related posts on Weibo, a popular Twitter-like social media platform in China, from November 1, 2019 to March 31, 2020. We developed a machine learning classifier to identify "sick posts," which are reports of one's own and other people's symptoms and diagnosis related to COVID-19. We then modeled the predictive power of sick posts and other COVID-19 posts on daily case counts. We found that reports of symptoms and diagnosis of COVID-19 significantly predicted daily case counts, up to 14 days ahead of official statistics. But other COVID-19 posts did not have similar predictive power. For a subset of geotagged posts (3.10% of all retrieved posts), we found that the predictive pattern held true for both Hubei province and the rest of mainland China, regardless of unequal distribution of healthcare resources and outbreak timeline. Researchers and disease control agencies should pay close attention to the social media infosphere regarding COVID-19. On top of monitoring overall search and posting activities, it is crucial to sift through the contents and efficiently identify true signals from noise.
△ Less
Submitted 4 August, 2020; v1 submitted 13 April, 2020;
originally announced April 2020.
-
Neural codes, decidability, and a new local obstruction to convexity
Authors:
Aaron Chen,
Florian Frick,
Anne Shiu
Abstract:
Given an intersection pattern of arbitrary sets in Euclidean space, is there an arrangement of convex open sets in Euclidean space that exhibits the same intersections? This question is combinatorial and topological in nature, but is motivated by neuroscience. Specifically, we are interested in a type of neuron called a place cell, which fires precisely when an organism is in a certain region, usu…
▽ More
Given an intersection pattern of arbitrary sets in Euclidean space, is there an arrangement of convex open sets in Euclidean space that exhibits the same intersections? This question is combinatorial and topological in nature, but is motivated by neuroscience. Specifically, we are interested in a type of neuron called a place cell, which fires precisely when an organism is in a certain region, usually convex, called a place field. The earlier question, therefore, can be rephrased as follows: Which neural codes, that is, patterns of neural activity, can arise from a collection of convex open sets? To address this question, Giusti and Itskov proved that convex neural codes have no "local obstructions," which are defined via the topology of a code's simplicial complex. Codes without local obstructions are called locally good, because the obstruction precludes the code from encoding the intersections of open sets that form a good cover. In other words, every good-cover code is locally good. Here we prove the converse: Every locally good code is a good-cover code. We also prove that the good-cover decision problem is undecidable. Finally, we reveal a stronger type of local obstruction that prevents a code from being convex, and prove that the corresponding decision problem is NP-hard. Our proofs use combinatorial and topological methods.
△ Less
Submitted 26 September, 2018; v1 submitted 30 March, 2018;
originally announced March 2018.
-
Fast and Accurate Semi-Automatic Segmentation Tool for Brain Tumor MRIs
Authors:
Andrew X. Chen,
Raúl Rabadán
Abstract:
Segmentation, the process of delineating tumor apart from healthy tissue, is a vital part of both the clinical assessment and the quantitative analysis of brain cancers. Here, we provide an open-source algorithm (MITKats), built on the Medical Imaging Interaction Toolkit, to provide user-friendly and expedient tools for semi-automatic segmentation. To evaluate its performance against competing alg…
▽ More
Segmentation, the process of delineating tumor apart from healthy tissue, is a vital part of both the clinical assessment and the quantitative analysis of brain cancers. Here, we provide an open-source algorithm (MITKats), built on the Medical Imaging Interaction Toolkit, to provide user-friendly and expedient tools for semi-automatic segmentation. To evaluate its performance against competing algorithms, we applied MITKats to 38 high-grade glioma cases from publicly available benchmarks. The similarity of the segmentations to expert-delineated ground truths approached the discrepancies among different manual raters, the theoretically maximal precision. The average time spent on each segmentation was 5 minutes, making MITKats between 4 and 11 times faster than competing semi-automatic algorithms, while retaining similar accuracy.
△ Less
Submitted 18 May, 2017;
originally announced May 2017.
-
Genetic drift suppresses bacterial conjugation in spatially structured populations
Authors:
Peter D. Freese,
Kirill S. Korolev,
Jose I. Jimenez,
Irene A. Chen
Abstract:
Conjugation is the primary mechanism of horizontal gene transfer that spreads antibiotic resistance among bacteria. Although conjugation normally occurs in surface-associated growth (e.g., biofilms), it has been traditionally studied in well-mixed liquid cultures lacking spatial structure, which is known to affect many evolutionary and ecological processes. Here we visualize spatial patterns of ge…
▽ More
Conjugation is the primary mechanism of horizontal gene transfer that spreads antibiotic resistance among bacteria. Although conjugation normally occurs in surface-associated growth (e.g., biofilms), it has been traditionally studied in well-mixed liquid cultures lacking spatial structure, which is known to affect many evolutionary and ecological processes. Here we visualize spatial patterns of gene transfer mediated by F plasmid conjugation in a colony of Escherichia coli growing on solid agar, and we develop a quantitative understanding by spatial extension of traditional mass-action models. We found that spatial structure suppresses conjugation in surface-associated growth because strong genetic drift leads to spatial isolation of donor and recipient cells, restricting conjugation to rare boundaries between donor and recipient strains. These results suggest that ecological strategies, such as enforcement of spatial structure and enhancement of genetic drift, could complement molecular strategies in slowing the spread of antibiotic resistance genes.
△ Less
Submitted 24 February, 2014;
originally announced February 2014.
-
Dynamics of a producer-parasite ecosystem on the brink of collapse
Authors:
Andrew Chen,
Alvaro Sanchez,
Lei Dai,
Jeff Gore
Abstract:
Ecosystems can undergo sudden shifts to undesirable states, but recent studies with simple single species ecosystems have demonstrated that advance warning can be provided by the slowing down of population dynamics near a tipping point. However, it is not clear how this effect of critical slowing down will manifest in ecosystems with strong interactions between their components. Here we probe the…
▽ More
Ecosystems can undergo sudden shifts to undesirable states, but recent studies with simple single species ecosystems have demonstrated that advance warning can be provided by the slowing down of population dynamics near a tipping point. However, it is not clear how this effect of critical slowing down will manifest in ecosystems with strong interactions between their components. Here we probe the dynamics of an experimental producer parasite ecosystem as it approaches a catastrophic collapse. Surprisingly, the producer population grows in size as the environment deteriorates, highlighting that population size can be a misleading measure of ecosystem stability. By analyzing the oscillatory producer parasite dynamics for over ~100 generations in multiple environmental conditions, we found that the collective ecosystem dynamics slows down as the tipping point is approached. Analysis of the coupled dynamics of interacting populations may therefore be necessary to provide advance warning of collapse in complex communities.
△ Less
Submitted 14 June, 2013;
originally announced June 2013.