-
Knowledge-guided Contextual Gene Set Analysis Using Large Language Models
Authors:
Zhizheng Wang,
Chi-Ping Day,
Chih-Hsuan Wei,
Qiao Jin,
Robert Leaman,
Yifan Yang,
Shubo Tian,
Aodong Qiu,
Yin Fang,
Qingqing Zhu,
Xinghua Lu,
Zhiyong Lu
Abstract:
Gene set analysis (GSA) is a foundational approach for interpreting genomic data of diseases by linking genes to biological processes. However, conventional GSA methods overlook clinical context of the analyses, often generating long lists of enriched pathways with redundant, nonspecific, or irrelevant results. Interpreting these requires extensive, ad-hoc manual effort, reducing both reliability…
▽ More
Gene set analysis (GSA) is a foundational approach for interpreting genomic data of diseases by linking genes to biological processes. However, conventional GSA methods overlook clinical context of the analyses, often generating long lists of enriched pathways with redundant, nonspecific, or irrelevant results. Interpreting these requires extensive, ad-hoc manual effort, reducing both reliability and reproducibility. To address this limitation, we introduce cGSA, a novel AI-driven framework that enhances GSA by incorporating context-aware pathway prioritization. cGSA integrates gene cluster detection, enrichment analysis, and large language models to identify pathways that are not only statistically significant but also biologically meaningful. Benchmarking on 102 manually curated gene sets across 19 diseases and ten disease-related biological mechanisms shows that cGSA outperforms baseline methods by over 30%, with expert validation confirming its increased precision and interpretability. Two independent case studies in melanoma and breast cancer further demonstrate its potential to uncover context-specific insights and support targeted hypothesis generation.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Aligning Protein Conformation Ensemble Generation with Physical Feedback
Authors:
Jiarui Lu,
Xiaoyin Chen,
Stephen Zhewen Lu,
Aurélie Lozano,
Vijil Chenthamarakshan,
Payel Das,
Jian Tang
Abstract:
Protein dynamics play a crucial role in protein biological functions and properties, and their traditional study typically relies on time-consuming molecular dynamics (MD) simulations conducted in silico. Recent advances in generative modeling, particularly denoising diffusion models, have enabled efficient accurate protein structure prediction and conformation sampling by learning distributions o…
▽ More
Protein dynamics play a crucial role in protein biological functions and properties, and their traditional study typically relies on time-consuming molecular dynamics (MD) simulations conducted in silico. Recent advances in generative modeling, particularly denoising diffusion models, have enabled efficient accurate protein structure prediction and conformation sampling by learning distributions over crystallographic structures. However, effectively integrating physical supervision into these data-driven approaches remains challenging, as standard energy-based objectives often lead to intractable optimization. In this paper, we introduce Energy-based Alignment (EBA), a method that aligns generative models with feedback from physical models, efficiently calibrating them to appropriately balance conformational states based on their energy differences. Experimental results on the MD ensemble benchmark demonstrate that EBA achieves state-of-the-art performance in generating high-quality protein ensembles. By improving the physical plausibility of generated structures, our approach enhances model predictions and holds promise for applications in structural biology and drug discovery.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
ReBaCCA-ss: Relevance-Balanced Continuum Correlation Analysis with Smoothing and Surrogating for Quantifying Similarity Between Population Spiking Activities
Authors:
Xiang Zhang,
Chenlin Xu,
Zhouxiao Lu,
Haonan Wang,
Dong Song
Abstract:
Quantifying similarity between population spike patterns is essential for understanding how neural dynamics encode information. Traditional approaches, which combine kernel smoothing, PCA, and CCA, have limitations: smoothing kernel bandwidths are often empirically chosen, CCA maximizes alignment between patterns without considering the variance explained within patterns, and baseline correlations…
▽ More
Quantifying similarity between population spike patterns is essential for understanding how neural dynamics encode information. Traditional approaches, which combine kernel smoothing, PCA, and CCA, have limitations: smoothing kernel bandwidths are often empirically chosen, CCA maximizes alignment between patterns without considering the variance explained within patterns, and baseline correlations from stochastic spiking are rarely corrected. We introduce ReBaCCA-ss (Relevance-Balanced Continuum Correlation Analysis with smoothing and surrogating), a novel framework that addresses these challenges through three innovations: (1) balancing alignment and variance explanation via continuum canonical correlation; (2) correcting for noise using surrogate spike trains; and (3) selecting the optimal kernel bandwidth by maximizing the difference between true and surrogate correlations. ReBaCCA-ss is validated on both simulated data and hippocampal recordings from rats performing a Delayed Nonmatch-to-Sample task. It reliably identifies spatio-temporal similarities between spike patterns. Combined with Multidimensional Scaling, ReBaCCA-ss reveals structured neural representations across trials, events, sessions, and animals, offering a powerful tool for neural population analysis.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
Category-Selective Neurons in Deep Networks: Comparing Purely Visual and Visual-Language Models
Authors:
Zitong Lu,
Yuxin Wang
Abstract:
Category-selective regions in the human brain, such as the fusiform face area (FFA), extrastriate body area (EBA), parahippocampal place area (PPA), and visual word form area (VWFA), play a crucial role in high-level visual processing. Here, we investigate whether artificial neural networks (ANNs) exhibit similar category-selective neurons and how these neurons vary across model layers and between…
▽ More
Category-selective regions in the human brain, such as the fusiform face area (FFA), extrastriate body area (EBA), parahippocampal place area (PPA), and visual word form area (VWFA), play a crucial role in high-level visual processing. Here, we investigate whether artificial neural networks (ANNs) exhibit similar category-selective neurons and how these neurons vary across model layers and between purely visual and vision-language models. Inspired by fMRI functional localizer experiments, we presented images from different categories (faces, bodies, scenes, words, scrambled scenes, and scrambled words) to deep networks and identified category-selective neurons using statistical criteria. Comparing ResNet and the structurally controlled ResNet-based CLIP model, we found that both models contain category-selective neurons, with their proportion increasing across layers, mirroring category selectivity in higher-level visual brain regions. However, CLIP exhibited a higher proportion but lower specificity of category-selective neurons compared to ResNet. Additionally, CLIP's category-selective neurons were more evenly distributed across feature maps and demonstrated greater representational consistency across layers. These findings suggest that language learning increases the number of category-selective neurons while reducing their selectivity strength, reshaping visual representations in deep networks. Our study provides insights into how ANNs mirror biological vision and how multimodal learning influences category-selective representations.
△ Less
Submitted 23 February, 2025;
originally announced February 2025.
-
MolCap-Arena: A Comprehensive Captioning Benchmark on Language-Enhanced Molecular Property Prediction
Authors:
Carl Edwards,
Ziqing Lu,
Ehsan Hajiramezanali,
Tommaso Biancalani,
Heng Ji,
Gabriele Scalia
Abstract:
Bridging biomolecular modeling with natural language information, particularly through large language models (LLMs), has recently emerged as a promising interdisciplinary research area. LLMs, having been trained on large corpora of scientific documents, demonstrate significant potential in understanding and reasoning about biomolecules by providing enriched contextual and domain knowledge. However…
▽ More
Bridging biomolecular modeling with natural language information, particularly through large language models (LLMs), has recently emerged as a promising interdisciplinary research area. LLMs, having been trained on large corpora of scientific documents, demonstrate significant potential in understanding and reasoning about biomolecules by providing enriched contextual and domain knowledge. However, the extent to which LLM-driven insights can improve performance on complex predictive tasks (e.g., toxicity) remains unclear. Further, the extent to which relevant knowledge can be extracted from LLMs also remains unknown. In this study, we present Molecule Caption Arena: the first comprehensive benchmark of LLM-augmented molecular property prediction. We evaluate over twenty LLMs, including both general-purpose and domain-specific molecule captioners, across diverse prediction tasks. To this goal, we introduce a novel, battle-based rating system. Our findings confirm the ability of LLM-extracted knowledge to enhance state-of-the-art molecular representations, with notable model-, prompt-, and dataset-specific variations. Code, resources, and data are available at github.com/Genentech/molcap-arena.
△ Less
Submitted 1 November, 2024;
originally announced November 2024.
-
Structure Language Models for Protein Conformation Generation
Authors:
Jiarui Lu,
Xiaoyin Chen,
Stephen Zhewen Lu,
Chence Shi,
Hongyu Guo,
Yoshua Bengio,
Jian Tang
Abstract:
Proteins adopt multiple structural conformations to perform their diverse biological functions, and understanding these conformations is crucial for advancing drug discovery. Traditional physics-based simulation methods often struggle with sampling equilibrium conformations and are computationally expensive. Recently, deep generative models have shown promise in generating protein conformations as…
▽ More
Proteins adopt multiple structural conformations to perform their diverse biological functions, and understanding these conformations is crucial for advancing drug discovery. Traditional physics-based simulation methods often struggle with sampling equilibrium conformations and are computationally expensive. Recently, deep generative models have shown promise in generating protein conformations as a more efficient alternative. However, these methods predominantly rely on the diffusion process within a 3D geometric space, which typically centers around the vicinity of metastable states and is often inefficient in terms of runtime. In this paper, we introduce Structure Language Modeling (SLM) as a novel framework for efficient protein conformation generation. Specifically, the protein structures are first encoded into a compact latent space using a discrete variational auto-encoder, followed by conditional language modeling that effectively captures sequence-specific conformation distributions. This enables a more efficient and interpretable exploration of diverse ensemble modes compared to existing methods. Based on this general framework, we instantiate SLM with various popular LM architectures as well as proposing the ESMDiff, a novel BERT-like structure language model fine-tuned from ESM3 with masked diffusion. We verify our approach in various scenarios, including the equilibrium dynamics of BPTI, conformational change pairs, and intrinsically disordered proteins. SLM provides a highly efficient solution, offering a 20-100x speedup than existing methods in generating diverse conformations, shedding light on promising avenues for future research.
△ Less
Submitted 12 March, 2025; v1 submitted 23 October, 2024;
originally announced October 2024.
-
Hierarchy of chaotic dynamics in random modular networks
Authors:
Łukasz Kuśmierz,
Ulises Pereira-Obilinovic,
Zhixin Lu,
Dana Mastrovito,
Stefan Mihalas
Abstract:
We introduce a model of randomly connected neural populations and study its dynamics by means of the dynamical mean-field theory and simulations. Our analysis uncovers a rich phase diagram, featuring high- and low-dimensional chaotic phases, separated by a crossover region characterized by low values of the maximal Lyapunov exponent and participation ratio dimension, but with high values of the Ly…
▽ More
We introduce a model of randomly connected neural populations and study its dynamics by means of the dynamical mean-field theory and simulations. Our analysis uncovers a rich phase diagram, featuring high- and low-dimensional chaotic phases, separated by a crossover region characterized by low values of the maximal Lyapunov exponent and participation ratio dimension, but with high values of the Lyapunov dimension that change significantly across the region. Counterintuitively, chaos can be attenuated by either adding noise to strongly modular connectivity or by introducing modularity into random connectivity. Extending the model to include a multilevel, hierarchical connectivity reveals that a loose balance between activities across levels drives the system towards the edge of chaos.
△ Less
Submitted 26 February, 2025; v1 submitted 8 October, 2024;
originally announced October 2024.
-
Cell Morphology-Guided Small Molecule Generation with GFlowNets
Authors:
Stephen Zhewen Lu,
Ziqing Lu,
Ehsan Hajiramezanali,
Tommaso Biancalani,
Yoshua Bengio,
Gabriele Scalia,
Michał Koziarski
Abstract:
High-content phenotypic screening, including high-content imaging (HCI), has gained popularity in the last few years for its ability to characterize novel therapeutics without prior knowledge of the protein target. When combined with deep learning techniques to predict and represent molecular-phenotype interactions, these advancements hold the potential to significantly accelerate and enhance drug…
▽ More
High-content phenotypic screening, including high-content imaging (HCI), has gained popularity in the last few years for its ability to characterize novel therapeutics without prior knowledge of the protein target. When combined with deep learning techniques to predict and represent molecular-phenotype interactions, these advancements hold the potential to significantly accelerate and enhance drug discovery applications. This work focuses on the novel task of HCI-guided molecular design. Generative models for molecule design could be guided by HCI data, for example with a supervised model that links molecules to phenotypes of interest as a reward function. However, limited labeled data, combined with the high-dimensional readouts, can make training these methods challenging and impractical. We consider an alternative approach in which we leverage an unsupervised multimodal joint embedding to define a latent similarity as a reward for GFlowNets. The proposed model learns to generate new molecules that could produce phenotypic effects similar to those of the given image target, without relying on pre-annotated phenotypic labels. We demonstrate that the proposed method generates molecules with high morphological and structural similarity to the target, increasing the likelihood of similar biological activity, as confirmed by an independent oracle model.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
RNACG: A Universal RNA Sequence Conditional Generation model based on Flow-Matching
Authors:
Letian Gao,
Zhi John Lu
Abstract:
RNA plays a pivotal role in diverse biological processes, ranging from gene regulation to catalysis. Recent advances in RNA design, such as RfamGen, Ribodiffusion and RDesign, have demonstrated promising results, with successful designs of functional sequences. However, RNA design remains challenging due to the inherent flexibility of RNA molecules and the scarcity of experimental data on tertiary…
▽ More
RNA plays a pivotal role in diverse biological processes, ranging from gene regulation to catalysis. Recent advances in RNA design, such as RfamGen, Ribodiffusion and RDesign, have demonstrated promising results, with successful designs of functional sequences. However, RNA design remains challenging due to the inherent flexibility of RNA molecules and the scarcity of experimental data on tertiary and secondary structures compared to proteins. These limitations highlight the need for a more universal and comprehensive approach to RNA design that integrates diverse annotation information at the sequence level. To address these challenges, we propose RNACG (RNA Conditional Generator), a universal framework for RNA sequence design based on flow matching. RNACG supports diverse conditional inputs, including structural, functional, and family-specific annotations, and offers a modular design that allows users to customize the encoding network for specific tasks. By unifying sequence generation under a single framework, RNACG enables the integration of multiple RNA design paradigms, from family-specific generation to tertiary structure inverse folding.
△ Less
Submitted 8 March, 2025; v1 submitted 29 July, 2024;
originally announced July 2024.
-
Teaching CORnet Human fMRI Representations for Enhanced Model-Brain Alignment
Authors:
Zitong Lu,
Yile Wang
Abstract:
Deep convolutional neural networks (DCNNs) have demonstrated excellent performance in object recognition and have been found to share some similarities with brain visual processing. However, the substantial gap between DCNNs and human visual perception still exists. Functional magnetic resonance imaging (fMRI) as a widely used technique in cognitive neuroscience can record neural activation in the…
▽ More
Deep convolutional neural networks (DCNNs) have demonstrated excellent performance in object recognition and have been found to share some similarities with brain visual processing. However, the substantial gap between DCNNs and human visual perception still exists. Functional magnetic resonance imaging (fMRI) as a widely used technique in cognitive neuroscience can record neural activation in the human visual cortex during the process of visual perception. Can we teach DCNNs human fMRI signals to achieve a more brain-like model? To answer this question, this study proposed ReAlnet-fMRI, a model based on the SOTA vision model CORnet but optimized using human fMRI data through a multi-layer encoding-based alignment framework. This framework has been shown to effectively enable the model to learn human brain representations. The fMRI-optimized ReAlnet-fMRI exhibited higher similarity to the human brain than both CORnet and the control model in within-and across-subject as well as within- and across-modality model-brain (fMRI and EEG) alignment evaluations. Additionally, we conducted an in-depth analyses to investigate how the internal representations of ReAlnet-fMRI differ from CORnet in encoding various object dimensions. These findings provide the possibility of enhancing the brain-likeness of visual models by integrating human neural data, helping to bridge the gap between computer vision and visual neuroscience.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
Information Benchmark for Biological Sensors Beyond Steady States -- Mpemba-like sensory withdrawal effect
Authors:
Asawari Pagare,
Zhiyue Lu
Abstract:
Biological sensors rely on the temporal dynamics of ligand concentration for signaling. The sensory performance is bounded by the distinguishability between the sensory state transition dynamics under different environmental protocols. This work presents a comprehensive theory to characterize arbitrary transient sensory dynamics of biological sensors. Here the sensory performance is quantified by…
▽ More
Biological sensors rely on the temporal dynamics of ligand concentration for signaling. The sensory performance is bounded by the distinguishability between the sensory state transition dynamics under different environmental protocols. This work presents a comprehensive theory to characterize arbitrary transient sensory dynamics of biological sensors. Here the sensory performance is quantified by the Kullback-Leibler (KL) divergence between the probability distributions of the sensor's stochastic paths. We introduce a novel benchmark to assess a sensor's transient sensory performance arbitrarily far from equilibrium. We identify a counter-intuitive phenomenon in multi-state sensors: while an initial exposure to high ligand concentration may hinder a sensor's sensitivity towards a future concentration up-shift, certain sensors may show a boost in sensitivity if the initial high concentration exposure is followed by a transient resetting at a low concentration environment. The boosted performance exceeds that of a sensor starting from an initially low concentration environment. This effect, reminiscent of a drug withdrawal effect, can be explained by the Markovian dynamics of the multi-state sensor, similar to the Markovian Mpemba effect. Moreover, an exhaustive machine learning study of 4-state sensors reveals a tight connection between the sensor's performance and the structure of the Markovian graph of its states.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Characterizing visual cortical magnification with topological smoothing and optimal transportation
Authors:
Yujian Xiong,
Yanshuai Tu,
Zhong-Lin Lu,
Yalin Wang
Abstract:
Human vision has different concentration on visual fields. Cortical magnification factor (CMF) is a popular measurement on visual acuity and cortex concentration. In order to achieve thorough measurement of CMF across the whole visual field, we propose a method to measure planar CMF upon retinotopic maps generated by pRF decoding, with help of our proposed methods: optimal transportation and topol…
▽ More
Human vision has different concentration on visual fields. Cortical magnification factor (CMF) is a popular measurement on visual acuity and cortex concentration. In order to achieve thorough measurement of CMF across the whole visual field, we propose a method to measure planar CMF upon retinotopic maps generated by pRF decoding, with help of our proposed methods: optimal transportation and topological smoothing. The optimal transportation re-calculates vertex location in retinotopic mapping, and topological smoothing guarantees topological conditions in retinotopic maps, which allow us to calculate planar CMF with the proposed 1-ring patch method. The pipeline was applied to the HCP 7T dataset, giving new planar results on CMF measurement across all 181 subjects, which illustrate novel concentration behavior on visual fields and their individual difference.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Achieving More Human Brain-Like Vision via Human EEG Representational Alignment
Authors:
Zitong Lu,
Yile Wang,
Julie D. Golomb
Abstract:
Despite advancements in artificial intelligence, object recognition models still lag behind in emulating visual information processing in human brains. Recent studies have highlighted the potential of using neural data to mimic brain processing; however, these often rely on invasive neural recordings from non-human subjects, leaving a critical gap in understanding human visual perception. Addressi…
▽ More
Despite advancements in artificial intelligence, object recognition models still lag behind in emulating visual information processing in human brains. Recent studies have highlighted the potential of using neural data to mimic brain processing; however, these often rely on invasive neural recordings from non-human subjects, leaving a critical gap in understanding human visual perception. Addressing this gap, we present, for the first time, 'Re(presentational)Al(ignment)net', a vision model aligned with human brain activity based on non-invasive EEG, demonstrating a significantly higher similarity to human brain representations. Our innovative image-to-brain multi-layer encoding framework advances human neural alignment by optimizing multiple model layers and enabling the model to efficiently learn and mimic human brain's visual representational patterns across object categories and different modalities. Our findings suggest that ReAlnet represents a breakthrough in bridging the gap between artificial and human vision, and paving the way for more brain-like artificial intelligence systems.
△ Less
Submitted 24 April, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
Stochastic Distinguishability of Markovian Trajectories
Authors:
Asawari Pagare,
Zhongmin Zhang,
Jiming Zheng,
Zhiyue Lu
Abstract:
The ability to distinguish between stochastic systems based on their trajectories is crucial in thermodynamics, chemistry, and biophysics. The Kullback-Leibler (KL) divergence, $D_{\text{KL}}^{AB}(0,τ)$, quantifies the distinguishability between the two ensembles of length-$τ$ trajectories from Markov processes A and B. However, evaluating $D_{\text{KL}}^{AB}(0,τ)$ from histograms of trajectories…
▽ More
The ability to distinguish between stochastic systems based on their trajectories is crucial in thermodynamics, chemistry, and biophysics. The Kullback-Leibler (KL) divergence, $D_{\text{KL}}^{AB}(0,τ)$, quantifies the distinguishability between the two ensembles of length-$τ$ trajectories from Markov processes A and B. However, evaluating $D_{\text{KL}}^{AB}(0,τ)$ from histograms of trajectories faces sufficient sampling difficulties, and no theory explicitly reveals what dynamical features contribute to the distinguishability. This letter provides a general formula that decomposes $D_{\text{KL}}^{AB}(0,τ)$ in space and time for any Markov processes, arbitrarily far from equilibrium or steady state. It circumvents the sampling difficulty of evaluating $D_{\text{KL}}^{AB}(0,τ)$. Furthermore, it explicitly connects trajectory KL divergence with individual transition events and their waiting time statistics. The results provide insights into understanding distinguishability between Markov processes, leading to new theoretical frameworks for designing biological sensors and optimizing signal transduction.
△ Less
Submitted 1 May, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
PubTator 3.0: an AI-powered Literature Resource for Unlocking Biomedical Knowledge
Authors:
Chih-Hsuan Wei,
Alexis Allot,
Po-Ting Lai,
Robert Leaman,
Shubo Tian,
Ling Luo,
Qiao Jin,
Zhizheng Wang,
Qingyu Chen,
Zhiyong Lu
Abstract:
PubTator 3.0 (https://www.ncbi.nlm.nih.gov/research/pubtator3/) is a biomedical literature resource using state-of-the-art AI techniques to offer semantic and relation searches for key concepts like proteins, genetic variants, diseases, and chemicals. It currently provides over one billion entity and relation annotations across approximately 36 million PubMed abstracts and 6 million full-text arti…
▽ More
PubTator 3.0 (https://www.ncbi.nlm.nih.gov/research/pubtator3/) is a biomedical literature resource using state-of-the-art AI techniques to offer semantic and relation searches for key concepts like proteins, genetic variants, diseases, and chemicals. It currently provides over one billion entity and relation annotations across approximately 36 million PubMed abstracts and 6 million full-text articles from the PMC open access subset, updated weekly. PubTator 3.0's online interface and API utilize these precomputed entity relations and synonyms to provide advanced search capabilities and enable large-scale analyses, streamlining many complex information needs. We showcase the retrieval quality of PubTator 3.0 using a series of entity pair queries, demonstrating that PubTator 3.0 retrieves a greater number of articles than either PubMed or Google Scholar, with higher precision in the top 20 results. We further show that integrating ChatGPT (GPT-4) with PubTator APIs dramatically improves the factuality and verifiability of its responses. In summary, PubTator 3.0 offers a comprehensive set of features and tools that allow researchers to navigate the ever-expanding wealth of biomedical literature, expediting research and unlocking valuable insights for scientific discovery.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
DTIAM: A unified framework for predicting drug-target interactions, binding affinities and activation/inhibition mechanisms
Authors:
Zhangli Lu,
Chuqi Lei,
Kaili Wang,
Libo Qin,
Jing Tang,
Min Li
Abstract:
Accurate and robust prediction of drug-target interactions (DTIs) plays a vital role in drug discovery. Despite extensive efforts have been invested in predicting novel DTIs, existing approaches still suffer from insufficient labeled data and cold start problems. More importantly, there is currently a lack of studies focusing on elucidating the mechanism of action (MoA) between drugs and targets.…
▽ More
Accurate and robust prediction of drug-target interactions (DTIs) plays a vital role in drug discovery. Despite extensive efforts have been invested in predicting novel DTIs, existing approaches still suffer from insufficient labeled data and cold start problems. More importantly, there is currently a lack of studies focusing on elucidating the mechanism of action (MoA) between drugs and targets. Distinguishing the activation and inhibition mechanisms is critical and challenging in drug development. Here, we introduce a unified framework called DTIAM, which aims to predict interactions, binding affinities, and activation/inhibition mechanisms between drugs and targets. DTIAM learns drug and target representations from large amounts of label-free data through self-supervised pre-training, which accurately extracts the substructure and contextual information of drugs and targets, and thus benefits the downstream prediction based on these representations. DTIAM achieves substantial performance improvement over other state-of-the-art methods in all tasks, particularly in the cold start scenario. Moreover, independent validation demonstrates the strong generalization ability of DTIAM. All these results suggested that DTIAM can provide a practically useful tool for predicting novel DTIs and further distinguishing the MoA of candidate drugs. DTIAM, for the first time, provides a unified framework for accurate and robust prediction of drug-target interactions, binding affinities, and activation/inhibition mechanisms.
△ Less
Submitted 23 December, 2023;
originally announced December 2023.
-
Full-Atom Protein Pocket Design via Iterative Refinement
Authors:
Zaixi Zhang,
Zepu Lu,
Zhongkai Hao,
Marinka Zitnik,
Qi Liu
Abstract:
The design of \emph{de novo} functional proteins that bind specific ligand molecules is paramount in therapeutics and bio-engineering. A critical yet formidable task in this endeavor is the design of the protein pocket, which is the cavity region of the protein where the ligand binds. Current methods are plagued by inefficient generation, inadequate context modeling of the ligand molecule, and the…
▽ More
The design of \emph{de novo} functional proteins that bind specific ligand molecules is paramount in therapeutics and bio-engineering. A critical yet formidable task in this endeavor is the design of the protein pocket, which is the cavity region of the protein where the ligand binds. Current methods are plagued by inefficient generation, inadequate context modeling of the ligand molecule, and the inability to generate side-chain atoms. Here, we present the Full-Atom Iterative Refinement (FAIR) method, designed to address these challenges by facilitating the co-design of protein pocket sequences, specifically residue types, and their corresponding 3D structures. FAIR operates in two steps, proceeding in a coarse-to-fine manner (transitioning from protein backbone to atoms, including side chains) for a full-atom generation. In each iteration, all residue types and structures are simultaneously updated, a process termed full-shot refinement. In the initial stage, the residue types and backbone coordinates are refined using a hierarchical context encoder, complemented by two structure refinement modules that capture both inter-residue and pocket-ligand interactions. The subsequent stage delves deeper, modeling the side-chain atoms of the pockets and updating residue types to ensure sequence-structure congruence. Concurrently, the structure of the binding ligand is refined across iterations to accommodate its inherent flexibility. Comprehensive experiments show that FAIR surpasses existing methods in designing superior pocket sequences and structures, producing average improvement exceeding 10\% in AAR and RMSD metrics. FAIR is available at \url{https://github.com/zaixizhang/FAIR}.
△ Less
Submitted 19 October, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
End-to-end topographic networks as models of cortical map formation and human visual behaviour: moving beyond convolutions
Authors:
Zejin Lu,
Adrien Doerig,
Victoria Bosch,
Bas Krahmer,
Daniel Kaiser,
Radoslaw M Cichy,
Tim C Kietzmann
Abstract:
Computational models are an essential tool for understanding the origin and functions of the topographic organisation of the primate visual system. Yet, vision is most commonly modelled by convolutional neural networks that ignore topography by learning identical features across space. Here, we overcome this limitation by developing All-Topographic Neural Networks (All-TNNs). Trained on visual inp…
▽ More
Computational models are an essential tool for understanding the origin and functions of the topographic organisation of the primate visual system. Yet, vision is most commonly modelled by convolutional neural networks that ignore topography by learning identical features across space. Here, we overcome this limitation by developing All-Topographic Neural Networks (All-TNNs). Trained on visual input, several features of primate topography emerge in All-TNNs: smooth orientation maps and cortical magnification in their first layer, and category-selective areas in their final layer. In addition, we introduce a novel dataset of human spatial biases in object recognition, which enables us to directly link models to behaviour. We demonstrate that All-TNNs significantly better align with human behaviour than previous state-of-the-art convolutional models due to their topographic nature. All-TNNs thereby mark an important step forward in understanding the spatial organisation of the visual brain and how it mediates visual behaviour.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval
Authors:
Qiao Jin,
Won Kim,
Qingyu Chen,
Donald C. Comeau,
Lana Yeganova,
W. John Wilbur,
Zhiyong Lu
Abstract:
Information retrieval (IR) is essential in biomedical knowledge acquisition and clinical decision support. While recent progress has shown that language model encoders perform better semantic retrieval, training such models requires abundant query-article annotations that are difficult to obtain in biomedicine. As a result, most biomedical IR systems only conduct lexical matching. In response, we…
▽ More
Information retrieval (IR) is essential in biomedical knowledge acquisition and clinical decision support. While recent progress has shown that language model encoders perform better semantic retrieval, training such models requires abundant query-article annotations that are difficult to obtain in biomedicine. As a result, most biomedical IR systems only conduct lexical matching. In response, we introduce MedCPT, a first-of-its-kind Contrastively Pre-trained Transformer model for zero-shot semantic IR in biomedicine. To train MedCPT, we collected an unprecedented scale of 255 million user click logs from PubMed. With such data, we use contrastive learning to train a pair of closely-integrated retriever and re-ranker. Experimental results show that MedCPT sets new state-of-the-art performance on six biomedical IR tasks, outperforming various baselines including much larger models such as GPT-3-sized cpt-text-XL. In addition, MedCPT also generates better biomedical article and sentence representations for semantic evaluations. As such, MedCPT can be readily applied to various real-world biomedical IR tasks.
△ Less
Submitted 3 October, 2023; v1 submitted 2 July, 2023;
originally announced July 2023.
-
Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health
Authors:
Shubo Tian,
Qiao Jin,
Lana Yeganova,
Po-Ting Lai,
Qingqing Zhu,
Xiuying Chen,
Yifan Yang,
Qingyu Chen,
Won Kim,
Donald C. Comeau,
Rezarta Islamaj,
Aadit Kapoor,
Xin Gao,
Zhiyong Lu
Abstract:
ChatGPT has drawn considerable attention from both the general public and domain experts with its remarkable text generation capabilities. This has subsequently led to the emergence of diverse applications in the field of biomedicine and health. In this work, we examine the diverse applications of large language models (LLMs), such as ChatGPT, in biomedicine and health. Specifically we explore the…
▽ More
ChatGPT has drawn considerable attention from both the general public and domain experts with its remarkable text generation capabilities. This has subsequently led to the emergence of diverse applications in the field of biomedicine and health. In this work, we examine the diverse applications of large language models (LLMs), such as ChatGPT, in biomedicine and health. Specifically we explore the areas of biomedical information retrieval, question answering, medical text summarization, information extraction, and medical education, and investigate whether LLMs possess the transformative power to revolutionize these tasks or whether the distinct complexities of biomedical domain presents unique challenges. Following an extensive literature survey, we find that significant advances have been made in the field of text generation tasks, surpassing the previous state-of-the-art methods. For other applications, the advances have been modest. Overall, LLMs have not yet revolutionized biomedicine, but recent rapid progress indicates that such methods hold great potential to provide valuable means for accelerating discovery and improving health. We also find that the use of LLMs, like ChatGPT, in the fields of biomedicine and health entails various risks and challenges, including fabricated information in its generated responses, as well as legal and privacy concerns associated with sensitive patient data. We believe this survey can provide a comprehensive and timely overview to biomedical researchers and healthcare practitioners on the opportunities and challenges associated with using ChatGPT and other LLMs for transforming biomedicine and health.
△ Less
Submitted 16 October, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Towards Predicting Equilibrium Distributions for Molecular Systems with Deep Learning
Authors:
Shuxin Zheng,
Jiyan He,
Chang Liu,
Yu Shi,
Ziheng Lu,
Weitao Feng,
Fusong Ju,
Jiaxi Wang,
Jianwei Zhu,
Yaosen Min,
He Zhang,
Shidi Tang,
Hongxia Hao,
Peiran Jin,
Chi Chen,
Frank Noé,
Haiguang Liu,
Tie-Yan Liu
Abstract:
Advances in deep learning have greatly improved structure prediction of molecules. However, many macroscopic observations that are important for real-world applications are not functions of a single molecular structure, but rather determined from the equilibrium distribution of structures. Traditional methods for obtaining these distributions, such as molecular dynamics simulation, are computation…
▽ More
Advances in deep learning have greatly improved structure prediction of molecules. However, many macroscopic observations that are important for real-world applications are not functions of a single molecular structure, but rather determined from the equilibrium distribution of structures. Traditional methods for obtaining these distributions, such as molecular dynamics simulation, are computationally expensive and often intractable. In this paper, we introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems. Inspired by the annealing process in thermodynamics, DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system, such as a chemical graph or a protein sequence. This framework enables efficient generation of diverse conformations and provides estimations of state densities. We demonstrate the performance of DiG on several molecular tasks, including protein conformation sampling, ligand structure sampling, catalyst-adsorbate sampling, and property-guided structure generation. DiG presents a significant advancement in methodology for statistically understanding molecular systems, opening up new research opportunities in molecular science.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Generate your neural signals from mine: individual-to-individual EEG converters
Authors:
Zitong Lu,
Julie D. Golomb
Abstract:
Most models in cognitive and computational neuroscience trained on one subject do not generalize to other subjects due to individual differences. An ideal individual-to-individual neural converter is expected to generate real neural signals of one subject from those of another one, which can overcome the problem of individual differences for cognitive and computational models. In this study, we pr…
▽ More
Most models in cognitive and computational neuroscience trained on one subject do not generalize to other subjects due to individual differences. An ideal individual-to-individual neural converter is expected to generate real neural signals of one subject from those of another one, which can overcome the problem of individual differences for cognitive and computational models. In this study, we propose a novel individual-to-individual EEG converter, called EEG2EEG, inspired by generative models in computer vision. We applied THINGS EEG2 dataset to train and test 72 independent EEG2EEG models corresponding to 72 pairs across 9 subjects. Our results demonstrate that EEG2EEG is able to effectively learn the mapping of neural representations in EEG signals from one subject to another and achieve high conversion performance. Additionally, the generated EEG signals contain clearer representations of visual information than that can be obtained from real data. This method establishes a novel and state-of-the-art framework for neural conversion of EEG signals, which can realize a flexible and high-performance mapping from individual to individual and provide insight for both neural engineering and cognitive neuroscience.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information
Authors:
Qiao Jin,
Yifan Yang,
Qingyu Chen,
Zhiyong Lu
Abstract:
While large language models (LLMs) have been successfully applied to various tasks, they still face challenges with hallucinations. Augmenting LLMs with domain-specific tools such as database utilities can facilitate easier and more precise access to specialized knowledge. In this paper, we present GeneGPT, a novel method for teaching LLMs to use the Web APIs of the National Center for Biotechnolo…
▽ More
While large language models (LLMs) have been successfully applied to various tasks, they still face challenges with hallucinations. Augmenting LLMs with domain-specific tools such as database utilities can facilitate easier and more precise access to specialized knowledge. In this paper, we present GeneGPT, a novel method for teaching LLMs to use the Web APIs of the National Center for Biotechnology Information (NCBI) for answering genomics questions. Specifically, we prompt Codex to solve the GeneTuring tests with NCBI Web APIs by in-context learning and an augmented decoding algorithm that can detect and execute API calls. Experimental results show that GeneGPT achieves state-of-the-art performance on eight tasks in the GeneTuring benchmark with an average score of 0.83, largely surpassing retrieval-augmented LLMs such as the new Bing (0.44), biomedical LLMs such as BioMedLM (0.08) and BioGPT (0.04), as well as GPT-3 (0.16) and ChatGPT (0.12). Our further analyses suggest that: (1) API demonstrations have good cross-task generalizability and are more useful than documentations for in-context learning; (2) GeneGPT can generalize to longer chains of API calls and answer multi-hop questions in GeneHop, a novel dataset introduced in this work; (3) Different types of errors are enriched in different tasks, providing valuable insights for future improvements.
△ Less
Submitted 16 May, 2023; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Transformer Encoder with Multiscale Deep Learning for Pain Classification Using Physiological Signals
Authors:
Zhenyuan Lu,
Burcu Ozek,
Sagar Kamarthi
Abstract:
Pain is a serious worldwide health problem that affects a vast proportion of the population. For efficient pain management and treatment, accurate classification and evaluation of pain severity are necessary. However, this can be challenging as pain is a subjective sensation-driven experience. Traditional techniques for measuring pain intensity, e.g. self-report scales, are susceptible to bias and…
▽ More
Pain is a serious worldwide health problem that affects a vast proportion of the population. For efficient pain management and treatment, accurate classification and evaluation of pain severity are necessary. However, this can be challenging as pain is a subjective sensation-driven experience. Traditional techniques for measuring pain intensity, e.g. self-report scales, are susceptible to bias and unreliable in some instances. Consequently, there is a need for more objective and automatic pain intensity assessment strategies. In this paper, we develop PainAttnNet (PAN), a novel transfomer-encoder deep-learning framework for classifying pain intensities with physiological signals as input. The proposed approach is comprised of three feature extraction architectures: multiscale convolutional networks (MSCN), a squeeze-and-excitation residual network (SEResNet), and a transformer encoder block. On the basis of pain stimuli, MSCN extracts short- and long-window information as well as sequential features. SEResNet highlights relevant extracted features by mapping the interdependencies among features. The third module employs a transformer encoder consisting of three temporal convolutional networks (TCN) with three multi-head attention (MHA) layers to extract temporal dependencies from the features. Using the publicly available BioVid pain dataset, we test the proposed PainAttnNet model and demonstrate that our outcomes outperform state-of-the-art models. These results confirm that our approach can be utilized for automated classification of pain intensity using physiological signals to improve pain management and treatment.
△ Less
Submitted 17 March, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
MineProt: modern application for custom protein curation
Authors:
Yunchi Zhu,
Chengda Tong,
Zuohan Zhao,
Zuhong Lu
Abstract:
AI systems represented by AlphaFold are rapidly expanding the scale of protein structure modelling data, and the MineProt project provides an effective solution for custom curation of these novel high-throughput data. It enables researchers to build their own protein server in simple steps, run almost out-of-the-box scripts to annotate and curate their proteins, visualize, browse and search their…
▽ More
AI systems represented by AlphaFold are rapidly expanding the scale of protein structure modelling data, and the MineProt project provides an effective solution for custom curation of these novel high-throughput data. It enables researchers to build their own protein server in simple steps, run almost out-of-the-box scripts to annotate and curate their proteins, visualize, browse and search their data via a user-friendly online interface, and utilize plugins to extend the functionality of server. It is expected to support researcher productivity and facilitate data sharing in the new era of structural proteomics. MineProt is open-sourced at https://github.com/huiwenke/MineProt.
△ Less
Submitted 14 December, 2022;
originally announced December 2022.
-
A 3D-Shape Similarity-based Contrastive Approach to Molecular Representation Learning
Authors:
Austin Atsango,
Nathaniel L. Diamant,
Ziqing Lu,
Tommaso Biancalani,
Gabriele Scalia,
Kangway V. Chuang
Abstract:
Molecular shape and geometry dictate key biophysical recognition processes, yet many graph neural networks disregard 3D information for molecular property prediction. Here, we propose a new contrastive-learning procedure for graph neural networks, Molecular Contrastive Learning from Shape Similarity (MolCLaSS), that implicitly learns a three-dimensional representation. Rather than directly encodin…
▽ More
Molecular shape and geometry dictate key biophysical recognition processes, yet many graph neural networks disregard 3D information for molecular property prediction. Here, we propose a new contrastive-learning procedure for graph neural networks, Molecular Contrastive Learning from Shape Similarity (MolCLaSS), that implicitly learns a three-dimensional representation. Rather than directly encoding or targeting three-dimensional poses, MolCLaSS matches a similarity objective based on Gaussian overlays to learn a meaningful representation of molecular shape. We demonstrate how this framework naturally captures key aspects of three-dimensionality that two-dimensional representations cannot and provides an inductive framework for scaffold hopping.
△ Less
Submitted 3 November, 2022;
originally announced November 2022.
-
Theoretical upper bound of multiplexing in stochastic sensory receptors
Authors:
Asawari Pagare,
Sa Hoon Min,
Zhiyue Lu
Abstract:
Biological sensory receptors provide excellent examples of microscopic scale information transduction amidst stochastic noise. We argue that stochasticity is not always a hindrance to sensing. Instead, it could allow a single stochastic sensor to perform multiplexing: simultaneously transducing multiple types of environmental information to the downstream sensory network. Through a Langevin dynami…
▽ More
Biological sensory receptors provide excellent examples of microscopic scale information transduction amidst stochastic noise. We argue that stochasticity is not always a hindrance to sensing. Instead, it could allow a single stochastic sensor to perform multiplexing: simultaneously transducing multiple types of environmental information to the downstream sensory network. Through a Langevin dynamics simulation of a ligand-receptor sensor in a bath of ligands, we demonstrate that a binary-state receptor can simultaneously encode multiple independent environmental variables, such as ligand concentration and the speed of media flow. We develop a general theory of stochastic sensory multiplexing and suggest two theoretical upper bounds. Furthermore, we conjecture that randomly generated sensors typically saturate the tighter upper bound. The theoretical framework developed in this study, which involves a rank-deficient maximum likelihood analysis (rd-MLE), provides a systematic approach to comprehensively assess a sensor's sensory ability without any initial assumptions. This theoretical framework can inspire the design of more efficient artificial sensors.
△ Less
Submitted 3 April, 2023; v1 submitted 18 May, 2022;
originally announced May 2022.
-
Assigning Species Information to Corresponding Genes by a Sequence Labeling Framework
Authors:
Ling Luo,
Chih-Hsuan Wei,
Po-Ting Lai,
Qingyu Chen,
Rezarta Islamaj Doğan,
Zhiyong Lu
Abstract:
The automatic assignment of species information to the corresponding genes in a research article is a critically important step in the gene normalization task, whereby a gene mention is normalized and linked to a database record or identifier by a text-mining algorithm. Existing methods typically rely on heuristic rules based on gene and species co-occurrence in the article, but their accuracy is…
▽ More
The automatic assignment of species information to the corresponding genes in a research article is a critically important step in the gene normalization task, whereby a gene mention is normalized and linked to a database record or identifier by a text-mining algorithm. Existing methods typically rely on heuristic rules based on gene and species co-occurrence in the article, but their accuracy is suboptimal. We therefore developed a high-performance method, using a novel deep learning-based framework, to classify whether there is a relation between a gene and a species. Instead of the traditional binary classification framework in which all possible pairs of genes and species in the same article are evaluated, we treat the problem as a sequence-labeling task such that only a fraction of the pairs needs to be considered. Our benchmarking results show that our approach obtains significantly higher performance compared to that of the rule-based baseline method for the species assignment task (from 65.8% to 81.3% in accuracy). The source code and data for species assignment are freely available at https://github.com/ncbi/SpeciesAssignment.
△ Less
Submitted 8 May, 2022;
originally announced May 2022.
-
A Multilayer Network Model of the Coevolution of the Spread of a Disease and Competing Opinions
Authors:
Kaiyan Peng,
Zheng Lu,
Vanessa Lin,
Michael R. Lindstrom,
Christian Parkinson,
Chuntian Wang,
Andrea L. Bertozzi,
Mason A. Porter
Abstract:
During the COVID-19 pandemic, conflicting opinions on physical distancing swept across social media, affecting both human behavior and the spread of COVID-19. Inspired by such phenomena, we construct a two-layer multiplex network for the coupled spread of a disease and conflicting opinions. We model each process as a contagion. On one layer, we consider the concurrent evolution of two opinions --…
▽ More
During the COVID-19 pandemic, conflicting opinions on physical distancing swept across social media, affecting both human behavior and the spread of COVID-19. Inspired by such phenomena, we construct a two-layer multiplex network for the coupled spread of a disease and conflicting opinions. We model each process as a contagion. On one layer, we consider the concurrent evolution of two opinions -- pro-physical-distancing and anti-physical-distancing -- that compete with each other and have mutual immunity to each other. The disease evolves on the other layer, and individuals are less likely (respectively, more likely) to become infected when they adopt the pro-physical-distancing (respectively, anti-physical-distancing) opinion. We develop approximations of mean-field type by generalizing monolayer pair approximations to multilayer networks; these approximations agree well with Monte Carlo simulations for a broad range of parameters and several network structures. Through numerical simulations, we illustrate the influence of opinion dynamics on the spread of the disease from complex interactions both between the two conflicting opinions and between the opinions and the disease. We find that lengthening the duration that individuals hold an opinion may help suppress disease transmission, and we demonstrate that increasing the cross-layer correlations or intra-layer correlations of node degrees may lead to fewer individuals becoming infected with the disease.
△ Less
Submitted 4 July, 2021;
originally announced July 2021.
-
Topological Receptive Field Model for Human Retinotopic Mapping
Authors:
Yanshuai Tu,
Duyan Ta,
Zhong-Lin Lu,
Yalin Wang
Abstract:
The mapping between visual inputs on the retina and neuronal activations in the visual cortex, i.e., retinotopic map, is an essential topic in vision science and neuroscience. Human retinotopic maps can be revealed by analyzing the functional magnetic resonance imaging (fMRI) signal responses to designed visual stimuli in vivo. Neurophysiology studies summarized that visual areas are topological (…
▽ More
The mapping between visual inputs on the retina and neuronal activations in the visual cortex, i.e., retinotopic map, is an essential topic in vision science and neuroscience. Human retinotopic maps can be revealed by analyzing the functional magnetic resonance imaging (fMRI) signal responses to designed visual stimuli in vivo. Neurophysiology studies summarized that visual areas are topological (i.e., nearby neurons have receptive fields at nearby locations in the image). However, conventional fMRI-based analyses frequently generate non-topological results because they process fMRI signals on a voxel-wise basis, without considering the neighbor relations on the surface. Here we propose a topological receptive field (tRF) model which imposes the topological condition when decoding retinotopic fMRI signals. More specifically, we parametrized the cortical surface to a unit disk, characterized the topological condition by tRF, and employed an efficient scheme to solve the tRF model. We tested our framework on both synthetic and human fMRI data. Experimental results showed that the tRF model could remove the topological violations, improve model explaining power, and generate biologically plausible retinotopic maps. The proposed framework is general and can be applied to other sensory maps.
△ Less
Submitted 15 June, 2021; v1 submitted 15 June, 2021;
originally announced June 2021.
-
Optimizing Visual Cortex Parameterization with Error-Tolerant Teichmuller Map in Retinotopic Mapping
Authors:
Yanshuai Tu,
Duyan Ta,
Zhong-Lin Lu,
Yalin Wang
Abstract:
The mapping between the visual input on the retina to the cortical surface, i.e., retinotopic mapping, is an important topic in vision science and neuroscience. Human retinotopic mapping can be revealed by analyzing cortex functional magnetic resonance imaging (fMRI) signals when the subject is under specific visual stimuli. Conventional methods process, smooth, and analyze the retinotopic mapping…
▽ More
The mapping between the visual input on the retina to the cortical surface, i.e., retinotopic mapping, is an important topic in vision science and neuroscience. Human retinotopic mapping can be revealed by analyzing cortex functional magnetic resonance imaging (fMRI) signals when the subject is under specific visual stimuli. Conventional methods process, smooth, and analyze the retinotopic mapping based on the parametrization of the (partial) cortical surface. However, the retinotopic maps generated by this approach frequently contradict neuropsychology results. To address this problem, we propose an integrated approach that parameterizes the cortical surface, such that the parametric coordinates linearly relates the visual coordinate. The proposed method helps the smoothing of noisy retinotopic maps and obtains neurophysiological insights in human vision systems. One key element of the approach is the Error-Tolerant Teichmuller Map, which uniforms the angle distortion and maximizes the alignments to self-contradicting landmarks. We validated our overall approach with synthetic and real retinotopic mapping datasets. The experimental results show the proposed approach is superior in accuracy and compatibility. Although we focus on retinotopic mapping, the proposed framework is general and can be applied to process other human sensory maps.
△ Less
Submitted 24 May, 2020;
originally announced May 2020.
-
Forecast analysis of the epidemics trend of COVID-19 in the United States by a generalized fractional-order SEIR model
Authors:
Conghui Xu,
Yongguang Yu,
QuanChen Yang,
Zhenzhen Lu
Abstract:
In this paper, a generalized fractional-order SEIR model is proposed, denoted by SEIQRP model, which has a basic guiding significance for the prediction of the possible outbreak of infectious diseases like COVID-19 and other insect diseases in the future. Firstly, some qualitative properties of the model are analyzed. The basic reproduction number $R_{0}$ is derived. When $R_{0}<1$, the disease-fr…
▽ More
In this paper, a generalized fractional-order SEIR model is proposed, denoted by SEIQRP model, which has a basic guiding significance for the prediction of the possible outbreak of infectious diseases like COVID-19 and other insect diseases in the future. Firstly, some qualitative properties of the model are analyzed. The basic reproduction number $R_{0}$ is derived. When $R_{0}<1$, the disease-free equilibrium point is unique and locally asymptotically stable. When $R_{0}>1$, the endemic equilibrium point is also unique. Furthermore, some conditions are established to ensure the local asymptotic stability of disease-free and endemic equilibrium points. The trend of COVID-19 spread in the United States is predicted. Considering the influence of the individual behavior and government mitigation measurement, a modified SEIQRP model is proposed, defined as SEIQRPD model. According to the real data of the United States, it is found that our improved model has a better prediction ability for the epidemic trend in the next two weeks. Hence, the epidemic trend of the United States in the next two weeks is investigated, and the peak of isolated cases are predicted. The modified SEIQRP model successfully capture the development process of COVID-19, which provides an important reference for understanding the trend of the outbreak.
△ Less
Submitted 29 April, 2020; v1 submitted 26 April, 2020;
originally announced April 2020.
-
A fractional-order SEIHDR model for COVID-19 with inter-city networked coupling effects
Authors:
Zhenzhen Lu,
Yongguang Yu,
YangQuan Chen,
Guojian Ren,
Conghui Xu,
Shuhui Wang,
Zhe Yin
Abstract:
In this paper, a mathematical model is proposed to analyze the dynamic behavior of COVID-19. Based on inter-city networked coupling effects, a fractional-order SEIHDR system with the real-data from 23 January to 18 March, 2020 of COVID-19 is discussed. Meanwhile, hospitalized individuals and the mortality rates of three types of individuals (exposed, infected and hospitalized) are firstly taken in…
▽ More
In this paper, a mathematical model is proposed to analyze the dynamic behavior of COVID-19. Based on inter-city networked coupling effects, a fractional-order SEIHDR system with the real-data from 23 January to 18 March, 2020 of COVID-19 is discussed. Meanwhile, hospitalized individuals and the mortality rates of three types of individuals (exposed, infected and hospitalized) are firstly taken into account in the proposed model. And infectivity of individuals during incubation is also considered in this paper. By applying least squares method and predictor-correctors scheme, the numerical solutions of the proposed system in the absence of the inter-city network and with the inter-city network are stimulated by using the real-data from 23 January to $18-m$ March, 2020 where $m$ is equal to the number of prediction days. Compared with integer-order system ($α=0$), the fractional-order model without network is validated to have a better fitting of the data on Beijing, Shanghai, Wuhan, Huanggang and other cities. In contrast to the case without network, the results indicate that the inter-city network system may be not a significant case to virus spreading for China because of the lock down and quarantine measures, however, it may have an impact on cities that have not adopted city closure. Meanwhile, the proposed model better fits the data from 24 February to 31, March in Italy, and the peak number of confirmed people is also predicted by this fraction-order model. Furthermore, the existence and uniqueness of a bounded solution under the initial condition are considered in the proposed system. Afterwards, the basic reproduction number $R_0$ is analyzed and it is found to hold a threshold: the disease-free equilibrium point is locally asymptotically stable when $R_0\le 1$, which provides a theoretical basis for whether COVID-19 will become a pandemic in the future.
△ Less
Submitted 30 April, 2020; v1 submitted 26 April, 2020;
originally announced April 2020.
-
Non-equilibrium statistical mechanics of continuous attractors
Authors:
Weishun Zhong,
Zhiyue Lu,
David J Schwab,
Arvind Murugan
Abstract:
Continuous attractors have been used to understand recent neuroscience experiments where persistent activity patterns encode internal representations of external attributes like head direction or spatial location. However, the conditions under which the emergent bump of neural activity in such networks can be manipulated by space and time-dependent external sensory or motor signals are not underst…
▽ More
Continuous attractors have been used to understand recent neuroscience experiments where persistent activity patterns encode internal representations of external attributes like head direction or spatial location. However, the conditions under which the emergent bump of neural activity in such networks can be manipulated by space and time-dependent external sensory or motor signals are not understood. Here, we find fundamental limits on how rapidly internal representations encoded along continuous attractors can be updated by an external signal. We apply these results to place cell networks to derive a velocity-dependent non-equilibrium memory capacity in neural networks.
△ Less
Submitted 30 December, 2018; v1 submitted 28 September, 2018;
originally announced September 2018.
-
Experimental Design Modulates Variance in BOLD Activation: The Variance Design General Linear Model
Authors:
Garren Gaut,
Xiangrui Li,
Zhong-Lin Lu,
Mark Steyvers
Abstract:
Typical fMRI studies have focused on either the mean trend in the blood-oxygen-level-dependent (BOLD) time course or functional connectivity (FC). However, other statistics of the neuroimaging data may contain important information. Despite studies showing links between the variance in the BOLD time series (BV) and age and cognitive performance, a formal framework for testing these effects has not…
▽ More
Typical fMRI studies have focused on either the mean trend in the blood-oxygen-level-dependent (BOLD) time course or functional connectivity (FC). However, other statistics of the neuroimaging data may contain important information. Despite studies showing links between the variance in the BOLD time series (BV) and age and cognitive performance, a formal framework for testing these effects has not yet been developed. We introduce the Variance Design General Linear Model (VDGLM), a novel framework that facilitates the detection of variance effects. We designed the framework for general use in any fMRI study by modeling both mean and variance in BOLD activation as a function of experimental design. The flexibility of this approach allows the VDGLM to i) simultaneously make inferences about a mean or variance effect while controlling for the other and ii) test for variance effects that could be associated with multiple conditions and/or noise regressors. We demonstrate the use of the VDGLM in a working memory application and show that engagement in a working memory task is associated with whole-brain decreases in BOLD variance.
△ Less
Submitted 6 August, 2018;
originally announced August 2018.
-
Invertible generalized synchronization: A putative mechanism for implicit learning in biological and artificial neural systems
Authors:
Zhixin Lu,
Danielle S. Bassett
Abstract:
Regardless of the marked differences between biological and artificial neural systems, one fundamental similarity is that they are essentially dynamical systems that can learn to imitate other dynamical systems, without knowing their governing equations. The brain is able to learn the dynamic nature of the physical world via experience; analogously, artificial neural systems can learn the long-ter…
▽ More
Regardless of the marked differences between biological and artificial neural systems, one fundamental similarity is that they are essentially dynamical systems that can learn to imitate other dynamical systems, without knowing their governing equations. The brain is able to learn the dynamic nature of the physical world via experience; analogously, artificial neural systems can learn the long-term behavior of complex dynamical systems from data. Yet, precisely how this implicit learning occurs remains unknown. Here, we draw inspiration from human neuroscience and from reservoir computing to propose a first-principles framework explicating putative mechanisms of implicit learning. Specifically, we show that an arbitrary dynamical system implicitly learns other dynamical attractors by embedding them into its own phase space through invertible generalized synchronization. By sustaining the embedding through fine-tuned feedback loops, the arbitrary dynamical system can imitate the attractor dynamics it has learned. To evaluate the mechanism's relevance, we construct several distinct neural network models that adaptively learn and imitate multiple attractors. We observe and explain the emergence of 5 distinct phenomena reminiscent of cognitive functions: (i) imitating a dynamical system purely from learning the time series, (ii) learning multiple attractors by a single system, (iii) switching among the imitations of multiple attractors, either spontaneously or driven by external cues, (iv) filling-in missing variables from incomplete observations of a learned dynamical system, and (v) deciphering superimposed input from different dynamical systems. Collectively, our findings support the notion that artificial and biological neural networks can learn the dynamic nature of their environment, and systems within their environment, through the mechanism of invertible generalized synchronization.
△ Less
Submitted 5 November, 2019; v1 submitted 12 July, 2018;
originally announced July 2018.
-
Predicting Task and Subject Differences with Functional Connectivity and BOLD Variability
Authors:
Garren Gaut,
Xiangrui Li,
Brandon Turner,
William A. Cunningham,
Zhong-Lin Lu,
Mark Steyvers
Abstract:
Previous research has found that functional connectivity (FC) can accurately predict the identity of a subject performing a task and the type of task being performed. We replicate these results using a large dataset collected at the OSU Center for Cognitive and Behavioral Brain Imaging. We also introduce a novel perspective on task and subject identity prediction: BOLD Variability (BV). Conceptual…
▽ More
Previous research has found that functional connectivity (FC) can accurately predict the identity of a subject performing a task and the type of task being performed. We replicate these results using a large dataset collected at the OSU Center for Cognitive and Behavioral Brain Imaging. We also introduce a novel perspective on task and subject identity prediction: BOLD Variability (BV). Conceptually, BV is a region-specific measure based on the variance within each brain region. BV is simple to compute, interpret, and visualize. We show that both FC and BV are predictive of task and subject, even across scanning sessions separated by multiple years. Subject differences rather than task differences account for the majority of changes in BV and FC. Similar to results in FC, we show that BV is reduced during cognitive tasks relative to rest.
△ Less
Submitted 12 July, 2018;
originally announced July 2018.
-
Continuous attractor-based clocks are unreliable phase estimators
Authors:
Weerapat Pittayakanchit,
Zhiyue Lu,
Justin Chew,
Michael J. Rust,
Arvind Murugan
Abstract:
Statistical estimation theory determines the optimal way of estimating parameters of a fluctuating noisy signal. However, if the estimation is performed on unreliable hardware, a sub-optimal estimation procedure can outperform the previously optimal procedure. Here, we compare classes of circadian clocks by viewing them as phase estimators for the periodic day-night light signal. We find that cont…
▽ More
Statistical estimation theory determines the optimal way of estimating parameters of a fluctuating noisy signal. However, if the estimation is performed on unreliable hardware, a sub-optimal estimation procedure can outperform the previously optimal procedure. Here, we compare classes of circadian clocks by viewing them as phase estimators for the periodic day-night light signal. We find that continuous attractor-based free running clocks, such as those found in the cyanobacterium Synechococcus elongatus and humans, are nearly optimal phase estimators since their flat attractor directions efficiently project out light intensity fluctuations due to weather patterns (`external noise'). However, such flat directions also make these continuous limit cycle attractors highly vulnerable to diffusive 'internal noise'. Given such unreliable biochemical hardware, we find that point attractor-based damped clocks, such as those found in a smaller cyanobacterium with low protein copy number, Prochlorococcus marinus, outperform continuous attractor-based clocks. By interpolating between the two types of clocks found in these organisms, we demonstrate a family of biochemical phase estimation strategies that are best suited to different relative strengths of external and internal noise.
△ Less
Submitted 27 September, 2017;
originally announced September 2017.
-
Deep learning for extracting protein-protein interactions from biomedical literature
Authors:
Yifan Peng,
Zhiyong Lu
Abstract:
State-of-the-art methods for protein-protein interaction (PPI) extraction are primarily feature-based or kernel-based by leveraging lexical and syntactic information. But how to incorporate such knowledge in the recent deep learning methods remains an open question. In this paper, we propose a multichannel dependency-based convolutional neural network model (McDepCNN). It applies one channel to th…
▽ More
State-of-the-art methods for protein-protein interaction (PPI) extraction are primarily feature-based or kernel-based by leveraging lexical and syntactic information. But how to incorporate such knowledge in the recent deep learning methods remains an open question. In this paper, we propose a multichannel dependency-based convolutional neural network model (McDepCNN). It applies one channel to the embedding vector of each word in the sentence, and another channel to the embedding vector of the head of the corresponding word. Therefore, the model can use richer information obtained from different channels. Experiments on two public benchmarking datasets, AIMed and BioInfer, demonstrate that McDepCNN compares favorably to the state-of-the-art rich-feature and single-kernel based methods. In addition, McDepCNN achieves 24.4% relative improvement in F1-score over the state-of-the-art methods on cross-corpus evaluation and 12% improvement in F1-score over kernel-based methods on "difficult" instances. These results suggest that McDepCNN generalizes more easily over different corpora, and is capable of capturing long distance features in the sentences.
△ Less
Submitted 6 June, 2017; v1 submitted 5 June, 2017;
originally announced June 2017.
-
MDA in Capillary for Whole Genome Amplification
Authors:
Junji Li,
Na Lu,
Xulian Shi,
Yi Qiao,
Liang Chen,
Mengqin Duan,
Yong Hou,
Qinyu Ge,
Yuhan Tao,
Jing Tu,
Zuhong Lu
Abstract:
Whole genome amplification (WGA) plays an important role in sample preparation of low-input templates for high-throughput sequencing. Multiple displacement amplification (MDA), a popular isothermal WGA method, suffers a major hurdle of highly uneven amplification. Optimizations have been made in the past by separating the reagents into numbers of tiny chambers or droplets in microfluidic devices,…
▽ More
Whole genome amplification (WGA) plays an important role in sample preparation of low-input templates for high-throughput sequencing. Multiple displacement amplification (MDA), a popular isothermal WGA method, suffers a major hurdle of highly uneven amplification. Optimizations have been made in the past by separating the reagents into numbers of tiny chambers or droplets in microfluidic devices, which significantly improves the amplification uniformity of MDA. However, skill barrier still exists for biological researchers to handle chip fabrication and droplet manipulation. Here, we present a novel MDA protocol, in-capillary MDA (icMDA), which significantly simplifies the manipulation and improves the uniformity of amplification by dispersing reagents in a long quasi-1D capillary tubing. We demonstrated that icMDA is able to accurately detect SNVs with higher efficiency and sensitivity. Moreover, this straightforward method employs neither customized instruments nor complicated operations, making it a ready-to-use approach for most laboratories.
△ Less
Submitted 30 May, 2017;
originally announced May 2017.
-
Dynamics of glucose-lactose diauxic growth in E. coli
Authors:
Zhao Lu,
Michael A Lee
Abstract:
We present a mathematical model of glucose-lactose diauxic growth in Escherichia coli including both the postive and negative regulation mechanisms of the lactose operon as well as the inducer exclusion. To validate this model, we first calculated the time evolution of beta-galactosidase for only the lactose nutrient and compared the numerical results with experimental data. Second, we compared…
▽ More
We present a mathematical model of glucose-lactose diauxic growth in Escherichia coli including both the postive and negative regulation mechanisms of the lactose operon as well as the inducer exclusion. To validate this model, we first calculated the time evolution of beta-galactosidase for only the lactose nutrient and compared the numerical results with experimental data. Second, we compared the calculated cell biomass of the glucose-lactose diauxic growth with the experimental optical density of the diauxic growth for a particular E. coli MG 1655. For both cases, the numerical calculations from this model are in good agreement with these two experiments' data. The diauxic growth pattern of a wild type E. coli was also investigated.
△ Less
Submitted 28 January, 2011; v1 submitted 15 August, 2007;
originally announced August 2007.
-
Neutral genetic drift can aid functional protein evolution
Authors:
Jesse D Bloom,
Philip A Romero,
Zhongyi Lu,
Frances H Arnold
Abstract:
BACKGROUND: Many of the mutations accumulated by naturally evolving proteins are neutral in the sense that they do not significantly alter a protein's ability to perform its primary biological function. However, new protein functions evolve when selection begins to favor other, "promiscuous" functions that are incidental to a protein's biological role. If mutations that are neutral with respect…
▽ More
BACKGROUND: Many of the mutations accumulated by naturally evolving proteins are neutral in the sense that they do not significantly alter a protein's ability to perform its primary biological function. However, new protein functions evolve when selection begins to favor other, "promiscuous" functions that are incidental to a protein's biological role. If mutations that are neutral with respect to a protein's primary biological function cause substantial changes in promiscuous functions, these mutations could enable future functional evolution.
RESULTS: Here we investigate this possibility experimentally by examining how cytochrome P450 enzymes that have evolved neutrally with respect to activity on a single substrate have changed in their abilities to catalyze reactions on five other substrates. We find that the enzymes have sometimes changed as much as four-fold in the promiscuous activities. The changes in promiscuous activities tend to increase with the number of mutations, and can be largely rationalized in terms of the chemical structures of the substrates. The activities on chemically similar substrates tend to change in a coordinated fashion, potentially providing a route for systematically predicting the change in one function based on the measurement of several others.
CONCLUSIONS: Our work suggests that initially neutral genetic drift can lead to substantial changes in protein functions that are not currently under selection, in effect poising the proteins to more readily undergo functional evolution should selection "ask new questions" in the future.
△ Less
Submitted 2 May, 2007;
originally announced May 2007.
-
Evolution favors protein mutational robustness in sufficiently large populations
Authors:
Jesse D. Bloom,
Zhongyi Lu,
David Chen,
Alpan Raval,
Ophelia S. Venturelli,
Frances H. Arnold
Abstract:
BACKGROUND: An important question is whether evolution favors properties such as mutational robustness or evolvability that do not directly benefit any individual, but can influence the course of future evolution. Functionally similar proteins can differ substantially in their robustness to mutations and capacity to evolve new functions, but it has remained unclear whether any of these differenc…
▽ More
BACKGROUND: An important question is whether evolution favors properties such as mutational robustness or evolvability that do not directly benefit any individual, but can influence the course of future evolution. Functionally similar proteins can differ substantially in their robustness to mutations and capacity to evolve new functions, but it has remained unclear whether any of these differences might be due to evolutionary selection for these properties.
RESULTS: Here we use laboratory experiments to demonstrate that evolution favors protein mutational robustness if the evolving population is sufficiently large. We neutrally evolve cytochrome P450 proteins under identical selection pressures and mutation rates in populations of different sizes, and show that proteins from the larger and thus more polymorphic population tend towards higher mutational robustness. Proteins from the larger population also evolve greater stability, a biophysical property that is known to enhance both mutational robustness and evolvability. The excess mutational robustness and stability is well described by existing mathematical theories, and can be quantitatively related to the way that the proteins occupy their neutral network.
CONCLUSIONS: Our work is the first experimental demonstration of the general tendency of evolution to favor mutational robustness and protein stability in highly polymorphic populations. We suggest that this phenomenon may contribute to the mutational robustness and evolvability of viruses and bacteria that exist in large populations.
△ Less
Submitted 14 April, 2007;
originally announced April 2007.
-
RNA Binding Density on X-chromosome Differing from that on 22 Autosomes in Human
Authors:
Zhanjun Lu,
Ying Lu,
Shuxia Song,
Zhai Yu,
Xiufang Wang
Abstract:
To test whether X-chromosome has unique genomic characteristics, X-chromosome and 22 autosomes were compared for RNA binding density. Nucleotide sequences on the chromosomes were divided into 50kb per segment that was recoded as a set of frequency values of 7-nucleotide (7nt) strings using all possible 7nt strings (47=16384). 120 genes highly expressed in tonsil germinal center B cells were sele…
▽ More
To test whether X-chromosome has unique genomic characteristics, X-chromosome and 22 autosomes were compared for RNA binding density. Nucleotide sequences on the chromosomes were divided into 50kb per segment that was recoded as a set of frequency values of 7-nucleotide (7nt) strings using all possible 7nt strings (47=16384). 120 genes highly expressed in tonsil germinal center B cells were selected for calculating 7nt string frequency values of all introns (RNAs). The binding density of DNA segments and RNAs was determined by the amount of complement sequences. It was shown for the first time that gene-poor and low gene expression X-chromosome had the lowest percentage of the DNA segments that can highly bind RNAs, whereas gene-rich and high gene expression chromosome 19 had the highest percentage of the segments. On the basis of these results, it is proposed that the nonrandom properties of distribution of RNA highly binding DNA segments on the chromosomes provide strong evidence that lack of RNA highly binding segments may be a cause of X-chromosome inactivation
△ Less
Submitted 24 December, 2003;
originally announced December 2003.