-
A Novel Nonlinear IP$_3$R State Transition Model and Calcium Oscillation
Authors:
Zhao-Yu Peng,
Han-Yu Jiang,
Jun He
Abstract:
We present a novel nonlinear state transition model for inositol 1,4,5-trisphosphate receptors (IP$_3$Rs) that incorporates a pre-activated state, as suggested by electron microscopy observations. Our model provides a theoretical framework for the biphasic Ca$^{2+}$ dependence of IP$_3$Rs and accurately reproduces their experimentally observed state distribution under saturating IP$_3$ conditions.…
▽ More
We present a novel nonlinear state transition model for inositol 1,4,5-trisphosphate receptors (IP$_3$Rs) that incorporates a pre-activated state, as suggested by electron microscopy observations. Our model provides a theoretical framework for the biphasic Ca$^{2+}$ dependence of IP$_3$Rs and accurately reproduces their experimentally observed state distribution under saturating IP$_3$ conditions. By integrating receptor dynamics with cytoplasmic and endoplasmic reticulum (ER) calcium exchange, we simulate IP$_3$R-mediated Ca$^{2+}$ oscillations governed by six key conformational states. A pivotal finding is that IP$_3$ regulates these oscillations in a switch-like manner: once a critical IP$_3$ concentration is reached, the system abruptly transitions to sustained, constant-amplitude oscillations that quickly terminate when the concentration exceeds a secondary threshold. These results underscore the crucial role of the pre-activated state in modulating calcium signaling.
△ Less
Submitted 18 August, 2025;
originally announced August 2025.
-
Bridging Brains and Machines: A Unified Frontier in Neuroscience, Artificial Intelligence, and Neuromorphic Systems
Authors:
Sohan Shankar,
Yi Pan,
Hanqi Jiang,
Zhengliang Liu,
Mohammad R. Darbandi,
Agustin Lorenzo,
Junhao Chen,
Md Mehedi Hasan,
Arif Hassan Zidan,
Eliana Gelman,
Joshua A. Konfrst,
Jillian Y. Russell,
Katelyn Fernandes,
Tianze Yang,
Yiwei Li,
Huaqin Zhao,
Afrar Jahin,
Triparna Ganguly,
Shair Dinesha,
Yifan Zhou,
Zihao Wu,
Xinliang Li,
Lokesh Adusumilli,
Aziza Hussein,
Sagar Nookarapu
, et al. (20 additional authors not shown)
Abstract:
This position and survey paper identifies the emerging convergence of neuroscience, artificial general intelligence (AGI), and neuromorphic computing toward a unified research paradigm. Using a framework grounded in brain physiology, we highlight how synaptic plasticity, sparse spike-based communication, and multimodal association provide design principles for next-generation AGI systems that pote…
▽ More
This position and survey paper identifies the emerging convergence of neuroscience, artificial general intelligence (AGI), and neuromorphic computing toward a unified research paradigm. Using a framework grounded in brain physiology, we highlight how synaptic plasticity, sparse spike-based communication, and multimodal association provide design principles for next-generation AGI systems that potentially combine both human and machine intelligences. The review traces this evolution from early connectionist models to state-of-the-art large language models, demonstrating how key innovations like transformer attention, foundation-model pre-training, and multi-agent architectures mirror neurobiological processes like cortical mechanisms, working memory, and episodic consolidation. We then discuss emerging physical substrates capable of breaking the von Neumann bottleneck to achieve brain-scale efficiency in silicon: memristive crossbars, in-memory compute arrays, and emerging quantum and photonic devices. There are four critical challenges at this intersection: 1) integrating spiking dynamics with foundation models, 2) maintaining lifelong plasticity without catastrophic forgetting, 3) unifying language with sensorimotor learning in embodied agents, and 4) enforcing ethical safeguards in advanced neuromorphic autonomous systems. This combined perspective across neuroscience, computation, and hardware offers an integrative agenda for in each of these fields.
△ Less
Submitted 14 July, 2025;
originally announced July 2025.
-
Formation and Regulation of Calcium Sparks on a Nonlinear Spatial Network of Ryanodine Receptors
Authors:
Tian-Tian Li,
Zhong-Xue Gao,
Zuo-Ming Ding,
Han-Yu Jiang,
Jun He
Abstract:
Accurate regulation of calcium release is essential for cellular signaling, with the spatial distribution of ryanodine receptors (RyRs) playing a critical role. In this study, we present a nonlinear spatial network model that simulates RyR spatial organization to investigate calcium release dynamics by integrating RyR behavior, calcium buffering, and calsequestrin (CSQ) regulation. The model succe…
▽ More
Accurate regulation of calcium release is essential for cellular signaling, with the spatial distribution of ryanodine receptors (RyRs) playing a critical role. In this study, we present a nonlinear spatial network model that simulates RyR spatial organization to investigate calcium release dynamics by integrating RyR behavior, calcium buffering, and calsequestrin (CSQ) regulation. The model successfully reproduces calcium sparks, shedding light on their initiation, duration, and termination mechanisms under clamped calcium conditions. Our simulations demonstrate that RyR clusters act as on-off switches for calcium release, producing short-lived calcium quarks and longer-lasting calcium sparks based on distinct activation patterns. Spark termination is governed by calcium gradients and stochastic RyR dynamics, with CSQ facilitating RyR closure and spark termination. We also uncover the dual role of CSQ as both a calcium buffer and a regulator of RyRs. Elevated CSQ levels prolong calcium release due to buffering effects, while CSQ-RyR interactions induce excessive refractoriness, a phenomenon linked to pathological conditions such as ventricular arrhythmias. Dysregulated CSQ function disrupts the on-off switching behavior of RyRs, impairing calcium release dynamics. These findings provide new insights into RyR-mediated calcium signaling, highlighting CSQ's pivotal role in maintaining calcium homeostasis and its implications for pathological conditions. This work advances the understanding of calcium spark regulation and underscores its significance for cardiomyocyte function.
△ Less
Submitted 10 July, 2025;
originally announced July 2025.
-
OmniESI: A unified framework for enzyme-substrate interaction prediction with progressive conditional deep learning
Authors:
Zhiwei Nie,
Hongyu Zhang,
Hao Jiang,
Yutian Liu,
Xiansong Huang,
Fan Xu,
Jie Fu,
Zhixiang Ren,
Yonghong Tian,
Wen-Bin Zhang,
Jie Chen
Abstract:
Understanding and modeling enzyme-substrate interactions is crucial for catalytic mechanism research, enzyme engineering, and metabolic engineering. Although a large number of predictive methods have emerged, they do not incorporate prior knowledge of enzyme catalysis to rationally modulate general protein-molecule features that are misaligned with catalytic patterns. To address this issue, we int…
▽ More
Understanding and modeling enzyme-substrate interactions is crucial for catalytic mechanism research, enzyme engineering, and metabolic engineering. Although a large number of predictive methods have emerged, they do not incorporate prior knowledge of enzyme catalysis to rationally modulate general protein-molecule features that are misaligned with catalytic patterns. To address this issue, we introduce a two-stage progressive framework, OmniESI, for enzyme-substrate interaction prediction through conditional deep learning. By decomposing the modeling of enzyme-substrate interactions into a two-stage progressive process, OmniESI incorporates two conditional networks that respectively emphasize enzymatic reaction specificity and crucial catalysis-related interactions, facilitating a gradual feature modulation in the latent space from general protein-molecule domain to catalysis-aware domain. On top of this unified architecture, OmniESI can adapt to a variety of downstream tasks, including enzyme kinetic parameter prediction, enzyme-substrate pairing prediction, enzyme mutational effect prediction, and enzymatic active site annotation. Under the multi-perspective performance evaluation of in-distribution and out-of-distribution settings, OmniESI consistently delivered superior performance than state-of-the-art specialized methods across seven benchmarks. More importantly, the proposed conditional networks were shown to internalize the fundamental patterns of catalytic efficiency while significantly improving prediction performance, with only negligible parameter increases (0.16%), as demonstrated by ablation studies on key components. Overall, OmniESI represents a unified predictive approach for enzyme-substrate interactions, providing an effective tool for catalytic mechanism cracking and enzyme engineering with strong generalization and broad applicability.
△ Less
Submitted 22 June, 2025;
originally announced June 2025.
-
The Study of Human Preference Based on Integrated Analysis of N1 and LPP Components
Authors:
Siyuan Li,
Xiangze Meng,
Yijian Yang,
Yiwen Xu,
Yunfei Wang,
Chenghu Qiu,
Hanyi Jiang,
Pin Wu,
Shegnbo Chen,
Xiao Wei,
Hao Wang,
Lan Ni,
Huiran Zhang
Abstract:
Human preference research is a significant domain in psychology and psychophysiology, with broad applications in psychiatric evaluation and daily life quality enhancement. This study explores the neural mechanisms of human preference judgments through the analysis of event-related potentials (ERPs), specifically focusing on the early N1 component and the late positive potential (LPP). Using a mixe…
▽ More
Human preference research is a significant domain in psychology and psychophysiology, with broad applications in psychiatric evaluation and daily life quality enhancement. This study explores the neural mechanisms of human preference judgments through the analysis of event-related potentials (ERPs), specifically focusing on the early N1 component and the late positive potential (LPP). Using a mixed-image dataset covering items such as hats, fruits, snacks, scarves, drinks, and pets, we elicited a range of emotional responses from participants while recording their brain activity via EEG. Our work innovatively combines the N1 and LPP components to reveal distinct patterns across different preference levels. The N1 component, particularly in frontal regions, showed increased amplitude for preferred items, indicating heightened early visual attention. Similarly, the LPP component exhibited larger amplitudes for both preferred and non-preferred items, reflecting deeper emotional engagement and cognitive evaluation. In addition, we introduced a relationship model that integrates these ERP components to assess the intensity and direction of preferences, providing a novel method for interpreting EEG data in the context of emotional responses. These findings offer valuable insights into the cognitive and emotional processes underlying human preferences and present new possibilities for brain-computer interface applications, personalized marketing, and product design.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
OPTIMUS: Predicting Multivariate Outcomes in Alzheimer's Disease Using Multi-modal Data amidst Missing Values
Authors:
Christelle Schneuwly Diaz,
Duy-Thanh Vu,
Julien Bodelet,
Duy-Cat Can,
Guillaume Blanc,
Haiting Jiang,
Lin Yao,
Guiseppe Pantaleo,
ADNI,
Oliver Y. Chén
Abstract:
Alzheimer's disease, a neurodegenerative disorder, is associated with neural, genetic, and proteomic factors while affecting multiple cognitive and behavioral faculties. Traditional AD prediction largely focuses on univariate disease outcomes, such as disease stages and severity. Multimodal data encode broader disease information than a single modality and may, therefore, improve disease predictio…
▽ More
Alzheimer's disease, a neurodegenerative disorder, is associated with neural, genetic, and proteomic factors while affecting multiple cognitive and behavioral faculties. Traditional AD prediction largely focuses on univariate disease outcomes, such as disease stages and severity. Multimodal data encode broader disease information than a single modality and may, therefore, improve disease prediction; but they often contain missing values. Recent "deeper" machine learning approaches show promise in improving prediction accuracy, yet the biological relevance of these models needs to be further charted. Integrating missing data analysis, predictive modeling, multimodal data analysis, and explainable AI, we propose OPTIMUS, a predictive, modular, and explainable machine learning framework, to unveil the many-to-many predictive pathways between multimodal input data and multivariate disease outcomes amidst missing values. OPTIMUS first applies modality-specific imputation to uncover data from each modality while optimizing overall prediction accuracy. It then maps multimodal biomarkers to multivariate outcomes using machine-learning and extracts biomarkers respectively predictive of each outcome. Finally, OPTIMUS incorporates XAI to explain the identified multimodal biomarkers. Using data from 346 cognitively normal subjects, 608 persons with mild cognitive impairment, and 251 AD patients, OPTIMUS identifies neural and transcriptomic signatures that jointly but differentially predict multivariate outcomes related to executive function, language, memory, and visuospatial function. Our work demonstrates the potential of building a predictive and biologically explainable machine-learning framework to uncover multimodal biomarkers that capture disease profiles across varying cognitive landscapes. The results improve our understanding of the complex many-to-many pathways in AD.
△ Less
Submitted 14 March, 2025;
originally announced March 2025.
-
Generalizable Cervical Cancer Screening via Large-scale Pretraining and Test-Time Adaptation
Authors:
Hao Jiang,
Cheng Jin,
Huangjing Lin,
Yanning Zhou,
Xi Wang,
Jiabo Ma,
Li Ding,
Jun Hou,
Runsheng Liu,
Zhizhong Chai,
Luyang Luo,
Huijuan Shi,
Yinling Qian,
Qiong Wang,
Changzhong Li,
Anjia Han,
Ronald Cheong Kin Chan,
Hao Chen
Abstract:
Cervical cancer is a leading malignancy in female reproductive system. While AI-assisted cytology offers a cost-effective and non-invasive screening solution, current systems struggle with generalizability in complex clinical scenarios. To address this issue, we introduced Smart-CCS, a generalizable Cervical Cancer Screening paradigm based on pretraining and adaptation to create robust and general…
▽ More
Cervical cancer is a leading malignancy in female reproductive system. While AI-assisted cytology offers a cost-effective and non-invasive screening solution, current systems struggle with generalizability in complex clinical scenarios. To address this issue, we introduced Smart-CCS, a generalizable Cervical Cancer Screening paradigm based on pretraining and adaptation to create robust and generalizable screening systems. To develop and validate Smart-CCS, we first curated a large-scale, multi-center dataset named CCS-127K, which comprises a total of 127,471 cervical cytology whole-slide images collected from 48 medical centers. By leveraging large-scale self-supervised pretraining, our CCS models are equipped with strong generalization capability, potentially generalizing across diverse scenarios. Then, we incorporated test-time adaptation to specifically optimize the trained CCS model for complex clinical settings, which adapts and refines predictions, improving real-world applicability. We conducted large-scale system evaluation among various cohorts. In retrospective cohorts, Smart-CCS achieved an overall area under the curve (AUC) value of 0.965 and sensitivity of 0.913 for cancer screening on 11 internal test datasets. In external testing, system performance maintained high at 0.950 AUC across 6 independent test datasets. In prospective cohorts, our Smart-CCS achieved AUCs of 0.947, 0.924, and 0.986 in three prospective centers, respectively. Moreover, the system demonstrated superior sensitivity in diagnosing cervical cancer, confirming the accuracy of our cancer screening results by using histology findings for validation. Interpretability analysis with cell and slide predictions further indicated that the system's decision-making aligns with clinical practice. Smart-CCS represents a significant advancement in cancer screening across diverse clinical contexts.
△ Less
Submitted 12 February, 2025;
originally announced February 2025.
-
Large Language Models for Bioinformatics
Authors:
Wei Ruan,
Yanjun Lyu,
Jing Zhang,
Jiazhang Cai,
Peng Shu,
Yang Ge,
Yao Lu,
Shang Gao,
Yue Wang,
Peilong Wang,
Lin Zhao,
Tao Wang,
Yufang Liu,
Luyang Fang,
Ziyu Liu,
Zhengliang Liu,
Yiwei Li,
Zihao Wu,
Junhao Chen,
Hanqi Jiang,
Yi Pan,
Zhenyuan Yang,
Jingyuan Chen,
Shizhe Liang,
Wei Zhang
, et al. (30 additional authors not shown)
Abstract:
With the rapid advancements in large language model (LLM) technology and the emergence of bioinformatics-specific language models (BioLMs), there is a growing need for a comprehensive analysis of the current landscape, computational characteristics, and diverse applications. This survey aims to address this need by providing a thorough review of BioLMs, focusing on their evolution, classification,…
▽ More
With the rapid advancements in large language model (LLM) technology and the emergence of bioinformatics-specific language models (BioLMs), there is a growing need for a comprehensive analysis of the current landscape, computational characteristics, and diverse applications. This survey aims to address this need by providing a thorough review of BioLMs, focusing on their evolution, classification, and distinguishing features, alongside a detailed examination of training methodologies, datasets, and evaluation frameworks. We explore the wide-ranging applications of BioLMs in critical areas such as disease diagnosis, drug discovery, and vaccine development, highlighting their impact and transformative potential in bioinformatics. We identify key challenges and limitations inherent in BioLMs, including data privacy and security concerns, interpretability issues, biases in training data and model outputs, and domain adaptation complexities. Finally, we highlight emerging trends and future directions, offering valuable insights to guide researchers and clinicians toward advancing BioLMs for increasingly sophisticated biological and clinical applications.
△ Less
Submitted 9 January, 2025;
originally announced January 2025.
-
COMET: Benchmark for Comprehensive Biological Multi-omics Evaluation Tasks and Language Models
Authors:
Yuchen Ren,
Wenwei Han,
Qianyuan Zhang,
Yining Tang,
Weiqiang Bai,
Yuchen Cai,
Lifeng Qiao,
Hao Jiang,
Dong Yuan,
Tao Chen,
Siqi Sun,
Pan Tan,
Wanli Ouyang,
Nanqing Dong,
Xinzhu Ma,
Peng Ye
Abstract:
As key elements within the central dogma, DNA, RNA, and proteins play crucial roles in maintaining life by guaranteeing accurate genetic expression and implementation. Although research on these molecules has profoundly impacted fields like medicine, agriculture, and industry, the diversity of machine learning approaches-from traditional statistical methods to deep learning models and large langua…
▽ More
As key elements within the central dogma, DNA, RNA, and proteins play crucial roles in maintaining life by guaranteeing accurate genetic expression and implementation. Although research on these molecules has profoundly impacted fields like medicine, agriculture, and industry, the diversity of machine learning approaches-from traditional statistical methods to deep learning models and large language models-poses challenges for researchers in choosing the most suitable models for specific tasks, especially for cross-omics and multi-omics tasks due to the lack of comprehensive benchmarks. To address this, we introduce the first comprehensive multi-omics benchmark COMET (Benchmark for Biological COmprehensive Multi-omics Evaluation Tasks and Language Models), designed to evaluate models across single-omics, cross-omics, and multi-omics tasks. First, we curate and develop a diverse collection of downstream tasks and datasets covering key structural and functional aspects in DNA, RNA, and proteins, including tasks that span multiple omics levels. Then, we evaluate existing foundational language models for DNA, RNA, and proteins, as well as the newly proposed multi-omics method, offering valuable insights into their performance in integrating and analyzing data from different biological modalities. This benchmark aims to define critical issues in multi-omics research and guide future directions, ultimately promoting advancements in understanding biological processes through integrated and different omics data analysis.
△ Less
Submitted 13 December, 2024;
originally announced December 2024.
-
CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding
Authors:
Jiquan Wang,
Sha Zhao,
Zhiling Luo,
Yangxuan Zhou,
Haiteng Jiang,
Shijian Li,
Tao Li,
Gang Pan
Abstract:
Electroencephalography (EEG) is a non-invasive technique to measure and record brain electrical activity, widely used in various BCI and healthcare applications. Early EEG decoding methods rely on supervised learning, limited by specific tasks and datasets, hindering model performance and generalizability. With the success of large language models, there is a growing body of studies focusing on EE…
▽ More
Electroencephalography (EEG) is a non-invasive technique to measure and record brain electrical activity, widely used in various BCI and healthcare applications. Early EEG decoding methods rely on supervised learning, limited by specific tasks and datasets, hindering model performance and generalizability. With the success of large language models, there is a growing body of studies focusing on EEG foundation models. However, these studies still leave challenges: Firstly, most of existing EEG foundation models employ full EEG modeling strategy. It models the spatial and temporal dependencies between all EEG patches together, but ignores that the spatial and temporal dependencies are heterogeneous due to the unique structural characteristics of EEG signals. Secondly, existing EEG foundation models have limited generalizability on a wide range of downstream BCI tasks due to varying formats of EEG data, making it challenging to adapt to. To address these challenges, we propose a novel foundation model called CBraMod. Specifically, we devise a criss-cross transformer as the backbone to thoroughly leverage the structural characteristics of EEG signals, which can model spatial and temporal dependencies separately through two parallel attention mechanisms. And we utilize an asymmetric conditional positional encoding scheme which can encode positional information of EEG patches and be easily adapted to the EEG with diverse formats. CBraMod is pre-trained on a very large corpus of EEG through patch-based masked EEG reconstruction. We evaluate CBraMod on up to 10 downstream BCI tasks (12 public datasets). CBraMod achieves the state-of-the-art performance across the wide range of tasks, proving its strong capability and generalizability. The source code is publicly available at https://github.com/wjq-learning/CBraMod.
△ Less
Submitted 13 April, 2025; v1 submitted 10 December, 2024;
originally announced December 2024.
-
Optimize Individualized Energy Delivery for Septic Patients Using Predictive Deep Learning Models: A Real World Study
Authors:
Lu Wang,
Li Chang,
Ruipeng Zhang,
Kexun Li,
Yu Wang,
Wei Chen,
Xuanlin Feng,
Mingwei Sun,
Qi Wang,
Charles Damien Lu,
Jun Zeng,
Hua Jiang
Abstract:
Background and Objectives: We aim to establish deep learning models to optimize the individualized energy delivery for septic patients. Methods and Study Design: We conducted a study of adult septic patients in Intensive Care Unit (ICU), collecting 47 indicators for 14 days. After data cleaning and preprocessing, we used stats to explore energy delivery in deceased and surviving patients. We filte…
▽ More
Background and Objectives: We aim to establish deep learning models to optimize the individualized energy delivery for septic patients. Methods and Study Design: We conducted a study of adult septic patients in Intensive Care Unit (ICU), collecting 47 indicators for 14 days. After data cleaning and preprocessing, we used stats to explore energy delivery in deceased and surviving patients. We filtered out nutrition-related features and divided the data into three metabolic phases: acute early, acute late, and rehabilitation. Models were built using data before September 2020 and validated on the rest. We then established optimal energy target models for each phase using deep learning. Results: A total of 277 patients and 3115 data were included in this study. The models indicated that the optimal energy targets in the three phases were 900kcal/d, 2300kcal/d, and 2000kcal/d, respectively. Excessive energy intake increased mortality rapidly in the early period of the acute phase. Insufficient energy in the late period of the acute phase significantly raised the mortality of septic patients. For the rehabilitation phase, too much or too little energy delivery both associated with high mortality. Conclusion: Our study established time-series prediction models for septic patients to optimize energy delivery in the ICU. This approach indicated the feasibility of developing nutritional tools for critically ill patients. We recommended permissive underfeeding only in the early acute phase. Later, increased energy intake may improve survival and settle energy debts caused by underfeeding.
△ Less
Submitted 3 February, 2024;
originally announced February 2024.
-
Insomnia impairs muscle function via regulating protein degradation and muscle clock
Authors:
Hui Ouyang,
Hong Jiang,
Jin Huang,
Zunjing Liu
Abstract:
Background: Insomnia makes people more physically unable of doing daily duties, which results in a lack of strength, leads to lacking in strength. However, the effects of insomnia on muscle function have not yet been thoroughly investigated. So, the objectives of this study were to clarify how insomnia contributes to the decrease of muscular function and to investigate the mechanisms behind this p…
▽ More
Background: Insomnia makes people more physically unable of doing daily duties, which results in a lack of strength, leads to lacking in strength. However, the effects of insomnia on muscle function have not yet been thoroughly investigated. So, the objectives of this study were to clarify how insomnia contributes to the decrease of muscular function and to investigate the mechanisms behind this phenomenon. Methods: To understand how insomnia influence muscle function, we analyzed the expression level of factors associated with muscle protein degradation, muscle protein synthesis , protein synthesis and degradation pathways and muscle clock. Results: The results showed that lower BMI and grip strength were observed in insomnia patients. The mice in the sleep deprivation(SD) group saw a 7.01 g loss in body mass. The SD group's tibialis anterior and gastrocnemius muscle mass decreased after 96 h of SD). The grip strength reduced in SD group. Using the RT-PCR approaches, we found a significant increase in muscle degradation factors expression in SD group versus normal control group. Conclusions: Insomnia can impair muscle function. The mechanism may be associated with the increased expression of muscle degradation related factors , as well as the abnormal expression of Clock gene.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
A large calcium-imaging dataset reveals a systematic V4 organization for natural scenes
Authors:
Tianye Wang,
Haoxuan Yao,
Tai Sing Lee,
Jiayi Hong,
Yang Li,
Hongfei Jiang,
Ian Max Andolina,
Shiming Tang
Abstract:
The visual system evolved to process natural scenes, yet most of our understanding of the topology and function of visual cortex derives from studies using artificial stimuli. To gain deeper insights into visual processing of natural scenes, we utilized widefield calcium-imaging of primate V4 in response to many natural images, generating a large dataset of columnar-scale responses. We used this d…
▽ More
The visual system evolved to process natural scenes, yet most of our understanding of the topology and function of visual cortex derives from studies using artificial stimuli. To gain deeper insights into visual processing of natural scenes, we utilized widefield calcium-imaging of primate V4 in response to many natural images, generating a large dataset of columnar-scale responses. We used this dataset to build a digital twin of V4 via deep learning, generating a detailed topographical map of natural image preferences at each cortical position. The map revealed clustered functional domains for specific classes of natural image features. These ranged from surface-related attributes like color and texture to shape-related features such as edges, curvature, and facial features. We validated the model-predicted domains with additional widefield calcium-imaging and single-cell resolution two-photon imaging. Our study illuminates the detailed topological organization and neural codes in V4 that represent natural scenes.
△ Less
Submitted 23 July, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Molecular tug of war reveals adaptive potential of an immune cell repertoire
Authors:
Hongda Jiang,
Shenshen Wang
Abstract:
The adaptive immune system constantly remodels its lymphocyte repertoire for better protection against future pathogens. Its ability to improve antigen recognition on the fly relies on somatic mutation and selective expansion of B lymphocytes expressing high-affinity antigen receptors. However, this Darwinian process inside an individual appears ineffective, hitting a modest ceiling of antigen-bin…
▽ More
The adaptive immune system constantly remodels its lymphocyte repertoire for better protection against future pathogens. Its ability to improve antigen recognition on the fly relies on somatic mutation and selective expansion of B lymphocytes expressing high-affinity antigen receptors. However, this Darwinian process inside an individual appears ineffective, hitting a modest ceiling of antigen-binding affinity. Experiment began to reveal that evolving B cells physically extract antigens from presenting cells and that the extraction level dictates clonal expansion; this challenges the prevailing assumption that the equilibrium constant of receptor-antigen binding determines selective advantage. We present a theoretical framework to explore whether, and how, such tug-of-war antigen extraction impacts the quality and diversity of an evolved B cell repertoire. We find that the apparent ineffectiveness of clonal selection can be a direct consequence of the non-equilibrium nature of antigen recognition. Our theory predicts that the physical strength of antigen tethering under tugging forces sets the affinity ceiling. Meanwhile, the model showed that, intriguingly, cells can use force variability to diversify binding phenotype without compromising fitness, thus remaining plastic under resource constraint. These results suggest that active probing of receptor quality via a molecular tug of war during antigen recognition limit the potency of response to the current antigen, but confer adaptive benefit for protection against future variants. Importantly, a saddle point in the fitness landscape of B cell phenotype evolution emerges from the tug-of-war setting, which rationalizes multiple key phenomenology and puts forward a role of active physical dynamics in immune adaptation.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Immune cells use active tugging forces to distinguish affinity and accelerate evolution
Authors:
Hongda Jiang,
Shenshen Wang
Abstract:
Cells are known to exert forces to sense their physical surroundings for guidance of motion and fate decisions. Here, we propose that cells might do mechanical work to drive their own evolution, taking inspiration from the adaptive immune system. Growing evidence indicates that immune B cells - capable of rapid Darwinian evolution - use cytoskeletal forces to actively extract antigen from other ce…
▽ More
Cells are known to exert forces to sense their physical surroundings for guidance of motion and fate decisions. Here, we propose that cells might do mechanical work to drive their own evolution, taking inspiration from the adaptive immune system. Growing evidence indicates that immune B cells - capable of rapid Darwinian evolution - use cytoskeletal forces to actively extract antigen from other cells' surface. To elucidate the evolutionary significance of force usage, we develop a theory of tug-of-war antigen extraction that maps receptor binding characteristics to clonal reproductive fitness, revealing physical determinants of selection strength. This framework unifies mechanosensing and affinity-discrimination capabilities of evolving cells: pulling against stiff antigen tethers enhances discrimination stringency at the expense of absolute extraction. As a consequence, active force usage can accelerate adaptation but may also cause extinction of cell populations, resulting in an optimal range of pulling strength that matches molecular rupture forces observed in cells. Our work suggests that nonequilibrium, physical extraction of environmental signals can make biological systems more evolvable at a moderate energy cost.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Calcium oscillation on homogeneous and heterogeneous networks of ryanodine receptor
Authors:
Zhong-Xue Gao,
Tian-Tian Li,
Han-Yu Jiang,
Jun He
Abstract:
Calcium oscillation is an important calcium homeostasis, imbalance of which is the key mechanism of initiation and progression of many major diseases. The formation and maintenance of calcium homeostasis are closely related to the spatial distribution of calcium channels. In the current paper, a theoretical framework is established by abstracting the spatial distribution of the calcium channels as…
▽ More
Calcium oscillation is an important calcium homeostasis, imbalance of which is the key mechanism of initiation and progression of many major diseases. The formation and maintenance of calcium homeostasis are closely related to the spatial distribution of calcium channels. In the current paper, a theoretical framework is established by abstracting the spatial distribution of the calcium channels as a nonlinear biological complex network with calcium channels as nodes and Ca$^{2+}$ as edges. A dynamical model for a RyR is adopted to investigate the effect of spatial distribution on calcium oscillation. The mean-field model can be well reproduced from the complete graph and dense Erdös-Rényi network. The synchronization of RyRs is found important to generate a global calcium oscillation. The clique graph with a cluster structure can not produce a global oscillation due to the failure of synchronization between clusters. A more realistic geometric network is constructed in a two-dimensional plane based on the experimental information about the RyR arrangement of clusters and the frequency distribution of cluster sizes. Different from the clique graph, the global oscillation can be generated with reasonable parameters on the geometric network. The simulation also suggests that existence of small clusters and rogue RyR's plays an important role in the maintenance of global calcium oscillation through keeping synchronization between large clusters. Such results support the heterogeneous distribution of RyR's with different-size clusters, which is helpful to understand recent observations with super resolution nanoscale imaging techniques. The current theoretical framework can also be extent to investigate other phenomena in calcium signal transduction.
△ Less
Submitted 1 February, 2023; v1 submitted 24 July, 2022;
originally announced July 2022.
-
Enhanced compound-protein binding affinity prediction by representing protein multimodal information via a coevolutionary strategy
Authors:
Binjie Guo,
Hanyu Zheng,
Haohan Jiang,
Xiaodan Li,
Naiyu Guan,
Yanming Zuo,
Yicheng Zhang,
Hengfu Yang,
Xuhua Wang
Abstract:
Due to the lack of a method to efficiently represent the multimodal information of a protein, including its structure and sequence information, predicting compound-protein binding affinity (CPA) still suffers from low accuracy when applying machine learning methods. To overcome this limitation, in a novel end-to-end architecture (named FeatNN), we develop a coevolutionary strategy to jointly repre…
▽ More
Due to the lack of a method to efficiently represent the multimodal information of a protein, including its structure and sequence information, predicting compound-protein binding affinity (CPA) still suffers from low accuracy when applying machine learning methods. To overcome this limitation, in a novel end-to-end architecture (named FeatNN), we develop a coevolutionary strategy to jointly represent the structure and sequence features of proteins and ultimately optimize the mathematical models for predicting CPA. Furthermore, from the perspective of data-driven approach, we proposed a rational method that can utilize both high- and low-quality databases to optimize the accuracy and generalization ability of FeatNN in CPA prediction tasks. Notably, we visually interpret the feature interaction process between sequence and structure in the rationally designed architecture. As a result, FeatNN considerably outperforms the state-of-the-art (SOTA) baseline in virtual drug screening tasks, indicating the feasibility of this approach for practical use. FeatNN provides an outstanding method for higher CPA prediction accuracy and better generalization ability by efficiently representing multimodal information of proteins via a coevolutionary strategy.
△ Less
Submitted 23 November, 2022; v1 submitted 29 March, 2022;
originally announced April 2022.
-
Dynamic Ensemble Bayesian Filter for Robust Control of a Human Brain-machine Interface
Authors:
Yu Qi,
Xinyun Zhu,
Kedi Xu,
Feixiao Ren,
Hongjie Jiang,
Junming Zhu,
Jianmin Zhang,
Gang Pan,
Yueming Wang
Abstract:
Objective: Brain-machine interfaces (BMIs) aim to provide direct brain control of devices such as prostheses and computer cursors, which have demonstrated great potential for mobility restoration. One major limitation of current BMIs lies in the unstable performance in online control due to the variability of neural signals, which seriously hinders the clinical availability of BMIs. Method: To dea…
▽ More
Objective: Brain-machine interfaces (BMIs) aim to provide direct brain control of devices such as prostheses and computer cursors, which have demonstrated great potential for mobility restoration. One major limitation of current BMIs lies in the unstable performance in online control due to the variability of neural signals, which seriously hinders the clinical availability of BMIs. Method: To deal with the neural variability in online BMI control, we propose a dynamic ensemble Bayesian filter (DyEnsemble). DyEnsemble extends Bayesian filters with a dynamic measurement model, which adjusts its parameters in time adaptively with neural changes. This is achieved by learning a pool of candidate functions and dynamically weighting and assembling them according to neural signals. In this way, DyEnsemble copes with variability in signals and improves the robustness of online control. Results: Online BMI experiments with a human participant demonstrate that, compared with the velocity Kalman filter, DyEnsemble significantly improves the control accuracy (increases the success rate by 13.9% and reduces the reach time by 13.5% in the random target pursuit task) and robustness (performs more stably over different experiment days). Conclusion: Our results demonstrate the superiority of DyEnsemble in online BMI control. Significance: DyEnsemble frames a novel and flexible framework for robust neural decoding, which is beneficial to different neural decoding applications.
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
A Deep Learning Approach to Predicting Ventilator Parameters for Mechanically Ventilated Septic Patients
Authors:
Zhijun Zeng,
Zhen Hou,
Ting Li,
Lei Deng,
Jianguo Hou,
Xinran Huang,
Jun Li,
Meirou Sun,
Yunhan Wang,
Qiyu Wu,
Wenhao Zheng,
Hua Jiang,
Qi Wang
Abstract:
We develop a deep learning approach to predicting a set of ventilator parameters for a mechanically ventilated septic patient using a long and short term memory (LSTM) recurrent neural network (RNN) model. We focus on short-term predictions of a set of ventilator parameters for the septic patient in emergency intensive care unit (EICU). The short-term predictability of the model provides attending…
▽ More
We develop a deep learning approach to predicting a set of ventilator parameters for a mechanically ventilated septic patient using a long and short term memory (LSTM) recurrent neural network (RNN) model. We focus on short-term predictions of a set of ventilator parameters for the septic patient in emergency intensive care unit (EICU). The short-term predictability of the model provides attending physicians with early warnings to make timely adjustment to the treatment of the patient in the EICU. The patient specific deep learning model can be trained on any given critically ill patient, making it an intelligent aide for physicians to use in emergent medical situations.
△ Less
Submitted 20 February, 2022;
originally announced February 2022.
-
Connectivity Concepts in Neuronal Network Modeling
Authors:
Johanna Senk,
Birgit Kriener,
Mikael Djurfeldt,
Nicole Voges,
Han-Jia Jiang,
Lisa Schüttler,
Gabriele Gramelsberger,
Markus Diesmann,
Hans E. Plesser,
Sacha J. van Albada
Abstract:
Sustainable research on computational models of neuronal networks requires published models to be understandable, reproducible, and extendable. Missing details or ambiguities about mathematical concepts and assumptions, algorithmic implementations, or parameterizations hinder progress. Such flaws are unfortunately frequent and one reason is a lack of readily applicable standards and tools for mode…
▽ More
Sustainable research on computational models of neuronal networks requires published models to be understandable, reproducible, and extendable. Missing details or ambiguities about mathematical concepts and assumptions, algorithmic implementations, or parameterizations hinder progress. Such flaws are unfortunately frequent and one reason is a lack of readily applicable standards and tools for model description. Our work aims to advance complete and concise descriptions of network connectivity but also to guide the implementation of connection routines in simulation software and neuromorphic hardware systems. We first review models made available by the computational neuroscience community in the repositories ModelDB and Open Source Brain, and investigate the corresponding connectivity structures and their descriptions in both manuscript and code. The review comprises the connectivity of networks with diverse levels of neuroanatomical detail and exposes how connectivity is abstracted in existing description languages and simulator interfaces. We find that a substantial proportion of the published descriptions of connectivity is ambiguous. Based on this review, we derive a set of connectivity concepts for deterministically and probabilistically connected networks and also address networks embedded in metric space. Beside these mathematical and textual guidelines, we propose a unified graphical notation for network diagrams to facilitate an intuitive understanding of network properties. Examples of representative network models demonstrate the practical use of the ideas. We hope that the proposed standardizations will contribute to unambiguous descriptions and reproducible implementations of neuronal network connectivity in computational neuroscience.
△ Less
Submitted 15 June, 2022; v1 submitted 6 October, 2021;
originally announced October 2021.
-
Nonlinear signal transduction network with multistate
Authors:
Han-Yu Jiang,
Jun He
Abstract:
Signal transduction is an important and basic mechanism to cell life activities. The stochastic state transition of receptor induces the release of signaling molecular, which triggers the state transition of other receptors. It constructs a nonlinear sigaling network, and leads to robust switchlike properties which are critical to biological function. Network architectures and state transitions of…
▽ More
Signal transduction is an important and basic mechanism to cell life activities. The stochastic state transition of receptor induces the release of signaling molecular, which triggers the state transition of other receptors. It constructs a nonlinear sigaling network, and leads to robust switchlike properties which are critical to biological function. Network architectures and state transitions of receptor affect the performance of this biological network. In this work, we perform a study of nonlinear signaling on biological polymorphic network by analyzing network dynamics of the Ca$^{2+}$ induced Ca$^{2+}$ release mechanism, where fast and slow processes are involved and the receptor has four conformational states. Three types of networks, Erdös-Rényi network, Watts-Strogatz network and BaraBási-Albert network, are considered with different parameters. The dynamics of the biological networks exhibit different patterns at different time scales. At short time scale, the second open state is essential to reproduce the quasi-bistable regime, which emerges at a critical strength of connection for all three states involved in the fast processes and disappears at another critical point. The pattern at short time scale is not sensitive to the network architecture. At long time scale, only monostable regime is observed, and difference of network architectures affects the results more seriously. Our finding identifies features of nonlinear signaling networks with multistate that may underlie their biological function.
△ Less
Submitted 25 October, 2021; v1 submitted 14 June, 2021;
originally announced June 2021.
-
Functional annotation of creeping bentgrass protein sequences based on convolutional neural network
Authors:
Han-Yu Jiang,
Jun He
Abstract:
Background: Creeping bentgrass (Agrostis soionifera) is a perennial grass of Gramineae, belonging to cold season turfgrass, but has poor disease resistance. Up to now, little is known about the induced systemic resistance (ISR) mechanism, especially the relevant functional proteins, which is important to disease resistance of turfgrass. Achieving more information of proteins of infected creeping b…
▽ More
Background: Creeping bentgrass (Agrostis soionifera) is a perennial grass of Gramineae, belonging to cold season turfgrass, but has poor disease resistance. Up to now, little is known about the induced systemic resistance (ISR) mechanism, especially the relevant functional proteins, which is important to disease resistance of turfgrass. Achieving more information of proteins of infected creeping bentgrass is helpful to understand the ISR mechanism. Results: With BDO treatment, creeping bentgrass seedlings were grown, and the ISR response was induced by infecting Rhizoctonia solani. High-quality protein sequences of creeping bentgrass seedlings were obtained. Some of protein sequences were functionally annotated according to the database alignment while a large part of the obtained protein sequences was left non-annotated. To treat the non-annotated sequences, a prediction model based on convolutional neural network was established with the dataset from Uniport database in three domains to acquire good performance, especially the higher false positive control rate. With established model, the non-annotated protein sequences of creeping bentgrass were analyzed to annotate proteins relevant to disease-resistance response and signal transduction. Conclusions: The prediction model based on convolutional neural network was successfully applied to select good candidates of the proteins with functions relevant to the ISR mechanism from the protein sequences which cannot be annotated by database alignment. The waste of sequence data can be avoided, and research time and labor will be saved in further research of protein of creeping bentgrass by molecular biology technology. It also provides reference for other sequence analysis of turfgrass disease-resistance research.
△ Less
Submitted 24 May, 2022; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Three-dimensional cytoplasmic calcium propagation with boundaries
Authors:
Han-Yu Jiang,
Jun He
Abstract:
Ca$^{2+}$ plays an important role in cell signal transduction. Its intracellular propagation is the most basic process of Ca$^{2+}$ signaling, such as calcium wave and double messenger system. In this work, with both numerical simulation and mean field ansatz, the 3-dimensional probability distribution of Ca$^{2+}$, which is read out by phosphorylation, is studied in two scenarios with boundaries.…
▽ More
Ca$^{2+}$ plays an important role in cell signal transduction. Its intracellular propagation is the most basic process of Ca$^{2+}$ signaling, such as calcium wave and double messenger system. In this work, with both numerical simulation and mean field ansatz, the 3-dimensional probability distribution of Ca$^{2+}$, which is read out by phosphorylation, is studied in two scenarios with boundaries. The coverage of distribution of Ca$^{2+}$ is found at an order of magnitude of $μ$m, which is consistent with experimental observed calcium spike and wave. Our results suggest that the double messenger system may occur in the ER-PM junction to acquire great efficiency. The buffer effect of kinase is also discussed by calculating the average position of phosphorylations and free Ca$^{2+}$. The results are helpful to understand the mechanism of Ca$^{2+}$ signaling.
△ Less
Submitted 10 November, 2020; v1 submitted 17 May, 2020;
originally announced May 2020.
-
Cost-effectiveness Analysis of Antiepidemic Policies and Global Situation Assessment of COVID-19
Authors:
Liyan Xu,
Hongmou Zhang,
Yuqiao Deng,
Keli Wang,
Fu Li,
Qing Lu,
Jie Yin,
Qian Di,
Tao Liu,
Hang Yin,
Zijiao Zhang,
Qingyang Du,
Hongbin Yu,
Aihan Liu,
Hezhishi Jiang,
Jing Guo,
Xiumei Yuan,
Yun Zhang,
Liu Liu,
Yu Liu
Abstract:
With a two-layer contact-dispersion model and data in China, we analyze the cost-effectiveness of three types of antiepidemic measures for COVID-19: regular epidemiological control, local social interaction control, and inter-city travel restriction. We find that: 1) intercity travel restriction has minimal or even negative effect compared to the other two at the national level; 2) the time of rea…
▽ More
With a two-layer contact-dispersion model and data in China, we analyze the cost-effectiveness of three types of antiepidemic measures for COVID-19: regular epidemiological control, local social interaction control, and inter-city travel restriction. We find that: 1) intercity travel restriction has minimal or even negative effect compared to the other two at the national level; 2) the time of reaching turning point is independent of the current number of cases, and only related to the enforcement stringency of epidemiological control and social interaction control measures; 3) strong enforcement at the early stage is the only opportunity to maximize both antiepidemic effectiveness and cost-effectiveness; 4) mediocre stringency of social interaction measures is the worst choice. Subsequently, we cluster countries/regions into four groups based on their control measures and provide situation assessment and policy suggestions for each group.
△ Less
Submitted 23 April, 2020; v1 submitted 16 April, 2020;
originally announced April 2020.
-
Visual Data Analysis and Simulation Prediction for COVID-19
Authors:
Baoquan Chen,
Mingyi Shi,
Xingyu Ni,
Liangwang Ruan,
Hongda Jiang,
Heyuan Yao,
Mengdi Wang,
Zhenhua Song,
Qiang Zhou,
Tong Ge
Abstract:
The COVID-19 (formerly, 2019-nCoV) epidemic has become a global health emergency, as such, WHO declared PHEIC. China has taken the most hit since the outbreak of the virus, which could be dated as far back as late November by some experts. It was not until January 23rd that the Wuhan government finally recognized the severity of the epidemic and took a drastic measure to curtain the virus spread b…
▽ More
The COVID-19 (formerly, 2019-nCoV) epidemic has become a global health emergency, as such, WHO declared PHEIC. China has taken the most hit since the outbreak of the virus, which could be dated as far back as late November by some experts. It was not until January 23rd that the Wuhan government finally recognized the severity of the epidemic and took a drastic measure to curtain the virus spread by closing down all transportation connecting the outside world. In this study, we seek to answer a few questions: How did the virus get spread from the epicenter Wuhan city to the rest of the country? To what extent did the measures, such as, city closure and community quarantine, help controlling the situation? More importantly, can we forecast any significant future development of the event had some of the conditions changed? By collecting and visualizing publicly available data, we first show patterns and characteristics of the epidemic development; we then employ a mathematical model of disease transmission dynamics to evaluate the effectiveness of some epidemic control measures, and more importantly, to offer a few tips on preventive measures.
△ Less
Submitted 6 March, 2020; v1 submitted 14 February, 2020;
originally announced February 2020.
-
Trait-space patterning and the role of feedback in antigen-immunity coevolution
Authors:
Hongda Jiang,
Shenshen Wang
Abstract:
Coevolutionary arms races form between interacting populations that constitute each other's environment and respond to mutual changes. This inherently far-from-equilibrium process finds striking manifestations in the adaptive immune system, where highly variable antigens and a finite repertoire of immune receptors coevolve on comparable timescales. This unique challenge to the immune system motiva…
▽ More
Coevolutionary arms races form between interacting populations that constitute each other's environment and respond to mutual changes. This inherently far-from-equilibrium process finds striking manifestations in the adaptive immune system, where highly variable antigens and a finite repertoire of immune receptors coevolve on comparable timescales. This unique challenge to the immune system motivates general questions: How do ecological and evolutionary processes interplay to shape diversity? What determine the endurance and fate of coevolution? Here, we take the perspective of responsive environments and develop a phenotypic model of coevolution between receptors and antigens that both exhibit cross-reactivity (one-to-many responses). The theory predicts that the extent of asymmetry in cross-reactivity is a key determinant of repertoire composition: small asymmetry supports persistent large diversity, whereas strong asymmetry yields long-lived transients of quasispecies in both populations. The latter represents a new type of Turing mechanism. More surprisingly, patterning in the trait space feeds back on population dynamics: spatial resonance between the Turing modes breaks the dynamic balance, leading to antigen extinction or unrestrained growth. Model predictions can be tested via combined genomic and phenotypic measurements. Our work identifies cross-reactivity as an important regulator of diversity and coevolutionary outcome, and reveals the remarkable effect of ecological feedback in pattern-forming systems, which drives evolution toward non-steady states different than the Red Queen persistent cycles.
△ Less
Submitted 29 July, 2019;
originally announced July 2019.
-
What evidence does deep learning model use to classify Skin Lesions?
Authors:
Xiaoxiao Li,
Junyan Wu,
Eric Z. Chen,
Hongda Jiang
Abstract:
Melanoma is a type of skin cancer with the most rapidly increasing incidence. Early detection of melanoma using dermoscopy images significantly increases patients' survival rate. However, accurately classifying skin lesions by eye, especially in the early stage of melanoma, is extremely challenging for the dermatologists. Hence, the discovery of reliable biomarkers will be meaningful for melanoma…
▽ More
Melanoma is a type of skin cancer with the most rapidly increasing incidence. Early detection of melanoma using dermoscopy images significantly increases patients' survival rate. However, accurately classifying skin lesions by eye, especially in the early stage of melanoma, is extremely challenging for the dermatologists. Hence, the discovery of reliable biomarkers will be meaningful for melanoma diagnosis. Recent years, the value of deep learning empowered computer-assisted diagnose has been shown in biomedical imaging based decision making. However, much research focuses on improving disease detection accuracy but not exploring the evidence of pathology. In this paper, we propose a method to interpret the deep learning classification findings. Firstly, we propose an accurate neural network architecture to classify skin lesions. Secondly, we utilize a prediction difference analysis method that examines each patch on the image through patch-wised corrupting to detect the biomarkers. Lastly, we validate that our biomarker findings are corresponding to the patterns in the literature. The findings can be significant and useful to guide clinical diagnosis.
△ Less
Submitted 13 February, 2019; v1 submitted 2 November, 2018;
originally announced November 2018.
-
ProLanGO: Protein Function Prediction Using Neural~Machine Translation Based on a Recurrent Neural Network
Authors:
Renzhi Cao,
Colton Freitas,
Leong Chan,
Miao Sun,
Haiqing Jiang,
Zhangxin Chen
Abstract:
With the development of next generation sequencing techniques, it is fast and cheap to determine protein sequences but relatively slow and expensive to extract useful information from protein sequences because of limitations of traditional biological experimental techniques. Protein function prediction has been a long standing challenge to fill the gap between the huge amount of protein sequences…
▽ More
With the development of next generation sequencing techniques, it is fast and cheap to determine protein sequences but relatively slow and expensive to extract useful information from protein sequences because of limitations of traditional biological experimental techniques. Protein function prediction has been a long standing challenge to fill the gap between the huge amount of protein sequences and the known function. In this paper, we propose a novel method to convert the protein function problem into a language translation problem by the new proposed protein sequence language "ProLan" to the protein function language "GOLan", and build a neural machine translation model based on recurrent neural networks to translate "ProLan" language to "GOLan" language. We blindly tested our method by attending the latest third Critical Assessment of Function Annotation (CAFA 3) in 2016, and also evaluate the performance of our methods on selected proteins whose function was released after CAFA competition. The good performance on the training and testing datasets demonstrates that our new proposed method is a promising direction for protein function prediction. In summary, we first time propose a method which converts the protein function prediction problem to a language translation problem and applies a neural machine translation model for protein function prediction.
△ Less
Submitted 19 October, 2017;
originally announced October 2017.
-
A Unified Model for Differential Expression Analysis of RNA-seq Data via L1-Penalized Linear Regression
Authors:
Kefei Liu,
Jieping Ye,
Yang Yang,
Li Shen,
Hui Jiang
Abstract:
The RNA-sequencing (RNA-seq) is becoming increasingly popular for quantifying gene expression levels. Since the RNA-seq measurements are relative in nature, between-sample normalization of counts is an essential step in differential expression (DE) analysis. The normalization of existing DE detection algorithms is ad hoc and performed once for all prior to DE detection, which may be suboptimal sin…
▽ More
The RNA-sequencing (RNA-seq) is becoming increasingly popular for quantifying gene expression levels. Since the RNA-seq measurements are relative in nature, between-sample normalization of counts is an essential step in differential expression (DE) analysis. The normalization of existing DE detection algorithms is ad hoc and performed once for all prior to DE detection, which may be suboptimal since ideally normalization should be based on non-DE genes only and thus coupled with DE detection. We propose a unified statistical model for joint normalization and DE detection of log-transformed RNA-seq data. Sample-specific normalization factors are modeled as unknown parameters in the gene-wise linear models and jointly estimated with the regression coefficients. By imposing sparsity-inducing L1 penalty (or mixed L1/L2-norm for multiple treatment conditions) on the regression coefficients, we formulate the problem as a penalized least-squares regression problem and apply the augmented lagrangian method to solve it. Simulation studies show that the proposed model and algorithms outperform existing methods in terms of detection power and false-positive rate when more than half of the genes are differentially expressed and/or when the up- and down-regulated genes among DE genes are unbalanced in amount.
△ Less
Submitted 11 October, 2016;
originally announced October 2016.
-
Salient Object Detection: A Survey
Authors:
Ali Borji,
Ming-Ming Cheng,
Qibin Hou,
Huaizu Jiang,
Jia Li
Abstract:
Detecting and segmenting salient objects from natural scenes, often referred to as salient object detection, has attracted great interest in computer vision. While many models have been proposed and several applications have emerged, a deep understanding of achievements and issues remains lacking. We aim to provide a comprehensive review of recent progress in salient object detection and situate t…
▽ More
Detecting and segmenting salient objects from natural scenes, often referred to as salient object detection, has attracted great interest in computer vision. While many models have been proposed and several applications have emerged, a deep understanding of achievements and issues remains lacking. We aim to provide a comprehensive review of recent progress in salient object detection and situate this field among other closely related areas such as generic scene segmentation, object proposal generation, and saliency for fixation prediction. Covering 228 publications, we survey i) roots, key concepts, and tasks, ii) core techniques and main modeling trends, and iii) datasets and evaluation metrics for salient object detection. We also discuss open problems such as evaluation metrics and dataset bias in model performance, and suggest future research directions.
△ Less
Submitted 1 July, 2019; v1 submitted 18 November, 2014;
originally announced November 2014.
-
Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species
Authors:
Keith R. Bradnam,
Joseph N. Fass,
Anton Alexandrov,
Paul Baranay,
Michael Bechner,
İnanç Birol,
Sébastien Boisvert,
Jarrod A. Chapman,
Guillaume Chapuis,
Rayan Chikhi,
Hamidreza Chitsaz,
Wen-Chi Chou,
Jacques Corbeil,
Cristian Del Fabbro,
T. Roderick Docking,
Richard Durbin,
Dent Earl,
Scott Emrich,
Pavel Fedotov,
Nuno A. Fonseca,
Ganeshkumar Ganapathy,
Richard A. Gibbs,
Sante Gnerre,
Élénie Godzaridis,
Steve Goldstein
, et al. (66 additional authors not shown)
Abstract:
Background - The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and…
▽ More
Background - The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results - In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions - Many current genome assemblers produced useful assemblies, containing a significant representation of their genes, regulatory sequences, and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.
△ Less
Submitted 27 June, 2013; v1 submitted 23 January, 2013;
originally announced January 2013.
-
Flexibility Induced Motion Transition of Active Filament: Rotation without Long-range Hydrodynamic Interaction
Authors:
Huijun Jiang,
Zhonghuai Hou
Abstract:
We investigate the motion of active semiflexible filament with shape kinematics and hydrodynamic interaction including. Three types of filament motion are found: Translation, snaking and rotation. Change of flexibility will induce instability of shape kinematics and further result in asymmetry of shape kinematics respect to the motion of mass center, which are responsible to a continuous-like tran…
▽ More
We investigate the motion of active semiflexible filament with shape kinematics and hydrodynamic interaction including. Three types of filament motion are found: Translation, snaking and rotation. Change of flexibility will induce instability of shape kinematics and further result in asymmetry of shape kinematics respect to the motion of mass center, which are responsible to a continuous-like transition from translation to snaking and a first-order-like transition from snaking to rotation, respectively. Of particular interest, we find that long-range hydrodynamic interaction is not necessary for filament rotation, but can enhance remarkably the parameter region for its appearance. This finding may provide an evidence that the experimentally found collective rotation of active filaments is more likely to arise from the individual property even without the long-range hydrodynamic interaction.
△ Less
Submitted 23 October, 2012;
originally announced October 2012.
-
Statistical Modeling of RNA-Seq Data
Authors:
Julia Salzman,
Hui Jiang,
Wing Hung Wong
Abstract:
Recently, ultra high-throughput sequencing of RNA (RNA-Seq) has been developed as an approach for analysis of gene expression. By obtaining tens or even hundreds of millions of reads of transcribed sequences, an RNA-Seq experiment can offer a comprehensive survey of the population of genes (transcripts) in any sample of interest. This paper introduces a statistical model for estimating isoform abu…
▽ More
Recently, ultra high-throughput sequencing of RNA (RNA-Seq) has been developed as an approach for analysis of gene expression. By obtaining tens or even hundreds of millions of reads of transcribed sequences, an RNA-Seq experiment can offer a comprehensive survey of the population of genes (transcripts) in any sample of interest. This paper introduces a statistical model for estimating isoform abundance from RNA-Seq data and is flexible enough to accommodate both single end and paired end RNA-Seq data and sampling bias along the length of the transcript. Based on the derivation of minimal sufficient statistics for the model, a computationally feasible implementation of the maximum likelihood estimator of the model is provided. Further, it is shown that using paired end RNA-Seq provides more accurate isoform abundance estimates than single end sequencing at fixed sequencing depth. Simulation studies are also given.
△ Less
Submitted 16 June, 2011;
originally announced June 2011.
-
Spatiotemporal dynamics on small-world neuronal networks: The roles of two types of time-delayed coupling
Authors:
Hao Wu,
Huijun Jiang,
Zhonghuai Hou
Abstract:
We investigate temporal coherence and spatial synchronization on small-world networks consisting of noisy Terman-Wang (TW) excitable neurons in dependence on two types of time-delayed coupling: $\{x_j(t-τ)-x_i (t)\}$ and $\{x_j(t-τ)-x_i(t-τ)\}$. For the former case, we show that time delay in the coupling can dramatically enhance temporal coherence and spatial synchrony of the noise-induced spike…
▽ More
We investigate temporal coherence and spatial synchronization on small-world networks consisting of noisy Terman-Wang (TW) excitable neurons in dependence on two types of time-delayed coupling: $\{x_j(t-τ)-x_i (t)\}$ and $\{x_j(t-τ)-x_i(t-τ)\}$. For the former case, we show that time delay in the coupling can dramatically enhance temporal coherence and spatial synchrony of the noise-induced spike trains. In addition, if the delay time $τ$ is tuned to nearly match the intrinsic spike period of the neuronal network, the system dynamics reaches a most ordered state, which is both periodic in time and nearly synchronized in space, demonstrating an interesting resonance phenomenon with delay. For the latter case, however, we can not achieve a similar spatiotemporal ordered state, but the neuronal dynamics exhibits interesting synchronization transition with time delay from zigzag fronts of excitations to dynamic clustering anti-phase synchronization (APS), and further to clustered chimera states which have spatially distributed anti-phase coherence separated by incoherence. Furthermore, we also show how these findings are influenced by the change of the noise intensity and the rewiring probability. Finally, qualitative analysis is given to illustrate the numerical results.
△ Less
Submitted 18 April, 2011; v1 submitted 18 April, 2011;
originally announced April 2011.