Search | arXiv e-print repository

arXiv:2503.19300 [pdf, other]

UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder Design

Authors: Xiangzhe Kong, Zishen Zhang, Ziting Zhang, Rui Jiao, Jianzhu Ma, Wenbing Huang, Kai Liu, Yang Liu

Abstract: The design of target-specific molecules such as small molecules, peptides, and antibodies is vital for biological research and drug discovery. Existing generative methods are restricted to single-domain molecules, failing to address versatile therapeutic needs or utilize cross-domain transferability to enhance model performance. In this paper, we introduce Unified generative Modeling of 3D Molecul… ▽ More The design of target-specific molecules such as small molecules, peptides, and antibodies is vital for biological research and drug discovery. Existing generative methods are restricted to single-domain molecules, failing to address versatile therapeutic needs or utilize cross-domain transferability to enhance model performance. In this paper, we introduce Unified generative Modeling of 3D Molecules (UniMoMo), the first framework capable of designing binders of multiple molecular domains using a single model. In particular, UniMoMo unifies the representations of different molecules as graphs of blocks, where each block corresponds to either a standard amino acid or a molecular fragment. Subsequently, UniMoMo utilizes a geometric latent diffusion model for 3D molecular generation, featuring an iterative full-atom autoencoder to compress blocks into latent space points, followed by an E(3)-equivariant diffusion process. Extensive benchmarks across peptides, antibodies, and small molecules demonstrate the superiority of our unified framework over existing domain-specific models, highlighting the benefits of multi-domain training. △ Less

Submitted 12 May, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

Comments: Accepted to ICML 2025

arXiv:2503.03989 [pdf, other]

Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

Authors: Xiangxin Zhou, Yi Xiao, Haowei Lin, Xinheng He, Jiaqi Guan, Yang Wang, Qiang Liu, Feng Zhou, Liang Wang, Jianzhu Ma

Abstract: The dynamic nature of proteins, influenced by ligand interactions, is essential for comprehending protein function and progressing drug discovery. Traditional structure-based drug design (SBDD) approaches typically target binding sites with rigid structures, limiting their practical application in drug development. While molecular dynamics simulation can theoretically capture all the biologically… ▽ More The dynamic nature of proteins, influenced by ligand interactions, is essential for comprehending protein function and progressing drug discovery. Traditional structure-based drug design (SBDD) approaches typically target binding sites with rigid structures, limiting their practical application in drug development. While molecular dynamics simulation can theoretically capture all the biologically relevant conformations, the transition rate is dictated by the intrinsic energy barrier between them, making the sampling process computationally expensive. To overcome the aforementioned challenges, we propose to use generative modeling for SBDD considering conformational changes of protein pockets. We curate a dataset of apo and multiple holo states of protein-ligand complexes, simulated by molecular dynamics, and propose a full-atom flow model (and a stochastic version), named DynamicFlow, that learns to transform apo pockets and noisy ligands into holo pockets and corresponding 3D ligand molecules. Our method uncovers promising ligand molecules and corresponding holo conformations of pockets. Additionally, the resultant holo-like states provide superior inputs for traditional SBDD approaches, playing a significant role in practical drug discovery. △ Less

Submitted 5 March, 2025; originally announced March 2025.

Comments: Accepted to ICLR 2025

arXiv:2502.09662 [pdf, other]

Generalizable Cervical Cancer Screening via Large-scale Pretraining and Test-Time Adaptation

Authors: Hao Jiang, Cheng Jin, Huangjing Lin, Yanning Zhou, Xi Wang, Jiabo Ma, Li Ding, Jun Hou, Runsheng Liu, Zhizhong Chai, Luyang Luo, Huijuan Shi, Yinling Qian, Qiong Wang, Changzhong Li, Anjia Han, Ronald Cheong Kin Chan, Hao Chen

Abstract: Cervical cancer is a leading malignancy in female reproductive system. While AI-assisted cytology offers a cost-effective and non-invasive screening solution, current systems struggle with generalizability in complex clinical scenarios. To address this issue, we introduced Smart-CCS, a generalizable Cervical Cancer Screening paradigm based on pretraining and adaptation to create robust and general… ▽ More Cervical cancer is a leading malignancy in female reproductive system. While AI-assisted cytology offers a cost-effective and non-invasive screening solution, current systems struggle with generalizability in complex clinical scenarios. To address this issue, we introduced Smart-CCS, a generalizable Cervical Cancer Screening paradigm based on pretraining and adaptation to create robust and generalizable screening systems. To develop and validate Smart-CCS, we first curated a large-scale, multi-center dataset named CCS-127K, which comprises a total of 127,471 cervical cytology whole-slide images collected from 48 medical centers. By leveraging large-scale self-supervised pretraining, our CCS models are equipped with strong generalization capability, potentially generalizing across diverse scenarios. Then, we incorporated test-time adaptation to specifically optimize the trained CCS model for complex clinical settings, which adapts and refines predictions, improving real-world applicability. We conducted large-scale system evaluation among various cohorts. In retrospective cohorts, Smart-CCS achieved an overall area under the curve (AUC) value of 0.965 and sensitivity of 0.913 for cancer screening on 11 internal test datasets. In external testing, system performance maintained high at 0.950 AUC across 6 independent test datasets. In prospective cohorts, our Smart-CCS achieved AUCs of 0.947, 0.924, and 0.986 in three prospective centers, respectively. Moreover, the system demonstrated superior sensitivity in diagnosing cervical cancer, confirming the accuracy of our cancer screening results by using histology findings for validation. Interpretability analysis with cell and slide predictions further indicated that the system's decision-making aligns with clinical practice. Smart-CCS represents a significant advancement in cancer screening across diverse clinical contexts. △ Less

Submitted 12 February, 2025; originally announced February 2025.

arXiv:2501.15055 [pdf, other]

Group Ligands Docking to Protein Pockets

Authors: Jiaqi Guan, Jiahan Li, Xiangxin Zhou, Xingang Peng, Sheng Wang, Yunan Luo, Jian Peng, Jianzhu Ma

Abstract: Molecular docking is a key task in computational biology that has attracted increasing interest from the machine learning community. While existing methods have achieved success, they generally treat each protein-ligand pair in isolation. Inspired by the biochemical observation that ligands binding to the same target protein tend to adopt similar poses, we propose \textsc{GroupBind}, a novel molec… ▽ More Molecular docking is a key task in computational biology that has attracted increasing interest from the machine learning community. While existing methods have achieved success, they generally treat each protein-ligand pair in isolation. Inspired by the biochemical observation that ligands binding to the same target protein tend to adopt similar poses, we propose \textsc{GroupBind}, a novel molecular docking framework that simultaneously considers multiple ligands docking to a protein. This is achieved by introducing an interaction layer for the group of ligands and a triangle attention module for embedding protein-ligand and group-ligand pairs. By integrating our approach with diffusion-based docking model, we set a new S performance on the PDBBind blind docking benchmark, demonstrating the effectiveness of our proposed molecular docking paradigm. △ Less

Submitted 24 January, 2025; originally announced January 2025.

Comments: 18 pages, published in ICLR 2025

arXiv:2501.01768 [pdf, other]

Remodeling Peptide-MHC-TCR Triad Binding as Sequence Fusion for Immunogenicity Prediction

Authors: Jiahao Ma, Hongzong Li, Jian-Dong Huang, Ye-Fan Hu, Yifan Chen

Abstract: The complex nature of tripartite peptide-MHC-TCR interactions is a critical yet underexplored area in immunogenicity prediction. Traditional studies on TCR-antigen binding have not fully addressed the complex dependencies in triad binding. In this paper, we propose new modeling approaches for these tripartite interactions, utilizing sequence information from MHCs, peptides, and TCRs. Our methods a… ▽ More The complex nature of tripartite peptide-MHC-TCR interactions is a critical yet underexplored area in immunogenicity prediction. Traditional studies on TCR-antigen binding have not fully addressed the complex dependencies in triad binding. In this paper, we propose new modeling approaches for these tripartite interactions, utilizing sequence information from MHCs, peptides, and TCRs. Our methods adhere to native sequence forms and align with biological processes to enhance prediction accuracy. By incorporating representation learning techniques, we introduce a fusion mechanism to integrate the three sequences effectively. Empirical experiments show that our models outperform traditional methods, achieving a 2.8 to 13.3 percent improvement in prediction accuracy across existing benchmarks. We further validate our approach with extensive ablation studies, demonstrating the effectiveness of the proposed model components. The model implementation, code, and supplementary materials, including a manuscript with colored hyperlinks and a technical appendix for digital viewing, will be open-sourced upon publication. △ Less

Submitted 3 January, 2025; originally announced January 2025.

Comments: 27 Pages 5 Figures

arXiv:2411.18463 [pdf, other]

Hotspot-Driven Peptide Design via Multi-Fragment Autoregressive Extension

Authors: Jiahan Li, Tong Chen, Shitong Luo, Chaoran Cheng, Jiaqi Guan, Ruihan Guo, Sheng Wang, Ge Liu, Jian Peng, Jianzhu Ma

Abstract: Peptides, short chains of amino acids, interact with target proteins, making them a unique class of protein-based therapeutics for treating human diseases. Recently, deep generative models have shown great promise in peptide generation. However, several challenges remain in designing effective peptide binders. First, not all residues contribute equally to peptide-target interactions. Second, the g… ▽ More Peptides, short chains of amino acids, interact with target proteins, making them a unique class of protein-based therapeutics for treating human diseases. Recently, deep generative models have shown great promise in peptide generation. However, several challenges remain in designing effective peptide binders. First, not all residues contribute equally to peptide-target interactions. Second, the generated peptides must adopt valid geometries due to the constraints of peptide bonds. Third, realistic tasks for peptide drug development are still lacking. To address these challenges, we introduce PepHAR, a hot-spot-driven autoregressive generative model for designing peptides targeting specific proteins. Building on the observation that certain hot spot residues have higher interaction potentials, we first use an energy-based density model to fit and sample these key residues. Next, to ensure proper peptide geometry, we autoregressively extend peptide fragments by estimating dihedral angles between residue frames. Finally, we apply an optimization process to iteratively refine fragment assembly, ensuring correct peptide structures. By combining hot spot sampling with fragment-based extension, our approach enables de novo peptide design tailored to a target protein and allows the incorporation of key hot spot residues into peptide scaffolds. Extensive experiments, including peptide design and peptide scaffold generation, demonstrate the strong potential of PepHAR in computational peptide binder design. Source code will be available at https://github.com/Ced3-han/PepHAR. △ Less

Submitted 20 May, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

Comments: Published as a conference paper at ICLR 2025

arXiv:2411.14743 [pdf, other]

FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification

Authors: Zhengrui Guo, Conghao Xiong, Jiabo Ma, Qichen Sun, Lishuang Feng, Jinzhuo Wang, Hao Chen

Abstract: Few-shot learning presents a critical solution for cancer diagnosis in computational pathology (CPath), addressing fundamental limitations in data availability, particularly the scarcity of expert annotations and patient privacy constraints. A key challenge in this paradigm stems from the inherent disparity between the limited training set of whole slide images (WSIs) and the enormous number of co… ▽ More Few-shot learning presents a critical solution for cancer diagnosis in computational pathology (CPath), addressing fundamental limitations in data availability, particularly the scarcity of expert annotations and patient privacy constraints. A key challenge in this paradigm stems from the inherent disparity between the limited training set of whole slide images (WSIs) and the enormous number of contained patches, where a significant portion of these patches lacks diagnostically relevant information, potentially diluting the model's ability to learn and focus on critical diagnostic features. While recent works attempt to address this by incorporating additional knowledge, several crucial gaps hinder further progress: (1) despite the emergence of powerful pathology foundation models (FMs), their potential remains largely untapped, with most approaches limiting their use to basic feature extraction; (2) current language guidance mechanisms attempt to align text prompts with vast numbers of WSI patches all at once, struggling to leverage rich pathological semantic information. To this end, we introduce the knowledge-enhanced adaptive visual compression framework, dubbed FOCUS, which uniquely combines pathology FMs with language prior knowledge to enable a focused analysis of diagnostically relevant regions by prioritizing discriminative WSI patches. Our approach implements a progressive three-stage compression strategy: we first leverage FMs for global visual redundancy elimination, and integrate compressed features with language prompts for semantic relevance assessment, then perform neighbor-aware visual token filtering while preserving spatial coherence. Extensive experiments on pathological datasets spanning breast, lung, and ovarian cancers demonstrate its superior performance in few-shot pathology diagnosis. Codes are available at https://github.com/dddavid4real/FOCUS. △ Less

Submitted 20 March, 2025; v1 submitted 22 November, 2024; originally announced November 2024.

Comments: Accepted by CVPR'2025

arXiv:2410.20688 [pdf, other]

Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design

Authors: Xiangxin Zhou, Jiaqi Guan, Yijia Zhang, Xingang Peng, Liang Wang, Jianzhu Ma

Abstract: Dual-target therapeutic strategies have become a compelling approach and attracted significant attention due to various benefits, such as their potential in overcoming drug resistance in cancer therapy. Considering the tremendous success that deep generative models have achieved in structure-based drug design in recent years, we formulate dual-target drug design as a generative task and curate a n… ▽ More Dual-target therapeutic strategies have become a compelling approach and attracted significant attention due to various benefits, such as their potential in overcoming drug resistance in cancer therapy. Considering the tremendous success that deep generative models have achieved in structure-based drug design in recent years, we formulate dual-target drug design as a generative task and curate a novel dataset of potential target pairs based on synergistic drug combinations. We propose to design dual-target drugs with diffusion models that are trained on single-target protein-ligand complex pairs. Specifically, we align two pockets in 3D space with protein-ligand binding priors and build two complex graphs with shared ligand nodes for SE(3)-equivariant composed message passing, based on which we derive a composed drift in both 3D and categorical probability space in the generative process. Our algorithm can well transfer the knowledge gained in single-target pretraining to dual-target scenarios in a zero-shot manner. We also repurpose linker design methods as strong baselines for this task. Extensive experiments demonstrate the effectiveness of our method compared with various baselines. △ Less

Submitted 26 November, 2024; v1 submitted 27 October, 2024; originally announced October 2024.

Comments: Accepted to NeurIPS 2024

arXiv:2410.14956 [pdf]

Airborne Biomarker Localization Engine (ABLE) for Open Air Point-of-Care Detection

Authors: Jingcheng Ma, Megan Laune, Pengju Li, Jing Lu, Jiping Yue, Yueyue Yu, Jessica Cleary, Kaitlyn Oliphant, Zachary Kessler, Erika C. Claud, Bozhi Tian

Abstract: Unlike biomarkers in biofluids, airborne biomarkers are dilute and difficult to trace. Detecting diverse airborne biomarkers with sufficient sensitivity typically relies on bulky and expensive equipment like mass spectrometers that remain inaccessible to the general population. Here, we introduce Airborne Biomarker Localization Engine (ABLE), a simple, affordable, and portable platform that can de… ▽ More Unlike biomarkers in biofluids, airborne biomarkers are dilute and difficult to trace. Detecting diverse airborne biomarkers with sufficient sensitivity typically relies on bulky and expensive equipment like mass spectrometers that remain inaccessible to the general population. Here, we introduce Airborne Biomarker Localization Engine (ABLE), a simple, affordable, and portable platform that can detect both volatile, non-volatile, molecular, and particulate biomarkers in about 15 minutes. ABLE significantly improves gas detection limits by converting dilute gases into droplets by water condensation, producing concentrated aqueous samples that are easy to be tested. Fundamental studies of multiphase condensation revealed unexpected stability in condensate-trapped biomarkers, making ABLE a reliable, accessible, and high-performance system for open-air-based biosensing applications such as non-contact infant healthcare, pathogen detection in public space, and food safety. △ Less

Submitted 18 October, 2024; originally announced October 2024.

Comments: 17 pages, 5 figures. An additional 67-page supplementary materials document containing a detailed description of methods, 15 additional discussions, 30 figures, and 3 tables, will be made available after the manuscript is published after peer-review process

arXiv:2408.11884 [pdf, other]

ST-USleepNet: A Spatial-Temporal Coupling Prominence Network for Multi-Channel Sleep Staging

Authors: Jingying Ma, Qika Lin, Ziyu Jia, Mengling Feng

Abstract: Sleep staging is critical to assess sleep quality and diagnose disorders. Despite advancements in artificial intelligence enabling automated sleep staging, significant challenges remain: (1) Simultaneously extracting prominent temporal and spatial sleep features from multi-channel raw signals, including characteristic sleep waveforms and salient spatial brain networks. (2) Capturing the spatial-te… ▽ More Sleep staging is critical to assess sleep quality and diagnose disorders. Despite advancements in artificial intelligence enabling automated sleep staging, significant challenges remain: (1) Simultaneously extracting prominent temporal and spatial sleep features from multi-channel raw signals, including characteristic sleep waveforms and salient spatial brain networks. (2) Capturing the spatial-temporal coupling patterns essential for accurate sleep staging. To address these challenges, we propose a novel framework named ST-USleepNet, comprising a spatial-temporal graph construction module (ST) and a U-shaped sleep network (USleepNet). The ST module converts raw signals into a spatial-temporal graph based on signal similarity, temporal, and spatial relationships to model spatial-temporal coupling patterns. The USleepNet employs a U-shaped structure for both the temporal and spatial streams, mirroring its original use in image segmentation to isolate significant targets. Applied to raw sleep signals and graph data from the ST module, USleepNet effectively segments these inputs, simultaneously extracting prominent temporal and spatial sleep features. Testing on three datasets demonstrates that ST-USleepNet outperforms existing baselines, and model visualizations confirm its efficacy in extracting prominent sleep features and temporal-spatial coupling patterns across various sleep stages. The code is available at: https://github.com/Majy-Yuji/ST-USleepNet.git. △ Less

Submitted 22 January, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

arXiv:2407.11742 [pdf, other]

Revolutionizing MRI Data Processing Using FSL: Preliminary Findings with the Fugaku Supercomputer

Authors: Tianxiang Lyu, Wataru Uchida, Zhe Sun, Christina Andica, Keita Tokuda, Rui Zou, Jie Mao, Keigo Shimoji, Koji Kamagata, Mitsuhisa Sato, Ryutaro Himeno, Shigeki Aoki

Abstract: The amount of Magnetic resonance imaging data has grown tremendously recently, creating an urgent need to accelerate data processing, which requires substantial computational resources and time. In this preliminary study, we applied FMRIB Software Library commands on T1-weighted and diffusion-weighted images of a single young adult using the Fugaku supercomputer. The tensor-based measurements and… ▽ More The amount of Magnetic resonance imaging data has grown tremendously recently, creating an urgent need to accelerate data processing, which requires substantial computational resources and time. In this preliminary study, we applied FMRIB Software Library commands on T1-weighted and diffusion-weighted images of a single young adult using the Fugaku supercomputer. The tensor-based measurements and subcortical structure segmentations performed on Fugaku supercomputer were highly consistent with those from conventional systems, demonstrating its reliability and significantly reduced processing time. △ Less

Submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.01649 [pdf, other]

FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group Frames

Authors: Ruidong Wu, Ruihan Guo, Rui Wang, Shitong Luo, Yue Xu, Jiahan Li, Jianzhu Ma, Qiang Liu, Yunan Luo, Jian Peng

Abstract: Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing probl… ▽ More Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing problem on high-rotational-error targets. To address this fundamental limitation, we propose a novel geodesic loss called Frame Aligned Frame Error (FAFE, denoted as F2E to distinguish from FAPE), which enables the model to better optimize both the rotational and translational errors between two frames. We then prove that F2E can be reformulated as a group-aware geodesic loss, which translates the optimization of the residue-to-residue error to optimizing group-to-group geodesic frame distance. By fine-tuning AF2 with our proposed new loss function, we attain a correct rate of 52.3\% (DockQ $>$ 0.23) on an evaluation set and 43.8\% correct rate on a subset with low homology, with substantial improvement over AF2 by 182\% and 100\% respectively. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2406.04628 [pdf, other]

Projecting Molecules into Synthesizable Chemical Spaces

Authors: Shitong Luo, Wenhao Gao, Zuofan Wu, Jian Peng, Connor W. Coley, Jianzhu Ma

Abstract: Discovering new drug molecules is a pivotal yet challenging process due to the near-infinitely large chemical space and notorious demands on time and resources. Numerous generative models have recently been introduced to accelerate the drug discovery process, but their progression to experimental validation remains limited, largely due to a lack of consideration for synthetic accessibility in prac… ▽ More Discovering new drug molecules is a pivotal yet challenging process due to the near-infinitely large chemical space and notorious demands on time and resources. Numerous generative models have recently been introduced to accelerate the drug discovery process, but their progression to experimental validation remains limited, largely due to a lack of consideration for synthetic accessibility in practical settings. In this work, we introduce a novel framework that is capable of generating new chemical structures while ensuring synthetic accessibility. Specifically, we introduce a postfix notation of synthetic pathways to represent molecules in chemical space. Then, we design a transformer-based model to translate molecular graphs into postfix notations of synthesis. We highlight the model's ability to: (a) perform bottom-up synthesis planning more accurately, (b) generate structurally similar, synthesizable analogs for unsynthesizable molecules proposed by generative models with their properties preserved, and (c) explore the local synthesizable chemical space around hit molecules. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.00735 [pdf, other]

Full-Atom Peptide Design based on Multi-modal Flow Matching

Authors: Jiahan Li, Chaoran Cheng, Zuofan Wu, Ruihan Guo, Shitong Luo, Zhizhou Ren, Jian Peng, Jianzhu Ma

Abstract: Peptides, short chains of amino acid residues, play a vital role in numerous biological processes by interacting with other target molecules, offering substantial potential in drug discovery. In this work, we present PepFlow, the first multi-modal deep generative model grounded in the flow-matching framework for the design of full-atom peptides that target specific protein receptors. Drawing inspi… ▽ More Peptides, short chains of amino acid residues, play a vital role in numerous biological processes by interacting with other target molecules, offering substantial potential in drug discovery. In this work, we present PepFlow, the first multi-modal deep generative model grounded in the flow-matching framework for the design of full-atom peptides that target specific protein receptors. Drawing inspiration from the crucial roles of residue backbone orientations and side-chain dynamics in protein-peptide interactions, we characterize the peptide structure using rigid backbone frames within the $\mathrm{SE}(3)$ manifold and side-chain angles on high-dimensional tori. Furthermore, we represent discrete residue types in the peptide sequence as categorical distributions on the probability simplex. By learning the joint distributions of each modality using derived flows and vector fields on corresponding manifolds, our method excels in the fine-grained design of full-atom peptides. Harnessing the multi-modal paradigm, our approach adeptly tackles various tasks such as fix-backbone sequence design and side-chain packing through partial sampling. Through meticulously crafted experiments, we demonstrate that PepFlow exhibits superior performance in comprehensive benchmarks, highlighting its significant potential in computational peptide design and analysis. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: ICML 2024

arXiv:2403.17615 [pdf, other]

Grad-CAMO: Learning Interpretable Single-Cell Morphological Profiles from 3D Cell Painting Images

Authors: Vivek Gopalakrishnan, Jingzhe Ma, Zhiyong Xie

Abstract: Despite their black-box nature, deep learning models are extensively used in image-based drug discovery to extract feature vectors from single cells in microscopy images. To better understand how these networks perform representation learning, we employ visual explainability techniques (e.g., Grad-CAM). Our analyses reveal several mechanisms by which supervised models cheat, exploiting biologicall… ▽ More Despite their black-box nature, deep learning models are extensively used in image-based drug discovery to extract feature vectors from single cells in microscopy images. To better understand how these networks perform representation learning, we employ visual explainability techniques (e.g., Grad-CAM). Our analyses reveal several mechanisms by which supervised models cheat, exploiting biologically irrelevant pixels when extracting morphological features from images, such as noise in the background. This raises doubts regarding the fidelity of learned single-cell representations and their relevance when investigating downstream biological questions. To address this misalignment between researcher expectations and machine behavior, we introduce Grad-CAMO, a novel single-cell interpretability score for supervised feature extractors. Grad-CAMO measures the proportion of a model's attention that is concentrated on the cell of interest versus the background. This metric can be assessed per-cell or averaged across a validation set, offering a tool to audit individual features vectors or guide the improved design of deep learning architectures. Importantly, Grad-CAMO seamlessly integrates into existing workflows, requiring no dataset or model modifications, and is compatible with both 2D and 3D Cell Painting data. Additional results are available at https://github.com/eigenvivek/Grad-CAMO. △ Less

Submitted 26 March, 2024; originally announced March 2024.

arXiv:2403.14046 [pdf]

Desiderata of evidence for representation in neuroscience

Authors: Stephan Pohl, Edgar Y. Walker, David L. Barack, Jennifer Lee, Rachel N. Denison, Ned Block, Florent Meyniel, Wei Ji Ma

Abstract: This paper develops a systematic framework for the evidence neuroscientists use to establish whether a neural response represents a feature. Researchers try to establish that the neural response is (1) sensitive and (2) specific to the feature, (3) invariant to other features, and (4) functional, which means that it is used downstream in the brain. We formalize these desiderata in information-theo… ▽ More This paper develops a systematic framework for the evidence neuroscientists use to establish whether a neural response represents a feature. Researchers try to establish that the neural response is (1) sensitive and (2) specific to the feature, (3) invariant to other features, and (4) functional, which means that it is used downstream in the brain. We formalize these desiderata in information-theoretic terms. This formalism allows us to precisely state the desiderata while unifying the different analysis methods used in neuroscience under one framework. We discuss how common methods such as correlational analyses, decoding and encoding models, representational similarity analysis, and tests of statistical dependence are used to evaluate the desiderata. In doing so, we provide a common terminology to researchers that helps to clarify disagreements, to compare and integrate results across studies and research groups, and to identify when evidence might be missing and when evidence for some representational conclusion is strong. We illustrate the framework with several canonical examples, including the representation of orientation, numerosity, faces, and spatial location. We end by discussing how the framework can be extended to cover models of the neural code, multi-stage models, and other domains. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 50 pages, 11 figures

arXiv:2403.07902 [pdf, other]

DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design

Authors: Jiaqi Guan, Xiangxin Zhou, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu

Abstract: Designing 3D ligands within a target binding site is a fundamental task in drug discovery. Existing structured-based drug design methods treat all ligand atoms equally, which ignores different roles of atoms in the ligand for drug design and can be less efficient for exploring the large drug-like molecule space. In this paper, inspired by the convention in pharmaceutical practice, we decompose the… ▽ More Designing 3D ligands within a target binding site is a fundamental task in drug discovery. Existing structured-based drug design methods treat all ligand atoms equally, which ignores different roles of atoms in the ligand for drug design and can be less efficient for exploring the large drug-like molecule space. In this paper, inspired by the convention in pharmaceutical practice, we decompose the ligand molecule into two parts, namely arms and scaffold, and propose a new diffusion model, DecompDiff, with decomposed priors over arms and scaffold. In order to facilitate the decomposed generation and improve the properties of the generated molecules, we incorporate both bond diffusion in the model and additional validity guidance in the sampling phase. Extensive experiments on CrossDocked2020 show that our approach achieves state-of-the-art performance in generating high-affinity molecules while maintaining proper molecular properties and conformational stability, with up to -8.39 Avg. Vina Dock score and 24.5 Success Rate. The code is provided at https://github.com/bytedance/DecompDiff △ Less

Submitted 26 February, 2024; originally announced March 2024.

Comments: Accepted to ICML 2023

arXiv:2401.13022 [pdf]

Harmonizing the Generation and Pre-publication Stewardship of FAIR Image Data

Authors: Nikki Bialy, Frank Alber, Brenda Andrews, Michael Angelo, Brian Beliveau, Lacramioara Bintu, Alistair Boettiger, Ulrike Boehm, Claire M. Brown, Mahmoud Bukar Maina, James J. Chambers, Beth A. Cimini, Kevin Eliceiri, Rachel Errington, Orestis Faklaris, Nathalie Gaudreault, Ronald N. Germain, Wojtek Goscinski, David Grunwald, Michael Halter, Dorit Hanein, John W. Hickey, Judith Lacoste, Alex Laude, Emma Lundberg , et al. (22 additional authors not shown)

Abstract: Together with the molecular knowledge of genes and proteins, biological images promise to significantly enhance the scientific understanding of complex cellular systems and to advance predictive and personalized therapeutic products for human health. For this potential to be realized, quality-assured image data must be shared among labs at a global scale to be compared, pooled, and reanalyzed, thu… ▽ More Together with the molecular knowledge of genes and proteins, biological images promise to significantly enhance the scientific understanding of complex cellular systems and to advance predictive and personalized therapeutic products for human health. For this potential to be realized, quality-assured image data must be shared among labs at a global scale to be compared, pooled, and reanalyzed, thus unleashing untold potential beyond the original purpose for which the data was generated. There are two broad sets of requirements to enable image data sharing in the life sciences. One set of requirements is articulated in the companion White Paper entitled Enabling Global Image Data Sharing in the Life Sciences, which is published in parallel and addresses the need to build the cyberinfrastructure for sharing the digital array data. In this White Paper, we detail a broad set of requirements, which involves collecting, managing, presenting, and propagating contextual information essential to assess the quality, understand the content, interpret the scientific implications, and reuse image data in the context of the experimental details. We start by providing an overview of the main lessons learned to date through international community activities, which have recently made considerable progress toward generating community standard practices for imaging Quality Control (QC) and metadata. We then provide a clear set of recommendations for amplifying this work. The driving goal is to address remaining challenges and democratize access to everyday practices and tools for a spectrum of biomedical researchers, regardless of their expertise, access to resources, and geographical location. △ Less

Submitted 30 August, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

Comments: This manuscript is published with a closely related companion entitled, Enabling Global Image Data Sharing in the Life Sciences, which can be found at the following link, arXiv:2401.13023 [q-bio.OT]

arXiv:2401.08851 [pdf]

Using i-vectors for subject-independent cross-session EEG transfer learning

Authors: Jonathan Lasko, Jeff Ma, Mike Nicoletti, Jonathan Sussman-Fort, Sooyoung Jeong, William Hartmann

Abstract: Cognitive load classification is the task of automatically determining an individual's utilization of working memory resources during performance of a task based on physiologic measures such as electroencephalography (EEG). In this paper, we follow a cross-disciplinary approach, where tools and methodologies from speech processing are used to tackle this problem. The corpus we use was released pub… ▽ More Cognitive load classification is the task of automatically determining an individual's utilization of working memory resources during performance of a task based on physiologic measures such as electroencephalography (EEG). In this paper, we follow a cross-disciplinary approach, where tools and methodologies from speech processing are used to tackle this problem. The corpus we use was released publicly in 2021 as part of the first passive brain-computer interface competition on cross-session workload estimation. We present our approach which used i-vector-based neural network classifiers to accomplish inter-subject cross-session EEG transfer learning, achieving 18% relative improvement over equivalent subject-dependent models. We also report experiments showing how our subject-independent models perform competitively on held-out subjects and improve with additional subject data, suggesting that subject-dependent training is not required for effective cognitive load determination. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: 11 pages

arXiv:2312.00485 [pdf, other]

Backbone-based Dynamic Graph Spatio-Temporal Network for Epidemic Forecasting

Authors: Junkai Mao, Yuexing Han, Gouhei Tanaka, Bing Wang

Abstract: Accurate epidemic forecasting is a critical task in controlling disease transmission. Many deep learning-based models focus only on static or dynamic graphs when constructing spatial information, ignoring their relationship. Additionally, these models often rely on recurrent structures, which can lead to error accumulation and computational time consumption. To address the aforementioned problems,… ▽ More Accurate epidemic forecasting is a critical task in controlling disease transmission. Many deep learning-based models focus only on static or dynamic graphs when constructing spatial information, ignoring their relationship. Additionally, these models often rely on recurrent structures, which can lead to error accumulation and computational time consumption. To address the aforementioned problems, we propose a novel model called Backbone-based Dynamic Graph Spatio-Temporal Network (BDGSTN). Intuitively, the continuous and smooth changes in graph structure, make adjacent graph structures share a basic pattern. To capture this property, we use adaptive methods to generate static backbone graphs containing the primary information and temporal models to generate dynamic temporal graphs of epidemic data, fusing them to generate a backbone-based dynamic graph. To overcome potential limitations associated with recurrent structures, we introduce a linear model DLinear to handle temporal dependencies and combine it with dynamic graph convolution for epidemic forecasting. Extensive experiments on two datasets demonstrate that BDGSTN outperforms baseline models and ablation comparison further verifies the effectiveness of model components. Furthermore, we analyze and measure the significance of backbone and temporal graphs by using information metrics from different aspects. Finally, we compare model parameter volume and training time to confirm the superior complexity and efficiency of BDGSTN. △ Less

Submitted 1 December, 2023; originally announced December 2023.

arXiv:2311.15156 [pdf, other]

xTrimoGene: An Efficient and Scalable Representation Learner for Single-Cell RNA-Seq Data

Authors: Jing Gong, Minsheng Hao, Xingyi Cheng, Xin Zeng, Chiming Liu, Jianzhu Ma, Xuegong Zhang, Taifeng Wang, Le Song

Abstract: Advances in high-throughput sequencing technology have led to significant progress in measuring gene expressions at the single-cell level. The amount of publicly available single-cell RNA-seq (scRNA-seq) data is already surpassing 50M records for humans with each record measuring 20,000 genes. This highlights the need for unsupervised representation learning to fully ingest these data, yet classic… ▽ More Advances in high-throughput sequencing technology have led to significant progress in measuring gene expressions at the single-cell level. The amount of publicly available single-cell RNA-seq (scRNA-seq) data is already surpassing 50M records for humans with each record measuring 20,000 genes. This highlights the need for unsupervised representation learning to fully ingest these data, yet classical transformer architectures are prohibitive to train on such data in terms of both computation and memory. To address this challenge, we propose a novel asymmetric encoder-decoder transformer for scRNA-seq data, called xTrimoGene$^α$ (or xTrimoGene for short), which leverages the sparse characteristic of the data to scale up the pre-training. This scalable design of xTrimoGene reduces FLOPs by one to two orders of magnitude compared to classical transformers while maintaining high accuracy, enabling us to train the largest transformer models over the largest scRNA-seq dataset today. Our experiments also show that the performance of xTrimoGene improves as we scale up the model sizes, and it also leads to SOTA performance over various downstream tasks, such as cell type annotation, perturb-seq effect prediction, and drug combination prediction. xTrimoGene model is now available for use as a service via the following link: https://api.biomap.com/xTrimoGene/apply. △ Less

Submitted 24 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

Comments: Accepted by NeurIPS 2023

arXiv:2309.08478 [pdf, other]

doi 10.1093/bioadv/vbae099

Current and future directions in network biology

Authors: Marinka Zitnik, Michelle M. Li, Aydin Wells, Kimberly Glass, Deisy Morselli Gysi, Arjun Krishnan, T. M. Murali, Predrag Radivojac, Sushmita Roy, Anaïs Baudot, Serdar Bozdag, Danny Z. Chen, Lenore Cowen, Kapil Devkota, Anthony Gitter, Sara Gosline, Pengfei Gu, Pietro H. Guzzi, Heng Huang, Meng Jiang, Ziynet Nesibe Kesimoglu, Mehmet Koyuturk, Jian Ma, Alexander R. Pico, Nataša Pržulj , et al. (12 additional authors not shown)

Abstract: Network biology is an interdisciplinary field bridging computational and biological sciences that has proved pivotal in advancing the understanding of cellular functions and diseases across biological systems and scales. Although the field has been around for two decades, it remains nascent. It has witnessed rapid evolution, accompanied by emerging challenges. These challenges stem from various fa… ▽ More Network biology is an interdisciplinary field bridging computational and biological sciences that has proved pivotal in advancing the understanding of cellular functions and diseases across biological systems and scales. Although the field has been around for two decades, it remains nascent. It has witnessed rapid evolution, accompanied by emerging challenges. These challenges stem from various factors, notably the growing complexity and volume of data together with the increased diversity of data types describing different tiers of biological organization. We discuss prevailing research directions in network biology and highlight areas of inference and comparison of biological networks, multimodal data integration and heterogeneous networks, higher-order network analysis, machine learning on networks, and network-based personalized medicine. Following the overview of recent breakthroughs across these five areas, we offer a perspective on the future directions of network biology. Additionally, we offer insights into scientific communities, educational initiatives, and the importance of fostering diversity within the field. This paper establishes a roadmap for an immediate and long-term vision for network biology. △ Less

Submitted 11 June, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

Comments: 52 pages, 6 figures, 1 table

arXiv:2308.05864 [pdf, other]

doi 10.1038/s41592-024-02233-6

The Multi-modality Cell Segmentation Challenge: Towards Universal Solutions

Authors: Jun Ma, Ronald Xie, Shamini Ayyadhury, Cheng Ge, Anubha Gupta, Ritu Gupta, Song Gu, Yao Zhang, Gihun Lee, Joonkee Kim, Wei Lou, Haofeng Li, Eric Upschulte, Timo Dickscheid, José Guilherme de Almeida, Yixin Wang, Lin Han, Xin Yang, Marco Labagnara, Vojislav Gligorovski, Maxime Scheder, Sahand Jamal Rahi, Carly Kempster, Alice Pollitt, Leon Espinosa , et al. (15 additional authors not shown)

Abstract: Cell segmentation is a critical step for quantitative single-cell analysis in microscopy images. Existing cell segmentation methods are often tailored to specific modalities or require manual interventions to specify hyper-parameters in different experimental settings. Here, we present a multi-modality cell segmentation benchmark, comprising over 1500 labeled images derived from more than 50 diver… ▽ More Cell segmentation is a critical step for quantitative single-cell analysis in microscopy images. Existing cell segmentation methods are often tailored to specific modalities or require manual interventions to specify hyper-parameters in different experimental settings. Here, we present a multi-modality cell segmentation benchmark, comprising over 1500 labeled images derived from more than 50 diverse biological experiments. The top participants developed a Transformer-based deep-learning algorithm that not only exceeds existing methods but can also be applied to diverse microscopy images across imaging platforms and tissue types without manual parameter adjustments. This benchmark and the improved algorithm offer promising avenues for more accurate and versatile cell analysis in microscopy imaging. △ Less

Submitted 1 April, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

Comments: NeurIPS22 Cell Segmentation Challenge: https://neurips22-cellseg.grand-challenge.org/ . Nature Methods (2024)

arXiv:2305.12471 [pdf, other]

Mapping Biological Neuron Dynamics into an Interpretable Two-layer Artificial Neural Network

Authors: Jingyang Ma, Songting Li, Douglas Zhou

Abstract: Dendrites are crucial structures for computation of an individual neuron. It has been shown that the dynamics of a biological neuron with dendrites can be approximated by artificial neural networks (ANN) with deep structure. However, it remains unclear whether a neuron can be further captured by a simple, biologically plausible ANN. In this work, we develop a two-layer ANN, named as dendritic bili… ▽ More Dendrites are crucial structures for computation of an individual neuron. It has been shown that the dynamics of a biological neuron with dendrites can be approximated by artificial neural networks (ANN) with deep structure. However, it remains unclear whether a neuron can be further captured by a simple, biologically plausible ANN. In this work, we develop a two-layer ANN, named as dendritic bilinear neural network (DBNN), to accurately predict both the sub-threshold voltage and spike time at the soma of biological neuron models with dendritic structure. Our DBNN is found to be interpretable and well captures the dendritic integration process of biological neurons including a bilinear rule revealed in previous works. In addition, we show DBNN is capable of performing diverse tasks including direction selectivity, coincidence detection, and image classification. Our work proposes a biologically interpretable ANN that characterizes the computation of biological neurons, which can be potentially implemented in the deep learning framework to improve computational ability. △ Less

Submitted 21 May, 2023; originally announced May 2023.

arXiv:2305.07508 [pdf, other]

MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation

Authors: Xingang Peng, Jiaqi Guan, Qiang Liu, Jianzhu Ma

Abstract: Deep generative models have recently achieved superior performance in 3D molecule generation. Most of them first generate atoms and then add chemical bonds based on the generated atoms in a post-processing manner. However, there might be no corresponding bond solution for the temporally generated atoms as their locations are generated without considering potential bonds. We define this problem as… ▽ More Deep generative models have recently achieved superior performance in 3D molecule generation. Most of them first generate atoms and then add chemical bonds based on the generated atoms in a post-processing manner. However, there might be no corresponding bond solution for the temporally generated atoms as their locations are generated without considering potential bonds. We define this problem as the atom-bond inconsistency problem and claim it is the main reason for current approaches to generating unrealistic 3D molecules. To overcome this problem, we propose a new diffusion model called MolDiff which can generate atoms and bonds simultaneously while still maintaining their consistency by explicitly modeling the dependence between their relationships. We evaluated the generation ability of our proposed model and the quality of the generated molecules using criteria related to both geometry and chemical properties. The empirical studies showed that our model outperforms previous approaches, achieving a three-fold improvement in success rate and generating molecules with significantly better quality. △ Less

Submitted 11 May, 2023; originally announced May 2023.

arXiv:2305.06769 [pdf]

Comparative Analysis of Machine Learning Algorithms for Predicting On-Target and Off-Target Effects of CRISPR-Cas13d for gene editing

Authors: Jingze Liu, Jiahao Ma

Abstract: CRISPR-Cas13 is a system that utilizes single stranded RNAs for RNA editing. Prediction of on-target and off-target effects for the CRISPR-Cas13d dependency enables us to design specific single guide RNAs (sgRNAs) that help locate the desired RNA target positions. In this study, we compared the performance of multiple machine learning algorithms in predicting these effects using a reported dataset… ▽ More CRISPR-Cas13 is a system that utilizes single stranded RNAs for RNA editing. Prediction of on-target and off-target effects for the CRISPR-Cas13d dependency enables us to design specific single guide RNAs (sgRNAs) that help locate the desired RNA target positions. In this study, we compared the performance of multiple machine learning algorithms in predicting these effects using a reported dataset. Our results show that Catboost is the most accurate model with high sensitivity. This finding represents a significant advancement in our understanding of how to chose modeling methods to deal with RNA sequence feaatures effictivelys. Furthermore, our approach can potentially be applied to other CRISPR systems and genetic engineering techniques. Overall, this work has important implications for developing safer and more effective gene therapies and biotechnological applications. △ Less

Submitted 11 May, 2023; originally announced May 2023.

Comments: code: https://www.kaggle.com/code/markblack370/cas13-pycaret/notebook

MSC Class: 68T05 ACM Class: I.2.6

arXiv:2304.13230 [pdf, other]

UNADON: Transformer-based model to predict genome-wide chromosome spatial position

Authors: Muyu Yang, Jian Ma

Abstract: The spatial positioning of chromosomes relative to functional nuclear bodies is intertwined with genome functions such as transcription. However, the sequence patterns and epigenomic features that collectively influence chromatin spatial positioning in a genome-wide manner are not well understood. Here, we develop a new transformer-based deep learning model called UNADON, which predicts the genome… ▽ More The spatial positioning of chromosomes relative to functional nuclear bodies is intertwined with genome functions such as transcription. However, the sequence patterns and epigenomic features that collectively influence chromatin spatial positioning in a genome-wide manner are not well understood. Here, we develop a new transformer-based deep learning model called UNADON, which predicts the genome-wide cytological distance to a specific type of nuclear body, as measured by TSA-seq, using both sequence features and epigenomic signals. Evaluations of UNADON in four cell lines (K562, H1, HFFc6, HCT116) show high accuracy in predicting chromatin spatial positioning to nuclear bodies when trained on a single cell line. UNADON also performed well in an unseen cell type. Importantly, we reveal potential sequence and epigenomic factors that affect large-scale chromatin compartmentalization to nuclear bodies. Together, UNADON provides new insights into the principles between sequence features and large-scale chromatin spatial localization, which has important implications for understanding nuclear structure and function. △ Less

Submitted 1 July, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: Published in ISMB 2023

arXiv:2303.03543 [pdf, other]

3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction

Authors: Jiaqi Guan, Wesley Wei Qian, Xingang Peng, Yufeng Su, Jian Peng, Jianzhu Ma

Abstract: Rich data and powerful machine learning models allow us to design drugs for a specific protein target \textit{in silico}. Recently, the inclusion of 3D structures during targeted drug design shows superior performance to other target-free models as the atomic interaction in the 3D space is explicitly modeled. However, current 3D target-aware models either rely on the voxelized atom densities or th… ▽ More Rich data and powerful machine learning models allow us to design drugs for a specific protein target \textit{in silico}. Recently, the inclusion of 3D structures during targeted drug design shows superior performance to other target-free models as the atomic interaction in the 3D space is explicitly modeled. However, current 3D target-aware models either rely on the voxelized atom densities or the autoregressive sampling process, which are not equivariant to rotation or easily violate geometric constraints resulting in unrealistic structures. In this work, we develop a 3D equivariant diffusion model to solve the above challenges. To achieve target-aware molecule design, our method learns a joint generative process of both continuous atom coordinates and categorical atom types with a SE(3)-equivariant network. Moreover, we show that our model can serve as an unsupervised feature extractor to estimate the binding affinity under proper parameterization, which provides an effective way for drug screening. To evaluate our model, we propose a comprehensive framework to evaluate the quality of sampled molecules from different dimensions. Empirical studies show our model could generate molecules with more realistic 3D structures and better affinities towards the protein targets, and improve binding affinity ranking and prediction without retraining. △ Less

Submitted 6 March, 2023; originally announced March 2023.

Comments: Accepted to ICLR 2023

arXiv:2302.00652 [pdf, other]

Breathing cluster in complex neuron-astrocyte networks

Authors: Ya Wang, Liang Wang, Huawei Fan, Jun Ma, Hui Cao, Xingang Wang

Abstract: Brain activities are featured by spatially distributed neural clusters of coherent firings and a spontaneous switching of the clusters between the synchrony and asynchrony states. Evidences from {\it in vivo} experiments suggest that astrocytes, a type of glial cell regarded previously as providing only structural and metabolic supports to neurons, participate actively in brain functions and play… ▽ More Brain activities are featured by spatially distributed neural clusters of coherent firings and a spontaneous switching of the clusters between the synchrony and asynchrony states. Evidences from {\it in vivo} experiments suggest that astrocytes, a type of glial cell regarded previously as providing only structural and metabolic supports to neurons, participate actively in brain functions and play a crucial role in regulating the neural firing activities, yet the mechanism remains unknown. Introducing astrocyte as a reservoir of the glutamate released from neuron synapses, here we propose the model of complex neuron-astrocyte network and employ it to explore the roles of astrocyte in regulating the synchronization behaviors of networked neurons. It is found that a fraction of neurons on the network can be synchronized as a cluster, while the remaining neurons are kept as desynchronized. Moreover, during the course of network evolution, the cluster is switching between the synchrony and asynchrony states intermittently, henceforth the phenomenon of ``breathing cluster". By the method of symmetry-based analysis, we conduct a theoretical investigation on the stability of the cluster and the mechanism generating the breathing activities. It is revealed that the contents of the cluster are determined by the network symmetry and the breathing activities are due to the interplay between the neural network and the astrocyte. The breathing phenomenon is demonstrated in network models of different structures and neural dynamics. The studies give insights into the cellular mechanism of astrocytes in regulating neural activities, and shed lights onto the spontaneous state switching of the neocortex. △ Less

Submitted 26 January, 2023; originally announced February 2023.

Comments: 14 pages, 6 figures

arXiv:2211.08084 [pdf]

Inferring cell-specific lncRNA regulation with single-cell RNA-sequencing data in the developing human neocortex

Authors: Meng Huang, Jiangtao Ma, Changzhou Long, Junpeng Zhang, Xiucai Ye, Tetsuya Sakurai

Abstract: Long non-coding RNAs (lncRNAs) are important regulators to modulate gene expression and cell proliferation in the developing human brain. Previous methods mainly use bulk lncRNA and mRNA expression data to study lncRNA regulation. However, to analyze lncRNA regulation regarding individual cells, we focus on single-cell RNA-sequencing (scRNA-seq) data instead of bulk data. Recent advance in scRNA-s… ▽ More Long non-coding RNAs (lncRNAs) are important regulators to modulate gene expression and cell proliferation in the developing human brain. Previous methods mainly use bulk lncRNA and mRNA expression data to study lncRNA regulation. However, to analyze lncRNA regulation regarding individual cells, we focus on single-cell RNA-sequencing (scRNA-seq) data instead of bulk data. Recent advance in scRNA-seq has provided a way to investigate lncRNA regulation at single-cell level. We will propose a novel computational method, CSlncR (cell-specific lncRNA regulation), which combines putative lncRNA-mRNA binding information with scRNA-seq data including lncRNAs and mRNAs to identify cell-specific lncRNA-mRNA regulation networks at individual cells. To understand lncRNA regulation at different development stages, we apply CSlncR to the scRNA-seq data of human neocortex. Network analysis shows that the lncRNA regulation is unique in each cell from the different human neocortex development stages. The comparison results indicate that CSlncR is also an effective tool for predicting cell-specific lncRNA targets and clustering single cells, which helps understand cell-cell communication. △ Less

Submitted 29 November, 2022; v1 submitted 15 November, 2022; originally announced November 2022.

arXiv:2208.14668 [pdf]

A Resonance Model for Spontaneous Cortical Activity

Authors: Yanjiang Wang, Jichao Ma, Jiebin Luo, Xue Chen, Yue Yuan

Abstract: How human brain function emerges from structure has intrigued researchers for decades and numerous models have been put forward, yet none of them yields a close structure-function relation. Here we present a resonance model based on neuronal spike timing dependent plasticity (STDP) principle to describe the spontaneous cortical activity by incorporating the dynamic interactions between neuronal po… ▽ More How human brain function emerges from structure has intrigued researchers for decades and numerous models have been put forward, yet none of them yields a close structure-function relation. Here we present a resonance model based on neuronal spike timing dependent plasticity (STDP) principle to describe the spontaneous cortical activity by incorporating the dynamic interactions between neuronal populations into a wave equation, which is able to accurately predict the resting brain functional connectivity (FC), including the resting-state networks. Besides, the proposed model provides strong theoretical and experimental evidences that the spontaneous dynamic coupling between brain regions fluctuates with a low frequency. Crucially, it is able to account for how the negative functional correlations emerge during resonance. We test the model with a large cohort of subjects (1038) from the Human Connectome Project (HCP) S1200 release in both time and frequency domain, which exhibits superior performance to existing eigen-decomposition models. △ Less

Submitted 6 October, 2022; v1 submitted 31 August, 2022; originally announced August 2022.

arXiv:2208.02433 [pdf, other]

Simulation and application of COVID-19 compartment model using physics-informed neural network

Authors: Jinhuan Ke, Jiahao Ma, Xiyu Yin, Robin Singh

Abstract: COVID-19 pandemic has had a disruptive and irreversible impact globally, yet traditional epidemiological modeling approaches such as the susceptible-infected-recovered (SIR) model have exhibited limited effectiveness in forecasting of the up-to-date pandemic situation. In this work, susceptible-vaccinated-exposed-infected-dead-recovered (SVEIDR) model and its variants -- aged and vaccination-struc… ▽ More COVID-19 pandemic has had a disruptive and irreversible impact globally, yet traditional epidemiological modeling approaches such as the susceptible-infected-recovered (SIR) model have exhibited limited effectiveness in forecasting of the up-to-date pandemic situation. In this work, susceptible-vaccinated-exposed-infected-dead-recovered (SVEIDR) model and its variants -- aged and vaccination-structured SVEIDR models -- are introduced to encode the effect of social contact for different age groups and vaccination status. Then, we implement the physics-informed neural network (PiNN) on both simulated and real-world data. The PiNN model enables robust analysis of the dynamic spread, prediction, and parameter optimization of the COVID-19 compartmental models. The models exhibit relative root mean square error (RRMSE) of $<4\%$ for all components and provide incubation, death, and recovery rates of $γ= 0.0130$, $λ=0.0001$, and $ρ=0.0037$, respectively, for the first 310 days of the epidemic in the US with RRMSE of $<0.35\%$ for all components. To further improve the model performance, temporally varying parameters can be included, such as vaccination, transmission, and incubation rates. Our implementation highlights PiNN as a reliable candidate approach for forecasting real-world data and can be applied to other compartmental model variants of interest. △ Less

Submitted 12 October, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

arXiv:2207.03523 [pdf, ps, other]

Winning the lottery with neural connectivity constraints: faster learning across cognitive tasks with spatially constrained sparse RNNs

Authors: Mikail Khona, Sarthak Chandra, Joy J. Ma, Ila Fiete

Abstract: Recurrent neural networks (RNNs) are often used to model circuits in the brain, and can solve a variety of difficult computational problems requiring memory, error-correction, or selection [Hopfield, 1982, Maass et al., 2002, Maass, 2011]. However, fully-connected RNNs contrast structurally with their biological counterparts, which are extremely sparse (~0.1%). Motivated by the neocortex, where ne… ▽ More Recurrent neural networks (RNNs) are often used to model circuits in the brain, and can solve a variety of difficult computational problems requiring memory, error-correction, or selection [Hopfield, 1982, Maass et al., 2002, Maass, 2011]. However, fully-connected RNNs contrast structurally with their biological counterparts, which are extremely sparse (~0.1%). Motivated by the neocortex, where neural connectivity is constrained by physical distance along cortical sheets and other synaptic wiring costs, we introduce locality masked RNNs (LM-RNNs) that utilize task-agnostic predetermined graphs with sparsity as low as 4%. We study LM-RNNs in a multitask learning setting relevant to cognitive systems neuroscience with a commonly used set of tasks, 20-Cog-tasks [Yang et al., 2019]. We show through reductio ad absurdum that 20-Cog-tasks can be solved by a small pool of separated autapses that we can mechanistically analyze and understand. Thus, these tasks fall short of the goal of inducing complex recurrent dynamics and modular structure in RNNs. We next contribute a new cognitive multi-task battery, Mod-Cog, consisting of upto 132 tasks that expands by 7-fold the number of tasks and task-complexity of 20-Cog-tasks. Importantly, while autapses can solve the simple 20-Cog-tasks, the expanded task-set requires richer neural architectures and continuous attractor dynamics. On these tasks, we show that LM-RNNs with an optimal sparsity result in faster training and better data-efficiency than fully connected networks. △ Less

Submitted 29 May, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

Comments: 12 pages, 5 main text figures

arXiv:2206.05059 [pdf]

Simulation, Modeling and Prediction of a Pharmacodynamic Animal Tissue Culture Compartment Model by Physical Informed Neural Network

Authors: Jiahao Ma

Abstract: Compartment models of cell culture are widely used in cytology, pharmacology, toxicology and other fields. Numerical simulation, data modeling and prediction of compartment models can be realized by traditional differential equation modeling methods. At the same time, with the development of software and hardware, Physical Informed Neural Network (PINN) is widely used to solve differential equatio… ▽ More Compartment models of cell culture are widely used in cytology, pharmacology, toxicology and other fields. Numerical simulation, data modeling and prediction of compartment models can be realized by traditional differential equation modeling methods. At the same time, with the development of software and hardware, Physical Informed Neural Network (PINN) is widely used to solve differential equation models. This work models, simulates and predicts the cell culture compartment model based on the machine learning framework PyTorch with an 16 hidden layers neural network, including 8 linear layers and 8 feedback active layers. The results showed a loss value of 0.0004853 for three-component four-parameter quantitative pharmacodynamic model predictions in this way, which is evaluated by Mean Square Error (MSE). In summary, Physical Informed Neural Network can serve as an effective tool to deal with cell culture compartment models and may perform better in dealing with big datasets. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Comments: 7 pages, 5 figures

arXiv:2205.14195 [pdf, other]

Unsupervised learning of features and object boundaries from local prediction

Authors: Heiko H. Schütt, Wei Ji Ma

Abstract: A visual system has to learn both which features to extract from images and how to group locations into (proto-)objects. Those two aspects are usually dealt with separately, although predictability is discussed as a cue for both. To incorporate features and boundaries into the same model, we model a layer of feature maps with a pairwise Markov random field model in which each factor is paired with… ▽ More A visual system has to learn both which features to extract from images and how to group locations into (proto-)objects. Those two aspects are usually dealt with separately, although predictability is discussed as a cue for both. To incorporate features and boundaries into the same model, we model a layer of feature maps with a pairwise Markov random field model in which each factor is paired with an additional binary variable, which switches the factor on or off. Using one of two contrastive learning objectives, we can learn both the features and the parameters of the Markov random field factors from images without further supervision signals. The features learned by shallow neural networks based on this loss are local averages, opponent colors, and Gabor-like stripe patterns. Furthermore, we can infer connectivity between locations by inferring the switch variables. Contours inferred from this connectivity perform quite well on the Berkeley segmentation database (BSDS500) without any training on contours. Thus, computing predictions across space aids both segmentation and feature learning, and models trained to optimize these predictions show similarities to the human visual system. We speculate that retinotopic visual cortex might implement such predictions over space through lateral connections. △ Less

Submitted 27 May, 2022; originally announced May 2022.

Comments: Submitted to NeurIPS 2022

arXiv:2205.07309 [pdf, other]

3DLinker: An E(3) Equivariant Variational Autoencoder for Molecular Linker Design

Authors: Yinan Huang, Xingang Peng, Jianzhu Ma, Muhan Zhang

Abstract: Deep learning has achieved tremendous success in designing novel chemical compounds with desirable pharmaceutical properties. In this work, we focus on a new type of drug design problem -- generating a small "linker" to physically attach two independent molecules with their distinct functions. The main computational challenges include: 1) the generation of linkers is conditional on the two given m… ▽ More Deep learning has achieved tremendous success in designing novel chemical compounds with desirable pharmaceutical properties. In this work, we focus on a new type of drug design problem -- generating a small "linker" to physically attach two independent molecules with their distinct functions. The main computational challenges include: 1) the generation of linkers is conditional on the two given molecules, in contrast to generating full molecules from scratch in previous works; 2) linkers heavily depend on the anchor atoms of the two molecules to be connected, which are not known beforehand; 3) 3D structures and orientations of the molecules need to be considered to avoid atom clashes, for which equivariance to E(3) group are necessary. To address these problems, we propose a conditional generative model, named 3DLinker, which is able to predict anchor atoms and jointly generate linker graphs and their 3D structures based on an E(3) equivariant graph variational autoencoder. So far as we know, there are no previous models that could achieve this task. We compare our model with multiple conditional generative models modified from other molecular design tasks and find that our model has a significantly higher rate in recovering molecular graphs, and more importantly, accurately predicting the 3D coordinates of all the atoms. △ Less

Submitted 15 May, 2022; originally announced May 2022.

arXiv:2205.07249 [pdf, other]

Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets

Authors: Xingang Peng, Shitong Luo, Jiaqi Guan, Qi Xie, Jian Peng, Jianzhu Ma

Abstract: Deep generative models have achieved tremendous success in designing novel drug molecules in recent years. A new thread of works have shown the great potential in advancing the specificity and success rate of in silico drug design by considering the structure of protein pockets. This setting posts fundamental computational challenges in sampling new chemical compounds that could satisfy multiple g… ▽ More Deep generative models have achieved tremendous success in designing novel drug molecules in recent years. A new thread of works have shown the great potential in advancing the specificity and success rate of in silico drug design by considering the structure of protein pockets. This setting posts fundamental computational challenges in sampling new chemical compounds that could satisfy multiple geometrical constraints imposed by pockets. Previous sampling algorithms either sample in the graph space or only consider the 3D coordinates of atoms while ignoring other detailed chemical structures such as bond types and functional groups. To address the challenge, we develop Pocket2Mol, an E(3)-equivariant generative network composed of two modules: 1) a new graph neural network capturing both spatial and bonding relationships between atoms of the binding pockets and 2) a new efficient algorithm which samples new drug candidates conditioned on the pocket representations from a tractable distribution without relying on MCMC. Experimental results demonstrate that molecules sampled from Pocket2Mol achieve significantly better binding affinity and other drug properties such as druglikeness and synthetic accessibility. △ Less

Submitted 15 May, 2022; originally announced May 2022.

Comments: ICML 2022 accepted

arXiv:2205.03583 [pdf]

Scanning Electron Microscopy and Metabolite Measurement Revealed the Stress Mechanism of PS-COOH Microplastics on Rhodotorula mucilaginosa AN5

Authors: Jiahao Ma, Xiangfei Meng, Zixin Li, Lexian Li, Jiwen Xu, Guangfeng Kan

Abstract: Microplastics in the marine environment have been paid more and more attention by researchers, and the impact of these substances on marine microorganisms can not be ignored. Studies have shown that PS-COOH Microplastics are harmful to marine molluscs, algae and monads. This study explore the effect and mechanism of microplastics (80 nm PS-COOH) on Antarctic marine yeast, Rhodotorula mucilaginosa… ▽ More Microplastics in the marine environment have been paid more and more attention by researchers, and the impact of these substances on marine microorganisms can not be ignored. Studies have shown that PS-COOH Microplastics are harmful to marine molluscs, algae and monads. This study explore the effect and mechanism of microplastics (80 nm PS-COOH) on Antarctic marine yeast, Rhodotorula mucilaginosa AN5 by bacterial count, Scanning Electron Microscopy (SEM) and metabolite analysis. The results illustrates that a 50 mg/L concentration of PS-COOH could inhibit 36.15% growth of yeast cells and 10 mg/L inhibit 80.20%. Microplastics stress causes changes in the content of some oxidative stress substances, including reactive oxygen species (ROS) 42.86% , malondialdehyde (MDA) 54.06% content and the activities of antioxidant enzymes such as catalase (CAT) 36.00% , peroxidase (POD) 66.67% and superoxide dismutase (SOD) 25.40%. These results revealed the possible stress effect of microplastic pollution on marine yeast and may affect bottom layer of marine ecosystem. △ Less

Submitted 13 September, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

arXiv:2204.11026 [pdf]

Bioinformatic analysis for structure and function of Glutamine synthetase(GS)

Authors: Jiahao Ma, Guotong Xu, Le Ao, Siqi Chen, Jingze Liu

Abstract: Objective: To predict structure and function of Glutamine synthetase (GS) from Pseudoalteromonas sp. by bioinformatics technology, and to provide a theoretical basis for further study. Methods: Open reading frame (ORF) of GS sequence from Pseudoalteromonas sp. was obtained by ORF finder and was translated into amino acid residue. The structure domain was analyzed by Blast. By the method of analysi… ▽ More Objective: To predict structure and function of Glutamine synthetase (GS) from Pseudoalteromonas sp. by bioinformatics technology, and to provide a theoretical basis for further study. Methods: Open reading frame (ORF) of GS sequence from Pseudoalteromonas sp. was obtained by ORF finder and was translated into amino acid residue. The structure domain was analyzed by Blast. By the method of analysis tools: Protparam, ProtScale, SignalP-4.0, TMHMM, SOPMA, SWISS-MODEL, NCBI SMART-BLAST and MAGA 7.0, the structure and function of the protein were predicted and analyzed. Results: The results showed that the sequence was GS with 468 amino acid residues, theoretical molecular weight was 51986.64 Da. The protein has the closest evolutionary status with Shewanella oneidensis. Then it had no signal peptide site and transmembrane domain. Secondary structure of GS contained 35.04% alpha-helix, 16.67% Extended chain, 5.34% beta-turn, 42.95% RandomCoil. Conclusions: This GU was a variety of biological functions of protein that may be used as a molecular samples of microbial nitrogen metabolism in extreme environments. △ Less

Submitted 23 April, 2022; originally announced April 2022.

Comments: 8 pages, 8 figures

arXiv:2203.10446 [pdf, other]

A 3D Generative Model for Structure-Based Drug Design

Authors: Shitong Luo, Jiaqi Guan, Jianzhu Ma, Jian Peng

Abstract: We study a fundamental problem in structure-based drug design -- generating molecules that bind to specific protein binding sites. While we have witnessed the great success of deep generative models in drug design, the existing methods are mostly string-based or graph-based. They are limited by the lack of spatial information and thus unable to be applied to structure-based design tasks. Particula… ▽ More We study a fundamental problem in structure-based drug design -- generating molecules that bind to specific protein binding sites. While we have witnessed the great success of deep generative models in drug design, the existing methods are mostly string-based or graph-based. They are limited by the lack of spatial information and thus unable to be applied to structure-based design tasks. Particularly, such models have no or little knowledge of how molecules interact with their target proteins exactly in 3D space. In this paper, we propose a 3D generative model that generates molecules given a designated 3D protein binding site. Specifically, given a binding site as the 3D context, our model estimates the probability density of atom's occurrences in 3D space -- positions that are more likely to have atoms will be assigned higher probability. To generate 3D molecules, we propose an auto-regressive sampling scheme -- atoms are sampled sequentially from the learned distribution until there is no room for new atoms. Combined with this sampling scheme, our model can generate valid and diverse molecules, which could be applicable to various structure-based molecular design tasks such as molecule sampling and linker design. Experimental results demonstrate that molecules sampled from our model exhibit high binding affinity to specific targets and good drug properties such as drug-likeness even if the model is not explicitly optimized for them. △ Less

Submitted 12 November, 2022; v1 submitted 19 March, 2022; originally announced March 2022.

Comments: Accepted to NeurIPS 2021

arXiv:2202.07921 [pdf, other]

Comparison on gait characteristics between controlled and free-living conditions in old adults

Authors: Jian Ma

Abstract: Gait is an important biomarker of functional conditions and gait characteristics can help us assessing health conditions and managing progression of diseases. Most of the existing research study the gait in controlled condition, such as clinical tests. In this paper, we study the gait characteristics in free-living conditions in old adults and compare them with that in controlled conditions, i.e.,… ▽ More Gait is an important biomarker of functional conditions and gait characteristics can help us assessing health conditions and managing progression of diseases. Most of the existing research study the gait in controlled condition, such as clinical tests. In this paper, we study the gait characteristics in free-living conditions in old adults and compare them with that in controlled conditions, i.e., Timed Up and Go (TUG) test. 65 subjects (12 patients with mobility impairment and 53 healthy controls) are recruited from elderly nursing institutions. The video data are collected from them in TUG test and free-living conditions and the 9 gait characteristics, including gait speed, are extracted from the data. Two-sample tests and independence test based on copula entropy are conducted on the extracted data to compare the characteristics in two conditions. Comparison results show that gait characteristics, such as gait speed, pace, speed variability, etc., in daily life are different from that of in TUG test. In daily life, people tend to have slow gait speed, smaller pace and speed variability, more frequent stride, and smaller acceleration range than in TUG test. We also found that gait speed, pace, and speed variability have stronger dependence with TUG score in the 3 conditions (TUG, daily life, and both) and that other 5 characteristics have stronger dependence with TUG score in both condition than in each condition. The comparison in this study suggests that TUG and daily life conditions are complementary with each other, and that TUG test can be considered as intervention on the movement state of human. △ Less

Submitted 16 February, 2022; originally announced February 2022.

Comments: 16 pages, 7 figures, 4 tables

arXiv:2202.04324 [pdf]

doi 10.1038/s41593-023-01444-y

Studying the neural representations of uncertainty

Authors: Edgar Y Walker, Stephan Pohl, Rachel N Denison, David L Barack, Jennifer Lee, Ned Block, Wei Ji Ma, Florent Meyniel

Abstract: The study of the brain's representations of uncertainty is a central topic in neuroscience. Unlike most quantities of which the neural representation is studied, uncertainty is a property of an observer's beliefs about the world, which poses specific methodological challenges. We analyze how the literature on the neural representations of uncertainty addresses those challenges and distinguish betw… ▽ More The study of the brain's representations of uncertainty is a central topic in neuroscience. Unlike most quantities of which the neural representation is studied, uncertainty is a property of an observer's beliefs about the world, which poses specific methodological challenges. We analyze how the literature on the neural representations of uncertainty addresses those challenges and distinguish between "code-driven" and "correlational" approaches. Code-driven approaches make assumptions about the neural code for representing world states and the associated uncertainty. By contrast, correlational approaches search for relationships between uncertainty and neural activity without constraints on the neural representation of the world state that this uncertainty accompanies. To compare these two approaches, we apply several criteria for neural representations: sensitivity, specificity, invariance, functionality. Our analysis reveals that the two approaches lead to different, but complementary findings, shaping new research questions and guiding future experiments. △ Less

Submitted 11 October, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

Comments: 23 pages, 3 figures. Nature Neuroscience (2023)

arXiv:2201.13299 [pdf, other]

Orientation-Aware Graph Neural Networks for Protein Structure Representation Learning

Authors: Jiahan Li, Shitong Luo, Congyue Deng, Chaoran Cheng, Jiaqi Guan, Leonidas Guibas, Jian Peng, Jianzhu Ma

Abstract: By folding into particular 3D structures, proteins play a key role in living beings. To learn meaningful representation from a protein structure for downstream tasks, not only the global backbone topology but the local fine-grained orientational relations between amino acids should also be considered. In this work, we propose the Orientation-Aware Graph Neural Networks (OAGNNs) to better sense the… ▽ More By folding into particular 3D structures, proteins play a key role in living beings. To learn meaningful representation from a protein structure for downstream tasks, not only the global backbone topology but the local fine-grained orientational relations between amino acids should also be considered. In this work, we propose the Orientation-Aware Graph Neural Networks (OAGNNs) to better sense the geometric characteristics in protein structure (e.g. inner-residue torsion angles, inter-residue orientations). Extending a single weight from a scalar to a 3D vector, we construct a rich set of geometric-meaningful operations to process both the classical and SO(3) representations of a given structure. To plug our designed perceptron unit into existing Graph Neural Networks, we further introduce an equivariant message passing paradigm, showing superior versatility in maintaining SO(3)-equivariance at the global scale. Experiments have shown that our OAGNNs have a remarkable ability to sense geometric orientational features compared to classical networks. OAGNNs have also achieved state-of-the-art performance on various computational biology applications related to protein 3D structures. The code is available at https://github.com/Ced3-han/OAGNN/tree/main. △ Less

Submitted 4 February, 2025; v1 submitted 28 January, 2022; originally announced January 2022.

Comments: Accepetd in RECOMB 2025

arXiv:2201.04697 [pdf, ps, other]

doi 10.1093/comnet/cnac021

Peak fraction of infected in epidemic spreading for multi-community networks

Authors: Jing Ma, Xiangyi Meng, Lidia A. Braunstein

Abstract: One of the most effective strategies to mitigate the global spreading of a pandemic (e.g., COVID-19) is to shut down international airports. From a network theory perspective, this is since international airports and flights, essentially playing the roles of bridge nodes and bridge links between countries as individual communities, dominate the epidemic spreading characteristics in the whole multi… ▽ More One of the most effective strategies to mitigate the global spreading of a pandemic (e.g., COVID-19) is to shut down international airports. From a network theory perspective, this is since international airports and flights, essentially playing the roles of bridge nodes and bridge links between countries as individual communities, dominate the epidemic spreading characteristics in the whole multi-community system. Among all epidemic characteristics, the peak fraction of infected, $I_{\max}$, is a decisive factor in evaluating an epidemic strategy given limited capacity of medical resources, but is seldom considered in multi-community models. In this paper, we study a general two-community system interconnected by a fraction $r$ of bridge nodes and its dynamic properties, especially $I_{\max}$, under the evolution of the Susceptible-Infected-Recovered (SIR) model. Comparing the characteristic time scales of different parts of the system allows us to analytically derive the asymptotic behavior of $I_{\max}$ with $r$, as $r\rightarrow 0$, which follows different power-law relations in each regime of the phase diagram. We also detect crossovers when $I_{\max}$ changes from one power law to another, crossing different power-law regimes as driven by $r$. Our results enable a better prediction of the effectiveness of strategies acting on bridge nodes, denoted by the power-law exponent $ε_I$ as in $I_{\max}\propto r^{1/ε_I}$. △ Less

Submitted 20 June, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

Comments: 19 pages, 6 figures, 3 tables

Journal ref: Journal of Complex Networks 10 (3), cnac021 (2022)

arXiv:2112.03266 [pdf, other]

Contrastive Cycle Adversarial Autoencoders for Single-cell Multi-omics Alignment and Integration

Authors: Xuesong Wang, Zhihang Hu, Tingyang Yu, Ruijie Wang, Yumeng Wei, Juan Shu, Jianzhu Ma, Yu Li

Abstract: Muilti-modality data are ubiquitous in biology, especially that we have entered the multi-omics era, when we can measure the same biological object (cell) from different aspects (omics) to provide a more comprehensive insight into the cellular system. When dealing with such multi-omics data, the first step is to determine the correspondence among different modalities. In other words, we should mat… ▽ More Muilti-modality data are ubiquitous in biology, especially that we have entered the multi-omics era, when we can measure the same biological object (cell) from different aspects (omics) to provide a more comprehensive insight into the cellular system. When dealing with such multi-omics data, the first step is to determine the correspondence among different modalities. In other words, we should match data from different spaces corresponding to the same object. This problem is particularly challenging in the single-cell multi-omics scenario because such data are very sparse with extremely high dimensions. Secondly, matched single-cell multi-omics data are rare and hard to collect. Furthermore, due to the limitations of the experimental environment, the data are usually highly noisy. To promote the single-cell multi-omics research, we overcome the above challenges, proposing a novel framework to align and integrate single-cell RNA-seq data and single-cell ATAC-seq data. Our approach can efficiently map the above data with high sparsity and noise from different spaces to a low-dimensional manifold in a unified space, making the downstream alignment and integration straightforward. Compared with the other state-of-the-art methods, our method performs better in both simulated and real single-cell data. The proposed method is helpful for the single-cell multi-omics research. The improvement for integration on the simulated data is significant. △ Less

Submitted 13 December, 2021; v1 submitted 5 December, 2021; originally announced December 2021.

arXiv:2110.08471 [pdf, other]

Fast Projection onto the Capped Simplex with Applications to Sparse Regression in Bioinformatics

Authors: Andersen Ang, Jianzhu Ma, Nianjun Liu, Kun Huang, Yijie Wang

Abstract: We consider the problem of projecting a vector onto the so-called k-capped simplex, which is a hyper-cube cut by a hyperplane. For an n-dimensional input vector with bounded elements, we found that a simple algorithm based on Newton's method is able to solve the projection problem to high precision with a complexity roughly about O(n), which has a much lower computational cost compared with the ex… ▽ More We consider the problem of projecting a vector onto the so-called k-capped simplex, which is a hyper-cube cut by a hyperplane. For an n-dimensional input vector with bounded elements, we found that a simple algorithm based on Newton's method is able to solve the projection problem to high precision with a complexity roughly about O(n), which has a much lower computational cost compared with the existing sorting-based methods proposed in the literature. We provide a theory for partial explanation and justification of the method. We demonstrate that the proposed algorithm can produce a solution of the projection problem with high precision on large scale datasets, and the algorithm is able to significantly outperform the state-of-the-art methods in terms of runtime (about 6-8 times faster than a commercial software with respect to CPU time for input vector with 1 million variables or more). We further illustrate the effectiveness of the proposed algorithm on solving sparse regression in a bioinformatics problem. Empirical results on the GWAS dataset (with 1,500,000 single-nucleotide polymorphisms) show that, when using the proposed method to accelerate the Projected Quasi-Newton (PQN) method, the accelerated PQN algorithm is able to handle huge-scale regression problem and it is more efficient (about 3-6 times faster) than the current state-of-the-art methods. △ Less

Submitted 25 October, 2021; v1 submitted 16 October, 2021; originally announced October 2021.

Comments: 12 pages, 5 figures

Journal ref: Advances in Neural Information Processing Systems 2021

arXiv:2103.01606 [pdf]

Time-dependent Clearance of Cyclosporine in Adult Renal Transplant Recipients: A Population Pharmacokinetic Perspective

Authors: Junjun Mao, Xiaoyan Qiu, Weiwei Qin, Luyang Xu, Ming Zhang, Mingkang Zhong

Abstract: Aim The pharmacokinetic (PK) properties of cyclosporine (CsA) in renal transplant recipients are patient- and time-dependent. Knowledge of this time-related variability is necessary to maintain or achieve CsA target exposure. Here, we aimed to identify factors explaining variabilities in CsA PK properties and characterise time-dependent clearance (CL/F) by performing a comprehensive analysis of Cs… ▽ More Aim The pharmacokinetic (PK) properties of cyclosporine (CsA) in renal transplant recipients are patient- and time-dependent. Knowledge of this time-related variability is necessary to maintain or achieve CsA target exposure. Here, we aimed to identify factors explaining variabilities in CsA PK properties and characterise time-dependent clearance (CL/F) by performing a comprehensive analysis of CsA PK factors using population PK (popPK) modelling of long-term follow-up data from our institution. Methods In total, 3,674 whole-blood CsA concentrations from 183 patients who underwent initial renal transplantation were analysed using nonlinear mixed-effects modelling. The effects of potential covariates were selected according to a previous report and well-accepted theoretical mechanisms. Model-informed individualised therapeutic regimens were also conducted. Results A two-compartment model adequately described the data and the estimated mean CsA CL/F was 32.6 L h-1 (5%). Allometrically scaled body size, haematocrit (HCT) level, CGC haplotype carrier status, and postoperative time may contribute to CsA PK variability. The CsA bioavailability in patients receiving a prednisolone dose (PD) of 80 mg was 20.6% lower than that in patients receiving 20 mg. A significant decrease (52.6%) in CL/F was observed as the HCT increased from 10.5% to 60.5%. The CL/F of the non-CGC haplotype carrier was 14.4% lower than that of the CGC haplotype carrier at 3 months post operation. CsA dose adjustments should be considered in different postoperative periods. Conclusions By monitoring body size, HCT, PD, and CGC haplotype, changes in CsA CL/F over time could be predicted. Such information could be used to optimise CsA therapy. △ Less

Submitted 2 March, 2021; originally announced March 2021.

arXiv:2012.05038 [pdf]

Cost-efficiency trade-offs of the human brain network revealed by a multiobjective evolutionary algorithm

Authors: Junji Ma, Jinbo Zhang, Ying Lin, Zhengjia Dai

Abstract: It is widely believed that the formation of brain network structure is under the pressure of optimal trade-off between reducing wiring cost and promoting communication efficiency. However, the question of whether this trade-off exists in empirical human brain networks and, if so, how it takes effect is still not well understood. Here, we employed a multiobjective evolutionary algorithm to directly… ▽ More It is widely believed that the formation of brain network structure is under the pressure of optimal trade-off between reducing wiring cost and promoting communication efficiency. However, the question of whether this trade-off exists in empirical human brain networks and, if so, how it takes effect is still not well understood. Here, we employed a multiobjective evolutionary algorithm to directly and quantitatively explore the cost-efficiency trade-off in human brain networks. Using this algorithm, we generated a population of synthetic networks with optimal but diverse cost-efficiency trade-offs. It was found that these synthetic networks could not only reproduce a large portion of connections in the empirical brain networks but also embed a resembling small-world structure. Moreover, the synthetic and empirical brain networks were found similar in terms of the spatial arrangement of hub regions and the modular structure, which are two important topological features widely assumed to be outcomes of cost-efficiency trade-offs. The synthetic networks had high robustness against random attack as the empirical brain networks did. Additionally, we also revealed some differences of the synthetic networks from the empirical brain networks, including lower segregated processing capacity and weaker robustness against targeted attack. These findings provide direct and quantitative evidence that the structure of human brain networks is indeed largely influenced by optimal cost-efficiency trade-offs. We also suggest that some additional factors (e.g., segregated processing capacity) might jointly determine the network organization with cost and efficiency. △ Less

Submitted 9 December, 2020; originally announced December 2020.

arXiv:2011.11396 [pdf]

THCluster: herb supplements categorization for precision traditional Chinese medicine

Authors: Chunyang Ruan, Ye Wang, Yanchun Zhang, Jiangang Ma, Huijuan Chen, Uwe Aickelin, Shanfeng Zhu, Ting Zhang

Abstract: There has been a continuing demand for traditional and complementary medicine worldwide. A fundamental and important topic in Traditional Chinese Medicine (TCM) is to optimize the prescription and to detect herb regularities from TCM data. In this paper, we propose a novel clustering model to solve this general problem of herb categorization, a pivotal task of prescription optimization and herb re… ▽ More There has been a continuing demand for traditional and complementary medicine worldwide. A fundamental and important topic in Traditional Chinese Medicine (TCM) is to optimize the prescription and to detect herb regularities from TCM data. In this paper, we propose a novel clustering model to solve this general problem of herb categorization, a pivotal task of prescription optimization and herb regularities. The model utilizes Random Walks method, Bayesian rules and Expectation Maximization(EM) models to complete a clustering analysis effectively on a heterogeneous information network. We performed extensive experiments on the real-world datasets and compared our method with other algorithms and experts. Experimental results have demonstrated the effectiveness of the proposed model for discovering useful categorization of herbs and its potential clinical manifestations. △ Less

Submitted 19 November, 2020; originally announced November 2020.

Comments: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Pages 417-424

arXiv:2006.16648 [pdf, other]

Associations between finger tapping, gait and fall risk with application to fall risk assessment

Authors: Jian Ma

Abstract: As the world ages, elderly care becomes a big concern of the society. To address the elderly's issues on dementia and fall risk, we have investigated smart cognitive and fall risk assessment with machine learning methodology based on the data collected from finger tapping test and Timed Up and Go (TUG) test. Meanwhile, we have discovered the associations between cognition and finger motion from fi… ▽ More As the world ages, elderly care becomes a big concern of the society. To address the elderly's issues on dementia and fall risk, we have investigated smart cognitive and fall risk assessment with machine learning methodology based on the data collected from finger tapping test and Timed Up and Go (TUG) test. Meanwhile, we have discovered the associations between cognition and finger motion from finger tapping data and the association between fall risk and gait characteristics from TUG data. In this paper, we jointly analyze the finger tapping and gait characteristics data with copula entropy. We find that the associations between certain finger tapping characteristics ('number of taps', 'average interval of tapping', 'frequency of tapping' of both hands of bimanual inphase and those of left hand of bimanual untiphase) and TUG score are relatively high. According to this finding, we propose to utilize this associations to improve the predictive models of automatic fall risk assessment we developed previously. Experimental results show that using the characteristics of both finger tapping and gait as inputs of the predictive models of predicting TUG score can considerably improve the prediction performance in terms of MAE compared with using only one type of characteristics. △ Less

Submitted 22 February, 2021; v1 submitted 30 June, 2020; originally announced June 2020.

Showing 1–50 of 82 results for author: Ma, J