Skip to main content

Showing 1–13 of 13 results for author: Zhou, F

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2509.12600  [pdf, ps, other

    cs.LG cs.AI q-bio.QM

    A Multimodal Foundation Model to Enhance Generalizability and Data Efficiency for Pan-cancer Prognosis Prediction

    Authors: Huajun Zhou, Fengtao Zhou, Jiabo Ma, Yingxue Xu, Xi Wang, Xiuming Zhang, Li Liang, Zhenhui Li, Hao Chen

    Abstract: Multimodal data provides heterogeneous information for a holistic understanding of the tumor microenvironment. However, existing AI models often struggle to harness the rich information within multimodal data and extract poorly generalizable representations. Here we present MICE (Multimodal data Integration via Collaborative Experts), a multimodal foundation model that effectively integrates patho… ▽ More

    Submitted 15 September, 2025; originally announced September 2025.

    Comments: 27 pages, 7 figures

  2. arXiv:2503.03989  [pdf, other

    q-bio.BM cs.LG

    Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

    Authors: Xiangxin Zhou, Yi Xiao, Haowei Lin, Xinheng He, Jiaqi Guan, Yang Wang, Qiang Liu, Feng Zhou, Liang Wang, Jianzhu Ma

    Abstract: The dynamic nature of proteins, influenced by ligand interactions, is essential for comprehending protein function and progressing drug discovery. Traditional structure-based drug design (SBDD) approaches typically target binding sites with rigid structures, limiting their practical application in drug development. While molecular dynamics simulation can theoretically capture all the biologically… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Accepted to ICLR 2025

  3. arXiv:2410.13872  [pdf, other

    cs.NE cs.LG q-bio.NC

    BLEND: Behavior-guided Neural Population Dynamics Modeling via Privileged Knowledge Distillation

    Authors: Zhengrui Guo, Fangxu Zhou, Wei Wu, Qichen Sun, Lishuang Feng, Jinzhuo Wang, Hao Chen

    Abstract: Modeling the nonlinear dynamics of neuronal populations represents a key pursuit in computational neuroscience. Recent research has increasingly focused on jointly modeling neural activity and behavior to unravel their interconnections. Despite significant efforts, these approaches often necessitate either intricate model designs or oversimplified assumptions. Given the frequent absence of perfect… ▽ More

    Submitted 6 February, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted by ICLR'2025

  4. arXiv:2409.18597  [pdf

    cs.LG cs.AI q-bio.GN

    TemporalPaD: a reinforcement-learning framework for temporal feature representation and dimension reduction

    Authors: Xuechen Mu, Zhenyu Huang, Kewei Li, Haotian Zhang, Xiuli Wang, Yusi Fan, Kai Zhang, Fengfeng Zhou

    Abstract: Recent advancements in feature representation and dimension reduction have highlighted their crucial role in enhancing the efficacy of predictive modeling. This work introduces TemporalPaD, a novel end-to-end deep learning framework designed for temporal pattern datasets. TemporalPaD integrates reinforcement learning (RL) with neural networks to achieve concurrent feature representation and featur… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  5. arXiv:2407.21298  [pdf, other

    cs.LG cs.AI q-bio.BM

    A Vectorization Method Induced By Maximal Margin Classification For Persistent Diagrams

    Authors: An Wu, Yu Pan, Fuqi Zhou, Jinghui Yan, Chuanlu Liu

    Abstract: Persistent homology is an effective method for extracting topological information, represented as persistent diagrams, of spatial structure data. Hence it is well-suited for the study of protein structures. Attempts to incorporate Persistent homology in machine learning methods of protein function prediction have resulted in several techniques for vectorizing persistent diagrams. However, current… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  6. arXiv:2406.19611  [pdf, other

    q-bio.QM cs.AI

    Multimodal Data Integration for Precision Oncology: Challenges and Future Directions

    Authors: Huajun Zhou, Fengtao Zhou, Chenyu Zhao, Yingxue Xu, Luyang Luo, Hao Chen

    Abstract: The essence of precision oncology lies in its commitment to tailor targeted treatments and care measures to each patient based on the individual characteristics of the tumor. The inherent heterogeneity of tumors necessitates gathering information from diverse data sources to provide valuable insights from various perspectives, fostering a holistic comprehension of the tumor. Over the past decade,… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 15 pages, 4 figures

  7. arXiv:2404.09738  [pdf

    q-bio.BM cs.AI q-bio.QM

    AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides

    Authors: Kewei Li, Yuqian Wu, Yinheng Li, Yutong Guo, Yan Wang, Yiyang Liang, Yusi Fan, Lan Huang, Ruochi Zhang, Fengfeng Zhou

    Abstract: Since the mechanism of action of drug molecules in the human body is difficult to reproduce in the in vitro environment, it becomes difficult to reveal the causes of the activity cliff phenomenon of drug molecules. We found out the AC of small molecules has been extensively investigated but limited knowledge is accumulated about the AC phenomenon in peptides with canonical amino acids. Understandi… ▽ More

    Submitted 3 November, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  8. arXiv:2401.11360  [pdf

    cs.LG cs.AI cs.CE q-bio.BM

    PepHarmony: A Multi-View Contrastive Learning Framework for Integrated Sequence and Structure-Based Peptide Encoding

    Authors: Ruochi Zhang, Haoran Wu, Chang Liu, Huaping Li, Yuqian Wu, Kewei Li, Yifan Wang, Yifan Deng, Jiahui Chen, Fengfeng Zhou, Xin Gao

    Abstract: Recent advances in protein language models have catalyzed significant progress in peptide sequence representation. Despite extensive exploration in this field, pre-trained models tailored for peptide-specific needs remain largely unaddressed due to the difficulty in capturing the complex and sometimes unstable structures of peptides. This study introduces a novel multi-view contrastive learning fr… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 25 pages, 5 figures, 3 tables

  9. arXiv:2311.17964  [pdf

    q-bio.GN cs.LG

    Linear normalised hash function for clustering gene sequences and identifying reference sequences from multiple sequence alignments

    Authors: Manal Helal, Fanrong Kong, Sharon C-A Chen, Fei Zhou, Dominic E Dwyer, John Potter, Vitali Sintchenko

    Abstract: The aim of this study was to develop a method that would identify the cluster centroids and the optimal number of clusters for a given sensitivity level and could work equally well for the different sequence datasets. A novel method that combines the linear mapping hash function and multiple sequence alignment (MSA) was developed. This method takes advantage of the already sorted by similarity seq… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    ACM Class: I.2.6

    Journal ref: Microbial Informatics and Experimentation volume 2, Article number: 2 (2012) https://microbialinformaticsj.biomedcentral.com/counter/pdf/10.1186/2042-5783-2-2.pdf

  10. arXiv:2311.04419  [pdf

    q-bio.BM cs.AI q-bio.QM

    PepLand: a large-scale pre-trained peptide representation model for a comprehensive landscape of both canonical and non-canonical amino acids

    Authors: Ruochi Zhang, Haoran Wu, Yuting Xiu, Kewei Li, Ningning Chen, Yu Wang, Yan Wang, Xin Gao, Fengfeng Zhou

    Abstract: In recent years, the scientific community has become increasingly interested on peptides with non-canonical amino acids due to their superior stability and resistance to proteolytic degradation. These peptides present promising modifications to biological, pharmacological, and physiochemical attributes in both endogenous and engineered peptides. Notwithstanding their considerable advantages, the s… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  11. arXiv:2304.06176  [pdf

    q-bio.QM eess.IV physics.bio-ph q-bio.CB q-bio.SC

    Surface-guided computing to analyze subcellular morphology and membrane-associated signals in 3D

    Authors: Felix Y. Zhou, Andrew Weems, Gabriel M. Gihana, Bingying Chen, Bo-Jui Chang, Meghan Driscoll, Gaudenz Danuser

    Abstract: Signal transduction and cell function are governed by the spatiotemporal organization of membrane-associated molecules. Despite significant advances in visualizing molecular distributions by 3D light microscopy, cell biologists still have limited quantitative understanding of the processes implicated in the regulation of molecular signals at the whole cell scale. In particular, complex and transie… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: 49 pages, 10 figures

  12. arXiv:2212.10883  [pdf, other

    q-bio.QM math.AT q-bio.TO

    Detecting Temporal shape changes with the Euler Characteristic Transform

    Authors: Lewis Marsh, Felix Y. Zhou, Xiao Qin, Xin Lu, Helen M. Byrne, Heather A. Harrington

    Abstract: Organoids are multi-cellular structures which are cultured in vitro from stem cells to resemble specific organs (e.g., brain, liver) in their three-dimensional composition. Dynamic changes in the shape and composition of these model systems can be used to understand the effect of mutations and treatments in health and disease. In this paper, we propose a new technique in the field of topological d… ▽ More

    Submitted 22 December, 2022; v1 submitted 21 December, 2022; originally announced December 2022.

  13. arXiv:q-bio/0409011  [pdf

    q-bio.GN

    SUMO Substrates and Sites Prediction Combining Pattern Recognition and Phylogenetic Conservation

    Authors: Yu Xue, Fengfeng Zhou, Hualei Lu, Guoliang Chen, Xuebiao Yao

    Abstract: Small Ubiquitin-related modifier (SUMO) proteins are widely expressed in eukaryotic cells, which are reversibly coupled to their substrates by motif recognition, called sumoylation. Two interesting questions are 1) how many potential SUMO substrates may be included in mammalian proteomes, such as human and mouse, 2) and given a SUMO substrate, can we recognize its sumoylation sites? To answer th… ▽ More

    Submitted 9 September, 2004; originally announced September 2004.

    Comments: 15 pages (including 1 figure and 2 tables)