Skip to main content

Showing 1–27 of 27 results for author: Hu, W

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2509.05309  [pdf, ps, other

    q-bio.QM cs.AI cs.CL

    ProtSAE: Disentangling and Interpreting Protein Language Models via Semantically-Guided Sparse Autoencoders

    Authors: Xiangyu Liu, Haodi Lei, Yi Liu, Yang Liu, Wei Hu

    Abstract: Sparse Autoencoder (SAE) has emerged as a powerful tool for mechanistic interpretability of large language models. Recent works apply SAE to protein language models (PLMs), aiming to extract and analyze biologically meaningful features from their latent spaces. However, SAE suffers from semantic entanglement, where individual neurons often mix multiple nonlinear concepts, making it difficult to re… ▽ More

    Submitted 26 August, 2025; originally announced September 2025.

  2. arXiv:2507.20925  [pdf, ps, other

    cs.LG q-bio.QM

    Zero-Shot Learning with Subsequence Reordering Pretraining for Compound-Protein Interaction

    Authors: Hongzhi Zhang, Zhonglie Liu, Kun Meng, Jiameng Chen, Jia Wu, Bo Du, Di Lin, Yan Che, Wenbin Hu

    Abstract: Given the vastness of chemical space and the ongoing emergence of previously uncharacterized proteins, zero-shot compound-protein interaction (CPI) prediction better reflects the practical challenges and requirements of real-world drug development. Although existing methods perform adequately during certain CPI tasks, they still face the following challenges: (1) Representation learning from local… ▽ More

    Submitted 28 July, 2025; originally announced July 2025.

  3. arXiv:2506.00854  [pdf, ps, other

    cs.CL cs.AI cs.LG cs.MM q-bio.NC

    EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG

    Authors: Jacky Tai-Yu Lu, Jung Chiang, Chi-Sheng Chen, Anna Nai-Yun Tung, Hsiang Wei Hu, Yuan Chiao Cheng

    Abstract: We propose EEG2TEXT-CN, which, to the best of our knowledge, represents one of the earliest open-vocabulary EEG-to-text generation frameworks tailored for Chinese. Built on a biologically grounded EEG encoder (NICE-EEG) and a compact pretrained language model (MiniLM), our architecture aligns multichannel brain signals with natural language representations via masked pretraining and contrastive le… ▽ More

    Submitted 8 July, 2025; v1 submitted 1 June, 2025; originally announced June 2025.

  4. arXiv:2505.13940  [pdf, ps, other

    cs.AI q-bio.BM

    DrugPilot: LLM-based Parameterized Reasoning Agent for Drug Discovery

    Authors: Kun Li, Zhennan Wu, Shoupeng Wang, Jia Wu, Shirui Pan, Wenbin Hu

    Abstract: Large language models (LLMs) integrated with autonomous agents hold significant potential for advancing scientific discovery through automated reasoning and task execution. However, applying LLM agents to drug discovery is still constrained by challenges such as large-scale multimodal data processing, limited task automation, and poor support for domain-specific tools. To overcome these limitation… ▽ More

    Submitted 28 July, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: 29 pages, 8 figures, 2 tables

  5. arXiv:2505.12068  [pdf, ps, other

    q-bio.NC

    Learning High-Order Relationships with Hypergraph Attention-based Spatio-Temporal Aggregation for Brain Disease Analysis

    Authors: Wenqi Hu, Xuerui Su, Guanliang Li, Yidi Pan, Aijing Lin

    Abstract: Traditional functional connectivity based on functional magnetic resonance imaging (fMRI) can only capture pairwise interactions between brain regions. Hypergraphs, which reveal high-order relationships among multiple brain regions, have been widely used for disease analysis. However, existing methods often rely on predefined hypergraph structures, limiting their ability to model complex patterns.… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  6. arXiv:2502.08975  [pdf, other

    cs.LG q-bio.BM

    Graph-structured Small Molecule Drug Discovery Through Deep Learning: Progress, Challenges, and Opportunities

    Authors: Kun Li, Yida Xiong, Hongzhi Zhang, Xiantao Cai, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Due to their excellent drug-like and pharmacokinetic properties, small molecule drugs are widely used to treat various diseases, making them a critical component of drug discovery. In recent years, with the rapid development of deep learning (DL) techniques, DL-based small molecule drug discovery methods have achieved excellent performance in prediction accuracy, speed, and complex molecular relat… ▽ More

    Submitted 14 May, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: 10 pages, 1 figures, 8 tables

  7. arXiv:2501.15799  [pdf, other

    q-bio.BM cs.LG cs.NE

    Can Molecular Evolution Mechanism Enhance Molecular Representation?

    Authors: Kun Li, Longtao Hu, Xiantao Cai, Jia Wu, Wenbin Hu

    Abstract: Molecular evolution is the process of simulating the natural evolution of molecules in chemical space to explore potential molecular structures and properties. The relationships between similar molecules are often described through transformations such as adding, deleting, and modifying atoms and chemical bonds, reflecting specific evolutionary paths. Existing molecular representation methods main… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: 9 pages, 6 figures, 5 tables

  8. arXiv:2501.15007  [pdf, other

    cs.AI cs.CE q-bio.QM

    Controllable Protein Sequence Generation with LLM Preference Optimization

    Authors: Xiangyu Liu, Yi Liu, Silei Chen, Wei Hu

    Abstract: Designing proteins with specific attributes offers an important solution to address biomedical challenges. Pre-trained protein large language models (LLMs) have shown promising results on protein sequence generation. However, to control sequence generation for specific attributes, existing work still exhibits poor functionality and structural stability. In this paper, we propose a novel controllab… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: Accepted in the 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025)

  9. arXiv:2408.09106  [pdf, other

    q-bio.BM cs.AI

    Fragment-Masked Diffusion for Molecular Optimization

    Authors: Kun Li, Xiantao Cai, Jia Wu, Shirui Pan, Huiting Xu, Bo Du, Wenbin Hu

    Abstract: Molecular optimization is a crucial aspect of drug discovery, aimed at refining molecular structures to enhance drug efficacy and minimize side effects, ultimately accelerating the overall drug development process. Many molecular optimization methods have been proposed, significantly advancing drug discovery. These methods primarily on understanding the specific drug target structures or their hyp… ▽ More

    Submitted 14 May, 2025; v1 submitted 17 August, 2024; originally announced August 2024.

    Comments: 12 pages, 9 figures, 4 tables

  10. arXiv:2405.14545  [pdf, other

    q-bio.BM cs.LG

    A Cross-Field Fusion Strategy for Drug-Target Interaction Prediction

    Authors: Hongzhi Zhang, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Drug-target interaction (DTI) prediction is a critical component of the drug discovery process. In the drug development engineering field, predicting novel drug-target interactions is extremely crucial.However, although existing methods have achieved high accuracy levels in predicting known drugs and drug targets, they fail to utilize global protein information during DTI prediction. This leads to… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  11. arXiv:2405.14536  [pdf, other

    q-bio.MN cs.AI cs.LG

    Regressor-free Molecule Generation to Support Drug Response Prediction

    Authors: Kun Li, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Drug response prediction (DRP) is a crucial phase in drug discovery, and the most important metric for its evaluation is the IC50 score. DRP results are heavily dependent on the quality of the generated molecules. Existing molecule generation methods typically employ classifier-based guidance, enabling sampling within the IC50 classification range. However, these methods fail to ensure the samplin… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 22 pages, 7 figures, 9 tables,

  12. arXiv:2312.10707  [pdf, other

    q-bio.BM cs.AI cs.LG q-bio.MN

    CLDR: Contrastive Learning Drug Response Models from Natural Language Supervision

    Authors: Kun Li, Wenbin Hu

    Abstract: Deep learning-based drug response prediction (DRP) methods can accelerate the drug discovery process and reduce R\&D costs. Although the mainstream methods achieve high accuracy in predicting response regression values, the regression-aware representations of these methods are fragmented and fail to capture the continuity of the sample order. This phenomenon leads to models optimized to sub-optima… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 9 pages, 4 figures, 3 tables

  13. arXiv:2311.07624  [pdf

    q-bio.PE stat.AP

    Disordered hyperuniformity signals functioning and resilience of self-organized vegetation patterns

    Authors: Wensi Hu, Quan-Xing Liu, Bo Wang, Nuo Xu, Lijuan Cui, Chi Xu

    Abstract: In harsh environments, organisms may self-organize into spatially patterned systems in various ways. So far, studies of ecosystem spatial self-organization have primarily focused on apparent orders reflected by regular patterns. However, self-organized ecosystems may also have cryptic orders that can be unveiled only through certain quantitative analyses. Here we show that disordered hyperuniformi… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 34 pages, 6 figures; Supplementary Materials, 19 pages, 10 figures, 2 tables

  14. arXiv:2310.12996  [pdf, other

    q-bio.BM cs.AI cs.LG q-bio.CB q-bio.GN

    Zero-shot Learning of Drug Response Prediction for Preclinical Drug Screening

    Authors: Kun Li, Yong Luo, Xiantao Cai, Wenbin Hu, Bo Du

    Abstract: Conventional deep learning methods typically employ supervised learning for drug response prediction (DRP). This entails dependence on labeled response data from drugs for model training. However, practical applications in the preclinical drug screening phase demand that DRP models predict responses for novel compounds, often with unknown drug responses. This presents a challenge, rendering superv… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 16 pages, 3 figures, 3 tables

  15. arXiv:2303.00313  [pdf, other

    cs.LG q-bio.BM

    Deep Learning Methods for Small Molecule Drug Discovery: A Survey

    Authors: Wenhao Hu, Yingying Liu, Xuanyu Chen, Wenhao Chai, Hangyue Chen, Hongwei Wang, Gaoang Wang

    Abstract: With the development of computer-assisted techniques, research communities including biochemistry and deep learning have been devoted into the drug discovery field for over a decade. Various applications of deep learning have drawn great attention in drug discovery, such as molecule generation, molecular property prediction, retrosynthesis prediction, and reaction prediction. While most existing s… ▽ More

    Submitted 5 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  16. arXiv:2210.17401  [pdf, other

    q-bio.BM cs.AI cs.LG

    Towards a Better Model with Dual Transformer for Drug Response Prediction

    Authors: Kun Li, Jia Wu, Bo Du, Sergey V. Petoukhov, Huiting Xu, Zheman Xiao, Wenbin Hu

    Abstract: GNN-based methods have achieved excellent results as a mainstream task in drug response prediction tasks in recent years. Traditional GNN methods use only the atoms in a drug molecule as nodes to obtain the representation of the molecular graph through node information passing, whereas the method using the transformer can only extract information about the nodes. However, the covalent bonding and… ▽ More

    Submitted 10 December, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 28 pages, 4 figures, 5 tables

  17. arXiv:2008.11622  [pdf

    physics.app-ph q-bio.QM

    Effectiveness of Common Fabrics to Block Aqueous Aerosols of COVID Virus-like Nanoparticles

    Authors: Steven R. Lustig, John J. S. Biswakarma, Devyesh Rana, Susan H. Tilford, Weike Hu, Ming Su, Michael S. Rosenblatt

    Abstract: Layered systems of commonly available fabric materials can be used by the public and healthcare providers in face masks to reduce the risk of inhaling viruses with protection about equivalent or better than the filtration and adsorption offered by 5-layer N95 respirators. Over 70 different common fabric combinations and masks were evaluated under steady state, forced convection air flux with pulse… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  18. arXiv:2006.09928  [pdf, other

    q-bio.NC eess.IV

    Functional connectome fingerprinting: Identifying individuals and predicting cognitive function via deep learning

    Authors: Biao Cai, Gemeng Zhang, Aiying Zhang, Li Xiao, Wenxing Hu, Julia M. Stephen, Tony W. Wilson, Vince D. Calhoun, Yu-Ping Wang

    Abstract: The dynamic characteristics of functional network connectivity have been widely acknowledged and studied. Both shared and unique information has been shown to be present in the connectomes. However, very little has been known about whether and how this common pattern can predict the individual variability of the brain, i.e. "brain fingerprinting", which attempts to reliably identify a particular i… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  19. arXiv:2006.09454  [pdf, other

    q-bio.NC cs.CV cs.LG eess.IV

    Interpretable multimodal fusion networks reveal mechanisms of brain cognition

    Authors: Wenxing Hu, Xianghe Meng, Yuntong Bai, Aiying Zhang, Biao Cai, Gemeng Zhang, Tony W. Wilson, Julia M. Stephen, Vince D. Calhoun, Yu-Ping Wang

    Abstract: Multimodal fusion benefits disease diagnosis by providing a more comprehensive perspective. Developing algorithms is challenging due to data heterogeneity and the complex within- and between-modality associations. Deep-network-based data-fusion models have been developed to capture the complex associations and the performance in diagnosis has been improved accordingly. Moving beyond diagnosis pred… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  20. arXiv:2005.01200  [pdf, other

    q-bio.PE

    Evolution of chemotactic hitchhiking

    Authors: Gurdip Uppal, Weiyi Hu, Dervis Can Vural

    Abstract: Bacteria typically reside in heterogeneous environments with various chemogradients where motile cells can gain an advantage over non-motile cells. Since motility is energetically costly, cells must optimize their swimming speed and behavior to maximize their fitness. Here we investigate how cheating strategies might evolve where slow or non-motile microbes exploit faster ones by sticking together… ▽ More

    Submitted 3 May, 2020; originally announced May 2020.

    Comments: 10 pages, 5 figures

  21. arXiv:1901.11418  [pdf, other

    q-bio.NC cs.LG eess.SP stat.ML

    Sequential Bayesian Detection of Spike Activities from Fluorescence Observations

    Authors: Zhuangkun Wei, Bin Li, Weisi Guo, Wenxiu Hu, Chenglin Zhao

    Abstract: Extracting and detecting spike activities from the fluorescence observations is an important step in understanding how neuron systems work. The main challenge lies in that the combination of the ambient noise with dynamic baseline fluctuation, often contaminates the observations, thereby deteriorating the reliability of spike detection. This may be even worse in the face of the nonlinear biologica… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

  22. arXiv:1604.06131  [pdf, ps, other

    physics.bio-ph cond-mat.soft physics.comp-ph physics.flu-dyn q-bio.CB

    Amoeboid swimming in a channel

    Authors: Hao Wu, A. Farutin, W. -F. Hu, M. Thiébaud, S. Rafaï, P. Peyla, M. -C. Lai, C. Misbah

    Abstract: Several micro-organisms, such as bacteria, algae, or spermatozoa, use flagella or cilia to swim in a fluid, while many other micro-organisms instead use ample shape deformation, described as amoeboid, to propel themselves by either crawling on a substrate or swimming. Many eukaryotic cells were believed to require an underlying substratum to migrate (crawl) by using membrane deformation (like bleb… ▽ More

    Submitted 28 August, 2016; v1 submitted 20 April, 2016; originally announced April 2016.

    Comments: Advance Article, Soft Matter (2016), 16 pages, 18 figures

    Journal ref: Soft Matter, 12, 7470-7484 (2016)

  23. arXiv:1502.03975  [pdf, ps, other

    physics.bio-ph cond-mat.soft physics.comp-ph physics.flu-dyn q-bio.CB

    Amoeboid motion in confined geometry

    Authors: Hao Wu, M. Thiébaud, W. -F. Hu, A. Farutin, S. Rafaï, M. -C. Lai, P. Peyla, C. Misbah

    Abstract: Many eukaryotic cells undergo frequent shape changes (described as amoeboid motion) that enable them to move forward. We investigate the effect of confinement on a minimal model of amoeboid swimmer. Complex pictures emerge: (i) The swimmer's nature (i.e., either pusher or puller) can be modified by confinement, thus suggesting that this is not an intrinsic property of the swimmer. This swimming na… ▽ More

    Submitted 4 November, 2015; v1 submitted 13 February, 2015; originally announced February 2015.

    Comments: 5 pages, 7 figures

    Journal ref: Phys. Rev. E 92, 050701 (2015)

  24. arXiv:1403.3256  [pdf

    q-bio.OT

    Parkinson disease is a TH17 dominant autoimmune disorder against accumulated alpha-synuclein

    Authors: Wan-Chung Hu

    Abstract: Parkinson disease is a very common neurodegenerative disorder. Patients usually undergo destruction of substantia nigra to develop typical symptoms such as resting tremor, hypokinesia, and rigidity. However, the exact mechanism of Parkinson disease is still unknown, so it is called idiopathic Parkinsonism. According to my microarray analysis of peripheral blood leukocytes and substantia nigra brai… ▽ More

    Submitted 21 November, 2013; originally announced March 2014.

  25. arXiv:1311.4968  [pdf

    q-bio.CB q-bio.GN

    Unstable Angina is a syndrome correlated to mixed Th17 and Th1 immune disorder

    Authors: Wan-Chung Hu

    Abstract: Unstable angina is common clinical manifestation of atherosclerosis. However, the detailed pathogenesis of unstable angina is still not known. Here, I propose that unstable angina is a mixed TH17 and TH1 immune disorder. By using microarray analysis, I find out that TH1 and TH17 related cytokine, cytokine receptor, chemokines, complement, immune-related transcription factors, anti-bacterial genes,… ▽ More

    Submitted 20 November, 2013; originally announced November 2013.

  26. arXiv:1311.4747  [pdf

    q-bio.GN q-bio.CB

    Sepsis is a syndrome with hyperactivity of TH17-like innate immunity and hypoactivity of adaptive immunity

    Authors: Wan-Chung Hu

    Abstract: Currently, there are two major theories for the pathogenesis of sepsis: hyperimmune and hypoimmune. Hyperimmune theory suggests that cytokine storm causes the symptoms of sepsis. On the contrary, hypoimmune theory suggests that immunosuppression causes the manifestations of sepsis. By using microarray study, this study implies that hyperactivity of TH17-like innate immunity and failure of adaptive… ▽ More

    Submitted 19 November, 2013; originally announced November 2013.

  27. arXiv:1311.4384  [pdf

    q-bio.TO q-bio.GN

    Acute Respiratory Distress Syndrome is a TH17-like and Treg immune disease

    Authors: Wan-Chung Hu

    Abstract: Acute Respiratory Distress Syndrome (ARDS) is a very severe syndrome leading to respiratory failure and subsequent mortality. Sepsis is one of the leading causes of ARDS. Thus, extracellular bacteria play an important role in the pathophysiology of ARDS. Overactivated neutrophils are the major effector cells in ARDS. Thus, extracellular bacteria triggered TH17-like innate immunity with neutrophil… ▽ More

    Submitted 18 November, 2013; originally announced November 2013.