Skip to main content

Showing 1–8 of 8 results for author: Ma, B

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2506.16084  [pdf

    q-bio.BM

    Aptamer-protein interaction prediction model based on transformer

    Authors: Zhichao Yan, Yue Kang, Buyong Ma

    Abstract: Aptamers are single-stranded DNA/RNAs or short peptides with unique tertiary structures that selectively bind to specific targets. They have great potential in the detection and medical fields. Here, we present SelfTrans-Ensemble, a deep learning model that integrates sequence information models and structural information models to extract multi-scale features for predicting aptamer-protein intera… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  2. arXiv:2506.02052  [pdf, ps, other

    q-bio.BM cs.AI cs.LG q-bio.QM

    Protap: A Benchmark for Protein Modeling on Realistic Downstream Applications

    Authors: Shuo Yan, Yuliang Yan, Bin Ma, Chenao Li, Haochun Tang, Jiahua Lu, Minhua Lin, Yuyuan Feng, Hui Xiong, Enyan Dai

    Abstract: Recently, extensive deep learning architectures and pretraining strategies have been explored to support downstream protein applications. Additionally, domain-specific models incorporating biological knowledge have been developed to enhance performance in specialized tasks. In this work, we introduce $\textbf{Protap}$, a comprehensive benchmark that systematically compares backbone architectures,… ▽ More

    Submitted 7 June, 2025; v1 submitted 1 June, 2025; originally announced June 2025.

  3. arXiv:2506.01857  [pdf

    q-bio.BM q-bio.QM

    Protein folding classes -- High-dimensional geometry of amino acid composition space revisited

    Authors: Boryeu Mao

    Abstract: In this study, the distributions of protein structure classes (or folding types) of experimentally determined structures from a legacy dataset and a comprehensive database SCOP are modeled precisely with geometric constructs such as convex polytopes in high-dimensional amino acid composition space. This is a follow-up of a previous non-statistical, geometry-motivated modeling of protein classes wi… ▽ More

    Submitted 15 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: 48 pages, 6 figures, 4 tables

  4. arXiv:2505.22563  [pdf, ps, other

    cs.CL q-bio.NC

    Do Large Language Models Think Like the Brain? Sentence-Level Evidence from fMRI and Hierarchical Embeddings

    Authors: Yu Lei, Xingyang Ge, Yi Zhang, Yiming Yang, Bolei Ma

    Abstract: Understanding whether large language models (LLMs) and the human brain converge on similar computational principles remains a fundamental and important question in cognitive neuroscience and AI. Do the brain-like patterns observed in LLMs emerge simply from scaling, or do they reflect deeper alignment with the architecture of human language processing? This study focuses on the sentence-level neur… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  5. arXiv:2210.12991  [pdf, other

    q-bio.QM

    What cleaves? Is proteasomal cleavage prediction reaching a ceiling?

    Authors: Ingo Ziegler, Bolei Ma, Ercong Nie, Bernd Bischl, David Rügamer, Benjamin Schubert, Emilio Dorigatti

    Abstract: Epitope vaccines are a promising direction to enable precision treatment for cancer, autoimmune diseases, and allergies. Effectively designing such vaccines requires accurate prediction of proteasomal cleavage in order to ensure that the epitopes in the vaccine are presented to T cells by the major histocompatibility complex (MHC). While direct identification of proteasomal cleavage \emph{in vitro… ▽ More

    Submitted 25 October, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: 15 pages, 1 figure

  6. arXiv:2208.00935  [pdf, other

    q-bio.QM eess.AS

    Amino Acid Classification in 2D NMR Spectra via Acoustic Signal Embeddings

    Authors: Jia Qi Yip, Dianwen Ng, Bin Ma, Konstantin Pervushin, Eng Siong Chng

    Abstract: Nuclear Magnetic Resonance (NMR) is used in structural biology to experimentally determine the structure of proteins, which is used in many areas of biology and is an important part of drug development. Unfortunately, NMR data can cost thousands of dollars per sample to collect and it can take a specialist weeks to assign the observed resonances to specific chemical groups. There has thus been gro… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  7. arXiv:1402.6260  [pdf, other

    math.NA math.AP q-bio.QM

    Finite difference approximations for a size-structured population model with distributed states in the recruitment

    Authors: A. S. Ackleh, J. Z. Farkas, X. Li, B. Ma

    Abstract: In this paper we consider a size-structured population model where individuals may be recruited into the population at different sizes. First and second order finite difference schemes are developed to approximate the solution of the mathematical model. The convergence of the approximations to a unique weak solution with bounded total variation is proved. We then show that as the distribution of t… ▽ More

    Submitted 25 February, 2014; originally announced February 2014.

    Journal ref: Journal of Biological Dynamics, 9, (2015) Supp.1, 2-31

  8. arXiv:cs/0111054  [pdf, ps, other

    cs.CC cond-mat.stat-mech cs.CE cs.CV math.CO math.MG math.ST physics.data-an q-bio.GN

    The similarity metric

    Authors: Ming Li, Xin Chen, Xin Li, Bin Ma, Paul Vitanyi

    Abstract: A new class of distances appropriate for measuring similarity relations between sequences, say one type of similarity per distance, is studied. We propose a new ``normalized information distance'', based on the noncomputable notion of Kolmogorov complexity, and show that it is in this class and it minorizes every computable distance in the class (that is, it is universal in that it discovers all… ▽ More

    Submitted 5 August, 2004; v1 submitted 20 November, 2001; originally announced November 2001.

    Comments: 13 pages, LaTex, 5 figures, Part of this work appeared in Proc. 14th ACM-SIAM Symp. Discrete Algorithms, 2003. This is the final, corrected, version to appear in IEEE Trans Inform. Th

    ACM Class: J.3, E.4