Skip to main content

Showing 1–22 of 22 results for author: Wong, W

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2412.03005  [pdf

    q-bio.GN

    gghic: A Versatile R Package for Exploring and Visualizing 3D Genome Organization

    Authors: Minghao Jiang, Duohui Jing, Jason W. H. Wong

    Abstract: Motivation: The three-dimensional (3D) organization of the genome plays a critical role in regulating gene expression and maintaining cellular homeostasis. Disruptions in this spatial organization can result in abnormal chromatin interactions, contributing to the development of various diseases including cancer. Advances in chromosome conformation capture technologies, such as Hi-C, have enabled r… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  2. arXiv:2411.10548  [pdf, ps, other

    cs.LG q-bio.BM

    BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery

    Authors: Peter St. John, Dejun Lin, Polina Binder, Malcolm Greaves, Vega Shah, John St. John, Adrian Lange, Patrick Hsu, Rajesh Illango, Arvind Ramanathan, Anima Anandkumar, David H Brookes, Akosua Busia, Abhishaike Mahajan, Stephen Malina, Neha Prasad, Sam Sinai, Lindsay Edwards, Thomas Gaudelet, Cristian Regep, Martin Steinegger, Burkhard Rost, Alexander Brace, Kyle Hippe, Luca Naef , et al. (63 additional authors not shown)

    Abstract: Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational bio… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  3. arXiv:2407.09089  [pdf

    q-bio.MN

    Lomics: Generation of Pathways and Gene Sets using Large Language Models for Transcriptomic Analysis

    Authors: Chun-Ka Wong, Ali Choo, Eugene C. C. Cheng, Wing-Chun San, Kelvin Chak-Kong Cheng, Yee-Man Lau, Minqing Lin, Fei Li, Wei-Hao Liang, Song-Yan Liao, Kwong-Man Ng, Ivan Fan-Ngai Hung, Hung-Fat Tse, Jason Wing-Hon Wong

    Abstract: Interrogation of biological pathways is an integral part of omics data analysis. Large language models (LLMs) enable the generation of custom pathways and gene sets tailored to specific scientific questions. These targeted sets are significantly smaller than traditional pathway enrichment analysis libraries, reducing multiple hypothesis testing and potentially enhancing statistical power. Lomics (… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  4. arXiv:2402.13297  [pdf, other

    q-bio.QM cs.AI

    Integrating Deep Learning and Synthetic Biology: A Co-Design Approach for Enhancing Gene Expression via N-terminal Coding Sequences

    Authors: Zhanglu Yan, Weiran Chu, Yuhua Sheng, Kaiwen Tang, Shida Wang, Yanfeng Liu, Weng-Fai Wong

    Abstract: N-terminal coding sequence (NCS) influences gene expression by impacting the translation initiation rate. The NCS optimization problem is to find an NCS that maximizes gene expression. The problem is important in genetic engineering. However, current methods for NCS optimization such as rational design and statistics-guided approaches are labor-intensive yield only relatively small improvements. T… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  5. arXiv:2302.11669  [pdf, other

    q-bio.BM cs.IT

    RNA secondary structures: from ab initio prediction to better compression, and back

    Authors: Evarista Onokpasa, Sebastian Wild, Prudence W. H. Wong

    Abstract: In this paper, we use the biological domain knowledge incorporated into stochastic models for ab initio RNA secondary-structure prediction to improve the state of the art in joint compression of RNA sequence and structure data (Liu et al., BMC Bioinformatics, 2008). Moreover, we show that, conversely, compression ratio can serve as a cheap and robust proxy for comparing the prediction quality of d… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: paper at Data Compression Conference 2023

  6. arXiv:2212.10653  [pdf, other

    q-bio.MN

    Estimating and Assessing Differential Equation Models with Time-Course Data

    Authors: Samuel W. K. Wong, Shihao Yang, S. C. Kou

    Abstract: Ordinary differential equation (ODE) models are widely used to describe chemical or biological processes. This article considers the estimation and assessment of such models on the basis of time-course data. Due to experimental limitations, time-course data are often noisy and some components of the system may not be observed. Furthermore, the computational demands of numerical integration have hi… ▽ More

    Submitted 13 February, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 26 pages, 8 figures, with code supplement

  7. arXiv:2210.13323  [pdf, other

    q-bio.PE stat.AP

    A Comparative Study of Compartmental Models for COVID-19 Transmission in Ontario, Canada

    Authors: Yuxuan Zhao, Samuel W. K. Wong

    Abstract: The number of confirmed COVID-19 cases reached over 1.3 million in Ontario, Canada by June 4, 2022. The continued spread of the virus underlying COVID-19 has been spurred by the emergence of variants since the initial outbreak in December, 2019. Much attention has thus been devoted to tracking and modelling the transmission of COVID-19. Compartmental models are commonly used to mimic epidemic tran… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 26 pages, 8 figures

  8. arXiv:2207.03105  [pdf

    q-bio.TO cs.CV eess.IV physics.med-ph

    Uncertainty-Aware Self-supervised Neural Network for Liver $T_{1ρ}$ Mapping with Relaxation Constraint

    Authors: Chaoxing Huang, Yurui Qian, Simon Chun Ho Yu, Jian Hou, Baiyan Jiang, Queenie Chan, Vincent Wai-Sun Wong, Winnie Chiu-Wing Chu, Weitian Chen

    Abstract: $T_{1ρ}$ mapping is a promising quantitative MRI technique for the non-invasive assessment of tissue properties. Learning-based approaches can map $T_{1ρ}$ from a reduced number of $T_{1ρ}$ weighted images, but requires significant amounts of high quality training data. Moreover, existing methods do not provide the confidence level of the $T_{1ρ}… ▽ More

    Submitted 25 October, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Provisionally accepted by Physics in Medicine and Biology

  9. arXiv:2206.06159  [pdf

    q-bio.PE

    Moving towards FAIR practices in epidemiological research

    Authors: Montserrat Garcia-Closas, Thomas U. Ahearn, Mia M. Gaudet, Amber N. Hurson, Jeya Balaji Balasubramanian, Parichoy Pal Choudhury, Nicole M. Gerlanc, Bhaumik Patel, Daniel Russ, Mustapha Abubakar, Neal D. Freedman, Wendy S. W. Wong, Stephen J. Chanock, Amy Berrington de Gonzalez, Jonas S Almeida

    Abstract: Reproducibility and replicability of research findings are central to the scientific integrity of epidemiology. In addition, many research questions require combiningdata from multiple sources to achieve adequate statistical power. However, barriers related to confidentiality, costs, and incentives often limit the extent and speed of sharing resources, both data and code. Epidemiological practices… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  10. arXiv:2203.11299  [pdf, other

    q-bio.NC

    A Fundamental Inequality Governing the Rate Coding Response of Sensory Neurons

    Authors: Willy Wong

    Abstract: A fundamental inequality governing the spike activity of peripheral neurons is derived and tested against auditory data. This inequality states that the steady-state firing rate must lie between the arithmetic and geometric means of the spontaneous and peak activities during adaptation. Implications towards the development of auditory mechanistic models are explored.

    Submitted 1 August, 2023; v1 submitted 21 March, 2022; originally announced March 2022.

  11. arXiv:2201.07775  [pdf, other

    stat.AP q-bio.BM

    Monte Carlo sampling of flexible protein structures: an application to the SARS-CoV-2 omicron variant

    Authors: Samuel W. K. Wong

    Abstract: Proteins can exhibit dynamic structural flexibility as they carry out their functions, especially in binding regions that interact with other molecules. For the key SARS-CoV-2 spike protein that facilitates COVID-19 infection, studies have previously identified several such highly flexible regions with therapeutic importance. However, protein structures available from the Protein Data Bank are pre… ▽ More

    Submitted 4 February, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: 20 pages, 4 figures

  12. arXiv:2105.08835  [pdf, ps, other

    q-bio.BM stat.AP

    Conformational variability of loops in the SARS-CoV-2 spike protein

    Authors: Samuel W. K. Wong, Zongjun Liu

    Abstract: The SARS-CoV-2 spike (S) protein facilitates viral infection, and has been the focus of many structure determination efforts. Its flexible loop regions are known to be involved in protein binding and may adopt multiple conformations. This paper identifies the S protein loops and studies their conformational variability based on the available Protein Data Bank (PDB) structures. While most loops had… ▽ More

    Submitted 13 October, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: 24 pages

  13. arXiv:2104.10878  [pdf, other

    stat.AP q-bio.PE

    Comparing regional and provincial-wide COVID-19 models with physical distancing in British Columbia

    Authors: Geoffrey McGregor, Jennifer Tippett, Andy T. S. Wan, Mengxiao Wang, Samuel W. K. Wong

    Abstract: We study the effects of physical distancing measures for the spread of COVID-19 in regional areas within British Columbia, using the reported cases of the five provincial Health Authorities. Building on the Bayesian epidemiological model of Anderson et al. (2020), we propose a hierarchical regional Bayesian model with time-varying regional parameters between March to December of 2020. In the absen… ▽ More

    Submitted 13 November, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: 35 pages, 16 figures

    Journal ref: AIMS Mathematics, 2022, 7(4): 6743-6778

  14. arXiv:2101.07494  [pdf, other

    physics.soc-ph q-bio.PE

    SIR Simulation of COVID-19 Pandemic in Malaysia: Will the Vaccination Program be Effective?

    Authors: W. K. Wong, Filbert H. Juwono, Tock H. Chua

    Abstract: Since the end of 2019, COVID-19 has significantly affected the lives of people around the world. Towards the end of 2020, several COVID-19 vaccine candidates with relatively high efficacy have been reported in the final phase of clinical trials. Vaccines have been considered as critical tools for opening up social and economic activities, thereby lessening the impact of this disease on the society… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  15. arXiv:2101.02304  [pdf, other

    stat.AP q-bio.BM

    Statistical challenges in the analysis of sequence and structure data for the COVID-19 spike protein

    Authors: Shiyu He, Samuel W. K. Wong

    Abstract: As the major target of many vaccines and neutralizing antibodies against SARS-CoV-2, the spike (S) protein is observed to mutate over time. In this paper, we present statistical approaches to tackle some challenges associated with the analysis of S-protein data. We build a Bayesian hierarchical model to study the temporal and spatial evolution of S-protein sequences, after grouping the sequences i… ▽ More

    Submitted 30 January, 2021; v1 submitted 6 January, 2021; originally announced January 2021.

    Comments: 21 pages, 5 figures

  16. Assessing the impacts of mutations to the structure of COVID-19 spike protein via sequential Monte Carlo

    Authors: Samuel W. K. Wong

    Abstract: Proteins play a key role in facilitating the infectiousness of the 2019 novel coronavirus. A specific spike protein enables this virus to bind to human cells, and a thorough understanding of its 3-dimensional structure is therefore critical for developing effective therapeutic interventions. However, its structure may continue to evolve over time as a result of mutations. In this paper, we use a d… ▽ More

    Submitted 11 June, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: 15 pages, 4 figures

    Journal ref: Journal of Data Science, 2020, 18(3): 511-525

  17. arXiv:1610.07213  [pdf, other

    stat.ME q-bio.MN q-bio.QM

    Stochastic Modeling and Statistical Inference of Intrinsic Noise in Gene Regulation System via Chemical Master Equation

    Authors: Chao Du, Wing Hong Wong

    Abstract: Intrinsic noise, the stochastic cell-to-cell fluctuations in mRNAs and proteins, has been observed and proved to play important roles in cellular systems. Due to the recent development in single-cell-level measurement technology, the studies on intrinsic noise are becoming increasingly popular among scholars. The chemical master equation (CME) has been used to model the evolutions of complex chemi… ▽ More

    Submitted 11 November, 2017; v1 submitted 23 October, 2016; originally announced October 2016.

    Comments: 64 pages, 5 figures

  18. arXiv:1307.6445  [pdf, other

    q-bio.NC cond-mat.stat-mech physics.bio-ph

    On the Rate Coding Response of Peripheral Sensory Neurons

    Authors: Willy Wong

    Abstract: The rate coding response of a single peripheral sensory neuron in the asymptotic, near-equilibrium limit can be derived using information theory, asymptotic Bayesian statistics and a theory of complex systems. Almost no biological knowledge is required. The theoretical expression shows good agreement with spike-frequency adaptation data across different sensory modalities and animal species. The a… ▽ More

    Submitted 10 December, 2020; v1 submitted 24 July, 2013; originally announced July 2013.

  19. arXiv:1207.3137  [pdf, ps, other

    q-bio.MN stat.AP

    Learning a nonlinear dynamical system model of gene regulation: A perturbed steady-state approach

    Authors: Arwen Vanice Bradley, Ye Henry Li, Bokyung Choi, Wing Hung Wong

    Abstract: Biological structure and function depend on complex regulatory interactions between many genes. A wealth of gene expression data is available from high-throughput genome-wide measurement technologies, but effective gene regulatory network inference methods are still needed. Model-based methods founded on quantitative descriptions of gene regulation are among the most promising, but many such metho… ▽ More

    Submitted 25 March, 2016; v1 submitted 12 July, 2012; originally announced July 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOAS645 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS645

    Journal ref: Annals of Applied Statistics 2013, Vol. 7, No. 3, 1311-1333

  20. arXiv:1106.3211  [pdf, ps, other

    stat.ME q-bio.GN

    Statistical Modeling of RNA-Seq Data

    Authors: Julia Salzman, Hui Jiang, Wing Hung Wong

    Abstract: Recently, ultra high-throughput sequencing of RNA (RNA-Seq) has been developed as an approach for analysis of gene expression. By obtaining tens or even hundreds of millions of reads of transcribed sequences, an RNA-Seq experiment can offer a comprehensive survey of the population of genes (transcripts) in any sample of interest. This paper introduces a statistical model for estimating isoform abu… ▽ More

    Submitted 16 June, 2011; originally announced June 2011.

    Comments: Published in at http://dx.doi.org/10.1214/10-STS343 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS343

    Journal ref: Statistical Science 2011, Vol. 26, No. 1, 62-83

  21. arXiv:1105.0126  [pdf

    q-bio.BM

    The surface accessibility of α-bungarotoxin monitored by a novel paramagnetic probe

    Authors: Andrea Bernini, Vincenzo Venditti, Ottavia Spiga, Filippo Prischi, Mauro Botta, Angela Pui-Ling Tong, Wing-Tak Wong, Neri Niccolai

    Abstract: The surface accessibility of α-bungarotoxin has been investigated by using Gd2L7, a newly designed paramagnetic NMR probe. Signal attenuations induced by Gd2L7 on α-bungarotoxin CαH peaks of 1H-13C HSQC spectra have been analyzed and compared with the ones previously obtained in the presence of GdDTPA-BMA. In spite of the different molecular size and shape, for the two probes a common pathway of a… ▽ More

    Submitted 30 April, 2011; originally announced May 2011.

    Comments: 13 pages, 4 figures,preliminary report

  22. The use of oscillatory signals in the study of genetic networks

    Authors: Ovidiu Lipan, Wing H. Wong

    Abstract: The structure of a genetic network is uncovered by studying its response to external stimuli (input signals). We present a theory of propagation of an input signal through a linear stochastic genetic network. It is found that there are important advantages in using oscillatory signals over step or impulse signals, and that the system may enter into a pure fluctuation resonance for a specific inp… ▽ More

    Submitted 23 February, 2005; originally announced February 2005.

    Comments: 46 pages, 5 figures. Submitted to PNAS on May 27th 2004. The paper is under consideration