Skip to main content

Showing 1–31 of 31 results for author: Song, J

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2506.01116  [pdf, ps, other

    cs.AI q-bio.QM

    ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation

    Authors: Xinyi Liu, Lipeng Ma, Yixuan Li, Weidong Yang, Qingyuan Zhou, Jiayi Song, Shuhao Li, Ben Fei

    Abstract: Large Language Models (LLMs) are widely used across various scenarios due to their exceptional reasoning capabilities and natural language understanding. While LLMs demonstrate strong performance in tasks involving mathematics and coding, their effectiveness diminishes significantly when applied to chemistry-related problems. Chemistry problems typically involve long and complex reasoning steps, w… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  2. Association in Facial Phenotype, Gene, Disease: A Dataset for Explainable Rare Genetic Diseases Diagnosis

    Authors: Jie Song, Mengqiao He, Shumin Ren, Bairong Shen

    Abstract: Many rare genetic diseases exhibit recognizable facial phenotypes, which are often used as diagnostic clues. However, current facial phenotype diagnostic models, which are trained on image datasets, have high accuracy but often suffer from an inability to explain their predictions, which reduces physicians' confidence in the model output.In this paper, we constructed a dataset, called FGDD, which… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Journal ref: Scientific Data 12, 634 (2025)

  3. arXiv:2503.21450  [pdf, other

    cs.CE q-bio.BM

    CMADiff: Cross-Modal Aligned Diffusion for Controllable Protein Generation

    Authors: Changjian Zhou, Yuexi Qiu, Tongtong Ling, Jiafeng Li, Shuanghe Liu, Xiangjing Wang, Jia Song, Wensheng Xiang

    Abstract: AI-assisted protein design has emerged as a critical tool for advancing biotechnology, as deep generative models have demonstrated their reliability in this domain. However, most existing models primarily utilize protein sequence or structural data for training, neglecting the physicochemical properties of proteins.Moreover, they are deficient to control the generation of proteins in intuitive con… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  4. arXiv:2503.13522  [pdf, ps, other

    q-bio.BM cs.AI cs.LG

    Advanced Deep Learning Methods for Protein Structure Prediction and Design

    Authors: Yichao Zhang, Ningyuan Deng, Xinyuan Song, Ziqian Bi, Tianyang Wang, Zheyu Yao, Keyu Chen, Ming Li, Qian Niu, Junyu Liu, Benji Peng, Sen Zhang, Ming Liu, Li Zhang, Xuanhe Pan, Jinlang Wang, Pohsun Feng, Yizhu Wen, Lawrence KQ Yan, Hongming Tseng, Yan Zhong, Yunze Wang, Ziyuan Qin, Bowen Jing, Junjie Yang , et al. (3 additional authors not shown)

    Abstract: After AlphaFold won the Nobel Prize, protein prediction with deep learning once again became a hot topic. We comprehensively explore advanced deep learning methods applied to protein structure prediction and design. It begins by examining recent innovations in prediction architectures, with detailed discussions on improvements such as diffusion based frameworks and novel pairwise attention modules… ▽ More

    Submitted 29 March, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  5. arXiv:2502.15867  [pdf

    q-bio.OT cs.AI

    Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence

    Authors: Yingying Sun, Jun A, Zhiwei Liu, Rui Sun, Liujia Qian, Samuel H. Payne, Wout Bittremieux, Markus Ralser, Chen Li, Yi Chen, Zhen Dong, Yasset Perez-Riverol, Asif Khan, Chris Sander, Ruedi Aebersold, Juan Antonio Vizcaíno, Jonathan R Krieger, Jianhua Yao, Han Wen, Linfeng Zhang, Yunping Zhu, Yue Xuan, Benjamin Boyang Sun, Liang Qiao, Henning Hermjakob , et al. (37 additional authors not shown)

    Abstract: Artificial intelligence (AI) is transforming scientific research, including proteomics. Advances in mass spectrometry (MS)-based proteomics data quality, diversity, and scale, combined with groundbreaking AI techniques, are unlocking new challenges and opportunities in biological discovery. Here, we highlight key areas where AI is driving innovation, from data analysis to new biological insights.… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 28 pages, 2 figures, perspective in AI proteomics

  6. arXiv:2411.11875  [pdf, other

    cs.IR cs.AI cs.CL q-bio.BM

    Exploring Optimal Transport-Based Multi-Grained Alignments for Text-Molecule Retrieval

    Authors: Zijun Min, Bingshuai Liu, Liang Zhang, Jia Song, Jinsong Su, Song He, Xiaochen Bo

    Abstract: The field of bioinformatics has seen significant progress, making the cross-modal text-molecule retrieval task increasingly vital. This task focuses on accurately retrieving molecule structures based on textual descriptions, by effectively aligning textual descriptions and molecules to assist researchers in identifying suitable molecular candidates. However, many existing approaches overlook the d… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: BIBM 2024 Regular Paper

  7. arXiv:2409.03773  [pdf, other

    q-bio.BM cs.LG

    CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity Prediction

    Authors: Rong Han, Xiaohong Liu, Tong Pan, Jing Xu, Xiaoyu Wang, Wuyang Lan, Zhenyu Li, Zixuan Wang, Jiangning Song, Guangyu Wang, Ting Chen

    Abstract: Accurately measuring protein-RNA binding affinity is crucial in many biological processes and drug design. Previous computational methods for protein-RNA binding affinity prediction rely on either sequence or structure features, unable to capture the binding mechanisms comprehensively. The recent emerging pre-trained language models trained on massive unsupervised sequences of protein and RNA have… ▽ More

    Submitted 3 January, 2025; v1 submitted 21 August, 2024; originally announced September 2024.

  8. arXiv:2408.15310  [pdf, other

    q-bio.MN cs.CE cs.LG

    RGDA-DDI: Residual graph attention network and dual-attention based framework for drug-drug interaction prediction

    Authors: Changjian Zhou, Xin Zhang, Jiafeng Li, Jia Song, Wensheng Xiang

    Abstract: Recent studies suggest that drug-drug interaction (DDI) prediction via computational approaches has significant importance for understanding the functions and co-prescriptions of multiple drugs. However, the existing silico DDI prediction methods either ignore the potential interactions among drug-drug pairs (DDPs), or fail to explicitly model and fuse the multi-scale drug feature representations… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  9. arXiv:2406.06985  [pdf

    q-bio.GN

    pVACview: an interactive visualization tool for efficient neoantigen prioritization and selection

    Authors: Huiming Xia, My Hoang, Evelyn Schmidt, Susanna Kiwala, Joshua McMichael, Zachary L. Skidmore, Bryan Fisk, Jonathan J. Song, Jasreet Hundal, Thomas Mooney, Jason R. Walker, S. Peter Goedegebuure, Christopher A. Miller, William E. Gillanders, Obi L. Griffith, Malachi Griffith

    Abstract: Neoantigen targeting therapies including personalized vaccines have shown promise in the treatment of cancers. Accurate identification/prioritization of neoantigens is highly relevant to designing clinical trials, predicting treatment response, and understanding mechanisms of resistance. With the advent of massively parallel sequencing technologies, it is now possible to predict neoantigens based… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Supplemental tables available at 10.5281/zenodo.11534338

  10. arXiv:2405.16248  [pdf

    eess.IV cs.CV cs.LG q-bio.QM

    Combining Radiomics and Machine Learning Approaches for Objective ASD Diagnosis: Verifying White Matter Associations with ASD

    Authors: Junlin Song, Yuzhuo Chen, Yuan Yao, Zetong Chen, Renhao Guo, Lida Yang, Xinyi Sui, Qihang Wang, Xijiao Li, Aihua Cao, Wei Li

    Abstract: Autism Spectrum Disorder is a condition characterized by a typical brain development leading to impairments in social skills, communication abilities, repetitive behaviors, and sensory processing. There have been many studies combining brain MRI images with machine learning algorithms to achieve objective diagnosis of autism, but the correlation between white matter and autism has not been fully u… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  11. arXiv:2404.10573  [pdf, other

    cs.AI cs.CE q-bio.BM

    AAVDiff: Experimental Validation of Enhanced Viability and Diversity in Recombinant Adeno-Associated Virus (AAV) Capsids through Diffusion Generation

    Authors: Lijun Liu, Jiali Yang, Jianfei Song, Xinglin Yang, Lele Niu, Zeqi Cai, Hui Shi, Tingjun Hou, Chang-yu Hsieh, Weiran Shen, Yafeng Deng

    Abstract: Recombinant adeno-associated virus (rAAV) vectors have revolutionized gene therapy, but their broad tropism and suboptimal transduction efficiency limit their clinical applications. To overcome these limitations, researchers have focused on designing and screening capsid libraries to identify improved vectors. However, the large sequence space and limited resources present challenges in identifyin… ▽ More

    Submitted 17 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  12. arXiv:2402.01481  [pdf, other

    cs.LG cs.AI q-bio.BM

    Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains

    Authors: Jiale Zhao, Wanru Zhuang, Jia Song, Yaqi Li, Shuqi Lu

    Abstract: In recent years, there has been a surge in the development of 3D structure-based pre-trained protein models, representing a significant advancement over pre-trained protein language models in various downstream tasks. However, most existing structure-based pre-trained models primarily focus on the residue level, i.e., alpha carbon atoms, while ignoring other atoms like side chain atoms. We argue t… ▽ More

    Submitted 2 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  13. arXiv:2307.12682  [pdf

    q-bio.BM

    Pro-PRIME: A general Temperature-Guided Language model to engineer enhanced Stability and Activity in Proteins

    Authors: Fan Jiang, Mingchen Li, Jiajun Dong, Yuanxi Yu, Xinyu Sun, Banghao Wu, Jin Huang, Liqi Kang, Yufeng Pei, Liang Zhang, Shaojie Wang, Wenxue Xu, Jingyao Xin, Wanli Ouyang, Guisheng Fan, Lirong Zheng, Yang Tan, Zhiqiang Hu, Yi Xiong, Yan Feng, Guangyu Yang, Qian Liu, Jie Song, Jia Liu, Liang Hong , et al. (1 additional authors not shown)

    Abstract: Designing protein mutants of both high stability and activity is a critical yet challenging task in protein engineering. Here, we introduce PRIME, a deep learning model, which can suggest protein mutants of improved stability and activity without any prior experimental mutagenesis data of the specified protein. Leveraging temperature-aware language modeling, PRIME demonstrated superior predictive… ▽ More

    Submitted 27 October, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2304.03780

  14. arXiv:2301.05864  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    Recent advances in artificial intelligence for retrosynthesis

    Authors: Zipeng Zhong, Jie Song, Zunlei Feng, Tiantao Liu, Lingxiang Jia, Shaolun Yao, Tingjun Hou, Mingli Song

    Abstract: Retrosynthesis is the cornerstone of organic chemistry, providing chemists in material and drug manufacturing access to poorly available and brand-new molecules. Conventional rule-based or expert-based computer-aided synthesis has obvious limitations, such as high labor costs and limited search space. In recent years, dramatic breakthroughs driven by artificial intelligence have revolutionized ret… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 27 pages, 6 figurs, 4 tables

  15. arXiv:2203.11444  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    Root-aligned SMILES: A Tight Representation for Chemical Reaction Prediction

    Authors: Zipeng Zhong, Jie Song, Zunlei Feng, Tiantao Liu, Lingxiang Jia, Shaolun Yao, Min Wu, Tingjun Hou, Mingli Song

    Abstract: Chemical reaction prediction, involving forward synthesis and retrosynthesis prediction, is a fundamental problem in organic synthesis. A popular computational paradigm formulates synthesis prediction as a sequence-to-sequence translation problem, where the typical SMILES is adopted for molecule representations. However, the general-purpose SMILES neglects the characteristics of chemical reactions… ▽ More

    Submitted 12 August, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: Chemical Science 2022. Main paper: 16 pages, 5 figures, and 6 tables; supplementary information: 8 pages, 5 figures and 3 tables. Code repository: https://github.com/otori-bird/retrosynthesis

  16. arXiv:2107.02146  [pdf, other

    stat.ME math.ST q-bio.NC stat.AP stat.ML

    Multivariate functional group sparse regression: functional predictor selection

    Authors: Ali Mahzarnia, Jun Song

    Abstract: In this paper, we propose methods for functional predictor selection and the estimation of smooth functional coefficients simultaneously in a scalar-on-function regression problem under high-dimensional multivariate functional data setting. In particular, we develop two methods for functional group-sparse regression under a generic Hilbert space of infinite dimension. We show the convergence of al… ▽ More

    Submitted 8 July, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: The R package that is developed for this paper is available at GitHub. See https://github.com/Ali-Mahzarnia/MFSGrp

  17. Transition behavior of the seizure dynamics modulated by the astrocyte inositol triphosphate noise

    Authors: JiaJia Li, Peihua Feng, Liang Zhao, Junying Chen, Mengmeng Du, Yangyang Yu, Jian Song, Ying Wu

    Abstract: Epilepsy is a neurological disorder with recurrent seizures of complexity and randomness. Until now, the mechanism of epileptic randomness has not been fully elucidated. Inspired by the recent finding that astrocyte GTPase-activating protein (G-protein)-coupled receptors could be involved in stochastic epileptic seizures, we proposed a neuron-astrocyte network model, incorporating the noise of the… ▽ More

    Submitted 31 October, 2022; v1 submitted 26 May, 2021; originally announced June 2021.

    Comments: 26 pages, 8 figures

  18. arXiv:2105.13427  [pdf, other

    q-bio.BM cond-mat.soft

    Small-Angle X-Ray Scattering Signatures of Conformational Heterogeneity and Homogeneity of Disordered Protein Ensembles

    Authors: Jianhui Song, Jichen Li, Hue Sun Chan

    Abstract: Physically, disordered ensembles of non-homopolymeric polypeptides are expected to be heterogeneous; i.e., they should differ from those homogeneous ensembles of homopolymers that harbor an essentially unique relationship between average values of end-to-end distance $R_{\rm EE}$ and radius of gyration $R_{\rm g}$. It was posited recently, however, that small-angle X-ray scattering (SAXS) data on… ▽ More

    Submitted 9 June, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: 66 pages, 21 main-text figures, supporting text, 7 supporting figures, and 145 references; with references updated and typographical errors in the previous version corrected. Accepted for publication in the Journal of Physical Chemistry B

    Journal ref: J. Phys. Chem. B 125, 6451-6478 (2021)

  19. arXiv:2010.13478  [pdf, other

    q-bio.QM q-bio.GN

    Pairwise heuristic sequence alignment algorithm based on deep reinforcement learning

    Authors: Yong Joon Song, Dong Jin Ji, Hye In Seo, Gyu Bum Han, Dong Ho Cho

    Abstract: Various methods have been developed to analyze the association between organisms and their genomic sequences. Among them, sequence alignment is the most frequently used for comparative analysis of biological genomes. However, the traditional sequence alignment method is considerably complicated in proportion to the sequences' length, and it is significantly challenging to align long sequences such… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 20pages, 9figures

  20. Robust Nucleus Detection with Partially Labeled Exemplars

    Authors: Linqing Feng, Jun Ho Song, Jiwon Kim, Soomin Jeong, Jin Sung Park, Jinhyun Kim

    Abstract: Quantitative analysis of cell nuclei in microscopic images is an essential yet challenging source of biological and pathological information. The major challenge is accurate detection and segmentation of densely packed nuclei in images acquired under a variety of conditions. Mask R-CNN-based methods have achieved state-of-the-art nucleus segmentation. However, the current pipeline requires fully a… ▽ More

    Submitted 13 November, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Journal ref: IEEE Access, vol. 7, pp. 162169-162178, 2019

  21. arXiv:1904.08648  [pdf

    q-bio.BM cond-mat.soft physics.bio-ph

    The lubricity of mucin solutions is robust toward changes in physiological conditions

    Authors: Jian Song, Benjamin Winkeljann, Oliver Lieleg

    Abstract: Solutions of manually purified gastric mucins have been shown to be promising lubricants for biomedical purposes, where they can efficiently reduce friction and wear. However, so far, such mucin solutions have been mostly tested in specific settings, and variations in the composition of the lubricating fluid have not been systematically explored. We here fill this gap and determine the viscosity,… ▽ More

    Submitted 18 July, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Journal ref: ACS Appl. Bio Mater. 2019, 2, 8, 3448-3457

  22. arXiv:1803.01974  [pdf

    q-bio.TO physics.bio-ph

    Pro-arrhythmogenic effects of heterogeneous tissue curvature: A suggestion for role of left atrial appendage in atrial fibrillation

    Authors: Jun-Seop Song, Jaehyeok Kim, Byounghyun Lim, Young-Seon Lee, Minki Hwang, Boyoung Joung, Eun Bo Shim, Hui-Nam Pak

    Abstract: Background: The arrhythmogenic role of atrial complex morphology has not yet been clearly elucidated. We hypothesized that bumpy tissue geometry can induce action potential duration (APD) dispersion and wavebreak in atrial fibrillation (AF). Methods and Results: We simulated 2D-bumpy atrial model by varying the degree of bumpiness, and 3D-left atrial (LA) models integrated by LA computed tomogra… ▽ More

    Submitted 14 September, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: Accepted for publication in Circulation Journal

    Journal ref: Circulation Journal 83.1 (2019) 32-40 [PMID: 30429429]

  23. arXiv:1711.04979  [pdf, other

    quant-ph cond-mat.other cs.DS q-bio.QM stat.ML

    Quantum transport senses community structure in networks

    Authors: Chenchao Zhao, Jun S. Song

    Abstract: Quantum time evolution exhibits rich physics, attributable to the interplay between the density and phase of a wave function. However, unlike classical heat diffusion, the wave nature of quantum mechanics has not yet been extensively explored in modern data analysis. We propose that the Laplace transform of quantum transport (QT) can be used to construct an ensemble of maps from a given complex ne… ▽ More

    Submitted 12 January, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

    Journal ref: Phys. Rev. E 98, 022301 (2018)

  24. arXiv:1705.06384  [pdf, other

    physics.data-an q-bio.QM

    Causality inference in stochastic systems from neurons to currencies: Profiting from small sample size

    Authors: Danh-Tai Hoang, Juyong Song, Vipul Periwal, Junghyo Jo

    Abstract: Success in modeling complex phenomena such as human perception hinges critically on the availability of data and computational power. Significant progress has been made in modeling such phenomena using probabilistic methods, particularly in image analysis and speech recognition. Maximum Likelihood Estimation (MLE) combined with Bayesian model selection is the basis of much of this progress, as MLE… ▽ More

    Submitted 8 May, 2018; v1 submitted 17 May, 2017; originally announced May 2017.

    Comments: 6 pages, 5 figures

    Journal ref: Phys. Rev. E 99, 023311 (2019)

  25. Conformational Heterogeneity and FRET Data Interpretation for Dimensions of Unfolded Proteins

    Authors: Jianhui Song, Gregory-Neal Gomes, Tongfei Shi, Claudiu C. Gradinaru, Hue Sun Chan

    Abstract: A mathematico-physically valid formulation is required to infer properties of disordered protein conformations from single-molecule Förster resonance energy transfer (smFRET). Conformational dimensions inferred by conventional approaches that presume a homogeneous conformational ensemble can be unphysical. When all possible---heterogeneous as well as homogeneous---conformational distributions are… ▽ More

    Submitted 31 July, 2017; v1 submitted 17 May, 2017; originally announced May 2017.

    Comments: 33 pages, 7 figures; 4 supporting figures. Accepted for publication in Biophysical Journal (content same as v2)

    Journal ref: Biophysical Journal 113:1012-1024 (2017)

  26. arXiv:1702.01373  [pdf, other

    stat.ML q-bio.QM stat.CO

    Exact heat kernel on a hypersphere and its applications in kernel SVM

    Authors: Chenchao Zhao, Jun S. Song

    Abstract: Many contemporary statistical learning methods assume a Euclidean feature space. This paper presents a method for defining similarity based on hyperspherical geometry and shows that it often improves the performance of support vector machine compared to other competing similarity measures. Specifically, the idea of using heat diffusion on a hypersphere to measure similarity has been previously pro… ▽ More

    Submitted 19 November, 2017; v1 submitted 4 February, 2017; originally announced February 2017.

  27. arXiv:1609.08205  [pdf, ps, other

    q-bio.BM cond-mat.soft

    Random-phase-approximation theory for sequence-dependent, biologically functional liquid-liquid phase separation of intrinsically disordered proteins

    Authors: Yi-Hsuan Lin, Jianhui Song, Julie D. Forman-Kay, Hue Sun Chan

    Abstract: Intrinsically disordered proteins (IDPs) are typically low in nonpolar/hydrophobic but relatively high in polar, charged, and aromatic amino acid compositions. Some IDPs undergo liquid-liquid phase separation in the aqueous milieu of the living cell. The resulting phase with enhanced IDP concentration can function as a major component of membraneless organelles that, by creating their own IDP-rich… ▽ More

    Submitted 26 September, 2016; originally announced September 2016.

    Comments: 21 pages, 12 figures; accepted for publication in Jol. Mol. Liquids

    Journal ref: J. Mol. Liq. 228, 176-193 (2017)

  28. Minimal Perceptrons for Memorizing Complex Patterns

    Authors: Marissa Pastor, Juyong Song, Danh-Tai Hoang, Junghyo Jo

    Abstract: Feedforward neural networks have been investigated to understand learning and memory, as well as applied to numerous practical problems in pattern classification. It is a rule of thumb that more complex tasks require larger networks. However, the design of optimal network architectures for specific tasks is still an unsolved fundamental problem. In this study, we consider three-layered neural netw… ▽ More

    Submitted 11 December, 2015; originally announced December 2015.

    Comments: 14 pages, 5 figures

    Journal ref: Physica A 462:31-37 (2016)

  29. arXiv:1506.01744  [pdf, other

    stat.ML cs.LG math.ST q-bio.GN

    Spectral Learning of Large Structured HMMs for Comparative Epigenomics

    Authors: Chicheng Zhang, Jimin Song, Kevin C Chen, Kamalika Chaudhuri

    Abstract: We develop a latent variable model and an efficient spectral algorithm motivated by the recent emergence of very large data sets of chromatin marks from multiple human cell types. A natural model for chromatin data in one cell type is a Hidden Markov Model (HMM); we model the relationship between multiple cell types by connecting their hidden states by a fixed tree of known structure. The main cha… ▽ More

    Submitted 4 June, 2015; originally announced June 2015.

    Comments: 27 pages, 3 figures

  30. arXiv:1312.3115  [pdf, other

    q-bio.GN q-bio.QM

    BayMeth: Improved DNA methylation quantification for affinity capture sequencing data using a flexible Bayesian approach

    Authors: Andrea Riebler, Mirco Menigatti, Jenny Z. Song, Aaron L. Statham, Clare Stirzaker, Nadiya Mahmud, Charles A. Mein, Susan J. Clark, Mark D. Robinson

    Abstract: DNA methylation (DNAme) is a critical component of the epigenetic regulatory machinery and aberrations in DNAme patterns occur in many diseases, such as cancer. Mapping and understanding DNAme profiles offers considerable promise for reversing the aberrant states. There are several approaches to analyze DNAme, which vary widely in cost, resolution and coverage. Affinity capture and high-throughput… ▽ More

    Submitted 11 December, 2013; originally announced December 2013.

    Comments: 58 pages (main text contains 33 pages), 20 figures (10 figures for the main text, 6 supplementary figures, 4 figures in supplementary text)

  31. Improved annotation of 3-prime untranslated regions and complex loci by combination of strand-specific Direct RNA Sequencing, RNA-seq and ESTs

    Authors: Nick Schurch, Christian Cole, Alexander Sherstnev, Junfang Song, Céline Duc, Kate G. Storey, W. H. Irwin McLean, Sara J. Brown, Gordon G. Simpson, Geoffrey J. Barton

    Abstract: The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct annotation is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental sy… ▽ More

    Submitted 11 November, 2013; originally announced November 2013.

    Comments: 44 pages, 9 figures

    Journal ref: PLoS ONE 9(4) (2014): e94270