Search | arXiv e-print repository

arXiv:2011.02103 [pdf]

COMO: A Pipeline for Multi-Omics Data Integration in Metabolic Modeling and Drug Discovery

Authors: Brandt Bessell, Josh Loecker, Zhongyuan Zhao, Sara Sadat Aghamiri, Sabyasachi Mohanty, Rada Amin, Tomáš Helikar, Bhanwar Lal Puniya

Abstract: Identifying potential drug targets using metabolic modeling requires integrating multiple modeling methods and heterogenous biological datasets, which can be challenging without sophisticated tools. We developed COMO, a user-friendly pipeline that integrates multi-omics data processing, context-specific metabolic model development, simulations, drug databases, and disease data to aid drug discover… ▽ More Identifying potential drug targets using metabolic modeling requires integrating multiple modeling methods and heterogenous biological datasets, which can be challenging without sophisticated tools. We developed COMO, a user-friendly pipeline that integrates multi-omics data processing, context-specific metabolic model development, simulations, drug databases, and disease data to aid drug discovery. COMO can be installed as a Docker image and includes intuitive instructions within a Jupyter Lab environment. It provides a comprehensive solution for multi-omics integration of bulk and single-cell RNA-seq, microarrays, and proteomics to develop context-specific metabolic models. Using public databases, open-source solutions for model construction, and a streamlined approach for predicting repurposable drugs, COMO empowers researchers to investigate low-cost alternatives and novel disease treatments. As a case study, we used the pipeline to construct metabolic models of B cells, which simulate and analyze them to predict 25 and 23 metabolic drug targets for rheumatoid arthritis and systemic lupus erythematosus, respectively. COMO can be used to construct models for any cell or tissue type and identify drugs for any human disease. The pipeline has the potential to improve the health of the global community cost-effectively by providing high-confidence targets to pursue in preclinical and clinical studies. △ Less

Submitted 9 May, 2023; v1 submitted 3 November, 2020; originally announced November 2020.

arXiv:2007.10458 [pdf, other]

doi 10.1101/2020.07.08.194308

i6mA-CNN: a convolution based computational approach towards identification of DNA N6-methyladenine sites in rice genome

Authors: Ruhul Amin, Chowdhury Rafeed Rahman, Md. Sadrul Islam Toaha, Swakkhar Shatabda

Abstract: DNA N6-methylation (6mA) in Adenine nucleotide is a post replication modification and is responsible for many biological functions. Experimental methods for genome wide 6mA site detection is an expensive and manual labour intensive process. Automated and accurate computational methods can help to identify 6mA sites in long genomes saving significant time and money. Our study develops a convolution… ▽ More DNA N6-methylation (6mA) in Adenine nucleotide is a post replication modification and is responsible for many biological functions. Experimental methods for genome wide 6mA site detection is an expensive and manual labour intensive process. Automated and accurate computational methods can help to identify 6mA sites in long genomes saving significant time and money. Our study develops a convolutional neural network based tool i6mA-CNN capable of identifying 6mA sites in the rice genome. Our model coordinates among multiple types of features such as PseAAC inspired customized feature vector, multiple one hot representations and dinucleotide physicochemical properties. It achieves area under the receiver operating characteristic curve of 0.98 with an overall accuracy of 0.94 using 5 fold cross validation on benchmark dataset. Finally, we evaluate our model on two other plant genome 6mA site identification datasets besides rice. Results suggest that our proposed tool is able to generalize its ability of 6mA site identification on plant genomes irrespective of plant species. Web tool for this research can be found at: https://cutt.ly/Co6KuWG. Supplementary data (benchmark dataset, independent test dataset, comparison purpose dataset, trained model, physicochemical property values, attention mechanism details for motif finding) are available at https://cutt.ly/PpDdeDH. △ Less

Submitted 11 August, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

arXiv:1912.10251 [pdf, other]

doi 10.1093/bioinformatics/btaa609

iPromoter-BnCNN: a Novel Branched CNN Based Predictor for Identifying and Classifying Sigma Promoters

Authors: Ruhul Amin, Chowdhury Rafeed Rahman, Md. Habibur Rahman Sifat, Md Nazmul Khan Liton, Md. Moshiur Rahman, Swakkhar Shatabda, Sajid Ahmed

Abstract: Promoter is a short region of DNA which is responsible for initiating transcription of specific genes. Development of computational tools for automatic identification of promoters is in high demand. According to the difference of functions, promoters can be of different types. Promoters may have both intra and inter class variation and similarity in terms of consensus sequences. Accurate classific… ▽ More Promoter is a short region of DNA which is responsible for initiating transcription of specific genes. Development of computational tools for automatic identification of promoters is in high demand. According to the difference of functions, promoters can be of different types. Promoters may have both intra and inter class variation and similarity in terms of consensus sequences. Accurate classification of various types of sigma promoters still remains a challenge. We present iPromoter-BnCNN for identification and accurate classification of six types of promoters - sigma24, sigma28, sigma32, sigma38, sigma54, sigma70. It is a Convolutional Neural Network (CNN) based classifier which combines local features related to monomer nucleotide sequence, trimer nucleotide sequence, dimer structural properties and trimer structural properties through the use of parallel branching. We conducted experiments on a benchmark dataset and compared with two state-of-the-art tools to show our supremacy on 5-fold cross-validation. Moreover, we tested our classifier on an independent test dataset. Our proposed tool iPromoter-BnCNN web server is freely available at http://103.109.52.8/iPromoter-BnCNN. The runnable source code can be found at https://colab.research.google.com/drive/1yWWh7BXhsm8U4PODgPqlQRy23QGjF2DZ. △ Less

Submitted 16 June, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

Showing 1–3 of 3 results for author: Amin, R