-
COMO: A Pipeline for Multi-Omics Data Integration in Metabolic Modeling and Drug Discovery
Authors:
Brandt Bessell,
Josh Loecker,
Zhongyuan Zhao,
Sara Sadat Aghamiri,
Sabyasachi Mohanty,
Rada Amin,
Tomáš Helikar,
Bhanwar Lal Puniya
Abstract:
Identifying potential drug targets using metabolic modeling requires integrating multiple modeling methods and heterogenous biological datasets, which can be challenging without sophisticated tools. We developed COMO, a user-friendly pipeline that integrates multi-omics data processing, context-specific metabolic model development, simulations, drug databases, and disease data to aid drug discover…
▽ More
Identifying potential drug targets using metabolic modeling requires integrating multiple modeling methods and heterogenous biological datasets, which can be challenging without sophisticated tools. We developed COMO, a user-friendly pipeline that integrates multi-omics data processing, context-specific metabolic model development, simulations, drug databases, and disease data to aid drug discovery. COMO can be installed as a Docker image and includes intuitive instructions within a Jupyter Lab environment. It provides a comprehensive solution for multi-omics integration of bulk and single-cell RNA-seq, microarrays, and proteomics to develop context-specific metabolic models. Using public databases, open-source solutions for model construction, and a streamlined approach for predicting repurposable drugs, COMO empowers researchers to investigate low-cost alternatives and novel disease treatments. As a case study, we used the pipeline to construct metabolic models of B cells, which simulate and analyze them to predict 25 and 23 metabolic drug targets for rheumatoid arthritis and systemic lupus erythematosus, respectively. COMO can be used to construct models for any cell or tissue type and identify drugs for any human disease. The pipeline has the potential to improve the health of the global community cost-effectively by providing high-confidence targets to pursue in preclinical and clinical studies.
△ Less
Submitted 9 May, 2023; v1 submitted 3 November, 2020;
originally announced November 2020.
-
i6mA-CNN: a convolution based computational approach towards identification of DNA N6-methyladenine sites in rice genome
Authors:
Ruhul Amin,
Chowdhury Rafeed Rahman,
Md. Sadrul Islam Toaha,
Swakkhar Shatabda
Abstract:
DNA N6-methylation (6mA) in Adenine nucleotide is a post replication modification and is responsible for many biological functions. Experimental methods for genome wide 6mA site detection is an expensive and manual labour intensive process. Automated and accurate computational methods can help to identify 6mA sites in long genomes saving significant time and money. Our study develops a convolution…
▽ More
DNA N6-methylation (6mA) in Adenine nucleotide is a post replication modification and is responsible for many biological functions. Experimental methods for genome wide 6mA site detection is an expensive and manual labour intensive process. Automated and accurate computational methods can help to identify 6mA sites in long genomes saving significant time and money. Our study develops a convolutional neural network based tool i6mA-CNN capable of identifying 6mA sites in the rice genome. Our model coordinates among multiple types of features such as PseAAC inspired customized feature vector, multiple one hot representations and dinucleotide physicochemical properties. It achieves area under the receiver operating characteristic curve of 0.98 with an overall accuracy of 0.94 using 5 fold cross validation on benchmark dataset. Finally, we evaluate our model on two other plant genome 6mA site identification datasets besides rice. Results suggest that our proposed tool is able to generalize its ability of 6mA site identification on plant genomes irrespective of plant species. Web tool for this research can be found at: https://cutt.ly/Co6KuWG. Supplementary data (benchmark dataset, independent test dataset, comparison purpose dataset, trained model, physicochemical property values, attention mechanism details for motif finding) are available at https://cutt.ly/PpDdeDH.
△ Less
Submitted 11 August, 2020; v1 submitted 20 July, 2020;
originally announced July 2020.
-
iPromoter-BnCNN: a Novel Branched CNN Based Predictor for Identifying and Classifying Sigma Promoters
Authors:
Ruhul Amin,
Chowdhury Rafeed Rahman,
Md. Habibur Rahman Sifat,
Md Nazmul Khan Liton,
Md. Moshiur Rahman,
Swakkhar Shatabda,
Sajid Ahmed
Abstract:
Promoter is a short region of DNA which is responsible for initiating transcription of specific genes. Development of computational tools for automatic identification of promoters is in high demand. According to the difference of functions, promoters can be of different types. Promoters may have both intra and inter class variation and similarity in terms of consensus sequences. Accurate classific…
▽ More
Promoter is a short region of DNA which is responsible for initiating transcription of specific genes. Development of computational tools for automatic identification of promoters is in high demand. According to the difference of functions, promoters can be of different types. Promoters may have both intra and inter class variation and similarity in terms of consensus sequences. Accurate classification of various types of sigma promoters still remains a challenge. We present iPromoter-BnCNN for identification and accurate classification of six types of promoters - sigma24, sigma28, sigma32, sigma38, sigma54, sigma70. It is a Convolutional Neural Network (CNN) based classifier which combines local features related to monomer nucleotide sequence, trimer nucleotide sequence, dimer structural properties and trimer structural properties through the use of parallel branching. We conducted experiments on a benchmark dataset and compared with two state-of-the-art tools to show our supremacy on 5-fold cross-validation. Moreover, we tested our classifier on an independent test dataset. Our proposed tool iPromoter-BnCNN web server is freely available at http://103.109.52.8/iPromoter-BnCNN. The runnable source code can be found at https://colab.research.google.com/drive/1yWWh7BXhsm8U4PODgPqlQRy23QGjF2DZ.
△ Less
Submitted 16 June, 2020; v1 submitted 21 December, 2019;
originally announced December 2019.