-
Towards quantum-enabled cell-centric therapeutics
Authors:
Saugata Basu,
Jannis Born,
Aritra Bose,
Sara Capponi,
Dimitra Chalkia,
Timothy A Chan,
Hakan Doga,
Frederik F. Flother,
Gad Getz,
Mark Goldsmith,
Tanvi Gujarati,
Aldo Guzman-Saenz,
Dimitrios Iliopoulos,
Gavin O. Jones,
Stefan Knecht,
Dhiraj Madan,
Sabrina Maniscalco,
Nicola Mariella,
Joseph A. Morrone,
Khadijeh Najafi,
Pushpak Pati,
Daniel Platt,
Maria Anna Rapsomaniki,
Anupama Ray,
Kahn Rhrissorrakrai
, et al. (8 additional authors not shown)
Abstract:
In recent years, there has been tremendous progress in the development of quantum computing hardware, algorithms and services leading to the expectation that in the near future quantum computers will be capable of performing simulations for natural science applications, operations research, and machine learning at scales mostly inaccessible to classical computers. Whereas the impact of quantum com…
▽ More
In recent years, there has been tremendous progress in the development of quantum computing hardware, algorithms and services leading to the expectation that in the near future quantum computers will be capable of performing simulations for natural science applications, operations research, and machine learning at scales mostly inaccessible to classical computers. Whereas the impact of quantum computing has already started to be recognized in fields such as cryptanalysis, natural science simulations, and optimization among others, very little is known about the full potential of quantum computing simulations and machine learning in the realm of healthcare and life science (HCLS). Herein, we discuss the transformational changes we expect from the use of quantum computation for HCLS research, more specifically in the field of cell-centric therapeutics. Moreover, we identify and elaborate open problems in cell engineering, tissue modeling, perturbation modeling, and bio-topology while discussing candidate quantum algorithms for research on these topics and their potential advantages over classical computational approaches.
△ Less
Submitted 1 August, 2023; v1 submitted 11 July, 2023;
originally announced July 2023.
-
An exact method to compute a $p$-value for the beyond-pairwise correlations among cancer gene mutations
Authors:
Jaegil Kim,
Atanas Kamburov,
Michal Lawrence,
Yosef Maruvka,
Gad Getz
Abstract:
The increasing observation of mutual exclusivity correlations among cancer gene mutations is a key component for identifying driver events or pathways in cancer genome analysis. Here we report a rigorous statistical method to compute an exact $p$-value for the beyond-pairwise mutual exclusivity or co-occurrence relationships among cancer gene mutations by enumerating a null distribution of overlap…
▽ More
The increasing observation of mutual exclusivity correlations among cancer gene mutations is a key component for identifying driver events or pathways in cancer genome analysis. Here we report a rigorous statistical method to compute an exact $p$-value for the beyond-pairwise mutual exclusivity or co-occurrence relationships among cancer gene mutations by enumerating a null distribution of overlapping mutations across more than two genes. The validity and the advantage of our method is explicitly demonstrated in both cancer gene mutations and simulation data through the comparison to the permutation test.
△ Less
Submitted 18 May, 2015;
originally announced May 2015.
-
High-order chromatin architecture determines the landscape of chromosomal alterations in cancer
Authors:
Geoff Fudenberg,
Gad Getz,
Matthew Meyerson,
Leonid Mirny
Abstract:
The rapid growth of cancer genome structural information provides an opportunity for a better understanding of the mutational mechanisms of genomic alterations in cancer and the forces of selection that act upon them. Here we test the evidence for two major forces, spatial chromosome structure and purifying (or negative) selection, that shape the landscape of somatic copy-number alterations (SCNAs…
▽ More
The rapid growth of cancer genome structural information provides an opportunity for a better understanding of the mutational mechanisms of genomic alterations in cancer and the forces of selection that act upon them. Here we test the evidence for two major forces, spatial chromosome structure and purifying (or negative) selection, that shape the landscape of somatic copy-number alterations (SCNAs) in cancer1. Using a maximum likelihood framework we compare SCNA maps and three-dimensional genome architecture as determined by genome-wide chromosome conformation capture (HiC) and described by the proposed fractal-globule (FG) model2. This analysis provides evidence that the distribution of chromosomal alterations in cancer is spatially related to three-dimensional genomic architecture and additionally suggests that purifying selection as well as positive selection shapes the landscape of SCNAs during somatic evolution of cancer cells.
△ Less
Submitted 6 September, 2011;
originally announced September 2011.
-
Gene expression analysis reveals a strong signature of an interferon induced pathway in childhood lymphoblastic leukemia as well as in breast and ovarian cancer
Authors:
Uri Einav,
Yuval Tabach,
Gad Getz,
Assif Yitzhaky,
Ugur Ozbek,
Ninette Amariglio,
Shai Izraeli,
Gideon Rechavi,
Eytan Domany
Abstract:
On the basis of epidemiological studies, infection was suggested to play a role in the etiology of human cancer. While for some cancers such a role was indeed demonstrated, there is no direct biological support for the role of viral pathogens in the pathogenesis of childhood leukemia. Using a novel bioinformatic tool, that alternates between clustering and standard statistical methods of analysi…
▽ More
On the basis of epidemiological studies, infection was suggested to play a role in the etiology of human cancer. While for some cancers such a role was indeed demonstrated, there is no direct biological support for the role of viral pathogens in the pathogenesis of childhood leukemia. Using a novel bioinformatic tool, that alternates between clustering and standard statistical methods of analysis, we performed a "double blind" search of published gene expression data of subjects with different childhood ALL subtypes, looking for unanticipated partitions of patients, induced by unexpected groups of genes with correlated expression. We discovered a group of about thirty genes, related to the interferon response pathway, whose expression levels divide the ALL samples into two subgroups; high in 50, low in 285 patients. Leukemic subclasses prevalent in early childhood (the age most susceptible to infection) are over-represented in the high expression subgroup. Similar partitions, induced by the same genes, were found also in breast and ovarian cancer but not in lung cancer, prostate cancer and lymphoma. About 40% of breast cancer samples expressed the "interferon- related" signature. It is of interested that several studies demonstrated MMTV-like sequences in about 40% of breast cancer samples. Our discovery of an unanticipated strong signature of an interferon induced pathway provides molecular support for a role for either inflammation or viral infection in the pathogenesis of childhood leukemia as well as breast and ovarian cancer.
△ Less
Submitted 14 November, 2005;
originally announced November 2005.
-
Expression profiles of acute lymphoblastic and myeloblastic leukemias with ALL-1 rearrangements
Authors:
T. Rozovskaia,
O. Ravid-Amir,
S. Tillib,
G. Getz,
E. Feinstein,
H. Agrawal,
A. Nagler,
E. Rappeport,
I. Issaeva,
Y. Matsuo,
U. R. Kees,
T. Lapidot,
F. Lo Coco,
R. Foa,
A. Mazo,
T. Nakamura,
C. M. Croce,
G. Cimino,
E. Domany,
E. Canaani
Abstract:
The ALL-1 gene is directly involved in 5-10% of ALLs and AMLs by fusion to other genes or through internal rearrangements. DNA microarrays were utilized to determine expression profiles of ALLs and AMLs with ALL-1 rearrangements. These profiles distinguish those tumors from other ALLs and AMLs. The expression patterns of ALL-1-associated tumors, in particular ALLs, involve oncogenes, tumor suppr…
▽ More
The ALL-1 gene is directly involved in 5-10% of ALLs and AMLs by fusion to other genes or through internal rearrangements. DNA microarrays were utilized to determine expression profiles of ALLs and AMLs with ALL-1 rearrangements. These profiles distinguish those tumors from other ALLs and AMLs. The expression patterns of ALL-1-associated tumors, in particular ALLs, involve oncogenes, tumor suppressors, anti apoptotic genes, drug resistance genes etc., and correlate with the aggressive nature of the tumors. The genes whose expression differentiates between ALLs with and without ALL-1 rearrangement were further divided into several groups enabling separation of ALL-1- associated ALLs into two subclasses. Further, AMLs with partial duplication of ALL-1 vary in their expression pattern from AMLs in which ALL-1 had undergone fusion to other genes. The extensive analysis described here draws attention to genes which might have a direct role in pathogenesis.
△ Less
Submitted 14 November, 2005;
originally announced November 2005.
-
FSSP to SCOP and CATH (F2CS) Prediction Server
Authors:
Gad Getz,
Alina Starovolsky,
Eytan Domany
Abstract:
Summary: The F2CS server provides access to the software, F2CS2.00, that implements an automated prediction method of SCOP and CATH classifications of proteins, based on their FSSP Z-scores (Getz et al., 2002), Availability: Free, at http://www.weizmann.ac.il/physics/complex/compphys/f2cs/. Contact: [email protected] Supplementary information: The site contains links to additional figu…
▽ More
Summary: The F2CS server provides access to the software, F2CS2.00, that implements an automated prediction method of SCOP and CATH classifications of proteins, based on their FSSP Z-scores (Getz et al., 2002), Availability: Free, at http://www.weizmann.ac.il/physics/complex/compphys/f2cs/. Contact: [email protected] Supplementary information: The site contains links to additional figures and tables.
△ Less
Submitted 15 September, 2004;
originally announced September 2004.
-
Outcome signature genes in breast cancer: is there a unique set?
Authors:
Liat Ein-Dor,
Itai Kela,
Gad Getz,
David Givol,
Eytan Domany
Abstract:
Motivation: Predicting the metastatic potential of primary malignant tissues has direct bearing on choice of therapy. Several microarray studies yielded gene sets whose expression profiles successfully predicted survival (Ramaswamy et al 2003; Sorlie et al 2001; van't Veer et al 2003). Nevertheless, the overlap between these gene sets is almost zero. Such small overlaps were observed also in oth…
▽ More
Motivation: Predicting the metastatic potential of primary malignant tissues has direct bearing on choice of therapy. Several microarray studies yielded gene sets whose expression profiles successfully predicted survival (Ramaswamy et al 2003; Sorlie et al 2001; van't Veer et al 2003). Nevertheless, the overlap between these gene sets is almost zero. Such small overlaps were observed also in other complex diseases (Lossos et al 2003; Miklos and Maleszka 2004), and the variables that could account for the differences had evoked a wide interest. One of the main open questions in this context is whether the disparity can be attributed only to trivial reasons such as different technologies, different patients and different types of analysis. Results: To answer this question we concentrated on one single breast cancer dataset, and analyzed it by one single method, the one which was used by van't Veer et al to produce a set of outcome predictive genes. We showed that in fact the resulting set of genes is not unique; it is strongly influenced by the subset of patients used for gene selection. Many equally predictive lists could have been produced from the same analysis. Three main properties of the data explain this sensitivity: (a) many genes are correlated with survival; (b) the differences between these correlations are small; (c) the correlations fluctuate strongly when measured over different subsets of patients. A possible biological explanation for these properties is discussed.
△ Less
Submitted 14 September, 2004;
originally announced September 2004.
-
Design Principle of Gene Expression Used by Human Stem Cells; Implication for Pluripotency
Authors:
Michal Golan-Mashiach,
Jean-Eudes Dazard,
Sharon Gerecht-Nir,
Ninette Amariglio,
Tamar Fisher,
Jasmine Jacob-Hirsch,
Bella Bielorai,
Sivan Osenberg,
Omer Barad,
Gad Getz,
Amos Toren,
Gideon Rechavi,
Joseph Eldor-Itskovitz,
Eytan Domany,
David Givol
Abstract:
Human embryonic stem cells (ESC) are undifferentiated and are endowed with the capacities of self renewal and pluripotential differentiation. Adult stem cells renew their own tissue, but whether they can trans-differentiate to other tissues is still controversial. To understand the genetic program that underlies the pluripotency of stem cells, we compared the transcription profile of ESC with th…
▽ More
Human embryonic stem cells (ESC) are undifferentiated and are endowed with the capacities of self renewal and pluripotential differentiation. Adult stem cells renew their own tissue, but whether they can trans-differentiate to other tissues is still controversial. To understand the genetic program that underlies the pluripotency of stem cells, we compared the transcription profile of ESC with that of progenitor/stem cells of human hematopoietic and keratinocytic origins, along with their mature cells to be viewed as snapshots along tissue differentiation. ESC gene profile show higher complexity with significantly more highly expressed genes than adult cells. We hypothesize that ESC use a strategy of expressing genes that represent various differentiation pathways and selection of only a few for continuous expression upon differentiation to a particular target. Such a strategy may be necessary for the pluripotency of ESC. The progenitors of either hematopoietic or keratinocytic cells also follow the same design principle. Using advanced clustering, we show that many of the ESC expressed genes are turned off in the progenitors/stem cells followed by a further downregulation in adult tissues. Concomitantly, genes specific to the target tissue are upregulated towards matured cells of skin or blood.
△ Less
Submitted 15 September, 2004;
originally announced September 2004.
-
Coupled Two-Way Clustering Analysis of Breast Cancer and Colon Cancer Gene Expression Data
Authors:
Gad Getz,
Hilah Gal,
Itai Kela,
Eytan Domany,
Dan A. Notterman
Abstract:
We present and review Coupled Two Way Clustering, a method designed to mine gene expression data. The method identifies submatrices of the total expression matrix, whose clustering analysis reveals partitions of samples (and genes) into biologically relevant classes. We demonstrate, on data from colon and breast cancer, that we are able to identify partitions that elude standard clustering analy…
▽ More
We present and review Coupled Two Way Clustering, a method designed to mine gene expression data. The method identifies submatrices of the total expression matrix, whose clustering analysis reveals partitions of samples (and genes) into biologically relevant classes. We demonstrate, on data from colon and breast cancer, that we are able to identify partitions that elude standard clustering analysis.
△ Less
Submitted 17 June, 2002;
originally announced June 2002.
-
Automated assignment of SCOP and CATH protein structure classification from FSSP scores
Authors:
Gad Getz,
Michele Vendruscolo,
David Sachs,
Eytan Domany
Abstract:
We present an automated procedure to assign CATH and SCOP classifications to proteins whose FSSP score is available. CATH classification is assigned down to the topology level and SCOP classification to the fold level. As the FSSP database is updated weekly, this method makes it possible to update also CATH and SCOP with the same frequency. Our predictions have a nearly perfect success rate when…
▽ More
We present an automated procedure to assign CATH and SCOP classifications to proteins whose FSSP score is available. CATH classification is assigned down to the topology level and SCOP classification to the fold level. As the FSSP database is updated weekly, this method makes it possible to update also CATH and SCOP with the same frequency. Our predictions have a nearly perfect success rate when ambiguous cases are discarded. These ambiguous cases are intrinsic in any protein structure classification, which relies on structural information alone. Hence, we introduce the notion of ``twilight zone for structure classification''. We further suggest that in order to resolve these ambiguous cases other criteria of classification, based also on information about sequence and function, must be used.
△ Less
Submitted 3 September, 2001; v1 submitted 15 February, 2001;
originally announced February 2001.
-
Coupled Two-Way Clustering Analysis of Gene Microarray Data
Authors:
G. Getz,
E. Levine,
E. Domany
Abstract:
We present a novel coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant partitions emerge. The search for such subsets is a computationally complex task: we present an algorithm, based on iterative clustering, which performs such a searc…
▽ More
We present a novel coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant partitions emerge. The search for such subsets is a computationally complex task: we present an algorithm, based on iterative clustering, which performs such a search. This analysis is especially suitable for gene microarray data, where the contributions of a variety of biological mechanisms to the gene expression levels are entangled in a large body of experimental data. The method was applied to two gene microarray data sets, on colon cancer and leukemia. By identifying relevant subsets of the data and focusing on them we were able to discover partitions and correlations that were masked and hidden when the full dataset was used in the analysis. Some of these partitions have clear biological interpretation; others can serve to identify possible directions for future research.
△ Less
Submitted 4 April, 2000;
originally announced April 2000.
-
Super-paramagnetic clustering of yeast gene expression profiles
Authors:
G. Getz,
E. Levine,
E. Domany,
M. Q. Zhang
Abstract:
High-density DNA arrays, used to monitor gene expression at a genomic scale, have produced vast amounts of information which require the development of efficient computational methods to analyze them. The important first step is to extract the fundamental patterns of gene expression inherent in the data. This paper describes the application of a novel clustering algorithm, Super-Paramagnetic Clu…
▽ More
High-density DNA arrays, used to monitor gene expression at a genomic scale, have produced vast amounts of information which require the development of efficient computational methods to analyze them. The important first step is to extract the fundamental patterns of gene expression inherent in the data. This paper describes the application of a novel clustering algorithm, Super-Paramagnetic Clustering (SPC) to analysis of gene expression profiles that were generated recently during a study of the yeast cell cycle. SPC was used to organize genes into biologically relevant clusters that are suggestive for their co-regulation. Some of the advantages of SPC are its robustness against noise and initialization, a clear signature of cluster formation and splitting, and an unsupervised self-organized determination of the number of clusters at each resolution. Our analysis revealed interesting correlated behavior of several groups of genes which has not been previously identified.
△ Less
Submitted 17 November, 1999;
originally announced November 1999.