-
Translocatome: a novel resource for the analysis of protein translocation between cellular organelles
Authors:
Peter Mendik,
Levente Dobronyi,
Ferenc Hari,
Csaba Kerepesi,
Leonardo Maia-Moco,
Donat Buszlai,
Peter Csermely,
Daniel V. Veres
Abstract:
Here we present Translocatome, the first dedicated database of human translocating proteins. The core of the Translocatome database is the manually curated data set of 213 human translocating proteins listing the source of their experimental validation, several details of their translocation mechanism, their local compartmentalized interactome, as well as their involvement in signalling pathways a…
▽ More
Here we present Translocatome, the first dedicated database of human translocating proteins. The core of the Translocatome database is the manually curated data set of 213 human translocating proteins listing the source of their experimental validation, several details of their translocation mechanism, their local compartmentalized interactome, as well as their involvement in signalling pathways and disease development. In addition, using the well-established and widely used gradient boosting machine learning tool, XGBoost, Translocatome provides translocation probability values for 13,066 human proteins identifying 1133 and 3268 high- and low-confidence translocating proteins, respectively. The database has user-friendly search options with a UniProt autocomplete quick search and advanced search for proteins filtered by their localization, UniProt identifiers, translocation likelihood or data complexity. Download options of search results, manually curated and predicted translocating protein sets are available on its website. The update of the database is helped by its manual curation framework and connection to the previously published ComPPI compartmentalized protein-protein interaction database. As shown by the application examples of merlin (NF2) and tumor protein 63 (TP63) Translocatome allows a better comprehension of protein translocation as a systems biology phenomenon and can be used as a discovery-tool in the protein translocation field. The database is available here: http://translocatome.linkgroup.hu
△ Less
Submitted 15 January, 2019; v1 submitted 15 October, 2018;
originally announced October 2018.
-
The braingraph.org Database of High Resolution Structural Connectomes and the Brain Graph Tools
Authors:
Csaba Kerepesi,
Balazs Szalkai,
Balint Varga,
Vince Grolmusz
Abstract:
Based on the data of the NIH-funded Human Connectome Project, we have computed structural connectomes of 426 human subjects in five different resolutions of 83, 129, 234, 463 and 1015 nodes and several edge weights. The graphs are given in anatomically annotated GraphML format that facilitates better further processing and visualization. For 96 subjects, the anatomically classified sub-graphs can…
▽ More
Based on the data of the NIH-funded Human Connectome Project, we have computed structural connectomes of 426 human subjects in five different resolutions of 83, 129, 234, 463 and 1015 nodes and several edge weights. The graphs are given in anatomically annotated GraphML format that facilitates better further processing and visualization. For 96 subjects, the anatomically classified sub-graphs can also be accessed, formed from the vertices corresponding to distinct lobes or even smaller regions of interests of the brain. For example, one can easily download and study the connectomes, restricted to the frontal lobes or just to the left precuneus of 96 subjects using the data. Partially directed connectomes of 423 subjects are also available for download. We also present a GitHub-deposited set of tools, called the Brain Graph Tools, for several processing tasks of the connectomes on the site \url{http://braingraph.org}.
△ Less
Submitted 6 October, 2016;
originally announced October 2016.
-
High-Resolution Directed Human Connectomes and the Consensus Connectome Dynamics
Authors:
Balázs Szalkai,
Csaba Kerepesi,
Bálint Varga,
Vince Grolmusz
Abstract:
Here we show a method of directing the edges of the connectomes, prepared from diffusion tensor imaging (DTI) datasets from the human brain. Before the present work, no high-definition directed braingraphs (or connectomes) were published, because the tractography methods in use are not capable of assigning directions to the neural tracts discovered. Previous work on the functional connectomes appl…
▽ More
Here we show a method of directing the edges of the connectomes, prepared from diffusion tensor imaging (DTI) datasets from the human brain. Before the present work, no high-definition directed braingraphs (or connectomes) were published, because the tractography methods in use are not capable of assigning directions to the neural tracts discovered. Previous work on the functional connectomes applied low-resolution functional MRI-detected statistical causality for the assignment of directions of connectomes of typically several dozens of vertices. Our method is based on the phenomenon of the "Consensus Connectome Dynamics" (CCD), described earlier by our research group. In this contribution, we apply the method to the 423 braingraphs, each with 1015 vertices, computed from the public release of the Human Connectome Project, and we also made the directed connectomes publicly available at the site \url{http://braingraph.org}. We also show the robustness of our edge directing method in four independently chosen connectome datasets: we have found that 86\% of the edges, which were present in all four datasets, get the very same directions in all datasets; therefore the direction method is robust, it does not depend on the particular choice of the dataset. We think that our present contribution opens up new possibilities in the analysis of the high-definition human connectome: from now on we can work with a robust assignment of directions of the connections of the human brain.
△ Less
Submitted 28 September, 2016;
originally announced September 2016.
-
The Dorsal Striatum and the Dynamics of the Consensus Connectomes in the Frontal Lobe of the Human Brain
Authors:
Csaba Kerepesi,
Balint Varga,
Balazs Szalkai,
Vince Grolmusz
Abstract:
In the applications of the graph theory it is unusual that one considers numerous, pairwise different graphs on the very same set of vertices. In the case of human braingraphs or connectomes, however, this is the standard situation: the nodes correspond to anatomically identified cerebral regions, and two vertices are connected by an edge if a diffusion MRI-based workflow identifies a fiber of axo…
▽ More
In the applications of the graph theory it is unusual that one considers numerous, pairwise different graphs on the very same set of vertices. In the case of human braingraphs or connectomes, however, this is the standard situation: the nodes correspond to anatomically identified cerebral regions, and two vertices are connected by an edge if a diffusion MRI-based workflow identifies a fiber of axons, running between the two regions, corresponding to the two vertices. Therefore, if we examine the braingraphs of $n$ subjects, then we have $n$ graphs on the very same, anatomically identified vertex set. It is a natural idea to describe the $k$-frequently appearing edges in these graphs: the edges that are present between the same two vertices in at least $k$ out of the $n$ graphs. Based on the NIH-funded large Human Connectome Project's public data release, we have reported the construction of the Budapest Reference Connectome Server \url{http://connectome.pitgroup.org} that generates and visualizes these $k$-frequently appearing edges. We call the graphs of the $k$-frequently appearing edges "$k$-consensus connectomes" since an edge could be included only if it is present in at least $k$ graphs out of $n$. Considering the whole human brain, we have reported a surprising property of these consensus connectomes earlier. In the present work we are focusing on the frontal lobe of the brain, and we report here a similarly surprising dynamical property of the consensus connectomes when $k$ is gradually changed from $k=n$ to $k=1$: the connections between the nodes of the frontal lobe are seemingly emanating from those nodes that were connected to sub-cortical structures of the dorsal striatum: the caudate nucleus, and the putamen. We hypothesize that this dynamic behavior copies the axonal fiber development of the frontal lobe.
△ Less
Submitted 4 May, 2016;
originally announced May 2016.
-
Parameterizable Consensus Connectomes from the Human Connectome Project: The Budapest Reference Connectome Server v3.0
Authors:
Balázs Szalkai,
Csaba Kerepesi,
Bálint Varga,
Vince Grolmusz
Abstract:
Connections of the living human brain, on a macroscopic scale, can be mapped by a diffusion MR imaging based workflow. Since the same anatomic regions can be corresponded between distinct brains, one can compare the presence or the absence of the edges, connecting the very same two anatomic regions, among multiple cortices. Previously, we have constructed the consensus braingraphs on 1015 vertices…
▽ More
Connections of the living human brain, on a macroscopic scale, can be mapped by a diffusion MR imaging based workflow. Since the same anatomic regions can be corresponded between distinct brains, one can compare the presence or the absence of the edges, connecting the very same two anatomic regions, among multiple cortices. Previously, we have constructed the consensus braingraphs on 1015 vertices first in five, then in 96 subjects in the Budapest Reference Connectome Server v1.0 and v2.0, respectively. Here we report the construction of the version 3.0 of the server, generating the common edges of the connectomes of variously parameterizable subsets of the 1015-vertex connectomes of 477 subjects of the Human Connectome Project's 500-subject release. The consensus connectomes are downloadable in csv and GraphML formats, and they are also visualized on the server's page. The consensus connectomes of the server can be considered as the "average, healthy" human connectome since all of their connections are present in at least $k$ subjects, where the default value of $k=209$, but it can also be modified freely at the web server. The webserver is available at \url{http://connectome.pitgroup.org}.
△ Less
Submitted 15 February, 2016;
originally announced February 2016.
-
The fallacy of tumor immunology: Evolutionary pressures, viruses as nature's genetic engineering tools and T cell surveillance emergence for purging nascent selfish cells
Authors:
Tibor Bakacs,
Katalin Kristof,
Jitendra Mehrishi,
Tamas Szabados,
Csaba Kerepesi,
Enikoe Regoes,
Gabor Tusnady
Abstract:
The US and Hungarian statistical records of the years 1900 and 1896, respectively, before the dramatic medical advances, show 32% and 27% deaths attributable to infections, whereas only 5% and 2% due to cancer. These data can be interpreted to mean that (i) the immune system evolved for purging nascent selfish cells, which establish natural chimerism littering the soma and the germline by conspeci…
▽ More
The US and Hungarian statistical records of the years 1900 and 1896, respectively, before the dramatic medical advances, show 32% and 27% deaths attributable to infections, whereas only 5% and 2% due to cancer. These data can be interpreted to mean that (i) the immune system evolved for purging nascent selfish cells, which establish natural chimerism littering the soma and the germline by conspecific alien cells and (ii) defense against pathogens that represent xenogeneic aliens appeared later in evolution.
`Liberating' T cells from the semantic trap of immunity and the shackles of the `two-signal' model of T cell activation, we point out theoretical grounds that the immune response to cancer is conceptually different from the immune response to infection. We argue for a one-signal model (with stochastic influences) as the explanation for T cell activation in preference to the widely accepted two-signal model of co-stimulation. Convincing evidence for our one-signal model emerged from the widespread autoimmune adverse events in 64.2% of advanced melanoma patients treated with the anti-CTLA-4 antibody (ipilimumab) that blocks an immune checkpoint. Harnessing the unleashed autoimmune power of T cells could be rewarding to defeat cancer. Assuming that immunization against isogeneic tumors also would be effective is a fallacy.
△ Less
Submitted 2 March, 2016; v1 submitted 11 January, 2016;
originally announced January 2016.
-
How to Direct the Edges of the Connectomes: Dynamics of the Consensus Connectomes and the Development of the Connections in the Human Brain
Authors:
Csaba Kerepesi,
Balázs Szalkai,
Bálint Varga,
Vince Grolmusz
Abstract:
The human connectome is the object of an intensive research today. In these graphs, the vertices correspond to the small areas of the gray matter, and two vertices are connected by an edge, if a diffusion-MRI based workflow finds connections between those areas. One main question of the field is discovering the directions of the edges. In a previous work we have reported the construction of the Bu…
▽ More
The human connectome is the object of an intensive research today. In these graphs, the vertices correspond to the small areas of the gray matter, and two vertices are connected by an edge, if a diffusion-MRI based workflow finds connections between those areas. One main question of the field is discovering the directions of the edges. In a previous work we have reported the construction of the Budapest Reference Connectome Server http://connectome.pitgroup.org from the data recorded in the Human Connectome Project of the NIH. After the server had been published, we recognized a surprising and unforeseen property of it: The server can generate the braingraph of connections that are present in at least $k$ graphs out of the 418, for any value of $k=1,2,...,418$. When the value of $k$ is changed from $k=418$ through 1 by moving a slider at the webserver from right to left, more and more edges appear in the consensus graph. The astonishing observation is that the appearance of the new edges is not random: it is similar to a growing tree. We hypothesize that this movement of the slider in the webserver may copy the development of the connections in the human brain in the following sense: the connections that are present in all subjects are the oldest ones, and those that are present in a decreasing fraction of subjects are gradually the newer connections in the individual brain development. An animation on the phenomenon is available at https://youtu.be/EnWwIf_HNjw. Based on this hypothesis, we can assign directions to the edges of the connectome as follows: Let $G_i$ denote the consensus connectome where each edge is present in at least $i$ graphs. Suppose that vertex $v$ is isolated in $G_{k+1}$, and becomes connected to a vertex $u$ in $G_k$, where $u$ was connected to other vertices already in $G_{k+1}$. Then we direct this $(v,u)$ edge from $v$ to $u$.
△ Less
Submitted 13 March, 2016; v1 submitted 18 September, 2015;
originally announced September 2015.
-
Life without dUTPase
Authors:
Csaba Kerepesi,
Judit E. Szabó,
Vince Grolmusz,
Beáta G. Vértessy
Abstract:
Fine-tuned regulation of the cellular nucleotide pools is indispensable for faithful replication of DNA. The genetic information is also safeguarded by DNA damage recognition and repair processes. Uracil is one of the most frequently occurring erroneous base in DNA; it can arise from cytosine deamination or thymine-replacing incorporation. Two enzyme families are primarily involved in keeping DNA…
▽ More
Fine-tuned regulation of the cellular nucleotide pools is indispensable for faithful replication of DNA. The genetic information is also safeguarded by DNA damage recognition and repair processes. Uracil is one of the most frequently occurring erroneous base in DNA; it can arise from cytosine deamination or thymine-replacing incorporation. Two enzyme families are primarily involved in keeping DNA uracil-free: dUTPases that prevent thymine-replacing incorporation and uracil-DNA glycosylases that excise uracil from DNA and initiate uracil-excision repair. Both dUTPase and the most efficient uracil-DNA glycosylase UNG is thought to be ubiquitous in free-living organisms. In the present work, we have systematically investigated the genotype of deposited fully sequenced bacterial and Archaeal genomes. Surprisingly, we have found that in contrast to the generally held opinion, a wide number of bacterial and Archaeal species lack the dUTPase gene(s). The dut- genotype is present in diverse bacterial phyla indicating that loss of this (or these) gene(s) has occurred multiple times during evolution. We have identified several survival strategies in lack of dUTPases: i) simultaneous lack or inhibition of UNG, ii) acquisition of a less dUTP-specific sanitizing nucleotide pyrophosphatase, and iii) supply of dUTPase from bacteriophages. Our data indicate that several unicellular microorganisms may efficiently cope with a dut- genotype potentially leading to an unusual uracil-enrichment in their genomic DNA.
△ Less
Submitted 16 September, 2015;
originally announced September 2015.
-
MiStImm: a simulation tool to compare classical nonsef-centered immune models with a novel self-centered model
Authors:
Tamás Szabados,
Csaba Kerepesi,
Tibor Bakács
Abstract:
Our main purpose is to compare classical nonself-centered, two-signal theoretical models of the adaptive immune system with a novel, self-centered, one-signal model developed by our research group. Our model hypothesizes that the immune system of a fetus is capable learning the limited set of self antigens but unable to prepare itself for the unlimited variety of nonself antigens. We have built a…
▽ More
Our main purpose is to compare classical nonself-centered, two-signal theoretical models of the adaptive immune system with a novel, self-centered, one-signal model developed by our research group. Our model hypothesizes that the immune system of a fetus is capable learning the limited set of self antigens but unable to prepare itself for the unlimited variety of nonself antigens. We have built a computational model that simulates the development of the adaptive immune system. For simplicity, we concentrated on humoral immunity and its major components: T cells, B cells, antibodies, interleukins, non-immune self cells, and foreign antigens. Our model is a microscopic one, similar to the interacting particle models of statistical physics and agent-based models in immunology. Furthermore, our model is stochastic: events are considered random and modeled by a continuous time, finite state Markov process, that is, they are controlled by finitely many independent exponential clocks.
We investigate under what conditions can an immune memory be created that results in a more effective immune response to a repeated infection. The simulations show that our self-centered model is realistic. Moreover, in case of a primary adaptive immune reaction, it can destroy infections more efficiently than a classical nonself-centered model.
Predictions of our theoretical model were clinically supported by autoimmune-related adverse events in high-dose immune checkpoint inhibitor immunotherapy trials and also by safe and successful low-dose immune checkpoint inhibitor combination treatment of heavily pretreated stage IV cancer patients who had exhausted all conventional treatments.
The MiStImm simulation tool and source codes are available at the address https://github.com/kerepesi/MiStImm.
△ Less
Submitted 31 March, 2018; v1 submitted 3 July, 2015;
originally announced July 2015.
-
Comparative Connectomics: Mapping the Inter-Individual Variability of Connections within the Regions of the Human Brain
Authors:
Csaba Kerepesi,
Balázs Szalkai,
Bálint Varga,
Vince Grolmusz
Abstract:
The human braingraph, or connectome is a description of the connections of the brain: the nodes of the graph correspond to small areas of the gray matter, and two nodes are connected by an edge if a diffusion MRI-based workflow finds fibers between those brain areas. We have constructed 1015-vertex graphs from the diffusion MRI brain images of 395 human subjects and compared the individual graphs…
▽ More
The human braingraph, or connectome is a description of the connections of the brain: the nodes of the graph correspond to small areas of the gray matter, and two nodes are connected by an edge if a diffusion MRI-based workflow finds fibers between those brain areas. We have constructed 1015-vertex graphs from the diffusion MRI brain images of 395 human subjects and compared the individual graphs with respect to several different areas of the brain. The inter-individual variability of the graphs within different brain regions was discovered and described. We have found that the frontal and the limbic lobes are more conservative, while the edges in the temporal and occipital lobes are more diverse. Interestingly, a "hybrid" conservative and diverse distribution was found in the paracentral lobule and the fusiform gyrus. Smaller cortical areas were also evaluated: precentral gyri were found to be more conservative, and the postcentral and the superior temporal gyri to be very diverse.
△ Less
Submitted 1 July, 2015;
originally announced July 2015.
-
The "Giant Virus Finder" Discovers an Abundance of Giant Viruses in the Antarctic Dry Valleys
Authors:
Csaba Kerepesi,
Vince Grolmusz
Abstract:
The first giant virus was identified in 2003 from a biofilm of an industrial water-cooling tower in England. Later, numerous new giant viruses were found in oceans and freshwater habitats, some of them having even 2,500 genes. We have demonstrated their very likely presence in four soil samples taken from the Kutch Desert (Gujarat, India). Here we describe a bioinformatics work-flow, called the "G…
▽ More
The first giant virus was identified in 2003 from a biofilm of an industrial water-cooling tower in England. Later, numerous new giant viruses were found in oceans and freshwater habitats, some of them having even 2,500 genes. We have demonstrated their very likely presence in four soil samples taken from the Kutch Desert (Gujarat, India). Here we describe a bioinformatics work-flow, called the "Giant Virus Finder" that is capable to discover the very likely presence of the genomes of giant viruses in metagenomic shotgun-sequenced datasets. The new tool is applied to numerous hot and cold desert soil samples as well as some tundra- and forest soils. We show that most of these samples contain giant viruses, and especially many were found in the Antarctic dry valleys. The results imply that giant viruses could be frequent not only in aqueous habitats, but in a wide spectrum of soils on our planet.
△ Less
Submitted 24 November, 2015; v1 submitted 18 March, 2015;
originally announced March 2015.
-
The Budapest Reference Connectome Server v2.0
Authors:
Balazs Szalkai,
Csaba Kerepesi,
Balint Varga,
Vince Grolmusz
Abstract:
The connectomes of different human brains are pairwise distinct: we cannot talk about an abstract "graph of the brain". Two typical connectomes, however, have quite a few common graph edges that may describe the same connections between the same cortical areas. The Budapest Reference Connectome Server Ver. 2.0 (http://connectome.pitgroup.org) generates the common edges of the connectomes of 96 dis…
▽ More
The connectomes of different human brains are pairwise distinct: we cannot talk about an abstract "graph of the brain". Two typical connectomes, however, have quite a few common graph edges that may describe the same connections between the same cortical areas. The Budapest Reference Connectome Server Ver. 2.0 (http://connectome.pitgroup.org) generates the common edges of the connectomes of 96 distinct cortexes, each with 1015 vertices, computed from 96 MRI data sets of the Human Connectome Project. The user may set numerous parameters for the identification and filtering of common edges, and the graphs are downloadable in both csv and GraphML formats; both formats carry the anatomical annotations of the vertices, generated by the Freesurfer program. The resulting consensus graph is also automatically visualized in a 3D rotating brain model on the website.
The consensus graphs, generated with various parameter settings, can be used as reference connectomes based on different, independent MRI images, therefore they may serve as reduced-error, low-noise, robust graph representations of the human brain.
△ Less
Submitted 6 January, 2015; v1 submitted 9 December, 2014;
originally announced December 2014.
-
Giant Viruses of the Kutch Desert
Authors:
Csaba Kerepesi,
Vince Grolmusz
Abstract:
The Kutch desert (Great Rann of Kutch, Gujarat, India) is a unique ecosystem: in the larger part of the year it is a hot, salty desert that is flooded regularly in the Indian monsoon season. In the dry season, the crystallized salt deposits form the "white desert" in large regions. The first metagenomic analysis of the soil samples of Kutch was published in 2013, and the data was deposited in the…
▽ More
The Kutch desert (Great Rann of Kutch, Gujarat, India) is a unique ecosystem: in the larger part of the year it is a hot, salty desert that is flooded regularly in the Indian monsoon season. In the dry season, the crystallized salt deposits form the "white desert" in large regions. The first metagenomic analysis of the soil samples of Kutch was published in 2013, and the data was deposited in the NCBI Sequence Read Archive. The sequences were analyzed at the same time phylogenetically for prokaryotes, especially for bacterial taxa.
In the present work, we are searching for the DNA sequences of the recently discovered giant viruses in the soil samples of the Kutch desert. Since most giant viruses were discovered in biofilms in industrial cooling towers, ocean water and freshwater ponds, we were surprised to find their DNA sequences in the soil samples of a seasonally very hot and arid, salty environment.
△ Less
Submitted 7 October, 2014; v1 submitted 6 October, 2014;
originally announced October 2014.