-
Predicting Transcription Factor Specificity with All-Atom Models
Authors:
Sahand Jamal Rahi,
Peter Virnau,
Leonid A. Mirny,
Mehran Kardar
Abstract:
The binding of a transcription factor (TF) to a DNA operator site can initiate or repress the expression of a gene. Computational prediction of sites recognized by a TF has traditionally relied upon knowledge of several cognate sites, rather than an ab initio approach. Here, we examine the possibility of using structure-based energy calculations that require no knowledge of bound sites but rathe…
▽ More
The binding of a transcription factor (TF) to a DNA operator site can initiate or repress the expression of a gene. Computational prediction of sites recognized by a TF has traditionally relied upon knowledge of several cognate sites, rather than an ab initio approach. Here, we examine the possibility of using structure-based energy calculations that require no knowledge of bound sites but rather start with the structure of a protein-DNA complex. We study the PurR E. coli TF, and explore to which extent atomistic models of protein-DNA complexes can be used to distinguish between cognate and non-cognate DNA sites. Particular emphasis is placed on systematic evaluation of this approach by comparing its performance with bioinformatic methods, by testing it against random decoys and sites of homologous TFs. We also examine a set of experimental mutations in both DNA and the protein. Using our explicit estimates of energy, we show that the specificity for PurR is dominated by direct protein-DNA interactions, and weakly influenced by bending of DNA.
△ Less
Submitted 24 September, 2008;
originally announced September 2008.
-
Intricate Knots in Proteins: Function and Evolution
Authors:
Peter Virnau,
Leonid A. Mirny,
Mehran Kardar
Abstract:
A number of recently discovered protein structures incorporate a rather unexpected structural feature: a knot in the polypeptide backbone. These knots are extremely rare, but their occurrence is likely connected to protein function in as yet unexplored fashion. Our analysis of the complete Protein Data Bank reveals several new knots which, along with previously discovered ones, can shed light on…
▽ More
A number of recently discovered protein structures incorporate a rather unexpected structural feature: a knot in the polypeptide backbone. These knots are extremely rare, but their occurrence is likely connected to protein function in as yet unexplored fashion. Our analysis of the complete Protein Data Bank reveals several new knots which, along with previously discovered ones, can shed light on such connections. In particular, we identify the most complex knot discovered to date in human ubiquitin hydrolase, and suggest that its entangled topology protects it against unfolding and degradation by the proteasome. Knots in proteins are typically preserved across species and sometimes even across kingdoms. However, we also identify a knot which only appears in some transcarbamylases while being absent in homologous proteins of similar structure. The emergence of the knot is accompanied by a shift in the enzymatic function of the protein. We suggest that the simple insertion of a short DNA fragment into the gene may suffice to turn an unknotted into a knotted structure in this protein.
△ Less
Submitted 2 April, 2007;
originally announced April 2007.
-
Kinetics of protein-DNA interaction: facilitated target location in sequence-dependent potential
Authors:
Michael Slutsky,
Leonid A. Mirny
Abstract:
Recognition and binding of specific sites on DNA by proteins is central for many cellular functions such as transcription, replication, and recombination. In the process of recognition, a protein rapidly searches for its specific site on a long DNA molecule and then strongly binds this site. Here we aim to find a mechanism that can provide both a fast search (1-10 sec) and high stability of the…
▽ More
Recognition and binding of specific sites on DNA by proteins is central for many cellular functions such as transcription, replication, and recombination. In the process of recognition, a protein rapidly searches for its specific site on a long DNA molecule and then strongly binds this site. Here we aim to find a mechanism that can provide both a fast search (1-10 sec) and high stability of the specific protein-DNA complex ($K_d=10^{-15}-10^{-8}$ M).
Earlier studies have suggested that rapid search involves the sliding of a protein along the DNA. Here we consider sliding as a one-dimensional (1D) diffusion in a sequence-dependent rough energy landscape. We demonstrate that, in spite of the landscape's roughness, rapid search can be achieved if 1D sliding is accompanied by 3D diffusion. We estimate the range of the specific and non-specific DNA-binding energy required for rapid search and suggest experiments that can test our mechanism. We show that optimal search requires a protein to spend half of time sliding along the DNA and half diffusing in 3D. We also establish that, paradoxically, realistic energy functions cannot provide both rapid search and strong binding of a rigid protein. To reconcile these two fundamental requirements we propose a search-and-fold mechanism that involves the coupling of protein binding and partial protein folding.
Proposed mechanism has several important biological implications for search in the presence of other proteins and nucleosomes, simultaneous search by several proteins etc. Proposed mechanism also provides a new framework for interpretation of experimental and structural data on protein-DNA interactions.
△ Less
Submitted 23 September, 2004; v1 submitted 3 February, 2004;
originally announced February 2004.
-
The long reach of DNA sequence heterogeneity in diffusive processes
Authors:
Michael Slutsky,
Mehran Kardar,
Leonid A. Mirny
Abstract:
Many biological processes involve one dimensional diffusion over a correlated inhomogeneous energy landscape with a correlation length $ξ_c$. Typical examples are specific protein target location on DNA, nucleosome repositioning, or DNA translocation through a nanopore, in all cases with $ξ_c\approx$ 10 nm. We investigate such transport processes by the mean first passage time (MFPT) formalism,…
▽ More
Many biological processes involve one dimensional diffusion over a correlated inhomogeneous energy landscape with a correlation length $ξ_c$. Typical examples are specific protein target location on DNA, nucleosome repositioning, or DNA translocation through a nanopore, in all cases with $ξ_c\approx$ 10 nm. We investigate such transport processes by the mean first passage time (MFPT) formalism, and find diffusion times which exhibit strong sample to sample fluctuations. For a a displacement $N$, the average MFPT is diffusive, while its standard deviation over the ensemble of energy profiles scales as $N^{3/2}$ with a large prefactor. Fluctuations are thus dominant for displacements smaller than a characteristic $N_c \gg ξ_c$: typical values are much less than the mean, and governed by an anomalous diffusion rule. Potential biological consequences of such random walks, composed of rapid scans in the vicinity of favorable energy valleys and occasional jumps to further valleys, is discussed.
△ Less
Submitted 22 October, 2003; v1 submitted 9 October, 2003;
originally announced October 2003.