1 |
A phage display study of interacting peptide binding partners of malarial S-Adenosylmethionine decarboxylase/Ornithine decarboxylaseNiemand, Jandeli 24 April 2008 (has links)
Due to the increasing resistance against the currently used antimalarial drugs, novel chemotherapeutic agents that target new metabolic pathways for the treatment of malarial infections are urgently needed. One approach to the drug discovery process is to use interaction analysis to find proteins that are involved in a specific metabolic pathway that has been identified as a drug target. Protein-protein interactions in such a pathway can be preferential targets since a) there is often greater structural variability in protein-protein interfaces, which can lead to more effective differentiation between the parasite and host proteins; and b) the important amino acids in a protein-protein interface are often conserved and even one amino acid mutation can lead to the dissociation of the complex, implying that resistance should be slower to appear. Since polyamines and their biosynthetic enzymes occur in increased concentrations in rapidly proliferating cells, the inhibition of polyamine metabolism is a rational approach for the development of antiparasitic drugs. Polyamine synthesis in P. falciparum is uniquely facilitated by a single open reading frame that encodes both rate-limiting enzymes in the pathway, namely ornithine decarboxylase (ODC) and S-adenosylmethionine decarboxylase (AdoMetDC). The AdoMetDC/ODC domains are assembled in a heterotetrameric bifunctional protein complex of ~330 kDa. Inhibition of both decarboxylase activities is curative of murine malaria and indicates the viability of such strategies in malaria control. It was hypothesized that protein ligands to this enzyme can be utilized in targeting the polyamine biosynthetic pathway in a novel approach. The bifunctional PfAdoMetDC/ODC was recombinantly expressed with a C-terminal Strep-tag-II to allow affinity purification. Subsequent gel electrophoresis analysis showed the presence of 3 contaminating proteins (~60 kDa, ~70 kDa and ~112 kDa) that co-elute with the ~330 kDa AdoMetDC/ODC. Efforts to purify the bifunctional protein to homogeneity included subcloning into a double-tagged vector for tandem affinity purification as well as size-exclusion HPLC. SDS-PAGE analysis of these indicated that separation of the four proteins was not successful, implicating the presence of strong protein-protein interactions. Western blot analysis showed that the ~112 kDa and ~70 kDa peptides were recombinantly produced with a C-terminal Strep-tag, indicating their heterologous origin. The ~60 kDa fragment was however not recognised by the tag-specific antibodies. This implies that this fragment is of E. coli origin. MS-analysis of the contaminating bands showed that the ~112 kDa peptide is an N-terminally truncated form of the full-length protein, the ~70 kDa peptide is a mixture of N-terminally truncated recombinant protein and E. coli DnaK and the ~60 kDa peptide is E. coli GroEL. A P. falciparum cDNA phage display library was used to identify peptide ligands to PfAdoMetDC/ODC. Of the peptides isolated through the biopanning process, only one was shown to occur in vivo. It could however not be conclusively shown that the isolated peptides bind to PfAdoMetDC/ODC and not to the co-eluting E. coli proteins. It is thought that while it is extremely likely that interacting protein partners to PfAdoMetDC/DOC exist, the available technologies are not sufficient to lead to the identification of such partners. / Dissertation (MSc (Biochemistry))--University of Pretoria, 2008. / Biochemistry / unrestricted
|
2 |
Computational Modeling of Peptide-Protein BindingJanuary 2010 (has links)
abstract: Peptides offer great promise as targeted affinity ligands, but the space of possible peptide sequences is vast, making experimental identification of lead candidates expensive, difficult, and uncertain. Computational modeling can narrow the search by estimating the affinity and specificity of a given peptide in relation to a predetermined protein target. The predictive performance of computational models of interactions of intermediate-length peptides with proteins can be improved by taking into account the stochastic nature of the encounter and binding dynamics. A theoretical case is made for the hypothesis that, because of the flexibility of the peptide and the structural complexity of the target protein, interactions are best characterized by an ensemble of possible bound configurations rather than a single “lock and key” fit. A model incorporating these factors is proposed and evaluated. A comprehensive dataset of 3,924 peptide-protein interface structures was extracted from the Protein Data Bank (PDB) and descriptors were computed characterizing the geometry and energetics of each interface. The characteristics of these interfaces are shown to be generally consistent with the proposed model, and heuristics for design and selection of peptide ligands are derived. The curated and energy-minimized interface structure dataset and a relational database containing the detailed results of analysis and energy modeling are made publicly available via a web repository. A novel analytical technique based on the proposed theoretical model, Virtual Scanning Probe Mapping (VSPM), is implemented in software to analyze the interaction between a target protein of known structure and a peptide of specified sequence, producing a spatial map indicating the most likely peptide binding regions on the protein target. The resulting predictions are shown to be superior to those of two other published methods, and support the validity of the stochastic binding model. / Dissertation/Thesis / Ph.D. Bioengineering 2010
|
3 |
Rational Structure-Based Rescaffolding Approach to De Novo Design of Interleukin 10 (IL-10) Receptor-1 MimeticsRuiz-Gómez, Gloria, Hawkins, John C., Philipp, Jenny, Künze, Georg, Wodtke, Robert, Löser, Reik, Fahmy, Karim, Pisabarro, M. Teresa 06 January 2017 (has links) (PDF)
Tackling protein interfaces with small molecules capable of modulating protein-protein interactions remains a challenge in structure-based ligand design. Particularly arduous are cases in which the epitopes involved in molecular recognition have a non-structured and discontinuous nature. Here, the basic strategy of translating continuous binding epitopes into mimetic scaffolds cannot be applied, and other innovative approaches are therefore required. We present a structure-based rational approach involving the use of a regular expression syntax inspired in the well established PROSITE to define minimal descriptors of geometric and functional constraints signifying relevant functionalities for recognition in protein interfaces of non-continuous and unstructured nature. These descriptors feed a search engine that explores the currently available three-dimensional chemical space of the Protein Data Bank (PDB) in order to identify in a straightforward manner regular architectures containing the desired functionalities, which could be used as templates to guide the rational design of small natural-like scaffolds mimicking the targeted recognition site. The application of this rescaffolding strategy to the discovery of natural scaffolds incorporating a selection of functionalities of interleukin-10 receptor-1 (IL-10R1), which are relevant for its interaction with interleukin-10 (IL-10) has resulted in the de novo design of a new class of potent IL-10 peptidomimetic ligands.
|
4 |
Rational Structure-Based Rescaffolding Approach to De Novo Design of Interleukin 10 (IL-10) Receptor-1 MimeticsRuiz-Gómez, Gloria, Hawkins, John C., Philipp, Jenny, Künze, Georg, Wodtke, Robert, Löser, Reik, Fahmy, Karim, Pisabarro, M. Teresa 06 January 2017 (has links)
Tackling protein interfaces with small molecules capable of modulating protein-protein interactions remains a challenge in structure-based ligand design. Particularly arduous are cases in which the epitopes involved in molecular recognition have a non-structured and discontinuous nature. Here, the basic strategy of translating continuous binding epitopes into mimetic scaffolds cannot be applied, and other innovative approaches are therefore required. We present a structure-based rational approach involving the use of a regular expression syntax inspired in the well established PROSITE to define minimal descriptors of geometric and functional constraints signifying relevant functionalities for recognition in protein interfaces of non-continuous and unstructured nature. These descriptors feed a search engine that explores the currently available three-dimensional chemical space of the Protein Data Bank (PDB) in order to identify in a straightforward manner regular architectures containing the desired functionalities, which could be used as templates to guide the rational design of small natural-like scaffolds mimicking the targeted recognition site. The application of this rescaffolding strategy to the discovery of natural scaffolds incorporating a selection of functionalities of interleukin-10 receptor-1 (IL-10R1), which are relevant for its interaction with interleukin-10 (IL-10) has resulted in the de novo design of a new class of potent IL-10 peptidomimetic ligands.
|
5 |
Analysis Of Structural And Functional Types Of Protein-Protein InteractionsNambudiry Rekha, * 02 1900 (has links) (PDF)
No description available.
|
6 |
Protein Structure Networks : Implications To Protein Stabiltiy And Protein-Protein InteractionsBrinda, K V 08 1900 (has links) (PDF)
No description available.
|
7 |
Structural And Evolutionary Studies On Protein-Protein InteractionsSwapna, L S 02 1900 (has links) (PDF)
The last few decades have witnessed an upsurge in the availability of large-scale data on genomes and genome-scale information. The development of methods to understand the trends and patterns from large scale data promised potentially to unravel the mechanisms responsible for the enormous diversity observed in biological systems. Of the many mechanisms adopted, protein-protein interactions represent one of the commonly adopted mechanisms to achieve functional diversity using a limited genetic repertoire. Protein-proteins interactions bring about several fundamental cellular processes and also modulate regulation at the cellular level. Different types of protein-protein interactions have evolved to carry out myriad functions in a cell. Mainly, interactions can be permanent or transient in nature, depending on the duration of interaction. In terms of affinity ,they are classified as obligate or non-obligate interactions. Structural studies on the various kinds of complexes have enabled the identification of the distinctive features characterizing the different types of complexes. Further, identifying the mechanisms involved in the evolution of protein-protein interactions are important in understanding the forces involved in their maintenance. Such studies also provide clues for the development of methods to predict protein-protein interactions from the large repertoire of sequence and structural data available.
In spite of significant understanding of various aspects of protein-protein interactions, several questions still remain unanswered. The work embodied in this thesis studies two main aspects of protein-protein interactions for various types of complexes: structural and evolutionary features. The first part of the thesis(comprising of chapters 2,3,4 and 5) involves structural studies on the following features of protein-protein interactions: structural change, flexibility, symmetry, and residue conservation. The second part of the thesis(comprising of chapters 6,7,8 and 9) involves study of evolutionary aspects of protein-protein interactions, based on both large-scale data as well as case studies.
Chapter1 provides a background and literature survey of the area of protein-protein interactions. The different classification schemes commonly used for describing the various protein-protein interactions are outlined. The key small-scale and large-scale experimental methods used for the identification of protein-protein interactions are described along with the details of the databases storing such experimentally derived information. Further, a comprehensive account of structural and evolutionary studies performed so far using the available data is provided. The computational(prediction)methods developed to address various aspects of protein-protein interactions are also outlined. In addition, the importance of protein-protein interactions in the context of diseases and the development of methods used to inhibit these interactions are discussed. Finally, the efforts towards design of protein-protein interactions based on the understanding of the principles governing their formation are outlined.
Chapter 2 and chapter 3 describe different aspects of transient protein-protein interactions, which form an important subset of interactions, and are mainly involved in the regulation of Cellular processes. In chapter 2, the structural changes occurring upon complex formation are described. In chapter 3, roles of interface residues in the unbound form are described.
In chapter 2, the nature, extent and location of structural changes upon binding is analyzed using a non-redundant and curated dataset of 77 structures of protein-protein complexes available in both bound and unbound forms. Structural change has been captured using two metrics: protein blocks and root mean square deviation of Cα positions. The relevance of the structural changes observed in protein-protein complexes is established by comparison with a control dataset of proteins not bound to any small or macromolecule. Results indicate that the observed changes are much larger than those observed due to random fluctuations. Given this background, the following observations were made on the nature, extent and location of structural changes in protein-protein complexes.(i) The nature of structural changes occurring at the interface is largely conformational with few rigid-body movements.(ii)The interfaces in the dataset are segregated into three types based on the extent of structural changes at the interface. A significant fraction of the interfaces are ‘pre-made’(almost in variant interface) or‘ induced-fit’(interface with large structural changes), while the rest are interfaces with moderate structural changes(‘others’). Analysis of structural changes using protein blocks reveals that pre-made interfaces are not completely invariant and are characterized by conformational changes of small magnitude. Pre-made interfaces are also observed to bind preferentially to‘ induced fit ’or‘ other’ interfaces. These observations implicate that non-obligate interactions possess in-built regulatory mechanisms in terms of conformational features to control the timely association-dissociation of transiently interacting proteins. (iii) Interestingly, significant structural changes away from the interface were observed in almost one-half of the complexes in the dataset. The analysis of these changes forms a major focus of this chapter. Crystallographic temperature factors, crystal packing, and normal mode analysis of these regions were studied to analyze the structural changes in these regions. Normal mode analysis along with literature survey indicates that most of the structural changes observed in non-interacting surface regions may be functionally relevant, with many of them corresponding to allosteric transitions. The majority of these changes occur in signaling proteins. The chapter summarizes that these observations suggest a much higher prevalence of allostery caused due to protein binding than appreciated before. The data set generated in this chapter can serve as a starting point to uncover potentially new allosteric modulators in signaling systems.
In chapter 3, the question‘ Do residues at the interface of transient protein-protein interactions have any role in the unbound form?’ has been investigated. A high resolution, non-redundant and functionally diverse dataset of 67 proteins with known structures available both in the form of protein-protein complex and unbound forms has been used. Significantly low B-factors at the core of the interface in the unbound form are observed in these structures, indicating high rigidity. Many of these residues also show B-factors comparable to those of buried residues in a protein, which formed the basis for classifying interface residues as ‘rigid’ and‘ non-rigid’. These two types of residues have differential contribution towards the energetics of complex formation. It is also observed that rigid interfacial residues are conserved better in evolution than non-rigid interfacial residues. Their stronger selection is highlighted by substantial conservation of microenvironment (rigidness), sequence(amino acid identity/similarity) and structure(specific side-chain orientation) in homologous proteins. These observations coupled with the absence of any specific type of amino acid to occur preferentially at a rigid site indicates that rigidness is a property of the topological location of the residue at the interface and not the type of the amino acid present at that site. Analysis of the energetic parameters of these residues indicates that the y contribute significantly to the molecular recognition process by reducing the entropic cost on complexation by virtue of their pre-ordered conformation.
This chapter also explores the contribution of interface residues towards the stability of the self-protein vis-à-vis that of the complex. It was seen that most interface residues contribute towards stabilizing the bound form. Interestingly, some of the interfacial residues predominantly stabilize the self-protein(the protein in which they are situated) and have a negligible contribution towards stabilization of the bound form. Thus, though these residues are located in the protein-protein interface their main role seems to be in the stabilization of the self-protein both in the unbound and bound forms. These residues are classified as Self-protein Stabilizing Residues(SSR -6.93%) and the rest as Neutrally Stabilizing Residues(NR -42.60%) and Complex Stabilizing Residues(CSR -50.46%). In addition, it was noted that the proportion of rigid residues is more in SSR(73.33%) than in NR(58.13%) and CSR(48.90%)sites. Apart from the favorable energetic contribution by rigid residues to the free energy of the unbound form than non-rigid residues, their predominance in SSRs suggests that rigid residues play an important role in the stabilization of the unbound form of the protein.The analyses performed in this chapter suggest that not all the protein-protein interfacial residues have the major role of stabilizing the complex; some of these residues seem to have more significant role in the unbound form than the bound form.
Chapter4 provides a discussion on the prevalence and relevance of a symmetry in homodimeric proteins. One of the main features characterizing homodimers is the symmetric arrangement of subunits in the three-dimensional structures.Typically, asymmetric arrangements of subunits are associated with disease states; however, they are also observed in normal homodimers performing specialized functions. Two measures to quantify structural asymmetry in homodimers (global asymmetry and interface asymmetry)have been used on an on-redundant dataset of 223 biologically relevant homodimers. The survey for globally asymmetric homodimers in the dataset indicates that they are very rare(n=11).The chapter discusses cases where a globally asymmetric arrangement of homodimeric proteins has been utilized by the nature to perform certain specialized functions, such as linking of a dimeric system with a monomeric system(half-of-sites reactivity) and the transmission of signals emanating from asymmetric DNA repeats. Analysis of the 3-D structures of homologues reveals that there is no clear conservation of asymmetry. Specifically, the function of the homologous protein appears to dictate the pattern of structural organization. This chapter also describes the structural and evolutionary analyses of the 11 globally asymmetric complexes, which suggest possible mechanisms adopted by nature for preventing infinite array formation. The postulated mechanisms are:(i) In case of homodimers associating via non-topologically equivalent surfaces in their tertiary structures, ligand-dependent mechanisms are used.(ii) In case of homodimers associating via partially topologically equivalent surfaces, steric hindrance serves as the preventive mechanism of infinite array.
Since most of the biologically relevant homodimers exhibit gross structural symmetry, this chapter explores further the extent of interface asymmetry in symmetric homodimers. It was observed that homodimers exhibiting grossly symmetric organization rarely exhibit either perfect local symmetry or high local asymmetry. Further, binding of small ligands at the interface does not cause any significant variation in interface asymmetry.The chapter provides new insights regarding accommodation of structural asymmetry in homodimers.
Chapter 5 describes the ability of residue conservation of interface residues vis-à-vis surface residues near interface residues to identify fitting errors caused due to mis-orientation in cryo-electron microscopy maps. Cryo-electron microscopy is the most popular technique for solving structures of large assemblies in physiological conditions. However, the structures are usually solved at low resolution and atomic resolution is desired to get insights at the molecular-level. Although several methods have been developed for the fitting of atomic structures or models in to low-resolution cryo-electron microscopic maps, inaccurate fitting is observed in several cases. Using a non-redundant and high-resolution dataset of 125 permanent interactions and 95 transient interactions, it was observed that interface residues are significantly conserved better than residues near to the interface. The chapter describes the ability of this differential conservation to identify probable mis-fittings in cryo-EM maps for three case-studies: ribosomal complex from Escherichia coli, transferring-transferrin receptor complex from Homosapiens, and glutamate synthase complex from Azospirillum brasilense. For these cases, the use of conservation information resulted in the identification of a few residues in the vicinity of the interface with significantly higher conservation, implying their probable occurrence in the interface. These findings were verified against the high-resolution structures for two of these complexes (ribosomal assembly and transferring-transferrin receptor complex).These analyses suggest that residue conservation information can be useful in the fitting process to arrive at the fitted structure with an improved accuracy. Further, the discriminative power of the simplistic measure of residue conservation coupled with residue surface accessibility in identifying interacting residues on protein structures is also analyzed in this chapter. Testing on a set of signaling and scaffolding molecules indicates that this simplistic measure can identify interface residues in protein structures, indicating that conservation contains a distinct, although weak, signal for functional regions.
Chapters 6 to 9 discuss studies involving evolutionary aspects of protein-protein interactions. Chapter 6 describes the usage of phylogenetic tree construction using maximum likelihood method to understand the origin of the signal captured by the mirror tree approach. Mirror tree is one of the most popular approaches for identifying interacting proteins based on co-evolution. This method uses the similarity in phylogenetic trees as an indicator of protein-protein interaction. The origin of the evolutionary signal detected by the mirror tree method is a subject of some controversy. Two broad hypotheses have been postulated in the literature to explain the origin of the signal(i)site-specific co-evolution alone and(ii)correlation induced by external factors with only minor, if any, contribution from site-specific co-evolution. In the typical mirror tree protocol, inferences from phylogenetic tree are optional and only genetic distances are analysed. Even if the tree is constructed, usually the Neighbor-Joining approach is used. However ,with Neighbor-Joining method the inferred tree topology and genetic distances are directly linked. With maximum likelihood the tree topology is not derived directly from the genetic distances and therefore the contributions of the signals arising from tree topology and genetic distances can be studied separately. Tree topologies can be considered to serve as indicators of compensatory substitutions(implicated in site-specific co-evolution)as well as shared evolutionary history. Genetic distances correspond to evolutionary rates(implicated in correlation induced by external factors).Using this method, phylogenetic trees for a range of datasets of interacting and non-interacting proteins corresponding to yeast(S.cerevisiae) have been derived. The analysis performed in this chapter reveals no strong correlation between phylogenetic tree topologies, and significant correlation between genetic distance matrices for interacting proteins. The chapter discusses the implications of these findings and attempts to understand the origin of the signal captured by mirror tree protocol using the following points.(i) The near lack of correlation in tree topologies is not surprising since compensatory substitutions accounts for only a minority of the sites in a protein.(ii) The influence of shared evolutionary history has also been tested in the chapter by comparison of tree topologies of interacting proteins and non-interacting with 18S rRNA tree. Tree topologies of both interacting and non-interacting proteins do not mirror the 18S rRNA tree, ruling out shared evolutionary history as the signal of correlated evolution.(iii) By contrast, the significant correlation observed between branch lengths(genetic distances) of interacting proteins in all the variant datasets demonstrates correlation between evolutionary rates, independent of evolutionary divergence. In summary, the chapter concludes by providing support for the theory of correlation induced by external factors with only minor contribution from site-specific co-evolution.
Chapter 7 explores the homology based transfer of interactions by quantifying the extent of retention/variation of interaction partnerships amongst a set of homologous proteins related at SCOP family level(which indicates clear evolutionary relationship).A large dataset of domain-domain interacting pairs(n=20,254)culled from SCOP1.73 was used for this analysis. Study involving this dataset shows that in around~80% of the cases, interacting partners are completely retained(evaluated as proteins belonging to the same SCOP family).If‘common’ partnership is evaluated at the level of SCOP folds, which are known to be structurally similaral though not necessarily evolutionarily related, the percentage of homologous domains with complete retention of partnership increases only by~5%. This indicates that only the presence of a common structural scaffold is not a sufficient feature for interaction. Further, the chapter also describes the retention/variation in partnerships analyzed as a function of sequence divergence between the homologous proteins. It is observed that there is a higher tendency to vary interacting partners as the evolutionary divergence between the homologues increases. In spite of this, interaction partnerships appear to be retained for homologous domains irrespective of their sequence divergence if the function mandates the presence of the interaction. However, all these observations could be influenced by the incomplete nature of information on the interactions available in the structural space. This problem has been addressed in this chapter by studying variation in interaction partnerships for Saccharomyces cerevisiae proteins. Yeast was chosen since it is extensively studied and interactions are available for~87% of proteins yielding a comprehensive list of interactions. To study this aspect, the SCOP dataset of interacting proteins(which represents a generic dataset) was compared with interactions of homologous proteins from yeast. The dataset of interacting proteins for yeast collated from all sources and documented in BIOGRID v50 was used. In this analysis, the proportion of homologous domains showing complete retention of interacting partners was only ~12%. This observation is the reverse of the trend observed for the dataset of homologous SCOP domains. Further analysis of homologous pairs of yeast-SCOP domains, containing only those pairs whose interacting protein families are found both in yeast and SCOP dataset, was performed to ascertain the extent of contribution of organism-specific proteins to the variation in interaction partnership for homologous domains. The proportion of homologous domains showing complete retention of interaction partners increases to~50% for these cases. These observations indicate that organism-specific proteins contribute significantly to the variation of interaction partnerships in homologous proteins.
The next two chapters(8 and 9) discuss two contrasting scenarios of interaction partnerships. Chapter 8 describes the study of two protein families showing variation in interaction partnerships/interface structure and analyzes the drift in protein-protein interaction surfaces in each of the cases. The analysis in this chapter is facilitated by the large number of sequences available for the case studies. The first case study involves members of the glutamine amido transferase (GAT) superfamily of enzymes. Three remote homologues in this superfamily could also be related by sequence: intracellular protease(DJ-1/PfpIfamily),C-terminal domain of the small subunit of carbomoyl phosphate synthetase (ClassI glutamine amidotransferase-like family), and C-terminal domain of catalase (Catalase ,C-terminal domain family).In two cases, it is seen that domain recruitment influences the interacting surface(catalase, carbamoyl phosphate synthetase). The tethered domains, which are involved in interaction with the GAT domain, are from different SCOP folds, indicating that partnerships are not retained at extreme divergence. However, members of the DJ-1/PfpIfamily form homodimers with differing quaternary structures i.e. different orientations of the dimers. Four members have been studied in detail in this chapter (intracellular protease–two distinct interfaces–forming hexamer, stress-induced protein -dimer, DJ-1protein -dimer, sigma cross-reacting protein -dimer). Since the members are sequentially less divergent(as they are within the same family), it is possible to trace the drift in interfaces among these members based on the multiple sequence alignments of members with the differing quaternary structures and the sequences bridging them. The second case study involves analysis of the family of legume lectins, which corresponds to another set of proteins exhibiting differing quaternary structures for remarkably well conserved tertiary structures and sequences.
Analysis of variations in protein-protein interaction surfaces when they show only slight differences between homologous members indicates that the drift is gradual, as seen when tracing the dynamics of DJ-1 family members and legume lectin family members. There exist sequences containing many different intermediate combinations of the interacting residues involved in both the sets of proteins. Comparisons of homologues where an entire interface seems to be lost show a different trend(intracellular protease and DJ-1).The most prominent interacting residues show an abrupt shift between the two different subfamilies. However, inspection of the other interacting residues reveals that there is a gradual change occurring generally, although a drastic change in the important(although quantitatively smaller) residues would have led to loss of interface. In summary, analysis of the evolutionary dynamics of the consensus interface residues of different quaternary structure types of DJ-1/PfpI family of enzymes and legume lectins shows that nature employs only the most important mutations to Prevent a specific interface and form a new interface and the rest of the positions drift and accumulate changes in the course of evolution.
Chapter 9 describes the opposite scenario i.e. conservation of an interface even at high sequence divergence, using the RNA polymerase assembly as a case study. The multi-molecular assembly consists of four core subunits–alpha (I and II), beta, betaprime, and omega. These four subunits are common to RNA polymerase complexes of eubacteria, eukaryota and archaea. The sigma subunit aids in initiation of transcription in eubacteria (cor eenzyme +sigma = holoenzyme). Remarkably, prokaryotic and eukaryotic structures exhibit high degree of structural similarity, although their sequence similarity is low(19-28% sequence identity).However, this is expected as the obligatory interaction between the various subunits is essential to successfully carry out transcription. This chapter investigates the structural accommodation of diverse sequences at the interface of RNA polymerase machinery of eubacteria, using sequence analysis and homology modelling. Analysis of domain composition and order of domains for the core subunits of the RNA polymerase assembly in>85 eubacterial species indicates complete conservation. However, conservation analysis of the various core subunits indicates that the interface residues are more divergent for alpha and omega subunits. Although beta and beta prime are generally well-conserved, the residues involved in interaction with the divergent subunits(i.e.alpha, omega) are not conserved. Insertions/deletions are also observed near the interacting surfaces even in the cases of most conserved subunits(beta and betaprime). The chapter describes the homology modeling of three divergent RNApolymerase complexes from Helicobacter pylori, Mycoplasma pulmonis and Onion yellows phytoplasma, highlighting that insertions/deletions can be accommodated near the interface as they generally occur at the periphery. The development of a generalized matrix capturing preferences of interface environment is documented, along with results comparing the similarity of the modeled interfaces to that of the template interface. It is observed that the modeled interfaces are physico-chemically similar to that of the template interfaces in Thermus thermophilus, indicating that nature accommodates substantial substitutions and insertions/deletions at and near the interface in order to retain the structure of the obligate complex, which is in dispensable for the process of transcription.
The main conclusions of the entire thesis work are summarized in chapter10, which also places the work in the context of the field of protein-protein interactions. The new insights obtained for transient interactions and homodimers from structural studies are highlighted. The application of evolutionary conservation to improve fitting of atomic structures in cryo-electron microscopic maps is discussed. The understanding gained from study of different evolutionary aspects of protein-protein interactions, ranging from correlated evolution to evolutionary dynamics of variations in interactions is also highlighted.
Appendix 1 of this thesis describes the homology modeling of the hexameric form of AAA ATPase domain of spastin along with associated structural analysis.
|
8 |
Protein-protein interactions: impact of solvent and effects of fluorinationSamsonov, Sergey 10 December 2009 (has links) (PDF)
Proteins have an indispensable role in the cell. They carry out a wide variety of structural, catalytic and signaling functions in all known biological systems. To perform their biological functions, proteins establish interactions with other bioorganic molecules including other proteins. Therefore, protein-protein interactions is one of the central topics in molecular biology. My thesis is devoted to three different topics in the field of protein-protein interactions. The first one focuses on solvent contribution to protein interfaces as it is an important component of protein complexes. The second topic discloses the structural and functional potential of fluorine's unique properties, which are attractive for protein design and engineering not feasible within the scope of canonical amino acids. The last part of this thesis is a study of the impact of charged amino acid residues within the hydrophobic interface of a coiled-coil system, which is one of the well-established model systems for protein-protein interactions studies.
I. The majority of proteins interact in vivo in solution, thus studies of solvent impact on protein-protein interactions could be crucial for understanding many processes in the cell. However, though solvent is known to be very important for protein-protein interactions in terms of structure, dynamics and energetics, its effects are often disregarded in computational studies because a detailed solvent description requires complex and computationally demanding approaches. As a consequence, many protein residues, which establish water-mediated interactions, are neither considered in an interface definition. In the previous work carried out in our group the protein interfaces database (SCOWLP) has been developed. This database takes into account interfacial solvent and based on this classifies all interfacial protein residues of the PDB into three classes based on their interacting properties: dry (direct interaction), dual (direct and water-mediated interactions), and wet spots (residues interacting only through one water molecule). To define an interaction SCOWLP considers a donor–acceptor distance for hydrogen bonds of 3.2 Å, for salt bridges of 4 Å, and for van der Waals contacts the sum of the van der Waals radii of the interacting atoms. In previous studies of the group, statistical analysis of a non-redundant protein structure dataset showed that 40.1% of the interfacial residues participate in water-mediated interactions, and that 14.5% of the total residues in interfaces are wet spots. Moreover, wet spots have been shown to display similar characteristics to residues contacting water molecules in cores or cavities of proteins.
The goals of this part of the thesis were:
1. to characterize the impact of solvent in protein-protein interactions
2. to elucidate possible effects of solvent inclusion into the correlated mutations approach for protein contacts prediction
To study solvent impact on protein interfaces a molecular dynamics (MD) approach has been used. This part of the work is elaborated in section 2.1 of this thesis. We have characterized properties of water-mediated protein interactions at residue and solvent level. For this purpose, an MD analysis of 17 representative complexes from SH3 and immunoglobulin protein families has been performed. We have shown that the interfacial residues interacting through a single water molecule (wet spots) are energetically and dynamically very similar to other interfacial residues. At the same time, water molecules mediating protein interactions have been found to be significantly less mobile than surface solvent in terms of residence time. Calculated free energies indicate that these water molecules should significantly affect formation and stability of a protein-protein complex. The results obtained in this part of the work also suggest that water molecules in protein interfaces contribute to the conservation of protein interactions by allowing more sequence variability in the interacting partners, which has important implications for the use of the correlated mutations concept in protein interactions studies. This concept is based on the assumption that interacting protein residues co-evolve, so that a mutation in one of the interacting counterparts is compensated by a mutation in the other. The study presented in section 2.2 has been carried out to prove that an explicit introduction of solvent into the correlated mutations concept indeed yields qualitative improvement of existing approaches. For this, we have used the data on interfacial solvent obtained from the SCOWLP database (the whole PDB) to construct a “wet” similarity matrix. This matrix has been used for prediction of protein contacts together with a well-established “dry” matrix. We have analyzed two datasets containing 50 domains and 10 domain pairs, and have compared the results obtained by using several combinations of both “dry” and “wet” matrices. We have found that for predictions for both intra- and interdomain contacts the introduction of a combination of a “dry” and a “wet” similarity matrix improves the predictions in comparison to the “dry” one alone. Our analysis opens up the idea that the consideration of water may have an impact on the improvement of the contact predictions obtained by correlated mutations approaches. There are two principally novel aspects in this study in the context of the used correlated mutations methodology :
i) the first introduction of solvent explicitly into the correlated mutations approach; ii) the use of the definition of protein-protein interfaces, which is essentially different from many other works in the field because of taking into account physico-chemical properties of amino acids and not being exclusively based on distance cut-offs.
II. The second part of the thesis is focused on properties of fluorinated amino acids in protein environments. In general, non-canonical amino acids with newly designed side-chain functionalities are powerful tools that can be used to improve structural, catalytic, kinetic and thermodynamic properties of peptides and proteins, which otherwise are not feasible within the use of canonical amino acids. In this context fluorinated amino acids have increasingly gained in importance in protein chemistry because of fluorine's unique properties: high electronegativity and a small atomic size. Despite the wide use of fluorine in drug design, properties of fluorine in protein environments have not been yet extensively studied. The aims of this part of the dissertation were:
1. to analyze the basic properties of fluorinated amino acids such as electrostatic and geometric characteristics, hydrogen bonding abilities, hydration properties and conformational preferences (section 3.1)
2. to describe the behavior of fluorinated amino acids in systems emulating protein environments (section 3.2, section 3.3)
First, to characterize fluorinated amino acids side chains we have used fluorinated ethane derivatives as their simplified models and applied a quantum mechanics approach. Properties such as charge distribution, dipole moments, volumes and size of the fluoromethylated groups within the model have been characterized. Hydrogen bonding properties of these groups have been compared with the groups typically presented in natural protein environments. We have shown that hydrogen and fluorine atoms within these fluoromethylated groups are weak hydrogen bond donors and acceptors. Nevertheless they should not be disregarded for applications in protein engineering. Then, we have implemented four fluorinated L-amino acids for the AMBER force field and characterized their conformational and hydration properties at the MD level. We have found that hydrophobicity of fluorinated side chains grows with the number of fluorine atoms and could be explained in terms of high electronegativity of fluorine atoms and spacial demand of fluorinated side-chains. These data on hydration agrees with the results obtained in the experimental work performed by our collaborators.
We have rationally engineered systems that allow us to study fluorine properties and extract results that could be extrapolated to proteins. For this, we have emulated protein environments by introducing fluorinated amino acids into a parallel coiled-coil and enzyme-ligand chymotrypsin systems. The results on fluorination effect on coiled-coil dimerization and substrate affinities in the chymotrypsin active site obtained by MD, molecular docking and free energy calculations are in strong agreement with experimental data obtained by our collaborators. In particular, we have shown that fluorine content and position of fluorination can considerably change the polarity and steric properties of an amino acid side chain and, thus, can influence the properties that a fluorinated amino acid reveals within a native protein environment.
III. Coiled-coils typically consist of two to five right-handed α-helices that wrap around each other to form a left-handed superhelix. The interface of two α-helices is usually represented by hydrophobic residues. However, the analysis of protein databases revealed that in natural occurring proteins up to 20% of these positions are populated by polar and charged residues. The impact of these residues on stability of coiled-coil system is not clear. MD simulations together with free energy calculations have been utilized to estimate favourable interaction partners for uncommon amino acids within the hydrophobic core of coiled-coils (Chapter 4). Based on these data, the best hits among binding partners for one strand of a coiled-coil bearing a charged amino acid in a central hydrophobic core position have been selected. Computational data have been in agreement with the results obtained by our collaborators, who applied phage display technology and CD spectroscopy. This combination of theoretical and experimental approaches allowed to get a deeper insight into the stability of the coiled-coil system.
To conclude, this thesis widens existing concepts of protein structural biology in three areas of its current importance. We expand on the role of solvent in protein interfaces, which contributes to the knowledge of physico-chemical properties underlying protein-protein interactions. We develop a deeper insight into the understanding of the fluorine's impact upon its introduction into protein environments, which may assist in exploiting the full potential of fluorine's unique properties for applications in the field of protein engineering and drug design. Finally we investigate the mechanisms underlying coiled-coil system folding. The results presented in the thesis are of definite importance for possible applications (e.g. introduction of solvent explicitly into the scoring function) into protein folding, docking and rational design methods.
The dissertation consists of four chapters:
● Chapter 1 contains an introduction to the topic of protein-protein interactions including basic concepts and an overview of the present state of research in the field.
● Chapter 2 focuses on the studies of the role of solvent in protein interfaces.
● Chapter 3 is devoted to the work on fluorinated amino acids in protein environments.
● Chapter 4 describes the study of coiled-coils folding properties.
The experimental parts presented in Chapters 3 and 4 of this thesis have been performed by our collaborators at FU Berlin.
Sections 2.1, 2.2, 3.1, 3.2 and Chapter 4 have been submitted/published in peer-reviewed international journals. Their organization follows a standard research article structure: Abstract, Introduction, Methodology, Results and discussion, and Conclusions. Section 3.3, though not published yet, is also organized in the same way. The literature references are summed up together at the end of the thesis to avoid redundancy within different chapters.
|
9 |
Protein-protein interactions: impact of solvent and effects of fluorinationSamsonov, Sergey 16 November 2009 (has links)
Proteins have an indispensable role in the cell. They carry out a wide variety of structural, catalytic and signaling functions in all known biological systems. To perform their biological functions, proteins establish interactions with other bioorganic molecules including other proteins. Therefore, protein-protein interactions is one of the central topics in molecular biology. My thesis is devoted to three different topics in the field of protein-protein interactions. The first one focuses on solvent contribution to protein interfaces as it is an important component of protein complexes. The second topic discloses the structural and functional potential of fluorine's unique properties, which are attractive for protein design and engineering not feasible within the scope of canonical amino acids. The last part of this thesis is a study of the impact of charged amino acid residues within the hydrophobic interface of a coiled-coil system, which is one of the well-established model systems for protein-protein interactions studies.
I. The majority of proteins interact in vivo in solution, thus studies of solvent impact on protein-protein interactions could be crucial for understanding many processes in the cell. However, though solvent is known to be very important for protein-protein interactions in terms of structure, dynamics and energetics, its effects are often disregarded in computational studies because a detailed solvent description requires complex and computationally demanding approaches. As a consequence, many protein residues, which establish water-mediated interactions, are neither considered in an interface definition. In the previous work carried out in our group the protein interfaces database (SCOWLP) has been developed. This database takes into account interfacial solvent and based on this classifies all interfacial protein residues of the PDB into three classes based on their interacting properties: dry (direct interaction), dual (direct and water-mediated interactions), and wet spots (residues interacting only through one water molecule). To define an interaction SCOWLP considers a donor–acceptor distance for hydrogen bonds of 3.2 Å, for salt bridges of 4 Å, and for van der Waals contacts the sum of the van der Waals radii of the interacting atoms. In previous studies of the group, statistical analysis of a non-redundant protein structure dataset showed that 40.1% of the interfacial residues participate in water-mediated interactions, and that 14.5% of the total residues in interfaces are wet spots. Moreover, wet spots have been shown to display similar characteristics to residues contacting water molecules in cores or cavities of proteins.
The goals of this part of the thesis were:
1. to characterize the impact of solvent in protein-protein interactions
2. to elucidate possible effects of solvent inclusion into the correlated mutations approach for protein contacts prediction
To study solvent impact on protein interfaces a molecular dynamics (MD) approach has been used. This part of the work is elaborated in section 2.1 of this thesis. We have characterized properties of water-mediated protein interactions at residue and solvent level. For this purpose, an MD analysis of 17 representative complexes from SH3 and immunoglobulin protein families has been performed. We have shown that the interfacial residues interacting through a single water molecule (wet spots) are energetically and dynamically very similar to other interfacial residues. At the same time, water molecules mediating protein interactions have been found to be significantly less mobile than surface solvent in terms of residence time. Calculated free energies indicate that these water molecules should significantly affect formation and stability of a protein-protein complex. The results obtained in this part of the work also suggest that water molecules in protein interfaces contribute to the conservation of protein interactions by allowing more sequence variability in the interacting partners, which has important implications for the use of the correlated mutations concept in protein interactions studies. This concept is based on the assumption that interacting protein residues co-evolve, so that a mutation in one of the interacting counterparts is compensated by a mutation in the other. The study presented in section 2.2 has been carried out to prove that an explicit introduction of solvent into the correlated mutations concept indeed yields qualitative improvement of existing approaches. For this, we have used the data on interfacial solvent obtained from the SCOWLP database (the whole PDB) to construct a “wet” similarity matrix. This matrix has been used for prediction of protein contacts together with a well-established “dry” matrix. We have analyzed two datasets containing 50 domains and 10 domain pairs, and have compared the results obtained by using several combinations of both “dry” and “wet” matrices. We have found that for predictions for both intra- and interdomain contacts the introduction of a combination of a “dry” and a “wet” similarity matrix improves the predictions in comparison to the “dry” one alone. Our analysis opens up the idea that the consideration of water may have an impact on the improvement of the contact predictions obtained by correlated mutations approaches. There are two principally novel aspects in this study in the context of the used correlated mutations methodology :
i) the first introduction of solvent explicitly into the correlated mutations approach; ii) the use of the definition of protein-protein interfaces, which is essentially different from many other works in the field because of taking into account physico-chemical properties of amino acids and not being exclusively based on distance cut-offs.
II. The second part of the thesis is focused on properties of fluorinated amino acids in protein environments. In general, non-canonical amino acids with newly designed side-chain functionalities are powerful tools that can be used to improve structural, catalytic, kinetic and thermodynamic properties of peptides and proteins, which otherwise are not feasible within the use of canonical amino acids. In this context fluorinated amino acids have increasingly gained in importance in protein chemistry because of fluorine's unique properties: high electronegativity and a small atomic size. Despite the wide use of fluorine in drug design, properties of fluorine in protein environments have not been yet extensively studied. The aims of this part of the dissertation were:
1. to analyze the basic properties of fluorinated amino acids such as electrostatic and geometric characteristics, hydrogen bonding abilities, hydration properties and conformational preferences (section 3.1)
2. to describe the behavior of fluorinated amino acids in systems emulating protein environments (section 3.2, section 3.3)
First, to characterize fluorinated amino acids side chains we have used fluorinated ethane derivatives as their simplified models and applied a quantum mechanics approach. Properties such as charge distribution, dipole moments, volumes and size of the fluoromethylated groups within the model have been characterized. Hydrogen bonding properties of these groups have been compared with the groups typically presented in natural protein environments. We have shown that hydrogen and fluorine atoms within these fluoromethylated groups are weak hydrogen bond donors and acceptors. Nevertheless they should not be disregarded for applications in protein engineering. Then, we have implemented four fluorinated L-amino acids for the AMBER force field and characterized their conformational and hydration properties at the MD level. We have found that hydrophobicity of fluorinated side chains grows with the number of fluorine atoms and could be explained in terms of high electronegativity of fluorine atoms and spacial demand of fluorinated side-chains. These data on hydration agrees with the results obtained in the experimental work performed by our collaborators.
We have rationally engineered systems that allow us to study fluorine properties and extract results that could be extrapolated to proteins. For this, we have emulated protein environments by introducing fluorinated amino acids into a parallel coiled-coil and enzyme-ligand chymotrypsin systems. The results on fluorination effect on coiled-coil dimerization and substrate affinities in the chymotrypsin active site obtained by MD, molecular docking and free energy calculations are in strong agreement with experimental data obtained by our collaborators. In particular, we have shown that fluorine content and position of fluorination can considerably change the polarity and steric properties of an amino acid side chain and, thus, can influence the properties that a fluorinated amino acid reveals within a native protein environment.
III. Coiled-coils typically consist of two to five right-handed α-helices that wrap around each other to form a left-handed superhelix. The interface of two α-helices is usually represented by hydrophobic residues. However, the analysis of protein databases revealed that in natural occurring proteins up to 20% of these positions are populated by polar and charged residues. The impact of these residues on stability of coiled-coil system is not clear. MD simulations together with free energy calculations have been utilized to estimate favourable interaction partners for uncommon amino acids within the hydrophobic core of coiled-coils (Chapter 4). Based on these data, the best hits among binding partners for one strand of a coiled-coil bearing a charged amino acid in a central hydrophobic core position have been selected. Computational data have been in agreement with the results obtained by our collaborators, who applied phage display technology and CD spectroscopy. This combination of theoretical and experimental approaches allowed to get a deeper insight into the stability of the coiled-coil system.
To conclude, this thesis widens existing concepts of protein structural biology in three areas of its current importance. We expand on the role of solvent in protein interfaces, which contributes to the knowledge of physico-chemical properties underlying protein-protein interactions. We develop a deeper insight into the understanding of the fluorine's impact upon its introduction into protein environments, which may assist in exploiting the full potential of fluorine's unique properties for applications in the field of protein engineering and drug design. Finally we investigate the mechanisms underlying coiled-coil system folding. The results presented in the thesis are of definite importance for possible applications (e.g. introduction of solvent explicitly into the scoring function) into protein folding, docking and rational design methods.
The dissertation consists of four chapters:
● Chapter 1 contains an introduction to the topic of protein-protein interactions including basic concepts and an overview of the present state of research in the field.
● Chapter 2 focuses on the studies of the role of solvent in protein interfaces.
● Chapter 3 is devoted to the work on fluorinated amino acids in protein environments.
● Chapter 4 describes the study of coiled-coils folding properties.
The experimental parts presented in Chapters 3 and 4 of this thesis have been performed by our collaborators at FU Berlin.
Sections 2.1, 2.2, 3.1, 3.2 and Chapter 4 have been submitted/published in peer-reviewed international journals. Their organization follows a standard research article structure: Abstract, Introduction, Methodology, Results and discussion, and Conclusions. Section 3.3, though not published yet, is also organized in the same way. The literature references are summed up together at the end of the thesis to avoid redundancy within different chapters.
|
Page generated in 0.1019 seconds