• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 2
  • Tagged with
  • 2
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Dynamics of Protein Kinases : Its Relationship to Functional Sites and States

Kalaivani, R January 2017 (has links) (PDF)
A cell is a highly complex, ordered, and above all, a robust system. It copes with in-trennel and external uncertainties like heterogeneous stimuli, errors in processing and execution, and changes within and outside the cell. Maintenance of such a system critically depends on a large body of signalling networks and associated regulatory mechanisms. Of the recurrent manoeuvres in cell signalling, protein phosphorylation is the most prominent, and is used as a switch to transmit information and effect-ate various outcomes. It is estimated that 30% of the entire proteome of a typical eukaryotic cell is phosphorylated at one time or another, almost exclusively at the hydroxyl groups of one or more Seer(S)/Thru(T)/Tyr(Y) residues. This phosphorylation is accomplished through the transfer of g-phosphate of ATP in the presence of cations by a superfamily of enzymes called protein kinases, or STY kinases. In accordance with widespread phosphorylation events, STY kinases form a large and diverse superfamily, constituting 2% of the proteins encoded in an eukaryotic genome and about 500 proteins in the human proteome. Distantly related STY kinases share less than 20% sequence identity, phosphorylate specific substrates, bind to dis-tint interaction partners, localise in different cellular compartments and are regulated by different mechanisms. Despite flexibly accommodating these specific attributes, all STY kinases share a conserved 3-dimensional fold and retain the catalytic function. Moreover, all STY kinases can be manipulated by the signalling machinery to be in the “on” (functionally active) or “off” (inactive) state, thereby adding another layer of regulatory control. Such versatility of the STY kinase domain in harbouring specific substrate recognition motifs, binding interfaces, domain architectures and functional states makes it one of the most influential players in cell signalling and a desirable drug target. Despite decades of studies, a comprehensive understanding of the kinase domain, and the features that dictate its catalytic activity and specificity is lacking. This is reflected by the fact that whereas kinases specifically bind and phosphorylate their cognate substrates, most drugs targeted at them are non-specific and beget cross-reactivity. This gap in understanding potentially ensues from an awry outlook of STY kinases from the viewpoint of sequences and structures alone. It is now well established that the function and regulation of a protein molecule, along with its stability and evolution, is closely related to its dynamics. In this premise, this thesis explores the mechanistic and dynamics underpinnings of STY kinases, and interprets them in the light of their multitude of functional responsibilities and specificities In Chapter 1 of the thesis, we broadly discuss the complexity of cell signalling and the pivotal role of STY kinases in it. After a brief introduction to cell signalling in eukaryotes, several signal cascades mediated by different secondary messengers (camp, cGMP, DAG, IP3) are described. In these signal pathways, modularity is identified as a recurrent theme at all levels of hierarchy: within domain, within protein, within signalling pathway and across signalling pathways. One such modular regulation, protein phosphorylation, is discussed in detail and its catalytic enzyme STY kinase is introduced. An overview and historical perspective of the STY kinase superfamily is presented along with the review of literature pertaining to their sequence, structure and catalytic function characteristics. We note that in the active state, all STY kinases adopt a specific spatial conformation characterised by precise positioning of crucial structural motifs, while the inactive state is usually a case of some deviation from these structural constraints. Chapter 2 addresses a fundamental question in the protein dynamics and function paradigm. If mobility and dynamics of a protein is intimately coupled to its function, how does it manifest in STY kinases? Is there a discernible inter-relationship be-tween the mobility of an STY kinase and its functional competence? To answer these questions, we collated 55 crystal structures of 14 STY kinases from diverse groups and families, and subjected their kinase catalytic domains to Gaussian network model (GNM) based normal mode analysis (NMA). GNM models the kinase structure as a 3-dimensional mass-spring system in a coarse-grained fashion, with masses/nodes at Ca atom positions. Proximate Ca nodes, within a 7 A˚ distance cut-off, are con-nested by identical virtual springs, resulting in a simplified network of Ca-Ca bonded and non-bonded interactions modelled as harmonic potentials. Based purely on the topology of mechanical constraints imposed by the springs, GNM analytically deter-mines the isotropic vibrational normal modes available to the kinase structure. This method approximates the energy of the protein structure harmonically, and thus any micro-motion of the kinase can be theoretically described by a linear cSombination of the calculated normal modes. It is known from previous studies that the modes of low frequencies correspond to biologically feasible and meaningful motions like hinge movements, protein folding and catalysis. We note that the multiple crystal structures analysed in each of the 14 STY kinases are identical in sequence and gross structural fold, and vary only in local backbone conformations corresponding to the functional state of the kinase (active/inactive). Upon examining the fluctuations of kinases in the normal mode of the least frequency (or, global mode), we found systematically higher structural fluctuations in the inactive states when compared to the corresponding active states. This observation held true within individual kinases and across all the 14 kinases. Taken together, a more number of residues have higher fluctuations in the inactive states (n = 1095), than those with higher fluctuations in the active states (n = 525; Chi-square test, p value < 0.05). This skewed fluctuation distribution is in corroboration with higher B-factors and con-formational energies of the inactive state crystal structures. Moreover, high fluctuation is observed across the different inactive forms, except a small fraction of DFG-“in” in-active conformations. Interestingly, the regions of differential fluctuation localised to activation loop, catalytic loop, aC-helix and aG-helix, which are implied in kinase function and regulation. Further investigation of 476 crystal structures of kinase com-plexes with other proteins revealed a remarkable correspondence of these regions of differential fluctuation to contact interfaces. We further verified that this differential fluctuation is not a trivial consequence of bound small molecules or mutations, but an inherent attribute of the kinase catalytic domains. In Chapter 3, we verified the accuracy of differential fluctuation observed between the active and inactive STY kinases, as perceived from GNM based NMA, using the more rigorous method of molecular dynamics (MD) simulations. GNM is minimal-is tic in that the STY kinase catalytic domain is coarse-grained and reduced to a 3-dimensional mechanical network of Ca atom nodes. Thus, the role of side chains and their biophysical character, intra-protein interactions, mutations and bound factors are grossly overlooked. In this premise, we conducted all-atom MD simulations using AMBER ff14SB force-field of 6 structural variants of cAMP-dependent protein kinase (PKA) for 1 ms each. We chose 2 crystal structures of active and inactive PKA (PDB IDs 3FJQ and 1SYK respectively) whose kinase domains shared high structural similarity (gRMSD = 2.6 A)˚. They were modified in silico to obtain 6 starting structures for MD simulations: phosphorylated kinase domain in active and inactive states, kinase do-main along with its C-terminal tail in active and inactive states, active kinase domain bound to ATP/2Mg2+, and unphosphorylated inactive kinase domain. In the absence of external domains, the inactive kinase domain conformation elicits higher mobility in terms of Ca RMSD and Ca RMSF than the active kinase domain. Of the 255 residues in PKA, remarkable 198 residues have higher Ca RMSF in the inactive state, with predominant contributions from ATP binding loop, catalytic loop and aG-helix. In the presence of C-terminal tail, the differential mobility of the kinase domain is exaggerated, with 241 out of 255 residues showing higher Ca RMSF in the inactive state. Upon close investigation, we found that in the presence of C-terminal tail, al-though the mobility of residues is generally suppressed in both the functional states, a few functional regions like activation loop and hinge residues experience higher Ca RMSF in the inactive state. This sheds light on the role of C-terminal tail in the dynam-ics of the activation loop, potentially operating through the hinge residues. Absence of phosphorylation in the inactive kinase domain increases the mobility of residues in general, except of those in the aG-helix. When bound to ATP/2Mg2+, active ki-nase domain (active-holo) showed higher mobility than the active-apo and inactive structures, contrary to the previous results and studies. Intrigued, we examined the simulation closely and found a transition of the active-holo structure to another con-formation, named s2, at 450 ns. Upon analysis of the trajectory before the transition, the active-holo form was indeed found to be more stable and less mobile than the inac-tive state(s). Thus, all the inactive variants are found to be consistently more agile and mobile than their active counterparts, in agreement with the results obtained using NMA. Chapter 4 discusses the transition of the active-holo simulation to a new state, named s2, characterises its structural features and explores the possibility of its func-tional relevance. In the previous chapter, while attempting to verify the presence of differential mobility between various active and inactive forms of PKA through MD simulations, we chanced upon the transition of an active PKA state bound to ATP/2Mg2+ (active-holo) to s2 conformation. The s2 state has a Ca RMSD of up to 4.1 A˚ from the initial starting conformations, mainly contributed by the ATP binding loop, abs-helix, act-helix and age-helix, which are implicated in catalysis and substrate recognition. Once formed, s2 was stable and did not revert back to the active-hole starting structure or any other conformation. We calculated all-vs.-all Ca RMSDs of the conformations sampled during the simulation and identified 3 time periods: 0 - 200 ns of initial conformations similar to the starting structure, 201 - 500 ns of transition, and 501 - 1000 ns of s2 conformations. Principle component analysis (PCA) of the Ca spatial positions during the entire trajectory also categorically exposed two energy wells corresponding to the initial and s2 conformations in the first and second PCs (variance = 56%). Upon systematically comparing the conformers sampled in MD with every known kinase structure, no structure hit with Ca RMSD 2 A˚ was found for conformers sampled after 500 ns, deeming s2 as a novel and hitherto unknown conformation. Investigation of persistent intra-protein interactions unique to the s2 state revealed two stabilising interactions: a salt bridge between K73 and E106 in the b-sheet behind the ATP binding cleft and a network of hydrophobic interactions anchoring act-helix to the age-helix. Aside from these defining interactions, s2 is also characterised by a higher density of intra-protein hydrogen bond network, which stabilises it further. PCA of the MD trajectory indicates the transition of active-hole to s2 to be a process with at least 2 steps, the first being the salt bridge formation. Evolutionary conservation analysis shows that the crucial residues involved in the s2-specific interactions are not reliably conserved across PKAs of other organisms. However, convergence to s2 may still be feasible through other courses of stabilising interactions. From a functional perspective, the s2 conformation opens up the age-helix away from the kinase core and mildly rearranges the catalytic cleft, thereby unmasking a hotspot for sub-strata binding that was absent in the initial structure. In an attempt to replicate the s2 conformation, we performed 4 repeat simulations of the same active-hole starting structure for 1 ms each. Although two of these independent simulations achieved the K73-E106 salt bridge, none of them cloned the complete extent of transition and con-mergence to s2. Instead, we sampled another set of novel conformations, s3, in one of the repeat simulations indicating a disposition for the ATP bound PKA to sample different conformations. Comparative analysis suggests a potential role of C-terminal tail in stabilising the active-hole conformation in physiological conditions. Chapter 5 characterises the extent of conservation of structural fluctuations in ho-mologous STY kinases and interprets the observations in the light of their regulatory diversity. Upon establishing that structural fluctuations of STY kinases carry activity-specific information (Chapter 2) and affirming the same using MD simulations (Chap-ter 3), we hypothesised that the mobility of STY kinases is an important consider-action to understand the basis of their regulatory features as well. In that case, one would expect the structural fluctuations to be better conserved in closely related STY kinases than distantly related ones. To test our hypothesis, we collated 73 crystal structures containing an STY kinase domain in the active conformation and subjected them to GNM based NMA as described above. The global mode structural fluctuations of pairs of STY kinases of varying evolutionary divergence (same-protein, within-subfamily, within-family, within-group and across-groups) were analysed. We found that the closely related STY kinase pairs (of same-protein and within-subfamily cate-goriest) have more conserved and better correlated structural fluctuations than those that were distantly related (of within-group and across-group categories). This con-serration of flexibility did not trivially follow from sequence/structure conservation, since a substantial 65.4% of variation in fluctuations was not accounted by variations in sequences and/or structures. Across the 73 active STY kinases belonging to different groups, we identified a conserved flexibility signature defined by low magnitude fluctuations of residues in and around the catalytic loop. Interestingly, we also identified sub-structural residue-specific fluctuation profiles characteristic of kinases of different categories. Specifically, fluctuation patterns that are statistically unique to kinase groups (AGC, TK) and families (PKA, CDK) were recognised. These fluctuation signatures localise in sites known to participate in protein-protein interactions typical of the kinase group and family concerned. Thus, we report for the first time that residues characterised by fluctuations that are differentially conserved within a group/family are involved in interactions specific to the group/family. Upon the validation of structural fluctuation as an indicative tool to understand kinase-specific interactions, we elucidate an application of this understanding. In SC kinase, we identified local regions around the age-helix to be exhibiting conserved differential fluctuations in comparison to its close relatives EGFR and Abl. Following from the learning that specific fluctuations are correlated with specific binding, we propose this as an attractive target for drug design, with minimal cross-reactivity. Overall, this chapter demonstrates the conservation of fluctuation in STY kinases and underscores the importance of consideration of fluctuations, over and above sequence and structural features, in understanding the roles of sites characteristic of kinases. Chapter 6 documents the frequency of substitution of structural fluctuations in STY kinases over the course of divergent evolution. So far, we had established that structural fluctuations are evidently distinct in the varied functional states assumed by a single STY kinase (Chapter 2-3). In addition, fluctuations are differentially conserved within closely related kinases, but systematically vary across families (Chapter 5). In this chapter, we quantify the structural fluctuation variations in all residues of STY kinases put together. In a sense, this is the fluctuation space available for STY kinases across their functional states, binding modes, and regulatory mechanisms. To accomplish this, we systematically compiled all known eukaryotic kinase domain structures solved at resolutions better than 3 A˚. These structures were then divided into wild-type (harbouring no mutations and having typical amino acids at critical functional sites), pseudo-kinase (harbouring no mutations, but having unconventional amino acids at critical functional sites), disease mutant (harbouring mutations that have imp-plications in diseases) and mutant of unknown effect (harbouring mutations whose physiological manifestation is unknown) categories. Global mode structural fluctuations were determined for every kinase catalytic domain structure in each of the 4 enlisted categories. Similar to Benioff and Benioff’s BLOSUM that summarised the probability of all possible amino acid substitutions in homologous proteins, we documented a ma tricks of fluctuation substitution frequency in the conserved regions of wild-type kinases (named FLOSUM). We observe a positive correlation between the mean and variance of structural fluctuations at equivalent residue positions in wild-type kinase structures (Spearman rank order correlation, r = 0.69, p value < 1e 139). This implies that the residues with low flexibility, like catalytic loop, do not adopt diverse fluctuations in different functional states or across kinases. Substitution with any other fluctuation is heavily disfavoured at the lower range of flexibility than at the higher range. While we did not detect apparent differences in the FLOSUMs of wild-type, disease mutant and mutants of unknown effect structures, there is a remarkable distinction in the FLOSUM of pseudo-kinases. Fluctuation substitutions that are traditionally unfavourable in wild-type kinases are freely allowed in pseudo-kinases, thus exhibit-in poor conservative substitution. Over and above the lack of conventional amino acids, poor conservation of structural fluctuations and favourable substitution of de-viand fluctuations could render auxiliary functional character to the kinase domain in pseudo-kinases, despite their structural similarity to STY kinases. Taken together, this study summarises the structural fluctuation landscape of STY kinases in the form of a substitution matrix, which can serve as a model of flexibility substitution during protein evolution. Encouraged by structural fluctuations being differentially conserved in closely re-lasted kinases (Chapter 5) and conservatively substituted across kinases (Chapter 6), we extended this principle to the sequences of STY kinases in Chapter 7. This chapter reports the development of a method to predict the sites of functional specialisation in kinases, which differentiate one kinase from another, and applies it to all known STY kinase families. These are correlates of kinase-specific functional and regulatory attributes like specific protein-protein interactions, cognate substrate recognition and response to specific signals. Two cardinal properties of family-specific functional sites, viz., differential conservation and discriminatory ability, were used to identify them. We systematically compiled a data set of 5488 kinase catalytic domain sequences be-longing to 107 families. After aligning them into a single multiple sequence alignment, we comparatively analysed the amino acid distributions in topologically equivalent positions of different families. Based on 3 different analytical measures, physicochemical property, Shannon’s entropy and random probability, we scored the differential conservation of every alignment position in each family. By maximising the disc rim-inability between the kinase families, we integrated the results of the three measures and devised a single unified scoring scheme called ID score. This integrated scoring method could distinguish the 107 families from one another with an accuracy of 99.2%. Each site in every STY kinase family was given a score in the range 0 to 1, with 0 indicating no functional specialisation and 1 indicating maximum functional spa-canalisation, by the ID score. Several validations of the method were carried out to assess its competence. First, we selected those residue positions which have consistently high ID scores across most families. Using these hotspot alignment positions that render specificity to the kinase, we clustered the kinase sequences into groups and families. We found that the ID score predicted sites clustered the kinases better than the traditional clustering using the entire alignment. Despite reduction in information, the increase in accuracy of clustering is feasible because of efficient filtering of non-discriminatory sites by ID score. Second, a linear discriminant classifier was observed to predict the kinase family, based on the ID score predicted sites, better than traditional methods. Third, family-specific protein-protein interaction sites in CDK and substrate recognising distal sites in MAPK were scored significantly higher than other residues by ID score (Two-tailed unpaired t-test, p value < 0.05). Fourth, family-specific oncogenic driver mutation sites in 8 different kinase families were identified confidently by the ID score. Finally, we demonstrate one feasible application of the ID score method in the prediction of specific protein-protein interaction sites. In summary, we developed an integrated discriminatory method to identify regions of functional specialisation in all known kinases, validated the results for known cases and elucidate a potential application of the method. The learning from the entire thesis work is summarised in Chapter 8, which positions the work in the larger context of functioning of the kinase domain and the use of dynamics to interpret protein functions. The validity of the simple, yet use-full, NMA of proteins and complementary MD simulations to understand basic mechanistic and dynamic properties of proteins is highlighted. Similar to sequence and structure, dynamics is now recognised as a crucial feature holding information about protein function. The main learning of the thesis that the flexibility and mobility of STY kinases is conserved and conservatively substituted at different levels of hierarchy (different functional forms within a kinase, across kinase families and across the entire STY kinase superfamily) is discussed. The contributions of the work in fur-the ring the knowledge of specificity determinants in kinases, which dictate precise regulatory and control mechanisms, are presented. Supplementary information helpful in understanding of the results of individual chapters, but could not be printed in the thesis due to its length, are provided in an optical disk attached to the thesis. The material in the optical disk is referred to in appropriate places in the individual chapters
2

Genome-Wide And Structural Analyses Of Protein Kinase Superfamily

Anamika, * 01 1900 (has links)
A signal transduction process refers to chain of highly regulated biochemical steps which results in the transfer of signal in response to a stimulus in the extracellular environment to the intracellular compartments such as nucleus. Variety of biomolecules such as proteins and lipids participate in such processes. One of the superfamilies of proteins which actively participate in signaling processes is protein kinase which transfers γ-phosphate from Adenosine Triphosphate (ATP) to the specific hydroxyl group(s) in the protein substrates. Phosphorylation and dephosphorylation events are critical in many signal transduction pathways affecting biological system as a whole. Protein phosphorylation carried out by protein kinases has emerged as pre-eminent mechanism for the regulation of variety of cellular processes such as cell growth, development, differentiation, homeostasis, apoptosis, metabolism, transcription and translation. The current thesis encompasses the investigations carried out by the author, using various bioinformatics tools and methods, to comprehend the structural and functional roles of diverse set of protein kinase subfamilies in various eukaryotic and prokaryotic organisms. The present thesis has been divided into various chapters. Chapter 1 of the thesis provides introduction to the superfamily of protein kinases and covers the relevant literature. The database of Kinases in Genomes (KinG) set-up in the author’s group a few years ago (Krupa et al, 2004a), comprises of a collection of Serine/Threonine/Tyrosine protein kinases recognized using bioinformatics approaches, from the genomic data of various eukaryotes, prokaryotes and viruses (Krupa et al, 2004a). KinG database also provides classification of protein kinases into various groups and subfamilies (Hanks et al, 1988). Information on non-kinase domains which are tethered to the catalytic kinase domains is also available for every kinase in the KinG database. KinG is periodically (annually) updated with rise in the number of genome sequence datasets of various organisms, increase in the number of known protein domain families and refinement or reannotation of genomic datasets (Anamika et al, 2008c). Author describes the work on annual update of KinG database in Chapter 2 of the thesis. Availability of an improved version of the human genomic data has provided an opportunity to re-investigate protein kinase complement of the human genome and enabled an analysis of the splice variants. This analysis is also described in Chapter 2. Chapter 3, Chapter 4 and Chapter 5 report recognition and analysis of repertoire of protein kinases in Chimpanzee, two Plasmodium species (Plasmodium falciparum and Plasmodium yoelii yoelli) and Entamoeba histolytica respectively. A detailed analysis of the non-kinase domains which are tethered to catalytic protein kinase domains in eukaryotic organisms is presented in Chapter 6. Chapter 7 discusses a systematic classification framework developed by the author to classify Serine/Threonine protein kinases in prokaryotic organisms. Investigation carried out on 3-D structural aspects of protein kinase-substrate interactions is described in Chapter 8. While identifying protein kinases from genomic data occurrence of protein kinase-like non-kinases (PKLNK), which lack aspartate in a specific position in the amino acid sequence (and hence are unlikely to function as a kinase), has also been observed. Chapter 9 presents an analysis of PKLNKs with an objective of obtaining clues to their functions. Chapter 10 summarizes the main conclusions of the thesis and provides an outlook of the current study. Chapter 1: Chapter 1 provides an introduction to cell signaling and the involvement of protein kinases in various signaling pathways compiled from author’s literature survey. This chapter provides a description of molecular events in cell signaling in prokaryotic and eukaryotic organisms. The diversity, specificity and cellular roles of protein kinases are discussed in detail. Chapter 2: Chapter 2 describes KinG (Kinases in Genomes) database which was first established by Krupa et al (2004a). The KinG database is an on-line compilation of the putative Serine/Threonine/Tyrosine protein kinases encoded in the completely sequenced genomes of archaea, eubacteria, viruses and eukaryotes. Surge in the datasets of genomes, improvements in the quality of the genomic data for various organisms and growing number of protein domain families necessitates periodic update of KinG database. The updated version of KinG holds information on protein kinases for 483 organisms (Anamika et al, 2008c). Availability of draft version of the human genome data in 2001 enabled recognition of repertoire of human protein kinases (Krupa and Srinivasan, 2002a; Manning et al, 2002; Kostich et al, 2002). Over the last 7 years human genomic data is being refined and at present the quality of the human genomic data available is much superior to the one available in 2001. By gleaning the latest version of human genome data, 46 new human protein kinase splice variants have been identified which were not recognized in the earlier studies on human kinome. Improper regulation or mutant forms of many of these newly identified protein kinase splice variants are directly involved in various diseases such as different kinds of cancer, Severe Combined Immunodeficiency Disease (SCID) and Huntington disease. In addition, abnormal forms of mouse orthologues of some of the newly identified human kinase splice variants are known to cause various diseases in mice. This raises the possibility of the human orthologues playing similar roles in the disease processes. Such observations and detailed analysis of these protein kinase splice variants would have a profound influence on drug design and development against various diseases. Chapter 3: Investigations on the identification and analysis of protein kinases encoded in the genome of chimpanzee (chimp) has been discussed in Chapter 3. Further, the kinome complement has been compared between chimp and its evolutionary close relative, human (Anamika et al, 2008b). The shared core biology between chimp and human is characterized by many orthologous protein kinases which are involved in conserved pathways. Domain architectures specific to chimp/human kinases have been observed. Chimp kinases with unique domain architectures are characterized by deletion of one or more non-kinase domains present in the human kinases. Interestingly, counterparts of some of the multi-domain human kinases in chimp are characterized by identical domain architectures but with kinase-like non-kinase domain (PKLNK). Remarkably, for 160 out of 587 chimpanzee kinases no human orthologue with sequence identity greater than 95% could be identified. Variations in chimpanzee kinases compared to human kinases are brought about also by differences in functions of domains tethered to the catalytic kinase domain. For example, the heterodimer forming PB1 domain related to the fold of ubiquitin / Ras-binding domain is seen uniquely tethered to PKC-like chimpanzee kinase. Though chimpanzee and human have close evolutionary relationship, there are chimpanzee kinases with no close counterpart in the human suggesting differences in their functions. This chapter provides a direction for experimental analysis of human and chimpanzee protein kinases in order to enhance our understanding on their specific biological roles. Chapter 4: Chapter 4 describes genome-wide comparative analysis for protein kinases encoded in the two apicomplexa namely Plasmodium falciparum (P. falciparum) (3D7 strain) and Plasmodium yoelii yoelii (P. yoelii yoelii) (17XNL strain) genomes which are causative agents of malaria in human and rodent respectively (Anamika and Srinivasan, 2007). Sensitive bioinformatics techniques enable identification of 82 and 60 putative protein kinases in P. falciparum and P. yoelii yoelii respectively. These protein kinases have been classified further into subfamilies based on the extent of sequence similarity of their catalytic domains (Hanks et al, 1988). The most populated kinase subfamilies in both the Plasmodium species correspond to CAMK and CMGC groups. Analysis of domain architectures enables detection of uncommon domain organisation in kinases of both the organisms such as kinase domain tethered to EF hands as well as pleckstrin homology domain. Components of MAPK signaling pathway are not well conserved in Plasmodium species. Such observations suggest that Plasmodium protein kinases are highly divergent from other eukaryotes. A trans-membrane kinase with 6 membrane spanning segments in P. falciparum seems to have no orthologue in P. yoelii yoelii. 19 P. falciparum kinases (Anamika et al, 2005; Anamika and Srinivasan, 2007) have been found to cluster separately from P. yoelii yoelii kinases and hence these kinases are unique to P. falciparum genome. Only 28 orthologous pairs of kinases could be identified between these two Plasmodium species. Comparative kinome analysis of the two Plasmodium species has thus provided clues to the function of many protein kinases based upon their classification and domain organisation and also implicate marked differences even between two Plasmodium species. Chapter 5: Identification and analysis of the repertoire of protein kinases in the intracellular parasite Entamoeba histolytica (E. histolytica) using sensitive sequence and profile search methods forms the basis of Chapter 5. A systematic analysis of a set of 307 protein kinases in E. histolytica genome has been carried out by classifying them into different subfamilies originally defined by Hanks and Hunter (Hanks et al, 1988) and by examining the functional domains which are tethered to the catalytic kinase domains (Anamika et al, 2008a). Compared to other eukaryotic organisms, protein kinases from E. histolytica vary in terms of their domain organisation and displays features that may have a bearing in the unusual biology of this organism. Some of the parasitic kinases show high sequence similarity in the catalytic domain region with calmodulin/calcium dependent protein kinase subfamily. However, they are unlikely to act like calcium/calmodulin dependent kinases as they lack non-catalytic domains characteristic of such kinases in other organisms. Such kinases form the largest subfamily of protein kinases in E. histolytica. Interestingly a Protein Kinase A/Protein Kinase G-like hybrid kinase subfamily member is tethered to pleckstrin homology domain. Although potential cyclins and cyclin-dependent kinases could be identified in the genome the likely absence of other cell cycle proteins suggests unusual nature of cell cycle in E. histolytica. Some of the unusual kinases recognized in the analysis include the absence of Mitogen activated protein kinase kinase (MEK) as a part of the Mitogen Activated Kinase signaling pathway and identification of trans-membraneous kinases with catalytic kinase region showing a good sequence similarity to Src kinases which are usually cytosolic. Sequences which could not be classified into known subfamilies of protein kinases have unusual domain architectures. Many such unclassified protein kinases are tethered to domains which are cysteine-rich and to domains known to be involved in protein-protein interactions. The current chapter on kinome analysis of E. histolytica suggests that the organism possesses a complex protein phosphorylation network that involves many unusual protein kinases. Chapter 6: Protein kinases phosphorylating Serine/Threonine/Tyrosine residues in several cellular proteins exert tight control over their biological functions. They constitute the largest protein family in most eukaryotic species. Classification based on sequence similarity of their catalytic domains, results in clustering of protein kinases sharing gross functional properties into various subfamilies. Many protein kinases are associated or tethered covalently to domains that serve as adapter or regulatory modules, aiding substrate recruitment, specificity, and also serve as scaffolds. Hence the modular organisation of the protein kinases serves as guidelines to their molecular interaction which has been discussed in Chapter 6. Recent studies on repertoires of protein kinases in eukaryotes have revealed wide spectrum of domain organisation in model organisms across various subfamilies. Occurrence of organism-specific novel domain combinations suggests functional diversity achieved by the protein kinase in order to regulate variety of biological processes. In addition, domain architectures of protein kinases revealed existence of hybrid protein kinase subfamilies and their emerging roles in the signaling of eukaryotic organisms. The repertoire of non-kinase domains tethered to multi-domain kinases in the higher eukaryotes is discussed in Chapter 6. Similarities and differences in the domain architectures of protein kinases in these organisms indicate conserved and unique features that are critical to functional specialization. Chapter 7: Chapter 7 describes systematic classification of Serine/Threonine protein kinases encoded in archaeal and eubacterial genomes. Majority of the Serine/Threonine protein kinases which have been identified in archaeal and eubacterial genomes could not be classified into any of the well known subfamilies (Hanks et al, 1988) of protein kinases suggesting their diversity from kinases in eukaryotes. The extensive prokaryotic Serine/Threonine protein kinase dataset obtained from KinG (Krupa et al, 2004a, Anamika et al, 2008c) has given an opportunity to classify these prokaryotic Serine/Threonine protein kinases mainly into three categories based upon sequence identity based clustering: 1) Species/Genus-specific clusters: Species/Genus-specific Serine/Threonine protein kinases contain members from a particular species or genus of the eubacteria or archaea suggesting requirement of these Serine/Threonine protein kinases for certain lineage specific function. 2) Organism-specific clusters: Organism specific clusters has members from certain specific types of organisms which suggests role of these Serine/Threonine protein kinases in some specific function being carried out by limited sets of prokaryotes. 3) Organism-diverse clusters: Organism diverse clusters suggest common function performed by such kinases in wide variety of organisms. Interestingly, occurrence of several species/genus or organism specific subfamilies of prokaryotic kinases contrasts with classification of eukaryotic protein kinases in which most of the popular subfamilies of eukaryotic protein kinases occur diversely in several eukaryotes. Function-based classification has also been proposed which shows that members of each cluster has specific function to perform. In this analysis, almost 50% of the “clusters” obtained have only one member suggesting their sequence and probably functional divergence. Many prokaryotic Serine/Threonine protein kinases exhibit a wide variety of modular organisation which indicates a degree of complexity and protein-protein interactions in the signaling pathways in these microbes. Chapter 8: A wide spectrum of protein kinases belonging to different Hanks and Hunter groups of kinases and subfamilies has been identified in various eukaryotes. However, specific biological targets (substrates) of many protein kinase subfamilies are still unknown and this is one of the active areas of research. In the current analysis reported in Chapter 8, an attempt has been made to understand protein kinase-substrate interaction and substrate consensus prediction by analyzing known 3-D structures of complexes of kinases and peptide substrates/pseudosubstrates. Considering protein kinase ternary complex structures in their active states, it has been observed that protein kinase residues which are interacting with the substrate residues having constraint are at topologically equivalent positions despite belonging to different Hanks and Hunter protein kinase subfamilies. In this analysis, it has also been observed that the residues in a given kinase subfamily interacting with consensus substrate residues are usually conserved across homologues. Interestingly, in Protein Kinase B and Phosphorylase Kinase subfamily homologues, residues interacting with substrate residue/s having no constraint are not well conserved even within the kinase subfamily suggesting different evolutionary rate of substrate interacting residues. This result is anticipated to be helpful in furthering our understanding of protein kinase-substrate relationship which is likely to be helpful in drug design. Chapter 9: Protein Kinase-Like Non-kinases (PKLNKs) are closely related to protein kinases but they lack the crucial catalytic aspartate in the catalytic loop and hence cannot function as a protein kinase. PKLNKs have been analyzed (Chapter 9) with an objective of obtaining clues about their functions. Using various sensitive sequence analysis methods, 82 PKLNKs from four higher eukaryotic organisms namely, Homo sapiens, Mus Musculus, Rattus norvegicus and Drosophila melanogaster have been recognized. On the basis of their domain combinations and functions of tethered domains, PKLNKs have been classified mainly into four categories: 1) Ligand binding PKLNKs 2) PKLNKs having extracellular protein-protein interaction domain 3) PKLNKs involved in dimerization 4) PKLNKs with cytoplasmic protein-protein interaction module. While members of the first two classes of PKLNKs have transmembrane domain tethered to the PKLNK domain, members of the other two classes of PKLNKs are entirely cytoplasmic in nature. The current classification scheme hopes to provide a convenient framework to classify the PKLNKs from other eukaryotes and it should be helpful in deciphering their roles in cellular processes. Chapter 10: This is a chapter on conclusions of the entire thesis work. Summary of the major outcomes of this thesis work is provided and implications of the work in the area of signal transduction are discussed. In addition to above mentioned work, studies on repertoire of protein kinases from two plant organisms have been carried out and the kinomes have been comparatively analyzed (Krupa et al, 2006) (Appendix 1). Comparison of plant protein kinases with other eukaryotes revealed remarkable differences. Trans-genomic comparison of the protein kinase repertoires of Arabidopsis thaliana and Oryza sativa has enabled identification of members that are uniquely conserved within the two species. Analysis on the domain organisation of plant protein kinases has also been carried out. Appendix 2 presents the work done on Entamoeba histolytica (E. histolytica) ornithine decarboxylase (ODC)-like protein which regulates the polyamine biosynthesis. DFMO (Difluoromethylornithine) is unable to inhibit the E. histolytica ODC-like protein while it inhibits the homologues of ODC in other organisms. Modelling study has suggested substitution of three amino acids in the E. histolytica ODC-like protein because of which DFMO is unable to inhibit the activity of ODC-like protein (Jhingran et al, 2008). All the computational modeling work reported in Appendix 2 was performed by the author while all the laboratory experiments were performed in the laboratory of the collaborator Prof. Madhubala of JNU, New Delhi. The supplementary data pertaining to this thesis is presented in an accompanying CD. The supplementary data in this CD is organized into different folders corresponding to various chapters.

Page generated in 0.081 seconds