• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 2
  • 1
  • Tagged with
  • 3
  • 3
  • 2
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Random and rational evolution of tautomerase superfamily members : analysis and implications

Darty, Joseph Edward 10 April 2014 (has links)
P[Kappa]a is not responsible for the improved activity. Hence, stabilization of an enediolate intermediate may be important for catalysis. In the second part of this work, the Chloroflexus aurantiacus J-10-fl heterohexameric 4-OT tautomerase was employed in random and rational directed evolution studies to introduce a CaaD activity. Genetic selection and a high throughput screening assay were used to identify mutants. Genetic selection was unsuccessful due to plasmid instability in the host strain. A small mutant library in the screening assay precluded the identification of any mutants with CaaD activity. Finally, rational design using structure-function relationships was investigated and a single mutant was discovered for hh4-OT that incorporated CaaD activity into the enzyme, the [alpha]L9R hh4-OT, this mutant has been characterized kinetically and the evolutionary implications for the tautomerase superfamily are described. / text
2

Statistical Analysis of Biological Interactions from Homologous Proteins

Xu, Qifang January 2008 (has links)
Information fusion aims to develop intelligent approaches of integrating information from complementary sources, such that a more comprehensive basis is obtained for data analysis and knowledge discovery. Our Protein Biological Unit (ProtBuD) database is the first database that integrated the biological unit information from the Protein Data Bank (PDB), Protein Quaternary Server (PQS) and Protein Interfaces, Surfaces and Assemblies (PISA) server, and compared the three biological units side-by-side. The statistical analyses show that the inconsistency within these databases and between them is significant. In order to improve the inconsistency, we studied interfaces across different PDB entries in a protein family using an assumption that interfaces shared by different crystal forms are likely to be biologically relevant. A novel computational method is proposed to achieve this goal. First, redundant data were removed by clustering similar crystal structures, and a representative entry was used for each cluster. Then a modified k-d tree algorithm was applied to facilitate the computation of identifying interfaces from crystals. The interface similarity functions were derived from Gaussian distributions fit to the data. Hierarchical clustering was used to cluster interfaces to define the likely biological interfaces by the number of crystal forms in a cluster. Benchmark data sets were used to determine whether the existence or lack of existence of interfaces across multiple crystal forms can be used to predict whether a protein is an oligomer or not. The probability that a common interface is biological is given. An interface shared in two different crystal forms by divergent proteins is very likely to be biologically important. The interface data not only provide new interaction templates for computational modeling, but also provide more accurate data for training sets and testing sets in data-mining research to predict protein-protein interactions. In summary, we developed a framework which is based on databases where different biological unit information is integrated and new interface data are stored. In order for users from the biology community to use the data, a stand-alone software program, a web site with a user-friendly graphical interface, and a web service are provided. / Computer and Information Science
3

Analysis Of Protein Evolution And Its Implications In Remote Homology Detection And Function Recognition

Gowri, V S 10 1900 (has links)
One of the major outcomes of a genome sequencing project is the availability of amino acid sequences of all the proteins encoded in the genome of the organism concerned. However, most commonly, for a substantial proportion of the proteins encoded in the genome no information in function is available either from experimental studies or by inference on the basis of homology with a protein of known function. Even if the general function of a protein is known, the region of the protein corresponding to the function might be a domain and there may be additional regions of considerable length in the protein with no known function. In such cases the information on function is incomplete. Lack of understanding of the repertoire of functions of proteins encoded in the genome limits the utility of the genomic data. While there are many experimental approaches available for deciphering functions of proteins at the genomic scale, bioinformatics approaches form a good early step in obtaining clues about functions of proteins at the genomic scale (Koonin et al, 1998). One of the common bioinformatics approaches is recognition of function by homology (Bork et al, 1994). If the evolutionary relationship between two proteins, one with known function and the other with unknown function, could be established it raises the possibility of common function and 3-D structure for these proteins(Bork and Gibson, 1996). While this approach is effective its utility is limited by the ability of the bioinformatics approach to identify related proteins when their evolutionary divergence is high leading to low amino acid sequence similarity which is typical of two unrelated proteins (Bork and Koonin, 1998). Use of 3-D structural information, obtained by predictive methods such as fold recognition, has offered approaches towards increasing the sensitivity of remote homology detection 9e.g., Kelley et al, 2000; Shi et al, 2001; Gough et al, 2001). The work embodied in this thesis has the general objective of analysis of evolution of structural features and functions of families of proteins and design of new bioinformatics approaches for recognizing distantly related proteins and their applications. After an introductory chapter, a few chapters report analysis of functional and structural features of homologous protein domains. Further chapters report development and assessment of new remote homology detection approaches and applications to the proteins encoded in two protozoan organisms. A further chapter is presented on the analysis of proteins involved in methylglyoxal detoxification pathways in kinetoplastid organisms. Chapter I of the thesis presents a brief introduction, based on the information available in the literature, to protein structures, classification, methods for structure comparison, popular methods for remote homology detection and homology-based methods for function annotation. Chapter 2 describes the steps involved in the update and improvements made in this database. In addition to the update, the domain structural families are integrated with the homologous sequences from the sequence databases. Thus, every family in PALI is enriched with a substantial volume of sequence information from proteins with no known structural information. Chapter 3 reports investigations on the inter-relationships between sequence, structure and functions of closely-related homologous enzyme domain families. Chapter 4 describes the investigations on the unusual differences in the lengths of closely-related homologous protein domains, accommodation of additional lengths in protein 3-D structures and their functional implications. Chapter 5 reports the development and assessment of a new approach for remote homology detection using dynamic multiple profiles of homologous protein domain families. Chapter 6 describes development of another remote homology detection approach which are multiple, static profiles generated using the bonafide members of the family. A rigorous assessment of the approach and strategies for improving the detection of distant homologues using the multiple profile approach are discussed in this chapter. Chapter 7 describes results of searches made in the database of multiple family profiles (MulPSSM database) in order to recognize the functions of hypothetical proteins encoded in two parasitic protozoa. Chapter 8 describes the sequence and structural analyses of two glyoxalase pathway proteins from the kinetoplastid organism Leishmania donovani which causes Leishmaniases. An alternate enzyme, which would probably substitute the glyoxalase pathway enzymes in certain kinetoplastid organisms which lack the glyoxalase enzymes are also discussed. Chapter 9 summarises the important findings from the various analyses discussed in this thesis. Appendix describes an analysis on the correlation between a measure of hydrophobicity of amino acid residues aligned in a multiple sequence alignment and residue depth in 3-D structures of proteins.

Page generated in 0.0729 seconds