Global ETD Search

91	Optimizing hydropathy scale to improve IDP prediction and characterizing IDPs' functions Huang, Fei January 2014 (has links) Indiana University-Purdue University Indianapolis (IUPUI) / Intrinsically disordered proteins (IDPs) are flexible proteins without defined 3D structures. Studies show that IDPs are abundant in nature and actively involved in numerous biological processes. Two crucial subjects in the study of IDPs lie in analyzing IDPs’ functions and identifying them. We thus carried out three projects to better understand IDPs. In the 1st project, we propose a method that separates IDPs into different function groups. We used the approach of CH-CDF plot, which is based the combined use of two predictors and subclassifies proteins into 4 groups: structured, mixed, disordered, and rare. Studies show different structural biases for each group. The mixed class has more order-promoting residues and more ordered regions than the disordered class. In addition, the disordered class is highly active in mitosis-related processes among others. Meanwhile, the mixed class is highly associated with signaling pathways, where having both ordered and disordered regions could possibly be important. The 2nd project is about identifying if an unknown protein is entirely disordered. One of the earliest predictors for this purpose, the charge-hydropathy plot (C-H plot), exploited the charge and hydropathy features of the protein. Not only is this algorithm simple yet powerful, its input parameters, charge and hydropathy, are informative and readily interpretable. We found that using different hydropathy scales significantly affects the prediction accuracy. Therefore, we sought to identify a new hydropathy scale that optimizes the prediction. This new scale achieves an accuracy of 91%, a significant improvement over the original 79%. In our 3rd project, we developed a per-residue C-H IDP predictor, in which three hydropathy scales are optimized individually. This is to account for the amino acid composition differences in three regions of a protein sequence (N, C terminus and internal). We then combined them into a single per-residue predictor that achieves an accuracy of 74% for per-residue predictions for proteins containing long IDP regions. Intrinsically disordered proteins Support vector machine Clustering Proteins -- Conformation -- Research Proteins -- Denaturation Protein folding -- Research Support vector machines Aggregation (Chemistry) Amino acids -- Analysis Cellular signal transduction Molecular biology -- Mathematics Algorithms
92	Characterization of a fatty acid elongase condensing enzyme by site-directed mutagenesis and biochemical analysis Hernandez-Buquer, Selene January 2014 (has links) Indiana University-Purdue University Indianapolis (IUPUI) / Fatty acid elongation is the extension of de novo synthesized fatty acids through a series of four reactions analogous to those of fatty acid synthase. ELOs catalyze the first reaction in the elongation pathway through the condensation of an acyl group with a two carbon unit derived from malonyl-CoA. This study uses the condensing enzyme, EloA, from the cellular slime mold, Dictyostelium discoideum as a model for the family of ELOs. EloA has substrate specificity for monounsaturated and saturated C16 fatty acids and catalyzes the elongation of 16:1Δ9 to 18:1Δ11. Site-directed mutagenesis was used to change residues highly conserved among the ELO family to examine their potential role in the condensation reaction. Mutant EloAs were expressed in yeast and fatty acid methyl esters prepared from total cellular lipids were analyzed by gas chromatography/mass spectrometry. Sixteen out of twenty mutants had a decrease in 18:1Δ11 production when compared to the wild-type EloA with little to no activity observed in ten mutants, four mutants had within 20% of wild-type activity, and six mutants had 10-60% of wild-type activity. Immunoblot studies using anti-EloA serum were used to determine if the differences in elongation activity were related to changes in protein expression for each mutant. Analysis of immunoblots indicated that those mutants with little to no activity, with the exception of T130A and Q203A, had x comparable protein expression to the wild-type. Further research included the solubilization of the His6-ELoA fusion protein and preliminary work toward the isolation of the tagged protein and the use of a radiolabeled condensation assay to determine the activity of the eluted protein. Preliminary results indicated that the protein was solubilized but the eluted protein showed no activity when examined by a condensation assay. The work presented here contributes to a better understanding of the role of certain amino acid residues in the activity of EloA and serves as a stepping-stone for future EloA isolation work. ELONGASE, CONDENSING ENZYME Lipids -- Metabolism -- Research Enzymes -- Analysis Biochemistry -- Technique Mutagenesis Dictyostelium discoideum Proteins -- Structure Saccharomyces cerevisiae Oligonucleotides Microsomes Yeast fungi Biosynthesis -- Research Gas chromatography Mass spectrometry Polymerase chain reaction
93	Mechanisms of binding diversity in protein disorder : molecular recognition features mediating protein interaction networks Hsu, Wei-Lun 25 February 2014 (has links) Indiana University-Purdue University Indianapolis (IUPUI) / Intrinsically disordered proteins are proteins characterized by lack of stable tertiary structures under physiological conditions. Evidence shows that disordered proteins are not only highly involved in protein interactions, but also have the capability to associate with more than one partner. Short disordered protein fragments, called “molecular recognition features” (MoRFs), were hypothesized to facilitate the binding diversity of highly-connected proteins termed “hubs”. MoRFs often couple folding with binding while forming interaction complexes. Two protein disorder mechanisms were proposed to facilitate multiple partner binding and enable hub proteins to bind to multiple partners: 1. One region of disorder could bind to many different partners (one-to-many binding), so the hub protein itself uses disorder for multiple partner binding; and 2. Many different regions of disorder could bind to a single partner (many-to-one binding), so the hub protein is structured but binds to many disordered partners via interaction with disorder. Thousands of MoRF-partner protein complexes were collected from Protein Data Bank in this study, including 321 one-to-many binding examples and 514 many-to-one binding examples. The conformational flexibility of MoRFs was observed at atomic resolution to help the MoRFs to adapt themselves to various binding surfaces of partners or to enable different MoRFs with non-identical sequences to associate with one specific binding pocket. Strikingly, in one-to-many binding, post-translational modification, alternative splicing and partner topology were revealed to play key roles for partner selection of these fuzzy complexes. On the other hand, three distinct binding profiles were identified in the collected many-to-one dataset: similar, intersecting and independent. For the similar binding profile, the distinct MoRFs interact with almost identical binding sites on the same partner. The MoRFs can also interact with a partially the same but partially different binding site, giving the intersecting binding profile. Finally, the MoRFs can interact with completely different binding sites, thus giving the independent binding profile. In conclusion, we suggest that protein disorder with post-translational modifications and alternative splicing are all working together to rewire the protein interaction networks. Disordered binding site Molecular recognition feature Intrinsically disordered protein Binding diversity Protein interaction networks One-to-many binding Many-to-one binding Partner selection MoRF Proteins -- Conformation Proteins -- Structure -- Databases Nucleic acids -- Research -- Technique Protein binding Molecular association -- Research Proteins -- Analysis Proteins -- Physiological transport Peptides
94	Structural analysis of the potential therapeutic targets from specific genes in Methicillin-resistant Staphylococcus aureus (MRSA) Yan, Xuan January 2011 (has links) The thesis describes over-expression, purification and crystallization of three proteins from Staphylococcus aureus (S. aureus). S. aureus is an important human pathogen and methicillin-resistant S. aureus (MRSA) is a serious problem in hospitals nowadays. The crystal structure of 3-Methyladenine DNA glycosylase I (TAG) was determined by single-wavelength anomalous diffraction (SAD) method. TAG is responsible for DNA repair and is an essential gene for both MRSA and methicilin-susceptible S. aureus (MSSA). The structure was also determined in complex with 3-methyladenine (3-MeA) and was solved using molecular replacement (MR) method. An assay was carried out and the molecular basis of discrimination between 3-MeA and adenosine was determined. The native crystal structure of fructose 1-phosphate kinase (PFK) from S. aureus was determined to 2.30 Å and solved using molecular replacement method. PFK is an essential enzyme involved in the central metabolism of MRSA. Despite extensive efforts no co-complex was determined, although crystals were obtained they diffracted poorly. An assay which can be used to test for inhibitors has been developed. Mevalonate Kinase (MK) is another essential enzyme in MRSA and is a key drug target in the mevalonate pathway. Native data diffracting to 2.2 Å was collected. The structure was solved using multiple isomorphorus replacement (MIR) method. A citrate molecule was bound at the MK active site, arising from the crystallization condition. The citrate molecule indicates how substrate might bind. The protein was kinetically characterized. A thermodynamic analysis using fluorescence-based method was carried out on each protein to investigate binding interactions of potential fragments and thus a drug design starting point. 579
95	Critical assessment of predicted interactions at atomic resolution Mendez Giraldez, Raul 21 September 2007 (has links) Molecular Biology has allowed the characterization and manipulation of the molecules of life in the wet lab. Also the structures of those macromolecules are being continuously elucidated. During the last decades of the past century, there was an increasing interest to study how the different genes are organized into different organisms (‘genomes’) and how those genes are expressed into proteins to achieve their functions. Currently the sequences for many genes over several genomes have been determined. In parallel, the efforts to have the structure of the proteins coded by those genes go on. However it is experimentally much harder to obtain the structure of a protein, rather than just its sequence. For this reason, the number of protein structures available in databases is an order of magnitude or so lower than protein sequences. Furthermore, in order to understand how living organisms work at molecular level we need the information about the interaction of those proteins. Elucidating the structure of protein macromolecular assemblies is still more difficult. To that end, the use of computers to predict the structure of these complexes has gained interest over the last decades.<p>The main subject of this thesis is the evaluation of current available computational methods to predict protein – protein interactions and build an atomic model of the complex. The core of the thesis is the evaluation protocol I have developed at Service de Conformation des Macromolécules Biologiques et de Bioinformatique, Université Libre de Bruxelles, and its computer implementation. This method has been massively used to evaluate the results on blind protein – protein interaction prediction in the context of the world-wide experiment CAPRI, which have been thoroughly reviewed in several publications [1-3]. In this experiment the structure of a protein complex (‘the target’) had to be modeled starting from the coordinates of the isolated molecules, prior to the release of the structure of the complex (this is commonly referred as ‘docking’).<p>The assessment protocol let us compute some parameters to rank docking models according to their quality, into 3 main categories: ‘Highly Accurate’, ‘Medium Accurate’, ‘Acceptable’ and ‘Incorrect’. The efficiency of our evaluation and ranking is clearly shown, even for borderline cases between categories. The correlation of the ranking parameters is analyzed further. In the same section where the evaluation protocol is presented, the ranking participants give to their predictions is also studied, since often, good solutions are not easily recognized among the pool of computer generated decoys.<p>An overview of the CAPRI results made per target structure and per participant regarding the computational method they used and the difficulty of the complex. Also in CAPRI there is a new ongoing experiment about scoring previously and anonymously generated models by other participants (the ‘Scoring’ experiment). Its promising results are also analyzed, in respect of the original CAPRI experiment. The Scoring experiment was a step towards the use of combine methods to predict the structure of protein – protein complexes. We discuss here its possible application to predict the structure of protein complexes, from a clustering study on the different results.<p>In the last chapter of the thesis, I present the preliminary results of an ongoing study on the conformational changes in protein structures upon complexation, as those rearrangements pose serious limitations to current computational methods predicting the structure protein complexes. Protein structures are classified according to the magnitude of its conformational re-arrangement and the involvement of interfaces and particular secondary structure elements is discussed. At the end of the chapter, some guidelines and future work is proposed to complete the survey. / Doctorat en Sciences / info:eu-repo/semantics/nonPublished Sciences exactes et naturelles Chimie Structural bioinformatics Molecular structure Proteins -- Conformation Proteins -- Structure Amino acids Bio-informatique structurale Structure moléculaire Protéines -- Conformation Protéines -- Structure Acides aminés protein - protein complex protein - protein interaction root mean square deviation docking solvent accessible area conformational change rmsd
96	Développement de potentiels statistiques pour l'étude in silico de protéines et analyse de structurations alternatives / Development of statistical potentials for the [study] in silico study of proteins and analysis of alternative structuring. Dehouck, Yves 20 May 2005 (has links) Cette thèse se place dans le cadre de l'étude in silico, c'est-à-dire assistée par ordinateur, des liens qui unissent la séquence d'une protéine à la (ou aux) structure(s) tri-dimensionnelle(s) qu'elle adopte. Le décryptage de ces liens présente de nombreuses applications dans divers domaines et constitue sans doute l'une des problématiques les plus fascinantes de la recherche en biologie moléculaire.<p><p>Le premier aspect de notre travail concerne le développement de potentiels statistiques dérivés de bases de données de protéines dont les structures sont connues. Ces potentiels présentent plusieurs avantages: ils peuvent être aisément adaptés à des représentations structurales simplifiées, et permettent de définir un nombre limité de fonctions énergétiques qui incarnent l'ensemble complexe d'interactions gouvernant la structure et la stabilité des protéines, et qui incluent également certaines contributions entropiques. Cependant, leur signification physique reste assez nébuleuse, car l'impact des diverses hypothèses nécessaires à leur dérivation est loin d'être clairement établi. Nous nous sommes attachés à l'étude de certaines limitations des ces potentiels: leur dépendance en la taille des protéines incluses dans la base de données, la non-additivité des termes de potentiels, et l'importance souvent négligée de l'environnement protéique spécifique ressenti par chaque résidu. Nous avons ainsi mis en évidence que l'influence de la taille des protéines de la base de données sur les potentiels de distance entre résidus est spécifique à chaque paire d'acides aminés, peut être relativement importante, et résulte essentiellement de la répartition inhomogène des résidus hydrophobes et hydrophiles entre le coeur et la surface des protéines. Ces résultats ont guidé la mise au point de fonctions correctives qui permettent de tenir compte de cette influence lors de la dérivation des potentiels. Par ailleurs, la définition d'une procédure générale de dérivation de potentiels et de termes de couplage a rendu possible la création d'une fonction énergétique qui tient compte simultanément de plusieurs descripteurs de séquence et de structure (la nature des résidus, leurs conformations, leurs accessibilités au solvant, ainsi que les distances qui les séparent dans l'espace et le long de la séquence). Cette fonction énergétique présente des performances nettement améliorées par rapport aux potentiels originaux, et par rapport à d'autres potentiels décrits dans la littérature.<p><p>Le deuxième aspect de notre travail concerne l'application de programmes basés sur des potentiels statistiques à l'étude de protéines qui adoptent des structures alternatives. La permutation de domaines est un phénomène qui affecte diverses protéines et qui implique la génération d'un oligomère suite à l'échange de fragments structuraux entre monomères identiques. Nos résultats suggèrent que la présence de "faiblesses structurales", c'est-à-dire de régions qui ne sont pas optimales vis-à-vis de la stabilité de la structure native ou qui présentent une préférence marquée pour une conformation non-native en absence d'interactions tertiaires, est intimement liée aux mécanismes de permutation. Nous avons également mis en évidence l'importance des interactions de type cation-{pi}, qui sont fréquemment observées dans certaines zones clés de la permutation. Finalement, nous avons sélectionné un ensemble de mutations susceptibles de modifier sensiblement la propension de diverses protéines à permuter. L'étude expérimentale de ces mutations devrait permettre de valider, ou de raffiner, les hypothèses que nous avons proposées quant au rôle joué par les faiblesses structurales et les interactions de type cation-{pi}. Nous avons également analysé une autre protéine soumise à d'importants réarrangements conformationnels: l'{alpha}1-antitrypsine. Dans le cas de cette protéine, les modifications structurales sont indispensables à l'exécution de l'activité biologique normale, mais peuvent sous certaines conditions mener à la formation de polymères insolubles et au développement de maladies. Afin de contribuer à une meilleure compréhension des mécanismes responsables de la polymérisation, nous avons cherché à concevoir rationnellement des protéines mutantes qui présentent une propension à polymériser contrôlée. Des tests expérimentaux ont été réalisés par le groupe australien du Professeur S.P. Bottomley, et ont permis de valider nos prédictions de manière assez remarquable.<p><p><p><p>The work presented in this thesis concerns the computational study of the relationships between the sequence of a protein and its three-dimensional structure(s). The unravelling of these relationships has many applications in different domains and is probably one of the most fascinating issues in molecular biology.<p><p>The first part of our work is devoted to the development of statistical potentials derived from databases of known protein structures. These potentials allow to define a limited number of energetic functions embodying the complex ensemble of interactions that rule protein folding and stability (including some entropic contributions), and can be easily adapted to simplified representations of protein structures. However, their physical meaning remains unclear since several hypotheses and approximations are necessary, whose impact is far from clearly understood. We studied some of the limitations of these potentials: their dependence on the size of the proteins included in the database, the non-additivity of the different potential terms, and the importance of the specific environment of each residue. Our results show that residue-based distance potentials are affected by the size of the database proteins, and that this effect can be quite strong, is residue-specific, and seems to result mostly from the inhomogeneous partition of hydrophobic and hydrophilic residues between the surface and the core of proteins. On the basis of these observations, we defined a set of corrective functions in order to take protein size into account while deriving the potentials. On the other hand, we developed a general procedure of derivation of potentials and coupling terms and consequently created an energetic function describing the correlations between several sequence and structure descriptors (the nature of each residue, the conformation of its main chain, its solvent accessibility, and the distances that separate it from other residues, in space and along the sequence). This energetic function presents a strongly improved predictive power, in comparison with the original potentials and with other potentials described in the literature.<p><p>The second part describes the application of different programs, based on statistical potentials, to the study of proteins that adopt alternative structures. Domain swapping involves the exchange of a structural element between identical proteins, and leads to the generation of an oligomeric unit. We showed that the presence of “structural weaknesses”, regions that are not optimal with respect to the folding mechanisms or to the stability of the native structure, seems to be intimately linked with the swapping mechanisms. In addition, cation-{pi} interactions were frequently detected in some key locations and might also play an important role. Finally, we designed a set of mutations that are likely to affect the swapping propensities of different proteins. The experimental study of these mutations should allow to validate, or refine, our hypotheses concerning the importance of structural weaknesses and cation-{pi} interactions. We also analysed another protein that undergoes large conformational changes: {alpha}1-antitrypsin. In this case, the structural modifications are necessary to the proper execution of the biological activity. However, under certain circumstances, they lead to the formation of insoluble polymers and the development of diseases. With the aim of reaching a better understanding of the mechanisms that are responsible for this polymerisation, we tried to design mutant proteins that display a controlled polymerisation propensity. An experimental study of these mutants was conducted by the group of Prof. S.P. Bottomley, and remarkably confirmed our predictions.<p> / Doctorat en sciences appliquées / info:eu-repo/semantics/nonPublished Sciences de l'ingénieur Chimie Molecular biology -- Methodology Proteins -- Biotechnology Proteins -- Analysis Proteins -- Structure Biologie moléculaire -- Méthodologie Protéines -- Biotechnologie Protéines -- Analyse Protéines -- Structure statistical potentials protein structure structural modifications modifications structurales stabilité des protéines potentiels de force moyenne structure des protéines permutation de domaines domain swapping protein stability mean force potentials
97	A Computational Study of the Mechanism for F1-ATPase Inhibition by the Epsilon Subunit Thomson, Karen J. January 2013 (has links) Indiana University-Purdue University Indianapolis (IUPUI) / The multi-protein complex of F0F1 ATP synthase has been of great interest in the fields of microbiology and biochemistry, due to the ubiquitous use of ATP as a biological energy source. Efforts to better understand this complex have been made through structural determination of segments based on NMR and crystallographic data. Some experiments have provided useful data, while others have brought up more questions, especially when structures and functions are compared between bacteria and species with chloroplasts or mitochondria. The epsilon subunit is thought to play a signi cant role in the regulation of ATP synthesis and hydrolysis, yet the exact pathway is unknown due to the experimental difficulty in obtaining data along the transition pathway. Given starting and end point protein crystal structures, the transition pathway of the epsilon subunit was examined through computer simulation.The purpose of this investigation is to determine the likelihood of one such proposed mechanism for the involvement of the epsilon subunit in ATP regulation in bacterial species such as E. coli. ATP synthase molecular dynamics computer simulation CHARMM Adenosine triphosphatase -- Regulation Computational biology Escherichia coli -- Research Proteins -- Structure Molecular biology -- Mathematical models Chemistry -- Data processing Proteins -- Conformation Crystallography -- Technique Molecular dynamics Bioenergetics Computer simulation
98	Protein function prediction by integrating sequence, structure and binding affinity information Zhao, Huiying 03 February 2014 (has links) Indiana University-Purdue University Indianapolis (IUPUI) / Proteins are nano-machines that work inside every living organism. Functional disruption of one or several proteins is the cause for many diseases. However, the functions for most proteins are yet to be annotated because inexpensive sequencing techniques dramatically speed up discovery of new protein sequences (265 million and counting) and experimental examinations of every protein in all its possible functional categories are simply impractical. Thus, it is necessary to develop computational function-prediction tools that complement and guide experimental studies. In this study, we developed a series of predictors for highly accurate prediction of proteins with DNA-binding, RNA-binding and carbohydrate-binding capability. These predictors are a template-based technique that combines sequence and structural information with predicted binding affinity. Both sequence and structure-based approaches were developed. Results indicate the importance of binding affinity prediction for improving sensitivity and precision of function prediction. Application of these methods to the human genome and structure genome targets demonstrated its usefulness in annotating proteins of unknown functions and discovering moon-lighting proteins with DNA,RNA, or carbohydrate binding function. In addition, we also investigated disruption of protein functions by naturally occurring genetic variations due to insertions and deletions (INDELS). We found that protein structures are the most critical features in recognising disease-causing non-frame shifting INDELs. The predictors for function predictions are available at http://sparks-lab.org/spot, and the predictor for classification of non-frame shifting INDELs is available at http://sparks-lab.org/ddig. protein function Artificial intelligence Algorithms Protein-protein interactions Genomics -- Data processing Proteins -- Analysis Expert systems (Computer science) Data mining Bioinformatics -- Research DNA-protein interactions RNA-protein interactions Carbohydrates

Search results