Global ETD Search

Return to search

Optimizing hydropathy scale to improve IDP prediction and characterizing IDPs' functions

Indiana University-Purdue University Indianapolis (IUPUI) / Intrinsically disordered proteins (IDPs) are flexible proteins without defined 3D structures. Studies show that IDPs are abundant in nature and actively involved in numerous biological processes. Two crucial subjects in the study of IDPs lie in analyzing IDPs’ functions and identifying them. We thus carried out three projects to better understand IDPs. In the 1st project, we propose a method that separates IDPs into different function groups. We used the approach of CH-CDF plot, which is based the combined use of two predictors and subclassifies proteins into 4 groups: structured, mixed, disordered, and rare. Studies show different structural biases for each group. The mixed class has more order-promoting residues and more ordered regions than the disordered class. In addition, the disordered class is highly active in mitosis-related processes among others. Meanwhile, the mixed class is highly associated with signaling pathways, where having both ordered and disordered regions could possibly be important. The 2nd project is about identifying if an unknown protein is entirely disordered. One of the earliest predictors for this purpose, the charge-hydropathy plot (C-H plot), exploited the charge and hydropathy features of the protein. Not only is this algorithm simple yet powerful, its input parameters, charge and hydropathy, are informative and readily interpretable. We found that using different hydropathy scales significantly affects the prediction accuracy. Therefore, we sought to identify a new hydropathy scale that optimizes the prediction. This new scale achieves an accuracy of 91%, a significant improvement over the original 79%. In our 3rd project, we developed a per-residue C-H IDP predictor, in which three hydropathy scales are optimized individually. This is to account for the amino acid composition differences in three regions of a protein sequence (N, C terminus and internal). We then combined them into a single per-residue predictor that achieves an accuracy of 74% for per-residue predictions for proteins containing long IDP regions.

Intrinsically disordered proteins

Support vector machine

Clustering

Proteins -- Conformation -- Research

Proteins -- Denaturation

Protein folding -- Research

Support vector machines

Aggregation (Chemistry)

Amino acids -- Analysis

Cellular signal transduction

Molecular biology -- Mathematics

Algorithms

Identifer	oai:union.ndltd.org:IUPUI/oai:scholarworks.iupui.edu:1805/5191
Date	January 2014
Creators	Huang, Fei
Contributors	Dunker, A. Keith, Chen, Jake, Hurley, Thomas D., 1961-, Shen, Li
Source Sets	Indiana University-Purdue University Indianapolis
Language	en_US
Detected Language	English
Type	Thesis

Page generated in 0.002 seconds

Optimizing hydropathy scale to improve IDP prediction and characterizing IDPs' functions

Description

Links & Downloads

Tags

Additional Fields