Global ETD Search

Return to search

PREDICTION OF PROTEIN SECONDARY STRUCTURE by Incorporating Biophysical Information into Artificial Neural Networks

<p>This project applied artificial neural networks to the field of secondary structure prediction of proteins. A NETtalk architecture with a window size 13 was used. Over-fitting was avoided by the use of 3 real numbers to represent amino acids, reducing the number of adjustable weights to 840. Two alternative representations of amino acids that incorporated biophysical data were created and tested. They were tested both separately and in combination on a standard 7-fold cross-validation set of 126 proteins. The best performance was achieved using an average result from two predictions. This was then filtered and gave the following results. Accuracy levels for core structures were: Q3total accuracy of 61.3% consisting of Q3 accuracy’s of 54.0%, 38.1% & 77.0% for Helix, Strand and Coil respectively with Matthew’s correlation’s Ca = 0.34, Cb = 0.26 , Cc = 0.31. The average lengths of structures predicted were 9.8, 4.9 and 11.0, for helix, sheet and coil respectively. These results are lower than those of other methods using single sequences and localist representations. The most likely reason for this is over generalisation caused by using a small number of units.</p>

http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-235

Computer and systems science

Data- och systemvetenskap

Identifer	oai:union.ndltd.org:UPSALLA/oai:DiVA.org:his-235
Date	January 1998
Creators	Alistair, Chalk
Publisher	University of Skövde, Department of Computer Science, Skövde : Institutionen för datavetenskap
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, text

Page generated in 0.0016 seconds

PREDICTION OF PROTEIN SECONDARY STRUCTURE by Incorporating Biophysical Information into Artificial Neural Networks

Description

Links & Downloads

Tags

Additional Fields