Global ETD Search

Return to search

The Relative Importance of Input Encoding and Learning Methodology on Protein Secondary Structure Prediction

In this thesis the relative importance of input encoding and learning algorithm on protein secondary structure prediction is explored. A novel input encoding, based on multidimensional scaling applied to a recently published amino acid substitution matrix, is developed and shown to be superior to an arbitrary input encoding. Both decimal valued and binary input encodings are compared. Two neural network learning algorithms, Resilient Propagation and Learning Vector Quantization, which have not previously been applied to the problem of protein secondary structure prediction, are examined. Input encoding is shown to have a greater impact on prediction accuracy than learning methodology with a binary input encoding providing the highest training and test set prediction accuracy.

Neural Networks

Learning Vector Quantization

Protein Secondary Structure Prediction

Resilient Propagation

Computer Sciences

Identifer	oai:union.ndltd.org:GEORGIA/oai:digitalarchive.gsu.edu:cs_theses-1018
Date	09 June 2006
Creators	Clayton, Arnshea
Publisher	Digital Archive @ GSU
Source Sets	Georgia State University
Detected Language	English
Type	text
Format	application/pdf
Source	Computer Science Theses

Page generated in 0.0019 seconds

The Relative Importance of Input Encoding and Learning Methodology on Protein Secondary Structure Prediction

Description

Links & Downloads

Tags

Additional Fields