Global ETD Search

Return to search

PELICAN : a PipELIne, including a novel redundancy-eliminating algorithm, to Create and maintain a topicAl family-specific Non-redundant protein database

The increasing number of biological databases today requires that users are able to search more efficiently among as well as in individual databases. One of the most widespread problems is redundancy, i.e. the problem of duplicated information in sets of data. This thesis aims at implementing an algorithm that distinguishes from other related attempts by using the genomic positions of sequences, instead of similarity based sequence comparisons, when making a sequence data set non-redundant. In an automatic updating procedure the algorithm drastically increases the possibility to update and to maintain the topicality of a non-redundant database. The procedure creates a biologically sound non-redundant data set with accuracy comparable to other algorithms focusing on making data sets non-redundant

http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-960

redundancy

BLAT

genomic positions

profile hidden Markov models

G-protein coupled receptors

Bioinformatics

Bioinformatik

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:his-960
Date	January 2005
Creators	Andersson, Christoffer
Publisher	Högskolan i Skövde, Institutionen för kommunikation och information, Skövde : Institutionen för kommunikation och information
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/postscript
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0019 seconds

PELICAN : a PipELIne, including a novel redundancy-eliminating algorithm, to Create and maintain a topicAl family-specific Non-redundant protein database

Description

Links & Downloads

Tags

Additional Fields