Return to search

A method for identification of putatively co-regulated genes

The genomes of several organisms have been sequenced and the need for methods to analyse the data is growing. In this project a method is described that tries to identify co-regulated genes. The method identifies transcription factor binding sites, documented in TRANSFAC, in the non-coding regions of genes. The algorithm counts the number of common binding sites and the number of unique binding sites for each pair of genes and decides if the genes are co-regulated. The result of the method is compared with the correlation between the gene expression patterns of the genes. The method is tested on 21 gene pairs from the genome of Saccharomyces cerevisiae. The algorithm first identified binding sites from all organisms. The accuracy of the program was very low in this case. When the algorithm was modified to only identify binding sites found in plants the accuracy was much improved, from 52% to 76% correct predictions.

Identiferoai:union.ndltd.org:UPSALLA1/oai:DiVA.org:his-705
Date January 2002
CreatorsAndersson, Malin
PublisherHögskolan i Skövde, Institutionen för datavetenskap, Skövde : Institutionen för datavetenskap
Source SetsDiVA Archive at Upsalla University
LanguageEnglish
Detected LanguageEnglish
TypeStudent thesis, info:eu-repo/semantics/bachelorThesis, text
Formatapplication/postscript, application/pdf
Rightsinfo:eu-repo/semantics/openAccess, info:eu-repo/semantics/openAccess

Page generated in 0.002 seconds