Main motive of this master thesis was the need of good bioinformatics tools for genome comparison and improvement of one of the existing tools - RepeatExplorer. This work offers an overview of transposable elements in DNA, existing tools for identification and analysis of repetitions in sequenced genomes, summary of currently used genome sequencing methods. This work describes shortcomings of RepeatExplorer tool with focus on comparative analysis of genomes. Two solutions to remove these problems were designed and implemented. The first solution is designed for comparing pairs of genomes. The principle of this solution is based on comparison of similarity of distribution of contigs coverages using Kolmogorov-Smirnov test, thanks to which we are able to determine different parts in the genomes.The second solution, which is used to compare multiple genomes, is based on the method of mapping reads from compared genomes to the reference genome contigs and provides contigs coverage graphs, by which we are able to determine the variability of the repeats.Their functionality was verified on real NGS data of organism Silene latifolia.
Identifer | oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:264943 |
Date | January 2015 |
Creators | Puterová, Janka |
Contributors | Vogel, Ivan, Martínek, Tomáš |
Publisher | Vysoké učení technické v Brně. Fakulta informačních technologií |
Source Sets | Czech ETDs |
Language | Czech |
Detected Language | English |
Type | info:eu-repo/semantics/masterThesis |
Rights | info:eu-repo/semantics/restrictedAccess |
Page generated in 0.0016 seconds