Return to search

Short sequence tags reveal global transcription of repetitive elements in mammalian genomes

Retrotransposon mobilization is a major source of genome evolution. However, the functional consequences of these events, and particularly their influence upon transcriptional activity, are poorly defined. The extent of retrotransposon transcription, as well as that of other repetitive elements, has eluded systematic study due to difficulties in discriminating elements copied in multiple genomic loci. Moreover, the potential regulatory effects of retrotransposon transcription upon the expression of neighbouring protein-coding genes are also largely unknown. This thesis develops methods to survey repetitive element expression and assess the functions of retrotransposons in the mouse and human genomes. Chapter 1 summarises the complex transcriptional output of the mammalian genome, the functional annotation of this expression and the genomic and bioinformatic tools available for its detection. Chapter 2 explores the capacity of short sequence tags to discern transcription from individual repetitive elements, as well as from protein-coding genes. It is based upon a publication that critiqued the bioinformatics associated with Cap Analysis Gene Expression (CAGE) and developed novel methodologies to resolve repetitive element transcription. Chapter 3 describes the development of an updated CAGE mapping pipeline for the fourth stage of the international Functional Annotation of Mouse (FANTOM) project, which lead to the generation of a research article and a book chapter. These works demonstrated the enhanced utility of CAGE when coupled with next-generation sequencing, highlighted the benefits of CAGE when applied to systems biology and profiled the temporal expression of human repetitive elements. Chapter 4 presents an in-depth analysis of repetitive element transcription in the mouse and human genomes. Using CAGE, approximately 250,000 retrotransposon associated transcription start sites were defined, many of which were tissue-specific. Retrotransposons were found to frequently function as alternative promoters for protein-coding genes and/or express non-coding RNAs. Furthermore, when retrotransposons were found within the 3’UTR of protein-coding genes, there was strong evidence for the reduced expression of the corresponding transcripts. A genome-wide screen for strong expression correlation between repetitive elements and neighbouring protein-coding genes identified approximately 23,000 candidate regulatory regions derived from retrotransposons, including several hundred putative boundary elements. These were in addition to more than two thousand examples of bidirectional transcription found in retrotransposons, which are known to be a source of double stranded RNAs involved in RNA interference. Chapter 5 explores the proportion of the mouse embryonic stem cell transcriptome comprised of repeat-derived transcripts, using next-generation RNA sequencing. This study defined the dynamic expression of repetitive elements at the greatest resolution achieved to date and demonstrated that repetitive elements are an intrinsic part of the mammalian transcriptional landscape.

Identiferoai:union.ndltd.org:ADTP/253991
CreatorsGeoffrey Faulkner
Source SetsAustraliasian Digital Theses Program
Detected LanguageEnglish

Page generated in 0.0017 seconds