Global ETD Search

1	Short-read Chromosome Level Genome Assembly of Digitaria exilis Gapa, Liubov 11 1900 (has links) Genomics has become an important tool in agriculture. Many modern crop breeding approaches such as genomic selection and genome editing require detailed information of the genomic composition of a crop species. However, the assembly of high-quality genome sequences is prone to technical artifacts that arise from inaccuracies in the sequencing technology and assembly algorithms. This is particularly true for the genomes of cereal crops, which are often very large, repeat-rich, and polyploid. Until recently, the highly continuous assembly of such cereal crop genomes from short-read data was mainly possible with proprietary assembly tools. In this work, we combined data generated with several short-read sequencing protocols and genomics technologies, including paired-end and mate-pair reads with multiple insert sizes, 10X linked reads, Hi-C contacts, and optical maps to assemble a chromosome level reference genome of Digitaria exilis (fonio millet) with open-source tools. Fonio millet is a semi-domesticated cereal orphan crop native to West Africa that has a high potential for desert agriculture. We implemented the TRITEX pipeline - a recently developed open-source pipeline for the assembly of large Triticeae genomes. We modified the pipeline to include 10X and Hi-C reads into the assembly process independently. We then compared the TRITEX assembly to the fonio reference genome, which had previously been assembled from the same input data but using proprietary algorithms. We found the two assemblies highly similar in content with high concordance in the local order (0.91 Pearson coefficient for alignments). However, we detected many small putative discrepancies between the two assemblies. While the TRITEX assembly was able to produce a highly continuous genome assembly, further work is needed to characterize the putative discrepancies in more detail. Genome assembly Plant-genomics Open-source
2	Draft Assembly and Baseline Annotation of the Ziziphus spina-christi Genome Shuwaikan, Raghad H. 07 1900 (has links) Third generation sequencing has revolutionized our understanding of genomics, and enabled the in-depth discovery of complex plant genomes. In this project I aimed to assemble and annotate the genome of Z. spina-christi, a native plant to Saudi Arabia, as part of the the Kingdom of Saudi Arabia Native Genome Project established at the Center for Desert Agriculture at KAUST. Initially, a voucher plant was selected from the Al Lith region of Western Saudi Arabia. Fresh leaf tissue was collected for high-molecular weight (HMW) DNA extraction, as well as seed for greenhouse propagation. After HMW DNA extraction, library construction and PacBio HiFi sequencing, I generated a de novo assembly of the Z. spina-christi genome using the Hifiasm assembler, which yielded a 1.9 Gbp long assembly with high levels of duplication. The assembled contigs were scaffolded using an in-house script based on the software RagTag, that yielded a 406 Mbp long scaffold with 331 gaps (85.45% of estimated genome size). A preliminary analysis of the assembly for transposable elements revealed a TE content of 32.36%, with Long Terminal Repeats retrotransposons (LTR-RTs) being the major contributor to the total TE content. Basline annotation was completed using Omicsbox revealing 18,330 functional genes. This work describes the first genomic resource for the desert plant Z. spina-christi. To improve the assembly, I suggest the use of scaffolding using optical mapping, long Nanopore reads and Hi-C data to capture the spatial organization of the genome. Further experimental, genetic and TEs analysis is needed to explore the plant’s resilience to abiotic stresses in extreme environments. sequencing genome assembly plant genomics plant genomes third-generation sequencing
3	Construire une économie de la recherche sur projets. L’installation de l’Agence Nationale de la Recherche en France et ses conséquences dans les domaines de la génomique végétale et de la chimie durable / Build up an economy of projectified research. The establishement of the National Research Agency (ANR) in France and its consequences in the area of plant genomics and green chemistry Schultz, Emilien 20 June 2016 (has links) L'Agence Nationale de la Recherche, créée en 2005, contribue à rapprocher le système de recherche français du modèle anglo-saxon, dans lequel les agences de financement jouent un rôle déterminant dans la conduite des politiques scientifiques. À partir d'une enquête sur l'agence et de l’étude des programmes de financement en génomique végétale et en chimie durable, cette thèse analyse le rôle de l'ANR dans la transformation du mode d'allocation des financements. Elle s’appuie sur des données issues d'entretiens auprès de chercheurs, d’un traitement statistique des projets déposés et de l'analyse de sources primaires et secondaires sur l’activité de l'ANR. Nous montrons notamment que l'ANR constitue une organisation intermédiaire partiellement autonome dont le métier a considérablement évolué depuis sa création en 2005. La carrière de l'agence témoigne des luttes de définition dont elle fait l'objet. Le gouvernement par projet pratiqué par l'ANR a contribué à généraliser la pratique des appels à projets compétitifs en France, qui favorise la différenciation des situations locales, la fragmentation des communautés de recherche et le changement du rapport des chercheurs aux financements. Plus généralement, nous montrons que la multiplication des appels à projets compétitifs conduit à établir ce que nous qualifions une économie de la recherche sur projets. Dans une telle économie, l'enjeu central pour les chercheurs devient le contrôle de la traduction des activités de recherche locales sous la forme générique de « projet » afin de mener leur activité. / The National Research Agency (ANR) created in 2005 brings the French research system closer to Anglo-saxon ones, where funding agencies play an important part in research policies. Based on an investigation of the ANR and two funding programs in plant genomics and green chemistry, this dissertation shows the significant role played by the ANR in the transformation of funding allocation in France. It contributes both to sociology of science policies and to sociology of organisations. We use data from interviews with researchers, statistical analysis of submitted projects and various reports and media contents. We show that the ANR is an intermediate and semi-autonomous organisation whose aims have significantly evolved since its creation in 2005. The definition of its missions has been a continuous object of struggles which can be seen in the « career » of the agency. Because the ANR mainly « governs through projects », its activities led to the multiplication of competitive calls for projects in France. Some of the consequences are the singularisation of local situations, the scattering of research communities and a change in how researchers behave regarding fundings. More broadly, we show that the multiplication of competitive project funding opportunities leads to an « economy of projectified research ». In this type of economy, the main issue for researchers is to control the way their research is converted into « project » in order to be funded. Sociologie des sciences Politiques scientifiques Financement sur projets Agence Nationale de la Recherche (ANR) Chimie verte Génomique végétale Sociology of science Science policy French National Research Agency (ANR) Green chemistry Plant genomics
4	Diversity and Evolution of Short Interspersed Nuclear Elements (SINEs) in Angiosperm and Gymnosperm Species and their Application as molecular Markers for Genotyping Kögler, Anja 08 September 2020 (has links) Short interspersed nuclear elements (SINEs) are small non-autonomous and heterogeneous retrotransposons, widespread in animals and plants and usually differentially propagated in related species resulting in genome-specific copy numbers. Within the monocots, the Poaceae (sweet grasses) is the largest and economically most important plant family. The distribution of 24 Poaceae SINE (PoaS) families, five of which showing a subfamily structure, was analyzed in five important cereals (Oryza sativa, Triticum aestivum, Hordeum vulgare, Sorghum bicolor, Zea mays), the energy crop Panicum virgatum and the model grass Brachypodium distachyon. The comparative investigation of SINE abundance and sequence diversity within Poaceae species provides insights into their species‐specific diversification and amplification. The PoaS families and subfamilies fall into two length and structural categories: simple SINEs of up to 180 bp and dimeric SINEs larger than 240 bp. Of 24 PoaS families, 20 are structurally related across species, in particular either in their 5′ or 3′ regions. Hence, reshuffling between SINEs, likely caused by nested insertions of full-lengh and truncated copies, is an important evolutionary mechanism of SINE formation. Most striking, the recently evolved homodimeric SINE family PoaS‐XIV occurs exclusively in wheat (T. aestivum) and consists of two tandemly arranged PoaS‐X.1 copies. Exemplary for deciduous tree species, the evolutionary history of SINE populations was examined in six Salicaceae genomes (Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, Salix purpurea). Four of eleven Salicaceae SINE (SaliS) families exhibit a subfamily organization. The SaliS families consist of two groups, differing in their phylogenetic distribution pattern, sequence similarity and 3’ end structure. These groups probably emerged at different evolutionary periods of time: during the ‘salicoid duplication’ (~ 65 million years ago) in the Salix-Populus progenitor, and during the separation of the genus Salix (~ 45 - 65 million years ago), respectively. Similar to the PoaS families, the majority of the 20 SaliS families and subfamilies share regions of sequence similarity, providing evidence for SINE emergence by reshuffling. Furthermore, they also contain an evolutionarily young dimeric SINE family (SaliS-V), amplified only in two poplar genomes. The special feature of the Salicaceae SINEs is the contrast of the conservation of 5’ start motifs across species and SINE families compared to the high variability of 3’ ends within the SINE families, differing in sequence and length, presumably resulting from mutations in the poly(A) tail as a possible route for SINE elongation. Periods of increased transpositional activity promote the dissemination of novel 3’ ends. Thereby, evolutionarily older motifs are displaced leading to various 3’ end subpopulations within the SaliS families. Opposed to the PoaS families with a largely equal ratio of poly(A) to poly(T) tail SINEs, the SaliS families are exclusively terminated by adenine stretches. Among retrotransposon-based markers, SINEs are highly suitable for the development of molecular markers due to their unidirectional insertion and random distribution mainly in euchromatic genome regions, together with an easy and fast detection of the heterogeneous SINE families. As a prerequisite for the development of SINE-derived inter-SINE amplified polymorphism (ISAP) markers, 13 novel Theaceae SINE families (TheaS-I - TheaS-VII, TheaS-VIII.1 and TheaS-VIII.2, TheaS-IX - TheaS-XIII) were identified in the angiosperm tree species Camellia japonica. Moreover, six Pinaceae SINE families (PinS-I.1 and PinS-I.2, PinS-II – PinS-VI) were detected in the gymnosperm species Larix decidua. Compared to the SaliS and PoaS families, structural relationships are less frequent within the TheaS families and absent in the PinS families. The ISAP analysis revealed the genetic identity of Europe’s oldest historical camellia (C. japonica) trees indicating their vegetative propagation from the same ancestor specimen, which was probably the first living camellia on European ground introduced to England within the 18th century. Historical sources locate the native origin of this ancestral camellia specimen either in the Chinese province Yunnan or at the Japanese Gotō Islands. Comparative ISAPs showed no accordance to the Gotō camellia sample pool and appropriate Chinese reference samples were not available. However, the initial experiments demonstrated the potential of ISAP to resolve variations among natural populations. The ISAP application on angiosperm trees also concerned fast growing Populus clones grown in short rotation coppice plantations for energy production. The species-specific P. tremula ISAP primers might also be applied for the discrimination of hybrid poplar clones involving P. tremuloides genome portions, since SINEs of these two species are highly related. However, due to lineage-specific SINE evolution during speciation, cross-species applications are generally only successful to limited extent. The analysis of poplar hybrids composed of P. maximowiczii with either P. trichocarpa or P. nigra based on P. tremula ISAP primers showed a strongly reduced resolution. In forestry, hybrid larch (e.g. Larix × eurolepis) genotypes have to be selected from the offspring of Japanese (Larix kaempferi) and European larch (Larix decidua) crosses, as they exhibit superior growth rates compared to the parental species. Initial ISAP-based examinations of European larch genotypes provided less polymorphic banding patterns, probably resulting from general high levels of synteny and collinearities reported for gymnosperm species. Hence, the ISAP was combined with the AFLP technique to the novel marker system inter-SINE-restriction site amplified polymorphism (ISRAP). The amplicons originating from genomic regions between SINEs and EcoRI cleavage sites were visualized with the sensitive capillary gel electrophoresis. The ISRAP assays, based on EcoRI adapter primers combined with two different SINE-derived primers, resulted in a sufficient number of polymorphic peaks to distinguish the L. decidua genotypes investigated. Compared to ISAPs, the ISRAP approach provides the required resolution to differentiate highly similar larch genotypes. info:eu-repo/classification/ddc/570 ddc:570
5	The retrotransposon landscape of the Beta vulgaris genome: Evolutionary conservation and diversity Heitkam, Tony 08 March 2019 (has links) Retrotransposons are major components of plant genomes influencing their genome size, organization and evolution. In the frame of this work, retrotransposons of the Beta vulgaris genome have been identified by molecular methods and whole genome bioinformatics approaches. Neither belonging to the rosids nor asterids, B. vulgaris (cultivated beet including sugar beet, beet root and mangold) is taxonomically placed at a key position at the root of the core eudicots, and considerably different from traditional plant model species such as thale cress or rice. Its genome has been sequenced, and annotation is under way. In order to compare different evolutionary lineages of B. vulgaris retrotransposons, long terminal repeat (LTR) and non-LTR retrotransposon family have been analyzed in detail. Full-length members have been isolated and characterized by bioinformatics, Southern and fluorescent in situ hybridization. Hallmarks of the LTR retrotransposon family Cotzilla are an additional env-like open reading frame (ORF), homogeneity of the members and the very high abundance. Most family members are evolutionarily young, and have most likely been created during recent bursts of amplification during species radiation. In contrast, the non-LTR retrotransposon family BNR has fewer copies and is much more diverged. Although the BNR ORF2 resembles previously analyzed long interspersed nuclear elements (LINEs) of the L1 clade, its ORF1 sequence differs strongly. It lacks the zinc finger domain described for plant LINEs, but contains instead an RNA recognition motif (RRM) likely to have an RNA-binding function. Database searches revealed the presence of similar LINE families in higher plant genomes such as poplar, lotus and soybean. Comparing their reverse transcriptase regions with other retrotransposons, these BNR-like LINEs form a separate group of L1 LINEs designated as BNR subclade. Availability of the B. vulgaris genome sequence allowed retrotransposon analyses on a genome-wide scale. A Hidden Markov Model-based detection algorithm has been developed in order to retrieve retrotransposon information directly from the database. Nearly 6000 B. vulgaris reverse transcriptase sequences have been isolated and classified into LTR retrotransposons of the Ty3-gypsy and Ty1-copia type, and non-LTR retrotransposons of the LINE type. As a result, a comprehensive overview of the retrotransposon spectrum of the B. vulgaris genome has been generated. Since plant LINEs have been only rarely investigated, the B. vulgaris LINE composition was studied in detail. Out of 28 described LINE clades, only members of the L1 and RTE clades have been identified. Based on a minimal shared sequence identity of 60 %, they form at least 17 L1 families and one RTE family. Full-length members of all investigated L1 families have been analyzed regarding their sequence, structure and diversity. In order to transfer the algorithm tested in B. vulgaris to other angiosperm genomes, twelve additional plant genomes have been queried for LINE reverse transcriptases. Key finding is the presence of only two LINE clades (L1 and RTE) in the analyzed genomes of higher plants. Whereas plant L1 LINEs are highly diverse and form at least seven subclades with members across species borders, RTE LINEs are extremely homogenized and constitute most likely only a single family per genome. In summary, this work’s results help to gain an understanding of the different strategies of retrotransposon evolution in plants, whereas the generated data directly contributes to the B. vulgaris genome annotation project. / Retrotransposons sind eine wesentliche Komponente von Pflanzengenomen, die sowohl die Größe und Organisation als auch die Evolution dieser Genome wesentlich beeinflussen können. Im Rahmen dieser Arbeit wurden verschiedene Gruppen von Retrotransposons des Beta vulgaris Genoms mittels molekularer und bioinformatischer Methoden identifiziert. Innerhalb der dikotyledonen Blütenpflanzen gehört B. vulgaris (kultivierte Rübe einschließlich Zuckerrübe, Roter Beete und Mangold) weder zu den Rosiden noch zu den Asteriden, sondern nimmt eine Schlüsselposition innerhalb der Kerneudikotyledonen ein. Somit zeigt das Rübengenom wesentliche Unterschiede zu traditionellen Modellpflanzen wie Arabidopsis thaliana oder Oryza sativa. Das Genom ist bereits sequenziert, die Annotation jedoch noch nicht abgeschlossen. Um verschiedene evolutionäre Linien von B. vulgaris Retrotransposons vergleichend zu untersuchen wurden insbesondere Long Terminal Repeat (LTR)- und Non-LTR-Retrotransposon-Familien detailliert analysiert. Vollständige Mitglieder wurden isoliert und mittels bioinformatischer Methoden, Southern- und Fluoreszenz-in situ-Hybridisierung untersucht. Die LTR-Retrotransposon-Familie Cotzilla ist durch einen zusätzlichen env-ähnlichen offenen Leserahmen (ORF), Homogenität ihrer Mitglieder und eine hohe Abundanz gekennzeichnet. Die meisten Cotzilla-Kopien sind evolutionär jung und wurden wahrscheinlich innerhalb eines kurzen Zeitraumes während der Artentstehung stark amplifiziert. Im Gegensatz zur Cotzilla-Familie besitzt die Non-LTR-Retrotransposon-Familie BNR weniger Kopien und ist wesentlich divergierter. Während der BNR-spezifische ORF2 starke Ähnlichkeiten zu anderen pflanzlichen Long Interspersed Nuclear Elements (LINEs) der L1-Klade aufweist, unterscheidet sich der BNR ORF1 von diesen sehr stark. Im Gegensatz zu bereits beschrieben pflanzlichen LINEs kodiert er kein Zinkfingermotiv, sondern substituiert dieses durch ein RNA-Erkennungsmotiv (RRM). Durch Datenbanksuche konnten BNR-ähnliche LINEs in den Genomen höherer Pflanzen wie Soja, Lotus und Pappel identifiziert werden. Ein Vergleich der entsprechenden Reversen Transkriptasen (RT) mit den RTs anderer Retrotransposons zeigt, dass die BNR-ähnlichen LINEs eine separate Gruppe innerhalb der L1 LINEs bilden. Diese wurde daher als BNR-Subklade definiert. Die Untersuchung von Retrotransposons auf Genomebene wurde durch die B. vulgaris Genomsequenz ermöglicht. Um Retrotransposon-Informationen direkt aus dem Genom zu extrahieren, wurde ein Hidden Markov Modell (HMM)-basierter Detektions-algorithmus entwickelt. Annähernd 6000 B. vulgaris Reverse Transkriptase-Sequenzen konnten identifiziert und in LTR-Retrotransposons des Ty3-gypsy- beziehungsweise des Ty1-copia-Typs und in Non-LTR-Retrotransposons des LINE-Typs klassifiziert werden. Somit wurde ein umfassender Überblick über die Bandbreite der B. vulgaris Retrotransposons arhalten. Da pflanzliche LINEs bisher nur wenig erforscht sind, wurde die B. vulgaris LINE Zusammensetzung genauer untersucht. Von 28 beschriebenen LINE-Kladen konnten nur Mitglieder der L1- und der RTE-Klade identifiziert werden. Basierend auf einer Identität von mindestens 60 % bilden die Sequenzen 17 L1 Familien und eine RTE Familie. Vollständige Mitglieder aller L1 Familien wurden hinsichtlich ihrer Sequenz, Struktur und Diversität analysiert. Um den in B. vulgaris getesteten HMM-basierten Algorithmus auf andere Angiospermengenome zu übertragen, wurden zwölf weitere Pflanzengenome auf das Vorhandensein von LINE-spezifischen Reversen Transkriptasen untersucht. Wesentlichstes Ergebnis ist der Nachweis von nur zwei LINE-Kladen (L1 und RTE) in höheren Pflanzen. Während pflanzliche L1 LINEs hochgradig divers sind und über Artgrenzen hinaus mindestens sieben Subkladen mit Vertretern verschiedener Pflanzen bilden, sind RTE LINEs extrem homogenisiert und stellen höchstwahrscheinlich nur eine einzelne Familie pro Genom einer Art dar. Zusammenfassend ermöglichen die Ergebnisse dieser Arbeit eine Erweiterung des Verständnisses der unterschiedlichen Evolutionsstrategien von Retrotransposons in Pflanzen. Zusätzlich tragen die gewonnen Daten zur Annotation des B. vulgaris Genoms bei. info:eu-repo/classification/ddc/570 ddc:570

1

Page generated in 0.0824 seconds