Global ETD Search

81	Identification et caractérisation de virus aviaires par des approches de séquençage à haut débit / Identification and characterisation of avian viruses using high throughput sequencing Liais, Etienne 02 December 2014 (has links) En médecine humaine et vétérinaire, les agents pathogènes représentent la cause de mortalité principale à travers la planète. Les méthodes de diagnostic de ces pathogènes ont considérablement changé et évolué particulièrement depuis l’apparition du séquençage haut débit. Les nouvelles méthodes de séquençage massif ont considérablement diminué le prix d’une séquence permettant de rendre accessible cette technologie révolutionnaire. Dans le cadre de mes travaux de thèse, nous avons mis en place un protocole pour l’utilisation du séquençage Illumina® (avec le séquenceur MiSeq) comme méthode de diagnostic lors de différents cas pathologiques aviaires. L’utilisation de cette méthode nous a permis dans un premier temps d’identifier l’agent étiologique de la maladie foudroyante de la pintade. Cette étude nous a permis de valider l’utilisation de ce genre de méthode pour des cas ciblés, ici lors d’un épisode clinique particulier n’impliquant vraisemblablement qu’un seul candidat pathogène. Ce nouveau coronavirus a fait l’objet d’études complémentaires afin de le caractériser. Nous avons élargis les cibles recherchées en analysant dans un deuxième temps l’ensemble des virus ARN chez le canard lors d’épisodes cliniques respiratoires et/ou de chute de ponte. L’analyse des données a mis en évidence une importante diversité virale et a permis d’identifier des candidats responsables potentiels. L’ensemble des résultats obtenus nous permet de valider l’utilisation du séquençage à haut débit comme un outil puissant de diagnostic. / Infectious diseases are considered the most prevalent cause of mortality in humans as well as other animals worldwide. Since the advent of high throughput sequencing technologies, diagnostic methods for these conditions have quickly changed and evolved, as the continuously decreasing cost of mass sequencing is making this tool available to larger numbers of people. As part of my thesis project, an Illumina®-based sequencing method (on a MiSeq machine) was designed for diagnostic purposes in clinical cases in poultry. We first used this method to identify the causative agent of the fulminating disease of guinea fowl. This validated the use of our protocol to identify the pathogenic infectious agent behind a specific condition. This newly identified Coronavirus was further analysed and characterised. In a second study we used an unbiased mass sequencing approach to describe the RNA virus populations present in the duck respiratory tract during clinical episodes (respiratory illness or egg drops). Data showed an important viral diversity and we identified some candidate pathogens. Taken together, these results validate the use of high throughput sequencing as a powerful diagnostic tool. Séquençage à haut débit Outil de diagnostic Pathologie aviaire Coronavirus Métapneumovirus Bio-Informatique High throughput sequencing Diagnostic tools Avian pathology Coronavirus Metapneumovirus Bioinformatics
82	Environmental DNA Detection and Population Genetic Patterns of Native and Invasive Great Lakes Fishes Snyder, Matthew Robert January 2019 (has links) No description available. Bioinformatics Biology Ecology Genetics Organismal Biology eDNA assays metabarcoding HTS high-throughput sequencing invasive species native species community diversity and structure population structure retail vector bait and pond trade bioinformatics
83	Identification de nouvelles bases moléculaires des cancers précoces par séquençage à haut débit. / Identification of new molecular basis of early-onset cancers by means of high-throughput sequencing Fermey, Pierre 13 December 2017 (has links) Une des plus grandes avancées en cancérologie et en génétique au cours des vingt dernières années fût l'identification des formes héréditaires de cancer et des gènes deprédisposition impliqués. Chez une majorité de patients soupçonnés de présenter une formehéréditaire de cancer, les analyses centrées sur les gènes connus pour être impliqués dansles prédispositions mendéliennes au cancer restent bien souvent négatives. Aujourd'hui,grâce à l'émergence du séquençage à haut-débit (NGS), il est possible de séquencerl'ensemble des exons (exome) d'un individu ou plusieurs centaines de gènes dans un lapsde temps court et à des coûts raisonnables. Dans ce contexte, nous avons appliqué plusieurs stratégies basées sur ces nouveaux outils, avec l'objectif d'identifier de nouvellesbases moléculaires des cancers héréditaires à survenue précoce. Tout d’abord, nous avons employé une stratégie d'analyse exomique intrafamiliale dans une famille atypique présentant des chondrosarcomes de localisation thoracique pour lesquels aucune base moléculaire n'avait pu être mise en évidence. Grâce à cette stratégie, nous avons pu identifier une altération tronquante du gène EXT2 (NM_000401.3; c.237G>A; p.Trp79). Les altérations perte de fonction documentées pour ce gène sont impliquées dans la maladie des ostéochondromes multiples (OM), des tumeurs bénignes. Or, dans cette famille, aucun signe clinique d'OM n'était présent. Ces travaux nous ont donc permis d'étendre le spectre phénotypique des mutations EXT2 et de modifier la prise en charge clinique de cette famille. Nous avons ensuite employé une stratégie d'analyse exomique soustractive de trio enfant malade / parents sains dans le but d’identifier des mutations de novo potentiellement responsables de la prédisposition génétique au cancer observée chez un jeune patient ayant développé un médulloblastome du cervelet à l’âge de 8 ans, suivi d’un méningiome à 22 ans. L’analyse exomique du trio a révélé l’existence chez ce patient d'une mutation de novo faux-sens affectant un acide aminé très conservé de la protéine HID-1. Cette dernière est particulièrement exprimée dans les cellules neuronales et sécrétrices, et semble fonctionner autour de l’appareil de Golgi pour réguler le tri des vésiculesnouvellement formées. Ainsi, notre hypothèse est qu’un défaut de la protéine HID-1, lié à une mutation du gène HID-1, perturberait la voie de sécrétion et participerait à la genèse du médulloblastome. Ces travaux, toujours en cours, démontrent à la fois la force de la stratégie exomique de trio pour identifier rapidement des mutations de novo et illustre toute la difficultéd'interprétation des variants détectés dans des gènes non impliqués dans le cancer. Par ailleurs, nous avons appliqué une stratégie exomique soustractive et interfamiliale à une cohorte de dix patients ayant développé un corticosurrénalome à un âge très précoce et pour lesquels aucune base moléculaire n'a pu être mise en évidence. Malheureusement, nous n'avons pas pu identifier de nouvelles bases moléculaires du corticosurrénalome de l'enfant par ces techniques. Enfin, sous l'hypothèse que des mutations rares ou privées dans un nombre limité de gènes impliqués dans le cancer contribueraient à des formes héréditaires de cancer, nous avons entrepris un projet visant à séquencer à haut débit 201 gènes fortement impliqués dans le cancer chez des patients ayant développé des tumeurs à un âge pédiatrique. Les premiers résultats de ce projet toujours en cours ont permis de confirmer la robustesse de cette technique et suggèrent une extension phénotypique du spectre des mutations DICER1 ainsi qu'une contribution oligogénique des gènes de réparation de l'ADN dans les tumeurs pédiatriques. L'ensemble de ces résultats seront bientôt compilés au sein d'une base de données et bénéficieront d'une analyse statistique fine avec l'objectif d'identifier des enrichissements en variants rares dans des gènes ou voies biologiques. / One of the greatest advances in oncology and genetics over the past 20 years has been the identification of hereditary forms of cancer and of the cancer genes. Nevertheless, in a majority of patients suspected to present an inherited form of cancer, analyses of the genes known to be involved in the Mendelian predispositions to cancer often remain negative. Today, thanks to the emergence of high-throughput sequencing (NGS), it is now possible to sequence all exons of an individual (exome) or several hundred genes in a short period of time and for a reasonable cost. In this context, we have applied several strategiesbased on these new tools in order to identify new molecular basis of early-onset cancers. First, we applied an intra-familial exome analysis strategy to an atypical family with chondrosarcomas of the chest, for which no molecular basis could be identified. Using this strategy, we were able to identify a truncating alteration of the EXT2 gene NM_000401.3; c.237G> A; p.Trp79 ). The documented loss of function alterations of this gene are implicated in a disease called multiple osteochondromas (OM), associated with benign lesions. Interestingly, these patients showed no clinical signs of OM indicating a potential phenotypic extension of EXT2 mutations. Plus, this work allowed us to change the clinical management of this family. We then used a strategy of subtractive exomic analysis of trio sick child/healthy parents in order to identify de novo mutations in a young patient who developed a medulloblastoma of the cerebellum at 8 years-old followed by a meningioma at 22 years-old. The analysis of the trio revealed the existence of a de novo mutation affecting a highly conserved amino acid of the HID-1 protein. HID-1 is specifically expressed in neuronal and secretory cells, and seems to function around the Golgi apparatus to regulate the sorting of newly formed vesicles. Our hypothesis is that a defect of the HID-1 protein linked to a mutation of the HID-1 gene, could alter the secretory pathway therefore contributing to the development of the tumor. This work, which is still ongoing, demonstrates both the strength of the trio strategy for the rapid identification of de novo mutations and illustrates all the difficulty of interpreting variants detected in genes not yet involved in cancer. Then, thanks to the recruitment of the Laboratory of Molecular Genetics of the CHU of Rouen, we have collected a cohort of 10 patients who developed an adrenocortical carcinoma (ACC) at a very early age and for which no molecular basis could be identified. Despite subtractive and inter-familial exomic analyses, we were unable to highlight new molecular bases for these cases of pediatric ACC. Finally, under the assumption that rare or private mutations in a limited number of genes involved in cancer could contribute to inherited forms of cancer, we undertook a project to sequence 201 genes involved in cancer in patients who developed tumors at a pediatric age. The first results of this project confirmed the robustness of this technique and suggested a phenotypic extension of the DICER1 mutation spectrum as well as an oligogenic contribution of DNA repair genes in pediatric tumors. Soon, these results will be compiled in a database and will benefit from a statistical analysis with the objective to identify enrichment of rare variants in specific genes or biological pathways in these patients compared to control individuals. Séquençage à haut-débit Cancer héréditaire Tumeurs pédiatriques Séquençage de l'exome Séquençage de panels de gènes High-throughput sequencing Hereditary cancer Pediatric tumors Exome sequencing Gene panel sequencing 616.994
84	Correction de données de séquençage de troisième génération / Error correction of third-generation sequencing data Morisse, Pierre 26 September 2019 (has links) Les objectifs de cette thèse s’inscrivent dans la large problématique du traitement des données issues de séquenceurs à très haut débit, et plus particulièrement des reads longs, issus de séquenceurs de troisième génération.Les aspects abordés dans cette problématiques se concentrent principalement sur la correction des erreurs de séquençage, et sur l’impact de la correction sur la qualité des analyses sous-jacentes, plus particulièrement sur l’assemblage. Dans un premier temps, l’un des objectifs de cette thèse est de permettre d’évaluer et de comparer la qualité de la correction fournie par les différentes méthodes de correction hybride (utilisant des reads courts en complément) et d’auto-correction (se basant uniquement sur l’information contenue dans les reads longs) de l’état de l’art. Une telle évaluation permet d’identifier aisément quelle méthode de correction est la mieux adaptée à un cas donné, notamment en fonction de la complexité du génome étudié, de la profondeur de séquençage, ou du taux d’erreurs des reads. De plus, les développeurs peuvent ainsi identifier les limitations des méthodes existantes, afin de guider leurs travaux et de proposer de nouvelles solutions visant à pallier ces limitations. Un nouvel outil d’évaluation, proposant de nombreuses métriques supplémentaires par rapport au seul outil disponible jusqu’alors, a ainsi été développé. Cet outil, combinant une approche par alignement multiple à une stratégie de segmentation, permet également une réduction considérable du temps nécessaire à l’évaluation. À l’aide de cet outil, un benchmark de l’ensemble des méthodes de correction disponibles est présenté, sur une large variété de jeux de données, de profondeur de séquençage, de taux d’erreurs et de complexité variable, de la bactérie A. baylyi à l’humain. Ce benchmark a notamment permis d’identifier deux importantes limitations des outils existants : les reads affichant des taux d’erreurs supérieurs à 30%, et les reads de longueur supérieure à 50 000 paires de bases. Le deuxième objectif de cette thèse est alors la correction des reads extrêmement bruités. Pour cela, un outil de correction hybride, combinant différentes approches de l’état de l’art, a été développé afin de surmonter les limitations des méthodes existantes. En particulier, cet outil combine une stratégie d’alignement des reads courts sur les reads longs à l’utilisation d’un graphe de de Bruijn, ayant la particularité d’être d’ordre variable. Le graphe est ainsi utilisé afin de relier les reads alignés, et donc de corriger les régions non couvertes des reads longs. Cette méthode permet ainsi de corriger des reads affichant des taux d’erreurs atteignant jusqu’à 44%, tout en permettant un meilleur passage à l’échelle sur de larges génomes et une diminution du temps de traitement, par rapport aux méthodes de l’état de l’art les plus efficaces. Enfin, le troisième objectif de cette thèse est la correction des reads extrêmement longs. Pour cela, un outil utilisant cette fois une approche par auto-correction a été développé, en combinant, de nouveau, différentes méthodologies de l’état de l’art. Plus précisément, une stratégie de calcul des chevauchements entre les reads, puis une double étape de correction, par alignement multiple puis par utilisation de graphes de de Bruijn locaux, sont utilisées ici. Afin de permettre à cette méthode de passer efficacement à l’échelle sur les reads extrêmement longs, la stratégie de segmentation mentionnée précédemment a été généralisée. Cette méthode d’auto-correction permet ainsi de corriger des reads atteignant jusqu’à 340 000 paires de bases, tout en permettant un excellent passage à l’échelle sur des génomes plus complexes, tels que celui de l’humain. / The aims of this thesis are part of the vast problematic of high-throughput sequencing data analysis. More specifically, this thesis deals with long reads from third-generation sequencing technologies. The aspects tackled in this topic mainly focus on error correction, and on its impact on downstream analyses such a de novo assembly. As a first step, one of the objectives of this thesis is to evaluate and compare the quality of the error correction provided by the state-of-the-art tools, whether they employ a hybrid (using complementary short reads) or a self-correction (relying only on the information contained in the long reads sequences) strategy. Such an evaluation allows to easily identify which method is best tailored for a given case, according to the genome complexity, the sequencing depth, or the error rate of the reads. Moreover, developpers can thus identify the limiting factors of the existing methods, in order to guide their work and propose new solutions allowing to overcome these limitations. A new evaluation tool, providing a wide variety of metrics, compared to the only tool previously available, was thus developped. This tool combines a multiple sequence alignment approach and a segmentation strategy, thus allowing to drastically reduce the evaluation runtime. With the help of this tool, we present a benchmark of all the state-of-the-art error correction methods, on various datasets from several organisms, spanning from the A. baylyi bacteria to the human. This benchmark allowed to spot two major limiting factors of the existing tools: the reads displaying error rates above 30%, and the reads reaching more than 50 000 base pairs. The second objective of this thesis is thus the error correction of highly noisy long reads. To this aim, a hybrid error correction tool, combining different strategies from the state-of-the-art, was developped, in order to overcome the limiting factors of existing methods. More precisely, this tool combines a short reads alignmentstrategy to the use of a variable-order de Bruijn graph. This graph is used in order to link the aligned short reads, and thus correct the uncovered regions of the long reads. This method allows to process reads displaying error rates as high as 44%, and scales better to larger genomes, while allowing to reduce the runtime of the error correction, compared to the most efficient state-of-the-art tools.Finally, the third objectif of this thesis is the error correction of extremely long reads. To this aim, aself-correction tool was developed, by combining, once again, different methologies from the state-of-the-art. More precisely, an overlapping strategy, and a two phases error correction process, using multiple sequence alignement and local de Bruijn graphs, are used. In order to allow this method to scale to extremely long reads, the aforementioned segmentation strategy was generalized. This self-correction methods allows to process reads reaching up to 340 000 base pairs, and manages to scale very well to complex organisms such as the human genome. Séquençage à haut débit Correction d'erreurs Assemblage Graphe de de Bruijn Alignement multiple High-throughput sequencing Error correction Assembly De Bruijn graphs Multiple Sequence Alignment 005.6
85	Apport du séquençage haut débit dans l'analyse bioinformatique du génome du virus de l'hépatite C / High-throughput sequencing contribution in bioinformatics analysis of hepatitis C virus genome Caporossi, Alban 26 November 2019 (has links) Le séquençage haut débit a été utilisé dans ce travail pour reconstruire avec des méthodes adaptées le génomeviral entier du virus de l’hépatite C (VHC) notamment pour le typer avec précision. Une étude a ainsi permisde mettre en évidence la présence d’une forme recombinante du VHC chez un patient. Une autre a permisde typer et détecter les mutations de résistance de plusieurs souches de VHC de génotypes différents. Enfin,une dernière étude basée sur cette approche a permis de découvrir une souche VHC appartenant à un nouveausous-type. Le séquençage haut débit a aussi été utilisé dans ce travail pour détecter des infections multiples etanalyser l’évolution virale en ciblant des gènes du VHC et en mettant en œuvre des méthodes non spécifiquespour 2 patients VHC sous traitement. Cette étude rétrospective a permis de définir la composition de chaqueéchantillon temporel, estimer leur diversité nucléotidique, explorer la structure génétique de la population viraleet son évolution temporelle et dater les infections secondaires. Les résultats obtenus supportent l’hypothèse d’unmécanisme d’apparition de résistance au traitement (selective sweeps). / High-throughput sequencing has been used in this work to reconstruct with adapted methods the whole genomeof the hepatitis C virus (HCV) particularly for accurately typing the virus. Thus, we managed to detect in a studya recombinant form of HCV circulating within a patient. We typed and detected in another study resistancemutations of several HCV strains of different genotypes. Finally, a last study based on this approach enabled touncover a HCV strain belonging to a new subtype. High-throughput sequencing has also been used in this workto detect multiple infections and analyze viral evolution with targeted HCV genes and non-specific methods for2 HCV patients under treatment. This retrospective study enabled to define the composition of each temporalsample, assess their nucleotide diversity, investigate viral population genetic structure and temporal evolutionand date secondary infections. Results of this analysis support the hypothesis of onset mechanism of treatmentresistance (selective sweeps). Génome entier Génotypage Infection multiple Evolution virale High-Throughput sequencing Hepatitis C Virus Whole genome Genotyping Multiple infection Viral evolution 570
86	Impact des anomalies moléculaires dans l'histoire naturelle de la leucémie lymphoïde chronique / Impact of molecular abnormalities in chronic lymphocytic leukemia natural history Chauzeix, Jasmine 11 December 2018 La leucémie lymphoïde chronique (LLC) est le lymphome avec phase circulante le plus fréquent chez l’adulte dans les pays occidentaux. Elle est caractérisée par une grand hétérogénéité dans son évolution naturelle avec des formes indolentes ne nécessitant jamais de traitement spécifique et des formes agressives requérant une chimiothérapie rapidement après le diagnostic. Dans ce travail de thèse, nous avons posé la question du rôle des anomalies moléculaires et en particulier des gènes des immunoglobulines dans l’histoire naturelle de la maladie, tant au plan mécanistique que pour le pronostic. Nous avons étudié trois remaniements atypiques impliquant un gène des immunoglobulines et un partenaire inconnu dans la LLC/lymphome lymphocytique. Les points de cassure ont pu être identifiés et nous ont permis de mettre en évidence l’implication dans 2 cas d’ARN longs non codants en 17q25 et 8q24. De plus, deux cas ont des points de cassure dans une région chromosomique restreinte (espacés de 200 kb en 17q25). Il pourrait s’agir d’un locus important dans la lymphomagénèse, de même que pour la région 8q24 contenant MYC et de nombreux gènes non codants jusqu’ici peu explorés. Par ailleurs, dans l’ère du séquençage haut débit, de nombreux marqueurs pronostiques moléculaires sont décrits dans la LLC. Nous avons démontré que l’électrophorèse des protéines sériques normale (immunoglobulines polyclonales sans hypogammaglobulinémie) au diagnostic, un marqueur simple et peu couteux, reste dans l’ère du NGS, un marqueur indépendant de bon pronostic dans la LLC. Sa combinaison avec un statut IGHV muté identifie un groupe de patients qui n’auront probablement jamais besoin de traitement spécifique. Ce travail nous a conduit à mettre au point un outil performant de détection des anomalies de nombre de copies par séquençage haut débit. Celui-ci permet la mise en évidence de disomies uniparentales dont la signification pronostique n’est actuellement pas établie dans la LLC. Ces anomalies pourraient être le reflet d’une instabilité chromosomique et il serait intéressant d’étudier leur impact pronostique dans la LLC. / Chronic lymphocytic leukemia (CLL) is the most frequent lymphoma with leukemic phase in the elderly in Western countries. It is characterized by a great heterogeneity in its natural history with indolent forms never requiring any specific treatment and aggressive forms needing chemotherapy rapidly after diagnosis. In this work of thesis, we asked the question of the role of molecular abnormalities, and in particular of the immunoglobulin genes in the natural history of the disease, at mechanistic level and for prognosis. We studied three atypical rearrangements implicating an immunoglobulin gene and an unknown partner in CLL/lymphocytic lymphoma. The breakpoints have been identified and the implication of long non coding RNA was highlighted in two cases in 17q25 and 8q24. Moreover, two cases harboured breakpoints in a restricted chromosomic region (200 kb spaced in 17q25). It could be an important locus in lymphomagenesis, as is 8q24 region containing MYC and numerous other non coding genes poorly characterized by now. Furthermore, in high throughput sequencing (HTS) era, many molecular prognosis markers have been described in CLL. We demonstrated that normal serum protein electrophoresis (polyclonal immunoglobulin without hypogammaglobulinemia) at diagnosis, a simple and unexpensive marker, stays in HTS era an independent good prognosis marker in CLL. Its combination with unmutated IGHV genes status identifies a group of patients who will probably never require any specific treatment. This work led us to develop an efficient tool to detect copy number variations by THS. This tool allows to highlight uniparental disomy whose prognosis signification is not established in CLL. These abnormalities could reflect chromosomal instability and it could be interesting to study their prognosis impact in CLL. Leucémie lymphoide chronique Translocation Immunoglobuline IncRNA Séquençage haut débit Chronic lymphocytic leukemia Translocation Immunoglobulin IncRNA High throughput sequencing Serum protein electrophoresis 616.994
87	Understanding and improving high-throughput sequencing data production and analysis Kircher, Martin 07 November 2011 (has links) Advances in DNA sequencing revolutionized the field of genomics over the last 5 years. New sequencing instruments make it possible to rapidly generate large amounts of sequence data at substantially lower cost. These high-throughput sequencing technologies (e.g. Roche 454 FLX, Life Technology SOLiD, Dover Polonator, Helicos HeliScope and Illumina Genome Analyzer) make whole genome sequencing and resequencing, transcript sequencing as well as quantification of gene expression, DNA-protein interactions and DNA methylation feasible at an unanticipated scale. In the field of evolutionary genomics, high-throughput sequencing permitted studies of whole genomes from ancient specimens of different hominin groups. Further, it allowed large-scale population genetics studies of present-day humans as well as different types of sequence-based comparative genomics studies in primates. Such comparisons of humans with closely related apes and hominins are important not only to better understand human origins and the biological background of what sets humans apart from other organisms, but also for understanding the molecular basis for diseases and disorders, particularly those that affect uniquely human traits, such as speech disorders, autism or schizophrenia. However, while the cost and time required to create comparative data sets have been greatly reduced, the error profiles and limitations of the new platforms differ significantly from those of previous approaches. This requires a specific experimental design in order to circumvent these issues, or to handle them during data analysis. During the course of my PhD, I analyzed and improved current protocols and algorithms for next generation sequencing data, taking into account the specific characteristics of these new sequencing technologies. The presented approaches and algorithms were applied in different projects and are widely used within the department of Evolutionary Genetics at the Max Planck Institute of Evolutionary Anthropology. In this thesis, I will present selected analyses from the whole genome shotgun sequencing of two ancient hominins and the quantification of gene expression from short-sequence tags in five tissues from three primates. info:eu-repo/classification/ddc/000 ddc:000
88	Analysis options for high-throughput sequencing in miRNA expression profiling Stokowy, Tomasz, Eszlinger, Markus, Świerniak, Michał, Fujarewicz, Krzysztof, Jarząb, Barbara, Paschke, Ralf, Krohn, Kurt 30 May 2014 (has links) Background: Recently high-throughput sequencing (HTS) using next generation sequencing techniques became useful in digital gene expression profiling. Our study introduces analysis options for HTS data based on mapping to miRBase or counting and grouping of identical sequence reads. Those approaches allow a hypothesis free detection of miRNA differential expression. Methods: We compare our results to microarray and qPCR data from one set of RNA samples. We use Illumina platforms for microarray analysis and miRNA sequencing of 20 samples from benign follicular thyroid adenoma and malignant follicular thyroid carcinoma. Furthermore, we use three strategies for HTS data analysis to evaluate miRNA biomarkers for malignant versus benign follicular thyroid tumors. Results: High correlation of qPCR and HTS data was observed for the proposed analysis methods. However, qPCR is limited in the differential detection of miRNA isoforms. Moreover, we illustrate a much broader dynamic range of HTS compared to microarrays for small RNA studies. Finally, our data confirm hsa-miR-197-3p, hsa-miR-221-3p, hsa-miR-222-3p and both hsa-miR-144-3p and hsa-miR-144-5p as potential follicular thyroid cancer biomarkers. Conclusions: Compared to microarrays HTS provides a global profile of miRNA expression with higher specificity and in more detail. Summarizing of HTS reads as isoform groups (analysis pipeline B) or according to functional criteria (seed analysis pipeline C), which better correlates to results of qPCR are promising new options for HTS analysis. Finally, data opens future miRNA research perspectives for HTS and indicates that qPCR might be limited in validating HTS data in detail.:Background; Methods; Results; Discussion; Conclusions info:eu-repo/classification/ddc/610 ddc:610
89	DNA metabarcoding for the identification of species within vegetarian food samples De Jager, Megan Dawn January 2021 (has links) >Magister Scientiae - MSc / Aims DNA metabarcoding has recently emerged as a valuable supplementary tool to ensure food authenticity within the global food market. However, it is widely known that highly processed food samples are one of DNA metabarcoding’s greatest shortfalls due to high DNA degradation, presence of PCR inhibitors and the incomplete removal of several undesirable compounds (such as polysaccharides) that makes the amplification of desired DNA challenging. This project has two main aims, the first of which was to determine and develop a cost and time effective DNA metabarcoding system that could successfully describe to species level the ingredient composition of highly processed vegetarian food products. The DNA metabarcoding system was thoroughly evaluated and tested by combining well-researched primers with varying concentrations into a multiplex reaction. The combination of plant and animal primers selected that yielded the best results were used to determine the species composition in the samples. The second aim is to determine the possible presence of meat contaminants within the highly processed vegetarian food samples. Numerous studies have shown that food adulteration is a wide-spread phenomenon throughout the world due to the economic gains it can provide. Animal primers were introduced into the multiplex reaction to aid in the identification of any meat products that could have been inserted into the vegetarian products to lower the overall cost to company. Methodology Thirty-two highly processed vegetarian food samples were collected in the Cape Town area from local and franchised supermarkets. DNA was extracted using the Chloroform/Isoamyl alcohol method best suited for plant-based samples followed by amplification of the following mini-barcoding regions: the mitochondrial 16S ribosomal rRNA, cytochrome B, tRNALeu – trnL – UAA intron and the ribosomal internal transcribed spacer region – ITS2 for plant and fungi identification. The PCR products were purified using the Qiaquick kit and library preparation and building was conducted using the TruSeq DNA PCR-free Library kit. Final purification was completed using AMPure XP kit and the pooled libraries were sequenced on an Illumina Miseq using 300bp paired-end run. Statistical and bioinformatic analysis on the NGS raw sequence reads was performed in R version 3.6.3. Results The results of the data analysis showed that the cytochrome B primer couldn’t detect any animal DNA in the vegetarian samples, however animal-derived sequences were detected in the positives present, validating the efficacy of the multiplex reaction. Mitochondrial 16S ribosomal rRNA was only able to detect plant-based DNA due to the structural homology between chloroplast and mitochondrial DNA. The fungal ribosomal internal transcribed spacer region – ITS2 detected sequences deriving from “Viridiplantae”. This result could have been due to the fungal and plant ribosomal internal transcribed spacer region – ITS2 sharing a reverse primer during amplification. The trnL region was able to detect the presence of undeclared coriander, mustard and wheat in 8 (29%), 6 (21%) and 5 (18%) samples respectively. Additionally, trnL was able to detect the presence of tobacco in 11 (35%) samples. This could have been due to cross-contamination between samples being co-extracted and amplified at the same time for separate studies. The PITS2 region was able to detect the presence of undeclared barley, mustard and wheat in 8 (25%), 4 (14%) and 4 (14%) samples respectively. Our results show the possibility of DNA metabarcoding for the authentication of a wide range of species present in highly processed vegetarian samples using a single assay. However, further optimization of the technique for the identification of both plant and animal species within vegetarian samples needs to be performed before the wide-spread implementation of this technology would be both feasible and viable. Eliminating primer biases, decreasing the risk of homology between different primers in the same assay as well as preventing the amplification of sequencing of undesirable DNA need to be further explored and ultimately mitigated before DNA metabarcoding can be widely seen as an effective and cost-effective method for authentication and food control. Food Authentication DNA Metabarcoding DNA Extraction Polymerase Chain Reaction Multiplex reaction Next Generation Sequencing High-throughput Sequencing Library building Illumina sequencing by synthesis Bioinformatics Species identification
90	Découverte de nouveaux transcrits de fusion dans des tumeurs pédiatriques en rechute et caractérisation fonctionnelle d’un nouvel oncogène LMO3-BORCS5 / Discovery of new fusion transcripts in pediatric tumors at relapse and functional characterization of a new oncogene LMO3-BORCS5 Dupain jourda, Célia 16 October 2018 (has links) Les cancers pédiatriques représentent la première cause de décès par maladie chez les enfants puisque 20% des patients présentent des résistances et rechutent. Les traitements actuels sont à leurs limites d'efficacité mais aussi de toxicité. L'amélioration des connaissances de la biologie des tumeurs résistantes et l'identification de nouvelles cibles moléculaires sont nécessaires pour ces pathologies rares, extrêmement diverses et mal comprises. Les oncogènes de fusion sont responsables de l’oncogenèse de ~ 20% des cancers. En raison de la biologie particulière des cancers pédiatriques, nous avons émis l'hypothèse qu’ils seraient dus à des oncogènes de fusion. Leur exploration, particulièrement à la rechute où les données actuelles sont limitées, permettrait de mieux comprendre ces pathologies et d’introduire de nouvelles cibles thérapeutiques. Notre premier objectif était de détecter de nouveaux transcrits de fusion parmi 48 patients pédiatriques en rechute inclus dans l'essai de médecine de précision MOSCATO-01 mené à Gustave Roussy. A partir de données de RNA-seq, nous avons détecté l’ensemble des fusions retrouvées dans ces tumeurs et classé les plus pertinentes selon leur fonction et leur potentiel thérapeutique. Un nouvel oncogène de fusion nommé LMO3-BORCS5 retrouvé chez un patient atteint de sarcome d'Ewing dans les biopsies tumorales à la rechute et au diagnostic, et dans diverses lignées cellulaires a particulièrement retenu notre attention. Nos études fonctionnelles montrent que LMO3-BORCS5 a un rôle important dans la tumorigenèse et la sensibilité au traitement et que BORCS5 fonctionnerait, in vivo, comme un gène suppresseur de tumeur. Ces résultats montrent le rôle critique des transcrits de fusion dans l'initiation et la progression tumorale ainsi que dans l’acquisition de résistance au traitement et que les fusions isolées ne doivent pas être ignorées. A l'avenir, la prise en charge des cancers réside en une médecine moléculaire de précision, les oncogènes de fusions représentant l'un des outils les plus remarquables en clinique. / Pediatric cancers represent the first cause of death by disease in children as 20% of patients harbor resistances and die from recurrence of the malignancy. Nowadays treatment are at their limits of efficacy but also toxicity. Improving the knowledge on the biology of the relapsed/resistant tumors and identifying new molecular targets are needed for these rare, extremely diverse and misunderstood pathologies. Fusion oncogenes are known to be major drivers of oncogenesis, responsible for ~20% of cancers. Due to their particular biology, we hypothesized that pediatric tumors would be more likely to harbor fusions and that their exploration, especially at relapse where very limited data are available, could help to understand the particularity of these malignancies and consequently to provide new therapeutic targets. Our first aim was to detect new fusion transcripts in a cohort of 48 pediatric patients at relapse included in the precision medicine trial MOSCATO-01, previously conducted in Gustave Roussy. From the RNA-seq data, we described the landscape of fusions in these tumors and classified the most confident candidates according to their biological function and druggability predictions. A new fusion named LMO3-BORCS5 found in biopsies of a patient with Ewing's sarcoma, at diagnosis and at relapse as well as in various cell lines particularly caught our attention. Functional studies showed that LMO3-BORCS5 has a high impact on tumorigenesis and treatment sensitivity and that BORCS5 would act, in vivo, as a tumor suppressor gene. Our results strengthened the critical role of fusion transcripts in tumor initiation, progression and resistance and reinforces the idea that a private fusion oncogene should not be ignored. Indeed, the future of cancer care lays in precision medicine, with fusion transcripts being one of the most remarkable tool for treatment decision. Transcrits de fusion Tumeurs pédiatriques solides en rechute Séquençage haut débit Oncogène de fusion LMO3-BORCS Fusion transcripts Pediatric solid cancers at relapse High throughput sequencing Fusion oncogene LMO3-BORCS5

Search results