• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 21
  • 5
  • Tagged with
  • 34
  • 34
  • 11
  • 10
  • 9
  • 7
  • 7
  • 6
  • 6
  • 5
  • 5
  • 4
  • 4
  • 4
  • 4
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
21

Similarity and difference between systemic lupus erythematosus and NZB/W F1 mice by multi-omics analysis / マルチオミクス解析による全身性エリテマトーデスとNZB/W F1 マウスの類似性と相違性

大熊, 賢司 23 May 2024 (has links)
京都大学 / 新制・論文博士 / 博士(医学) / 乙第13630号 / 論医博第2321号 / 新制||医||1074(附属図書館) / (主査)教授 伊藤 能永, 教授 浅野 雅秀, 教授 濵﨑 洋子 / 学位規則第4条第2項該当 / Doctor of Medical Science / Kyoto University / DFAM
22

Étude des gènes de réponse aux terres rares chez des organismes modèles / Study of rare earth element responsive genes in model organisms

Grosjean, Nicolas 26 June 2019 (has links)
Les terres rares (TRs) sont des métaux stratégiques du XXIe siècle dont la demande croissante résulte de leurs propriétés essentielles, notamment dans les domaines des énergies renouvelables, de la médecine et des hautes technologies. Ils sont classés en TR lourdes (HTRs), terres légères légères (LTRs) et non-lanthanides. Leur dissémination dans l'environnement, associée à une faible recyclabilité, fait des TRs des contaminants émergents pour lesquels les études de toxicité sont jusqu'à présent très disparates. Afin d’établir une base générale de la réponse cellulaire et moléculaire à un stress TRs, nous avons premièrement utilisé des stratégies complémentaires et à haut débit pour étudier la réponse, ainsi que l’absorption des TRs chez le modèle eucaryote Saccharomyces cerevisiae. Les deletome, transcriptome, protéome et ionome de la levure ont été analysés et approfondis par des analyses physiologiques ciblées. Bien que des réponses communes aux TRs et à d'autres métaux aient été mises en évidence, les réponses spécifiques aux TRs étaient prédominantes. La composition de la paroi cellulaire, la biosynthèse des sphingolipides, la voie ESCRT et l'endocytose sont des éléments clés de la réponse aux TRs. Deuxièmement, nous avons exploré les effets des TRs sur le transcriptome et le ionome du modèle végétal Arabidopsis thaliana. L'exposition des TRs a négativement impacté l'architecture racinaire, comme l'a révélé la modulation de gènes liés à l'auxine. De plus, le ionome a été modifié et les gènes liés à une carence en Fe largement représentés parmi les gènes les plus différentiellement exprimés. Afin d'identifier de nouvelles plantes modèles accumulant des TRs, des espèces de Phytolacca et de fougères ont été criblées. Malgré un trait d’accumulation des TRs conservé chez quelques genres de fougères et Phytolacca, un enrichissement en HTRs chez Phytolacca et en LTRs chez les fougères a été observé. Cependant, plusieurs espèces de Dryopteris présentent des teneurs contrastées en TRs dans les frondes et représentent de nouveaux modèles pertinents pour décrypter les mécanismes d’accumulation et de tolérance aux TRs. Globalement, des divergences ont été mises en évidence dans la réponse aux différentes TRs, en fonction de leur rayon ionique. La composition de la paroi cellulaire, la détoxification vacuolaire, mais aussi l’accumulation et le fractionnement des TRs ont souligné ces différences. Nous avons confirmé que les LTRs empruntaient les canaux calciques, tandis que de nouvelles preuves ont été données sur le rôle des transporteurs de Fe dans l'accumulation de HTRs. En conclusion, nous apportons ici de nouveaux éléments sur la toxicité et les spécificités des TRs, ainsi que des explications moléculaires pour certains effets déjà connus. Ce travail constitue un premier travail de base complet et multi-approches pour de futures études afin d’approfondir la compréhension de la toxicité des TRs chez les organismes vivants. / Rare earth elements (REEs) are strategic metals whose demand in the 21st century is increasing as a result of their essential properties useful to the fields of renewable energies, medicine, and high-technologies. They are classified as heavy REEs (HREEs), light REEs (LREEs) and non-lanthanides. Their dissemination in the environment, together with poor recyclability, leads REEs to be considered emerging contaminants, for which toxicity studies are currently very fragmented. To build a strong general foundation on the cellular and molecular response to REEs, we first adopted high-throughput and complementary strategies to study the REE stress response and their uptake in the unicellular eukaryotic model Saccharomyces cerevisiae. The deletome, transcriptome, proteome and ionome of yeast were analysed together with in-depth physiological experiments. Although common responses between REEs and other metals were highlighted, REE-specific responses were predominant. Cell wall composition, sphingolipid biosynthesis, the ESCRT pathway and endocytosis were emphasized as key elements in the cellular response to REEs. Second, we explored how REEs affect the transcriptome and ionome of the plant model Arabidopsis thaliana. REE exposure negatively affected the root architecture, as revealed by the modulation of auxin-related genes. REEs impaired the ionome, and Fe deficiency-related genes were largely represented among the most differentially expressed genes in both roots and leaves. Additionally, to identify new REE-accumulating plant models, collections of ferns and Phytolacca species were screened. Despite a conserved REE accumulation trait for Phytolacca and a few fern genera, HREE enrichment was observed in Phytolacca, while LREEs were preferentially transferred into the fronds of all fern species. However, several Dryopteris species harboured contrasting REE contents in the fronds. The latter species will be of great importance in deciphering the mechanisms of REE accumulation and tolerance. Overall, the response towards REEs differed according to their ionic radius. The cell wall composition, vacuolar detoxification, and the accumulation and fractionation of REEs notably accounted for these differences. Our findings support LREE-mediated entry through calcium channels, while new evidence was provided for the role of Fe transporters in the accumulation of HREEs. In conclusion, we have provided new insights into REE toxicity and specificities, together with the molecular elucidation of REE effects that have not previously been mechanistically explained. This work is a first multi-approach comprehensive groundwork that will be used for future studies to deepen the understanding and assessment of REE toxicity in organisms.
23

Inclusion of Kinetic Proteomics in Multi-Omics Methods to Analyze Calorie Restriction Effects on Aging

Carson, Richard Hajime 06 December 2019 (has links)
One of the greatest risk factors for disease is advanced age. As the human lifespan has increased, so too have the burdens of caring for an increasingly older population suffering from rising rates of cardiovascular disease, kidney disease, diabetes, and dementia. The need for improving medical technology and developing new therapies for age-related diseases is manifest. Yet our understanding of the processes of aging and how to attenuate the effects of aging remains incomplete. Various studies have established calorie restriction as a robust method for extending lifespan in laboratory organisms; however the mechanism is a topic of much debate. Advancing our understanding of calorie restriction holds promise for illuminating biochemical processes involved in the aging process. One of the best explanations for the lifespan extension benefits of calorie restriction is that it improves cellular protein homeostasis (proteostasis), but because proteostasis is dynamic, it can be difficult to measure. We developed a novel combined omics methodology integrating kinetic proteomics, and applied it to a mouse model placed on calorie restriction. Our unbiased approach integrating just three measurements (kinetic proteomics, quantitative proteomics, and transcriptomics) enabled us to characterize the synthesis and degradation of thousands of proteins, and determine that calorie restriction largely alters proteostasis by slowing global protein synthesis post-transcriptionally. Validating our omics approach, we were able to replicate many previous results found in the literature, demonstrating the differential regulation of various protein ontologies in response to the nutrient stress of calorie restriction. Moreover, we were able to detect differential degradation of the large and small ribosomal subunits under calorie restriction, and proposed a model in which the rate of protein synthesis could be attenuated by the depletion of the large ribosomal subunit relative to the small subunit. The flexibility of our dynamic combined omics approach was demonstrated by the expansion of measurements to include nucleic acids and lipids. Flux measurements of DNA, ribosomal RNA, and lipids yielded cellular division rates, ribosome turnover, and lipid metabolism insights, respectively. We also adapted this approach to two-dimensional tissue imaging by DESI-MS in a proof-of-concept study to demonstrate its utility for studying regional differences in metabolism. The future integration of metabolomics and lipidomics into our combined omics approach would be facile, and add unprecedented depth to systems-wide studies involving cellular metabolism. Applied to the regulation of cellular homeostasis in humans, this has the potential to open new avenues for elucidating the etiology of aging, understanding the pathology of age-related diseases, and identifying novel targets for therapeutics.
24

Mass Spectrometry-Based Metabolomics and Protein Native Structure Characterization to Improve Intervention in Salmonellosis and Proteomics-based Biomarker Characterization in Invasive Aspergillosis

Wu, Jikang, Dr. January 2018 (has links)
No description available.
25

AI for Omics and Imaging Models in Precision Medicine and Toxicology

Bussola, Nicole 01 July 2022 (has links)
This thesis develops an Artificial Intelligence (AI) approach intended for accurate patient stratification and precise diagnostics/prognostics in clinical and preclinical applications. The rapid advance in high throughput technologies and bioinformatics tools is still far from linking precisely the genome-phenotype interactions with the biological mechanisms that underlie pathophysiological conditions. In practice, the incomplete knowledge on individual heterogeneity in complex diseases keeps forcing clinicians to settle for surrogate endpoints and therapies based on a generic one-size-fits-all approach. The working hypothesis is that AI can add new tools to elaborate and integrate together in new features or structures the rich information now available from high-throughput omics and bioimaging data, and that such re- structured information can be applied through predictive models for the precision medicine paradigm, thus favoring the creation of safer tailored treatments for specific patient subgroups. The computational techniques in this thesis are based on the combination of dimensionality reduction methods with Deep Learning (DL) architectures to learn meaningful transformations between the input and the predictive endpoint space. The rationale is that such transformations can introduce intermediate spaces offering more succinct representations, where data from different sources are summarized. The research goal was attacked at increasing levels of complexity, starting from single input modalities (omics and bioimaging of different types and scales), to their multimodal integration. The approach also deals with the key challenges for machine learning (ML) on biomedical data, i.e. reproducibility, stability, and interpretability of the models. Along this path, the thesis contribution is thus the development of a set of specialized AI models and a core framework of three tools of general applicability: i. A Data Analysis Plan (DAP) for model selection and evaluation of classifiers on omics and imaging data to avoid selection bias. ii. The histolab Python package that standardizes the reproducible pre-processing of Whole Slide Images (WSIs), supported by automated testing and easily integrable in DL pipelines for Digital Pathology. iii. Unsupervised and dimensionality reduction techniques based on the UMAP and TDA frameworks for patient subtyping. The framework has been successfully applied on public as well as original data in precision oncology and predictive toxicology. In the clinical setting, this thesis has developed1: 1. (DAPPER) A deep learning framework for evaluation of predictive models in Digital Pathology that controls for selection bias through properly designed data partitioning schemes. 2. (RADLER) A unified deep learning framework that combines radiomics fea- tures and imaging on PET-CT images for prognostic biomarker development in head and neck squamous cell carcinoma. The mixed deep learning/radiomics approach is more accurate than using only one feature type. 3. An ML framework for automated quantification tumor infiltrating lymphocytes (TILs) in onco-immunology, validated on original pathology Neuroblastoma data of the Bambino Gesu’ Children’s Hospital, with high agreement with trained pathologists. The network-based INF pipeline, which applies machine learning models over the combination of multiple omics layers, also providing compact biomarker signatures. INF was validated on three TCGA oncogenomic datasets. In the preclinical setting the framework has been applied for: 1. Deep and machine learning algorithms to predict DILI status from gene expression (GE) data derived from cancer cell lines on the CMap Drug Safety dataset. 2. (ML4TOX) Deep Learning and Support Vector Machine models to predict potential endocrine disruption of environmental chemicals on the CERAPP dataset. 3. (PathologAI) A deep learning pipeline combining generative and convolutional models for preclinical digital pathology. Developed as an internal project within the FDA/NCTR AIRForce initiative and applied to predict necrosis on images from the TG-GATEs project, PathologAI aims to improve accuracy and reduce labor in the identification of lesions in predictive toxicology. Furthermore, GE microarray data were integrated with histology features in a unified multi-modal scheme combining imaging and omics data. The solutions were developed in collaboration with domain experts and considered promising for application.
26

<b>Systems Modeling of host microbiome interactions in Inflammatory Bowel Diseases</b>

Javier E Munoz (18431688) 24 April 2024 (has links)
<p dir="ltr">Crohn’s disease and ulcerative colitis are chronic inflammatory bowel diseases (IBD) with a rising global prevalence, influenced by clinical and demographics factors. The pathogenesis of IBD involves complex interactions between gut microbiome dysbiosis, epithelial cell barrier disruption, and immune hyperactivity, which are poorly understood. This necessitates the development of novel approaches to integrate and model multiple clinical and molecular data modalities from patients, animal models, and <i>in-vitro</i> systems to discover effective biomarkers for disease progression and drug response. As sequencing technologies advance, the amount of molecular and compositional data from paired measurements of host and microbiome systems is exploding. While it is become routine to generate such rich, deep datasets, tools for their interpretation lag behind. Here, I present a computational framework for integrative modeling of microbiome multi-omics data titled: Latent Interacting Variable Effects (LIVE) modeling. LIVE combines various types of microbiome multi-omics data using single-omic latent variables (LV) into a structured meta-model to determine the most predictive combinations of multi-omics features predicting an outcome, patient group, or phenotype. I implemented and tested LIVE using publicly available metagenomic and metabolomics data set from Crohn’s Disease (CD) and ulcerative colitis (UC) status patients in the PRISM and LLDeep cohorts. The findings show that LIVE reduced the number of features interactions from the original datasets for CD to tractable numbers and facilitated prioritization of biological associations between microbes, metabolites, enzymes, clinical variables, and a disease status outcome. LIVE modeling makes a distinct and complementary contribution to the current methods to integrate microbiome data to predict IBD status because of its flexibility to adapt to different types of microbiome multi-omics data, scalability for large and small cohort studies via reliance on latent variables and dimensionality reduction, and the intuitive interpretability of the meta-model integrating -omic data types.</p><p dir="ltr">A novel application of LIVE modeling framework was associated with sex-based differences in UC. Men are 20% more likely to develop this condition and 60% more likely to progress to colitis-associated cancer compared to women. A possible explanation for this observation is differences in estrogen signaling among men and women in which estrogen signaling may be protective against UC. Extracting causal insights into how gut microbes and metabolites regulate host estrogen receptor β (ERβ) signaling can facilitate the study of the gut microbiome’s effects on ERβ’s protective role against UC. Supervised LIVE models<b> </b>ERβ signaling using high-dimensional gut microbiome data by controlling clinical covariates such as: sex and disease status. LIVE models predicted an inhibitory effect on ER-UP and ER-DOWN signaling activities by pairs of gut microbiome features, generating a novel of catalog of metabolites, microbial species and their interactions, capable of modulating ER. Two strongly positively correlated gut microbiome features: <i>Ruminoccocus gnavus</i><i> </i>with acesulfame and <i>Eubacterium rectale</i><i> </i>with 4-Methylcatechol were prioritized as suppressors ER-UP and ER-DOWN signaling activities. An <i>in-vitro</i> experimental validation roadmap is proposed to study the synergistic relationships between metabolites and microbiota suppressors of ERβ signaling in the context of UC. Two i<i>n-vitro</i> systems, HT-29 female colon cancer cell and female epithelial gut organoids are described to evaluate the effect of gut microbiome on ERβ signaling. A detailed experimentation is described per each system including the selection of doses, treatments, metrics, potential interpretations and limitations. This experimental roadmap attempts to compare experimental conditions to study the inhibitory effects of gut microbiome on ERβ signaling and how it could elevate or reduce the risk of developing UC. The intuitive interpretability of the meta-model integrating -omic data types in conjunction with the presented experimental validation roadmap aim to transform an artificial intelligence-generated big data hypothesis into testable experimental predictions.</p>
27

Network-Based Multi-Omics Approaches for Precision Cardio-Oncology: Pathobiology, Drug Repurposing and Functional Testing

Lal, Jessica Castrillon 26 May 2023 (has links)
No description available.
28

Epigenetic Responses of Arabidopsis to Abiotic Stress

Laliberte, Suzanne Rae 17 March 2023 (has links)
Weed resistance to control measures, particularly herbicides, is a growing problem in agriculture. In the case of herbicides, resistance is sometimes connected to genetic changes that directly affect the target site of the herbicide. Other cases are less straightforward where resistance arises without such a clear-cut mechanism. Understanding the genetic and gene regulatory mechanisms that may lead to the rapid evolution of resistance in weedy species is critical to securing our food supply. To study this phenomenon, we exposed young Arabidopsis plants to sublethal levels of one of four weed management stressors, glyphosate herbicide, trifloxysulfuron herbicide, mechanical clipping, and shading. To evaluate responses to these stressors we collected data on gene expression and regulation via epigenetic modification (methylation) and small RNA (sRNA). For all of the treatments except shade, the stress was limited in duration, and the plants were allowed to recover until flowering, to identify changes that persist to reproduction. At flowering, DNA for methylation bisulfite sequencing, RNA, and sRNA were extracted from newly formed rosette leaf tissue. Analyzing the individual datasets revealed many differential responses when compared to the untreated control for gene expression, methylation, and sRNA expression. All three measures showed increases in differential abundance that were unique to each stressor, with very little overlap between stressors. Herbicide treatments tended to exhibit the largest number of significant differential responses, with glyphosate treatment most often associated with the greatest differences and contributing to overlap. To evaluate how large datasets from methylation, gene expression, and sRNA analyses could be connected and mined to link regulatory information with changes in gene expression, the information from each dataset and for each gene was united in a single large matrix and mined with classification algorithms. Although our models were able to differentiate patterns in a set of simulated data, the raw datasets were too noisy for the models to consistently identify differentially expressed genes. However, by focusing on responses at a local level, we identified several genes with differential expression, differential sRNA, and differential methylation. While further studies will be needed to determine whether these epigenetic changes truly influence gene expression at these sites, the changes detected at the treatment level could prime the plants for future incidents of stress, including herbicides. / Doctor of Philosophy / Growing resistance to herbicides, particularly glyphosate, is one of the many problems facing agriculture. The rapid rise of resistance across herbicide classes has caused some to wonder if there is a mechanism of adaptation that does not involve mutations. Epigenetics is the study of changes in the phenotype that cannot be attributed to changes in the genotype. Typically, studies revolve around two features of the chromosomes: cytosine methylation and histone modifications. The former can influence how proteins interact with DNA, and the latter can influence protein access to DNA. Both can affect each other in self-reinforcing loops. They can affect gene expression, and DNA methylation can be directed by small RNA (sRNA), which can also influence gene expression through other pathways. To study these processes and their role in abiotic stress response, we aimed to analyze sRNA, RNA, and DNA from Arabidopsis thaliana plants under stress. The stresses applied were sublethal doses of the herbicides, glyphosate and trifloxysulfuron, as well as mechanical clipping and shade to represent other weed management stressors. The focus of the project was to analyze these responses individually and together to find epigenetic responses to stresses routinely encountered by weeds. We tested RNA for gene expression changes under our stress conditions and identified many, including some pertaining to DNA methylation regulation. The herbicide treatments were associated with upregulated defense genes and downregulated growth genes. Shade treated plants had many downregulated defense and other stress response genes. We also detected differential methylation and sRNA responses when compared to the control plants. Changes to methylation and sRNA only accounted for about 20% of the variation in gene expression. While attempting to link the epigenetic process of methylation to gene expression, we connected all the data sets and developed computer programs to try to make correlations. While these methods worked on a simulated dataset, we did not detect broad patterns of changes to epigenetic pathways that correlated strongly with gene expression in our experiment's data. There are many factors that can influence gene expression that could create noise that would hinder the algorithms' abilities to detect differentially expressed genes. This does not, however, rule out the possibility of epigenetic influence on gene expression in local contexts. Through scoring the traits of individual genes, we found several that interest us for future studies.
29

Computational Modeling of Planktonic and Biofilm Metabolism

Guo, Weihua 16 October 2017 (has links)
Most of microorganisms are ubiquitously able to live in both planktonic and biofilm states, which can be applied to dissolve the energy and environmental issues (e.g., producing biofuels and purifying waste water), but can also lead to serious public health problems. To better harness microorganisms, plenty of studies have been implemented to investigate the metabolism of planktonic and/or biofilm cells via multi-omics approaches (e.g., transcriptomics and proteomics analysis). However, these approaches are limited to provide the direct description of intracellular metabolism (e.g., metabolic fluxes) of microorganisms. Therefore, in this study, I have applied computational modeling approaches (i.e., 13C assisted pathway and flux analysis, flux balance analysis, and machine learning) to both planktonic and biofilm cells for better understanding intracellular metabolisms and providing valuable biological insights. First, I have summarized recent advances in synergizing 13C assisted pathway and flux analysis and metabolic engineering. Second, I have applied 13C assisted pathway and flux analysis to investigate the intracellular metabolisms of planktonic and biofilm cells. Various biological insights have been elucidated, including the metabolic responses under mixed stresses in the planktonic states, the metabolic rewiring in homogenous and heterologous chemical biosynthesis, key pathways of biofilm cells for electricity generation, and mechanisms behind the electricity generation. Third, I have developed a novel platform (i.e., omFBA) to integrate multi-omics data with flux balance analysis for accurate prediction of biological insights (e.g., key flux ratios) of both planktonic and biofilm cells. Fourth, I have designed a computational tool (i.e., CRISTINES) for the advanced genome editing tool (i.e., CRISPR-dCas9 system) to facilitate the sequence designs of guide RNA for programmable control of metabolic fluxes. Lastly, I have also accomplished several outreaches in metabolic engineering. In summary, during my Ph.D. training, I have systematically applied computational modeling approaches to investigate the microbial metabolisms in both planktonic and biofilm states. The biological findings and computational tools can be utilized to guide the scientists and engineers to derive more productive microorganisms via metabolic engineering and synthetic biology. In the future, I will apply 13C assisted pathway analysis to investigate the metabolism of pathogenic biofilm cells for reducing their antibiotic resistance. / Ph. D. / Most of microorganisms are ubiquitously able to live in both planktonic and biofilm states (i.e., floating in a flow and anchoring on a surface, respectively), which can be applied to dissolve the energy and environmental issues (e.g., producing biofuels and purifying waste water), but can also lead to serious public health problems (e.g., chronic infections). Therefore, deciphering the metabolism of both planktonic and biofilm cells are of great importance to better harness microorganism. Plenty of studies have been implemented to investigate the metabolism of planktonic and/or biofilm cells by measuring the abundances of single type of biological components (e.g., gene expression and proteins). However, these approaches are limited to provide the direct description of intracellular metabolism (e.g., enzyme activities) of microorganisms. Therefore, in this study, I have applied computational modeling approaches to both planktonic and biofilm cells for providing valuable biological insights (e.g., enzyme activities). The biological insights include 1) how planktonic cells response to mixed stresses (e.g., acids and organics) 2) how planktonic cells produce various chemicals, and 3) how biofilm cells generate electricity by rewiring the intracellular metabolic pathways. I also developed a novel platform to utilize multiple types of biological data for improving the prediction accuracy of biological insights of both planktonic and biofilm cells. In addition, I designed a computational tool to facilitate the sequence designs of an advanced genome editing tool for precisely controlling the corresponding enzyme activities. Lastly, I have also accomplished several outreaches in metabolic engineering. In summary, during my Ph.D. training, I have systematically applied computational modeling approaches to investigate the microbial metabolisms in both planktonic and biofilm states. The biological findings and computational tools can be utilized to guide the metabolic engineered to derive more productive microorganisms via metabolic engineering and synthetic biology. In the future, I plan to investigate how the pathogenic biofilm cells improve their antibiotic resistance and attempt to reduce such strong resistance.
30

Integrative analysis of data from multiple experiments

Ronen, Jonathan 22 July 2020 (has links)
Auf die Entwicklung der Hochdurchsatz-Sequenzierung (HTS) folgte eine Reihe von speziellen Erweiterungen, die erlauben verschiedene zellbiologischer Aspekte wie Genexpression, DNA-Methylierung, etc. zu messen. Die Analyse dieser Daten erfordert die Entwicklung von Algorithmen, die einzelne Experimenteberücksichtigen oder mehrere Datenquellen gleichzeitig in betracht nehmen. Der letztere Ansatz bietet besondere Vorteile bei Analyse von einzelligen RNA-Sequenzierung (scRNA-seq) Experimenten welche von besonders hohem technischen Rauschen, etwa durch den Verlust an Molekülen durch die Behandlung geringer Ausgangsmengen, gekennzeichnet sind. Um diese experimentellen Defizite auszugleichen, habe ich eine Methode namens netSmooth entwickelt, welche die scRNA-seq-Daten entrascht und fehlende Werte mittels Netzwerkdiffusion über ein Gennetzwerk imputiert. Das Gennetzwerk reflektiert dabei erwartete Koexpressionsmuster von Genen. Unter Verwendung eines Gennetzwerks, das aus Protein-Protein-Interaktionen aufgebaut ist, zeige ich, dass netSmooth anderen hochmodernen scRNA-Seq-Imputationsmethoden bei der Identifizierung von Blutzelltypen in der Hämatopoese, zur Aufklärung von Zeitreihendaten unter Verwendung eines embryonalen Entwicklungsdatensatzes und für die Identifizierung von Tumoren der Herkunft für scRNA-Seq von Glioblastomen überlegen ist. netSmooth hat einen freien Parameter, die Diffusionsdistanz, welche durch datengesteuerte Metriken optimiert werden kann. So kann netSmooth auch dann eingesetzt werden, wenn der optimale Diffusionsabstand nicht explizit mit Hilfe von externen Referenzdaten optimiert werden kann. Eine integrierte Analyse ist auch relevant wenn multi-omics Daten von mehrerer Omics-Protokolle auf den gleichen biologischen Proben erhoben wurden. Hierbei erklärt jeder einzelne dieser Datensätze nur einen Teil des zellulären Systems, während die gemeinsame Analyse ein vollständigeres Bild ergibt. Ich entwickelte eine Methode namens maui, um eine latente Faktordarstellungen von multiomics Daten zu finden. / The development of high throughput sequencing (HTS) was followed by a swarm of protocols utilizing HTS to measure different molecular aspects such as gene expression (transcriptome), DNA methylation (methylome) and more. This opened opportunities for developments of data analysis algorithms and procedures that consider data produced by different experiments. Considering data from seemingly unrelated experiments is particularly beneficial for Single cell RNA sequencing (scRNA-seq). scRNA-seq produces particularly noisy data, due to loss of nucleic acids when handling the small amounts in single cells, and various technical biases. To address these challenges, I developed a method called netSmooth, which de-noises and imputes scRNA-seq data by applying network diffusion over a gene network which encodes expectations of co-expression patterns. The gene network is constructed from other experimental data. Using a gene network constructed from protein-protein interactions, I show that netSmooth outperforms other state-of-the-art scRNA-seq imputation methods at the identification of blood cell types in hematopoiesis, as well as elucidation of time series data in an embryonic development dataset, and identification of tumor of origin for scRNA-seq of glioblastomas. netSmooth has a free parameter, the diffusion distance, which I show can be selected using data-driven metrics. Thus, netSmooth may be used even in cases when the diffusion distance cannot be optimized explicitly using ground-truth labels. Another task which requires in-tandem analysis of data from different experiments arises when different omics protocols are applied to the same biological samples. Analyzing such multiomics data in an integrated fashion, rather than each data type (RNA-seq, DNA-seq, etc.) on its own, is benefitial, as each omics experiment only elucidates part of an integrated cellular system. The simultaneous analysis may reveal a comprehensive view.

Page generated in 0.0467 seconds