• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 2
  • 2
  • Tagged with
  • 4
  • 4
  • 3
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Accurate and Reliable Cancer Classi cation Based on Pathway-Markers and Subnetwork-Markers

Su, Junjie 2010 December 1900 (has links)
Finding reliable gene markers for accurate disease classification is very challenging due to a number of reasons, including the small sample size of typical clinical data, high noise in gene expression measurements, and the heterogeneity across patients. In fact, gene markers identified in independent studies often do not coincide with each other, suggesting that many of the predicted markers may have no biological significance and may be simply artifacts of the analyzed dataset. To nd more reliable and reproducible diagnostic markers, several studies proposed to analyze the gene expression data at the level of groups of functionally related genes, such as pathways. Given a set of known pathways, these methods estimate the activity level of each pathway by summarizing the expression values of its member genes and using the pathway activities for classification. One practical problem of the pathway-based approach is the limited coverage of genes by currently known pathways. As a result, potentially important genes that play critical roles in cancer development may be excluded. In this thesis, we first propose a probabilistic model to infer pathway/subnetwork activities. After that, we developed a novel method for identifying reliable subnetwork markers in a human protein-protein interaction (PPI) network based on probabilistic inference of subnetwork activities. We tested the proposed methods based on two independent breast cancer datasets. The proposed method can efficiently find reliable subnetwork markers that outperform the gene-based and pathway-based markers in terms of discriminative power, reproducibility and classification performance. The identified subnetwork markers are highly enriched in common GO terms, and they can more accurately classify breast cancer metastasis compared to markers found by a previous method.
2

Dimensionality Reduction, Feature Selection and Visualization of Biological Data

Ha, Sook Shin 14 September 2012 (has links)
Due to the high dimensionality of most biological data, it is a difficult task to directly analyze, model and visualize the data to gain biological insight. Thus, dimensionality reduction becomes an imperative pre-processing step in analyzing and visualizing high-dimensional biological data. Two major approaches to dimensionality reduction in genomic analysis and biomarker identification studies are: Feature extraction, creating new features by combining existing ones based on a mapping technique; and feature selection, choosing an optimal subset of all features based on an objective function. In this dissertation, we show how our innovative reduction schemes effectively reduce the dimensionality of DNA gene expression data to extract biologically interpretable and relevant features which result in enhancing the biomarker identification process. To construct biologically interpretable features and facilitate Muscular Dystrophy (MD) subtypes classification, we extract molecular features from MD microarray data by constructing sub-networks using a novel integrative scheme which utilizes protein-protein interaction (PPI) network, functional gene sets information and mRNA profiling data. The workflow includes three major steps: First, by combining PPI network structure and gene-gene co-expression relationship into a new distance metric, we apply affinity propagation clustering (APC) to build gene sub-networks; secondly, we further incorporate functional gene sets knowledge to complement the physical interaction information; finally, based on the constructed sub-network and gene set features, we apply multi-class support vector machine (MSVM) for MD sub-type classification and highlight the biomarkers contributing to the sub-type prediction. The experimental results show that our scheme could construct sub-networks that are more relevant to MD than those constructed by the conventional approach. Furthermore, our integrative strategy substantially improved the prediction accuracy, especially for those ‘hard-to-classify' sub-types. Conventionally, pathway-based analysis assumes that genes in a pathway equally contribute to a biological function, thus assigning uniform weight to genes. However, this assumption has been proven incorrect and applying uniform weight in the pathway analysis may not be an adequate approach for tasks like molecular classification of diseases, as genes in a functional group may have different differential power. Hence, we propose to use different weights for the pathway analysis which resulted in the development of four weighting schemes. We applied them in two existing pathway analysis methods using both real and simulated gene expression data for pathways. Weighting changes pathway scoring and brings up some new significant pathways, leading to the detection of disease-related genes that are missed under uniform weight. To help us understand our MD expression data better and derive scientific insight from it, we have explored a suite of visualization tools. Particularly, for selected top performing MD sub-networks, we displayed the network view using Cytoscape; functional annotations using IPA and DAVID functional analysis tools; expression pattern using heat-map and parallel coordinates plot; and MD associated pathways using KEGG pathway diagrams. We also performed weighted MD pathway analysis, and identified overlapping sub-networks across different weight schemes and different MD subtypes using Venn Diagrams, which resulted in the identification of a new sub-network significantly associated with MD. All those graphically displayed data and information helped us understand our MD data and the MD subtypes better, resulting in the identification of several potentially MD associated biomarker pathways and genes. / Ph. D.
3

Characterization of phosphorylation-dependent interactions involving neurofibromin 2 (NF2, merlin) isoforms and the Parkinson protein 7 (PARK7, DJ1)

Worseck, Josephine Maria 19 June 2012 (has links)
Veränderungen in phosphorylierungsabhängigen Signalwegen, Akkumulation von Proteinaggregaten im Gehirn und neuronaler Zelltod sind Neurodegenerationskennzeichen und Indikatoren für überlappende molekulare Mechanismen. Um Einblicke in die involvierten Signalwege zu erhalten, wurde mit Hilfe eines modifizierten Hefe-Zwei-Hybrid (Y2H)-Systems für 71 Proteine, die mit neurologischen Erkrankungen assoziiert sind, proteomweit nach Protein-Protein Interaktionen (PPIs) gesucht. Für 21 dieser Proteine wurden PPIs identifiziert. Das Gesamtnetzwerk besteht aus 79 Proteinen und 90 PPIs von denen 5 phosphorylierungsabhängig sind. Ein Teil dieser PPIs wurde in unabhängigen Interaktionsassays mit einer Validierungsrate von 66 % getestet. Der netzwerkbasierte Versuch verbindet erfolgreich neurologische Erkrankungen untereinander aber auch mit zellulären Prozessen. Ser/Thr-Kinase abhängige PPIs verknüpfen zum Beispiel das Parkinson Protein 7 (PARK7, DJ1) mit den E3 Ligase Komponenten ASB3 und RNF31 (HOIP). Die Funktion dieser Proteine bekräftigt den Zusammenhang zwischen dem Ubiquitin-Proteasom-System und der Parkinson Krankheit (PD). Neurofibromin 2 (NF2, merlin) Isoformen und PARK7 interagieren mit der regulatorischen PI3K Untereinheit p55-gamma (PIK3R3). Diese PPIs basieren auf Tyr-Kinase Aktivität im modifizierten Y2H System und funktionellen PIK3R3 pTyr-Erkennungsmodulen (SH2 Domänen) in co-IP und Venus PCA Versuchen. Dies verknüpft den PI3K/AKT Überlebenssignalweg mit zwei unterschiedlichen neurologischen Erkrankungsphenotypen: dem PD assoziierten neuronalen Zelltod und der Neurofibromatose Typ 2-assoziierten Tumorentstehung. Die vergleichende Beobachtung von PIK3R3, AOF2 (KDM1A, LSD1) Interaktionen auf NF2 Isoformlevel offenbart eine Bevorzugung von Isoform 7 bei zytoplasmatischer Lokalisation, wohingegen Isoform 1 PPIs an der Membran lokalisiert sind. Das modifizierungsabhängige und isoformspezifische PPI Netzwerk ermöglichte neue Hypothesen zu molekularen Pathomechanismen. / Alterations in phosphorylation-dependent signalling pathways, accumulation of aggregated proteins in the brain and neuronal apoptosis are common to neurodegeneration and implicate overlapping molecular mechanism. To gain insight into involved pathways, a modified yeast-two hybrid (Y2H) system was applied to screen 71 proteins associated with neurological disorders in a proteome-wide manner. For 21 of these proteins interactions were identified including 5 phosphorylation-dependent ones. In total, the network connected 79 proteins through 90 protein-protein interactions (PPIs). A fraction of these Y2H PPIs was tested in secondary interaction assays with a validation rate of 66 %. The described network-based approach successfully identified proteins associated with more than one disorder and cellular functions connected to specific disorders. In particular, the network revealed Ser/Thr kinase-dependent PPIs between the Parkinson protein 7 (PARK7, DJ1) and the E3 ligase components ASB3 and RNF31 (HOIP). The function of these proteins further substantiates the established connection between Parkinson’s disease (PD) and ubiquitination-mediated proteasome (dis)functions. Neurofibromin 2 (NF2, merlin) isoforms and PARK7 were identified as PI3K regulatory subunit p55-gamma (PIK3R3) interactors. These PPIs required Tyr kinase coexpression in the modified Y2H system and functional PIK3R3 pTyr-recognition modules (SH2 domains) in co-IP and Venus PCA experiments. This finding implicates the PI3K/AKT survival pathway in PD-associated neuronal apoptosis and Neurofibromatosis type 2-associated tumour formation. Investigation of PIK3R3, AOF2 (KDM1A, LSD1) and EMILIN1 PPIs on NF2 isoform level revealed preferential isoform 7 binding and cytoplasmic or membrane localisation of these PPIs for isoform 7 or 1, respectively. The generated modification-dependent and isoform-specific PPI network triggered many hypotheses on the molecular mechanisms implicated in neurological disorders.
4

A proteome-wide screen utilizing second generation sequencing for the identification of lysine and arginine methyltransferase protein interactions

Weimann, Mareike 13 September 2012 (has links)
Proteinmethylierung spielt eine immer größere Rolle in der Regulierung zellulärer Prozesse. Die Entwicklung effizienter proteomweiter Methoden zur Detektion von Methylierung auf Proteinen ist limitiert und technisch schwierig. In dieser Arbeit haben wir einen neuen Hefe-Zwei-Hybrid-Ansatz (Y2H) entwickelt, der Proteine, die miteinander wechselwirken, mit Hilfe von Sequenzierungen der zweiten Generation identifiziert (Y2H-Seq). Der neue Y2H-Seq-Ansatz wurde systematisch mit dem Y2H-Seq-Ansatz verglichen. Dafür wurde ein Bait-Set von 8 Protein-Arginin-Methyltransferasen, 17 Protein-Lysin-Methyltransferasen und 10 Demethylasen gegen 14,268 Prey-Proteine getestet. Der Y2H-Seq-Ansatz ist weniger arbeitsintensiv, hat eine höhere Sensitivität als der Standard Y2H-Matrix-Ansatz und ist deshalb besonders geeignet, um schwache Interaktionen zwischen Substraten und Protein-Methyltransferasen zu detektieren. Insgesamt wurden 523 Wechselwirkungen zwischen 22 Bait-Proteinen und 324 Prey-Pr oteinen etabliert, darunter 11 bekannte Methyltransferasen-Substrate. Netzwerkanalysen zeigen, dass Methyltransferasen bevorzugt mit Transkriptionsregulatoren, DNA- und RNA-Bindeproteinen wechselwirken. Diese Daten repräsentieren das erste proteomweite Wechselwirkungsnetzwerk über Protein-Methyltransferasen und dienen als Ressource für neue potentielle Methylierungssubstrate. In einem in vitro Methylierungsassay wurden exemplarisch mit Hilfe massenspektrometrischer Analysen die methylierten Aminosäurereste einiger Kandidatenproteine bestimmt. Von neun getesteten Proteinen waren sieben methyliert, zu denen gehören SPIN2B, DNAJA3, QKI, SAMD3, OFCC1, SYNCRIP und WDR42A. Wahrscheinlich sind viele Methylierungssubstrate im Netzwerk vorhanden. Das vorgestellte Protein-Protein-Wechselwirkungsnetzwerk zeigt, dass Proteinmethylierung sehr unterschiedliche zelluläre Prozesse beeinflusst und ermöglicht die Aufstellung neuer Hypothesen über die Regulierung Molekularer Mechanismen durch Methylierung. / Protein methylation on arginine and lysine residues is a largely unexplored posttranslational modification which regulates diverse cellular processes. The development of efficient proteome-wide approaches for detecting protein methylation is limited and technically challenging. We developed a novel workload reduced yeast-two hybrid (Y2H) approach to detect protein-protein interactions utilizing second generation sequencing. The novel Y2H-seq approach was systematically evaluated against our state of the art Y2H-matrix screening approach and used to screen 8 protein arginine methyltransferases, 17 protein lysine methyltransferases and 10 demethylases against a set of 14,268 proteins. Comparison of the two approaches revealed a higher sensitivity of the new Y2H-seq approach. The increased sampling rate of the Y2H-seq approach is advantageous when assaying transient interactions between substrates and methyltransferases. Overall 523 interactions between 22 bait proteins and 324 prey proteins were identified including 11 proteins known to be methylated. Network analysis revealed enrichment of transcription regulator activity, DNA- and RNA-binding function of proteins interacting with protein methyltransferases. The dataset represents the first proteome-wide interaction network of enzymes involved in methylation and provides a comprehensively annotated resource of potential new methylation substrates. An in vitro methylation assay coupled to mass spectrometry revealed amino acid methylation of candidate proteins. Seven of nine proteins tested were methylated including SPIN2B, DNAJA3, QKI, SAMD3, OFCC1, SYNCRIP and WDR42A indicating that the interaction network is likely to contain many putative methyltransferase substrate pairs. The presented protein-protein interaction network demonstrates that protein methylation is involved in diverse cellular processes and can inform hypothesis driven investigation into molecular mechanisms regulated through methylation.

Page generated in 0.0931 seconds