• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 165
  • 8
  • Tagged with
  • 173
  • 173
  • 173
  • 173
  • 173
  • 34
  • 33
  • 19
  • 18
  • 17
  • 17
  • 17
  • 16
  • 12
  • 12
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
51

Computational discovery of DNA methylation patterns as biomarkers of ageing, cancer, and mental disorders : Algorithms and Tools

Torabi Moghadam, Behrooz January 2017 (has links)
Epigenetics refers to the mitotically heritable modifications in gene expression without a change in the genetic code. A combination of molecular, chemical and environmental factors constituting the epigenome is involved, together with the genome, in setting up the unique functionality of each cell type. DNA methylation is the most studied epigenetic mark in mammals, where a methyl group is added to the cytosine in a cytosine-phosphate-guanine dinucleotides or a CpG site. It has been shown to have a major role in various biological phenomena such as chromosome X inactivation, regulation of gene expression, cell differentiation, genomic imprinting. Furthermore, aberrant patterns of DNA methylation have been observed in various diseases including cancer. In this thesis, we have utilized machine learning methods and developed new methods and tools to analyze DNA methylation patterns as a biomarker of ageing, cancer subtyping and mental disorders. In Paper I, we introduced a pipeline of Monte Carlo Feature Selection and rule-base modeling using ROSETTA in order to identify combinations of CpG sites that classify samples in different age intervals based on the DNA methylation levels. The combination of genes that showed up to be acting together, motivated us to develop an interactive pathway browser, named PiiL, to check the methylation status of multiple genes in a pathway. The tool enhances detecting differential patterns of DNA methylation and/or gene expression by quickly assessing large data sets. In Paper III, we developed a novel unsupervised clustering method, methylSaguaro, for analyzing various types of cancers, to detect cancer subtypes based on their DNA methylation patterns. Using this method we confirmed the previously reported findings that challenge the histological grouping of the patients, and proposed new subtypes based on DNA methylation patterns. In Paper IV, we investigated the DNA methylation patterns in a cohort of schizophrenic and healthy samples, using all the methods that were introduced and developed in the first three papers.
52

Detection of artefacts in FFPE-sample sequence data

Swenson, Hugo January 2019 (has links)
Next generation sequencing is increasingly used as a diagnostic tool in the clinical setting. This is driven by the vast increase in molecular targeted therapy, which requires detailed information on what genetic variants are present in patient samples. In the hospital setting, most cancer diagnostics are based on Formalin Fixed Paraffin Embedded (FFPE) samples. The FFPE routine is very beneficial for logistical purposes and for some histopathological analyses, but creates problems for molecular diagnostics based on DNA. These problems derive from sample immersion informalin, which results in DNA fragmentation, interstrand DNA crosslinking and sequence artefacts due to hydrolytic deamination. Distinguishing such artefacts from true somatic variants can be challenging, thus affecting both research and clinical analyses. In order to identify FFPE-artefacts from true variants in next generation sequencing data from FFPE samples, I developed the novelprogram FUSAC (FFPE tissue UMI based Sequence Artefact Classifier) for the facility Clinical Genomics in Uppsala. FUSAC utilizes UniqueMolecular Identifiers (UMI's) to identify and group sequencing reads based on their molecule of origin. By using UMI's to collapse duplicate paired reads into consensus reads, FFPE-artefacts are classified through comparative analysis of the positive and negative strand sequences. My findings indicate that FUSAC can succesfully classify UMI-tagged next generation sequencing reads with FFPE-artefacts, from sequencing reads with true variants. FUSAC thus presents a novel approach in bioinformatic pipelines for studying FFPE-artefacts.
53

Evaluation and visualization of complexity in parameter setting in automotive industry

Lunev, Alexey January 2018 (has links)
Parameter setting is a process primary used to specify in what kind of vehicle an electronic control unit of each type is used. This thesis is targeted to investigate whether the current strategy to measure complexity gives user satisfactory results. The strategy consists of structure-based algorithms that are an essential part of the Complexity Analyzer - a prototype application used to evaluate the complexity.     The results described in this work suggest that the currently implemented algorithms have to be properly defined and adapted to be used in terms of parameter setting. Moreover, the measurements that the algorithms output has been analyzed in more detail making the results easier to interpret.     It has been shown that a typical parameter setting file can be regarded as a tree structure. To measure variation in this structure a new concept, called Path entropy has been formulated, tested and implemented.     The main disadvantage of the original version of the Complexity Analyzer application is its lack of user-friendliness. Therefore, a web version of the application based on Model-View-Controller technique has been developed. Different to the original version it has user interface included and it takes just a couple of seconds to see the visualization of data, compared to the original version where it took several minutes to run the application.
54

Development of a hierarchical k-selecting clustering algorithm – application to allergy.

Malm, Patrik January 2007 (has links)
The objective with this Master’s thesis was to develop, implement and evaluate an iterative procedure for hierarchical clustering with good overall performance which also merges features of certain already described algorithms into a single integrated package. An accordingly built tool was then applied to an allergen IgE-reactivity data set. The finally implemented algorithm uses a hierarchical approach which illustrates the emergence of patterns in the data. At each level of the hierarchical tree a partitional clustering method is used to divide data into k groups, where the number k is decided through application of cluster validation techniques. The cross-reactivity analysis, by means of the new algorithm, largely arrives at anticipated cluster formations in the allergen data, which strengthen results obtained through previous studies on the subject. Notably, though, certain unexpected findings presented in the former analysis where aggregated differently, and more in line with phylogenetic and protein family relationships, by the novel clustering package.
55

Evaluation and Development of Methods for Identification of Biochemical Networks / Evaluering och utveckling av metoder för identifiering av biokemiska nätverk

Jauhiainen, Alexandra January 2005 (has links)
Systems biology is an area concerned with understanding biology on a systems level, where structure and dynamics of the system is in focus. Knowledge about structure and dynamics of biological systems is fundamental information about cells and interactions within cells and also play an increasingly important role in medical applications. System identification deals with the problem of constructing a model of a system from data and an extensive theory of particularly identification of linear systems exists. This is a master thesis in systems biology treating identification of biochemical systems. Methods based on both local parameter perturbation data and time series data have been tested and evaluated in silico. The advantage of local parameter perturbation data methods proved to be that they demand less complex data, but the drawbacks are the reduced information content of this data and sensitivity to noise. Methods employing time series data are generally more robust to noise but the lack of available data limits the use of these methods. The work has been conducted at the Fraunhofer-Chalmers Research Centre for Industrial Mathematics in Göteborg, and at the division of Computational Biology at the Department of Physics and Measurement Technology, Biology, and Chemistry at Linköping University during the autumn of 2004.
56

Building graph models of oncogenesis by using microRNA expression data

Zichner, Thomas January 2008 (has links)
MicroRNAs (miRNAs) are a class of small non-coding RNAs that control gene expression by targeting mRNAs and triggering either translation repression or RNA degradation. Several groups pointed out that miRNAs play a major role in several diseases, including cancer. This is assumed since the expression level of several miRNAs differs between normal and cancerous cells. Further, it has been shown that miRNAs are involved in cell proliferation and cell death. Because of this role it is suspected that miRNAs could serve as biomarkers to improve tumor classification, therapy selection, or prediction of survival. In this context, it is questioned, among other things, whether miRNA deregulations in cancer cells occur according to some pattern or in a rather random order. With this work we contribute to answering this question by adapting two approaches (Beerenwinkel et al. (J Comput Biol, 2005) and Höglund et al. (Gene Chromosome Canc, 2001)), developed to derive graph models of oncogenesis for chromosomal imbalances, to miRNA expression data and applying them to a breast cancer data set. Further, we evaluated the results by comparing them to results derived from randomly altered versions of the used data set. We could show that miRNA deregulations most likely follow a rough temporal order, i.e. some deregulations occur early and some occur late in cancer progression. Thus, it seems to be possible that the expression level of some miRNAs can be used as indicator for the stage of a tumor. Further, our results suggest that the over expression of mir-21 as well as mir-102 are initial events in breast cancer oncogenesis. Additionally, we identified a set of miRNAs showing a cluster-like behavior, i.e. their deregulations often occur together in a tumor, but other deregulations are less frequently present. These miRNAs are let-7d, mir-10b, mir-125a, mir-125b, mir-145, mir-206, and mir-210. Further, we could confirm the strong relationship between the expression of mir-125a and mir-125b.
57

Identifying gene regulatory interactions using functional genomics data

Johansson, Annelie January 2014 (has links)
Previously studies used correlation of DNase I hypersensitivity sites sequencing (DNase-seq) experiments to predict interactions between enhancers and its target promoter gene. We investigate the correlation methods Pearson’s correlation and Mutual Information, using DNase-seq data for 100 cell-types in regions on chromosome one. To assess the performances, we compared our results of correlation scores to Hi-C data from Jin et al. 2013. We showed that the performances are low when comparing it to the Hi-C data, and there is a need of improved correlation metrics. We also demonstrate that the use of Hi-C data as a gold standard is limited, because of its low resolution, and we suggest using another gold standard in further studies.
58

Haplotype Inference as a caseof Maximum Satisfiability : A strategy for identifying multi-individualinversion points in computational phasing

Bergman, Ebba January 2017 (has links)
Phasing genotypes from sequence data is an important step betweendata gathering and downstream analysis in population genetics,disease studies, and multiple other fields. This determination ofthe sequences of markers corresponding to the individualchromosomes can be done on data where the markers are in lowdensity across the chromosome, such as from single nucleotidepolymorphism (SNP) microarrays, or on data with a higher localdensity of markers like in next generation sequencing (NGS). Thesorted markers may then be used for many different analyses anddata processing such as linkage analysis, or inference of missinggenotypes in the process of imputation cnF2freq is a haplotype phasing program that uses an uncommonapproach allowing it to divide big groups of related individualsinto smaller ones. It sets an initial haplotype phase and theniteratively changes it using estimations from Hidden MarkovModels. If a marker is judged to have been placed in the wronghaplotype, a switch needs to be made so that it belongs to thecorrect phase. The objective of this project was to go fromallowing only one individual within a group to be switched in aniteration to allowing multiple switches that are dependent on eachother. The result of this project is a theoretical solution for allowingmultiple dependent switches in cnF2freq, and an implementedsolution using the max-SAT solver toulbar2.
59

Local adaptation of Grauer's gorilla gut microbiome

Bebris, Kristaps January 2017 (has links)
The availability of high-throughput sequencing technologies has enabled metagenomicinvestigations into complex bacterial communities with unprecedented resolution andthroughput. The production of dedicated data sets for metagenomic analyses is, however, acostly process and, frequently, the first research questions focus on the study species itself. Ifthe source material is represented by fecal samples, target capture of host-specific sequencesis applied to enrich the complex DNA mixtures contained within a typical fecal DNA extract.Yet, even after this enrichment, the samples still contain a large amount of environmentalDNA that is usually left unanalysed. In my study I investigate the possibility of using shotgunsequencing data that has been subjected to target enrichment for mtDNA from the hostspecies, Grauer’s gorilla (Gorilla beringei graueri), for further analysis of the microbialcommunity present in these samples. The purpose of these analyses is to study the differencesin the bacterial communities present within a high-altitude Grauer’s gorilla, low-altitudeGrauer’s gorilla, and a sympatric chimpanzee population. Additionally, I explore the adaptivepotential of the gut microbiota within these great ape populations.I evaluated the impact that the enrichment process had on the microbial community by usingpre- and post-capture museum preserved samples. In addition to this, I also analysed the effectof two different extraction methods on the bacterial communities.My results show that the relative abundances of the bacterial taxa remain relatively unaffectedby the enrichment process and the extraction methods. The overall number of taxa is,however, reduced by each additional capture round and is not consistent between theextraction methods. This means that both the enrichment and extraction processes introducebiases that require the usage of abundance-based distance measures for biological inferences.Additionally, even if the data cannot be used to study the bacterial communities in anunbiased manner, it provides useful comparative insights for samples that were treated in thesame fashion.With this background, I used museum and fecal samples to perform cluster analysis to explorethe relationships between the gut microbiota of the three great ape populations. I found thatpopulations cluster by species first, and only then group according to habitat. I further foundthat a bacterial taxon that degrades plant matter is enriched in the gut microbiota of all threegreat ape species, where it could help with the digestion of vegetative foods. Another bacterialtaxon that consumes glucose is enriched in the gut microbiota of the low-altitude gorilla andchimpanzee populations, where it could help with the modulation of the host’s mucosalimmune system, and could point to the availability of fruit in the animals diet. In addition, Ifound a bacterial taxon that is linked with diarrhea in humans to be part of the gut microbiotaof the habituated high-altitude gorilla population, which could indicate that this pathogen hasbeen transmitted to the gorillas from their interaction with humans, or it could be indicative ofthe presence of a contaminated water source.
60

Diffusion in fractal globules / På spaning efter onormal diffusion av biomolekyler i DNA med hjälp av stokastisk simulering

Hariz, Jakob January 2016 (has links)
Recent experiments suggest that the human genome (all of our DNA) is organised as a so-called fractal globule. The fractal globule is a knot--free dense polymer that easily folds and unfolds any genomic locus, for example a group of nearby genes. Proteins often need to locate specific target sites on the DNA, for instance to activate a gene. To understand how proteins move through the DNA polymer, we simulate diffusion of particles through a fractal globule. The fractal globule was generated on a cubic lattice as spheres connected by cylinders. With the structure in place, we simulate particle diffusion and measure how their mean squared displacement ($\langle R^2(t)\rangle$) grows as function of time $t$ for different particle radii. This quantity allows us to better understand how the three dimensional structure of DNA affects the protein's motion. From our simulations we found that $\langle R^2(t)/t\rangle$ is a decaying function when the particle is sufficiently large. This means that the particles diffuse slower than if they were free. Assuming that $\langle R^2(t) \rangle \propto t^\alpha$ for long times, we calculated the growth exponent $\alpha$ as a function of particle radius $r_p$. When $r_p$ is small compared to the average distance between two polymer segments $d$, we find that $\alpha \approx 1$. This means the polymer network does not affect the particle's motion. However, in the opposite limit $r_p\sim d$ we find that $\alpha<1$ which means that the polymer strongly slows down the particle's motion. This behaviour is indicative of sub-diffusive dynamics and has potentially far reaching consequences for target finding processes and biochemical reactions in the cell.

Page generated in 0.0909 seconds