Spelling suggestions: "subject:"1expression studies"" "subject:"dexpression studies""
1 |
Statistical Methods for Normalization and Analysis of High-Throughput Genomic DataGuennel, Tobias 20 January 2012 (has links)
High-throughput genomic datasets obtained from microarray or sequencing studies have revolutionized the field of molecular biology over the last decade. The complexity of these new technologies also poses new challenges to statisticians to separate biological relevant information from technical noise. Two methods are introduced that address important issues with normalization of array comparative genomic hybridization (aCGH) microarrays and the analysis of RNA sequencing (RNA-Seq) studies. Many studies investigating copy number aberrations at the DNA level for cancer and genetic studies use comparative genomic hybridization (CGH) on oligo arrays. However, aCGH data often suffer from low signal to noise ratios resulting in poor resolution of fine features. Bilke et al. showed that the commonly used running average noise reduction strategy performs poorly when errors are dominated by systematic components. A method called pcaCGH is proposed that significantly reduces noise using a non-parametric regression on technical covariates of probes to estimate systematic bias. Then a robust principal components analysis (PCA) estimates any remaining systematic bias not explained by technical covariates used in the preceding regression. The proposed algorithm is demonstrated on two CGH datasets measuring the NCI-60 cell lines utilizing NimbleGen and Agilent microarrays. The method achieves a nominal error variance reduction of 60%-65% as well as an 2-fold increase in signal to noise ratio on average, resulting in more detailed copy number estimates. Furthermore, correlations of signal intensity ratios of NimbleGen and Agilent arrays are increased by 40% on average, indicating a significant improvement in agreement between the technologies. A second algorithm called gamSeq is introduced to test for differential gene expression in RNA sequencing studies. Limitations of existing methods are outlined and the proposed algorithm is compared to these existing algorithms. Simulation studies and real data are used to show that gamSeq improves upon existing methods with regards to type I error control while maintaining similar or better power for a range of sample sizes for RNA-Seq studies. Furthermore, the proposed method is applied to detect differential 3' UTR usage.
|
2 |
Identificação de genes e vias associadas aos transtornos do espectro autista / Identification of genes and pathways associated to autism spectrum disordersOliveira, Karina Griesi 28 June 2011 (has links)
Os transtornos do espectro autista (TEA) são um grupo de doenças neuropsiquiátricas caracterizadas por um prejuízo na capacidade de comunicação e de interação social e por padrões comportamentais estereotipados. Os TEA são geneticamente heterogêneos o que dificulta a identificação das alterações genéticas que estão contribuindo para estes transtornos. No presente estudo, selecionamos como uma primeira abordagem o estudo de translocações cromossômicas, buscando encontrar genes candidatos para posteriores estudos funcionais. No primeiro caso, uma translocação de novo balanceada envolvendo os cromossomos 2q11 e Xq24, não identificamos nenhum candidato funcional rompido pelos pontos de quebra. Detectamos ainda a presença de uma isodissomia materna do cromossomo 5 nesta paciente. Este resultado sugere que, possivelmente, tanto a translocação cromossômica quanto a isodissomia devem estar contribuindo para a etiologia do TEA nesta paciente, caracterizando este como um caso de efeito poligênico. Já o estudo da translocação de novo balanceada (3,11)(p21,q22) revelou que o gene TRPC6, um canal de cálcio envolvido no desenvolvimento de dendritos e sinapses excitatórias, encontrava-se rompido no cromossomo 11 deste paciente. As análises dos neurônios e células progenitoras neurais deste paciente obtidas através da técnica de reprogramação celular e o estudo global de expressão gênica sugerem fortemente que o rompimento do gene TRPC6 é o fator etiológico do TEA neste caso. Por fim, nós também realizamos um estudo de expressão gênica global de pacientes autistas idiopáticos e verificamos que os genes diferencialmente expressos nestes pacientes estão principalmente envolvidos na regulação da dinâmica do citoesqueleto, indicando que este pode ser o processo biológico comumente afetado nos pacientes autistas. Nosso trabalho mostra que os estudos citogenéticos são importantes para a identificação de genes candidatos para os TEA e reforça a hipótese de que estes transtornos são causados por diferentes variantes genéticas mas que levam ao comprometimento de um processo biológico comum. Acreditamos que o modelo de reprogramação celular contribuirá para o entendimento da implicação de tais processos na etiologia dos TEA. / Autism spectrum disorders (ASD) are a group of neurodevelopmental diseases characterized by impairments in social and communicative skills and repetitive behaviors. The investigation of ASD causes is hampered by the genetic heterogeneity of these neurodevelopmental diseases. In the present study, we mapped the breakpoints associated to chromosomal translocations found in two autistic patients as a first screening approach, trying to identify single candidate genes that could be further investigated by functional analysis. In the first case, a de novo balanced translocation involving the chromosomes 2q11 and Xq24, we did not find any functionally known relevant gene disrupted by the breakpoints but, surprisingly, SNP-array data showed that the patient also presents a maternally inherited isodisomy on chromosome 5. In this case, is possible that ASD is caused by the combination of the molecular results caused by the translocation and the UPD on chromosome 5, which would characterize this case as an example of polygenic effects on ASD etiology. On the other hand, the study of a second case, a boy with a de novo balanced translocation (3;11)(p21;q22), revealed that TRPC6, a calcium channel involved in dendritic spine and excitatory synapse formation, was disrupted by the translocation on chromosome 11. Making use of cellular reprogramming to generate neurons and neuronal progenitor cells from this patient and expression analysis, we demonstrated that TRPC6 disruption can respond for the phenotype seen in this patient. Finally, we also performed a genome-wide expression analysis to investigate idiopathic autistic patients and we verified that ASD DEGs are mainly implicated in cytoskeleton dynamics, suggesting that the regulation of this cellular structure can be one of the common mechanisms of ASD etiology. Our work shows that cytogenetic studies are important for the identification of ASD candidate genes and reinforces the hypothesis that these disorders are caused by different genetic variants that are implicated in a common biological process. We believe that cellular reprogramming will contribute for the understanding of the implication of such biological processes in the etiology of ASD.
|
3 |
Identificação de genes e vias associadas aos transtornos do espectro autista / Identification of genes and pathways associated to autism spectrum disordersKarina Griesi Oliveira 28 June 2011 (has links)
Os transtornos do espectro autista (TEA) são um grupo de doenças neuropsiquiátricas caracterizadas por um prejuízo na capacidade de comunicação e de interação social e por padrões comportamentais estereotipados. Os TEA são geneticamente heterogêneos o que dificulta a identificação das alterações genéticas que estão contribuindo para estes transtornos. No presente estudo, selecionamos como uma primeira abordagem o estudo de translocações cromossômicas, buscando encontrar genes candidatos para posteriores estudos funcionais. No primeiro caso, uma translocação de novo balanceada envolvendo os cromossomos 2q11 e Xq24, não identificamos nenhum candidato funcional rompido pelos pontos de quebra. Detectamos ainda a presença de uma isodissomia materna do cromossomo 5 nesta paciente. Este resultado sugere que, possivelmente, tanto a translocação cromossômica quanto a isodissomia devem estar contribuindo para a etiologia do TEA nesta paciente, caracterizando este como um caso de efeito poligênico. Já o estudo da translocação de novo balanceada (3,11)(p21,q22) revelou que o gene TRPC6, um canal de cálcio envolvido no desenvolvimento de dendritos e sinapses excitatórias, encontrava-se rompido no cromossomo 11 deste paciente. As análises dos neurônios e células progenitoras neurais deste paciente obtidas através da técnica de reprogramação celular e o estudo global de expressão gênica sugerem fortemente que o rompimento do gene TRPC6 é o fator etiológico do TEA neste caso. Por fim, nós também realizamos um estudo de expressão gênica global de pacientes autistas idiopáticos e verificamos que os genes diferencialmente expressos nestes pacientes estão principalmente envolvidos na regulação da dinâmica do citoesqueleto, indicando que este pode ser o processo biológico comumente afetado nos pacientes autistas. Nosso trabalho mostra que os estudos citogenéticos são importantes para a identificação de genes candidatos para os TEA e reforça a hipótese de que estes transtornos são causados por diferentes variantes genéticas mas que levam ao comprometimento de um processo biológico comum. Acreditamos que o modelo de reprogramação celular contribuirá para o entendimento da implicação de tais processos na etiologia dos TEA. / Autism spectrum disorders (ASD) are a group of neurodevelopmental diseases characterized by impairments in social and communicative skills and repetitive behaviors. The investigation of ASD causes is hampered by the genetic heterogeneity of these neurodevelopmental diseases. In the present study, we mapped the breakpoints associated to chromosomal translocations found in two autistic patients as a first screening approach, trying to identify single candidate genes that could be further investigated by functional analysis. In the first case, a de novo balanced translocation involving the chromosomes 2q11 and Xq24, we did not find any functionally known relevant gene disrupted by the breakpoints but, surprisingly, SNP-array data showed that the patient also presents a maternally inherited isodisomy on chromosome 5. In this case, is possible that ASD is caused by the combination of the molecular results caused by the translocation and the UPD on chromosome 5, which would characterize this case as an example of polygenic effects on ASD etiology. On the other hand, the study of a second case, a boy with a de novo balanced translocation (3;11)(p21;q22), revealed that TRPC6, a calcium channel involved in dendritic spine and excitatory synapse formation, was disrupted by the translocation on chromosome 11. Making use of cellular reprogramming to generate neurons and neuronal progenitor cells from this patient and expression analysis, we demonstrated that TRPC6 disruption can respond for the phenotype seen in this patient. Finally, we also performed a genome-wide expression analysis to investigate idiopathic autistic patients and we verified that ASD DEGs are mainly implicated in cytoskeleton dynamics, suggesting that the regulation of this cellular structure can be one of the common mechanisms of ASD etiology. Our work shows that cytogenetic studies are important for the identification of ASD candidate genes and reinforces the hypothesis that these disorders are caused by different genetic variants that are implicated in a common biological process. We believe that cellular reprogramming will contribute for the understanding of the implication of such biological processes in the etiology of ASD.
|
4 |
Protein Interaction networks and their applications to protein characterization and cancer genes predictionAragüés Peleato, Ramón 13 July 2007 (has links)
La importancia de comprender los procesos biológicos ha estimulado el desarrollo de métodos para la detección de interacciones proteína-proteína. Esta tesis presenta PIANA (Protein Interactions And Network Analysis), un programa informático para la integración y el análisis de redes de interacción proteicas. Además, describimos un método que identifica motivos de interacción basándose en que las proteínas con parejas de interacción comunes tienden a interaccionar con esas parejas a través del mismo motivo de interacción. Encontramos que las proteínas altamente conectadas (i.e., hubs) con múltiples motivos tienen mayor probabilidad de ser esenciales para la viabilidad de la célula que los hubs con uno o dos motivos. Finalmente, presentamos un método que predice genes relacionados con cáncer mediante la integración de redes de interacción proteicas, datos de expresión diferenciada y propiedades estructurales, funcionales y evolutivas. El valor de predicción positiva es 71% con sensitividad del 1%, superando a otros métodos usados independientemente. / The importance of understanding cellular processes prompted the development of experimental approaches that detect protein-protein interactions. Here, we describe a software platform called PIANA (Protein Interactions And Network Analysis) that integrates interaction data from multiple sources and automates the analysis of protein interaction networks. Moreover, we describe a method that delineates interacting motifs by relying on the observation that proteins with common interaction partners tend to interact with these partners through the same interacting motif. We find that highly connected proteins (i.e., hubs) with multiple interacting motifs are more likely to be essential for cellular viability than hubs with one or two interacting motifs. Furthermore, we present a method that predicts cancer genes by integrating protein interaction networks, differential expression studies and structural, functional and evolutionary properties. For a sensitivity of 1%, the positive predictive value is 71%, which outperforms the use of any of the methods independently.
|
Page generated in 0.0967 seconds