Statistical Analysis of PAR-CLIP data

From creation to its degradation, the RNA molecule is the action field of many binding proteins with different roles in regulation and RNA metabolism. Since these proteins are involved in a large number of processes, a variety of diseases are related to abnormalities occurring within the binding mechanisms. One of the experimental methods for detecting the binding sites of these proteins is PAR-CLIP built on the next generation sequencing technology. Due to its size and intrinsic noise, PAR-CLIP data analysis requires appropriate pre-processing and thorough statistical analysis. The present work has two main goals. First, to develop a modular pipeline for preprocessing PAR-CLIP data and extracting necessary signals for further analysis. Second, to devise a novel statistical model in order to carry out inference about presence of protein binding sites based on the signals extracted in the pre-processing step.

Identiferoai:union.ndltd.org:UPSALLA1/oai:DiVA.org:kth-124347
Date January 2013
CreatorsGolumbeanu, Monica
PublisherKTH, Beräkningsbiologi, CB
Source SetsDiVA Archive at Upsalla University
LanguageEnglish
Detected LanguageEnglish
TypeStudent thesis, info:eu-repo/semantics/bachelorThesis, text
Formatapplication/pdf
Rightsinfo:eu-repo/semantics/openAccess

Page generated in 0.0019 seconds