Return to search

Enabling the processing of bioinformatics workflows where data is located through the use of cloud and container technologies

>Magister Scientiae - MSc / The growing size of raw data and the lack of internet communication technology to
keep up with that growth is introducing unique challenges to academic researchers.
This is especially true for those residing in rural areas or countries with sub-par
telecommunication infrastructure. In this project I investigate the usefulness of cloud
computing technology, data analysis workflow languages and portable computation
for institutions that generate data. I introduce the concept of a software solution
that could be used to simplify the way that researchers execute their analysis on
data sets at remote sources, rather than having to move the data. The scope of this
project involved conceptualising and designing a software system to simplify the
use of a cloud environment as well as implementing a working prototype of said
software for the OpenStack cloud computing platform. I conclude that it is possible
to improve the performance of research pipelines by removing the need for
researchers to have operating system or cloud computing knowledge and that utilising
technologies such as this can ease the burden of moving data.

Identiferoai:union.ndltd.org:netd.ac.za/oai:union.ndltd.org:uwc/oai:etd.uwc.ac.za:11394/6767
Date January 2019
Creatorsde Beste, Eugene
ContributorsChristoffels, Alan
PublisherUniversity of the Western Cape
Source SetsSouth African National ETD Portal
LanguageEnglish
Detected LanguageEnglish
RightsUniversity of the Western Cape

Page generated in 0.0019 seconds