Global ETD Search

Return to search

Code duplication and reuse in Jupyter notebooks

Reusing code can expedite software creation, analysis and exploration of data. Expediency can be particularly valuable for users of computational notebooks, where duplication allows them to quickly test hypotheses and iterate over data, without creating code from scratch. In this thesis, I’ll explore the topic of code duplication and the behaviour of code reuse for Jupyter notebooks; quantifying and describing snippets of code and explore potential barriers for reuse. As part of this thesis I conducted two studies into Jupyter notebooks use. In my first study, I mined GitHub repositories, quantifying and describing code duplicates contained within repositories that contained at least one Jupyter notebook. For my second study, I conducted an observational user study using a contextual inquiry, where my participants solved specific tasks using notebooks, while I observed and took notes. The work in this thesis can be categorized as exploratory, since both my studies were aimed at generating hypotheses for which further studies can build upon. My contributions with this thesis is two-fold: a thorough description of code duplicates contained within GitHub repositories and an exploration of the behaviour behind code reuse in Jupyter notebooks. It is my desire that others can build upon this work to provide new tools, addressing some of the issues outlined in this thesis. / Graduate

http://hdl.handle.net/1828/12137

Jupyter

computational notebooks

exploratory programming

Identifer	oai:union.ndltd.org:uvic.ca/oai:dspace.library.uvic.ca:1828/12137
Date	21 September 2020
Creators	Koenzen, Andreas Peter
Contributors	Storey, Margaret-Anne, Ernst, Neil A.
Source Sets	University of Victoria
Language	English, English
Detected Language	English
Type	Thesis
Format	application/pdf
Rights	Available to the World Wide Web

Page generated in 0.002 seconds

Code duplication and reuse in Jupyter notebooks

Description

Links & Downloads

Tags

Additional Fields