• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 2
  • Tagged with
  • 3
  • 3
  • 2
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Data preservation and reproducibility at the LHCb experiment at CERN

Trisovic, Ana January 2018 (has links)
This dissertation presents the first study of data preservation and research reproducibility in data science at the Large Hadron Collider at CERN. In particular, provenance capture of the experimental data and the reproducibility of physics analyses at the LHCb experiment were studied. First, the preservation of the software and hardware dependencies of the LHCb experimental data and simulations was investigated. It was found that the links between the data processing information and the datasets themselves were obscure. In order to document these dependencies, a graph database was designed and implemented. The nodes in the graph represent the data with their processing information, software and computational environment, whilst the edges represent their dependence on the other nodes. The database provides a central place to preserve information that was previously scattered across the LHCb computing infrastructure. Using the developed database, a methodology to recreate the LHCb computational environment and to execute the data processing on the cloud was implemented with the use of virtual containers. It was found that the produced physics events were identical to the official LHCb data, meaning that the system can aid in data preservation. Furthermore, the developed method can be used for outreach purposes, providing a streamlined way for a person external to CERN to process and analyse the LHCb data. Following this, the reproducibility of data analyses was studied. A data provenance tracking service was implemented within the LHCb software framework \textsc{Gaudi}. The service allows analysts to capture their data processing configurations that can be used to reproduce a dataset within the dataset itself. Furthermore, to assess the current status of the reproducibility of LHCb physics analyses, the major parts of an analysis were reproduced by following methods described in publicly and internally available documentation. This study allowed the identification of barriers to reproducibility and specific points where documentation is lacking. With this knowledge, one can specifically target areas that need improvement and encourage practices that would improve reproducibility in the future. Finally, contributions were made to the CERN Analysis Preservation portal, which is a general knowledge preservation framework developed at CERN to be used across all the LHC experiments. In particular, the functionality to preserve source code from git repositories and Docker images in one central location was implemented.
2

Establishing “The Fossil Record”: A Database of Vertebrate Paleontological Sites Across the State of Tennessee

Mclaurine, Sarah 01 May 2024 (has links) (PDF)
Fossil localities across the state of Tennessee and the data related to those sites were compiled from Tennessee Division of Geology Bulletin 84, titled “Tennessee’s Prehistoric Vertebrates,” and stored in a Microsoft Access geodatabase housed by the Department of Collections at the East Tennessee State University Museum of Natural History located at the Gray Fossil Site. Included in the database are forms to enter new site localities, view information about those already entered, view and add data to a master faunal list for the state, view sites repository information and store and add documents that are key-word searchable from the main menu. This database was compiled to give researchers a straightforward and easy to use means of analyzing known information about paleontological sites across the state, with the potential to be expanded worldwide. Conservation of data is crucial and can be lost over time unless data preservation efforts are made.
3

Data Curation Perspectives and Practices of Researchers at Kent State University’s Liquid Crystal Institute: A Case Study

Shakeri, Shadi 27 November 2013 (has links)
No description available.

Page generated in 0.13 seconds