Global ETD Search

Return to search

Improve Data Quality By Using Dependencies And Regular Expressions

The objective of this study has been to answer the question of finding ways to improve the quality of database. There exists a lot of problems of the data stored in the database, like missing or spelling errors. To deal with the dirty data in the database, this study adopts the conditional functional dependencies and regular expressions to detect and correct data. Based on the former studies of data cleaning methods, this study considers the more complex conditions of database and combines the efficient algorithms to deal with the data. The study shows that by using these methods, the database’s quality can be improved and considering the complexity of time and space, there still has a lot of things to do to make the data cleaning process more efficiency.

data cleaning

data quality

condition functional dependency

regular expression

Computer Systems

Datorsystem

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:miun-35620
Date	January 2018
Creators	Feng, Yuan
Publisher	Mittuniversitetet, Avdelningen för informationssystem och -teknologi
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.002 seconds

Improve Data Quality By Using Dependencies And Regular Expressions

Description

Links & Downloads

Tags

Additional Fields