Global ETD Search

Return to search

Efektivní metody detekce plagiátů v rozsáhlých dokumentových skladech / Effective methods of plagiarism detectios in large document repositories

The work focuses on issues of plagiarism detection in large document repositories. Taking into account real situation that needs to be addressed now in the university environment in the Czech Republic and proposes a system that will be able to carry out this analysis in real time and yet be able to capture the widest possible range of plagiarism methods. The main contribution of this work is taking the definition of so-called unordered n-grams - {n}-grams - which can be used just to detect some forms of advanced plagiarism methods. All cited recommendations that relate to the various components of the system to detect plagiarism - preprocessing the document before document insertion into the corpus, the representation of documents in document storage, identification of potential sources of plagiarism to calculate rates of similarity; visualization analysis of plagiarism - are subject to discussion and appropriately quantified. The result is a set of design parameters of the system so that it can in detect plagiarism in the Czech language language quickly, accurately and yet in most forms.

http://www.nusl.cz/ntk/nusl-19025

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:19025
Date	January 2009
Creators	Přibil, Jiří
Contributors	Jiroušek, Radim, Strossa, Petr, Snášel, Václav
Publisher	Vysoká škola ekonomická v Praze
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/doctoralThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0026 seconds

Efektivní metody detekce plagiátů v rozsáhlých dokumentových skladech / Effective methods of plagiarism detectios in large document repositories

Description

Links & Downloads

Tags

Additional Fields