• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 2
  • 2
  • Tagged with
  • 5
  • 5
  • 4
  • 4
  • 4
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Cross-Entropy Approaches To Software Forensics: Source Code Authorship Identification

Stinson, James Thomas 09 December 2011 (has links)
Identification of source code authorship can be a useful tool in the areas of security and forensic investigation by helping to create corroborating evidence that may send a suspected cyber terrorist, hacker, or malicious code writer to jail. When applied to academia, it can also prove a useful tool for professors who suspect students of academic dishonesty, plagiarism, or modification of source code related to programming assignments. The purpose of this dissertation is to determine whether or not cross-entropy approaches to source code authorship analysis will succeed in predicting the correct author of a given piece of source code. If so, this work will try to identify factors that affect the accuracy of the algorithm, how programmer experience determines accuracy, and whether a cross-entropy approach performs better than some known source code authorship approaches. The approach taken in the research effort will manufacture a corpus of source code writings from various authors based on the same system descriptions and varying system descriptions, from which benchmarks of different approaches can be measured.
2

Joan Hambidge se idiolek oor die grense van genres : 'n korpuslinguistiese ondersoek / Mariska Nel

Nel, Mariska January 2014 (has links)
Idiolect refers to an individual’s unique use of language. Therefore, the author of a text can be identified by his/her use of language. This study is focused on Joan Hambidge’s recognisable idiolect across the boundaries of genres. It is expected that Hambidge will have a unique and recognisable idiolect, regardless of the genre she writes in. By making use of forensic linguistic principles, methods and applications, it has been shown that it is possible to determine an individual’s idiolect. Even though forensic principles are specifically focused on identifying an author, the methodology used in the research field can be applied to a corpus linguistic study to determine how clearly an individual’s idiolect features across the boundaries of genres. By researching the research subject, explaining her oeuvre, creating a literary background, as well as discussing the literary approaches that Hambidge uses in her respective genres, and what she writes about, the necessary literary background was created, which contributes to the complete image of Hambidge and her influences. By creating this background, it is possible to determine which external factors have an influence on Hambidge's idiolect. Linguistic research was done to determine the origin and background of sociolinguistics; as well as factors that can influence an individual’s idiolect. The background of forensic linguistics was provided, as well as the various corpus linguistic methods that can be used in a study such as this one. After the background was provided, the empirical analysis was executed, in which both stylistic and stylometric analyses were performed by making use of inter- and intra-corpus linguistic research, according to which Hambidge’s idiolect was identified. To identify Hambidge’s idiolect, the Taalkommissie corpus was used as a reference corpus to determine whether the idiosyncratic characteristics that were found in the Hambidge corpus truly are a unique feature or whether they can also be found in the Taalkommissie corpus. The application and execution of the methods made it possible to determine to which extent, if at all, Hambidge has a unique idiolect, and how this idiolect features across the boundaries of genres. The research has determined that Joan Hambidge has a unique idiolect and that the idiolect is especially clear when research is done about her corpus in its entirety. When Hambidge’s separate genres were compared to each other, it was clear that genre influences idiolect, but also that Hambidge did not follow the prescribed genre conventions. Even though the two novels that were compared, did not match as was expected, the other, various genres did agree. Various categories were identified, from which it is clear that distinguishing characteristics can be found in Hambidge’s corpus. It can therefore be said without a doubt that Hambidge has a unique idiolect across the boundaries of genres. / MA (Afrikaans and Dutch), North-West University, Potchefstroom Campus, 2014
3

Joan Hambidge se idiolek oor die grense van genres : 'n korpuslinguistiese ondersoek / Mariska Nel

Nel, Mariska January 2014 (has links)
Idiolect refers to an individual’s unique use of language. Therefore, the author of a text can be identified by his/her use of language. This study is focused on Joan Hambidge’s recognisable idiolect across the boundaries of genres. It is expected that Hambidge will have a unique and recognisable idiolect, regardless of the genre she writes in. By making use of forensic linguistic principles, methods and applications, it has been shown that it is possible to determine an individual’s idiolect. Even though forensic principles are specifically focused on identifying an author, the methodology used in the research field can be applied to a corpus linguistic study to determine how clearly an individual’s idiolect features across the boundaries of genres. By researching the research subject, explaining her oeuvre, creating a literary background, as well as discussing the literary approaches that Hambidge uses in her respective genres, and what she writes about, the necessary literary background was created, which contributes to the complete image of Hambidge and her influences. By creating this background, it is possible to determine which external factors have an influence on Hambidge's idiolect. Linguistic research was done to determine the origin and background of sociolinguistics; as well as factors that can influence an individual’s idiolect. The background of forensic linguistics was provided, as well as the various corpus linguistic methods that can be used in a study such as this one. After the background was provided, the empirical analysis was executed, in which both stylistic and stylometric analyses were performed by making use of inter- and intra-corpus linguistic research, according to which Hambidge’s idiolect was identified. To identify Hambidge’s idiolect, the Taalkommissie corpus was used as a reference corpus to determine whether the idiosyncratic characteristics that were found in the Hambidge corpus truly are a unique feature or whether they can also be found in the Taalkommissie corpus. The application and execution of the methods made it possible to determine to which extent, if at all, Hambidge has a unique idiolect, and how this idiolect features across the boundaries of genres. The research has determined that Joan Hambidge has a unique idiolect and that the idiolect is especially clear when research is done about her corpus in its entirety. When Hambidge’s separate genres were compared to each other, it was clear that genre influences idiolect, but also that Hambidge did not follow the prescribed genre conventions. Even though the two novels that were compared, did not match as was expected, the other, various genres did agree. Various categories were identified, from which it is clear that distinguishing characteristics can be found in Hambidge’s corpus. It can therefore be said without a doubt that Hambidge has a unique idiolect across the boundaries of genres. / MA (Afrikaans and Dutch), North-West University, Potchefstroom Campus, 2014
4

Investigating the use of forensic stylistic and stylometric techniques in the analyses of authorship on a publicly accessible social networking site (Facebook)

Michell, Colin Simon 2013 July 1900 (has links)
This research study examines the forensic application of a selection of stylistic and stylometric techniques in a simulated authorship attribution case involving texts on the social networking site, Facebook. Eight participants each submitted 2,000 words of self-authored text from their personal Facebook messages, and one of them submitted an extra 2,000 words to act as the ‘disputed text’. The texts were analysed in terms of the first 1,000 words received and then at the 2,000-word level to determine what effect text length has on the effectiveness of the chosen style markers (keywords, function words, most frequently occurring words, punctuation, use of digitally mediated communication features and spelling). It was found that despite accurately identifying the author of the disputed text at the 1,000-word level, the results were not entirely conclusive but at the 2,000-word level the results were more promising, with certain style markers being particularly effective. / Linguistics / MA (Linguistics)
5

Investigating the use of forensic stylistic and stylometric techniques in the analyses of authorship on a publicly accessible social networking site (Facebook)

Michell, Colin Simon 07 1900 (has links)
This research study examines the forensic application of a selection of stylistic and stylometric techniques in a simulated authorship attribution case involving texts on the social networking site, Facebook. Eight participants each submitted 2,000 words of self-authored text from their personal Facebook messages, and one of them submitted an extra 2,000 words to act as the ‘disputed text’. The texts were analysed in terms of the first 1,000 words received and then at the 2,000-word level to determine what effect text length has on the effectiveness of the chosen style markers (keywords, function words, most frequently occurring words, punctuation, use of digitally mediated communication features and spelling). It was found that despite accurately identifying the author of the disputed text at the 1,000-word level, the results were not entirely conclusive but at the 2,000-word level the results were more promising, with certain style markers being particularly effective. / Linguistics and Modern Languages / M.A. (Linguistics)

Page generated in 0.1241 seconds