Return to search

The Game Walkthrough Corpus (GWTC)

We present the Game Walkthrough Corpus (GWTC), which contains 12,295 unique
walkthrough documents covering 6,117 games. For each game walkthrough, we
provide frequencies of unigrams and bigrams, treating the walkthrough document
as a Bag of Words. In addition, we provide word frequencies at the sentence level.
Furthermore, the GWTC contains a number of game-related metadata, including title, publisher, developer, year, and genre. All the language statistics and metadata are stored in separate plain text files and can be referenced through uniform resource names (URN). These URNs can also be used to derive any combination of statistics and metadata. Researchers, for instance, can investigate the most frequent unigrams for games in the “Adventure” genre. This way, the GWTC can be reused for different kinds of research questions on gaming language.

Identiferoai:union.ndltd.org:DRESDEN/oai:qucosa:de:qucosa:91724
Date30 May 2024
CreatorsBurghardt, Manuel, Tiepmar, Jochen
PublisherUbiquity Press
Source SetsHochschulschriftenserver (HSSS) der SLUB Dresden
LanguageEnglish
Detected LanguageEnglish
Typeinfo:eu-repo/semantics/publishedVersion, doc-type:article, info:eu-repo/semantics/article, doc-type:Text
Rightsinfo:eu-repo/semantics/openAccess
Relation2059-481X

Page generated in 0.0021 seconds