Yes / The analysis of potentially large volumes of crowd-sourced and social media data is central to meeting the requirements of the ATHENA project. Here, we discuss the various stages of the pipeline process we have developed, including acquisition of the data, analysis, aggregation, filtering, and structuring. We highlight the challenges involved when working with unstructured, noisy data from sources such as Twitter, and describe the crisis taxonomies that have been developed to support the tasks and enable concept extraction. State-of-the-art techniques such as formal concept analysis and machine learning are used to create a range of capabilities including concept drill down, sentiment analysis, credibility assessment, and assignment of priority. We ground many of these techniques using results obtained from a set of tweets which emerged from the Colorado wildfires of 2012 in order to demonstrate the applicability of our work to real crisis scenarios.
Identifer | oai:union.ndltd.org:BRADFORD/oai:bradscholars.brad.ac.uk:10454/17662 |
Date | 28 February 2020 |
Creators | Andrews, S., Day, T., Domdouzis, K., Hirsch, L., Lefticaru, Raluca, Orphanides, C. |
Publisher | Springer International Publishing |
Source Sets | Bradford Scholars |
Language | English |
Detected Language | English |
Type | Book chapter, Accepted manuscript |
Rights | © 2017 Springer. Reproduced in accordance with the publisher's self-archiving policy. The final publication is available at Springer via https://doi.org/10.1007/978-3-319-52419-1_6, Unspecified |
Page generated in 0.0022 seconds