Return to search

A Statistical and Machine Learning Approach to Air Pollution Forecasts

In today’s world, where air pollution has become a ubiquitous problem, city air is normally monitored. Such monitoring can produce large amounts of data, and this enables the development of statistical and machine learning techniques for modeling and forecasting air quality. However, the complex nature of air pollution makes such data a challenge to fully utilize. To this end, machine learning methods, especially deep neural networks, have in recent years emerged as a promising technology for more accurate predictions of air pollution levels, and the research problem in this work is; To capture and model the complex dynamics of air pollution with machine learning methods, with an emphasis on deep neural networks. Connected to the research problem is the research question; How can machine learning, in particular deep neural networks, be used to forecast air pollution levels and pollution peaks? An emphasis is put on pollution peaks, as these are the episodes when existing forecasting models tend to give the largest prediction errors. In this work, historical data from air monitoring sensors were utilized to train several neural network architectures, as well as a more straightforward multiple linear regression model, for forecasting background levels of nitrogen dioxide in the center of Stockholm. Several evaluation metrics showed that the neural network models outperformed the multiple linear regression model, however, none of the models had the desired structure of the forecast errors, and all models failed to successfully capture sudden pollution peaks. Nevertheless, the results point to an advantage for the more complex neural network models, and further advances in the field of machine learning, together with higher resolution data, have the potential to improve air quality forecasts even more and cross conventional forecasting limits.

Identiferoai:union.ndltd.org:UPSALLA1/oai:DiVA.org:su-219600
Date January 2022
CreatorsCarlén, Simon
PublisherStockholms universitet, Institutionen för data- och systemvetenskap
Source SetsDiVA Archive at Upsalla University
LanguageEnglish
Detected LanguageEnglish
TypeStudent thesis, info:eu-repo/semantics/bachelorThesis, text
Formatapplication/pdf
Rightsinfo:eu-repo/semantics/openAccess

Page generated in 0.0017 seconds