Real-time Audio Classification onan Edge Device : Using YAMNet and TensorFlow Lite

Edge computing is the idea of moving computations away from the cloud andinstead perform them at the edge of the network. The benefits of edge computing arereduced latency, increased integrity, and less strain on networks. Edge AI is the practiceof deploying machine learning algorithms to perform computations on the edge.In this project, a pre-trained model YAMNet is retrained and used to perform audioclassification in real-time to detect gunshots, glass shattering, and speech. The modelis deployed onto the edge device both as a full TensorFlow model and as TensorFlowLite models. Comparing results of accuracy, inference time, and memory allocationfor full TensorFlow and TensorFlow Lite models with and without optimization. Resultsfrom this research were that it was a valid option to use both TensorFlow andTensorFlow Lite but there was a lot of performance to gain by using TensorFlow Litewith little downside.

Identiferoai:union.ndltd.org:UPSALLA1/oai:DiVA.org:lnu-107633
Date January 2021
CreatorsMalmberg, Christoffer
PublisherLinnéuniversitetet, Institutionen för datavetenskap och medieteknik (DM)
Source SetsDiVA Archive at Upsalla University
LanguageEnglish
Detected LanguageEnglish
TypeStudent thesis, info:eu-repo/semantics/bachelorThesis, text
Formatapplication/pdf
Rightsinfo:eu-repo/semantics/openAccess

Page generated in 0.0017 seconds