Return to search

Ablation Study on Deeplabv3+ for Semantic Segmentation

Semantic segmentation is a fundamental task in computer vision that aims to classify every pixel in an image into different categories. Deep convolutional neural networks (CNNs) have achieved state-of-the-art results in semantic segmentation. Deeplabv3+ is a deep CNN-based model that uses atrous convolution and a decoder network to improve the accuracy of semantic segmentation. In this research, we conduct an ablation study on Deeplabv3+ to analyze the importance of its different components and their impact on the performance of the model, which provides valuable insights for developing more efficient and accurate semantic segmentation models. Our study encompasses a comprehensive examination of Deeplabv3+. We explore its constituent elements, including the backbone network, the Atrous Spatial Pyramid Pooling (ASPP) module, and the decoder network. Our investigation delves into the reasons underlying performance changes resulting from the removal of these architectural components. This analysis provides a deeper understanding of their intrinsic roles in shaping the model’s segmentation efficacy. Notably, we identify that the backbone exerts a substantial impact. Changes to other components yield relatively minor effects, while modifications to the backbone wield a remarkable influence. The Encoder-decoder structure also bears significant weight, playing a pivotal role in the upsampling process. This structure significantly impacts precision, enhancing boundary clarity and positional accuracy. Moreover, we recognize the vital role of feature integration. Features aid in establishing pixel position information, enhancing boundary definition, and positioning accuracy. Furthermore, the ASPP module emerges as a critical factor. ASPP leverages multi-scale information to differentiate complex object boundaries, further enriching the model’s semantic understanding.

Identiferoai:union.ndltd.org:UMASS/oai:scholarworks.umass.edu:masters_theses_2-2451
Date01 September 2023
CreatorsLei, Bowen
PublisherScholarWorks@UMass Amherst
Source SetsUniversity of Massachusetts, Amherst
Detected LanguageEnglish
Typetext
Formatapplication/pdf
SourceMasters Theses
Rightshttp://creativecommons.org/licenses/by/4.0/

Page generated in 0.0022 seconds