Global ETD Search

Return to search

Ablation Study on Deeplabv3+ for Semantic Segmentation

Semantic segmentation is a fundamental task in computer vision that aims to classify every pixel in an image into different categories. Deep convolutional neural networks (CNNs) have achieved state-of-the-art results in semantic segmentation. Deeplabv3+ is a deep CNN-based model that uses atrous convolution and a decoder network to improve the accuracy of semantic segmentation. In this research, we conduct an ablation study on Deeplabv3+ to analyze the importance of its different components and their impact on the performance of the model, which provides valuable insights for developing more efficient and accurate semantic segmentation models. Our study encompasses a comprehensive examination of Deeplabv3+. We explore its constituent elements, including the backbone network, the Atrous Spatial Pyramid Pooling (ASPP) module, and the decoder network. Our investigation delves into the reasons underlying performance changes resulting from the removal of these architectural components. This analysis provides a deeper understanding of their intrinsic roles in shaping the model’s segmentation efficacy. Notably, we identify that the backbone exerts a substantial impact. Changes to other components yield relatively minor effects, while modifications to the backbone wield a remarkable influence. The Encoder-decoder structure also bears significant weight, playing a pivotal role in the upsampling process. This structure significantly impacts precision, enhancing boundary clarity and positional accuracy. Moreover, we recognize the vital role of feature integration. Features aid in establishing pixel position information, enhancing boundary definition, and positioning accuracy. Furthermore, the ASPP module emerges as a critical factor. ASPP leverages multi-scale information to differentiate complex object boundaries, further enriching the model’s semantic understanding.

Electrical and Computer Engineering

Identifer	oai:union.ndltd.org:UMASS/oai:scholarworks.umass.edu:masters_theses_2-2451
Date	01 September 2023
Creators	Lei, Bowen
Publisher	ScholarWorks@UMass Amherst
Source Sets	University of Massachusetts, Amherst
Detected Language	English
Type	text
Format	application/pdf
Source	Masters Theses
Rights	http://creativecommons.org/licenses/by/4.0/

Page generated in 0.0022 seconds

Ablation Study on Deeplabv3+ for Semantic Segmentation

Description

Links & Downloads

Tags

Additional Fields