Global ETD Search

Return to search

Object Detection and Semantic Segmentation Using Self-Supervised Learning

In this thesis, three well known self-supervised methods have been implemented and trained on road scene images. The three so called pretext tasks RotNet, MoCov2, and DeepCluster were used to train a neural network self-supervised. The self-supervised trained networks where then evaluated on different amount of labeled data on two downstream tasks, object detection and semantic segmentation. The performance of the self-supervised methods are compared to networks trained from scratch on the respective downstream task. The results show that it is possible to achieve a performance increase using self-supervision on a dataset containing road scene images only. When only a small amount of labeled data is available, the performance increase can be substantial, e.g., a mIoU from 33 to 39 when training semantic segmentation on 1750 images with a RotNet pre-trained backbone compared to training from scratch. However, it seems that when a large amount of labeled images are available (>70000 images), the self-supervised pretraining does not increase the performance as much or at all.

http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-180815

Self-supervised learning

Computer vision

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:liu-180815
Date	January 2021
Creators	Gustavsson, Simon
Publisher	Linköpings universitet, Datorseende
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0017 seconds

Object Detection and Semantic Segmentation Using Self-Supervised Learning

Description

Links & Downloads

Tags

Additional Fields