Global ETD Search

Return to search

Self-supervised Representation Learning in Computer Vision and Reinforcement Learning

This work is devoted to self-supervised representation learning (SSL). We consider both contrastive and non-contrastive methods and present a new loss function for SSL based on feature whitening. Our solution is conceptually simple and competitive with other methods. Self-supervised representations are beneficial for most areas of deep learning, and reinforcement learning is of particular interest because SSL can compensate for the sparsity of the training signal.
We present two methods from this area. The first tackles the partial observability providing the agent with a history, represented with temporal alignment, and improves performance in most Atari environments. The second addresses the exploration problem. The method employs a world model of the SSL latent space, and the prediction error of this model indicates novel states required to explore. It shows strong performance on exploration-hard benchmarks, especially on the notorious Montezuma's Revenge.
Finally, we consider the metric learning problem, which has much in common with SSL approaches. We present a new method based on hyperbolic embeddings, vision transformers and contrastive loss. We demonstrate the advantage of hyperbolic space over the widely used Euclidean space for metric learning. The method outperforms the current state-of-the-art by a significant margin.

Identifer	oai:union.ndltd.org:unitn.it/oai:iris.unitn.it:11572/360781
Date	06 December 2022
Creators	Ermolov, Aleksandr
Contributors	Ermolov, Aleksandr, Sebe, Niculae
Publisher	Università degli studi di Trento
Source Sets	Università di Trento
Language	English
Detected Language	English
Type	info:eu-repo/semantics/doctoralThesis
Rights	info:eu-repo/semantics/openAccess
Relation	firstpage:1, lastpage:160, numberofpages:160

Page generated in 0.002 seconds

Self-supervised Representation Learning in Computer Vision and Reinforcement Learning

Description

Links & Downloads

Tags

Additional Fields