Global ETD Search

Return to search

On the Modelling of Stochastic Gradient Descent with Stochastic Differential Equations

Stochastic gradient descent (SGD) is arguably the most important algorithm used in optimization problems for large-scale machine learning. Its behaviour has been studied extensively from the viewpoint of mathematical analysis and probability theory; it is widely held that in the limit where the learning rate in the algorithm tends to zero, a specific stochastic differential equation becomes an adequate model of the dynamics of the algorithm. This study exhibits some of the research in this field by analyzing the application of a recently proven theorem to the problem of tensor principal component analysis. The results, originally discovered in an article by Gérard Ben Arous, Reza Gheissari and Aukosh Jagannath from 2022, illustrate how the phase diagram of functions of SGD differ in the high-dimensional regime from that of the classical fixed-dimensional setting.

http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-508599

stochastic gradient descent

stochastic differential equations

statistical machine learning

Other Mathematics

Annan matematik

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:uu-508599
Date	January 2023
Creators	Leino, Martin
Publisher	Uppsala universitet, Analys och partiella differentialekvationer
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess
Relation	U.U.D.M. project report ; 2023:30

Page generated in 0.0022 seconds

On the Modelling of Stochastic Gradient Descent with Stochastic Differential Equations

Description

Links & Downloads

Tags

Additional Fields