Return to search

A weight initialization method based on neural network with asymmetric activation function

Yes / Weight initialization of neural networks has an important influence on the learning process, and the selection of initial weights is related to the activation interval of the activation function. It is proposed that an improved and extended weight initialization method for neural network with asymmetric activation function as an extension of the linear interval tolerance method (LIT), called ‘GLIT’ (generalized LIT), which is more suitable for higher-dimensional inputs. The purpose is to expand the selection range of the activation function so that the input falls in the unsaturated region, so as to improve the performance of the network. Then, a tolerance solution theorem based upon neural network system is given and proved. Furthermore, the algorithm is given about determining the initial weight interval. The validity of the theorem
and algorithm is verified by numerical experiments. The input could fall into any preset interval in the sense of probability under the GLIT method. In another sense, the GLIT method could provide a theoretical basis for the further study of neural networks. / The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This paper is partly supported by National Science Foundation of China under Grants (62073226, 61603262), Liaoning Province Natural Science Foundation (2020-KF-11-09, 2021-KF-11-05), Shen-Fu Demonstration Zone Science and Technology Plan Project (2020JH13, 2021JH07), Central Government Guides Local Science and Technology Development Funds of Liaoning Province (2021JH6).

Identiferoai:union.ndltd.org:BRADFORD/oai:bradscholars.brad.ac.uk:10454/18746
Date14 February 2022
CreatorsLiu, J., Liu, Y., Zhang, Qichun
Source SetsBradford Scholars
LanguageEnglish
Detected LanguageEnglish
TypeArticle, Accepted manuscript
Rights© 2022 Elsevier. Reproduced in accordance with the publisher's self-archiving policy. This manuscript version is made available under the CC-BY-NC-ND 4.0 license.

Page generated in 0.0049 seconds