Global ETD Search

Return to search

The statistical mechanics of Bayesian model selection

In this thesis we examine the question of model selection in systems which learn input-output mappings from a data set of examples. The models we consider are inspired by feed-forward architectures used within the artificial neural networks community. The approach taken here is to elucidate the properties of various model selection criteria by calculation of relevant quantities derived in a Bayesian framework. These calculations make the assumption that examples are generated from some underlying rule or teacher by randomly sampling the input space and are performed using techniques borrowed from statistical mechanics. Such an approach allows for the comparison of different approaches on the basis of the resultant ability of the system to generalize to novel examples. Broadly stated, the model selection problem is the following. Given only a limited set of examples, which model, or student, should one choose from a set of candidates in order to achieve the highest level of generalization? We consider four model selection criteria. A penalty based method utilising a quantity derived from Bayesian statistics termed the evidence, and two methods based on estimates of the generalization performance namely, the test error and the cross validation error. The fourth method, less widely used, is based on the noise sensitivity of he models. In a simple scenario we demonstrate that model selection based on the evidence is susceptible to misspecification of the student. Our analysis is conducted in the thermodynamic limit where the system size is taken to be arbitrarily large. In particular we examine the evidence procedure assignments of the hyperparameters which control the learning algorithm. We find that, where the student is not sufficiently powerful to fully model the teacher, despite being sub-optimal this procedure is remarkably robust towards such misspecifications. In a scenario in which the student is more than able to represent the teacher we find the evidence procedure is optimal.

http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.657315

519

Identifer	oai:union.ndltd.org:bl.uk/oai:ethos.bl.uk:657315
Date	January 1996
Creators	Marion, Glenn
Publisher	University of Edinburgh
Source Sets	Ethos UK
Detected Language	English
Type	Electronic Thesis or Dissertation
Source	http://hdl.handle.net/1842/15264

Page generated in 0.0021 seconds

The statistical mechanics of Bayesian model selection

Description

Links & Downloads

Tags

Additional Fields