Spelling suggestions: "subject:"exponentialfamilie"" "subject:"exponentialfamilien""
1 |
Finding the Maximizers of the Information Divergence from an Exponential Family / Das Auffinden der Maximierer der Informationsdivergenz von einer ExponentialfamilieRauh, Johannes 19 October 2011 (has links) (PDF)
The subject of this thesis is the maximization of the information divergence from an exponential family on a finite set, a problem first formulated by Nihat Ay. A special case is the maximization of the mutual information or the multiinformation between different parts of a composite system.
My thesis contributes mainly to the mathematical aspects of the optimization problem. A reformulation is found that relates the maximization of the information divergence with the maximization of an entropic quantity, defined on the normal space of the exponential family. This reformulation simplifies calculations in concrete cases and gives theoretical insight about the general problem.
A second emphasis of the thesis is on examples that demonstrate how the theoretical results can be applied in particular cases. Third, my thesis contain first results on the characterization of exponential families with a small maximum value of the information divergence.
|
2 |
Finding the Maximizers of the Information Divergence from an Exponential Family: Finding the Maximizersof the Information Divergencefrom an Exponential FamilyRauh, Johannes 09 January 2011 (has links)
The subject of this thesis is the maximization of the information divergence from an exponential family on a finite set, a problem first formulated by Nihat Ay. A special case is the maximization of the mutual information or the multiinformation between different parts of a composite system.
My thesis contributes mainly to the mathematical aspects of the optimization problem. A reformulation is found that relates the maximization of the information divergence with the maximization of an entropic quantity, defined on the normal space of the exponential family. This reformulation simplifies calculations in concrete cases and gives theoretical insight about the general problem.
A second emphasis of the thesis is on examples that demonstrate how the theoretical results can be applied in particular cases. Third, my thesis contain first results on the characterization of exponential families with a small maximum value of the information divergence.:1. Introduction
2. Exponential families
2.1. Exponential families, the convex support and the moment map
2.2. The closure of an exponential family
2.3. Algebraic exponential families
2.4. Hierarchical models
3. Maximizing the information divergence from an exponential family
3.1. The directional derivatives of D(*|E )
3.2. Projection points and kernel distributions
3.3. The function DE
3.4. The first order optimality conditions of DE
3.5. The relation between D(*|E) and DE
3.6. Computing the critical points
3.7. Computing the projection points
4. Examples
4.1. Low-dimensional exponential families
4.1.1. Zero-dimensional exponential families
4.1.2. One-dimensional exponential families
4.1.3. One-dimensional exponential families on four states
4.1.4. Other low-dimensional exponential families
4.2. Partition models
4.3. Exponential families with max D(*|E ) = log(2)
4.4. Binary i.i.d. models and binomial models
5. Applications and Outlook
5.1. Principles of learning, complexity measures and constraints
5.2. Optimally approximating exponential families
5.3. Asymptotic behaviour of the empirical information divergence
A. Polytopes and oriented matroids
A.1. Polytopes
A.2. Oriented matroids
Bibliography
Index
Glossary of notations
|
3 |
On Boundaries of Statistical Models / Randeigenschaften statistischer ModelleKahle, Thomas 24 June 2010 (has links) (PDF)
In the thesis "On Boundaries of Statistical Models" problems related to a description of probability
distributions with zeros, lying in the boundary of a statistical model, are treated. The
distributions considered are joint distributions of finite collections of finite discrete random
variables. Owing to this restriction, statistical models are subsets of finite dimensional real
vector spaces. The support set problem for exponential families, the main class of models considered
in the thesis, is to characterize the possible supports of distributions in the boundaries of these
statistical models. It is shown that this problem is equivalent to a characterization of the face
lattice of a convex polytope, called the convex support. The main tool for treating questions
related to the boundary are implicit representations. Exponential families are shown to be sets of
solutions of binomial equations, connected to an underlying combinatorial structure, called oriented
matroid. Under an additional assumption these equations are polynomial and one is placed in the
setting of commutative algebra and algebraic geometry. In this case one recovers results from
algebraic statistics. The combinatorial theory of exponential families using oriented matroids makes
the established connection between an exponential family and its convex support completely natural:
Both are derived from the same oriented matroid.
The second part of the thesis deals with hierarchical models, which are a special class of
exponential families constructed from simplicial complexes. The main technical tool for their
treatment in this thesis are so called elementary circuits. After their introduction, they are used
to derive properties of the implicit representations of hierarchical models. Each elementary circuit
gives an equation holding on the hierarchical model, and these equations are shown to be the
"simplest", in the sense that the smallest degree among the equations corresponding to elementary
circuits gives a lower bound on the degree of all equations characterizing the model. Translating
this result back to polyhedral geometry yields a neighborliness property of marginal polytopes, the
convex supports of hierarchical models. Elementary circuits of small support are related to
independence statements holding between the random variables whose joint distributions the
hierarchical model describes. Models for which the complete set of circuits consists of elementary
circuits are shown to be described by totally unimodular matrices. The thesis also contains an
analysis of the case of binary random variables. In this special situation, marginal polytopes can
be represented as the convex hulls of linear codes. Among the results here is a classification of
full-dimensional linear code polytopes in terms of their subgroups.
If represented by polynomial equations, exponential families are the varieties of binomial prime
ideals. The third part of the thesis describes tools to treat models defined by not necessarily
prime binomial ideals. It follows from Eisenbud and Sturmfels' results on binomial ideals that these
models are unions of exponential families, and apart from solving the support set problem for each
of these, one is faced with finding the decomposition. The thesis discusses algorithms for
specialized treatment of binomial ideals, exploiting their combinatorial nature. The provided
software package Binomials.m2 is shown to be able to compute very large primary decompositions,
yielding a counterexample to a recent conjecture in algebraic statistics.
|
4 |
On Boundaries of Statistical ModelsKahle, Thomas 26 May 2010 (has links)
In the thesis "On Boundaries of Statistical Models" problems related to a description of probability
distributions with zeros, lying in the boundary of a statistical model, are treated. The
distributions considered are joint distributions of finite collections of finite discrete random
variables. Owing to this restriction, statistical models are subsets of finite dimensional real
vector spaces. The support set problem for exponential families, the main class of models considered
in the thesis, is to characterize the possible supports of distributions in the boundaries of these
statistical models. It is shown that this problem is equivalent to a characterization of the face
lattice of a convex polytope, called the convex support. The main tool for treating questions
related to the boundary are implicit representations. Exponential families are shown to be sets of
solutions of binomial equations, connected to an underlying combinatorial structure, called oriented
matroid. Under an additional assumption these equations are polynomial and one is placed in the
setting of commutative algebra and algebraic geometry. In this case one recovers results from
algebraic statistics. The combinatorial theory of exponential families using oriented matroids makes
the established connection between an exponential family and its convex support completely natural:
Both are derived from the same oriented matroid.
The second part of the thesis deals with hierarchical models, which are a special class of
exponential families constructed from simplicial complexes. The main technical tool for their
treatment in this thesis are so called elementary circuits. After their introduction, they are used
to derive properties of the implicit representations of hierarchical models. Each elementary circuit
gives an equation holding on the hierarchical model, and these equations are shown to be the
"simplest", in the sense that the smallest degree among the equations corresponding to elementary
circuits gives a lower bound on the degree of all equations characterizing the model. Translating
this result back to polyhedral geometry yields a neighborliness property of marginal polytopes, the
convex supports of hierarchical models. Elementary circuits of small support are related to
independence statements holding between the random variables whose joint distributions the
hierarchical model describes. Models for which the complete set of circuits consists of elementary
circuits are shown to be described by totally unimodular matrices. The thesis also contains an
analysis of the case of binary random variables. In this special situation, marginal polytopes can
be represented as the convex hulls of linear codes. Among the results here is a classification of
full-dimensional linear code polytopes in terms of their subgroups.
If represented by polynomial equations, exponential families are the varieties of binomial prime
ideals. The third part of the thesis describes tools to treat models defined by not necessarily
prime binomial ideals. It follows from Eisenbud and Sturmfels'' results on binomial ideals that these
models are unions of exponential families, and apart from solving the support set problem for each
of these, one is faced with finding the decomposition. The thesis discusses algorithms for
specialized treatment of binomial ideals, exploiting their combinatorial nature. The provided
software package Binomials.m2 is shown to be able to compute very large primary decompositions,
yielding a counterexample to a recent conjecture in algebraic statistics.
|
Page generated in 0.0554 seconds