Return to search

Algebraic dynamic programming over general data structures

Background: Dynamic programming algorithms provide exact solutions to many problems in computational biology, such as sequence alignment, RNA folding, hidden Markov models (HMMs), and scoring of phylogenetic trees. Structurally analogous algorithms compute optimal solutions, evaluate score distributions, and perform stochastic sampling. This is explained in the theory of Algebraic Dynamic Programming (ADP) by a strict separation of state space traversal (usually represented by a context free grammar), scoring (encoded as an algebra), and choice rule. A key ingredient in this theory is the use of yield parsers that operate on the ordered input data structure, usually strings or ordered trees. The computation of ensemble properties, such as a posteriori probabilities of HMMs or partition functions in RNA folding, requires the combination of two distinct, but intimately related algorithms, known as the inside and the outside recursion. Only the inside recursions are covered by the classical ADP theory. Results: The ideas of ADP are generalized to a much wider scope of data structures by relaxing the concept of parsing. This allows us to formalize the conceptual complementarity of inside and outside variables in a natural way. We demonstrate that outside recursions are generically derivable from inside decomposition schemes. In addition to rephrasing the well-known algorithms for HMMs, pairwise sequence alignment, and RNA folding we show how the TSP and the shortest Hamiltonian path problem can be implemented efficiently in the extended ADP framework. As a showcase application we investigate the ancient evolution of HOX gene clusters in terms of shortest Hamiltonian paths. Conclusions: The generalized ADP framework presented here greatly facilitates the development and implementation of dynamic programming algorithms for a wide spectrum of applications.

Identiferoai:union.ndltd.org:DRESDEN/oai:qucosa.de:bsz:15-qucosa-206280
Date29 June 2016
CreatorsHöner zu Siederdissen, Christian, Prohaska, Sonja J., Stadler, Peter F.
ContributorsUniversität Leipzig, Fakultät für Mathematik und Informatik, Universität Wien, Fakultät für Chemie, Universität Leipzig, Fakultät für Mathematik und Informatik, Universität Leipzig, Interdisziplinäres Zentrum für Bioinformatik, Max-Planck-Institut für Mathematik in den Naturwissenschaften,, Fraunhofer-Institut für Zelltherapie und Immunologie,, Center for non-coding RNA in Technology and Health,, Santa Fe Institute,, BioMed Central,
PublisherUniversitätsbibliothek Leipzig
Source SetsHochschulschriftenserver (HSSS) der SLUB Dresden
LanguageEnglish
Detected LanguageEnglish
Typedoc-type:article
Formatapplication/pdf
SourceBMC Bioinformatics 2015, 16(Suppl 19):S2 doi: 10.1186/1471-2105-16-S19-S2

Page generated in 0.0024 seconds