Global ETD Search

1	Bridges of Markov counting processes : reciprocal classes and duality formulas Conforti, Giovanni, Léonard, Christian, Murr, Rüdiger, Roelly, Sylvie January 2014 (has links) Processes having the same bridges are said to belong to the same reciprocal class. In this article we analyze reciprocal classes of Markov counting processes by identifying their reciprocal invariants and we characterize them as the set of counting processes satisfying some duality formula. counting process bridge reciprocal class duality formula Mathematics
2	A journey across football modelling with application to algorithmic trading Kharrat, Tarak January 2016 (has links) In this thesis we study the problem of forecasting the final score of a football match before the game kicks off (pre-match) and show how the derived models can be used to make profit in an algorithmic trading (betting) strategy. The thesis consists of two main parts. The first part discusses the database and a new class of counting processes. The second part describes the football forecasting models. The data part discusses the details of the design, specification and data collection of a comprehensive database containing extensive information on match results and events, players' skills and attributes and betting market prices. The database was created using state of the art web-scraping, text-processing and data-mining techniques. At the time of writing, we have collected data on all games played in the five major European leagues since the 2009-2010 season and on more than 7000 players. The statistical modelling part discusses forecasting models based on a new generation of counting process with flexible inter-arrival time distributions. Several different methods for fast computation of the associated probabilities are derived and compared. The proposed algorithms are implemented in a contributed R package Countr available from the Comprehensive R Archive Network. One of these flexible count distributions, the Weibull count distribution, was used to derive our first forecasting model. Its predictive ability is compared to the models previously suggested in the literature and tested in an algorithmic trading (betting) strategy. The model developed has been shown to perform rather well compared to its competitors. Our second forecasting model uses the same statistical distribution but models the attack and defence strengths of each team at the players level rather than at a team level, as is systematically done in the literature. For this model we make heavy use of the data on the players' attributes discussed in the data part of the thesis. Not only does this model turn out to have a higher predictive power but it also allows us to answer important questions about the 'nature of the game' such as the contribution of the full-backs to the attacking efforts or where would a new team finish in the Premier League. 510
3	NONPARAMETRIC INFERENCES FOR THE HAZARD FUNCTION WITH RIGHT TRUNCATION Akcin, Haci Mustafa 03 May 2013 (has links) Incompleteness is a major feature of time-to-event data. As one type of incompleteness, truncation refers to the unobservability of the time-to-event variable because it is smaller (or greater) than the truncation variable. A truncated sample always involves left and right truncation. Left truncation has been studied extensively while right truncation has not received the same level of attention. In one of the earliest studies on right truncation, Lagakos et al. (1988) proposed to transform a right truncated variable to a left truncated variable and then apply existing methods to the transformed variable. The reverse-time hazard function is introduced through transformation. However, this quantity does not have a natural interpretation. There exist gaps in the inferences for the regular forward-time hazard function with right truncated data. This dissertation discusses variance estimation of the cumulative hazard estimator, one-sample log-rank test, and comparison of hazard rate functions among finite independent samples under the context of right truncation. First, the relation between the reverse- and forward-time cumulative hazard functions is clarified. This relation leads to the nonparametric inference for the cumulative hazard function. Jiang (2010) recently conducted a research on this direction and proposed two variance estimators of the cumulative hazard estimator. Some revision to the variance estimators is suggested in this dissertation and evaluated in a Monte-Carlo study. Second, this dissertation studies the hypothesis testing for right truncated data. A series of tests is developed with the hazard rate function as the target quantity. A one-sample log-rank test is first discussed, followed by a family of weighted tests for comparison between finite $K$-samples. Particular weight functions lead to log-rank, Gehan, Tarone-Ware tests and these three tests are evaluated in a Monte-Carlo study. Finally, this dissertation studies the nonparametric inference for the hazard rate function for the right truncated data. The kernel smoothing technique is utilized in estimating the hazard rate function. A Monte-Carlo study investigates the uniform kernel smoothed estimator and its variance estimator. The uniform, Epanechnikov and biweight kernel estimators are implemented in the example of blood transfusion infected AIDS data. Right truncation Reverse-time hazard Kernel function Hypothesis testing Counting process
4	Reciprocal classes of Markov processes : an approach with duality formulae Murr, Rüdiger January 2012 (has links) In this work we are concerned with the characterization of certain classes of stochastic processes via duality formulae. First, we introduce a new formulation of a characterization of processes with independent increments, which is based on an integration by parts formula satisfied by infinitely divisible random vectors. Then we focus on the study of the reciprocal classes of Markov processes. These classes contain all stochastic processes having the same bridges, and thus similar dynamics, as a reference Markov process. We start with a resume of some existing results concerning the reciprocal classes of Brownian diffusions as solutions of duality formulae. As a new contribution, we show that the duality formula satisfied by elements of the reciprocal class of a Brownian diffusion has a physical interpretation as a stochastic Newton equation of motion. In the context of pure jump processes we derive the following new results. We will analyze the reciprocal classes of Markov counting processes and characterize them as a group of stochastic processes satisfying a duality formula. This result is applied to time-reversal of counting processes. We are able to extend some of these results to pure jump processes with different jump-sizes, in particular we are able to compare the reciprocal classes of Markov pure jump processes through a functional equation between the jump-intensities. Duality formula reciprocal class Levy process infinite divisibility counting process Malliavin calculus Mathematics
5	Nonparametric Inferences for the Hazard Function with Right Truncation Akcin, Haci Mustafa 03 May 2013 (has links) Incompleteness is a major feature of time-to-event data. As one type of incompleteness, truncation refers to the unobservability of the time-to-event variable because it is smaller (or greater) than the truncation variable. A truncated sample always involves left and right truncation. Left truncation has been studied extensively while right truncation has not received the same level of attention. In one of the earliest studies on right truncation, Lagakos et al. (1988) proposed to transform a right truncated variable to a left truncated variable and then apply existing methods to the transformed variable. The reverse-time hazard function is introduced through transformation. However, this quantity does not have a natural interpretation. There exist gaps in the inferences for the regular forward-time hazard function with right truncated data. This dissertation discusses variance estimation of the cumulative hazard estimator, one-sample log-rank test, and comparison of hazard rate functions among finite independent samples under the context of right truncation. First, the relation between the reverse- and forward-time cumulative hazard functions is clarified. This relation leads to the nonparametric inference for the cumulative hazard function. Jiang (2010) recently conducted a research on this direction and proposed two variance estimators of the cumulative hazard estimator. Some revision to the variance estimators is suggested in this dissertation and evaluated in a Monte-Carlo study. Second, this dissertation studies the hypothesis testing for right truncated data. A series of tests is developed with the hazard rate function as the target quantity. A one-sample log-rank test is first discussed, followed by a family of weighted tests for comparison between finite $K$-samples. Particular weight functions lead to log-rank, Gehan, Tarone-Ware tests and these three tests are evaluated in a Monte-Carlo study. Finally, this dissertation studies the nonparametric inference for the hazard rate function for the right truncated data. The kernel smoothing technique is utilized in estimating the hazard rate function. A Monte-Carlo study investigates the uniform kernel smoothed estimator and its variance estimator. The uniform, Epanechnikov and biweight kernel estimators are implemented in the example of blood transfusion infected AIDS data. Right truncation Reverse-time hazard Kernel function Hypothesis testing Counting process
6	Spline-based sieve semiparametric generalized estimating equation for panel count data Hua, Lei 01 May 2010 (has links) In this thesis, we propose to analyze panel count data using a spline-based sieve generalized estimating equation method with a semiparametric proportional mean model E(N(t)\|Z) = Λ0(t) eβT0Z. The natural log of the baseline mean function, logΛ0(t), is approximated by a monotone cubic B-spline function. The estimates of regression parameters and spline coefficients are the roots of the spline based sieve generalized estimating equations (sieve GEE). The proposed method avoids assumingany parametric structure of the baseline mean function and the underlying counting process. Selection of an appropriate covariance matrix that represents the true correlation between the cumulative counts improves estimating efficiency. In addition to the parameters existing in the proportional mean function, the estimation that accounts for the over-dispersion and autocorrelation involves an extra nuisance parameter σ2, which could be estimated using a method of moment proposed by Zeger (1988). The parameters in the mean function are then estimated by solving the pseudo generalized estimating equation with σ2 replaced by its estimate, σ2n. We show that the estimate of (β0,Λ0) based on this two-stage approach is still consistent and could converge at the optimal convergence rate in the nonparametric/semiparametric regression setting. The asymptotic normality of the estimate of β0 is also established. We further propose a spline-based projection variance estimating method and show its consistency. Simulation studies are conducted to investigate finite sample performance of the sieve semiparametric GEE estimates, as well as different variance estimating methods with different sample sizes. The covariance matrix that accounts for the overdispersion generally increases estimating efficiency when overdispersion is present in the data. Finally, the proposed method with different covariance matrices is applied to a real data from a bladder tumor clinical trial. Counting process Generalized Estimating Equation Monotone polynomial splines Over-dispersion Semiparametric model Biostatistics
7	Temporal Event Modeling of Social Harm with High Dimensional and Latent Covariates Liu, Xueying 08 1900 (has links) Indiana University-Purdue University Indianapolis (IUPUI) / The counting process is the fundamental of many real-world problems with event data. Poisson process, used as the background intensity of Hawkes process, is the most commonly used point process. The Hawkes process, a self-exciting point process fits to temporal event data, spatial-temporal event data, and event data with covariates. We study the Hawkes process that fits to heterogeneous drug overdose data via a novel semi-parametric approach. The counting process is also related to survival data based on the fact that they both study the occurrences of events over time. We fit a Cox model to temporal event data with a large corpus that is processed into high dimensional covariates. We study the significant features that influence the intensity of events. Temporal event sequence Hawkes process Counting Process Social harm Cox proportional hazard model Heterogeneous data
8	Reciprocal classes of Markov processes : an approach with duality formulae Murr, Rüdiger January 2012 (has links) This work is concerned with the characterization of certain classes of stochastic processes via duality formulae. In particular we consider reciprocal processes with jumps, a subject up to now neglected in the literature. In the first part we introduce a new formulation of a characterization of processes with independent increments. This characterization is based on a duality formula satisfied by processes with infinitely divisible increments, in particular Lévy processes, which is well known in Malliavin calculus. We obtain two new methods to prove this duality formula, which are not based on the chaos decomposition of the space of square-integrable function- als. One of these methods uses a formula of partial integration that characterizes infinitely divisible random vectors. In this context, our characterization is a generalization of Stein’s lemma for Gaussian random variables and Chen’s lemma for Poisson random variables. The generality of our approach permits us to derive a characterization of infinitely divisible random measures. The second part of this work focuses on the study of the reciprocal classes of Markov processes with and without jumps and their characterization. We start with a resume of already existing results concerning the reciprocal classes of Brownian diffusions as solutions of duality formulae. As a new contribution, we show that the duality formula satisfied by elements of the reciprocal class of a Brownian diffusion has a physical interpretation as a stochastic Newton equation of motion. Thus we are able to connect the results of characterizations via duality formulae with the theory of stochastic mechanics by our interpretation, and to stochastic optimal control theory by the mathematical approach. As an application we are able to prove an invariance property of the reciprocal class of a Brownian diffusion under time reversal. In the context of pure jump processes we derive the following new results. We describe the reciprocal classes of Markov counting processes, also called unit jump processes, and obtain a characterization of the associated reciprocal class via a duality formula. This formula contains as key terms a stochastic derivative, a compensated stochastic integral and an invariant of the reciprocal class. Moreover we present an interpretation of the characterization of a reciprocal class in the context of stochastic optimal control of unit jump processes. As a further application we show that the reciprocal class of a Markov counting process has an invariance property under time reversal. Some of these results are extendable to the setting of pure jump processes, that is, we admit different jump-sizes. In particular, we show that the reciprocal classes of Markov jump processes can be compared using reciprocal invariants. A characterization of the reciprocal class of compound Poisson processes via a duality formula is possible under the assumption that the jump-sizes of the process are incommensurable. / Diese Arbeit befasst sich mit der Charakterisierung von Klassen stochastischer Prozesse durch Dualitätsformeln. Es wird insbesondere der in der Literatur bisher unbehandelte Fall reziproker Klassen stochastischer Prozesse mit Sprungen untersucht. Im ersten Teil stellen wir eine neue Formulierung einer Charakterisierung von Prozessen mit unabhängigen Zuwächsen vor. Diese basiert auf der aus dem Malliavinkalkül bekannten Dualitätsformel für Prozesse mit unendlich oft teilbaren Zuwächsen. Wir präsentieren zusätzlich zwei neue Beweismethoden dieser Dualitätsformel, die nicht auf der Chaoszerlegung des Raumes quadratintegrabler Funktionale beruhen. Eine dieser Methoden basiert auf einer partiellen Integrationsformel fur unendlich oft teilbare Zufallsvektoren. In diesem Rahmen ist unsere Charakterisierung eine Verallgemeinerung des Lemma fur Gaußsche Zufallsvariablen von Stein und des Lemma fur Zufallsvariablen mit Poissonverteilung von Chen. Die Allgemeinheit dieser Methode erlaubt uns durch einen ähnlichen Zugang die Charakterisierung unendlich oft teilbarer Zufallsmaße. Im zweiten Teil der Arbeit konzentrieren wir uns auf die Charakterisierung reziproker Klassen ausgewählter Markovprozesse durch Dualitätsformeln. Wir beginnen mit einer Zusammenfassung bereits existierender Ergebnisse zu den reziproken Klassen Brownscher Bewegungen mit Drift. Es ist uns möglich die Charakterisierung solcher reziproken Klassen durch eine Dualitätsformel physikalisch umzudeuten in eine Newtonsche Gleichung. Damit gelingt uns ein Brückenschlag zwischen derartigen Charakterisierungsergebnissen und der Theorie stochastischer Mechanik durch den Interpretationsansatz, sowie der Theorie stochastischer optimaler Steuerung durch den mathematischen Ansatz. Unter Verwendung der Charakterisierung reziproker Klassen durch Dualitätsformeln beweisen wir weiterhin eine Invarianzeigenschaft der reziproken Klasse Browscher Bewegungen mit Drift unter Zeitumkehrung. Es gelingt uns weiterhin neue Resultate im Rahmen reiner Sprungprozesse zu beweisen. Wir beschreiben reziproke Klassen Markovscher Zählprozesse, d.h. Sprungprozesse mit Sprunghöhe eins, und erhalten eine Charakterisierung der reziproken Klasse vermöge einer Dualitätsformel. Diese beinhaltet als Schlüsselterme eine stochastische Ableitung nach den Sprungzeiten, ein kompensiertes stochastisches Integral und eine Invariante der reziproken Klasse. Wir präsentieren außerdem eine Interpretation der Charakterisierung einer reziproken Klasse im Rahmen der stochastischen Steuerungstheorie. Als weitere Anwendung beweisen wir eine Invarianzeigenschaft der reziproken Klasse Markovscher Zählprozesse unter Zeitumkehrung. Einige dieser Ergebnisse werden fur reine Sprungprozesse mit unterschiedlichen Sprunghöhen verallgemeinert. Insbesondere zeigen wir, dass die reziproken Klassen Markovscher Sprungprozesse vermöge reziproker Invarianten unterschieden werden können. Eine Charakterisierung der reziproken Klasse zusammengesetzter Poissonprozesse durch eine Dualitätsformel gelingt unter der Annahme inkommensurabler Sprunghöhen. unendliche Teilbarkeit Dualitätsformeln reziproke Klassen Zählprozesse stochastische Mechanik infinite divisibility duality formulae reciprocal class counting process stochastic mechanics Mathematics
9	Treatment Comparison in Biomedical Studies Using Survival Function Zhao, Meng 03 May 2011 (has links) In the dissertation, we study the statistical evaluation of treatment comparisons by evaluating the relative comparison of survival experiences between two treatment groups. We construct confidence interval and simultaneous confidence bands for the ratio and odds ratio of two survival functions through both parametric and nonparametric approaches.We first construct empirical likelihood confidence interval and simultaneous confidence bands for the odds ratio of two survival functions to address small sample efficacy and sufficiency. The empirical log-likelihood ratio is developed, and the corresponding asymptotic distribution is derived. Simulation studies show that the proposed empirical likelihood band has outperformed the normal approximation band in small sample size cases in the sense that it yields closer coverage probabilities to chosen nominal levels.Furthermore, in order to incorporate prognostic factors for the adjustment of survival functions in the comparison, we construct simultaneous confidence bands for the ratio and odds ratio of survival functions based on both the Cox model and the additive risk model. We develop simultaneous confidence bands by approximating the limiting distribution of cumulative hazard functions by zero-mean Gaussian processes whose distributions can be generated through Monte Carlo simulations. Simulation studies are conducted to evaluate the performance for proposed models. Real applications on published clinical trial data sets are also studied for further illustration purposes.In the end, the population attributable fraction function is studied to measure the impact of risk factors on disease incidence in the population. We develop semiparametric estimation of attributable fraction functions for cohort studies with potentially censored event time under the additive risk model. Additive risk model Attributive fraction function Censoring Confidence band Counting Process Cox Regression Empirical likelihood Martingale Odds ratio Ratio Right censored data Survival function Mathematics
10	Treatment Comparison in Biomedical Studies Using Survival Function Zhao, Meng 03 May 2011 (has links) In the dissertation, we study the statistical evaluation of treatment comparisons by evaluating the relative comparison of survival experiences between two treatment groups. We construct confidence interval and simultaneous confidence bands for the ratio and odds ratio of two survival functions through both parametric and nonparametric approaches.We first construct empirical likelihood confidence interval and simultaneous confidence bands for the odds ratio of two survival functions to address small sample efficacy and sufficiency. The empirical log-likelihood ratio is developed, and the corresponding asymptotic distribution is derived. Simulation studies show that the proposed empirical likelihood band has outperformed the normal approximation band in small sample size cases in the sense that it yields closer coverage probabilities to chosen nominal levels.Furthermore, in order to incorporate prognostic factors for the adjustment of survival functions in the comparison, we construct simultaneous confidence bands for the ratio and odds ratio of survival functions based on both the Cox model and the additive risk model. We develop simultaneous confidence bands by approximating the limiting distribution of cumulative hazard functions by zero-mean Gaussian processes whose distributions can be generated through Monte Carlo simulations. Simulation studies are conducted to evaluate the performance for proposed models. Real applications on published clinical trial data sets are also studied for further illustration purposes.In the end, the population attributable fraction function is studied to measure the impact of risk factors on disease incidence in the population. We develop semiparametric estimation of attributable fraction functions for cohort studies with potentially censored event time under the additive risk model. Additive risk model Attributive fraction function Censoring Confidence band Counting Process Cox Regression Empirical likelihood Martingale Odds ratio Ratio Right censored data Survival function

Search results