1 |
Nonparametric methods in financial time series analysisHong, Seok Young January 2018 (has links)
The fundamental objective of the analysis of financial time series is to unveil the random mechanism, i.e. the probability law, underlying financial data. The effort to identify the truth that governs the observations involves proposing and estimating reasonable statistical models that well explain the empirical features of data. This thesis develops some new nonparametric tools that can be exploited in this context; the efficacy and validity of their use are supported by computational advancements and surging availability of large/complex (`big') data sets. Chapter 1 investigates the conditional first moment properties of financial returns. We propose multivariate extensions of the popular Variance Ratio (VR) statistic, aiming to test linear predictability of returns and weak-form market efficiency. We construct asymptotic distribution theories for the statistics and scalar functions thereof under the null hypothesis of no predictability. The imposed assumptions are weaker than those widely adopted in the literature, and in our view more credible with regard to the underlying data generating process we expect for stock returns. It is also shown that the limit theories can be extended to the long horizon and large dimension cases, and also to allow for a time varying risk premium. Our methods are applied to CRSP weekly returns from 1962 to 2013; the joint tests of the multivariate hypothesis reject the null at the 1% level for all horizons considered. Chapter 2 is about nonparametric estimation of conditional moments. We propose a local constant type estimator that operates with an infinite number of conditioning variables; this enables a direct estimation of many objects of econometric interest that have dependence upon the infinite past. We show pointwise and uniform consistency of the estimator and establish its asymptotic nomality in various static and dynamic regressions context. The optimal rate of estimation turns out to be of logarithmic order, and the precise rate depends on the Lambert W function, the smoothness of the regression operator and the dependence of the data in a non-trivial way. The theories are applied to investigate the intertemporal risk-return relation for the aggregate stock market. We report an overall positive risk-return relation on the S&P 500 daily data from 1950-2017, and find evidence of strong time variation and counter-cyclical behaviour in risk aversion. Lastly, Chapter 3 concerns nonparametric volatility estimation with high frequency time series. While data observed at finer time scale than daily provide rich information, their distinctive empirical properties bring new challenges in their analysis. We propose a Fourier domain based estimator for multivariate ex-post volatility that is robust to two major hurdles in high frequency finance: asynchronicity in observations and the presence of microstructure noise. Asymptotic properties are derived under some mild conditions. Simulation studies show our method outperforms time domain estimators when two assets with different liquidity are traded asynchronously.
|
2 |
Ranking-Based Methods for Gene Selection in Microarray DataChen, Li 21 March 2006 (has links)
DNA microarrays have been used for the purpose of monitoring expression levels of thousands of genes simultaneously and identifying those genes that are differentially expressed. One of the major goals of microarray data analysis is the detection of differentially expressed genes across two kinds of tissue samples or samples obtained under two experimental conditions. A large number of gene detection methods have been developed and most of them are based on statistical analysis. However the statistical analysis methods have the limitations due to the small sample size and unknown distribution and error structure of microarray data. In this thesis, a study of ranking-based gene selection methods which have weak assumption about the data was done. Three approaches are proposed to integrate the individual ranks to select differentially expressed genes in microarray data. The experiments are implemented on the simulated and biological microarray data, and the results show that ranking-based methods outperform the t-test and SAM in selecting differentially expressed genes, especially when the sample size is small.
|
3 |
Vážená hloubka dat a diskriminace založená na hloubce dat / Weighted Data Depth and Depth Based DiscriminationVencálek, Ondřej January 2011 (has links)
The concept of data depth provides a powerful nonparametric tool for multivariate data analysis. We propose a generalization of the well-known halfspace depth called weighted data depth. The weighted data depth is not affine invariant in general, but it has some useful properties as possible nonconvex central areas. We further discuss application of data depth methodology to solve discrimination problem. Several classifiers based on data depth are reviewed and one new classifier is proposed. The new classifier is a modification of k-nearest- neighbour classifier. Classifiers are compared in a short simulation study. Advantage gained from use of the weighted data depth for discrimination purposes is shown.
|
4 |
Vážené poloprostorové hloubky a jejich vlastnosti / Weighted Halfspace Depths and Their PropertiesKotík, Lukáš January 2015 (has links)
Statistical depth functions became well known nonparametric tool of multivariate data analyses. The most known depth functions include the halfspace depth. Although the halfspace depth has many desirable properties, some of its properties may lead to biased and misleading results especially when data are not elliptically symmetric. The thesis introduces 2 new classes of the depth functions. Both classes generalize the halfspace depth. They keep some of its properties and since they more respect the geometric structure of data they usually lead to better results when we deal with non-elliptically symmetric, multimodal or mixed distributions. The idea presented in the thesis is based on replacing the indicator of a halfspace by more general weight function. This provides us with a continuum, especially if conic-section weight functions are used, between a local view of data (e.g. kernel density estimate) and a global view of data as is e.g. provided by the halfspace depth. The rate of localization is determined by the choice of the weight functions and theirs parameters. Properties including the uniform strong consistency of the proposed depth functions are proved in the thesis. Limit distribution is also discussed together with some other data depth related topics (regression depth, functional data depth)...
|
5 |
Specification Tests in Econometrics and Their Application / 計量経済学における特定化検定の理論とその応用Iwasawa, Masamune 23 March 2016 (has links)
京都大学 / 0048 / 新制・課程博士 / 博士(経済学) / 甲第19459号 / 経博第528号 / 新制||経||276(附属図書館) / 32495 / 京都大学大学院経済学研究科経済学専攻 / (主査)教授 西山 慶彦, 准教授 奥井 亮, 准教授 高野 久紀 / 学位規則第4条第1項該当 / Doctor of Economics / Kyoto University / DGAM
|
6 |
Bayesian Nonparametric Methods with Applications in Longitudinal, Heterogeneous and Spatiotemporal DataDuan, Li 19 October 2015 (has links)
No description available.
|
7 |
Efficient Bayesian Nonparametric Methods for Model-Free Reinforcement Learning in Centralized and Decentralized Sequential EnvironmentsLiu, Miao January 2014 (has links)
<p>As a growing number of agents are deployed in complex environments for scientific research and human well-being, there are increasing demands for designing efficient learning algorithms for these agents to improve their control polices. Such policies must account for uncertainties, including those caused by environmental stochasticity, sensor noise and communication restrictions. These challenges exist in missions such as planetary navigation, forest firefighting, and underwater exploration. Ideally, good control policies should allow the agents to deal with all the situations in an environment and enable them to accomplish their mission within the budgeted time and resources. However, a correct model of the environment is not typically available in advance, requiring the policy to be learned from data. Model-free reinforcement learning (RL) is a promising candidate for agents to learn control policies while engaged in complex tasks, because it allows the control policies to be learned directly from a subset of experiences and with time efficiency. Moreover, to ensure persistent performance improvement for RL, it is important that the control policies be concisely represented based on existing knowledge, and have the flexibility to accommodate new experience. Bayesian nonparametric methods (BNPMs) both allow the complexity of models to be adaptive to data, and provide a principled way for discovering and representing new knowledge.</p><p>In this thesis, we investigate approaches for RL in centralized and decentralized sequential decision-making problems using BNPMs. We show how the control policies can be learned efficiently under model-free RL schemes with BNPMs. Specifically, for centralized sequential decision-making, we study Q learning with Gaussian processes to solve Markov decision processes, and we also employ hierarchical Dirichlet processes as the prior for the control policy parameters to solve partially observable Markov decision processes. For decentralized partially observable Markov decision processes, we use stick-breaking processes as the prior for the controller of each agent. We develop efficient inference algorithms for learning the corresponding control policies. We demonstrate that by combining model-free RL and BNPMs with efficient algorithm design, we are able to scale up RL methods for complex problems that cannot be solved due to the lack of model knowledge. We adaptively learn control policies with concise structure and high value, from a relatively small amount of data.</p> / Dissertation
|
8 |
Modelos semiparamétricos com resposta binomial negativa / Semiparametric models with negative binomial responseOki, Fabio Hideto 14 May 2015 (has links)
O objetivo principal deste trabalho é discutir estimação e diagnóstico em modelos semiparamétricos com resposta binomial negativa, mais especificamente, modelos de regressão com resposta binomial negativa em que uma das variáveis explicativas contínuas é modelada de forma não paramétrica. Iniciamos o trabalho com um exemplo ilustrativo e fazemos uma breve revisão dos modelos paramétricos com resposta binomial negativa. Em seguida, introduzimos os modelos semiparamétricos com resposta binomial negativa e discutimos alguns aspectos de estimação, inferência e seleção de modelos. Dedicamos um capítulo a procedimentos de diagnóstico, tais como desenvolvimento de medidas de alavanca e de influência sob os aspectos de deleção de pontos e influência local, além de abordar a análise de resíduos. Reanalizamos o exemplo ilustrativo sob o enfoque semiparamétrico e apresentamos algumas conclusões. / The aim of this work is to discuss some aspects on estimation and diagnostics in negative binomial regression models which an explanatory continuous variable is modeled nonparametrically. First, an illustrative example is presented and analyzed under parametric negative binomial regression models. The proposed models are then introduced and some aspects on estimations, inference and model selection are presented. Particular emphasis is given on the development of diagnostic procedures, such as leverage measures, Cook distances, local influence approach and residuals. The motivated example is reanalyzed under the semiparametric viewpoint and some conclusions are given.
|
9 |
Modelos semiparamétricos com resposta binomial negativa / Semiparametric models with negative binomial responseFabio Hideto Oki 14 May 2015 (has links)
O objetivo principal deste trabalho é discutir estimação e diagnóstico em modelos semiparamétricos com resposta binomial negativa, mais especificamente, modelos de regressão com resposta binomial negativa em que uma das variáveis explicativas contínuas é modelada de forma não paramétrica. Iniciamos o trabalho com um exemplo ilustrativo e fazemos uma breve revisão dos modelos paramétricos com resposta binomial negativa. Em seguida, introduzimos os modelos semiparamétricos com resposta binomial negativa e discutimos alguns aspectos de estimação, inferência e seleção de modelos. Dedicamos um capítulo a procedimentos de diagnóstico, tais como desenvolvimento de medidas de alavanca e de influência sob os aspectos de deleção de pontos e influência local, além de abordar a análise de resíduos. Reanalizamos o exemplo ilustrativo sob o enfoque semiparamétrico e apresentamos algumas conclusões. / The aim of this work is to discuss some aspects on estimation and diagnostics in negative binomial regression models which an explanatory continuous variable is modeled nonparametrically. First, an illustrative example is presented and analyzed under parametric negative binomial regression models. The proposed models are then introduced and some aspects on estimations, inference and model selection are presented. Particular emphasis is given on the development of diagnostic procedures, such as leverage measures, Cook distances, local influence approach and residuals. The motivated example is reanalyzed under the semiparametric viewpoint and some conclusions are given.
|
10 |
Nonparametric Learning in High DimensionsLiu, Han 01 December 2010 (has links)
This thesis develops flexible and principled nonparametric learning algorithms to explore, understand, and predict high dimensional and complex datasets. Such data appear frequently in modern scientific domains and lead to numerous important applications. For example, exploring high dimensional functional magnetic resonance imaging data helps us to better understand brain functionalities; inferring large-scale gene regulatory network is crucial for new drug design and development; detecting anomalies in high dimensional transaction databases is vital for corporate and government security.
Our main results include a rigorous theoretical framework and efficient nonparametric learning algorithms that exploit hidden structures to overcome the curse of dimensionality when analyzing massive high dimensional datasets. These algorithms have strong theoretical guarantees and provide high dimensional nonparametric recipes for many important learning tasks, ranging from unsupervised exploratory data analysis to supervised predictive modeling. In this thesis, we address three aspects:
1 Understanding the statistical theories of high dimensional nonparametric inference, including risk, estimation, and model selection consistency;
2 Designing new methods for different data-analysis tasks, including regression, classification, density estimation, graphical model learning, multi-task learning, spatial-temporal adaptive learning;
3 Demonstrating the usefulness of these methods in scientific applications, including functional genomics, cognitive neuroscience, and meteorology.
In the last part of this thesis, we also present the future vision of high dimensional and large-scale nonparametric inference.
|
Page generated in 0.101 seconds