Global ETD Search

1	Modelling of Spatial Data Using Semivariograms of Stationary Spatial Processes / Erdvinių duomenų modeliavimas naudojant stacionarių erdvinių procesų semivariogramas Borisenko, Ingrida 03 March 2010 (has links) Spatial statistics is one of the youngest trends in the science of statistics. First, it has been applied in mining, during the fifth decade of the last century. In fifty years after this trend of science had been discovered, the circle of the scientists involved in it has grown drastically as well as areas of application. Also, a wide range of theoretical and practical material has been issued. Nowadays, spatial statistics methods are used in: ecology, quantity geology, image processing and analysis, epidemiology, studying global climate change and even cosmology. However, in Lithuania, the methodology of spatial data analysis has been studied only from the beginning of this Millennium. Since only few scientists (Dumbrauskas, A.; Kumetaitis, A.; Kumetaitienė, A. and others) are involved, it is very important to expand this area and develop the existing methods. Also it is essential to study the spatial dada modelling methods throughly and provide general spatial data modelling methodology. In order to apply the methods of spatial statistics, it is necessary to know the location of data in space, which is usually expressed in geographic coordinates. Thus, one of the main distinctions of spatial statistics which makes it different from the classical is the ability to model both spatial trend and spatial autocorrelation. One of the main objectives of spatial statistics is creating a mathematical model of spatial data, which can be used for interpolation (extrapolation) or for... [to full text] / Disertacijoje nagrinėjama erdvinių duomenų su stacionariomis klaidomis modeliavimo per semivariogramas ir tiesinio prognozavimo metodika. Erdvinių duomenų skiriamasis bruožas – jų išsidėstymas erdvėje, kuris dažniausiai aprašomas geografinėmis koordinatėmis. Tokių duomenų modeliavimas semivariogramomis, ir prognozavimas krigingu yra vienas iš svarbių geostatistikos mokslo uždavinių. Krigingas yra stochastinis prognozavimo metodas, kuris prie tam tikrų salygų pateikia geriausią tiesinę nepaslinktą prognozę. Krigingo rezultatų paklaidos priklauso nuo to kaip tiksliai erdvinių duomenų sklaida aprašoma kovariacine funkcija arba semivariograma. Darbe dėmesys skiriamas semivariogramoms, nes jos aprašo platesnę erdvinių procesų klasę. Pagrindinis disertacijos tikslas yra apibendrinti ir realizuoti vieningą erdvinių duomenų su stacionariomis klaidomis modeliavimo metodiką, pagrįstą semivariogramomis. Darbo objektai yra semivariogramos, jų modeliai, įvairūs erdvinių duomenų prognozavimo metodai bei erdvinių duomenų modeliavimo, prognozavimo etapai. Šių objektų analizė bei interpretacija prie tam tikrų sąlygų leidžia gauti geriausius erdvinių duomenų modeliavimo bei prognozavimo rezultatus. Taip pat disertaciniame darbe empiriniam Materon‘o semivariogramų įvertiniui MoM pateikta dispersijų-kovariacijų matricos išraiška per teorines semivariogramas stacionaraus Gauso duomenų modelio atvejui. Tiriami erdvinių duomenų vidurkio modelio parametrų bei semivariogramų vertinimo metodai... [toliau žr. visą tekstą] Mathematics Stationary spatial process Semivariograms Asymptotic Kriging Stacionarus erdvinis procesas Semivariogramos Asimptotika Krigingas
2	Sur l'estimation non paramétrique des modèles conditionnels pour variables fonctionnelles spatialement dépendantes / On the nonparametric estimation of certain conditional models in functional spatial data Kaid, Zoulikha 09 December 2012 (has links) Dans cette thèse, nous nous intéressons au problème de la prévision spatiale en considérant des modèles non paramétriques conditionnels dont la variable explicative est fonctionnelle. Plus précisément, les points étudiés pour décrire la co-variation spatiale entre une variable réponse réelle et une variable fonctionnelle sont le mode conditionnel et les quantiles conditionnels.En ce qui concerne le mode conditionnel, nous établissons la convergence presque complète, la convergence en norme Lp et la normalité asymptotique d'un estimateur à noyau. Ces propriétés asymptotiques sont obtenues sous des conditions assez générales telles, l'hypothèse de mélange forte et l'hypothèse de concentration de la mesure de probabilité de la variable explicative fonctionnelle. L'implémentation de l'estimateur construit en pratique est illustrée par une application sur des données météorologiques.Le modèle des quantiles conditionnels est abordé dans la deuxième partie de la thèse. Il est traité comme fonction inverse de la fonction de répartition conditionnelle qui est estimée par un estimateur à double noyaux. Sous les mêmes conditions que celles du modèle précédent, nous donnons l'expression de la vitesse de convergence en norme Lp et nous démontrons la normalité asymptotique de l'estimateur construit.Notre étude généralise au cas spatial de nombreux résultats déjà existant en série chronologique fonctionnelle. De plus, l'estimation de nos modèles repose sur une estimation préalable de la densité et de la fonction de répartition conditionnelles et permet de construire des régions prédictives, montrant ainsi l'apport de ce genre de modèles par rapport à la régression classique. / The main purpose of this thesis concerns the problem of spatial prediction using some nonparametric conditional models where the covariate variable is a functional one. More precisely, we treat the nonparametric estimation of the conditional mode and that of the conditional quantiles as spatial prediction tools alternative to the classical spatial regression of real response variable given a functional variable.Concerning the first model, that is the conditional mode, it is estimated by maximizing the spatial version of the kernel estimate of the conditional density. Under a general mixing condition and the concentration properties of the probability measure of the functional variable, we establish the almost complete convergence (with rate), the Lp consistency (with rate) and the asymptotic normality of the considered estimator. The usefulness of this estimation is illustrated by an application on real meteorological data.The model of the conditional quantiles is considered in the second part of this thesis and is treated as the inverse function of the conditional cumulative distribution function which is estimated by a double kernel estimator. Under the same general conditions as in the first model, we give the convergence rate in the Lp- norm and we show the asymptotic normality of the constructed estimator. These asymptotic results are closely related to the concentration properties on small balls of the probability measure of the underlying explanatory variable and the regularity of the conditional cumulative distribution function.Our study generalizes to spatial case some existing results in functional times series case. Finally, we highlight what our models brings compared to classical regression, discussing the use of our results as preliminary works to construct predictive regions. Processus spatial Mode conditionnel Non-paramétrique Spatial process Conditional mode estimate Nonparametric
3	Dimension Reduction and Variable Selection Moradi Rekabdarkolaee, Hossein 01 January 2016 (has links) High-dimensional data are becoming increasingly available as data collection technology advances. Over the last decade, significant developments have been taking place in high-dimensional data analysis, driven primarily by a wide range of applications in many fields such as genomics, signal processing, and environmental studies. Statistical techniques such as dimension reduction and variable selection play important roles in high dimensional data analysis. Sufficient dimension reduction provides a way to find the reduced space of the original space without a parametric model. This method has been widely applied in many scientific fields such as genetics, brain imaging analysis, econometrics, environmental sciences, etc. in recent years. In this dissertation, we worked on three projects. The first one combines local modal regression and Minimum Average Variance Estimation (MAVE) to introduce a robust dimension reduction approach. In addition to being robust to outliers or heavy-tailed distribution, our proposed method has the same convergence rate as the original MAVE. Furthermore, we combine local modal base MAVE with a $L_1$ penalty to select informative covariates in a regression setting. This new approach can exhaustively estimate directions in the regression mean function and select informative covariates simultaneously, while being robust to the existence of possible outliers in the dependent variable. The second project develops sparse adaptive MAVE (saMAVE). SaMAVE has advantages over adaptive LASSO because it extends adaptive LASSO to multi-dimensional and nonlinear settings, without any model assumption, and has advantages over sparse inverse dimension reduction methods in that it does not require any particular probability distribution on \textbf{X}. In addition, saMAVE can exhaustively estimate the dimensions in the conditional mean function. The third project extends the envelope method to multivariate spatial data. The envelope technique is a new version of the classical multivariate linear model. The estimator from envelope asymptotically has less variation compare to the Maximum Likelihood Estimator (MLE). The current envelope methodology is for independent observations. While the assumption of independence is convenient, this does not address the additional complication associated with a spatial correlation. This work extends the idea of the envelope method to cases where independence is an unreasonable assumption, specifically multivariate data from spatially correlated process. This novel approach provides estimates for the parameters of interest with smaller variance compared to maximum likelihood estimator while still being able to capture the spatial structure in the data. Envelope Local Modal MAVE Robust Estimation Shrinkage Estimation Spatial Process Sufficient Dimension Reduction Variable Selection. Applied Statistics Multivariate Analysis Statistical Methodology Statistical Models Statistical Theory
4	Spatial sampling and prediction Schelin, Lina January 2012 (has links) This thesis discusses two aspects of spatial statistics: sampling and prediction. In spatial statistics, we observe some phenomena in space. Space is typically of two or three dimensions, but can be of higher dimension. Questions in mind could be; What is the total amount of gold in a gold-mine? How much precipitation could we expect in a specific unobserved location? What is the total tree volume in a forest area? In spatial sampling the aim is to estimate global quantities, such as population totals, based on samples of locations (papers III and IV). In spatial prediction the aim is to estimate local quantities, such as the value at a single unobserved location, with a measure of uncertainty (papers I, II and V). In papers III and IV, we propose sampling designs for selecting representative probability samples in presence of auxiliary variables. If the phenomena under study have clear trends in the auxiliary space, estimation of population quantities can be improved by using representative samples. Such samples also enable estimation of population quantities in subspaces and are especially needed for multi-purpose surveys, when several target variables are of interest. In papers I and II, the objective is to construct valid prediction intervals for the value at a new location, given observed data. Prediction intervals typically rely on the kriging predictor having a Gaussian distribution. In paper I, we show that the distribution of the kriging predictor can be far from Gaussian, even asymptotically. This motivated us to propose a semiparametric method that does not require distributional assumptions. Prediction intervals are constructed from the plug-in ordinary kriging predictor. In paper V, we consider prediction in the presence of left-censoring, where observations falling below a minimum detection limit are not fully recorded. We review existing methods and propose a semi-naive method. The semi-naive method is compared to one model-based method and two naive methods, all based on variants of the kriging predictor. Auxiliary variables Censoring Inclusion probabilities Kriging Local pivotal method Minimum detection limit Prediction intervals Representative sample Spatial process Spatial sampling Semiparametric bootstrap
5	Contribution à la statistique spatiale et l'analyse de données fonctionnelles / Contribution to spatial statistics and functional data analysis Ahmed, Mohamed Salem 12 December 2017 (has links) Ce mémoire de thèse porte sur la statistique inférentielle des données spatiales et/ou fonctionnelles. En effet, nous nous sommes intéressés à l’estimation de paramètres inconnus de certains modèles à partir d’échantillons obtenus par un processus d’échantillonnage aléatoire ou non (stratifié), composés de variables indépendantes ou spatialement dépendantes.La spécificité des méthodes proposées réside dans le fait qu’elles tiennent compte de la nature de l’échantillon étudié (échantillon stratifié ou composé de données spatiales dépendantes).Tout d’abord, nous étudions des données à valeurs dans un espace de dimension infinie ou dites ”données fonctionnelles”. Dans un premier temps, nous étudions les modèles de choix binaires fonctionnels dans un contexte d’échantillonnage par stratification endogène (échantillonnage Cas-Témoin ou échantillonnage basé sur le choix). La spécificité de cette étude réside sur le fait que la méthode proposée prend en considération le schéma d’échantillonnage. Nous décrivons une fonction de vraisemblance conditionnelle sous l’échantillonnage considérée et une stratégie de réduction de dimension afin d’introduire une estimation du modèle par vraisemblance conditionnelle. Nous étudions les propriétés asymptotiques des estimateurs proposées ainsi que leurs applications à des données simulées et réelles. Nous nous sommes ensuite intéressés à un modèle linéaire fonctionnel spatial auto-régressif. La particularité du modèle réside dans la nature fonctionnelle de la variable explicative et la structure de la dépendance spatiale des variables de l’échantillon considéré. La procédure d’estimation que nous proposons consiste à réduire la dimension infinie de la variable explicative fonctionnelle et à maximiser une quasi-vraisemblance associée au modèle. Nous établissons la consistance, la normalité asymptotique et les performances numériques des estimateurs proposés.Dans la deuxième partie du mémoire, nous abordons des problèmes de régression et prédiction de variables dépendantes à valeurs réelles. Nous commençons par généraliser la méthode de k-plus proches voisins (k-nearest neighbors; k-NN) afin de prédire un processus spatial en des sites non-observés, en présence de co-variables spatiaux. La spécificité du prédicteur proposé est qu’il tient compte d’une hétérogénéité au niveau de la co-variable utilisée. Nous établissons la convergence presque complète avec vitesse du prédicteur et donnons des résultats numériques à l’aide de données simulées et environnementales.Nous généralisons ensuite le modèle probit partiellement linéaire pour données indépendantes à des données spatiales. Nous utilisons un processus spatial linéaire pour modéliser les perturbations du processus considéré, permettant ainsi plus de flexibilité et d’englober plusieurs types de dépendances spatiales. Nous proposons une approche d’estimation semi paramétrique basée sur une vraisemblance pondérée et la méthode des moments généralisées et en étudions les propriétés asymptotiques et performances numériques. Une étude sur la détection des facteurs de risque de cancer VADS (voies aéro-digestives supérieures)dans la région Nord de France à l’aide de modèles spatiaux à choix binaire termine notre contribution. / This thesis is about statistical inference for spatial and/or functional data. Indeed, weare interested in estimation of unknown parameters of some models from random or nonrandom(stratified) samples composed of independent or spatially dependent variables.The specificity of the proposed methods lies in the fact that they take into considerationthe considered sample nature (stratified or spatial sample).We begin by studying data valued in a space of infinite dimension or so-called ”functionaldata”. First, we study a functional binary choice model explored in a case-controlor choice-based sample design context. The specificity of this study is that the proposedmethod takes into account the sampling scheme. We describe a conditional likelihoodfunction under the sampling distribution and a reduction of dimension strategy to definea feasible conditional maximum likelihood estimator of the model. Asymptotic propertiesof the proposed estimates as well as their application to simulated and real data are given.Secondly, we explore a functional linear autoregressive spatial model whose particularityis on the functional nature of the explanatory variable and the structure of the spatialdependence. The estimation procedure consists of reducing the infinite dimension of thefunctional variable and maximizing a quasi-likelihood function. We establish the consistencyand asymptotic normality of the estimator. The usefulness of the methodology isillustrated via simulations and an application to some real data.In the second part of the thesis, we address some estimation and prediction problemsof real random spatial variables. We start by generalizing the k-nearest neighbors method,namely k-NN, to predict a spatial process at non-observed locations using some covariates.The specificity of the proposed k-NN predictor lies in the fact that it is flexible and allowsa number of heterogeneity in the covariate. We establish the almost complete convergencewith rates of the spatial predictor whose performance is ensured by an application oversimulated and environmental data. In addition, we generalize the partially linear probitmodel of independent data to the spatial case. We use a linear process for disturbancesallowing various spatial dependencies and propose a semiparametric estimation approachbased on weighted likelihood and generalized method of moments methods. We establishthe consistency and asymptotic distribution of the proposed estimators and investigate thefinite sample performance of the estimators on simulated data. We end by an applicationof spatial binary choice models to identify UADT (Upper aerodigestive tract) cancer riskfactors in the north region of France which displays the highest rates of such cancerincidence and mortality of the country. Modèle à choix binaire Analyses de données fonctionnelles ´Echantillonnage basé sur le choix ´Echantillonnage Cas-Témoin Modèle linéaire fonctionnel Processus auto-régressif spatial Quasi-maximum de vraisemblance Statistique Non-paramétrique Régression, Prédiction K-plus proches voisins Estimateur à Noyau Processus spatial Econométrie spatiale Estimation Semi-paramétrique Méthodes des moments généralisées Binary choice model Functional data analysis Choice-based sampling Case-control Functional Linear Model Spatial Autoregressive Process Quasi-maximum likelihood estimator Nonparametric statistics Regression Prediction K-nearest neighbors Kernel estimate Spatial process Spatial econometrics Semi-parametric estimation Generalized method of moments

1

Page generated in 0.0833 seconds