Spelling suggestions: "subject:"local linear"" "subject:"focal linear""
1 |
Methods for Quantitatively Describing Tree Crown Profiles of Loblolly pine (<I>Pinus taeda</I> L.)Doruska, Paul F. 17 July 1998 (has links)
Physiological process models, productivity studies, and wildlife abundance studies all require accurate representations of tree crowns. In the past, geometric shapes or flexible mathematical equations approximating geometric shapes were used to represent crown profiles. Crown profile of loblolly pine (<I>Pinus taeda</I> L.) was described using single-regressor, nonparametric regression analysis in an effort to improve crown representations. The resulting profiles were compared to more traditional representations. Nonparametric regression may be applicable when an underlying parametric model cannot be identified. The modeler does not specify a functional form. Rather, a data-driven technique is used to determine the shape a curve. The modeler determines the amount of local curvature to be depicted in the curve. A class of local-polynomial estimators which contains the popular kernel estimator as a special case was investigated. Kernel regression appears to fit closely to the interior data points but often possesses bias problems at the boundaries of the data, a feature less exhibited by local linear or local quadratic regression. When using nonparametric regression, decisions must be made regarding polynomial order and bandwidth. Such decisions depend on the presence of local curvature, desired degree of smoothing, and, for bandwidth in particular, the minimization of some global error criterion. In the present study, a penalized PRESS criterion (PRESS*) was selected as the global error criterion. When individual- tree, crown profile data are available, the technique of nonparametric regression appears capable of capturing more of the tree to tree variation in crown shape than multiple linear regression and other published functional forms. Thus, modelers should consider the use of nonparametric regression when describing crown profiles as well as in any regression situation where traditional techniques perform unsatisfactorily or fail. / Ph. D.
|
2 |
Asymptotic properties of Non-parametric Regression with Beta KernelsNatarajan, Balasubramaniam January 1900 (has links)
Doctor of Philosophy / Department of Statistics / Weixing Song / Kernel based non-parametric regression is a popular statistical tool to identify the relationship between response and predictor variables when standard parametric regression models are not appropriate. The efficacy of kernel based methods depend both on the kernel choice and the smoothing parameter. With insufficient smoothing, the resulting regression estimate is too rough and with excessive smoothing, important features of the underlying relationship is lost. While the choice of the kernel has been shown to have less of an effect on the quality of regression estimate, it is important to choose kernels to best match the support set of the underlying predictor variables. In the past few decades, there have been multiple efforts to quantify the properties of asymmetric kernel density and regression estimators. Unlike classic symmetric kernel based estimators, asymmetric kernels do not suffer from boundary problems. For example, Beta kernel estimates are especially suitable for investigating the distribution structure of predictor variables with compact support. In this dissertation, two types of Beta kernel based non parametric regression estimators are proposed and analyzed. First, a Nadaraya-Watson type Beta kernel estimator is introduced within the regression setup followed by a local linear regression estimator based on Beta kernels. For both these regression estimators, a comprehensive analysis of its large sample properties is presented. Specifically, for the first time, the asymptotic normality and the uniform almost sure convergence results for the new estimators are established. Additionally, general guidelines for bandwidth selection is provided. The finite sample performance of the proposed estimator is evaluated via both a simulation study and a real data application. The results presented and validated in this dissertation help advance the understanding and use of Beta kernel based methods in other non-parametric regression applications.
|
3 |
Simulation and characterisation of a concentrated solar power plant / Coenraad Josephus NelNel, Coenraad Josephus January 2015 (has links)
Concentrated solar power (CSP) is an efficient means of renewable energy that makes use of solar
radiation to produce electricity instead of making use of conventional fossil fuel techniques such as
burning coal. The aim of this study is the simulation and characterisation of a CSP plant in order to
gain a better understanding of the dominant plant dynamics. Due to the nature of the study, the
dissertation is divided into two main parts namely the simulation of a CSP plant model and the
characterisation of the plant model.
Modelling the CSP plant takes the form of developing an accurate Flownex® model of a 40 MW
combined cycle CSP plant. The model includes thermal energy storage as well as making use of a
duct burner. The Flownex® model is based on an existing TRNSYS model of the same plant. The
Flownex® model is verified and validated, by making use of a bottom-up approach, to ensure that
the developed model is in fact correct.
The characterisation part of this dissertation involves evaluating the dynamic responses unique to
that of a CSP plant as stated in the literature. This involves evaluating the dominant dynamic
behaviour, the presence of resonant and anti-resonant modes found within the control bandwidth,
and the change in the dynamics of the plant as the plants’ operating points change throughout the
day.
Once the developed model is validated, characterisation in the form of evaluating the open loop
local linear models of the plant is implemented. In order to do so, these models are developed
based on model identification processes, which include the use of system identification software
such as Matlab® SID Toolbox®.
The dominant dynamic behaviour of the plant model, obtained from the developed local linear
models, represents that of an over damped second order system that changes as the operating
points of the plant change; with the models’ time responses and the bandwidth decreasing and
increasing respectively as the thermal energy inputs to the plant increases. The frequency
response of the developed local linear models also illustrates the presence of resonant and antiresonant
modes found within the control bandwidth of the solar collector field’s temperature
response. These modes however are not found to be present in the mechanical power output
response of the plant.
The use of adaptive control, such as feedforward and gain-scheduled controllers, for the plant
should be developed to compensate for the dynamic behaviours associated with that of a CSP
plant. / MIng (Computer and Electronic Engineering), North-West University, Potchefstroom Campus, 2015
|
4 |
Simulation and characterisation of a concentrated solar power plant / Coenraad Josephus NelNel, Coenraad Josephus January 2015 (has links)
Concentrated solar power (CSP) is an efficient means of renewable energy that makes use of solar
radiation to produce electricity instead of making use of conventional fossil fuel techniques such as
burning coal. The aim of this study is the simulation and characterisation of a CSP plant in order to
gain a better understanding of the dominant plant dynamics. Due to the nature of the study, the
dissertation is divided into two main parts namely the simulation of a CSP plant model and the
characterisation of the plant model.
Modelling the CSP plant takes the form of developing an accurate Flownex® model of a 40 MW
combined cycle CSP plant. The model includes thermal energy storage as well as making use of a
duct burner. The Flownex® model is based on an existing TRNSYS model of the same plant. The
Flownex® model is verified and validated, by making use of a bottom-up approach, to ensure that
the developed model is in fact correct.
The characterisation part of this dissertation involves evaluating the dynamic responses unique to
that of a CSP plant as stated in the literature. This involves evaluating the dominant dynamic
behaviour, the presence of resonant and anti-resonant modes found within the control bandwidth,
and the change in the dynamics of the plant as the plants’ operating points change throughout the
day.
Once the developed model is validated, characterisation in the form of evaluating the open loop
local linear models of the plant is implemented. In order to do so, these models are developed
based on model identification processes, which include the use of system identification software
such as Matlab® SID Toolbox®.
The dominant dynamic behaviour of the plant model, obtained from the developed local linear
models, represents that of an over damped second order system that changes as the operating
points of the plant change; with the models’ time responses and the bandwidth decreasing and
increasing respectively as the thermal energy inputs to the plant increases. The frequency
response of the developed local linear models also illustrates the presence of resonant and antiresonant
modes found within the control bandwidth of the solar collector field’s temperature
response. These modes however are not found to be present in the mechanical power output
response of the plant.
The use of adaptive control, such as feedforward and gain-scheduled controllers, for the plant
should be developed to compensate for the dynamic behaviours associated with that of a CSP
plant. / MIng (Computer and Electronic Engineering), North-West University, Potchefstroom Campus, 2015
|
5 |
Sequential Procedures for Nonparametric Kernel RegressionDharmasena, Tibbotuwa Deniye Kankanamge Lasitha Sandamali, Sandamali.dharmasena@rmit.edu.au January 2008 (has links)
In a nonparametric setting, the functional form of the relationship between the response variable and the associated predictor variables is unspecified; however it is assumed to be a smooth function. The main aim of nonparametric regression is to highlight an important structure in data without any assumptions about the shape of an underlying regression function. In regression, the random and fixed design models should be distinguished. Among the variety of nonparametric regression estimators currently in use, kernel type estimators are most popular. Kernel type estimators provide a flexible class of nonparametric procedures by estimating unknown function as a weighted average using a kernel function. The bandwidth which determines the influence of the kernel has to be adapted to any kernel type estimator. Our focus is on Nadaraya-Watson estimator and Local Linear estimator which belong to a class of kernel type regression estimators called local polynomial kerne l estimators. A closely related problem is the determination of an appropriate sample size that would be required to achieve a desired confidence level of accuracy for the nonparametric regression estimators. Since sequential procedures allow an experimenter to make decisions based on the smallest number of observations without compromising accuracy, application of sequential procedures to a nonparametric regression model at a given point or series of points is considered. The motivation for using such procedures is: in many applications the quality of estimating an underlying regression function in a controlled experiment is paramount; thus, it is reasonable to invoke a sequential procedure of estimation that chooses a sample size based on recorded observations that guarantees a preassigned accuracy. We have employed sequential techniques to develop a procedure for constructing a fixed-width confidence interval for the predicted value at a specific point of the independent variable. These fixed-width confidence intervals are developed using asymptotic properties of both Nadaraya-Watson and local linear kernel estimators of nonparametric kernel regression with data-driven bandwidths and studied for both fixed and random design contexts. The sample sizes for a preset confidence coefficient are optimized using sequential procedures, namely two-stage procedure, modified two-stage procedure and purely sequential procedure. The proposed methodology is first tested by employing a large-scale simulation study. The performance of each kernel estimation method is assessed by comparing their coverage accuracy with corresponding preset confidence coefficients, proximity of computed sample sizes match up to optimal sample sizes and contrasting the estimated values obtained from the two nonparametric methods with act ual values at given series of design points of interest. We also employed the symmetric bootstrap method which is considered as an alternative method of estimating properties of unknown distributions. Resampling is done from a suitably estimated residual distribution and utilizes the percentiles of the approximate distribution to construct confidence intervals for the curve at a set of given design points. A methodology is developed for determining whether it is advantageous to use the symmetric bootstrap method to reduce the extent of oversampling that is normally known to plague Stein's two-stage sequential procedure. The procedure developed is validated using an extensive simulation study and we also explore the asymptotic properties of the relevant estimators. Finally, application of our proposed sequential nonparametric kernel regression methods are made to some problems in software reliability and finance.
|
6 |
Some Aspects of Propensity Score-based Estimators for Causal InferencePingel, Ronnie January 2014 (has links)
This thesis consists of four papers that are related to commonly used propensity score-based estimators for average causal effects. The first paper starts with the observation that researchers often have access to data containing lots of covariates that are correlated. We therefore study the effect of correlation on the asymptotic variance of an inverse probability weighting and a matching estimator. Under the assumptions of normally distributed covariates, constant causal effect, and potential outcomes and a logit that are linear in the parameters we show that the correlation influences the asymptotic efficiency of the estimators differently, both with regard to direction and magnitude. Further, the strength of the confounding towards the outcome and the treatment plays an important role. The second paper extends the first paper in that the estimators are studied under the more realistic setting of using the estimated propensity score. We also relax several assumptions made in the first paper, and include the doubly robust estimator. Again, the results show that the correlation may increase or decrease the variances of the estimators, but we also observe that several aspects influence how correlation affects the variance of the estimators, such as the choice of estimator, the strength of the confounding towards the outcome and the treatment, and whether constant or non-constant causal effect is present. The third paper concerns estimation of the asymptotic variance of a propensity score matching estimator. Simulations show that large gains can be made for the mean squared error by properly selecting smoothing parameters of the variance estimator and that a residual-based local linear estimator may be a more efficient estimator for the asymptotic variance. The specification of the variance estimator is shown to be crucial when evaluating the effect of right heart catheterisation, i.e. we show either a negative effect on survival or no significant effect depending on the choice of smoothing parameters. In the fourth paper, we provide an analytic expression for the covariance matrix of logistic regression with normally distributed regressors. This paper is related to the other papers in that logistic regression is commonly used to estimate the propensity score.
|
7 |
Neparametrické regresní odhady / Nonparametric regression estimatorsMěsíček, Martin January 2017 (has links)
This thesis is focused on local polynomial smoothers of the conditional vari- ance function in a heteroscedastic nonparametric regression model. Both mean and variance functions are assumed to be smooth, but neither is assumed to be in a parametric family. The basic idea is to apply a local linear regression to squa- red residuals. This method, as we have shown, has high minimax efficiency and it is fully adaptive to the unknown conditional mean function. However, the local linear estimator may give negative values in finite samples which makes variance estimation impossible. Hence Xu and Phillips proposed a new variance estimator that is asymptotically equivalent to the local linear estimator for interior points but is guaranteed to be non-negative. We also established asymptotic results of both estimators for boundary points and proved better asymptotic behavior of the local linear estimator. That motivated us to propose a modification of the local li- near estimator that guarantees non-negativity. Finally, simulations are conducted to evaluate the finite sample performances of the mentioned estimators.
|
8 |
Model robust regression: combining parametric, nonparametric, and semiparametric methodsMays, James Edward January 1995 (has links)
In obtaining a regression fit to a set of data, ordinary least squares regression depends directly on the parametric model formulated by the researcher. If this model is incorrect, a least squares analysis may be misleading. Alternatively, nonparametric regression (kernel or local polynomial regression, for example) has no dependence on an underlying parametric model, but instead depends entirely on the distances between regressor coordinates and the prediction point of interest. This procedure avoids the necessity of a reliable model, but in using no information from the researcher, may fit to irregular patterns in the data. The proper combination of these two regression procedures can overcome their respective problems. Considered is the situation where the researcher has an idea of which model should explain the behavior of the data, but this model is not adequate throughout the entire range of the data. An extension of partial linear regression and two methods of model robust regression are developed and compared in this context. These methods involve parametric fits to the data and nonparametric fits to either the data or residuals. The two fits are then combined in the most efficient proportions via a mixing parameter. Performance is based on bias and variance considerations. / Ph. D. / incomplete_metadata
|
9 |
Bayesian Structural Time Series in Marketing Mix Modelling / Bayesianska Strukturella Tidsseriemodeller inom Marketing Mix ModelleringKarlsson, Jessika January 2022 (has links)
Marketing Mix Modelling has been used since the 1950s, leveraging statistical inference to attribute media investments to sales. Typically, regression models have been used to model the relationship between the two. However, the media landscape evolves at an increasingly rapid pace, driving the need for more refined models which are able to accurately capture these changes. One class of such models are Bayesian structural time series, which are the focal point in this thesis. This class of models retains the relationship between media investments and sales, while also allowing for model parameters to vary over time. The effectiveness of these models is evaluated with respect to prediction accuracy and certainty, both in and out-of-sample. A total of four different models of varying degrees of complexity were investigated. It was concluded that the in-sample performance was similar across models, yet when it came to out-of-sample performance models with time-varying performance outperformed their static counterparts, with respect to uncertainty. Furthermore, the functional form of the intercept influenced the uncertainty of the forecasts on extended time horizons. / Marketing mix modellering har använts sedan 1950-talet för att dra slutsatser om hur mediainvesteringar påverkar försäljning, med hjälp av statistisk inferens. Vanligtvis har regressionmodeller använts för att modellera relationen mellan de två. Men medielandskapet utvecklas allt snabbare, vilket kräver mer sofistikerade modeller som kan fånga upp dessa förändringar på ett mer precist sätt. En klass av sådana modeller är Bayesianska strukturella tidsseriemodeller, som är fokus för detta arbete. Denna klass av modeller bibehåller den strukturella relationen mellan mediainvesteringar och försäljning, samtidigt som de också tillåter modellparametrarna att variera över tid. Effektiviteten hos modellerna bedöms med avseende på noggrannhet och säkerhet, både tränings- och testdata. Totalt fyra olika modeller med varierande komplexitet undersöktes. Det konstaterades att prestandan på träningsdata var likvärdig mellan modellerna, men när det gällde testdata presterade modeller med tidsvarierande parametrar bättre än sina statiska motsvarigheter, med avseende på osäkerhet. Dessutom påverkade den funktionella formen av interceptet osäkerheten hos prognoserna över längre tidshorisonter.
|
10 |
Nonparametric statistical inference for dependent censored dataEl Ghouch, Anouar 05 October 2007 (has links)
A frequent problem that appears in practical survival data analysis is censoring. A censored observation occurs when the observation of the event time (duration or survival time) may be prevented by the occurrence of an earlier competing event (censoring time). Censoring may be due to different causes. For example, the loss of some subjects under study, the end of the follow-up period, drop out or the termination of the study and the limitation in the sensitivity of a measurement instrument. The literature about censored data focuses on the i.i.d. case. However in many real applications the data are collected sequentially in time or space and so the assumption of independence in such case does not hold. Here we only give some typical examples from the literature involving correlated data which are subject to censoring. In the clinical trials domain it frequently happens that the patients from the same hospital have correlated survival times due to unmeasured variables like the quality of the hospital equipment. Censored correlated data are also a common problem in the domain of environmental and spatial (geographical or ecological) statistics. In fact, due to the process being used in the data sampling procedure, e.g. the analytical equipment, only the measurements which exceed some thresholds, for example the method detection limits or the instrumental detection limits, can be included in the data analysis. Many other examples can also be found in other fields like econometrics and financial statistics. Observations on duration of unemployment e.g., may be right censored and are typically correlated. When the data are not independent and are subject to censoring, estimation and inference become more challenging mathematical problems with a wide area of applications. In this context, we propose here some new and flexible tools based on a nonparametric approach. More precisely, allowing dependence between individuals, our main contribution to this domain concerns the following aspects. First, we are interested in developing more suitable confidence intervals for a general class of functionals of a survival distribution via the empirical likelihood method. Secondly, we study the problem of conditional mean estimation using the local linear technique. Thirdly, we develop and study a new estimator of the conditional quantile function also based on the local linear method. In this dissertation, for each proposed method, asymptotic results like consistency and asymptotic normality are derived and the finite sample performance is evaluated in a simulation study.
|
Page generated in 0.0483 seconds