Global ETD Search

301	Estimation of Pareto Distribution Functions from Samples Contaminated by Measurement Errors Kondlo, Lwando Orbet January 2010 (has links) >Magister Scientiae - MSc / Estimation of population distributions, from samples that are contaminated by measurement errors, is a common problem. This study considers the problem of estimating the population distribution of independent random variables Xi, from error-contaminated samples ~i (.j = 1, ... , n) such that Yi = Xi + f·.i, where E is the measurement error, which is assumed independent of X. The measurement error ( is also assumed to be normally distributed. Since the observed distribution function is a convolution of the error distribution with the true underlying distribution, estimation of the latter is often referred to as a deconvolution problem. A thorough study of the relevant deconvolution literature in statistics is reported. We also deal with the specific case when X is assumed to follow a truncated Pareto form. If observations are subject to Gaussian errors, then the observed Y is distributed as the convolution of the finite-support Pareto and Gaussian error distributions. The convolved probability density function (PDF) and cumulative distribution function (CDF) of the finite-support Pareto and Gaussian distributions are derived. The intention is to draw more specific connections bet.ween certain deconvolution methods and also to demonstrate the application of the statistical theory of estimation in the presence of measurement error. A parametric methodology for deconvolution when the underlying distribution is of the Pareto form is developed. Maximum likelihood estimation (MLE) of the parameters of the convolved distributions is considered. Standard errors of the estimated parameters are calculated from the inverse Fisher's information matrix and a jackknife method. Probability-probability (P-P) plots and Kolmogorov-Smirnov (K-S) goodnessof- fit tests are used to evaluate the fit of the posited distribution. A bootstrapping method is used to calculate the critical values of the K-S test statistic, which are not available. Simulated data are used to validate the methodology. A real-life application of the methodology is illustrated by fitting convolved distributions to astronomical data Deconvolution Distribution functions Error-Contaminated samples Errors-in-variables Jackknife Maximum likelihood method Measurement errors Nonparametrie estimation Pareto distribution
302	Modeling and Simulation of Spatial Extremes Based on Max-Infinitely Divisible and Related Processes Zhong, Peng 17 April 2022 (has links) The statistical modeling of extreme natural hazards is becoming increasingly important due to climate change, whose effects have been increasingly visible throughout the last decades. It is thus crucial to understand the dependence structure of rare, high-impact events over space and time for realistic risk assessment. For spatial extremes, max-stable processes have played a central role in modeling block maxima. However, the spatial tail dependence strength is persistent across quantile levels in those models, which is often not realistic in practice. This lack of flexibility implies that max-stable processes cannot capture weakening dependence at increasingly extreme levels, resulting in a drastic overestimation of joint tail risk. To address this, we develop new dependence models in this thesis from the class of max-infinitely divisible (max-id) processes, which contain max-stable processes as a subclass and are flexible enough to capture different types of dependence structures. Furthermore, exact simulation algorithms for general max-id processes are typically not straightforward due to their complex formulations. Both simulation and inference can be computationally prohibitive in high dimensions. Fast and exact simulation algorithms to simulate max-id processes are provided, together with methods to implement our models in high dimensions based on the Vecchia approximation method. These proposed methodologies are illustrated through various environmental datasets, including air temperature data in South-Eastern Europe in an attempt to assess the effect of climate change on heatwave hazards, and sea surface temperature data for the entire Red Sea. In another application focused on assessing how the spatial extent of extreme precipitation has changed over time, we develop new time-varying $r$-Pareto processes, which are the counterparts of max-stable processes for high threshold exceedances. Max-infinitely divisible processes climate extremes extreme-value theory spatial extent max-stable processes Vecchia approximation r-Pareto processes
303	Risk–constrained stochastic economic dispatch and demand response with maximal renewable penetration under renewable obligation Hlalele, Thabo Gregory January 2020 (has links) In the recent years there has been a great deal of attention on the optimal demand and supply side strategy. The increase in renewable energy sources and the expansion in demand response programmes has shown the need for a robust power system. These changes in power system require the control of the uncertain generation and load at the same time. Therefore, it is important to provide an optimal scheduling strategy that can meet an adequate energy mix under demand response without affecting the system reliability and economic performance. This thesis addresses the following four aspects to these changes. First, a renewable obligation model is proposed to maintain an adequate energy mix in the economic dispatch model while minimising the operational costs of the allocated spinning reserves. This method considers a minimum renewable penetration that must be achieved daily in the energy mix. If the renewable quota is not achieved, the generation companies are penalised by the system operator. The uncertainty of renewable energy sources are modelled using the probability density functions and these functions are used for scheduling output power from these generators. The overall problem is formulated as a security constrained economic dispatch problem. Second, a combined economic and demand response optimisation model under a renewable obligation is presented. Real data from a large-scale demand response programme are used in the model. The model finds an optimal power dispatch strategy which takes advantage of demand response to minimise generation cost and maximise renewable penetration. The optimisation model is applied to a South African large-scale demand response programme in which the system operator can directly control the participation of the electrical water heaters at a substation level. Actual load profile before and after demand reduction are used to assist the system operator in making optimal decisions on whether a substation should participate in the demand response programme. The application of these real demand response data avoids traditional approaches which assume arbitrary controllability of flexible loads. Third, a stochastic multi-objective economic dispatch model is presented under a renewable obligation. This approach minimises the total operating costs of generators and spinning reserves under renewable obligation while maximising renewable penetration. The intermittency nature of the renewable energy sources is modelled using dynamic scenarios and the proposed model shows the effectiveness of the renewable obligation policy framework. Due to the computational complexity of all possible scenarios, a scenario reduction method is applied to reduce the number of scenarios and solve the model. A Pareto optimal solution is presented for a renewable obligation and further decision making is conducted to assess the trade-offs associated with the Pareto front. Four, a combined risk constrained stochastic economic dispatch and demand response model is presented under renewable obligation. An incentive based optimal power dispatch strategy is implemented to minimise generation costs and maximise renewable penetration. In addition, a risk-constrained approach is used to control the financial risks of the generation company under demand response programme. The coordination strategy for the generation companies to dispatch power using thermal generators and renewable energy sources while maintaining an adequate spinning reserve is presented. The proposed model is robust and can achieve significant demand reduction while increasing renewable penetration and decreasing the financial risks for generation companies. / Thesis (PhD (Electrical Engineering))--University of Pretoria, 2020. / Electrical, Electronic and Computer Engineering / PhD (Electrical Engineering) / Unrestricted UCTD Battery energy storage dynamic economic dispatch multi-objective optimisation Pareto optimal solution
304	Characteristics of Electricity Storage Technologies for Maintaining Reliability of Grid with High Amounts of Intermittent Energy Sundararagavan, Sandhya 01 January 2010 (has links) (PDF) For the grid to be stable, the supply of power must equal the demands of the consumer at every moment during the day. The unpredictable intermittent nature of wind results in inconsistent power generation. Energy storage technologies coupled with a wind farm can not only provide power during fluctuations but also maintain a stable and reliable grid. The objective of the thesis is to perform a comprehensive analysis of different types of energy storage technologies that can be coupled with a wind farm. The analysis is performed on the basis of multiple characteristics which affect their viability. We identified key characteristics for a range of storage technologies, including lead-acid, sodium-sulphur, nickel cadmium, lithium-ion, superconducting magnetic energy storage, electrochemical capacitors, flywheels, flow batteries, pumped hydro and compressed air energy storage systems. We performed a comparison study to analyze trade-offs and assessed potential improvement areas that will make them more competitive in the electric power industry. We suggested viable energy storage systems that could be better and suitable for different applications for an electric grid integrated with a wind farm. Energy storage technologies key characteristics cost analysis pareto analysis comparison study electric grid integrated with wind farm Energy Systems
305	Bayesian Modeling of Sub-Asymptotic Spatial Extremes Yadav, Rishikesh 04 1900 (has links) In many environmental and climate applications, extreme data are spatial by nature, and hence statistics of spatial extremes is currently an important and active area of research dedicated to developing innovative and flexible statistical models that determine the location, intensity, and magnitude of extreme events. In particular, the development of flexible sub-asymptotic models is in trend due to their flexibility in modeling spatial high threshold exceedances in larger spatial dimensions and with little or no effects on the choice of threshold, which is complicated with classical extreme value processes, such as Pareto processes. In this thesis, we develop new flexible sub-asymptotic extreme value models for modeling spatial and spatio-temporal extremes that are combined with carefully designed gradient-based Markov chain Monte Carlo (MCMC) sampling schemes and that can be exploited to address important scientific questions related to risk assessment in a wide range of environmental applications. The methodological developments are centered around two distinct themes, namely (i) sub-asymptotic Bayesian models for extremes; and (ii) flexible marked point process models with sub-asymptotic marks. In the first part, we develop several types of new flexible models for light-tailed and heavy-tailed data, which extend a hierarchical representation of the classical generalized Pareto (GP) limit for threshold exceedances. Spatial dependence is modeled through latent processes. We study the theoretical properties of our new methodology and demonstrate it by simulation and applications to precipitation extremes in both Germany and Spain. In the second part, we construct new marked point process models, where interest mostly lies in the extremes of the mark distribution. Our proposed joint models exploit intrinsic CAR priors to capture the spatial effects in landslide counts and sizes, while the mark distribution is assumed to take various parametric forms. We demonstrate that having a sub-asymptotic distribution for landslide sizes provides extra flexibility to accurately capture small to large and especially extreme, devastating landslides. Bayesian hierarchical modeling generalized Pareto distribution extreme event landslides modeling Markov chain Monte Carlo precipitation extremes sub-asymptotic modeling
306	A Pareto-Frontier Analysis of Performance Trends for Small Regional Coverage LEO Constellation Systems Hinds, Christopher Alan 01 December 2014 (has links) (PDF) As satellites become smaller, cheaper, and quicker to manufacture, constellation systems will be an increasingly attractive means of meeting mission objectives. Optimizing satellite constellation geometries is therefore a topic of considerable interest. As constellation systems become more achievable, providing coverage to specific regions of the Earth will become more common place. Small countries or companies that are currently unable to afford large and expensive constellation systems will now, or in the near future, be able to afford their own constellation systems to meet their individual requirements for small coverage regions. The focus of this thesis was to optimize constellation geometries for small coverage regions with the constellation design limited between 1-6 satellites in a Walker-delta configuration, at an altitude of 200-1500km, and to provide remote sensing coverage with a minimum ground elevation angle of 60 degrees. Few Pareto-frontiers have been developed and analyzed to show the tradeoffs among various performance metrics, especially for this type of constellation system. The performance metrics focus on geometric coverage and include revisit time, daily visibility time, constellation altitude, ground elevation angle, and the number of satellites. The objective space containing these performance metrics were characterized for 5 different regions at latitudes of 0, 22.5, 45, 67.5, and 90 degrees. In addition, the effect of minimum ground elevation angle was studied on the achievable performance of this type of constellation system. Finally, the traditional Walker-delta pattern constraint was relaxed to allow for asymmetrical designs. These designs were compared to see how the Walker-delta pattern performs compared to a more relaxed design space. The goal of this thesis was to provide both a framework as well as obtain and analyze Pareto-frontiers for constellation performance relating to small regional coverage LEO constellation systems. This work provided an in-depth analysis of the trends in both the design and objective space of the obtained Pareto-frontiers. A variation on the εNSGA-II algorithm was utilized along with a MATLAB/STK interface to produce these Pareto-frontiers. The εNSGA-II algorithm is an evolutionary algorithm that was developed by Kalyanmoy Deb to solve complex multi-objective optimization problems. The algorithm used in this study proved to be very efficient at obtaining various Pareto-frontiers. This study was also successful in characterizing the design and solution space surrounding small LEO remote sensing constellation systems providing small regional coverage. Artificial Intelligence and Robotics Astrodynamics
307	Using Pareto points for model identification in predictive toxicology Palczewska, Anna Maria, Neagu, Daniel, Ridley, Mick J. January 2013 (has links) no / Predictive toxicology is concerned with the development of models that are able to predict the toxicity of chemicals. A reliable prediction of toxic effects of chemicals in living systems is highly desirable in cosmetics, drug design or food protection to speed up the process of chemical compound discovery while reducing the need for lab tests. There is an extensive literature associated with the best practice of model generation and data integration but management and automated identification of relevant models from available collections of models is still an open problem. Currently, the decision on which model should be used for a new chemical compound is left to users. This paper intends to initiate the discussion on automated model identification. We present an algorithm, based on Pareto optimality, which mines model collections and identifies a model that offers a reliable prediction for a new chemical compound. The performance of this new approach is verified for two endpoints: IGC50 and LogP. The results show a great potential for automated model identification methods in predictive toxicology.
308	Multi-objective day-ahead scheduling of microgrids using modified grey wolf optimizer algorithm Javidsharifi, M., Niknam, T., Aghaei, J., Mokryani, Geev, Papadopoulos, P. 10 August 2018 (has links) Yes / Investigation of the environmental/economic optimal operation management of a microgrid (MG) as a case study for applying a novel modified multi-objective grey wolf optimizer (MMOGWO) algorithm is presented in this paper. MGs can be considered as a fundamental solution in order for distributed generators’ (DGs) management in future smart grids. In the multi-objective problems, since the objective functions are conflict, the best compromised solution should be extracted through an efficient approach. Accordingly, a proper method is applied for exploring the best compromised solution. Additionally, a novel distance-based method is proposed to control the size of the repository within an aimed limit which leads to a fast and precise convergence along with a well-distributed Pareto optimal front. The proposed method is implemented in a typical grid-connected MG with non-dispatchable units including renewable energy sources (RESs), along with a hybrid power source (micro-turbine, fuel-cell and battery) as dispatchable units, to accumulate excess energy or to equalize power mismatch, by optimal scheduling of DGs and the power exchange between the utility grid and storage system. The efficiency of the suggested algorithm in satisfying the load and optimizing the objective functions is validated through comparison with different methods, including PSO and the original GWO. / Supported in part by Royal Academy of Engineering Distinguished Visiting Fellowship under Grant DVF1617\6\45 Pareto optimal solution Modified grey wolf optimizer Micro-grid Renewable energy sources
309	Interpretation, Identification and Reuse of Models. Theory and algorithms with applications in predictive toxicology. Palczewska, Anna Maria January 2014 (has links) This thesis is concerned with developing methodologies that enable existing models to be effectively reused. Results of this thesis are presented in the framework of Quantitative Structural-Activity Relationship (QSAR) models, but their application is much more general. QSAR models relate chemical structures with their biological, chemical or environmental activity. There are many applications that offer an environment to build and store predictive models. Unfortunately, they do not provide advanced functionalities that allow for efficient model selection and for interpretation of model predictions for new data. This thesis aims to address these issues and proposes methodologies for dealing with three research problems: model governance (management), model identification (selection), and interpretation of model predictions. The combination of these methodologies can be employed to build more efficient systems for model reuse in QSAR modelling and other areas. The first part of this study investigates toxicity data and model formats and reviews some of the existing toxicity systems in the context of model development and reuse. Based on the findings of this review and the principles of data governance, a novel concept of model governance is defined. Model governance comprises model representation and model governance processes. These processes are designed and presented in the context of model management. As an application, minimum information requirements and an XML representation for QSAR models are proposed. Once a collection of validated, accepted and well annotated models is available within a model governance framework, they can be applied for new data. It may happen that there is more than one model available for the same endpoint. Which one to chose? The second part of this thesis proposes a theoretical framework and algorithms that enable automated identification of the most reliable model for new data from the collection of existing models. The main idea is based on partitioning of the search space into groups and assigning a single model to each group. The construction of this partitioning is difficult because it is a bi-criteria problem. The main contribution in this part is the application of Pareto points for the search space partition. The proposed methodology is applied to three endpoints in chemoinformatics and predictive toxicology. After having identified a model for the new data, we would like to know how the model obtained its prediction and how trustworthy it is. An interpretation of model predictions is straightforward for linear models thanks to the availability of model parameters and their statistical significance. For non linear models this information can be hidden inside the model structure. This thesis proposes an approach for interpretation of a random forest classification model. This approach allows for the determination of the influence (called feature contribution) of each variable on the model prediction for an individual data. In this part, there are three methods proposed that allow analysis of feature contributions. Such analysis might lead to the discovery of new patterns that represent a standard behaviour of the model and allow additional assessment of the model reliability for new data. The application of these methods to two standard benchmark datasets from the UCI machine learning repository shows a great potential of this methodology. The algorithm for calculating feature contributions has been implemented and is available as an R package called rfFC. / BBSRC and Syngenta (International Research Centre at Jealott’s Hill, Bracknell, UK).
310	Scaling Analytics via Approximate and Distributed Computing Chakrabarti, Aniket 12 December 2017 (has links) No description available. Computer Science Approximate Computing Distributed Computing Locality Sensitive Hashing Kernel Learning Markov Random Field Pareto Frontier Analytics Frameworks

Search results