Global ETD Search

1	Comparison of Schema Matching Evaluations Do, Hong-Hai, Melnik, Sergey, Rahm, Erhard 12 December 2018 (has links) Recently, schema matching has found considerable interest in both research and practice. Determining matching components of database or XML schemas is needed in many applications, e.g. for E-business and data integration. Various schema matching systems have been developed to solve the problem semi-automatically. While there have been some evaluations, the overall effectiveness of currently available automatic schema matching systems is largely unclear. This is because the evaluations were conducted in diverse ways making it difficult to assess the effectiveness of each single system, let alone to compare their effectiveness. In this paper we survey recently published schema matching evaluations. For this purpose, we introduce the major criteria that influence the effectiveness of a schema matching approach and use these criteria to compare the various systems. Based on our observations, we discuss the requirements for future match implementations and evaluations.
2	Jackknife Empirical Likelihood-Based Confidence Intervals for Low Income Proportions with Missing Data YIN, YANAN 18 December 2013 (has links) The estimation of low income proportions plays an important role in comparisons of poverty in different countries. In most countries, the stability of the society and the development of economics depend on the estimation of low income proportions. An accurate estimation of a low income proportion has a crucial role for the development of the natural economy and the improvement of people's living standards. In this thesis, the Jackknife empirical likelihood method is employed to construct confidence intervals for a low income proportion when the observed data had missing values. Comprehensive simulation studies are conducted to compare the relative performances of two Jackknife empirical likelihood based confidence intervals for low income proportions in terms of coverage probability. A real data example is used to illustrate the application of the proposed methods. Jackknife Empirical Likelihood Missing Data Auxiliary information Coverage Probability Low Income Proportion Income distribution
3	Statistical estimators of the finite population parameters in the case of sample rotation / Baigtinės populiacijos parametrų statistiniai įvertiniai esant imties rotacijai Chadyšas, Viktoras 03 March 2010 (has links) The dissertation analyzes how to incorporate auxiliary information into the estimation of the finite population total, distribution function and quantile in the case of sample rotation. First of all estimation of the finite population total in the case of sample rotation are considered. We focus on construction of the total estimator for rotated sampling design. Successive sampling procedure using multi-phase sampling design have been developed. The composite ratio type estimator of the total using auxiliary information and its approximate variance is constructed. A simulation study, based on the real population data, is performed and the proposed estimators are compared by a traditional estima-tor for a total. The composite estimators of finite population distribution function, constructed under sampling on two occasions, are considered. Composite regression and ratio type estimators are constructed, using values of the study variable as auxiliary information obtained on the first occasion. The optimal estimators, in the sense of minimal variance, is also obtained. A simulation study, based on the real population data, is performed and the proposed estimators are compared by a traditional estimator for a distribution function. Several quantile estimators by deriving the distribution function estimators with the use of auxiliary information are proposed. Some procedures that may be used to obtain estimates of confidence intervals for quantiles in a finite population (most of... [to full text] / Disertacijoje sudaromi baigtinės populiacijos tyrimo kintamojo sumos, pasiskirstymo funkcijos, kvantilio įvertiniai esant imties rotacijai. Pirmiausia darbe nagrinėjamas baigtinės populiacijos tyrimo kintamojo sumos vertinimas esant imties rotacijai. Sudarytas sudėtinis santykinis sumos įvertinys naudojantis papildomą informaciją žinomą iš ankstesnių imties rinkimų. Modeliavimo rezultatai rodo, kad papildomos informacijos panaudojimas iš jau išrinktos imties gali pagerinti įvertinių tikslumą. Kelių ėmimų schema gali būti taikoma siekiant pagerinti baigtinės populiacijos nepaslinktojo imties planu pagristo sumos įvertinio tikslumą. Taip pat nagrinėjami baigtinės populiacijos tyrimo kintamojo pasiskirstymo funkcijos įvertiniai esant imties rotacijai. Sudaryti keli tyrimo kintamojo baigtinėje populiacijoje pasiskirstymo funkcijos sudėtiniai įvertiniai (regresinis ir santykinis) naudojant dviejų ėmimų schemą. Pasiūlyti optimalūs sudėtiniai pasiskirstymo funkcijos įvertiniai su mažiausia dispersija. Įvertiniai lyginami tarpusavyje atliekant modeliavimą su realiais duomenimis. Baigtinės populiacijos kvantilio ivertiniai sudaromi imant sudėtiniu pasiskirstymo funkcijų įvertinių atvirkštinių funkcijų įvertinius. Taip pat sudaromi pasikliautinojo intervalo įvertiniai kvantiliams, taikant kartotinių imčių metodus. Modeliuojant duomenis lyginamas jų tikslumas, daromos išvados apie pasikliautinojo intervalo įvertinių, sudarytų skirtingais būdais, kvantiliui efektyvumą. Mathematics Sample rotation Auxiliary information Composite estimators Resampling methods Imties rotacija Papildoma informacija Sudėtiniai įvertiniai Imčių perrinkimo metodai
4	Modelos baseados no planejamento para análise de populações finitas / Design-based models for the analysis of finite populations González Garcia, Luz Mery 23 April 2008 (has links) Estudamos o problema de obtenção de estimadores/preditores ótimos para combinações lineares de respostas coletadas de uma população finita por meio de amostragem aleatória simples. Nesse contexto, estendemos o modelo misto para populações finitas proposto por Stanek, Singer & Lencina (2004, Journal of Statistical Planning and Inference) para casos em que se incluem erros de medida (endógenos e exógenos) e informação auxiliar. Admitindo que as variâncias são conhecidas, mostramos que os estimadores/preditores propostos têm erro quadrático médio menor dentro da classe dos estimadores lineares não viciados. Por meio de estudos de simulação, comparamos o desempenho desses estimadores/preditores empíricos, i.e., obtidos com a substituição das componentes de variância por estimativas, com aquele de competidores tradicionais. Também, estendemos esses modelos para análise de estudos com estrutura do tipo pré-teste/pós-teste. Também por intermédio de simulação, comparamos o desempenho dos estimadores empíricos com o desempenho do estimador obtido por meio de técnicas clássicas de análise de medidas repetidas e com o desempenho do estimador obtido via análise de covariância por meio de mínimos quadrados, concluindo que os estimadores/ preditores empíricos apresentaram um menor erro quadrático médio e menor vício. Em geral, sugerimos o emprego dos estimadores/preditores empíricos propostos para dados com distribuição assimétrica ou amostras pequenas. / We consider optimal estimation of finite population parameters with data obtained via simple random samples. In this context, we extend a finite population mixed model proposed by Stanek, Singer & Lencina (2004, Journal of Statistical Planning and Inference) by including measurement errors (endogenous or exogenous) and auxiliary information. Assuming that variance components are known, we show that the proposed estimators/predictors have the smallest mean squared error in the class of unbiased estimators. Using simulation studies, we compare the performance of the empirical estimators/predictors obtained by replacing variance components with estimates with the performance of a traditional estimator. We also extend the finite population mixed model to data obtained via pretest-posttest designs. Through simulation studies, we compare the performance of the empirical estimator of the difference in gain between groups with the performance of the usual repeated measures estimator and with the performance of the usual analysis of covariance estimator obtained via ordinary least squares. The empirical estimator has smaller mean squared error and bias than the alternative estimators under consideration. In general, we recommend the use of the proposed estimators/ predictors for either asymmetric response distributions or small samples. auxiliary information BLUP coeficiente de regressão erros de medida estimador ótimo estudos pré-teste/pós-teste informação auxiliar measure of association measurement error medida de associação modelo de permutação aleatória. optimal estimator pretest-posttest designs random permutation model regression coefficient response error.
5	Modelos baseados no planejamento para análise de populações finitas / Design-based models for the analysis of finite populations Luz Mery González Garcia 23 April 2008 (has links) Estudamos o problema de obtenção de estimadores/preditores ótimos para combinações lineares de respostas coletadas de uma população finita por meio de amostragem aleatória simples. Nesse contexto, estendemos o modelo misto para populações finitas proposto por Stanek, Singer & Lencina (2004, Journal of Statistical Planning and Inference) para casos em que se incluem erros de medida (endógenos e exógenos) e informação auxiliar. Admitindo que as variâncias são conhecidas, mostramos que os estimadores/preditores propostos têm erro quadrático médio menor dentro da classe dos estimadores lineares não viciados. Por meio de estudos de simulação, comparamos o desempenho desses estimadores/preditores empíricos, i.e., obtidos com a substituição das componentes de variância por estimativas, com aquele de competidores tradicionais. Também, estendemos esses modelos para análise de estudos com estrutura do tipo pré-teste/pós-teste. Também por intermédio de simulação, comparamos o desempenho dos estimadores empíricos com o desempenho do estimador obtido por meio de técnicas clássicas de análise de medidas repetidas e com o desempenho do estimador obtido via análise de covariância por meio de mínimos quadrados, concluindo que os estimadores/ preditores empíricos apresentaram um menor erro quadrático médio e menor vício. Em geral, sugerimos o emprego dos estimadores/preditores empíricos propostos para dados com distribuição assimétrica ou amostras pequenas. / We consider optimal estimation of finite population parameters with data obtained via simple random samples. In this context, we extend a finite population mixed model proposed by Stanek, Singer & Lencina (2004, Journal of Statistical Planning and Inference) by including measurement errors (endogenous or exogenous) and auxiliary information. Assuming that variance components are known, we show that the proposed estimators/predictors have the smallest mean squared error in the class of unbiased estimators. Using simulation studies, we compare the performance of the empirical estimators/predictors obtained by replacing variance components with estimates with the performance of a traditional estimator. We also extend the finite population mixed model to data obtained via pretest-posttest designs. Through simulation studies, we compare the performance of the empirical estimator of the difference in gain between groups with the performance of the usual repeated measures estimator and with the performance of the usual analysis of covariance estimator obtained via ordinary least squares. The empirical estimator has smaller mean squared error and bias than the alternative estimators under consideration. In general, we recommend the use of the proposed estimators/ predictors for either asymmetric response distributions or small samples. coeficiente de regressão erros de medida estimador ótimo estudos pré-teste/pós-teste informação auxiliar medida de associação modelo de permutação aleatória. auxiliary information BLUP measure of association measurement error optimal estimator pretest-posttest designs random permutation model regression coefficient response error.
6	Estimation efficace en présence de non-réponse dans les enquêtes Gao, Yimeng 03 1900 (has links) No description available. Calage Calage assisté par un modèle Information auxiliaire Non-réponse Repondération Auxiliary information Calibration Model calibration Non-response Propensity score adjusted estimator Re-weighting
7	Baigtinės populiacijos parametrų statistiniai įvertiniai, gauti naudojant papildomą informaciją / Statistical estimators of the finite population parameters in the presence of auxiliary information Pumputis, Dalius 09 March 2009 (has links) Disertacijoje nagrinėjamos papildomos informacijos panaudojimo galimybės konstruojant baigtinės populiacijos sumos, dispersijos ir kovariacijos įvertinius, bei sluoksniuojant baigtines populiacijas. Pirmiausia darbe sprendžiamas populiacijų sluoksniavimo uždavinys, kai tyrimo kintamojo skirstinys yra asimetrinis. Pasiūlomas naujas - pataisytasis geometrinis - sluoksniavimo metodas. Šis metodas modeliuojant lyginamas su trimis kitais žinomais metodais: kvadratinės šaknies iš skirstinio dažnio, geometriniu ir laipsninio sluoksniavimo metodu. Modeliavimo rezultatai rodo, kad vidutiniškai asimetrinėms populiacijoms geriausiai tinka laipsninio sluoksniavimo metodas, o ypač asimetrinėms populiacijoms geriausias yra pataisytasis geometrinis sluoksniavimas. Toliau nagrinėjami baigtinės populiacijos sumos kalibruotieji įvertiniai, sukonstruoti taikant skirtingas atstumo funkcijas. Modeliuojant tiriama šių įvertinių kokybė. Sukonstruoti nauji populiacijos kovariacijos kalibruotieji įvertiniai, naudojantys vieną, dvi ir tris svorių sistemas. Šie įvertiniai konstruojami pasirenkant skirtingas kalibravimo lygtis. Remiantis modeliais pagrįstų įvertinių teorija, čia taip pat sukonstruojamas pataisytasis tiesiniu regresiniu modeliu pagrįstas kalibruotasis populiacijos kovariacijos įvertinys. Modeliuojant įvertiniai lyginami tarpusavyje ir su standartiniais atitinkamų parametrų įvertiniais. Kalibruotieji įvertiniai yra kur kas tikslesni, jei tyrimo ir papildomų kintamųjų koreliacija yra... [toliau žr. visą tekstą] / The dissertation analyzes how to incorporate auxiliary information into the estimation of the finite population total, variance, covariance, and how to use it for the stratification of finite populations. First of all, the problem of efficient stratification in the case of skewed population is considered. A new adjusted geometric stratification method is introduced. This method is compared by simulation with the cumulative root frequency method, the geometric method, and the power method. The simulation results show that in most cases considered the power method is the most efficient one, but the adjusted geometric stratification method outperforms all the methods in the case of highly skewed populations. The calibrated estimators of finite population total, constructed using different distance functions, are considered. The quality of such estimators is analyzed by simulation. The new calibrated estimators of the finite population covariance (variance) are derived, using one or more weighting systems. Applying the model calibration theory, we construct here an adjusted linear regression model-assisted and calibrated estimator of the population covariance. The estimators derived are compared by simulation with the standard estimators of the respective parameters. The calibrated estimators of the population covariance are more efficient compared to the straight estimators provided the auxiliary variables are well correlated with the study variables. The problem of estimation... [to full text] Mathematics Kalibravimas Baigtinės populiacijos sluoksniavimas Papildoma informacija Calibration Stratification of the finite population Auxiliary information
8	Statistical estimators of the finite population parameters in the presence of auxiliary information / Baigtinės populiacijos parametrų statistiniai įvertiniai, gauti naudojant papildomą informaciją Pumputis, Dalius 09 March 2009 (has links) The dissertation analyzes how to incorporate auxiliary information into the estimation of the finite population total, variance, covariance, and how to use it for the stratification of finite populations. First of all, the problem of efficient stratification in the case of skewed population is considered. A new adjusted geometric stratification method is introduced. This method is compared by simulation with the cumulative root frequency method, the geometric method, and the power method. The simulation results show that in most cases considered the power method is the most efficient one, but the adjusted geometric stratification method outperforms all the methods in the case of highly skewed populations. The calibrated estimators of finite population total, constructed using different distance functions, are considered. The quality of such estimators is analyzed by simulation. The new calibrated estimators of the finite population covariance (variance) are derived, using one or more weighting systems. Applying the model calibration theory, we construct here an adjusted linear regression model-assisted and calibrated estimator of the population covariance. The estimators derived are compared by simulation with the standard estimators of the respective parameters. The calibrated estimators of the population covariance are more efficient compared to the straight estimators provided the auxiliary variables are well correlated with the study variables. The problem of estimation... [to full text] / Disertacijoje nagrinėjamos papildomos informacijos panaudojimo galimybės konstruojant baigtinės populiacijos sumos, dispersijos ir kovariacijos įvertinius, bei sluoksniuojant baigtines populiacijas. Pirmiausia darbe sprendžiamas populiacijų sluoksniavimo uždavinys, kai tyrimo kintamojo skirstinys yra asimetrinis. Pasiūlomas naujas - pataisytasis geometrinis - sluoksniavimo metodas. Šis metodas modeliuojant lyginamas su trimis kitais žinomais metodais: kvadratinės šaknies iš skirstinio dažnio, geometriniu ir laipsninio sluoksniavimo metodu. Modeliavimo rezultatai rodo, kad vidutiniškai asimetrinėms populiacijoms geriausiai tinka laipsninio sluoksniavimo metodas, o ypač asimetrinėms populiacijoms geriausias yra pataisytasis geometrinis sluoksniavimas. Toliau nagrinėjami baigtinės populiacijos sumos kalibruotieji įvertiniai, sukonstruoti taikant skirtingas atstumo funkcijas. Modeliuojant tiriama šių įvertinių kokybė. Sukonstruoti nauji populiacijos kovariacijos kalibruotieji įvertiniai, naudojantys vieną, dvi ir tris svorių sistemas. Šie įvertiniai konstruojami pasirenkant skirtingas kalibravimo lygtis. Remiantis modeliais pagrįstų įvertinių teorija, čia taip pat sukonstruojamas pataisytasis tiesiniu regresiniu modeliu pagrįstas kalibruotasis populiacijos kovariacijos įvertinys. Modeliuojant įvertiniai lyginami tarpusavyje ir su standartiniais atitinkamų parametrų įvertiniais. Kalibruotieji įvertiniai yra kur kas tikslesni, jei tyrimo ir papildomų kintamųjų koreliacija yra... [toliau žr. visą tekstą] Mathematics Calibration Stratification of the finite population Auxiliary information Kalibravimas Baigtinės populiacijos sluoksniavimas Papildoma informacija

Search results