Spelling suggestions: "subject:"auxiliar information""
1 |
Comparison of Schema Matching EvaluationsDo, Hong-Hai, Melnik, Sergey, Rahm, Erhard 12 December 2018 (has links)
Recently, schema matching has found considerable interest in both research and practice. Determining matching components of database or XML schemas is needed in many applications, e.g. for E-business and data integration. Various schema matching systems have been developed to solve the problem semi-automatically. While there have been some evaluations, the overall effectiveness of currently available automatic schema matching systems is largely unclear. This is because the evaluations were conducted in diverse ways making it difficult to assess the effectiveness of each single system, let alone to compare their effectiveness. In this paper we survey recently published schema matching evaluations. For this purpose, we introduce the major criteria that influence the effectiveness of a schema matching approach and use these criteria to compare the various systems. Based on our observations, we discuss the requirements for future match implementations and evaluations.
|
2 |
Jackknife Empirical Likelihood-Based Confidence Intervals for Low Income Proportions with Missing DataYIN, YANAN 18 December 2013 (has links)
The estimation of low income proportions plays an important role in comparisons of poverty in different countries. In most countries, the stability of the society and the development of economics depend on the estimation of low income proportions. An accurate estimation of a low income proportion has a crucial role for the development of the natural economy and the improvement of people's living standards. In this thesis, the Jackknife empirical likelihood method is employed to construct confidence intervals for a low income proportion when the observed data had missing values. Comprehensive simulation studies are conducted to compare the relative performances of two Jackknife empirical likelihood based confidence intervals for low income proportions in terms of coverage probability. A real data example is used to illustrate the application of the proposed methods.
|
3 |
Statistical estimators of the finite population parameters in the case of sample rotation / Baigtinės populiacijos parametrų statistiniai įvertiniai esant imties rotacijaiChadyšas, Viktoras 03 March 2010 (has links)
The dissertation analyzes how to incorporate auxiliary information into the estimation of the finite population total, distribution function and quantile in the case of sample rotation. First of all estimation of the finite population total in the case of sample rotation are considered. We focus on construction of the total estimator for rotated sampling design. Successive sampling procedure using multi-phase sampling design have been developed. The composite ratio type estimator of the total using auxiliary information and its approximate variance is constructed. A simulation study, based on the real population data, is performed and the proposed estimators are compared by a traditional estima-tor for a total. The composite estimators of finite population distribution function, constructed under sampling on two occasions, are considered. Composite regression and ratio type estimators are constructed, using values of the study variable as auxiliary information obtained on the first occasion. The optimal estimators, in the sense of minimal variance, is also obtained. A simulation study, based on the real population data, is performed and the proposed estimators are compared by a traditional estimator for a distribution function. Several quantile estimators by deriving the distribution function estimators with the use of auxiliary information are proposed. Some procedures that may be used to obtain estimates of confidence intervals for quantiles in a finite population (most of... [to full text] / Disertacijoje sudaromi baigtinės populiacijos tyrimo kintamojo sumos, pasiskirstymo funkcijos, kvantilio įvertiniai esant imties rotacijai. Pirmiausia darbe nagrinėjamas baigtinės populiacijos tyrimo kintamojo sumos vertinimas esant imties rotacijai. Sudarytas sudėtinis santykinis sumos įvertinys naudojantis papildomą informaciją žinomą iš ankstesnių imties rinkimų. Modeliavimo rezultatai rodo, kad papildomos informacijos panaudojimas iš jau išrinktos imties gali pagerinti įvertinių tikslumą. Kelių ėmimų schema gali būti taikoma siekiant pagerinti baigtinės populiacijos nepaslinktojo imties planu pagristo sumos įvertinio tikslumą. Taip pat nagrinėjami baigtinės populiacijos tyrimo kintamojo pasiskirstymo funkcijos įvertiniai esant imties rotacijai. Sudaryti keli tyrimo kintamojo baigtinėje populiacijoje pasiskirstymo funkcijos sudėtiniai įvertiniai (regresinis ir santykinis) naudojant dviejų ėmimų schemą. Pasiūlyti optimalūs sudėtiniai pasiskirstymo funkcijos įvertiniai su mažiausia dispersija. Įvertiniai lyginami tarpusavyje atliekant modeliavimą su realiais duomenimis. Baigtinės populiacijos kvantilio ivertiniai sudaromi imant sudėtiniu pasiskirstymo funkcijų įvertinių atvirkštinių funkcijų įvertinius. Taip pat sudaromi pasikliautinojo intervalo įvertiniai kvantiliams, taikant kartotinių imčių metodus. Modeliuojant duomenis lyginamas jų tikslumas, daromos išvados apie pasikliautinojo intervalo įvertinių, sudarytų skirtingais būdais, kvantiliui efektyvumą.
|
4 |
Modelos baseados no planejamento para análise de populações finitas / Design-based models for the analysis of finite populationsGonzález Garcia, Luz Mery 23 April 2008 (has links)
Estudamos o problema de obtenção de estimadores/preditores ótimos para combinações lineares de respostas coletadas de uma população finita por meio de amostragem aleatória simples. Nesse contexto, estendemos o modelo misto para populações finitas proposto por Stanek, Singer & Lencina (2004, Journal of Statistical Planning and Inference) para casos em que se incluem erros de medida (endógenos e exógenos) e informação auxiliar. Admitindo que as variâncias são conhecidas, mostramos que os estimadores/preditores propostos têm erro quadrático médio menor dentro da classe dos estimadores lineares não viciados. Por meio de estudos de simulação, comparamos o desempenho desses estimadores/preditores empíricos, i.e., obtidos com a substituição das componentes de variância por estimativas, com aquele de competidores tradicionais. Também, estendemos esses modelos para análise de estudos com estrutura do tipo pré-teste/pós-teste. Também por intermédio de simulação, comparamos o desempenho dos estimadores empíricos com o desempenho do estimador obtido por meio de técnicas clássicas de análise de medidas repetidas e com o desempenho do estimador obtido via análise de covariância por meio de mínimos quadrados, concluindo que os estimadores/ preditores empíricos apresentaram um menor erro quadrático médio e menor vício. Em geral, sugerimos o emprego dos estimadores/preditores empíricos propostos para dados com distribuição assimétrica ou amostras pequenas. / We consider optimal estimation of finite population parameters with data obtained via simple random samples. In this context, we extend a finite population mixed model proposed by Stanek, Singer & Lencina (2004, Journal of Statistical Planning and Inference) by including measurement errors (endogenous or exogenous) and auxiliary information. Assuming that variance components are known, we show that the proposed estimators/predictors have the smallest mean squared error in the class of unbiased estimators. Using simulation studies, we compare the performance of the empirical estimators/predictors obtained by replacing variance components with estimates with the performance of a traditional estimator. We also extend the finite population mixed model to data obtained via pretest-posttest designs. Through simulation studies, we compare the performance of the empirical estimator of the difference in gain between groups with the performance of the usual repeated measures estimator and with the performance of the usual analysis of covariance estimator obtained via ordinary least squares. The empirical estimator has smaller mean squared error and bias than the alternative estimators under consideration. In general, we recommend the use of the proposed estimators/ predictors for either asymmetric response distributions or small samples.
|
5 |
Modelos baseados no planejamento para análise de populações finitas / Design-based models for the analysis of finite populationsLuz Mery González Garcia 23 April 2008 (has links)
Estudamos o problema de obtenção de estimadores/preditores ótimos para combinações lineares de respostas coletadas de uma população finita por meio de amostragem aleatória simples. Nesse contexto, estendemos o modelo misto para populações finitas proposto por Stanek, Singer & Lencina (2004, Journal of Statistical Planning and Inference) para casos em que se incluem erros de medida (endógenos e exógenos) e informação auxiliar. Admitindo que as variâncias são conhecidas, mostramos que os estimadores/preditores propostos têm erro quadrático médio menor dentro da classe dos estimadores lineares não viciados. Por meio de estudos de simulação, comparamos o desempenho desses estimadores/preditores empíricos, i.e., obtidos com a substituição das componentes de variância por estimativas, com aquele de competidores tradicionais. Também, estendemos esses modelos para análise de estudos com estrutura do tipo pré-teste/pós-teste. Também por intermédio de simulação, comparamos o desempenho dos estimadores empíricos com o desempenho do estimador obtido por meio de técnicas clássicas de análise de medidas repetidas e com o desempenho do estimador obtido via análise de covariância por meio de mínimos quadrados, concluindo que os estimadores/ preditores empíricos apresentaram um menor erro quadrático médio e menor vício. Em geral, sugerimos o emprego dos estimadores/preditores empíricos propostos para dados com distribuição assimétrica ou amostras pequenas. / We consider optimal estimation of finite population parameters with data obtained via simple random samples. In this context, we extend a finite population mixed model proposed by Stanek, Singer & Lencina (2004, Journal of Statistical Planning and Inference) by including measurement errors (endogenous or exogenous) and auxiliary information. Assuming that variance components are known, we show that the proposed estimators/predictors have the smallest mean squared error in the class of unbiased estimators. Using simulation studies, we compare the performance of the empirical estimators/predictors obtained by replacing variance components with estimates with the performance of a traditional estimator. We also extend the finite population mixed model to data obtained via pretest-posttest designs. Through simulation studies, we compare the performance of the empirical estimator of the difference in gain between groups with the performance of the usual repeated measures estimator and with the performance of the usual analysis of covariance estimator obtained via ordinary least squares. The empirical estimator has smaller mean squared error and bias than the alternative estimators under consideration. In general, we recommend the use of the proposed estimators/ predictors for either asymmetric response distributions or small samples.
|
6 |
Estimation efficace en présence de non-réponse dans les enquêtesGao, Yimeng 03 1900 (has links)
No description available.
|
7 |
Baigtinės populiacijos parametrų statistiniai įvertiniai, gauti naudojant papildomą informaciją / Statistical estimators of the finite population parameters in the presence of auxiliary informationPumputis, Dalius 09 March 2009 (has links)
Disertacijoje nagrinėjamos papildomos informacijos panaudojimo
galimybės konstruojant baigtinės populiacijos sumos, dispersijos
ir kovariacijos įvertinius, bei sluoksniuojant baigtines
populiacijas.
Pirmiausia darbe sprendžiamas populiacijų sluoksniavimo uždavinys,
kai tyrimo kintamojo skirstinys yra asimetrinis. Pasiūlomas naujas
- pataisytasis geometrinis - sluoksniavimo metodas. Šis metodas
modeliuojant lyginamas su trimis kitais žinomais metodais:
kvadratinės šaknies iš skirstinio dažnio, geometriniu ir
laipsninio sluoksniavimo metodu. Modeliavimo rezultatai rodo, kad
vidutiniškai asimetrinėms populiacijoms geriausiai tinka
laipsninio sluoksniavimo metodas, o ypač asimetrinėms
populiacijoms geriausias yra pataisytasis geometrinis
sluoksniavimas.
Toliau nagrinėjami baigtinės populiacijos sumos kalibruotieji
įvertiniai, sukonstruoti taikant skirtingas atstumo funkcijas.
Modeliuojant tiriama šių įvertinių kokybė. Sukonstruoti nauji
populiacijos kovariacijos kalibruotieji įvertiniai, naudojantys
vieną, dvi ir tris svorių sistemas. Šie įvertiniai konstruojami
pasirenkant skirtingas kalibravimo lygtis. Remiantis modeliais
pagrįstų įvertinių teorija, čia taip pat sukonstruojamas
pataisytasis tiesiniu regresiniu modeliu pagrįstas kalibruotasis
populiacijos kovariacijos įvertinys. Modeliuojant įvertiniai
lyginami tarpusavyje ir su standartiniais atitinkamų parametrų
įvertiniais. Kalibruotieji įvertiniai yra kur kas tikslesni, jei
tyrimo ir papildomų kintamųjų koreliacija yra... [toliau žr. visą tekstą] / The dissertation analyzes how to incorporate auxiliary information
into the estimation of the finite population total, variance,
covariance, and how to use it for the stratification of finite
populations.
First of all, the problem of efficient stratification in the case
of skewed population is considered. A new adjusted geometric
stratification method is introduced. This method is compared by
simulation with the cumulative root frequency method, the
geometric method, and the power method. The simulation results
show that in most cases considered the power method is the most
efficient one, but the adjusted geometric stratification method
outperforms all the methods in the case of highly skewed
populations.
The calibrated estimators of finite population total, constructed
using different distance functions, are considered. The quality of
such estimators is analyzed by simulation. The new calibrated
estimators of the finite population covariance (variance) are
derived, using one or more weighting systems. Applying the model
calibration theory, we construct here an adjusted linear
regression model-assisted and calibrated estimator of the
population covariance. The estimators derived are compared by
simulation with the standard estimators of the respective
parameters. The calibrated estimators of the population covariance
are more efficient compared to the straight estimators provided
the auxiliary variables are well correlated with the study
variables. The problem of estimation... [to full text]
|
8 |
Statistical estimators of the finite population parameters in the presence of auxiliary information / Baigtinės populiacijos parametrų statistiniai įvertiniai, gauti naudojant papildomą informacijąPumputis, Dalius 09 March 2009 (has links)
The dissertation analyzes how to incorporate auxiliary information
into the estimation of the finite population total, variance,
covariance, and how to use it for the stratification of finite
populations.
First of all, the problem of efficient stratification in the case
of skewed population is considered. A new adjusted geometric
stratification method is introduced. This method is compared by
simulation with the cumulative root frequency method, the
geometric method, and the power method. The simulation results
show that in most cases considered the power method is the most
efficient one, but the adjusted geometric stratification method
outperforms all the methods in the case of highly skewed
populations.
The calibrated estimators of finite population total, constructed
using different distance functions, are considered. The quality of
such estimators is analyzed by simulation. The new calibrated
estimators of the finite population covariance (variance) are
derived, using one or more weighting systems. Applying the model
calibration theory, we construct here an adjusted linear
regression model-assisted and calibrated estimator of the
population covariance. The estimators derived are compared by
simulation with the standard estimators of the respective
parameters. The calibrated estimators of the population covariance
are more efficient compared to the straight estimators provided
the auxiliary variables are well correlated with the study
variables. The problem of estimation... [to full text] / Disertacijoje nagrinėjamos papildomos informacijos panaudojimo
galimybės konstruojant baigtinės populiacijos sumos, dispersijos
ir kovariacijos įvertinius, bei sluoksniuojant baigtines
populiacijas.
Pirmiausia darbe sprendžiamas populiacijų sluoksniavimo uždavinys,
kai tyrimo kintamojo skirstinys yra asimetrinis. Pasiūlomas naujas
- pataisytasis geometrinis - sluoksniavimo metodas. Šis metodas
modeliuojant lyginamas su trimis kitais žinomais metodais:
kvadratinės šaknies iš skirstinio dažnio, geometriniu ir
laipsninio sluoksniavimo metodu. Modeliavimo rezultatai rodo, kad
vidutiniškai asimetrinėms populiacijoms geriausiai tinka
laipsninio sluoksniavimo metodas, o ypač asimetrinėms
populiacijoms geriausias yra pataisytasis geometrinis
sluoksniavimas.
Toliau nagrinėjami baigtinės populiacijos sumos kalibruotieji
įvertiniai, sukonstruoti taikant skirtingas atstumo funkcijas.
Modeliuojant tiriama šių įvertinių kokybė. Sukonstruoti nauji
populiacijos kovariacijos kalibruotieji įvertiniai, naudojantys
vieną, dvi ir tris svorių sistemas. Šie įvertiniai konstruojami
pasirenkant skirtingas kalibravimo lygtis. Remiantis modeliais
pagrįstų įvertinių teorija, čia taip pat sukonstruojamas
pataisytasis tiesiniu regresiniu modeliu pagrįstas kalibruotasis
populiacijos kovariacijos įvertinys. Modeliuojant įvertiniai
lyginami tarpusavyje ir su standartiniais atitinkamų parametrų
įvertiniais. Kalibruotieji įvertiniai yra kur kas tikslesni, jei
tyrimo ir papildomų kintamųjų koreliacija yra... [toliau žr. visą tekstą]
|
Page generated in 0.1235 seconds