Spelling suggestions: "subject:"regression"" "subject:"regresiones""
1 |
Bayesian Logistic Regression with Jaro-Winkler String Comparator Scores Provides Sizable Improvement in Probabilistic Record MatchingJann, Dominic 1983- 14 March 2013 (has links)
Record matching is a fundamental and ubiquitous part of today?s society. Anything from typing in a password in order to access your email to connecting existing health records in California with new health records in New York requires matching records together. In general, there are two types of record matching algorithms: deterministic, a more rules-based approach, and probabilistic, a model-based approach. Both types have their advantages and disadvantages. If the amount of data is relatively small, deterministic algorithms yield very high success rates. However, the number of common mistakes, and subsequent rules, becomes astronomically large as the sizes of the datasets increase. This leads to a highly labor-intensive process updating and maintaining the matching algorithm. On the other hand, probabilistic record matching implements a mathematical model that can take into account keying mistakes, does not require as much maintenance and over- head, and provides a probability that two particular entities should be linked. At the same time, as a model, assumptions need to be met, fitness has to be assessed, and predictions can be incorrect. Regardless of the type of algorithm, nearly all utilize a 0/1 field-matching structure, including the Fellegi-Sunter algorithm from 1969. That is to say that either the fields match entirely, or they do not match at all. As a result, typographical errors can get lost and false negatives can result. My research has yielded that using Jaro-Winkler string comparator scores as predictors to a Bayesian logistic regression model in lieu of a restrictive binary structure yields marginal improvement over current methodologies.
|
2 |
Preventative Counselling for Nova Scotia Adolescents: Examining Predictors of its Provision in Several CommunitiesCorbett, Erica L. 12 February 2010 (has links)
This project examined the extent to which Nova Scotian adolescents’ counselling needs are being met with respect to physical, sexual, substance use, and psychosocial health by their family physicians. This was accomplished by assessing how well Nova Scotian physicians provide preventative advice consistent with the Guidelines for Adolescent Preventative Services (GAPS). Analyses were performed using pooled data from surveys carried out in 2003 and 2006. Descriptive analyses, Poisson and logistic regression were used to examine associations of sociodemographic characteristics, need, and the presence of school based health centres (SBHCs) with the provision of advice. Advice was not well provided and appeared to be need-driven. Females were significantly more likely to be provided advice and respondent access to a SBHC increased the likelihood of advice being provided. These results have implications for policy and practice, specifically, ways to refine preventative healthcare services for the province’s adolescents to ensure optimal care.
|
3 |
Critérios de avaliação das exigências em treonina, triptofano, valina e isoleucina para frangos de corte de 22 a 42 dias de idadeDuarte, Karina Ferreira [UNESP] 03 July 2009 (has links) (PDF)
Made available in DSpace on 2014-06-11T19:33:33Z (GMT). No. of bitstreams: 0
Previous issue date: 2009-07-03Bitstream added on 2014-06-13T19:04:39Z : No. of bitstreams: 1
duarte_kf_dr_jabo.pdf: 413050 bytes, checksum: 2d6d9cfb8e49274f4b726645266116bb (MD5) / Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) / Quatro experimentos foram conduzidos no Setor de Avicultura da Faculdade de Ciências Agrárias e Veterinárias – Campus de Jaboticabal- SP, com o objetivo de estabelecer diferentes critérios de avaliação das exigências dos aminoácidos digestíveis treonina, triptofano, valina e isoleucina para frangos de corte de 22 a 42 dias de idade. Em cada experimento foram utilizados 1.920 frangos de corte machos com 22 dias de idade da linhagem “Cobb”, distribuídos em um delineamento inteiramente ao acaso, com seis tratamentos e oito repetições de 40 aves cada. Os tratamentos consistiram no fornecimento de dietas formuladas com base em aminoácidos digestíveis contendo seis diferentes níveis do aminoácido em estudo. Foram avaliados os dados de desempenho (ganho de peso, consumo de ração, conversão alimentar e viabilidade criatória) e as características de carcaça (rendimento de carcaça, de peito, de coxas+sobrecoxas, de dorso e de asas). Para a determinação das exigências do aminoácido estudado, foram utilizados três modelos de regressão: o modelo quadrático, o modelo exponencial e o de retas segmentadas ou broken line (“linha quebrada”), com 90% do quadrado máximo, estabelecendo equações principalmente para ganho de peso e conversão alimentar. Em caso de significância estatística, foi adotado também o procedimento de comparação das médias pelo Teste de Duncan a 5% de probabilidade. Com base no comportamento dos dados os níveis de 0,7642 e 0,7514% de treonina digestível (treonina:lisina digestíveis de 71,19% e 70,00%), determinados pelo modelo broken line e pelo teste de Duncan a 5% de probabilidade respectivamente, promoveram os melhores resultados de conversão alimentar. O triptofano digestível, quando no nível de 0,2255% (triptofano:lisina digestíveis de 21%) indicou melhora na conversão alimentar e os níveis de 0,1825 e 0,1919%... / Four experiments were conducted to establish different criteria for evaluation the requirements in threonine, tryptophan, valine and isoleucine for broilers from 22 to 42 days of age. In each experiment it was used 1,920 male broilers (Cobb) in a completely randomized design, with six treatments and eight replications of 40 birds each. The treatments consisted in supply diets formulated according to the ideal protein concept and digestible amino acids, with six different levels of the related amino acid. The data from performance and characteristics of carcass had been evaluated. It was used three regression models: quadratic, exponential and broken line, stablishing equations mainly to body weight and feed conversion. In case of significant statistics, it was used the procedure for means comparison using Duncan test (0.05%). The threonine levels of 0.7642 and 0.7515% (threonine:lysine of 71.19% and 70.00%) determined by broken line and Duncan test respectively, showed better results for feed conversion. For digestible tryptophan at the level of 0.2255% (tryptophan:lysine of 21%) indicated a increase in feed conversion and the levels of 0.1825 and 0,1919 (tryptophan:lysine of 17.88% and 17.00%) determined by Duncan test and quadratic equation respectively, indicated the best results for carcass yield. Digestible valine at the level of 0.7729% (valine:lysine of 72.00) determined by Duncan test, showed the best results for feed intake and viability. At the level of 0.8802% (valine:lysine of 82.00%) the best results for body weight and feed conversion and at 0.8265% and 0.9339% (valine:lysine of 77.00% and 87.00%) the best back yield according to Duncan test. The isoleucine at 0.7729% (isoleucine:lysine of 72.00%) showed a improvement in feed conversion according to Duncan test (0.05%).
|
4 |
Individualaus namo optimizacijos modelis / Optimization model for a single family housesBacius, Haroldas 20 June 2013 (has links)
Baigiamojo darbo tikslas – apžvelgti individualių gyvenamųjų namų statybos technologijas, koncepcijas bei saulės energijos panaudojimą pastatuose, išanalizuoti keleto pastatytų ar projektuojamų namų rodiklius, investicijas į atskirus pastato elementus bei sukurti optimizacinį modelį, kurį naudojant būtų galima prognozuoti pastato gyvavimo ciklo išlaidas. Darbą sudaro: įvadas, kuriame trumpai aprašoma problematika, darbo tikslas ir keliami uždaviniai šiam tikslui pasiekti; analitinė dalis – individualių namų technologijos, pagrindiniai privalumai, trūkumai ir skirtumai, naujausios pastatų koncepcijos, istorija, reikalavimai, pagrindiniai aspektai, pasyvios ir aktyvios saulės energijos panaudojimo galimybės; metodinė - tiriamoji dalis – analizuojami individualūs gyvenamieji pastatai, investicijos į konstrukcijas ir šildymo, vėdinimo ir oro kondicionavimo sistemas bei pastato eksploatacijos išlaidos, kuriamas optimizacijos modelis naudojant regresinės analizės metodą; išvados ir pasiūlymai; literatūros sąrašas. Darbo apimtis: 76 p. teksto be priedų, 25 iliustr., 11 lent., 71 bibliografiniai šaltiniai. / The objective of this master thesis is to review different single family house building technologies, conceptions and opportunities of using solar power, analyze few different detached houses, investments to different building parts and create optimization model which would allow to forecast lifecycle cost of a project. Thesis contains: introduction, where issues of the topic , purpose and tasks are discussed; analytic part – building technologies of single family houses, main advantages, disadvantages and differences, building conceptions, history, requirements and aspects of different conceptions are presented, opportunities of passive and active solar power usage in detached houses; methodical - research part – evaluation of few different single family houses, analysis of investments to building envelope constructions, heating, ventilation and air conditioning systems and building operation cost, creation of optimization model by using regresion analysis method; conclusions and suggestions; references. Final thesis consists of: 76 p. of text without appendixes, 25 pictures, 11 tables, 71 references.
|
5 |
Critérios de avaliação das exigências em treonina, triptofano, valina e isoleucina para frangos de corte de 22 a 42 dias de idade /Duarte, Karina Ferreira. January 2009 (has links)
Orientador: Otto Mack Junqueira / Banca: Edivaldo Antônio Garcia / Banca: Douglas Emygdio de Faria / Banca: Rosemeire da Silva Filardi / Banca: Silvana Martinez Baraldi Artoni / Resumo: Quatro experimentos foram conduzidos no Setor de Avicultura da Faculdade de Ciências Agrárias e Veterinárias - Campus de Jaboticabal- SP, com o objetivo de estabelecer diferentes critérios de avaliação das exigências dos aminoácidos digestíveis treonina, triptofano, valina e isoleucina para frangos de corte de 22 a 42 dias de idade. Em cada experimento foram utilizados 1.920 frangos de corte machos com 22 dias de idade da linhagem "Cobb", distribuídos em um delineamento inteiramente ao acaso, com seis tratamentos e oito repetições de 40 aves cada. Os tratamentos consistiram no fornecimento de dietas formuladas com base em aminoácidos digestíveis contendo seis diferentes níveis do aminoácido em estudo. Foram avaliados os dados de desempenho (ganho de peso, consumo de ração, conversão alimentar e viabilidade criatória) e as características de carcaça (rendimento de carcaça, de peito, de coxas+sobrecoxas, de dorso e de asas). Para a determinação das exigências do aminoácido estudado, foram utilizados três modelos de regressão: o modelo quadrático, o modelo exponencial e o de retas segmentadas ou broken line ("linha quebrada"), com 90% do quadrado máximo, estabelecendo equações principalmente para ganho de peso e conversão alimentar. Em caso de significância estatística, foi adotado também o procedimento de comparação das médias pelo Teste de Duncan a 5% de probabilidade. Com base no comportamento dos dados os níveis de 0,7642 e 0,7514% de treonina digestível (treonina:lisina digestíveis de 71,19% e 70,00%), determinados pelo modelo broken line e pelo teste de Duncan a 5% de probabilidade respectivamente, promoveram os melhores resultados de conversão alimentar. O triptofano digestível, quando no nível de 0,2255% (triptofano:lisina digestíveis de 21%) indicou melhora na conversão alimentar e os níveis de 0,1825 e 0,1919%... (Resumo completo, clicar acesso eletrônico abaixo) / Abstract: Four experiments were conducted to establish different criteria for evaluation the requirements in threonine, tryptophan, valine and isoleucine for broilers from 22 to 42 days of age. In each experiment it was used 1,920 male broilers (Cobb) in a completely randomized design, with six treatments and eight replications of 40 birds each. The treatments consisted in supply diets formulated according to the ideal protein concept and digestible amino acids, with six different levels of the related amino acid. The data from performance and characteristics of carcass had been evaluated. It was used three regression models: quadratic, exponential and broken line, stablishing equations mainly to body weight and feed conversion. In case of significant statistics, it was used the procedure for means comparison using Duncan test (0.05%). The threonine levels of 0.7642 and 0.7515% (threonine:lysine of 71.19% and 70.00%) determined by broken line and Duncan test respectively, showed better results for feed conversion. For digestible tryptophan at the level of 0.2255% (tryptophan:lysine of 21%) indicated a increase in feed conversion and the levels of 0.1825 and 0,1919 (tryptophan:lysine of 17.88% and 17.00%) determined by Duncan test and quadratic equation respectively, indicated the best results for carcass yield. Digestible valine at the level of 0.7729% (valine:lysine of 72.00) determined by Duncan test, showed the best results for feed intake and viability. At the level of 0.8802% (valine:lysine of 82.00%) the best results for body weight and feed conversion and at 0.8265% and 0.9339% (valine:lysine of 77.00% and 87.00%) the best back yield according to Duncan test. The isoleucine at 0.7729% (isoleucine:lysine of 72.00%) showed a improvement in feed conversion according to Duncan test (0.05%). / Doutor
|
6 |
Reporting - ERP systém / Reporting - ERP SystemPála, Milan January 2013 (has links)
This work deals with creating a module for existing ERP system. Module should be able to produce dataprogress of production, monitor productivity of production and warn if some issue will happen. This work evaluates a processing of a large amount of data and it shows different possibilities how to precalculate data. It also deals with a draft how to predict information from known data.
|
7 |
Analýza vysokoškolského vzdělávání v Česku se zaměřením na absolventy 2001-2017 / Higher education analysis in the Czech Republic with a focus on graduates 2001-2017Pištorová, Markéta January 2018 (has links)
Higher education analysis in the Czech Republic with a focus on graduates 2001-2017 Abstract The aim of this thesis is the analysis of higher education graduates in the Czech Republic between 2001 and 2017, focussing on the type of study program, the field of study, the net duration of the study, the graduation age, the citizenship and the gender. As main data source served the register. Changes in the net duration of the study, depending on the above-mentioned factors, were summarized using a general linear model. The gender dependence on the field and the net duration of the study was modeled using logistic regression. In addition, demographic methods were used for net entry rates and net graduation rates. Theoretical introduction to the topic - Trow's Concept, Bologna Process and Strategic Materials of the Ministry of Education, Youth and Sports - is also included. The overview of higher education is complemented by the historical development of the higher education system in Czechoslovakia, information on the educational structure according to the SLDB 2011 and international comparisons within the OECD countries complements the overall view of higehr education. The main finding of this work is the fact that the amount and the structure of graduates reflect the implementation of the principles of the...
|
8 |
[en] POISSON REGRESSION MULTILEVEL MODEL: AN APLICATION TO SAEBS REPETENCE DATE / [es] MODELO JERÁRQUICO DE REGRESIÓN DE POISSON: UNA APLICACIÓN A LOS DATOS DE REPITENCIA DE SAEB / [pt] MODELO HIERÁRQUICO DE REGRESSÃO POISSON: UMA APLICAÇÃO AOS DADOS DE REPETÊNCIA DO SAEBELIANE DA SILVA CHRISTO 11 July 2001 (has links)
[pt] A maioria das pesquisas sociais e de comportamento
apresenta uma estrutura hierárquica, a qual pode ser
caracterizada pela existência de agrupamento das unidades
de análise.
Nesta dissertação empregou-se modelos multiníveis aos dados
de avaliação educacional do Sistema Nacional de Avaliação
Básica (SAEB). O objetivo foi analisar a Repetência Escolar
dos alunos de 4.a série do ensino fundamental na disciplina
de matemática. Foram feitas regressões de Poisson com a
variável Repetência como resposta e várias variáveis
explicativas associadas ao aluno, professor e escola. Nos
modelos foram considerados dois níveis de hierarquia (nível
1=aluno; nível 2=escola). Os trabalhos foram feitos no
software Mlwin o qual possibilita o uso de dados
multiníveis. / [en] The most of social and behaviour researches show
hierarquical structure.
In this dissertation, the evolution of education data s of
Brazilian National System for the Evolution Education
(SAEB) were used in the multilevel models. The aim was
analysed repetence of students in the primary school in
mathematics subject. Poisson Regressions were made with
Repetence as response variable and a lot of explanatory
variables were linked student, teacher and school. In this
models were considered two hierarchy levels (1-student and
2-school). The procedures were made in the software Mlwin
that allows using multileves data s. / [es] La mayoria de las investigaciones sociales y de
comportamiento presentan una extructura jerárquica, que
puede ser caracterizada por la existencia de agrupamientos
de las unidades de análisis. En esta disertación se emplean
modelos multiníveles en datos de evaluación educacional
del Sistema Nacional de Evaluación Básica (SAEB). EL
objetivo fue analizar la Repitencia Escolar de los alumnos
de 4ª grado de la primaria (4ª série, ensino fundamental)
en la disciplina de matemáticas. Se ajustaron regresiones
de Poison utilizando con la variable Repitencia como
respuesta y varias variables explicativas asociadas al
alumno, profesor y escuela. En los modelos fueron
considerados dos níveles de jerarquía (nivel 1=alumno;
nivel 2=escola), utilizando el el software Mlwin, que es
específico para el uso de datos multiníveles.
|
9 |
[en] STATISTICAL MODEL FOR PREDICTING THE SUPPLY OF HIGHER EDUCATION: 2015-2035 / [es] MODELO ESTADÍSTICO PARA LA PROYECCIÓN DE OFERTA DE EDUCACIÓN SUPERIOR: 2015-2035 / [pt] MODELO ESTATÍSTICO PARA A PROJEÇÃO DA OFERTA DE ENSINO SUPERIOR: 2015-2035CLARENA PATRICIA ARRIETA ARRIETA 03 October 2018 (has links)
[pt] Segundo o INEP/MEC, nos últimos 20 anos, o número de matrículas da educação superior de graduação no Brasil cresceu mais de duas vezes, com uma taxa de crescimento anual verificada a partir de 2001 em torno de 5,7 por cento ao ano. Ainda segundo esta instituição, em 2008 houve o ingresso de 1.505.819
novos estudantes nos cursos presenciais, ao mesmo tempo em que 1.479.318 vagas não foram ocupadas, sendo que 54,6 por cento do total de vagas ofertadas pelo setor privado. Tendo em conta que São Paulo é o maior estado do Brasil, é muito importante que o Ministério da Educação tome conhecimento de como
se dará a dinâmica da oferta de educação superior nos próximos 20 anos para que suas ações (políticas públicas, sobretudo) possam ser realizadas com êxito. O objetivo deste trabalho é aplicar modelagem estatística para estimar a oferta do ensino superior do Estado de São Paulo no período de 2015 a 2035, considerando dados da INEP de educação superior. A motivação para este trabalho é melhorar o planejamento da oferta de curso superior e fazer a replicação do modelo preditivo para outros estados do Brasil. A metodologia usada é modelagem estatística (modelos de regressão linear) e séries temporais
(Holt). Como resultado, têm-se as áreas e/os cursos onde o governo federal deve investir no futuro aprimorando seu planejamento. / [en] According to INEP/MEC, in the last 20 years, the number of
undergraduate higher education enrollments in Brazil has grown more than
twice, with an annual growth rate of 5,7 percent per year since 2001. According
to this institution, in 2008 there were 1.505.819 new students enrolled in
presential courses, while 1.479.318 vacancies were not filled, with 54.6 percent of the
total number of vacancies offered by the private sector. Given that São Paulo is
the largest state in Brazil, it is very important that the Ministry of Education
becomes aware of the dynamics of the offer of higher education in the next 20
years so that its actions (mainly public policies) can be successfully executed.
The objective of this study is to apply statistical modeling to estimate the
offer of higher education in the State of São Paulo in the period from 2015
to 2035, considering data from INEP about higher education. The motivation
for this work is to improve the planning of the offer of higher education and
to replicate the predictive model for other Brazilian states. The methodology
used concerns statistical modeling (linear regression models) and time series
(Holt). As a result, it is obtained the areas and/or courses where the federal
government should invest in the future, improving its planning. / [es] Según el INEP/MEC, en los últimos 20 años, el número de matrículas de educación superior en Brasil creció más de dos veces, con una tasa de crecimiento anual verificada a partir de 2001 en torno al 5,7 por ciento por año. Según esta institución, en 2008 hubo un ingreso de 1.505.819 nuevos estudiantes en los cursos presenciales, al mismo tiempo que 1.479.318 vacantes no fueron ocupadas, siendo el 54,6 por ciento del total de vacantes ofrecidas por el sector privado. Dado que São Paulo es el mayor estado de Brasil, es muy importante que el Ministerio de Educación tome conocimiento de cómo se dará la dinámica de la oferta de educación superior en los próximos 20 años para que sus acciones (políticas públicas, sobre todo) puedan realizarse con éxito. El objetivo de este trabajo es aplicar modelos estadísticos para estimar la oferta de educación superior del Estado de São Paulo en el período de 2015 a 2035, considerando datos de INEP de educación superior. La motivación para este trabajo es mejorar la planificación de la oferta de curso superior y hacer replicación del modelo predictivo para otros estados de Brasil. La metodología utilizada es
modelos estadístico (modelos de regresión lineal) y series tiempo (Holt). Como resultado, se tienen las áreas y/o cursos donde el gobierno federal debe invertir en el futuro mejorando su planificación.
|
10 |
Dolovací modul systému pro dolování z dat na platformě NetBeans / Data Mining Module of a Data Mining System on NetBeans PlatformVýtvar, Jaromír January 2010 (has links)
The aim of this work is to get basic overview about the process of obtaining knowledge from databases - datamining and to analyze the datamining system developed at FIT BUT on the NetBeans platform in order to create a new mining module. We decided to implement a module for mining outliers and to extend existing regression module with multiple linear regression using generalized linear models. New methods using existing methods of Oracle Data Mining.
|
Page generated in 0.5195 seconds