Global ETD Search

21	Development and application of statistical genetic methods to genomic prediction in Coffea canephora / Desenvolvimento e aplicação de métodos genético-estatíticos para predição genômica em Coffea canephora Ferrão, Luís Felipe Ventorim 07 April 2017 (has links) Genomic selection (GS) works by simultaneously selecting hundreds or thousands of markers covering the genome so that the majority of quantitative trait loci are in linkage disequilibrium (LD) with such markers. Thus, markers associated with QTLs, regardless of the significance of their effects, are used to explain the genetic variation of a trait. Simulation and empirical results have shown that genomic prediction presents sufficient accuracy to help success in breeding programs, in contrast to traditional phenotypic analysis. For this end, an important step addresses the use of statistical genetic models able to predict the phenotypic performance for important traits. Although some crops have benefited from this approach, studies in the genus Coffea are still in their infancy. Until now, there have been no studies of how predictive models work across populations and environments or, even, their performance for different complex traits. Therefore, the main objective of this research is investigating important aspects related to statistical modeling in order to enable a more comprehensive understanding of what makes a robust prediction model and, as consequence, apply it in practical breeding programs. Real data from two experimental populations of Coffea canephora, evaluated in two brazilian locations and SNPs identified by Genotyping-by-Sequencing (GBS) were considered to investigate the genotype-phenotype relationship. In terms of statistical modelling, two classes of models were considered: i) Mixed models, based on genomic relationship matrix to define the (co)variance between relatives (called GBLUP model); and ii) Multilocus association models, which thousands of markers are modeled simultaneously and the marker effects are summed, in order to compute the genetic merit of individuals. Both approaches were considered in separated chapters. Chapter entitled \"A mixed model to multiplicative harvest-location trial applied to genomic prediction in Coffea canephora\" addressed an expansion of the traditional GBLUP to accommodate interaction effects (Genotype × Local and Genotype × Harvest). For this end, we have tested appropriate (co)variance structures for modeling heterogeneity and correlation of genetic effects and residual effects. The proposed model, called MET.GBLUP, showed the best goodness of fit and higher predictive ability, when compared to other methods. Chapter in the sequence was entitled \"Comparison of statistical methods and reliability of genomic prediction in Coffea canephora population\" and addressed the use of different modelling assumptions considering multilocos association models. The usual assumption of marker effects drawn from a normal distribution was relaxed, in order to seek for a possible dependency between predictive performance and trait, conditional on the genetic architecture. Although the competitor models are conceptually different, a minimal difference in predictive accuracy was observed in the comparative analysis. In terms of computational demand, Bayesian models showed higher time of analysis. Results discussed in both chapters have supported the potential of genomic selection to reshape traditional breeding programs. In practice, compared to traditional phenotypic evaluation, it is expected to accelerate the breeding cycle in recurrent selection programs, maintain genetic diversity and increase the genetic gain per unit of time. / Seleção Genômica pode ser definida como a seleção simultânea de centenas ou milhares de marcadores moleculares, os quais cobrem o genoma de forma densa, de modo que locos de caracteres quantitativos (QTL) estejam em desequilíbrio de ligação com uma parte desses marcadores. Assim, marcadores associados a QTLs, independentemente da significância dos seus efeitos, são utilizados na predição do mérito genético de um indivíduo para um determinado caráter. Simulações e estudos empíricos mostram que essa abordagem apresenta acurácia suficiente para garantir o sucesso em programas de melhoramento genético, quando comparado com os métodos tradicionais de seleção fenotípica. Para tanto, uma das etapas requeridas é o uso de modelos genético-estatísticos que contemplem a predição fidedigna da performance fenotípica da população sob estudo. Apesar da relevância, o número de estudos no gênero Coffea ainda são reduzidos, não havendo relatos sobre o desempenho desses modelos em diferentes populações e ambientes, ou mesmo, a sua performance para diferentes caracteres agronômicos do cafeeiro. Dessa forma, este estudo tem como finalidade investigar aspectos relacionados a modelagem estatística, a fim de compreender quais são os fatores que tornam os modelos preditivos mais acurados e utiliza-los em programas aplicados de melhoramento genético. Dados reais de duas populações de seleção recorrente de Coffea canephora, avaliados em dois ambientes e genotipados pela tecnologia de genotipagem por sequenciamento (GBS, do inglês Genotyping-by-Sequencing) foram considerados para o estudo da relação entre genótipo-fenótipo. Em termos de modelagem estatística, duas classes de modelos foram considerados: i) Modelos mistos, baseados no cálculo da matriz de parentesco realizado como medida de (co)variância genética entre indivíduos (modelo GBLUP); e ii) Modelos de associação multilocos, no qual milhares de marcadores moleculares são modelados simultaneamente e os efeitos estimados dos marcadores são somados, a fim de computar o mérito genético dos indivíduos. Ambas estratégias foram descritas em capítulos separados no formato de artigo científico. O capítulo intitulado \"A mixed model to multiplicative harvest-location trial applied to genomic prediction in Coffea canephora\" abordou uma expansão do modelo GBLUP de modo a contemplar efeitos de interações entre Genótipo × Colheita e Genótipo × Local. Para tanto, apropriadas estruturas de variância e covariância para modelagem da heterogeneidade e correlação dos efeitos genéticos e residuais foram testadas. O modelo proposto, denominado de MET.GBLUP, apresentou melhor qualidade de ajuste e capacidade preditiva, quando comparado com outros métodos. O capítulo em sequência, intitulado de \"Comparison of statistical methods and reliability of genomic prediction in Coffea canephora population\" investigou a capacidade preditiva de diferentes modelos de associação multilocos. A suposição usual de efeitos dos marcadores amostrados de uma distribuição normal foi relaxada, a fim de testar métodos alternativos que pudessem melhor descrever o fenômeno biológico e, consequentemente, resultar em maior capacidade preditiva. Embora os modelos testados sejam conceitualmente distintos, diferenças mínimas nos valores de acurácia de predição foram observadas nos cenários testados. Em termos de demanda computacional, modelos Bayesianos apresentaram maior tempo de análise. Os resultados descritos em ambos os capítulos apoiam o potencial do uso da seleção genômica em programas de melhoramento assistido de café. Em termos práticos, comparado com métodos tradicionais de avaliação fenotípica, é esperado que a implementação desses conceitos em programas de seleção recorrente possam acelerar o ciclo de melhoramento, manter a diversidade genética e, sobretudo, aumentar o ganho genético por unidade de tempo. Café Coffee Genomic selection Linear models Marcadores moleculares Modelos lineares Molecular markers Seleção genômica
22	Predição genômica de híbridos de milho para caracteres de arquitetura oligogênica e sob diferentes parâmetros de penalização e correção de fenótipo / Genomic prediction of maize hybrids for traits with oligogenic architecture and under distinct shrinkage factors and phenotypic correction Galli, Giovanni 29 June 2016 (has links) O alcance de altas produtividades em milho (Zea mays L.) depende do desenvolvimento de híbridos, o principal produto explorado nos programas de melhoramento. O sucesso na obtenção deste tipo de cultivar é conseguido com extensivo cruzamento de linhagens, seguido de avaliações para identificação das combinações de maior potencial. Geralmente, o melhorista tem à sua disponibilidade grande número de linhagens, possibilitando a realização de centenas a milhares de cruzamentos distintos, dos quais apenas uma pequena quantidade pode ser avaliada experimentalmente devido a limitação de tempo e recursos. Com o advento da Seleção Genômica (GS) tornou-se possível predizer o comportamento destes indivíduos não avaliados com base em seu genoma. No decorrer do processo de consolidação da GS várias metodologias foram propostas. A aptidão destas em predizer desempenhos fenotípicos é dependente da sua capacidade de acomodar a arquitetura genética das características e lidar com a multicolinearidade das matrizes genômicas. Neste sentido, métodos baseados em modelos mistos podem apresentar menor eficiência na predição de características oligogênicas devido à não capacidade de representar a distribuição real do efeito dos QTL. Além disso, a regularização das predições na presença de multicolinearidade é realizada por meio de um parâmetro de penalização (λ), o qual pode ser estimado de várias formas e consequentemente modificar a acurácia dos modelos. Além do aprimoramento dos métodos, outro aspecto importante é o procedimento de correção dos dados fenotípicos previamente à GS, o qual não é consenso na comunidade científica. Diante do exposto, este trabalho objetivou: verificar o efeito das formas de obtenção do λ (via REML na GS e pela herdabilidade da característica) e da correção do fenótipo (valor genotípico e média ajustada) na GS e avaliar a eficiência da modelagem diferencial de QTL de maior efeito na capacidade preditiva da metodologia G-BLUP, comparando-a ao LASSO Bayesiano, BayesB e G-BLUP convencional. Para isso foram utilizadas informações de híbridos simples de milho tropical avaliados em cinco locais para produtividade de grãos, altura de planta e espiga no ano de 2015. Os dados genômicos foram obtidos com a plataforma Affymetrix® Axiom® Maize Genotyping Array de 616.201 SNPs. Foram estudados diferentes cenários de GS considerando os fatores supracitados, sendo estes comparados entre si por suas capacidades preditivas e seletivas. Os resultados obtidos indicam que a correção do fenótipo e a forma de estimação de λ afetam a capacidade preditiva. O uso de valores genotípicos como correção dos fenótipos e estimação de λ via REML apresentaram os melhores resultados. Foi também observado que a modelagem de SNPs de maior efeito como fator fixo aumenta discretamente a capacidade preditiva da metodologia G-BLUP para as características oligogênicas avaliadas (altura de planta e espiga), sendo indicado o uso do G-BLUP convencional. Complementarmente, observou-se que a GS apresentou modesta eficiência na seleção de híbridos superiores sob intensidades moderadas. Entretanto, a sua alta capacidade de selecionar sob baixa intensidade pode ser amplamente explorada nos programas de melhoramento de milho visando a seleção precoce direta. / The achievement of high yield in maize (Zea mays L.) relies on the development of hybrids, which is the main product of breeding programs. The success in obtaining this kind of cultivar is achieved through extensive crossing of inbred lines followed by field trials to identify the combinations with greatest potential. Generally, breeders have a large number of inbred lines on their hands, being able to perform hundreds to thousands of different crosses, of which only a small portion can be experimentally evaluated due to time and resource limitations. Genomic Selection (GS) has made it possible to predict phenotypes of unevaluated individuals based on their genome. Throughout the establishment process of GS many approaches have been proposed. The ability of these approaches at predicting phenotypic performance depends on their capacity of accommodating the genetic architecture of the traits and dealing with the multicollinearity of the genomic matrices. Hence, methods based on mixed model equations may present lower prediction efficiency for oligogenic traits due to their inability of depicting the real distribution of the QTL effects. Moreover, the prediction regularization in the presence of multicollinearity is done by a shrinkage factor (λ), which can be estimated in a number of ways and may affect the accuracy of the models. In addition to the improvement of the models, the correction of the phenotype utilized in the predictions is also important, which is not a consensus among researchers. Based on these facts, this study aimed to assess the effect of estimation of λ (by REML in the GS model and by the heritability of the traits) and the correction of the phenotype (genotypic value and adjusted mean) on the GS. It also targeted to evaluate the effect of differential modeling of major makers on the prediction accuracy of G-BLUP, comparing it to Bayesian LASSO, BayesB and ordinary G-BLUP. To those ends, tropical maize single-crosses evaluated at five sites for grain yield, plant and ear height in 2015 were utilized. The genomic data was obtained with the Affymetrix® Axiom® Maize Genotyping Array of 616,201 SNPs. Distinct GS scenarios were studied considering the aforementioned factors which were compared by their prediction and selection accuracy. The results suggest that the correction of the phenotype and the way of estimation of λ do affect prediction accuracies. The use of genotypic values as the correction of phenotypes and the estimation of λ by REML showed best results. It was also observed that modeling major SNPs as fixed effect factors had little improvement on the prediction accuracy of G-BLUP for the oligogenic traits evaluated (plant and ear height). Thereby, ordinary G-BLUP should be the method of choice to predict these traits. Additionally, it was observed that GS presented modest efficiency for selecting superior hybrids under moderate intensities. However, its high effectiveness at selecting under low intensities might be exploited on maize breeding programs for early direct selection. Arquitetura genética BLUE BLUE BLUP BLUP Capacidade preditiva Genetic architecture Genomic selection Prediction accuracy Seleção genômica
23	Genetics of disease resistance : application to bovine tuberculosis Tsairidou, Smaragda January 2016 (has links) Bovine Tuberculosis (bTB) is a disease of significant economic importance, being one of the most persistent animal health problems in the UK and the Republic of Ireland and increasingly constituting a public health concern especially for the developing world. Limitations of the currently available diagnostic and control methods, along with our incomplete understanding of bTB transmission, prevent successful eradication. This Thesis addresses the development of a complementary control strategy which will be based on animal genetics and will allow us to identify animals genetically predisposed to be more resistant to disease. Specifically, the aim of my PhD project is to investigate the genetic architecture of resistance to bTB and demonstrate the feasibility of whole genome prediction for the control of bTB in cattle. Genomic selection for disease resistance in livestock populations will assist with the reduction of the in herd-level incidence and the severity of potential outbreaks. The first objective was to explore the estimation of breeding values for bTB resistance in UK dairy cattle, and test these genomic predictions for situations when disease phenotypes are not available on selection candidates. Through using dense SNP chip data the results of Chapter 2 demonstrate that genomic selection for bTB resistance is feasible (h2 = 0.23(SE = 0.06)) and bTB resistance can be predicted using genetic markers with an estimate of prediction accuracy of r(g, ĝ) = 0.33 in this data. It was shown that genotypes help to predict disease state (AUC ≈ 0.58) and animals lacking bTB phenotypes can be selected based on their genotypes. In Chapter 3, a novel approach is presented to identify loci displaying heterozygote (dis)advantage associated with resistance to M. bovis, hypothesising underlying non-additive genetic variation, and these results are compared with those obtained from standard genome scans. A marker was identified suggesting an association between locus heterozygosity and increased susceptibility to bTB i.e. a heterozygote disadvantage, with the heterozygotes being significantly more in the cases than in the controls (x2 = 11.50, p < 0.001). Secondly, this thesis focused on conducting a meta-analysis on two dairy cattle populations with bTB phenotypes and SNP chip genotypes, identifying genomic regions underlying bTB resistance and testing genomic predictions by means of cross-validation. In Chapter 4, exploration of the genetic architecture of the trait revealed that bTB resistance is a moderately polygenic, complex trait with clusters of causal variants spread across a few major chromosomes collectively controlling the trait. A region was identified on chromosome 6, putatively associated with bTB resistance and this chromosome as a whole was shown to contribute a major proportion (hc 2= 0.051) of the observed variation in this dataset. Genomic prediction for bTB was shown to be feasible even when only distantly related populations are combined (r(g,ĝ)=0.33 (SE = 0.05)), with the chromosomal heritability results suggesting that the accuracy arises from the SNPs capturing linkage disequilibrium between markers and QTL, as well as additive relationships between animals (~80% of estimated genomic h2 is due to relatedness). To extend the analysis, in Chapter 5, high density genotypes were inferred by means of genotype imputation, anticipating that these analyses will allow the identification of genomic regions associated with bTB resistance more closely, and that would increase the prediction accuracy. Genotype imputation was successful, however, using all imputed genotypes added little information. The limiting factor was found to be the number of animals and the trait definitions rather than the density of genotypes. Thirdly, a quantitative genetic analysis of actual Single Intradermal Comparative Cervical Test (SICCT) values collected during bTB herd testing was conducted aiming to investigate if selection for bTB resistance is likely to have an impact on the SICCT diagnostic test. This analysis demonstrated that the SICCT has a negligibly low heritability (h2=0.0104 (SE = 0.0032)) and any effect on the responsiveness to the test is likely to be small. In conclusion, breeding for disease resistance in livestock is feasible and we can predict the risk of bTB in cattle using genomic information. Further, putative QTLs associated with bTB resistance were identified, and exploration of the genetic architecture of bTB resistance revealed a moderately polygenic trait. These results suggest that given that larger datasets with more phenotyped and genotyped animals will be available, we can breed for bTB resistance and implement the genomic selection technology in breeding programmes aiming to improve the disease status and overall health of the livestock population. Using the genomics this can be continued as the epidemic declines. 636.2
24	Parametric and semi-parametric models for predicting genomic breeding values of complex traits in Nelore cattle / Modelos estatísticos paramétricos e semiparamétricos para a predição de valores genéticos genômicos de características complexas em bovinos da raça Nelore Espigolan, Rafael [UNESP] 23 February 2017 (has links) Submitted by RAFAEL ESPIGOLAN (espigolan@yahoo.com.br) on 2017-03-17T22:04:14Z No. of bitstreams: 1 Tese_Rafael_Espigolan.pdf: 1532864 bytes, checksum: c79ad7471b25137c47529f25762a83a2 (MD5) / Approved for entry into archive by Juliano Benedito Ferreira (julianoferreira@reitoria.unesp.br) on 2017-03-22T12:50:50Z (GMT) No. of bitstreams: 1 espigolan_r_dr_jabo.pdf: 1532864 bytes, checksum: c79ad7471b25137c47529f25762a83a2 (MD5) / Made available in DSpace on 2017-03-22T12:50:50Z (GMT). No. of bitstreams: 1 espigolan_r_dr_jabo.pdf: 1532864 bytes, checksum: c79ad7471b25137c47529f25762a83a2 (MD5) Previous issue date: 2017-02-23 / Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) / O melhoramento genético animal visa melhorar a produtividade econômica das futuras gerações de espécies domésticas por meio da seleção. A maioria das características de interesse econômico na pecuária é de expressão quantitativa e complexa, isto é, são influenciadas por vários genes e afetadas por fatores ambientais. As análises estatísticas de informações de fenótipo e pedigree permite estimar os valores genéticos dos candidatos à seleção com base no modelo infinitesimal. Uma grande quantidade de dados genômicos está atualmente disponível para a identificação e seleção de indivíduos geneticamente superiores com o potencial de aumentar a acurácia de predição dos valores genéticos e, portanto, a eficiência dos programas de melhoramento genético animal. Vários estudos têm sido conduzidos com o objetivo de identificar metodologias apropriadas para raças e características específicas, o que resultará em estimativas de valores genéticos genômicos (GEBVs) mais acurados. Portanto, o objetivo deste estudo foi verificar a possibilidade de aplicação de modelos semiparamétricos para a seleção genômica e comparar a habilidade de predição com os modelos paramétricos para dados reais (características de carcaça, qualidade da carne, crescimento e reprodutiva) e simulados. As informações fenotípicas e de pedigree utilizadas foram fornecidas por onze fazendas pertencentes a quatro programas de melhoramento genético animal. Para as características de carcaça e qualidade da carne, o banco de dados continha 3.643 registros para área de olho de lombo (REA), 3.619 registros para espessura de gordura (BFT), 3.670 registros para maciez da carne (TEN) e 3.378 observações para peso de carcaça quente (HCW). Um total de 825.364 registros para peso ao sobreano (YW) e 166.398 para idade ao primeiro parto (AFC) foi utilizado para as características de crescimento e reprodutiva. Genótipos de 2.710, 2.656, 2.749, 2.495, 4.455 e 1.760 animais para REA, BFT, TEN, HCW, YW e AFC foram disponibilizados, respectivamente. Após o controle de qualidade, restaram dados de, aproximadamente, 450.000 polimorfismos de base única (SNP). Os modelos de análise utilizados foram BLUP genômico (GBLUP), single-step GBLUP (ssGBLUP), Bayesian LASSO (BL) e as abordagens semiparamétricas Reproducing Kernel Hilbert Spaces (RKHS) e Kernel Averaging (KA). Para cada característica foi realizada uma validação cruzada composta por cinco “folds” e replicada aleatoriamente trinta vezes. Os modelos estatísticos foram comparados em termos do erro do quadrado médio (MSE) e acurácia de predição (ACC). Os valores de ACC variaram de 0,39 a 0,40 (REA), 0,38 a 0,41 (BFT), 0,23 a 0,28 (TEN), 0,33 a 0,35 (HCW), 0,36 a 0,51 (YW) e 0,49 a 0,56 (AFC). Para todas as características, os modelos GBLUP e BL apresentaram acurácias de predição similares. Para REA, BFT e HCW, todos os modelos apresentaram ACC similares, entretanto a regressão RKHS obteve o melhor ajuste comparado ao KA. Para características com maior quantidade de registros fenotípicos comparada ao número de animais genotipados (YW e AFC) o modelo ssGBLUP é indicado. Considerando o desempenho geral, para todas as características estudadas, a regressão RKHS é, particularmente, uma alternativa interessante para a aplicação na seleção genômica, especialmente para características de baixa herdabilidade. No estudo de simulação, genótipos, pedigree e fenótipos para quatro características (A, B, C e D) foram simulados utilizando valores de herdabilidade baseados nos obtidos com os dados reais (0,09, 0,12, 0,36 e 0,39 para cada característica, respectivamente). O genoma simulado consistiu de 735.293 marcadores e 1.000 QTLs distribuídos aleatoriamente por 29 pares de autossomos, com comprimento variando de 40 a 146 centimorgans (cM), totalizando 2.333 cM. Assumiu-se que os QTLs explicavam 100% da variação genética. Considerando as frequências do alelo menor maiores ou iguais a 0,01, um total de 430.000 marcadores foram selecionados aleatoriamente. Os fenótipos foram obtidos pela soma dos resíduos (aleatoriamente amostrados de uma distribuição normal com média igual a zero) aos valores genéticos verdadeiros, e todo o processo de simulação foi replicado 10 vezes. A ACC foi calculada por meio da correlação entre o valor genético genômico estimado e o valor genético verdadeiro, simulados da 12a a 15a geração. A média do desequilíbrio de ligação, medido entre os pares de marcadores adjacentes para todas as características simuladas foi de 0,21 para as gerações recentes (12a, 13a e 14a), e 0,22 para a 15a geração. A ACC para as características simuladas A, B, C e D variou de 0,43 a 0,44, 0,47 a 0,48, 0,80 a 0,82 e 0,72 a 0,73, respectivamente. Diferentes metodologias de seleção genômica implementadas neste estudo mostraram valores similares de acurácia de predição, e o método mais adequado é dependente da característica explorada. Em geral, as regressões RKHS obtiveram melhor desempenho em termos de ACC com menor valor de MSE em comparação com os outros modelos. / Animal breeding aims to improve economic productivity of future generations of domestic species through selection. Most of the traits of economic interest in livestock have a complex and quantitative expression i.e. are influenced by a large number of genes and affected by environmental factors. Statistical analysis of phenotypes and pedigree information allows estimating the breeding values of the selection candidates based on infinitesimal model. A large amount of genomic data is now available for the identification and selection of genetically superior individuals with the potential to increase the accuracy of prediction of genetic values and thus, the efficiency of animal breeding programs. Numerous studies have been conducted in order to identify appropriate methodologies to specific breeds and traits, which will result in more accurate genomic estimated breeding values (GEBVs). Therefore, the objective of this study was to verify the possibility of applying semi-parametric models for genomic selection and to compare their ability of prediction with those of parametric models for real (carcass, meat quality, growth and reproductive traits) and simulated data. The phenotypic and pedigree information used were provided by farms belonging to four animal breeding programs which represent eleven farms. For carcass and meat quality traits, the data set contained 3,643 records for rib eye area (REA), 3,619 records for backfat thickness (BFT), 3,670 records for meat tenderness (TEN) and 3,378 observations for hot carcass weight (HCW). A total of 825,364 records for yearling weight (YW) and 166,398 for age at first calving (AFC) were used as growth and reproductive traits of Nelore cattle. Genotypes of 2,710, 2,656, 2,749, 2,495, 4,455 and 1,760 animals were available for REA, BFT, TEN, HCW, YW and AFC, respectively. After quality control, approximately 450,000 single nucleotide polymorphisms (SNP) remained. Methods of analysis were genomic BLUP (GBLUP), single-step GBLUP (ssGBLUP), Bayesian LASSO (BL) and the semi-parametric approaches Reproducing Kernel Hilbert Spaces (RKHS) regression and Kernel Averaging (KA). A five-fold cross-validation with thirty random replicates was carried out and models were compared in terms of their prediction mean squared error (MSE) and accuracy of prediction (ACC). The ACC ranged from 0.39 to 0.40 (REA), 0.38 to 0.41 (BFT), 0.23 to 0.28 (TEN), 0.33 to 0.35 (HCW), 0.36 to 0.51 (YW) and 0.49 to 0.56 (AFC). For all traits, the GBLUP and BL models showed very similar prediction accuracies. For REA, BFT and HCW, models provided similar prediction accuracies, however RKHS regression had the best fit across traits considering multiple-step models and compared to KA. For traits which have a higher number of animals with phenotypes compared to the number of those with genotypes (YW and AFC), the ssGBLUP is indicated. Judged by overall performance, across all traits, the RKHS regression is particularly appealing for application in genomic selection, especially for low heritability traits. Simulated genotypes, pedigree, and phenotypes for four traits A, B, C and D were obtained using heritabilities based on real data (0.09, 0.12, 0.36 and 0.39 for each trait, respectively). The simulated genome consisted of 735,293 markers and 1,000 QTLs randomly distributed over 29 pairs of autosomes, with length varying from 40 to 146 centimorgans (cM), totaling 2,333 cM. It was assumed that QTLs explained 100% of genetic variance. Considering Minor Allele Frequencies greater or equal to 0.01, a total of 430,000 markers were randomly selected. The phenotypes were generated by adding residuals, randomly drawn from a normal distribution with mean equal to zero, to the true breeding values and all simulation process was replicated 10 times. ACC was quantified using correlations between the predicted genomic breeding value and true breeding values simulated for the generations of 12 to 15. The average linkage disequilibrium, measured between pairs of adjacent markers for all simulated traits was 0.21 for recent generations (12, 13 and 14), and 0.22 for generation 15. The ACC for simulated traits A, B, C and D ranged from 0.43 to 0.44, 0.47 to 0.48, 0.80 to 0.82 and 0.72 to 0.73, respectively. Different genomic selection methodologies implemented in this study showed similar accuracies of prediction, and the optimal method was sometimes trait dependent. In general, RKHS regressions were preferable in terms of ACC and provided smallest MSE estimates compared to other models. / FAPESP: 2014/00779-0 / FAPESP: 2015/13084-3 Acurácia Características e carcaça Regressão RKHS Seleção genômica SNP Accuracy Carcass traits Genomic selection RKHS regression
25	Estudo do desequilíbrio de ligação e estimativa do tamanho efetivo em uma população da raça gir selecionada para crescimento pós-desmama / Linkage disequilibrium and effective size on population of gir zebu breed selected for post-weaning weights Toro Ospina, Alejandra Maria [UNESP] 24 February 2017 (has links) Submitted by ALEJANDRA MARIA TORO OSPINA null (alejita-t_92@hotmail.com) on 2017-03-18T16:50:07Z No. of bitstreams: 1 dissertação_Alejandra_Toro.pdf: 1073618 bytes, checksum: 4de34349c23cb909c3128081fe41cc42 (MD5) / Approved for entry into archive by Juliano Benedito Ferreira (julianoferreira@reitoria.unesp.br) on 2017-03-22T12:59:36Z (GMT) No. of bitstreams: 1 toroospina_am_me_jabo.pdf: 1073618 bytes, checksum: 4de34349c23cb909c3128081fe41cc42 (MD5) / Made available in DSpace on 2017-03-22T12:59:36Z (GMT). No. of bitstreams: 1 toroospina_am_me_jabo.pdf: 1073618 bytes, checksum: 4de34349c23cb909c3128081fe41cc42 (MD5) Previous issue date: 2017-02-24 / O objetivo deste estudo foi estimar o desequilíbrio de ligação (r2) nas distâncias de 25-50kb, 50-100kb, 100-500kb, 0,5-1Mb e o tamanho efetivo (Ne) nas gerações 0, 5, 10, 15, 20 em população da raça Gir selecionada para crescimento pós-desmama. Os animais utilizados no presente estudo foram provenientes do rebanho fechado do Instituto de Zootecnia, Sertãozinho, SP. Foram obtidos os genótipos de 155 animais com o painel BovineDL 33kb e 18 com painel HD imputado onde realizou-se controle de qualidade (CQ) para alelo de menor frequência (MAF) < 0,02 e call rate < 0,1. Depois do CQ permaneceram 27.236 SNPs e 155 animais do painel de 33 kb e 732.962 SNPs e 173 animais do painel HD Imputado. As análises de r2 foram realizadas pelo programa Plink e programa estatístico R Studio e o Ne por meio do DL. Os resultados das distâncias 25-50kb, 50-100kb, 100-500kb e 0,5-1Mb do r2 para o painel 33kb foram iguais a 0,29, 0,25, 0,16 e 0,032 respectivamente, e 0,35, 0,29, 0,18, 0,032 para o painel HD imputado demostrando que o DL permaneceu nas distâncias menores a 100kb, decaindo com o aumento das distâncias. Estes resultados foram maiores aos descritos na literatura para animais zebuínos, sugerindo como causa os segmentos longos de haplótipos que compartilham os animais aparentados. O Ne foi igual a 9, 17, 24, 30 e 30 animais nas gerações 0, 5, 10, 15, 20, observa-se que o Ne é maior na geração 20, com 30 animais, e decai drasticamente a partir da 5 geração com 17 animais, e sendo de 9 animais a última geração, um tamanho pequeno para uma população. Os valores encontrados neste estudo mostram alto DL e baixo Ne, provavelmente pelo sistema de seleção e a estrutura da população da raça Gir avaliada, que apresenta alto nível de endogamia, perda da variabilidade genética, uso intensivo de pequeno número de reprodutores, conduzindo a diminuição da deriva genética da população, ocasionando dificuldades na seleção dos animais. / The aim of this study was to estimate the linkage disequilibrium (r2) at distances of 25-50kb, 50-100kb, 100-500kb, 0,5-1Mb and the effective population size (Ne) in generations 0, 5, 10, 15, 20 in population of the selected Gir for yearling growth. The animals used in this study were from the closed herd Animal Science Institute, Sertãozinho, SP. the genotypes of 155 animals were obtained with BovineDL 33kb and 18 animals of panel HD, where quality control was held (QC) for minor allele frequency (MAF) <0.02 and call rate <0.1. After QC remained 27,236 SNPs and 155 animals to panel 33 kb, 732.962 SNPs and 173 the panel HD imputation. The r2 analyzes were performed by Plink program and R Studio statistical program and Ne through the DL. The results of r2 for distances 25-50kb, 50-100kb, 100-500kb and 0,5-1Mb were equal to 0.29, 0.25, 0.16 and 0.032, respectively, showing that the DL remained in smaller distances 100kb, decreasing with increasing distances. These results were higher than those reported in the literature for Zebu animals, suggesting a cause to long haplotype segments that share the related animals. Ne is equal to 9, 17, 24, 30 and 30 in the generations 0, 5, 10, 15, 20, it is observed that Ne is higher in generation 20 with 30 animals and decays sharply from 5 Generation 17 animals, and with 9 animals the latest generation, small size for a population. The values found in this study to DL and Ne, explain the selection system and the structure of the population of Gir evaluated, which has a high level of inbreeding, loss of genetic variability, intensive small number of players, leading to decreased drift population genetics, causing difficulties in the selection of the next generations. Seleção genômica Haplótipos Deriva genética Endogamia Genomic selection Haplotypes Genetic drift Inbreeding
26	Bridging genomics and quantitative genetics of Eucalyptus: genome-wide prediction and genetic parameter estimation for growth and wood properties using high-density SNP data / Conectando a genômica à genética quantitativa de Eucalyptus: predição genômica e estimação de parâmetros genéticos para crescimento e propriedades de madeira usando alta densidade de SNPs Lima, Bruno Marco de 25 April 2014 (has links) Convergence of quantitative genetics and genomics is becoming the way that fundamental genetics and applied breeding will be carried out in the next decades. This study bridges the quantitative genetics of complex growth and wood properties traits with genomic technologies towards a more innovative approach to tree breeding. Planted forests play a major role to fulfill the growing world demand for wood products and energy. Eucalypts stand out for their high productivity and versatile wood resulting from the advanced breeding programs associated to clonal propagation and modern silviculture. Despite their fast growth, breeding cycles still take several years and wood properties assessment is limited to a sample of trees in the late stages of selection due to the costs involved in wood phenotyping, not exploitingthe range of genetic variation in wood properties. In this study, we examined fifteen traits including growth and wood chemical and physical properties in 1,000 individuals sampled from an elite Eucalyptus breeding population. Near-infrared spectroscopy (NIRS) models were developed and used for high-throughput phenotyping of wood traits.Highdensity data for 29,090 SNPs was used to obtain accurate pedigree-record-free estimates of trait variance components, heritabilities, genetic and phenotypic correlations, based on a realized relationship matrix, comparing them to pedigree-based estimates. To the best of our knowledge, this is the first study to do this in plants. NIRS predictions were accurate for wood chemical traits and wood density, and variably successful for physical traits. Heritabilities were medium for growth (0.34 to 0.44), high for wood chemical traits (0.56 to 0.85) and variable for wood physical traits (0.11 to 0.63). High positive correlations among growth traits and negative between cellulose and lignin content were observed, while correlations between wood chemical and physical traits and between growth and wood quality traits were low although significant. Phenotypes and SNP markers were then used to build genomic predictive models using a marker density higher than any previous genomic selection study in trees (1 SNP/21 kbp). Two models (RR-BLUP and Bayesian LASSO) that differ regarding the assumed distribution of marker effects were used for genomic predictions. Predictions were compared to those obtained by phenotypic BLUP. Predictive abilities very similar by the two models and strongly correlated to the heritabilities. Accurate genomic-enabled predictions were obtained for wood chemical traits related to lignin, wood density and growth, although generally 15 to 25% lower than those achieved by phenotypic BLUP prediction. Nevertheless, genomic predictions yielded a coincidence above 70% in selecting the top 30 trees ranked by phenotypic selection for growth, wood density and S:G ratio, and 60% when tandem selection was applied. The results of this study open opportunities for an increased use of highthroughput NIRS phenotyping and genome-wide SNP genotyping in Eucalyptus breeding, allowing accurate pedigree-record-free estimation of genetic parameters and prediction of genomic breeding values for yet to be phenotyped trees. These applications should become routine in tree breeding programs for the years to come, significantly reducing the length of breeding cycles while optimizing resource allocation and sustainability of the breeding endeavor. / A convergência da genética quantitativa com a genômica está se tornando a maneira pela qual a genética fundamental e aplicada serão conduzidas nas próximas décadas. Este estudo buscou conectar a genética de fenótipos complexos de crescimento e propriedades de madeira às tecnologias genômicas, em uma abordagem inovadora para o melhoramento florestal. Florestas plantadas têm papel fundamental para satisfazer a crescente demanda mundial por produtos madeireiros e energia. O eucalipto,com sua alta produtividade e madeira versátil, é resultado de programas avançados de melhoramento associados à propagação clonal e silvicultura moderna. Apesar de seu rápido crescimento, ciclos de melhoramento ainda levam muitos anos e a avaliação detalhada de propriedades da madeira é limitada a apenas uma amostra das árvores em estágios avançados de seleção, devido aos altos custos de fenotipagem, não explorando assim toda a variação genética disponível. Neste estudo, examinamos quinze caracteres, incluindo crescimento e propriedades químicas e físicas da madeira, em 1000 indivíduos amostrados de uma população elite de melhoramento. Modelos de espectroscopia de infravermelho próximo (NIRS) foram desenvolvidos e utilizados para fenotipagem de alto desempenho de propriedades de madeira. Genotipagem de alta densidade com 29.090 SNPs foi utilizada para obter estimativas acuradas de componentes de variância, herdabilidades e correlações genéticas baseadas em uma matriz de parentesco realizado, ou seja,sem o uso de pedigree. Este é o primeiro estudo de que temos conhecimento a fazer isso em plantas. Predições NIRS foram precisas para caracteres químicos da madeira e densidade, e apresentaram sucesso variável para caracteres físicos. As herdabilidades foram médias para crescimento (0,34 a 0,44), altas para caracteres químicos de madeira (0,56 a 0,85) e variáveis para caracteres físicos da madeira (0,11 a 0,63). Altas correlações positivas entre caracteres de crescimento e negativas entre celulose e lignina foram observadas, enquanto correlações entre caracteres químicos e físicos da madeira foram baixas, porém significativas. Fenótipos e marcadores SNP foram em seguida utilizados na construção de modelos preditivos com a maior densidade de marcadores já utilizada em estudos de seleção genômica em espécies florestais (1 SNP/21 kpb). Dois modelos de predição (RR-BLUP e LASSO Bayesiano)foram usados nas predições genômicas e comparados ao BLUP fenotípico. Os modelos apresentaram capacidades preditivas similares, fortemente correlacionadas às herdabilidades. Predições genômicas precisas foram obtidas para caracteres relacionados à lignina, densidade e crescimento, embora geralmente 15 a 25% menores do que as predições obtidas por BLUP fenotípico. Contudo, predições genômicas alcançaram coincidências acima de 70% na seleção das melhores 30 árvores ranqueadas pela seleção fenotípica para crescimento, densidade e relação S:G, e de 60% quando seleção em tandem foi aplicada. Os resultados deste estudo abrem enormes oportunidades para o uso combinado de fenotipagem NIRS e genotipagem com SNPs no melhoramento do eucalipto, permitindo estimativas acuradas de parâmetros genéticos e a predição de valores genéticos genômicos para plantas jovens ainda não fenotipadas. Estas aplicações deverão se tornar rotineiras nos programas de melhoramento florestal nos próximos anos, reduzindo significativamente a duração dos ciclos de seleção e, consequentemente, otimizando a alocação de recursos e a sustentabilidade do melhoramento. Genomic selection Herdabilidade Heritability Marcador molecular Melhoramento florestal Molecular marker Seleção genômica Tree breeding
27	Prise en compte d’informations a priori en sélection génomique dans un dispositif d’hybrides de tournesol (Helianthus annuus L.) / Taking into account a priori information in genomic selection in a sunflower hybrid design Bonnafous, Fanny 18 December 2017 (has links) La sélection génomique (GS) est un outil puissant pour prédire les phénotypes ou les valeurs génétiques d'individus encore non observés, sur la base d'un panel à la fois phénotypé et génotypé. Les modèles mixtes GBLUP habituellement utilisés prennent en compte tous les marqueurs simultanément, en postulant que leurs effets suivent tous la même distribution gaussienne. Les connaissances des mécanismes biologiques sous-jacent à la variation phénotypique ne sont donc pas pris en compte dans une telle modélisation. Le but de cette thèse est d'intégrer dans des modèles GBLUP des connaissances a priori, comme des régions génomique impliquées dans la variation des caractères d'intérêt ou encore des réseaux de gènes, afin d'évaluer le potentiel d'amélioration de la précision de prédiction. Ces modèles ont été appliqués à l'espèce de tournesol Helianthus annuus L., sur trois caractères (la floraison, le rendement et la sénescence foliaire) dans 13 environnements différents. L'un des principaux défis des études sur les hybrides de tournesol est de modéliser la vigueur hybride, ou hétérosis. Différentes hypothèses, incluant la dominance, la superdominance et l'épistasie ont été proposées pour clarifier les mécanismes génétiques sous-jacents au phénomène de l'hétérosis, mais leur importance n'est pas clairement connue. Dans ce contexte, la première partie de cette étude a eu pour but de tester l'efficacité de la GS dans une population d'hybrides provenant du croisement de 36 lignées femelles avec 36 lignées mâles. Pour cela des modèles prenant en compte des effets non-additifs ont été expérimentés, et les résultats validés expérimentalement en champ sur deux années. La prédiction des valeurs génétiques des hybrides ayant été concluante, nous avons ensuite cherché des informations a priori à intégrer à ces modèles. Des SNPs impliqués dans la variation des trois caractères d'intérêt ont été recherchés à l'aide de plusieurs modèles de GWAS (additifs et non-additifs). De plus, dans la perspective de tester des modèles prenant en compte des interactions épistatiques, des SNPs localisés dans des réseaux de gènes connus ont été recherchés. La dernière partie de cette thèse a eu pour but d'intégrer aux modèles GBLUP ces régions génomiques impliquées dans la variation des caractères. Deux méthodes ont été utilisées pour cela, à savoir la modélisation des informations a priori dans la partie aléatoire (modèle MultiBLUP) ou dans la partie fixe des modèles. Ces méthodes ne montrent pas d'amélioration significative des précisions de prédiction par rapport aux modèles GBLUP sans information a priori. / Genomic selection is a powerful tool for predicting phenotypes or genetic values of non-observed individuals, based on a panel both phenotyped and genotyped. The mixed models GBLUP usually utilized take into account all markers simultaneously, assuming that all their effects all follow the same Gaussian distribution. Knowledge of the biological mechanisms underlying phenotypic variation is therefore not taken into account in such modeling. The aim of this thesis is to integrate in GBLUP models a priori knowledge, such as genomic regions involved in the variation of the traits of interest or networks of genes, in order to evaluate the potential for improvement of accuracies. These models were applied to the Helianthus annuus L. sunflower specie on three traits (flowering time, yield and leaf senescence) in 13 several environments. One of the main challenges of genetic studies on sunflower hybrids is to model hybrid vigor, or heterosis. Different hypotheses, including dominance, over-dominance and epistasis have been proposed to clarify the genetic mechanisms underlying the heterosis phenomenon, but their importance is not clearly known. In this context, the first part of this study aimed to test the efficiency of the GS in an hybrid population from the crossing of 36 female lines with 36 male lines. For this purpose, models taking into account non-additive effects were experimented, and the results validated experimentally in field over two years. The prediction of the genetic values of the hybrids was conclusive, so we looked for a priori information to integrate with these models. SNPs involved in the variation of the three traits of interest were searched using several models of GWAS (additive and non-additive). Moreover, in order to test models taking into account epistatic interactions, SNPs located in known gene networks have been sought. Finally the integration of the genomic regions involved in the variation of the traits, into the GBLUP models, was conducted. Two methods were implemented for this, namely the modeling of a priori information in the random part (MultiBLUP model) or in the fixed part of the models. These methods do not show significant improvement in accuracies compared to GBLUP models without a priori information. Sélection génomique Tournesol Hétérosis Non-additif MultiBLUP Genomic selection Sunflower Heterosis Non-additive MultiBLUP
28	Application of selection index theory comprising genomic information to breeding programs of sport horses and pigs Haberland, Anne Marie 03 May 2013 (has links) In der vorliegenden Arbeit werden drei Modellrechnungen vorgestellt, die auf dem Selektionsindex basieren und mit denen der Nutzen genomischer Zusatzinformation auf aktuelle Selektionsstrategien für Reitpferde und Schweine ermittelt werden soll. In Kapitel eins werden sowohl die klassischen Instrumente der Tierzucht, der Selektionsindex, die ökonomische Modellierung und die Genfluss-Methode, als auch die genomische Selektion dargestellt. Zudem wird die Software ZPlan+, in welcher diese Instrumente miteinander kombiniert sind, kurz vorgestellt. Im zweiten Kapitel der Arbeit wird das Potenzial der genomischen Selektion für die Reitpferdezucht dargestellt. Der Zugewinn an Genauigkeit eines Zuchtwerts durch Hinzunahme genomischer Information wurde für drei praxisnahe Selektionsschritte betrachtet, nämlich für Pferde ohne Eigenleistung, Pferde mit Eigenleistung und Pferde mit Eigen- und Nachkommenleistungen. Die Modellrechnung zeigte, dass sich durch Hinzunahme genomischer Information besonders die Genauigkeit von Zuchtwerten junger Pferde ohne Eigen- oder Nachkommenleistung beträchtlich steigern ließe. Sobald Nachkommenleistungen unterstellt wurden war der Zugewinn an Genauigkeit durch genomische Information jedoch zu vernachlässigen. Für die praktische Pferdezucht ist die genomische Selektion daher vor allem für die Absicherung vorläufiger Zuchtzulassungen für junge Hengste nach der Körung und für leistungsgeprüfte Hengste ohne Nachkommenleistungen empfehlenswert. Im dritten Kapitel der Arbeit wurde untersucht, wie sich die Art der berücksichtigten Informationsquellen (konventionell oder genomisch) auf die Interaktion der Parameter genetische Korrelation, Heritabilität und ökonomische Gewichtung auswirkte. Als Vergleichsparameter wurde die Standardabweichung des jeweiligen Indexes herangezogen, welche sich direkt proportional zum Zuchtfortschritt verhält. Es wurden drei Indices mit zwei Zuchtzielmerkmalen verglichen, deren Informationsquellen in Anlehnung an die Schweinezucht gewählt wurden. Der erste Index wurde für ein Tier mit Eigenleistung aufgestellt, während im zweiten Index zusätzliche Vollgeschwisterleistungen angenommen wurden. Im dritten Index wurde die Eigenleistung mit genomischer Information für beide Zuchtzielmerkmale kombiniert. Die Genauigkeit der genomischen Zuchtwerte wurde durch Annahme unterschiedlich großer Referenzpopulationen variiert. Je mehr Information im Index berücksichtigt wurde, desto unabhängiger wurde die Standardabweichung des Indexes von den Parametern genetische Korrelation, Heritabilität und ökonomische Gewichtung. Dabei wurden für den Vollgeschwisterindex mit 7 Vollgeschwistern und den genomischen Indexes mit einer Referenzpopulation von 1.000 Tieren vergleichbare Ergebnisse gefunden. Die Anzahl von 1.000 Tieren in der Referenzpopulation kann somit als Mindestmaß für die Schweinezucht angenommen werden. Für geringere Heritabilitäten zeigte sich eine deutliche Überlegenheit des genomischen Indexes über den Vollgeschwisterindex. Im vierten Kapitel der Arbeit wurde ein Schweinezuchtprogramm optimiert. Dazu wurden verschiedene Zuchtziele und Selektionsstrategien gegen Ebergeruch verglichen. Das Zuchtziel war entweder die Selektion gegen den Gehalt der chemischen Leitkomponenten des Ebergeruchs, Androstenon, Skatol und Indol oder die Selektion gegen den, von Testpersonen bestimmten, Human Nose Score. Innerhalb der Selektion gegen die chemischen Komponenten wurden drei verschiedene Informationsquellen miteinander verglichen, nämlich eine Feldprüfung in Form einer Biopsie am lebenden Eber, genomische Selektion sowie die Kombination beider Informationen. Innerhalb der Selektion gegen den Human Nose Score wurden als Informationsquellen eine Stationsprüfung von Voll- und Halbgeschwistern des Selektionskandidaten sowie genomische Selektion miteinander verglichen. Das komplexe Zuchtprogramm wurde deterministisch mit der Software ZPlan+ modelliert. Bei der Selektion gegen die chemischen Komponenten des Ebergeruchs war der Zuchtfortschritt am höchsten, wenn als Informationsquelle die Eigenleistung in Form einer Biopsie der männlichen Selektionskandidaten genutzt wurde. Aufgrund der hohen Erblichkeit der Ebergeruchskomponenten war der Nutzen genomischer Zuchtwerte deutlich geringer und gleichzeitig teurer. Für die Selektion gegen den geringer erblichen Human Nose Score lieferte die genomische Selektion einen höheren Zuchtfortschritt als die Stationsprüfung der Geschwister des Selektions-kandidaten. Auch wenn der Human Nose Score als Zielmerkmal angesehen wurde, erwies sich eine Selektion gegen die chemischen Komponenten als zielführend, da der (korrelierte) naturale Zuchtfortschritt des Human Nose Scores bei Durchführung einer Biopsie deutlich höher war, als bei direkter Selektion gegen den Human Nose Score. Im fünften Kapitel wird das Potential der genomischen Selektion für die Reitpferde- und Schweinezucht diskutiert. Dabei wird besonders auf Möglichkeiten für die Vergrößerung der Referenzpopulation und der Kostenreduktion eingegangen. Für die Reitpferdezucht wer-den zusätzlich Möglichkeiten aufgezeigt, wie der Zuchtfortschritt mit konventionellen Methoden gesteigert werden könnte. 630 genomic selection selection index sport horse boar taint pig genetic gain Land- und Forstwirtschaft (PPN621302791)
29	Métodos de redução de dimensionalidade aplicados na seleção genômica para características de carcaça em suínos / Dimensionality reduction methods applied to genomic selection for carcass traits in pigs Azevedo, Camila Ferreira 26 July 2012 (has links) Made available in DSpace on 2015-03-26T13:32:15Z (GMT). No. of bitstreams: 1 texto completo.pdf: 1216352 bytes, checksum: 3e5fbc09a6f684ddf7dbb4442657ce1f (MD5) Previous issue date: 2012-07-26 / The main contribution of molecular genetics is the direct use of DNA information to identify genetically superior individuals. Under this approach, genome-wide selection (GWS) can be used with this purpose. GWS consists in analyzing of a large number of SNP markers widely distributed in the genome, and due to the fact that the number of markers is much larger than the number of genotyped individuals (high dimensionality) and also to the fact that such markers are highly correlated (multicollinearity). However, the use of methodologies that address the adversities is fundamental to the success of genome wide selection. In view of, the aim of this dissertation was to propose the application of Independent Component Regression (ICR), Principal Component Regression (PCR), Partial Least Squares (PLS) and Random Regression Best Linear Unbiased Predictor, whereas carcass traits in an F2 population of pigs originated from the cross of two males from the naturalized Brazilian breed Piau with 18 females of a commercial line (Large White × Landrace × Pietrain), developed at the University Federal of Viçosa. The specific objectives were, to estimate Genomic Breeding Value (GBV) for each individual and estimate the effects of SNP markers in order to compare methods. The results showed that ICR method is more efficient, since provided most accurate genomic breeding values estimates for most carcass traits. / A principal contribuição da genética molecular no melhoramento animal é a utilização direta das informações de DNA no processo de identificação de animais geneticamente superiores. Sob esse enfoque, a seleção genômica ampla (Genome Wide Selection GWS), a qual consiste na análise de um grande número de marcadores SNPs (Single Nucleotide Polymorphisms) amplamente distribuídos no genoma, foi idealizada. A utilização dessas informações é um desafio, uma vez que o número de marcadores é muito maior que o número de animais genotipados (alta dimensionalidade) e tais marcadores são altamente correlacionados (multicolinearidade). No entanto, o sucesso da seleção genômica ampla deve-se a escolha de metodologias que contemplem essas adversidades. Diante do exposto, o presente trabalho teve por objetivo propor a aplicação dos métodos de regressão via Componentes Independentes (Independent Component Regression ICR), regressão via componentes principais (Principal Component Regression PCR), regressão via Quadrados Mínimos Parciais (Partial Least Squares PLSR) e RR-BLUP, considerando características de carcaça em uma população F2 de suínos proveniente do cruzamento de dois varrões da raça naturalizada brasileira Piau com 18 fêmeas de linhagem comercial (Landrace × Large White × Pietrain), desenvolvida na Universidade Federal de Viçosa. Os objetivos específicos foram estimar Valores Genéticos Genômicos (Genomic Breeding Values GBV) para cada indivíduo avaliado e estimar efeitos de marcadores SNPs, visando a comparação dos métodos. Os resultados indicaram que o método ICR se mostrou mais eficiente, uma vez que este proporcionou maiores valores de acurácia na estimação do GBV para a maioria das características de carcaça. Seleção genômica Dimensionality reduction methods Genomic selection CNPQ::CIENCIAS AGRARIAS
30	Estudos genômicos de características indicadoras de eficiência alimentar em duas populações de bovinos da raça Nelore / Genomic studies of feed efficiency traits in two Nelore populations Santos, Samuel Wallace Boer dos 31 July 2018 (has links) Submitted by Samuel Wallace Boer dos Santos (samuel_wallace_eu@hotmail.com) on 2018-10-05T12:41:08Z No. of bitstreams: 1 Dissertação (compacted).pdf: 1033432 bytes, checksum: 72856099afe05132396d596489c6971b (MD5) / Approved for entry into archive by Neli Silvia Pereira null (nelisps@fcav.unesp.br) on 2018-10-08T17:47:46Z (GMT) No. of bitstreams: 1 santos_swb_me_jabo.pdf: 1033432 bytes, checksum: 72856099afe05132396d596489c6971b (MD5) / Made available in DSpace on 2018-10-08T17:47:46Z (GMT). No. of bitstreams: 1 santos_swb_me_jabo.pdf: 1033432 bytes, checksum: 72856099afe05132396d596489c6971b (MD5) Previous issue date: 2018-07-31 / Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) / Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) / Características de eficiência alimentar estão diretamente associadas com a lucratividade e sustentabilidade da bovinocultura de corte. Conversão alimentar, consumo alimentar residual, consumo de matéria seca, eficiência alimentar e ganho em peso, são características importantes para a seleção de animais mais eficientes dentro de um sistema de produção, porém, com exceção do ganho em peso, as demais não vêm sendo consideradas como critérios de seleção devido à dificuldade de obtenção de fenótipos para as mesmas. Com o avanço nas tecnologias de genotipagem e sequenciamento, foram desenvolvidos chips de alta densidade de marcadores do tipo SNP (Single Nucleotide Polymorphism) espalhados pelo genoma. Estas informações moleculares vêm sendo utilizadas em estudos de associação genômica ampla (GWAS) e de seleção genômica (SG). Basicamente, o GWAS permite a identificação de variações genéticas de maior efeito sobre a expressão fenotípica de características de interesse, enquanto a SG visa a predição do valor genômico direto dos candidatos à seleção utilizando apenas a informação molecular, o que tem revolucionado o melhoramento genético por proporcionar a diminuição do intervalo de geração e o aumento da acurácia de predição dos valores genéticos dos animais. Assim sendo, os objetivos do presente trabalho foram: 1) encontrar regiões cromossômicas de maior efeito sobre características de eficiência alimentar em animais Nelore provenientes de dois programas de melhoramento (Instituto de Zootecnia - IZ e Nelore Qualitas), visando encontrar possíveis diferenças/semelhanças entre as populações; 2) avaliar a existência de genes candidatos comum às populações; e 3) avaliar a possibilidade e os benefícios de combinar estas duas populações Nelore em estudos de seleção genômica. Foram utilizadas informações fenotípicas e genotípicas de 1.137 animais do IZ e 817 animais do Qualitas. Os animais foram genotipados com painel de alta densidade (Illumina BovineHD chip) ou tiveram seus genótipos imputados para HD através do software FImpute. Após o controle de qualidade dos genótipos, permaneceram para análise 408.161 SNPs para o IZ e 428.621 SNPs para o Qualitas. O GWAS foi realizado para cada população individualmente, considerando a metodologia GBLUP. Modelos unicaracterísticos foram empregados nas análises, incluindo, além dos efeitos aleatórios de animal e resíduo, os efeitos sistemáticos de grupos de contemporâneos (GC), os quais foram definidos como: sexo, ano de nascimento e instalação (IZ) e ano do teste e baia (Qualitas). Para o IZ também foram incluídos, para todas as características, os efeitos fixos de mês de nascimento, e, como covariáveis, idade do animal (linear), idade da mãe (linear e quadrática) e os dois primeiros componentes principais (obtidos a partir da matriz G). O efeito quadrático da idade do animal foi incluído no modelo apenas para o consumo de matéria seca e ganho médio diário. Para o Qualitas, foi considerado, para todas as características, o efeito linear da idade do animal como covariável. No GWAS, foram encontradas algumas regiões cromossômicas de maior efeito para cada característica nas duas populações, porém, não foram encontradas regiões em comum. No estudo de seleção genômica (SG), foram utilizados dez diferentes abordagens e esquemas envolvendo as duas populações para comparar a acurácia de predição. Em geral, a combinação das populações pode gerar benefícios para a seleção genômica, porém, tais benefícios dependem da característica e do esquema de validação. / Feed efficiency traits are directly associated with the profitability and sustainability of beef cattle. Feed conversion rate, residual feed intake, dry matter intake, feed efficiency and average daily gain are important traits for the selection of more efficiency animals within a production system, but, except for weight gain, the others have not been considered as selection criteria due to the difficulty of obtaining phenotypes. With the advance in genotyping and sequencing technologies, high density chips of SNP (Single Nucleotide Polymorphism) have been developed. This molecular information has been used in genome-wide association (GWAS) and genomic selection (GS) studies. Basically, GWAS allows the identification of genetic variations with major effects on the phenotypic expression of traits of interest, while SG aims at the prediction of direct genomic value for the selection candidates using only their molecular information, which has revolutionized the animal breeding by providing a decrease in generation interval and increases in the prediction accuracies of breeding values. Thus, the objectives of the present study were to: 1) identify chromosomal regions with major effects on feed efficiency traits in animals from two Nellore breeding programs (Instituto de Zootecnia and Nellore Qualitas), in order to find possible differences/similarities between the populations; 2) evaluate the existence of candidate genes in common to populations; and 3) evaluate the possibility and benefits of combining these two Nellore populations in genomic selection studies. Phenotypic and genotypic information of 1,137 animals from IZ and 817 from Qualitas were used. The animals were genotyped with high density panel (Illumina BovineHD chip) or had their genotypes imputed to HD through the FImpute software. After quality control, remained for analysis 408,161 SNPs for IZ and 428.611 SNPs for Qualitas. The GWAS was performed for each population individually, considering the GBLUP methodology. Single-trait models were implemented in the analyzes, including, in addition to the random effects of animal and residual, the systematic effects of contemporary groups (CG), which were defined as: sex, year of birth and pen for the IZ, and year of test and pen for the Qualitas. For IZ, there were also considered, for all traits, the fixed effects of month of birth and, as covariable, age of animal (linear effect), age of dam (linear and quadratic effects) and the first two principal components (calculated based on the G matrix). For ADG and DMI, the quadratic effect of age of animal, as covariable, was added to the model. For Qualitas, it was also included in the model, for all traits, the linear effect of the animal age as covariable. In GWAS, some chromosomal regions of greater effect were found for each trait in both populations. However, no common regions were found. In GS, ten different approach and schemes involving the two Nellore populations were used to compare the accuracy of genomic prediction. In general, genomic predictions combining both populations are feasible, but, the benefits will depend on the trait and validation scheme. / CNPq: 132884/2016-0 / FAPESP: 2016/24228-9 / FAPESP: 2017/13411-0 Associação genômica ampla Melhoramento genético animal Seleção genômica Animal breeding Genomic selection Genome wide association studies

Search results