Global ETD Search

801	Geochemical and Petrographic Characterization of the Transition Boundary between the MG2 package to MG3 package at Dwarsrivier Chrome Mine, Bushveld Complex, South Africa Ramushu, Adam Puleng January 2018 (has links) Magister Scientiae - MSc (Earth Science) / This study area is situated within the Eastern Bushveld complex at Dwarsrivier chrome mine, which is approximately 30 km from Steelpoort and 60km from Lydenburg in the Mpumalanga province. The primary aim of the project is to identify the petrological and geochemical characteristics that can be used to distinguish the various rock types of feldspathic pyroxenites, chromitites, anorthosites and chromitite pyroxenites and determine whether the various rock types are from the MG2 package and MG3 package were formed from a single or multiple magma pulses. The geochemical and mineralogical variation studies were carried out using cores from borehole DWR74 and DWR172 located on the farm Dwarsrivier 372 KT. Using the combination of various multivariate statistical techniques (factor, cluster and discriminant analysis) multi element diagrams and trace element ratios, the outcome of the study demonstrated that each of the four rock types can be sub-divided into two groups.
802	Seleção de genótipos de soja portadores ou não do gene RR por meio de análise multivariada e desempenho agronômico / Selection of soybean genotypes carrying or not the RR gene through multivariate analysis and agronomic performance Leite, Wallace de Sousa [UNESP] 19 February 2016 (has links) Submitted by WALLACE DE SOUSA LEITE null (leitewallace@hotmail.com) on 2016-03-08T01:08:24Z No. of bitstreams: 1 DISSERTAÇÃO - WALLACE LEITE CORREGIDA.. 2016.pdf: 1066689 bytes, checksum: f8ec8b34776ac48c519907f49bb30767 (MD5) / Approved for entry into archive by Ana Paula Grisoto (grisotoana@reitoria.unesp.br) on 2016-03-09T14:55:54Z (GMT) No. of bitstreams: 1 leite_ws_me_jabo.pdf: 1066689 bytes, checksum: f8ec8b34776ac48c519907f49bb30767 (MD5) / Made available in DSpace on 2016-03-09T14:55:54Z (GMT). No. of bitstreams: 1 leite_ws_me_jabo.pdf: 1066689 bytes, checksum: f8ec8b34776ac48c519907f49bb30767 (MD5) Previous issue date: 2016-02-19 / Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) / A seleção de genótipos superiores de soja é um processo complexo, dessa forma, técnicas exploratórias multivariadas podem ser aplicadas para selecionar genótipos, analisando simultaneamente todos os caracteres agronômicos em estudo. Mediante o exposto, o objetivo do presente trabalho, consistiu em selecionar genótipos de soja Roundup Ready com bons caracteres agronômicos por meio de análises multivariadas, identificar quais caracteres mais influenciam a produtividade de grãos, comparar o desempenho agronômico de genótipos de soja RR com convencionais, oriundos de cruzamentos biparentais e avaliar por meio da análise de trilha, a relação entre caracteres de importância agronômica com a produtividade de grãos. Na geração F5, 227 linhagens de soja, sendo estas portadoras ou não do gene RR, foram avaliadas em delineamento de blocos aumentados com testemunhas intercalares. Na geração F6, os genótipos foram separados em dois grupos (RR com 27 genótipos e Convencional com 23 genótipos) e avaliados em dois experimentos distintos, conduzidos em delineamento experimental de blocos ao acaso com três repetições. Foram avaliados os principais caracteres de interesse agronômico. Para as análises exploratórias multivariadas, utilizou-se a técnica de componentes principais e análise de agrupamento pelo método hierárquico de Ward e pelo método não hierárquico de k-médias. Na análise de componentes principais na geração F5 três autovalores foram superiores a um, explicando 67,58% da variância contida nas informações originais, sendo caracterizados pelos caracteres altura da planta na maturidade, acamamento, valor agronômico, número de ramos, número de vagens e produtividade de grãos que permitiram a discriminação de genótipos de soja Roundup Ready com bons atributos agronômicos. Os resultados das análises de agrupamento pelo método de K-médias e pelo o método de Ward quando comparados foram semelhantes, pois agruparam os genótipos específicos para os caracteres selecionados na análise de componentes principais, em um mesmo grupo. Na geração F6, Os genótipos de soja apresentaram desempenho produtivo diferenciado dentro do grupo RR e convencional estudados. Dentre os genótipos RR analisados, 3 genótipos obtiveram alta produtividade de grãos, com valores médios superiores a 4.575,5 kg ha-1. Para os genótipos convencionais, 10 genótipos, além da testemunha Conquista apresentaram superioridade para o caráter produtividade de grãos, com valores médios superiores a 3.511,4 kg ha-1. Concluiu-se com este trabalho que a análise de componentes principais permitiu a discriminação e seleção de 16 genótipos de soja Roundup Ready com superioridade agronômica. Os caracteres que compõe os componentes de produção (Número de ramos e de vagens) exerceram maior influência sobre a produtividade de grãos, pois relacionaram-se positivamente. Os genótipos de soja RR mais produtivos, apresentaram valores superiores quando comparados aos genótipos convencionais de maior rendimento. / The selection of superior genotypes of soybean is a complex process. Thus, the exploratory multivariate techniques to select genotypes is an alternative, analyzing simultaneously all traits under study. Therefore, the objectives of this study were to select Roundup Ready soybean genotypes with good agronomic traits through multivariate analysis, identify which trait has the highest influence in grain yield and to compare the agronomic performance of two groups of soybean genotypes – RR and conventional – originated of two-way crosses and evaluate through path analysis, the relationship between important agronomic traits with the grain yield. In the F5 generation, 227 lines of soybean – carrying or not the RR gene – have been evaluated in augmented randomized design with additional control. In the F6 generation, the genotypes were separated into two groups (RR with 27 genotypes and conventional with 23 genotypes) and evaluated in two separated experiments conducted in a randomized blocks design with three replications. The main traits of agronomic interest has been evaluated. For the multivariate analysis, it was applied the technique of principal components and hierarchical cluster analysis by the methods of Ward and non-hierarchical k-means. In the principal component analysis, there were three eigen values greater than one explaining 67.58% of the total variance in the F5 generation, characterized by the traits: plant height at maturity, lodging, agronomic value, number of branches, number of pods and grain yield. These traits allowed discriminating the RR genotypes with good agronomic traits. The results of cluster analyzes by the methods of K-means and Ward were similar, since they clustered the specific genotypes for the selected traits in the same group. In generation F6, the soybean genotypes showed different growth performance within the RR and conventional groups. Among the analyzed genotypes RR, 3 genotypes obtained high grain yield, with average values greater than 4575.5 kg ha-1. For conventional genotypes, 10 genotypes and the check Conquista showed superiority to the traits grain yield, with average values greater than 3511.4 kg ha-1. The principal component analysis allowed the discrimination and selection of 16 Roundup Ready soybean genotypes with agronomic superiority. The traits that compose the components of production (number of branches and pods) had the greatest influence on grain yield, once they were positively related. The most productive RR genotypes showed higher values, when compared to the highest yielding genotypes. / CNPq: 132034/2014-0 Glycine max Análise de agrupamento Componentes principais Desempenho de genótipos Melhoramento genético Seleção de caracteres Glycine max Breeding Cluster analysis Performance genotypes Principal components Traits selection
803	Variabilidade de caracteres nutricionais e de produtividade de grãos em cultivares de milho / Variability of nutritional and productivity characters of grains in maize cultivars Alves, Bruna Mendonça 19 February 2013 (has links) Coordenação de Aperfeiçoamento de Pessoal de Nível Superior / This study aimed to verify whether there is genetic variability regarding nutritional and productivity characters of grains among cultivars of early cycle, very early cycle and transgenic maize and whether variability exists, discuss the differences among the cultivars, in order to select cultivars for crossing. It was used data of three experiments conducted during the harvest 2009/2010, at the experimental area of the Plant Science Department of the Federal University of Santa Maria. During these experiments were analyzed 76 maize cultivars, being 36 of early cycle, 22 of very early cycle and 18 of transgenic cultivar. In each experiment, after the harvest, in each of the three replicates of each cultivar, it was measured the following variables: grain productivity, crude protein, lysine, methionine, cysteine, threonine, tryptophan, valine, isoleucine, leucine, phenylalanine, histidine, arginine, ethere extract, starch and amylose in percentage of crude material. For each experiment, it was made an analysis of variance of each variable, according to a randomized block design. The averages of cultivars were compared by using Scott-Knott test. For each experiment was used the cluster analysis. Initially, discarding of variables was done in three steps: 1) Removal of variables without significant importance for the cultivar (F test of ANOVA), 2) Removal of variables that increase multicollinearity in the correlation matrix among the variablesand 3) Removal of variables by main components analysis. With the remaining variables was determined the Mahalanobis distance matrix (D2) and the cultivar clustering was done by using UPGMA method. Posteriorly, it was built a dendogram and calculated the cophenetic correlation coefficient. In order to test the hypothesis of differences among the average profile, for each experiment, it was calculated the evaluation of multivariate analysis of variance. There is variability among the early cycle, very early cycle and transgenic cultivars regarding grain productivity and nutritional characters in maize. Except starch which showed no variability in early cycle cultivar and transgenic cultivar, and methionine variables for transgenic cultivars. Based on variables grain productivity, crude protein and amylose, three groups were formed for early maturity cultivars, three groups for veryearly cycle cultivars and two groups for transgenic cultivars. / O presente trabalho teve como objetivos, verificar se há variabilidade genética, em relação aos caracteres nutricionais e à produtividade de grãos, entre as cultivares de ciclo precoce, de ciclo superprecoce e transgênicas de milho, e, existindo variabilidade, avaliar a divergência genética entre as cultivares com a finalidade de selecionar cultivares para cruzamentos. Foram utilizados os dados de três experimentos conduzidos na safra agrícola 2009/2010, na área experimental do Departamento de Fitotecnia, da Universidade Federal de Santa Maria. Nesses experimentos, foram avaliadas 76 cultivares de milho, sendo 36 de ciclo precoce, 22 de ciclo superprecoce e 18 cultivares transgênicas. Em cada experimento, após a colheita, em cada uma das três repetições de cada cultivar, foram mensuradas as seguintes variáveis: produtividade de grãos, proteína bruta, lisina, metionina, cisteina, treonina, triptofano, valina, isoleucina, leucina, fenilalanina, histidina, arginina, extrato etéreo, amido e amilose em porcentagem da matéria bruta (%MB). Para cada experimento, foi realizada a análise de variância de cada variável, conforme o modelo matemático de blocos ao acaso. As médias das cultivares foram comparadas por meio do teste de Scott-Knott. Para cada experimento foi realizada a análise de agrupamento. Para isso, inicialmente foi feito o descarte de variáveis em três etapas: 1) retirada de variáveis sem efeito significativo para cultivar (teste F da ANOVA); 2) retirada de variáveis causadoras de multicolinearidade na matriz de correlação entre as variáveis e; 3) descarte de variáveis por meio da análise de componentes principais. Após, com as variáveis que permaneceram. Com as variáveis que permaneceram foi determinada a matriz de distância de Mahalanobis (D2) e o agrupamento das cultivares foi realizado por meio do método UPGMA. Posteriormente, foi construído um dendrograma e calculado o coeficiente de correlação cofenética (CCC). Para testar a hipótese da diferença entre os perfis de médias de cada grupo, para cada experimento, foi realizada a análise de variância multivariada. Há variabilidade entre as cultivares de ciclo precoce, de ciclo superprecoce e cultivares transgênicas em relação à produtividade de grãos e aos caracteres nutricionais de grãos de milho. O amido não apresentou variabilidade em cultivares de ciclo precoce e cultivares transgênicas e a variável metionina para cultivares transgênicas. Com base nas variáveis produtividade de grãos, proteína bruta e amilose, foram formados três grupos para as cultivares de ciclo precoce, três grupos para as cultivares de ciclo superprecoce e dois grupos para as cultivares transgênicas. Zea mays L. Divergência genética Distância generalizada de Mahalanobis Análise de agrupamento Zea mays L. Genetic divergence Mahalanobis generalized distance Cluster analysis CNPQ::CIENCIAS AGRARIAS::AGRONOMIA
804	MMD and Ward criterion in a RKHS : application to Kernel based hierarchical agglomerative clustering / Maximum Dean Discrepancy et critère de Ward dans un RKHS : application à la classification hierarchique à noyau Li, Na 01 December 2015 (has links) La classification non supervisée consiste à regrouper des objets afin de former des groupes homogènes au sens d’une mesure de similitude. C’est un outil utile pour explorer la structure d’un ensemble de données non étiquetées. Par ailleurs, les méthodes à noyau, introduites initialement dans le cadre supervisé, ont démontré leur intérêt par leur capacité à réaliser des traitements non linéaires des données en limitant la complexité algorithmique. En effet, elles permettent de transformer un problème non linéaire en un problème linéaire dans un espace de plus grande dimension. Dans ce travail, nous proposons un algorithme de classification hiérarchique ascendante utilisant le formalisme des méthodes à noyau. Nous avons tout d’abord recherché des mesures de similitude entre des distributions de probabilité aisément calculables à l’aide de noyaux. Parmi celles-ci, la maximum mean discrepancy a retenu notre attention. Afin de pallier les limites inhérentes à son usage, nous avons proposé une modification qui conduit au critère de Ward, bien connu en classification hiérarchique. Nous avons enfin proposé un algorithme itératif de clustering reposant sur la classification hiérarchique à noyau et permettant d’optimiser le noyau et de déterminer le nombre de classes en présence / Clustering, as a useful tool for unsupervised classification, is the task of grouping objects according to some measured or perceived characteristics of them and it has owned great success in exploring the hidden structure of unlabeled data sets. Kernel-based clustering algorithms have shown great prominence. They provide competitive performance compared with conventional methods owing to their ability of transforming nonlinear problem into linear ones in a higher dimensional feature space. In this work, we propose a Kernel-based Hierarchical Agglomerative Clustering algorithms (KHAC) using Ward’s criterion. Our method is induced by a recently arisen criterion called Maximum Mean Discrepancy (MMD). This criterion has firstly been proposed to measure difference between different distributions and can easily be embedded into a RKHS. Close relationships have been proved between MMD and Ward's criterion. In our KHAC method, selection of the kernel parameter and determination of the number of clusters have been studied, which provide satisfactory performance. Finally an iterative KHAC algorithm is proposed which aims at determining the optimal kernel parameter, giving a meaningful number of clusters and partitioning the data set automatically Classification automatique (statistique) Reconnaissance des formes (informatique) Apprentissage automatique Tests d'hypothèses (statistique) Cluster analysis Pattern recognition systems Machine learning Statistical hypothesis testing 620.004 52
805	ESTRUTURA E RELAÇÕES AMBIENTAIS DE GRUPOS FLORÍSTICOS EM FRAGMENTO DA FLORESTA OMBRÓFILA MISTA, RIO GRANDE DO SUL, BRASIL / STRUCTURE AND RELATIONSHIPS ENVIRONMENTAL OF FLORISTIC GROUPS ON FRAGMENT OF THE MIXED RAIN FOREST, RIO GRANDE DO SUL, BRAZIL Greff, Luiz Thiago Brondani 02 March 2012 (has links) Coordenação de Aperfeiçoamento de Pessoal de Nível Superior / This study was developed in a fragment of Mixed Ombrophylous Forest, located on National Forest of São Francisco de Paula in the state of Rio Grande do Sul, aiming to verify the formation of floristic groups and to characterize them in terms of structure, floristic composition and possible relations with the environment. Were allocated 16 plots of 50 by 50 m, totaling four hectares of sampling, where the vegetation with circumference at breast height. greater than 30 cm was measured. Soil characteristics and topographic variables were obtained to explore the environmental heterogeneity and possible relationship to the vegetation. The results were measured and identified 3171 trees distributed in 79 species, 55 genera, belonging to 32 botanical families. It was observed the formation of three floristic groups of which stands out the species Blepharocalyx salicifolius and Sebastiania commersoniana as characteristic species of a group that occurs in local small slope and soil relatively moist; Siphoneugena reitzii and Vernonanthura discolor as characterized an association with several exclusive species, small basal area and occur in places of great slopes and shallow soils, and also the species Araucaria angustifolia and Luehea divaricata characterize a third group, which occurs in areas of deep soil, good drainage, mean slope, and high basal area . It was observed through the logistic regression that changeable soil characteristics, topographical and structural of the vegetation affect in positive and negative way the occurrence probability them main species them florísticos groups. / Este estudo foi realizado em fragmento da Floresta Ombrófila Mista, situado na Floresta Nacional de São Francisco de Paula, no estado do Rio Grande do Sul, tendo como objetivo verificar a formação de grupos florísticos e caracterizá-los quanto à estrutura, composição florística e possíveis relações com o ambiente. Foram alocadas 16 unidades amostrais de 50 por 50 m, totalizando quatro hectares de área amostral, onde a vegetação com circunferência a altura do peito maior que 30 cm foi mensurada. Variáveis edáficas e topográficas foram obtidas para explorar a heterogeneidade ambiental e possíveis relações com a vegetação. Como resultados foram mensuradas e identificadas 3171 árvores, distribuídas em 79 espécies, 55 gêneros, pertencentes a 32 famílias botânicas. Verificou-se a formação de três grupos florísticos dos quais destacam-se as espécies Blepharocalyx salicifolius e Sebastiania commersoniana como espécies características de um grupo, que ocorre em locais de pequena declividade e em solos relativamente úmidos; Siphoneugena reitzii e Vernonanthura discolor caracterizam uma associação com várias espécies exclusivas, pequena área basal e que ocorrem em locais de grande declividade e solos rasos; e ainda, as espécies Araucaria angustifolia e Luehea divaricata caracterizam um terceiro grupo, que ocorre em locais de solo profundo, bem drenados, declividade média, e elevada área basal. Observou-se através da regressão logistíca que variáveis edáficas, topográficas e estruturais da vegetação afetam de maneira positiva e negativa a probabilidade de ocorrência das principais espécies dos grupos florísticos. . Análise de agrupamento Fitossociologia Floresta com araucária Regressão logística Cluster analysis Phytosociology Forest with araucaria Logistic regression
806	Estudo estat?stico sobre eventos de precipita??o intensa no nordeste do Brasil / Statistical analysis of the extreme rainfall events in northeastern Brazil Oliveira, Priscilla Teles de 16 April 2014 (has links) Made available in DSpace on 2014-12-17T14:12:03Z (GMT). No. of bitstreams: 1 PriscillaTO_TESE.pdf: 3493221 bytes, checksum: 724431c57caf371626a1f4eb15f4b92c (MD5) Previous issue date: 2014-04-16 / Coordena??o de Aperfei?oamento de Pessoal de N?vel Superior / The Northeast of Brazil (NEB) shows high climate variability, ranging from semiarid regions to a rainy regions. According to the latest report of the Intergovernmental Panel on Climate Change, the NEB is highly susceptible to climate change, and also heavy rainfall events (HRE). However, few climatology studies about these episodes were performed, thus the objective main research is to compute the climatology and trend of the episodes number and the daily rainfall rate associated with HRE in the NEB and its climatologically homogeneous sub regions; relate them to the weak rainfall events and normal rainfall events. The daily rainfall data of the hydrometeorological network managed by the Ag?ncia Nacional de ?guas, from 1972 to 2002. For selection of rainfall events used the technique of quantiles and the trend was identified using the Mann-Kendall test. The sub regions were obtained by cluster analysis, using as similarity measure the Euclidean distance and Ward agglomerative hierarchical method. The results show that the seasonality of the NEB is being intensified, i.e., the dry season is becoming drier and wet season getting wet. The El Ni?o and La Ni?a influence more on the amount of events regarding the intensity, but the sub-regions this influence is less noticeable. Using daily data reanalysis ERAInterim fields of anomalies of the composites of meteorological variables were calculated for the coast of the NEB, to characterize the synoptic environment. The Upper-level cyclonic vortex and the South atlantic convergene zone were identified as the main weather systems responsible for training of EPI on the coastland / O Nordeste do Brasil (NEB) apresenta alta variabilidade no clima, abrangendo desde regi?es semi-?ridas at? regi?es com alto ?ndice pluviom?trico. Segundo o ?ltimo relat?rio do Intergovernmental Panel on Climate Change, o NEB ? uma regi?o altamente suscept?vel ?s mudan?as clim?ticas, al?m de ser uma regi?o sujeita ? ocorr?ncia de eventos de precipita??o intensa (EPI); contudo, ainda existem poucos estudos sobre a climatologia destes epis?dios na regi?o. Neste sentido, o objetivo principal da pesquisa ? determinar a climatologia e tend?ncia dos EPI sobre o NEB e suas sub-regi?es climatologicamente homog?neas, comparando seu comportamento com a climatologia e tend?ncia dos eventos de precipita??o fraca e dos eventos de precipita??o normal. Para tanto, foram utilizados os dados di?rios de precipita??o da rede hidrometeorol?gica gerenciada pela Ag?ncia Nacional de ?guas, para o per?odo de 1972 a 2002. Por interm?dio da t?cnica dos quantis foram definidos os eventos de precipita??o e sua confian?a estat?stica foi analisada atrav?s do teste de Mann Kendall. As sub-regi?es foram obtidas por meio da an?lise de cluster, utilizando como medida de similaridade a dist?ncia euclidiana e o m?todo hier?rquico aglomerativo de Ward. Os resultados mostraram que a sazonalidade do NEB est? sendo intensificada, ou seja, a esta??o seca est? se tornando mais seca e esta??o chuvosa ficando mais chuvosa. Os fen?menos El Ni?o e La Ni?a influenciam mais em rela??o ? quantidade de eventos do que em rela??o ? intensidade, mas nas sub-regi?es esta influ?ncia ? menos percept?vel. Utilizando dados di?rios das rean?lises do ERA-Interim, campos das anomalias dos compostos de vari?veis meteorol?gicas foram calculados para o litoral do NEB, para caracteriza??o do ambiente sin?tico. Foram identificados os V?rtices Cicl?nicos de Altos N?veis e a Zona de Converg?ncia do Atl?ntico Sul como os principais sistemas meteorol?gicos respons?veis pela forma??o dos EPI no litoral
807	DFA e an?lise de agrupamento aplicadas a perfis de porosidade neutr?nico em po?os de petr?leo Silva, Francisco Wilton de Freitas 22 May 2009 (has links) Made available in DSpace on 2015-03-03T13:59:42Z (GMT). No. of bitstreams: 1 FranciscoWFA.pdf: 1362232 bytes, checksum: 33548c2a28a5c7d6034cf165f163a691 (MD5) Previous issue date: 2009-05-22 / ?Peng was the first to work with the Technical DFA (Detrended Fluctuation Analysis), a tool capable of detecting auto-long-range correlation in time series with non-stationary. In this study, the technique of DFA is used to obtain the Hurst exponent (H) profile of the electric neutron porosity of the 52 oil wells in Namorado Field, located in the Campos Basin -Brazil. The purpose is to know if the Hurst exponent can be used to characterize spatial distribution of wells. Thus, we verify that the wells that have close values of H are spatially close together. In this work we used the method of hierarchical clustering and non-hierarchical clustering method (the k-mean method). Then compare the two methods to see which of the two provides the best result. From this, was the parameter ? (index neighborhood) which checks whether a data set generated by the k- average method, or at random, so in fact spatial patterns. High values of ? indicate that the data are aggregated, while low values of ? indicate that the data are scattered (no spatial correlation). Using the Monte Carlo method showed that combined data show a random distribution of ? below the empirical value. So the empirical evidence of H obtained from 52 wells are grouped geographically. By passing the data of standard curves with the results obtained by the k-mean, confirming that it is effective to correlate well in spatial distribution / Peng foi o primeiro a trabalhar com a T?cnica DFA (Detrended Fluctuation Analysis), uma ferramenta capaz de detectar auto-correla??o de longo alcance em s?ries temporais com n?o-estacionaridade. Nesse trabalho, a t?cnica de DFA ? utilizada para obter o expoente de Hurst (H) do perfil el?trico de Porosidade Neutr?nica dos 52 po?os petrol?feros Campo de Namorado, situado na Bacia de Campos ? RJ. A finalidade ? saber se o expoente de Hurst pode ou n?o ser usado para se caracterizar uma distribui??o espacial dos po?os. Assim, queremos verificar se os po?os que apresentam valores pr?ximos de H est?o espacialmente pr?ximos entre si. Neste trabalho foi utilizado o m?todo de agrupamento hier?rquico e o m?todo de agrupamento n?o hier?rquico (m?todo do k-m?dia). Em seguida comparamos os dois m?todos para ver qual dos dois fornece o melhor resultado. A partir disso, foi criado o par?metro (?ndice de vizinhan?a) o qual verifica se um conjunto de dados gerados pelo m?todo km?dia, ou de forma aleat?ria, forma de fato padr?es espaciais. Altos valores de indicam que os dados est?o agregados, enquanto que baixos valores de indicam que os dados est?o espalhados (sem correla??o espacial). Com aux?lio do m?todo de Monte Carlo observou-se que dados agrupados aleatoriamente apresentam uma distribui??o de inferior ao valor emp?rico. Portanto os dados emp?ricos de H obtidos dos 52 po?os est?o agrupados espacialmente. Ao cruzar os dados das curvas de n?vel com os resultados obtidos pelo k-m?dia, confirmam que este ? eficaz para correlacionar po?os em distribui??o espacial Campo de Namorado Petr?leo Porosidade Neutr?nica DFA An?lise de Agrupamentos Campo de Namorado Oil Neutron Porosity DFA Cluster analysis
808	An?lise de Agrupamentos Com Base na Teoria da Informa??o: Uma Abordagem Representativa Ara?jo, Daniel Sabino Amorim de 18 March 2013 (has links) Made available in DSpace on 2014-12-17T14:55:09Z (GMT). No. of bitstreams: 1 DanielSAA_TESE_inicio_pag67.pdf: 3521346 bytes, checksum: 030bba7c8ca800b8151b345676b6759c (MD5) Previous issue date: 2013-03-18 / Coordena??o de Aperfei?oamento de Pessoal de N?vel Superior / Currently, one of the biggest challenges for the field of data mining is to perform cluster analysis on complex data. Several techniques have been proposed but, in general, they can only achieve good results within specific areas providing no consensus of what would be the best way to group this kind of data. In general, these techniques fail due to non-realistic assumptions about the true probability distribution of the data. Based on this, this thesis proposes a new measure based on Cross Information Potential that uses representative points of the dataset and statistics extracted directly from data to measure the interaction between groups. The proposed approach allows us to use all advantages of this information-theoretic descriptor and solves the limitations imposed on it by its own nature. From this, two cost functions and three algorithms have been proposed to perform cluster analysis. As the use of Information Theory captures the relationship between different patterns, regardless of assumptions about the nature of this relationship, the proposed approach was able to achieve a better performance than the main algorithms in literature. These results apply to the context of synthetic data designed to test the algorithms in specific situations and to real data extracted from problems of different fields / Atualmente, um dos maiores desafios para o campo de minera??o de dados ? realizar a an?lise de agrupamentos em dados complexos. At? o momento, diversas t?cnicas foram propostas mas, em geral, elas s? conseguem atingir bons resultados dentro de dom?nios espec?ficos, n?o permitindo, dessa maneira, que exista um consenso de qual seria a melhor forma para agrupar dados. Essas t?cnicas costumam falhar por fazer suposi??es nem sempre realistas sobre a distribui??o de probabilidade que modela os dados. Com base nisso, o trabalho proposto neste documento cria uma nova medida baseada no Potencial de Informa??o Cruzado que utiliza pontos representativos do conjunto de dados e a estat?stica extra?da diretamente deles para medir a intera??o entre grupos. A abordagem proposta permite usar todas as vantagens desse descritor de informa??o e contorna as limita??es impostas a ele pela sua pr?pria forma de funcionamento. A partir disso, duas fun??es custo de otimiza??o e tr?s algoritmos foram constru?dos para realizar a an?lise de agrupamentos. Como o uso de Teoria da Informa??o permite capturar a rela??o entre diferentes padr?es, independentemente de suposi??es sobre a natureza dessa rela??o, a abordagem proposta foi capaz de obter um desempenho superior aos principais algoritmos citados na literatura. Esses resultados valem tanto para o contexto de dados sint?ticos desenvolvidos para testar os algoritmos em situa??es espec?ficas quanto em dados extra?dos de problemas reais de diferentes naturezas CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA
809	Comunidades e fatores de virulência bacterianos na cavidde bucal de pacientes infantis com infecções endodônticas em dentes decíduos Sarmento, Naelka January 2017 (has links) A presente tese teve como objetivo realizar a descrição dos microrganismos que já foram isolados ou detectados em infecções endodônticas de dentes decíduos em pacientes infantis por meio de uma revisão sistemática, além de avaliar a composição bacteriana e a presença de genes de resistência a antibióticos em amostras de saliva (S), biofilme supragengival (SB), dentina (D) e câmara pulpar (RC) de dentes decíduos com infecções endodônticas. No Capítulo 1, realizou-se revisão sistemática em bancos de dados eletrônicos, tendo sido incluídos estudos clínicos que avaliaram presença de microrganismos em dentes decíduos com infecções endodônticas, por meio de análise microbiológica com cultivo ou de métodos moleculares. Foi realizada análise descritiva dos dados. A análise identificou 44 títulos, sendo revisados, na íntegra, 17 artigos. Foram selecionados 8 estudos clínicos, de acordo com os critérios de inclusão determinados. Por meio de busca manual, foram selecionados 2 artigos adicionais, totalizando 11 artigos excluídos desta revisão. Nos oito estudos clínicos incluídos na revisão sistemática, a identificação dos microrganismos envolvidos nas infecções endodônticas foi realizada por meio de várias técnicas como: cultura microbiológica, hibridização DNA-DNA, PCR e suas variações, clonagem, sequenciamento e pirossequenciamento, confirmando a diversidade de microrganismos envolvidos nas infecções endodônticas de dentes decíduos. A análise dos dados sugere que as infecções endodônticas em dentes decíduos são causadas por múltiplas combinações de espécies de micro-organismos, confirmando a sua natureza polibacteriana. No Capítulo 2, amostras de S, SB, D e RC foram coletadas de pacientes infantis com infecções endodônticas. O perfil das comunidades microbianas foram obtidos por meio da análise da região espaçadora intergênica relacionada aos genes 16S e 23S rRNA (PCR-RISA). Determinaram-se e índices de riqueza, dominância, índice de Shannon, índice de Chao-1 (alfa-diversidade) e análise multivariada de conglomerados (método UPGMA e índice de Similaridade de Bray-Curtis) e análise de coordenadas principais (PCoA) (beta-diversidade). Há um baixo grau de agrupamento entre as amostras de S, BS, D e RC, obtidas de um mesmo participante. Se presentes, os agrupamentos acontecem para sítios contíguos, mas com baixo percentual de similaridade. Amostras de um mesmo ecossistema obtidas de diferentes participantes abrigam comunidades bacterianas distintas, com baixa similaridade. Não parece haver uma relação entre a presença de um sinal/sintoma clínico e acréscimo no perfil de similaridade das comunidades bacterianas em RC. Não foram observadas diferenças estatisticamente significativas entre os índices de alfa-diversidade (riqueza, dominância, Shannon e Chao-1) entre S, SB, D e RC. O uso prévio de antibióticos não modificou os resultados de alfa diversidade ou de beta diversidade obtidos. No Capítulo 3, verificou-se a distribuição dos genes de resistência bacteriana aos principais grupos de antibióticos em S, SB, D, e RC dentes decíduos em pacientes infantis com infecções endodônticas e também de amostras de saliva dos responsáveis (R) por meio de PCR para os genes cfxA/cfxA2, blaTEM, blaZ, ampC, mecA, mefA, ermB, ermC, tetQ, tetM, tetW, linB, lsaB. Realizou-se análise estatística descritiva e análise multivariada de conglomerados (método UPGMA e índice de Similaridade de Bray-Curtis). Dos pacientes selecionados, 3/8 utilizaram antibiótico previamente à coleta. Nenhum gene de resistência foi observado em todos os ecossistemas de um mesmo participante. Os genes mais frequentemente detectados foram os genes de resistência à tetraciclina tetQ e tetW. Não foram detectados nas amostras os genes ampC, mecA, lnuB e lsaB. A presença simultânea de um gene em dois nichos ocorre em ecossistemas contíguos. Não se observa um comportamento uniforme quanto ao perfil de agrupamento de diferentes amostras de um mesmo participante, e nem entre as amostras de saliva do participante infantil (S) e seu responsável (R). Há múltiplos perfis de distribuição de genes de resistência a agentes antimicrobianos em amostras de ecossistemas bucais contíguos em um mesmo paciente portador de infecção endodôntica. A análise conjunta dos dados permite concluir que cada um dos ecossistemas da cavidade bucal de crianças portadoras de infecções endodônticas avaliado apresenta espécies bacterianas e fatores de virulência distribuídos de forma única e distintas, a partir de uma perspectiva de análise de diversidade. / This thesis aimed assessing information on the bacteria that were isolated/detected in teeth with endodontic infections from infant patients through a sistematic review of the literature. Furthermore, the bacterial composition and the presence of resistance genes to antimicrobial agents was determined in saliva (S), in supragengival biofilm (SB), in dentine (D) and in pulp cavity (RC) samples. In the Chapter 1, a sistematic review was conducted in electronic databasis. Clinical studies that evaluated the presence of microorganims in primary teeth with endodontic infections through culture and molecular methods were included. Fourty-four titles were selected and 17 articles were fully revised. Eight clinical studies were selected for data extraction. Two articles were included following the hand search. According to the data analysis, microbial identification was performed by culture, DNA-DNA hybridization, PCR, cloning and sequencing and next-generation sequencing methods. A high diversity in the microbial components identificated/detected was reported. Endodontic infections in primary teeth are polymicrobial, with a multi-species consortia. In the Chapter 2, the S, SB, D and RC samples were collected from infanti patients with endodontic infections. The ribossomal intergenic spacer analysis (PCR-RISA) for the 16S-23S rRNA genes interspacer region was employed to determine the bacterial fingerprint for each sample. Metrics for alfa and beta diversity were employed, such as richness, dominance, Shannon Index, Chao-1 Index, cluster analysis (UPGMA, Bray-Curtis Index) and principal coordenate analysis (PCoA). There was a low grouping profile for shared samples of S, BS, D and RC from the same participant. When detected, clustering behavior was observed for contiguous sites, with low percentual of similarity between them. Samples from the same site but from different subjects harboured distinct bacterial communities, with low similarity. No clinical sign/symptom was detected as a grouping factor for RC sample from different subjects. No statistical difference was detected for the alfa-diversity indexes among S, SB, D and RC. The previous exposition to antimicrobial agentes has no effect over the alfa- and beta-diversity indexes. In the Chapter 3, the distribution of bacterial genes for antimicrobial resistance in S, SB, D and RC was determined in samples from children with endodontic infections and their relatives (R) by PCR. The presence of the genes cfxA/cfxA2, blaTEM, blaZ, ampC, mecA, mefA, ermB, ermC, tetQ, tetM, tetW, linB, and lsaB was detecte in the samples. Descriptive statistical analysis and multivariate anlysis (cluster analyisis, UPGMA and Bray-Curtis similarity index) were carried out. Three out of 8 patients had antimicrobial agents previously to the apointment. No resistance gene was shared by all envirnments in the same participant. The most frequently detected genes were tetQ and tetW. The genes ampC, mecA, lnuB, and lsaB were not detected in any the samples. The same gene was detected only in two contiguous niches. Clustering analysis revealed no grouping pattern among the samples, despite they were or not from the same participant or his/her relative. Multiple profiles of resistance genes distribution were detected in the oral cavity samples from infant participants. The oral cavity in children with endodontic infection is a complex environment that harbours unique bacterial communities profiles and a distinct distribution of resitance genes to antimicrobial agents, considering an ecological perspective. Endodontia Dente decíduo Infecção Mouth Microbial ecology Drug resistance Polymerase chain reaction Bacteria Sistematic review Pulp cavity Primary teeth Saliva Cluster analysis Diversity
810	Extraction d'informations textuelles au sein de documents numérisés : cas des factures / Extracting textual information within scanned documents : case of invoices Pitou, Cynthia 28 September 2017 (has links) Le traitement automatique de documents consiste en la transformation dans un format compréhensible par un système informatique de données présentes au sein de documents et compréhensibles par l'Homme. L'analyse de document et la compréhension de documents sont les deux phases du processus de traitement automatique de documents. Étant donnée une image de document constituée de mots, de lignes et d'objets graphiques tels que des logos, l'analyse de documents consiste à extraire et isoler les mots, les lignes et les objets, puis à les regrouper au sein de blocs. Les différents blocs ainsi formés constituent la structure géométrique du document. La compréhension de documents fait correspondre à cette structure géométrique une structure logique en considérant des liaisons logiques (à gauche, à droite, au-dessus, en-dessous) entre les objets du document. Un système de traitement de documents doit être capable de : (i) localiser une information textuelle, (ii) identifier si cette information est pertinente par rapport aux autres informations contenues dans le document, (iii) extraire cette information dans un format compréhensible par un programme informatique. Pour la réalisation d'un tel système, les difficultés à surmonter sont liées à la variabilité des caractéristiques de documents, telles que le type (facture, formulaire, devis, rapport, etc.), la mise en page (police, style, agencement), la langue, la typographie et la qualité de numérisation du document. Dans ce mémoire, nous considérons en particulier des documents numérisés, également connus sous le nom d'images de documents. Plus précisément, nous nous intéressons à la localisation d'informations textuelles au sein d'images de factures, afin de les extraire à l'aide d'un moteur de reconnaissance de caractères. Les factures sont des documents très utilisés mais non standards. En effet, elles contiennent des informations obligatoires (le numéro de facture, le numéro siret de l'émetteur, les montants, etc.) qui, selon l'émetteur, peuvent être localisées à des endroits différents. Les contributions présentées dans ce mémoire s'inscrivent dans le cadre de la localisation et de l'extraction d'informations textuelles fondées sur des régions identifiées au sein d'une image de document.Tout d'abord, nous présentons une approche de décomposition d'une image de documents en sous-régions fondée sur la décomposition quadtree. Le principe de cette approche est de décomposer une image de documents en quatre sous-régions, de manière récursive, jusqu'à ce qu'une information textuelle d'intérêt soit extraite à l'aide d'un moteur de reconnaissance de caractères. La méthode fondée sur cette approche, que nous proposons, permet de déterminer efficacement les régions contenant une information d'intérêt à extraire.Dans une autre approche, incrémentale et plus flexible, nous proposons un système d'extraction d'informations textuelles qui consiste en un ensemble de régions prototypes et de chemins pour parcourir ces régions prototypes. Le cycle de vie de ce système comprend cinq étapes:- Construction d'un jeu de données synthétiques à partir d'images de factures réelles contenant les informations d'intérêts.- Partitionnement des données produites.- Détermination des régions prototypes à partir de la partition obtenue.- Détermination des chemins pour parcourir les régions prototypes, à partir du treillis de concepts d'un contexte formel convenablement construit.- Mise à jour du système de manière incrémentale suite à l'insertion de nouvelles données / Document processing is the transformation of a human understandable data in a computer system understandable format. Document analysis and understanding are the two phases of document processing. Considering a document containing lines, words and graphical objects such as logos, the analysis of such a document consists in extracting and isolating the words, lines and objects and then grouping them into blocks. The subsystem of document understanding builds relationships (to the right, left, above, below) between the blocks. A document processing system must be able to: locate textual information, identify if that information is relevant comparatively to other information contained in the document, extract that information in a computer system understandable format. For the realization of such a system, major difficulties arise from the variability of the documents characteristics, such as: the type (invoice, form, quotation, report, etc.), the layout (font, style, disposition), the language, the typography and the quality of scanning.This work is concerned with scanned documents, also known as document images. We are particularly interested in locating textual information in invoice images. Invoices are largely used and well regulated documents, but not unified. They contain mandatory information (invoice number, unique identifier of the issuing company, VAT amount, net amount, etc.) which, depending on the issuer, can take various locations in the document. The present work is in the framework of region-based textual information localization and extraction.First, we present a region-based method guided by quadtree decomposition. The principle of the method is to decompose the images of documents in four equals regions and each regions in four new regions and so on. Then, with a free optical character recognition (OCR) engine, we try to extract precise textual information in each region. A region containing a number of expected textual information is not decomposed further. Our method allows to determine accurately in document images, the regions containing text information that one wants to locate and retrieve quickly and efficiently.In another approach, we propose a textual information extraction model consisting in a set of prototype regions along with pathways for browsing through these prototype regions. The life cycle of the model comprises five steps:- Produce synthetic invoice data from real-world invoice images containing the textual information of interest, along with their spatial positions.- Partition the produced data.- Derive the prototype regions from the obtained partition clusters.- Derive pathways for browsing through the prototype regions, from the concept lattice of a suitably defined formal context.- Update incrementally the set of protype regions and the set of pathways, when one has to add additional data. Traitement automatique de documents Extraction de texte Décomposition quadtree Classification non supervisée Analyse formelle de concepts Treillis de concepts Document processing Text extraction Quadtree decomposition Cluster analysis Formal concept analysis Concept lattice

Search results