Global ETD Search

151	Múltiplas visões coordenadas para exploração de mapas de similaridade / Coordinated and multiple views for exploration of similarity maps Eler, Danilo Medeiros 18 March 2011 (has links) Atualmente, diversas áreas de aplicação necessitam de mecanismos mais efetivos para analisar dados provenientes de naturezas distintas. Tipicamente, esses dados são abstratos, não estruturados e possuem uma natureza multidimensional (e.g., coleções de documentos). Dados que não possuem uma natureza multidimensional podem ser representados como tal por meio da aplicação de algoritmos extratores de características (e.g., coleções de imagens). Assim, técnicas de visualização de informação projetadas para interpretar dados multidimensionais podem ser aproveitadas para analisar dados não estruturados. Esta tese empregou técnicas de visualização de informação para construir mapas de similaridade a partir de dados multidimensionais como uma forma de representação desses dados, uma vez que as técnicas para construilos tem evoluído com a expansão dos campos de aplicação. Novas técnicas para coordenação de múltiplas visões foram desenvolvidas para permitir a exploração de conjuntos de dados, a partir de mapas de similaridade gerados por diferentes técnicas de construção de mapas, diferentes parâmetros ou ainda diferentes conjuntos de dados. As técnicas de coordenação desenvolvidas são baseadas em identificador, em distância, em tópicos, na identificação de tópicos em coleções que evoluem no tempo, e em uma técnica que combina o mapeamento de diferentes técnicas de coordenação. Esta tese também apresenta aplicações das técnicas de coordenação desenvolvidas e das ferramentas construídas para análise de coleções de documentos, coleções de imagens e dados volumétricos, empregando coordenações de mapas de similaridade. As técnicas de coordenação desenvolvidas são apoiadas por um modelo de coordenação que estende um modelo previamente proposto na literatura. O modelo estendido permite a configuração de técnicas de coordenação durante a exploração, admitindo diferentes tipos de mapeamentos. Uma característica importante do modelo é permitir o desenvolvimento de mapeamentos dinâmicos para técnicas de coordenação, isto é, mapeamentos que podem mudar o comportamento de acordo com a interação do usuário. Como resultado desta tese, está disponível um arcabouço para visualização coordenada de múltiplos mapas de similaridade, composto por um modelo, um conjunto de técnicas e um conjunto de ferramentas que efetivamente permitem a análise visual de conjuntos de dados multidimensionais / Currently, various fields of application need effective mechanisms to analyse data differing in nature. Typically these data are abstract, unstructured and multidimensional (e.g. document collections). Data that do not present multidimensional description can be represented as such by means of feature extraction algorithms (e.g. image collections). Thus, information visualization techniques designed to interpret multidimensional data sets can be employed to analyse unstructured data. This thesis employed information visualization techniques that build similarity maps from multidimensional data as a form of data representation, since the techniques to construct them have evolved lately with expanding fields of application. Novel techniques for coordination of multiple views were developed that allow exploration of data sets, from similarity maps generated using different techniques for building maps, different parameters or even different data sets. The developed coordination techniques are based on identity relationships, on distance relationships, on topic coverage (for text or other annotated data) and on evolution of topic coverage (also for text). An approach to combine different coordination techniques was also developed. This thesis also reports on applications of the coordination techniques developed, and on tools built for analysis of image, text and volumetric data employing coordinated similarity maps. The techniques developed in this work are supported by a coordination model that extends a model previously proposed in literature. The extended model allows the definition and configuration of coordination techniques during coordination tasks and performing various types of mappings. An important feature of the model is to support the development of dynamic mappings, which are mappings that may change behavior according to user interaction. As a result of this thesis, a framework is available for coordinated visualization of multiple similarity maps, composed by a model, a set of techniques and a set of implemented tools that effectively support the visual analysis of multidimensional data sets Mapas de similaridade Múltiplas visões coordenadas
152	Estudo da similaridade imperfeita em estruturas sujeitas a carregamentos de impacto. / Study of imperfect similarity in structures subjected to impact loadings. Oshiro, Roberto Eiki 09 June 2010 (has links) As leis usuais de redução de escala não produzem bons resultados em estruturas sujeitas a cargas de impacto, gerando uma semelhança imperfeita entre modelo e protótipo. Neste trabalho, utiliza-se a técnica de similaridade não direta através da alteração da velocidade inicial do corpo de impacto, gerando uma resposta do modelo idêntica ao do protótipo. Três fatores que contribuem para a resposta não similar da estrutura em escala são estudados nessa tese: taxa de deformação e modelo com parâmetros geométricos e de material distorcidos em relação ao protótipo. Além disso, mostra-se como a técnica proposta pode ser usada para correção das distorções através da mudança da massa de impacto. Considerando-se todos esses elementos, um procedimento abrangente e simples que gera um modelo com comportamento similar ao do protótipo é criado. Para corroborar as hipóteses levantadas durante a tese e estudar o método de correção, três problemas analíticos e dois problemas numéricos são explorados. Em todas as análises, os resultados mostram uma melhora significativa na semelhança entre modelo e protótipo após a aplicação do método de correção apresentado. Ao longo do trabalho, as vantagens e limitações das técnicas desenvolvidas e as principais diferenças em relação a trabalhos anteriores são detidamente discutidas. / Current scaling laws are not capable of predicting the structural impact response of prototypes from the behavior of the corresponding scaled models. Here, the nondirect similitude technique is employed by changing the initial impact velocity loading so that model and prototype behave the same. Three main factors that contribute to the non-similar response of a scaled structure are investigated: strain rate, model geometry and material parameters distorted in relation to the prototype. Moreover, it is shown how the proposed technique can be applied to alter the impact mass instead of its velocity. By considering all these aspects, it is then created a comprehensive and simple procedure that generates models similar to a given prototype. Three analytical and two numerical problems are used to present the main features of the technique. In all the cases analyzed, after the correction is applied, it was possible to accurately predict the behavior of the structure under analysis by the response of the model. Throughout this work, limitations and advantages of the method are emphasized bearing in mind other published works. Escala Estructure Estrutura Impact Impacto Scale Similaridade Similarity
153	Estudo e implementação de um gerador de tráfego com dependência de longa duração. / Study and implementation of a network traffic generator with long range dependency. Mello, Fernando Lemos de 10 November 2006 (has links) Medidas mostraram que o tráfego das redes multisserviço possui propriedades fractais tais como auto-similaridade e memória longa ou dependência de longa duração (LRD). A memória longa é caracterizada pela existência de um pólo na origem da função densidade espectral de potência (formato 1/f). Também foi constatado que o tráfego pode apresentar dependência de curta duração (SRD) em algumas escalas temporais. A utilização de um gerador de tráfego agregado ?realista?, que sintetize séries temporais fractais, é fundamental para a validação de algoritmos de controle de tráfego. Neste trabalho, a síntese de realizações aproximadas de dois tipos de processos aleatórios auto-similares é efetuada via transformada wavelet. O primeiro deles é denominado Ruído Gaussiano Fracionário (fGN) e o segundo Modelo Wavelet Multifractal (MWM). O método proposto também é capaz de sintetizar séries Gaussianas (fGN) e não-Gaussianas (MWM) com espectros mais genéricos do que 1/f, ou seja, séries que também apresentam dependência de curta duração. A geração é feita em dois estágios. O primeiro gera uma realização aproximada do fGN ou do MWM via Transformada Wavelet Discreta (DWT). O segundo estágio introduz SRD através de uma filtragem IIR da saída do primeiro estágio. Efetuou-se uma caracterização detalhada das séries resultantes, utilizando-se nas análises momentos estatísticos de 2ª., 3ª. e 4ª. ordens, além de testes estatísticos específicos para séries auto-similares. Adicionalmente, duas alternativas de conversão são apresentadas para que as séries temporais geradas sejam transformadas em séries de pacotes, que é o formato adequado para transmissão por um módulo gerador de pacotes. As séries de pacotes são novamente analisadas a fim de identificar se o método de conversão introduz distorção nas características auto-similares das séries sintetizadas. Mostra-se que as séries de pacotes auto-similares podem ser utilizadas em softwares simuladores de rede ou, alternativamente, serem utilizadas para injetar pacotes em redes de teste. Utilizando-se recursos do simulador NS-2, as séries de pacotes sintetizadas foram introduzidas em cenários de simulação adequados. Os resultados (medidas de atraso médio, perda de pacotes para o tráfego de interesse e tamanho da fila) dos cenários com tráfego interferente correspondente às séries de pacotes baseadas em modelos fGN e MWM foram comparados com resultados obtidos em cenários cujo tráfego interferente foi gerado com modelo Poisson. / Measurements have shown that multiservice network traffic has fractal properties such as self-similarity and long memory or long-range dependence (LRD). Long memory is characterized by the existence of a pole at the origin of the power spectrum density function (1/f shape). It was also noticed that traffic may present short-range dependence (SRD) at some time scales. The use of a ?realistic? aggregated network traffic generator, one that synthesizes fractal time series, is fundamental to the validation of traffic control algorithms. In this document, the synthesis of approximate realizations of two kinds of self-similar random process is done via wavelet transform. The first one is named Fractional Gaussian Noise (fGN) and the second Multifractal Wavelet Model (MWM). The proposed method is also capable of synthesizing Gaussian (fGN) and non-Gaussian (MWM) time series with more generic spectra than 1/f, that is, time series that also have short-range dependence. The generation is done in two stages. The first one generates an approximate realization of fGN or MWM via Discrete Wavelet Transform (DWT). The second one introduces SRD through Infinite Impulse Response (IIR) filtering at the output of the first stage. A detailed characterization of the resulting series was done, using statistical moments of first, second, third and forth orders, as well as specific statistical tests for self-similar series. Additionally, two alternatives for conversion are introduced in order to generate packet series, which is the suitable format for transmission by a packet generator module, from the original synthesized time series. Packet series are also analyzed to find if the conversion method has introduced distortion in the self-similar characteristics of the synthesized series. It is shown that the self-similar packet series can be used in network simulator software or, alternatively, be used to inject packets in a testbed network. Using resources from the NS-2 simulator, the synthesized packet series were introduced in appropriate network simulator scenarios. The results (average delay measurements, packet loss for interest traffic and queue length) from scenarios with interfering traffic corresponding to the packet series based on fGN and MWM models were compared to results from scenarios with interfering traffic generated by Poisson model. Auto-similaridade Gerador de tráfego Self-similarity Traffic generation Wavelets Wavelets
154	Tratamento de condições especiais para busca por similaridade em bancos de dados complexos / Treatment of special conditional for similarity searching in complex data bases Kaster, Daniel dos Santos 23 April 2012 (has links) A quantidade de dados complexos (imagens, vídeos, séries temporais e outros) tem crescido rapidamente. Dados complexos são adequados para serem recuperados por similaridade, o que significa definir consultas de acordo com um dado critério de similaridade. Além disso, dados complexos usualmente são associados com outras informações, geralmente de tipos de dados convencionais, que devem ser utilizadas em conjunto com operações por similaridade para responder a consultas complexas. Vários trabalhos propuseram técnicas para busca por similaridade, entretanto, a maioria das abordagens não foi concebida para ser integrada com um SGBD, tratando consultas por similaridade como operações isoladas, disassociadas do processador de consultas. O objetivo principal desta tese é propor alternativas algébricas, estruturas de dados e algoritmos para permitir um uso abrangente de consultas por similaridade associadas às demais operações de busca disponibilizadas pelos SGBDs relacionais e executar essas consultas compostas eficientemente. Para alcançar este objetivo, este trabalho apresenta duas contribuições principais. A primeira contribuição é a proposta de uma nova operação por similaridade, chamada consulta aos k-vizinhos mais próximos estendida com condições (ck-NNq), que estende a consulta aos k-vizinhos mais próximos (k-\'NN SUB. q\') de maneira a fornecer uma condição adicional, modificando a semântica da operação. A operação proposta permite representar consultas demandadas por várias aplicações, que não eram capazes de ser representadas anteriormente, e permite homogeneamente integrar condições de filtragem complementares à k-\'NN IND.q\'. A segunda contribuição é o desenvolvimento do FMI-SiR (user-defined Features, Metrics and Indexes for Similarity Retrieval ), que é um módulo de banco de dados que permite executar consultas por similaridade integradas às demais operações do SGBD. O módulo permite incluir métodos de extração de características e funções de distância definidos pelo usuário no núcleo do gerenciador de banco de dados, fornecendo grande exibilidade, e também possui um tratamento especial para imagens médicas. Além disso, foi verificado através de experimentos sobre bancos de dados reais que a implementação do FMI-SiR sobre o SGBD Oracle é capaz de consultar eficientemente grandes bancos de dados complexos / The amount of complex data (images, videos, time series and others) has been growing at a very fast pace. Complex data are well-suited to be searched by similarity, which means to define queries according to a given similarity criterion. Moreover, complex data are usually associated with other information, usually of conventional data types, which must be employed in conjunction with similarity operations to answer complex queries. Several works proposed techniques for similarity searching, however, the majority of the approaches was not conceived to be integrated into a DBMS, treating similarity queries as isolated operations detached from the query processor. The main objective of this thesis is to propose algebraic alternatives, data structures and algorithms to allow a wide use of similarity queries associated to the search operations provided by the relational DBMSs and to execute such composite queries eficiently. To reach this goal, this work presents two main contributions. The first contribution is the proposal of a new similarity operation, called condition-extended k-Nearest Neighbor query (ck-\'NN IND. q\'), that extends the k-Nearest Neighbor query (k-\'NN IND. q\') to provide an additional conditio modifying the operation semantics. The proposed operation allows representing queries required by several applications, which were not able to be represented before, and allows to homogeneously integrate complementary filtering conditions to the k-\'NN IND. q\'. The second contribution is the development of the FMI-SiR(user-defined Features, Metrics and Indexes for Similarity Retrieval), which is a database module that allows executing similarity queries integrated to the DBMS operations. The module allows including user-defined feature extraction methods and distance functions into the database core, providing great exibility, and also has a special treatment for medical images. Moreover, it was verified through experiments over real datasets that the implementation of FMI-SiR over the Oracle DBMS is able to eficiently search very large complex databases Banco de dados Consultas por similaridade Multimedia databases Multimídia Similarity queries
155	An Interpersonal Approach to Social Preference: Examining Patterns and Influences of Liking and Being Bothered by Interpersonal Behaviors of Others Tianwei Du (6619103) 10 June 2019 (has links) <p>Interpersonal researchers have primarily assessed interpersonal behaviors using self-ratings of one’s own behaviors and third-person ratings of dyadic interactions. Only a limited number of researches have studied how individuals perceive others’ interpersonal behaviors in social situations. Using a sample of 470 undergraduate students, we examined patterns of liking and being bothered by others’ interpersonal behaviors as well as influences of these patterns on individuals’ psychological functioning. Our findings showed that people tend to like interpersonal behaviors that are the most similar to their own and get bothered by behaviors that are the least similar to their own. Such pattern is more characteristic on the warmth dimension than the dominance dimension and is consistent across different levels of intimacy between the evaluator and the subject being evaluated. We also found small but significant effects of interpersonal preference on social support, interpersonal problems, negative affect, and detachment, above and beyond effects of individuals’ own interpersonal traits. Findings suggest that perception of others’ interpersonal behaviors relates specifically to one’s own interpersonal traits, and these patterns of interpersonal perception have unique associations with one’s own affective and interpersonal experiences. Such findings highlight the importance of including perception of other’s in investigating interpersonal dynamics.</p> Clinical Psychology Interpersonal circumplex liking similarity social perception
156	Diversidade e Padrões de Distribuição de Mamíferos dos Pampas do Uruguai e Brasil / Diversity and distributional patterns of pampean mammals of Uruguay and Brazil Morató, Diego Queirolo 02 July 2009 (has links) Pela primeira vez considerou-se a fauna de mamíferos dos Pampas do sul do Brasil e do Uruguai como um todo, independentemente de fronteiras políticas. Primeiramente, foi obtida informação sobre ocorrência das espécies na área de estudo a partir de diferentes fontes e logo se elaboraram mapas de distribuição para todas elas. Na seqüência, tratou-se de determinar por meio de análises quantitativas o padrão de distribuição geográfica das mesmas realizando, primeiramente, uma análise entre sub-regiões (UGOs) dentro da área de estudo, para logo compará-la com regiões vizinhas e, por último, com ambientes similares distribuídos em outros continentes. Também, analisou-se preliminarmente o estado de conservação das espécies de mamíferos que ocorrem dentro da área de estudo. Foi obtida informação de coleções científicas (2.080 registros), de literatura (439 registros de resumos de reuniões científicas e 868 de revistas periódicas, livros, dissertações, teses e relatórios), 63 de observação ou comunicação pessoal de diferentes pesquisadores e o restante de outras fontes. No total, consideraram-se 3.522 registros (1.738 no Uruguai e 1.784 Brasil) distribuídos em 1.041 localidades. Foram identificadas 125 espécies (80 para o Uruguai e 117 para o Rio Grande do Sul), sete delas são endêmicas e seis são consideradas como extintas. Os roedores junto aos quirópteros e marsupiais somam 68,6% dos gêneros e os dois primeiros também ultrapassam 60% do total de espécies. As maiores riquezas foram encontradas naquelas áreas mais estudadas. A análise de agrupamento das UGOs mostrou dois grupos bem definidos, um conformado pelo Uruguai e pela região da Campanha Gaúcha e o outro, pelo litoral Atlântico, leste e centro do Rio Grande do Sul. Em relação às regiões vizinhas, observou-se claramente um grupo formado pelas Províncias florestais, onde está incluída a área de estudo. Na análise das ecoregiões o grupo conformado por aquelas de origem tropical e/ou florestal também foi o que incluiu a área de estudo. Existiram diferenças significativas nos hábitos de locomoção e marginalmente significativas nos hábitos alimentares em comparação com as regiões vizinhas. No entanto, na comparação dos hábitos alimentares com outros ambientes similares de campos temperados distribuídos em outros continentes, sim houve diferença significativa indicando pouca semelhança entre a estrutura das comunidades. Por último, 31 espécies foram consideradas ameaçadas nos Pampas do Rio Grande do Sul e 21 no Uruguai, resultando num total de 47 para toda a região (37,6% do total). Este trabalho pretendeu colaborar com a geração de informação básica fundamental para a formulação de políticas de conservação que contemplem toda a região independentemente dos países que a compõem. / For the first time the mammalian fauna of the Pampas from Uruguay and southern Brazil was considered as a whole, despite political borders. Information on species distribution in the study area was obtained from different sources and a distribution map was elaborated for each species. After that we identified the pattern of geographical distribution of mammals in the region by using quantitative analysis. First of all, we analyzed the sub-regions (OGUs) inside the study area, after that we compared those OGUs with neighbour areas, and finally with similar environments around the world. We also did a preliminary analysis of the conservation status of the species registered. Information was obtained from scientific collections (2,080 records), from literature (439 from scientific reunion abstracts; 868 from manuscripts, books, thesis and other kind of bibliography), 63 from personal communications or observations, and the rest from other sources. In general, 3,522 registers were considered (1,738 in the Uruguay and 1,784 in the Brazil), totaling 1,041 different localities. One hundred twenty five species were identified (80 from Uruguay and 117 from Rio Grande do Sul), being seven of them endemic species and six extinct species at the moment. The rodents together with the bats and the marsupials conform 68.6% of the genera found. Also, the two former exceed 60% of the total number of species. Largest richness was found in most studied areas. Cluster analysis showed two welldefined groups of OGUs: 1. Uruguay and Campanha Gaucha region and 2. Atlantic coast, East and Center of Rio Grande do Sul State. Concerning the neighbour areas we observed a group structured by the forestal Provinces in which the study area was included. The Ecoregions analysis showed three groups, and the one composed by forestal and/or tropical regions also integrate the study area. There were significant differences related to locomotor habits when comparing the structure of community with neighbour regions and marginally significant differences concerning diet. However, when comparing the diet with similar environments of temperate grasslands in other continents there were significant difference, which means little similitude between the mammalian communities. Finally, we identified 31 species with some degree of threat in the Pampas from Rio Grande do Sul and 21 from Uruguay, with 47 threatened species in the region (37,5% of the total). This work intended to collaborate with essential information in order to elaborate conservation politics that consider the entire region, independent of the countries which compose it. Biogeografia Biogeograghy Distribuição Distribution Mamíferos Mammals Pampas Pampas Similaridade Similarity
157	Blow-up and global similarity solutions for semilinear third-order dispersive PDEs Koçak, Hüseyin January 2015 (has links) No description available. 510
158	Similarity Reasoning over Semantic Context-Graphs Boteanu, Adrian 26 August 2015 (has links) "Similarity is a central cognitive mechanism for humans which enables a broad range of perceptual and abstraction processes, including recognizing and categorizing objects, drawing parallelism, and predicting outcomes. It has been studied computationally through models designed to replicate human judgment. The work presented in this dissertation leverages general purpose semantic networks to derive similarity measures in a problem-independent manner. We model both general and relational similarity using connectivity between concepts within semantic networks. Our first contribution is to model general similarity using concept connectivity, which we use to partition vocabularies into topics without the need of document corpora. We apply this model to derive topics from unstructured dialog, specifically enabling an early literacy primer application to support parents in having better conversations with their young children, as they are using the primer together. Second, we model relational similarity in proportional analogies. To do so, we derive relational parallelism by searching in semantic networks for similar path pairs that connect either side of this analogy statement. We then derive human readable explanations from the resulting similar path pair. We show that our model can answer broad-vocabulary analogy questions designed for human test takers with high confidence. The third contribution is to enable symbolic plan repair in robot planning through object substitution. When a failure occurs due to unforeseen changes in the environment, such as missing objects, we enable the planning domain to be extended with a number of alternative objects such that the plan can be repaired and execution to continue. To evaluate this type of similarity, we use both general and relational similarity. We demonstrate that the task context is essential in establishing which objects are interchangeable." Analogy Robot Tasks Plan Repair Topic Modeling Semantic Similarity
159	Direct Demonstration of Self-Similarity in a Hydrodynamic Treatment of Polymer Self-Diffusion Merriam, Susan Carol 01 May 2002 (has links) The self-diffusion coefficient of a polymer in solution may be expanded in the concentration of the polymer, as seen in equation 1. The linear term would represent a perturbation due to the presence of another polymer; the c^{2} term would represent a perturbation due to interactions of trios of polymers. Phillies determined the c^{2} term of a virial expansion of the self-diffusion coefficient for trios of polymers interacting via a ring. Here I determine a correction to the c^{2} term due to trios of polymers interacting via a figure-eight scattering diagram: the equivalent of four polymers interacting in a ring where the second polymer and the fourth polymer are the same. D_{s}(c) = D_{0}(1+ alpha D_{0} c + beta D_{0}^{2}c^{2}+...) 1 or, D_{s}(c) = D_{0}(1+ alpha D_{s}(c)c). 2 A D_{0} may be replaced by D_{s}(c) in equation 1 to arrive at equation 2. The left-hand-side of equation 2 is the final self-diffusion coefficient, and the D_{s}(c) on the right-hand-side of this equation is that due to the question of self-similarity. If the D_{s}(c) on the right-hand-side is given by equation 1, resulting in beta=alpha^{2}, it may be said that the system exhibits self-similarity. I demonstrate self-similarity quantitatively for a polymer solution using a generalized Kirkwood-Riseman model of polymer dynamics. The major physical assumption of the model I utilize to derive equation 2 is that, in solution, polymer motions are dominantly governed by hydrodynamic interactions between the chains. First, I review the Kirkwood-Riseman model for intrachain hydrodynamic interactions. I then discuss Phillies' extension of this model to interchain interactions for duos or trios of polymers in a ring. I analytically calculate the hydrodynamic interaction tensor from a multiple scattering picture T_{54321}, for five polymers in solution and verify this tensor by numerical differentiation. Finally, I perform the ensemble average of the self-interaction tensor b_{1232} appropriate to the figure-eight scattering diagram both analytically and with a Monte Carlo routine, thereby verifying equation 2 to second order in concentration. self-similarity polymer self-diffusion hydrodynamic Polymers Hydrodynamics
160	Estudos e desenvolvimento de métodos baseados em harmônicos esféricos para análise de similaridade estrutural entre ligantes / Study and development of spherical harmonics based methods for similarity ligand analysis Caires, Fernando Ribeiro 19 October 2016 (has links) Descritores moleculares são essenciais em muitas aplicações de física e química computacional, como na análise de similaridade entre ligantes baseada em sua estrutura. Harmônicos esféricos têm sido utilizados como descritores da superfície molecular por serem uma forma compacta de descrição geométrica e por possuírem um descritor invariante por rotação. Assim, este trabalho propõe um método de análise de similaridade estrutural entre ligantes no qual se modela a superfície de uma molécula através de uma expansão em harmônicos esféricos realizada pelo programa LIRA. Os coeficientes encontrados são utilizados para percorrer o banco de dados DUD-E, com descritores previamente calculados, utilizando Distância Euclidiana e diversos valores de corte para selecionar compostos mais semelhantes. O potencial do método é avaliado usando o Ultrafast Shape Recognition (USR) como método padrão, pelo fato de ser uma excelente e rápida métrica para análise da similaridade de ligantes. Foram selecionadas 50 moléculas de diferentes tamanhos e composição de forma a representar todos os grupos moleculares presentes na DUD-E. Em seguida, cada molécula foi submetida à busca de similares variando-se valores de corte para o LIRA em que o conjunto de moléculas selecionadas foi comparado com as selecionadas pelo USR através de um processo de classificação binária e criação e interpretação de curvas ROC. Além do benchmarking, foi realizada a análise das componentes principais para determinar quais descritores são os mais importantes e carregam as melhores informações utilizadas na descrição da superfície da molécula. A partir das componentes principais, foi realizado um estudo do uso de funções peso, associando mais importância aos descritores adequados, e a redução da dimensionalidade do banco de dados, seleção de um novo conjunto de autovetores que formam as bases do espaço vetorial e uma nova descrição das moléculas para o novo espaço, no qual cada variação foi avaliada através de um novo benchmarking. O LIRA se mostrou tão rápido quanto o USR e apresentou grande potencial de seleção de moléculas similares, para a maioria das moléculas testadas, pois as curvas ROC apresentaram pontos acima da linha do aleatório. Tanto a redução da dimensionalidade quanto o uso de funções de ponderação agregaram valor à métrica deixando-a mais veloz, no caso da redução da quantidade de descritores, e seletiva, em ambos os casos. Dessa forma, o método proposto se mostrou eficiente em mensurar a similaridade entre ligantes de forma seletiva e rápida utilizando somente informações a respeito da superfície molecular. / Molecular descriptors are essential for many applications in computational chemistry and physics, such as ligand-based similarity searching. Spherical harmonics have previously been suggested as comprehensive descriptors of molecular structure due to their properties, orthonormality and rotationally invariant. Here we proposed a ligand similarity analysis method where molecule\'s surface is modeled by an expansion in Spherical Harmonics, called LIRA, whose coefficient are used to perform a search in DUD-E database, with all descriptors previously calculated, measured by Euclidian Distance and different cutoff\'s values to select similar compounds. Method\'s potential is evaluated against Ultrafast Shape Recognition (USR), due to it is an excellent a fast metric to ligand similarity analysis, in a benchmarking. Fifty molecules are selected varying chemical composition and size to represent all molecular groups of DUD-E. After that, which one was submitted in a search with different values of cutoff for LIRA and the subset selected was compared with the ones selected by USR through binary classification and ROC curves analysis. Beyond benchmarking, it was performed a principal component analysis to identify which are the most valuable coefficient for shape description. Using principal components two other studies are made, weight functions are applied to descriptors, providing more value for those carry more information, and dimensionality reduction, where a subset of eigenvectors are select to form the new basis of the vector space and the new molecule\'s description was made in the new space, which variation was tested in a new benchmarking. Lira showed to be as fast as USR and a big potential to select similar molecules, for the majority of the molecules tested, because ROC curves had points over the random line. Dimensionality reduction and weight functions improved LIRA results raising velocity, due to the use of less descriptors to model molecule\'s surface, and the selection power, for both cases. In summary, the proposed method showed to be an efficient and fast tool for measure similarity between ligands based in molecular shape. Harmônicos esféricos Ligands Ligantes Similaridade Similarity Spherical harmonics

Search results