Global ETD Search

131	The Characteristics of Cold Air Outbreaks in the eastern United States and the influence of Atmospheric Circulation Patterns Smith, Erik T. 18 July 2017 (has links) No description available. Geography Climate Change Environmental Science Meteorology Climate Synoptic Climatology Cold Air Outbreaks Atmospheric Circulation Patterns United States Self-Organizing Maps Polar Vortex Extreme Weather Teleconnections
132	Detekce útoku pomocí analýzy systémových logů / Attack Detection by Analysis of the System's Logs Holub, Ondřej Unknown Date (has links) The thesis deals with the attack detection possibilities and the nonstandard behaviour. It focuses on problems with the IDS detection systems, the subsequent classification and methods which are being used for the attack detection. One part of the thesis presents the existing IDS systems and their properties which are necessary for the successful attack detection. Other parts describe methods to obtain information from the operating systems Microsoft Windows and it also analyses the theoretical methods of data abnormalities. The practical part focuses on the design and implementation of the HIDS application. The final application and its detection abilities are tested at the end of the practical part with the help of some model situations. In the conclusion, the thesis sums up the gained information and shows a possible way of the future development.
133	Estudo de representações multidimensionais para segmentação das fases do gesto / Study of multidimensional representations for the gesture phases segmentation Feitosa, Ricardo Alves 17 April 2018 (has links) Sistemas de análise de gestos têm se destacado por suas contribuições para a interação entre humanos, humanos e máquinas, e humanos e ambiente. Nessa interação, a gesticulação natural é vista como parte do sistema linguístico que suporta a comunicação, e qualquer sistema de informação que objetiva usar interação para suporte à decisão deveria ser capaz de interpretá-la. Essa interpretação pode ser realizada por meio da segmentação das fases do gesto. Para resolver essa tarefa, o estabelecimento de uma representação de dados eficiente para os gestos é um ponto crítico. A representação escolhida e sua associação a técnicas de análise podem ou não favorecer a solução sob implementação. Neste trabalho, formas de representação de gestos são submetidas aos algoritmos de reconhecimento de padrões MLP e SOM para elaborar um ambiente propício à identificação das representações mais discriminantes, quais aspectos as diferentes representações descrevem com eficiência, e como elas podem ser combinadas para melhorar a segmentação das fases do gesto. Para construção das representações multidimensionais são usados aspectos espaciais e temporais combinados com a normalização dos dados e a aplicação do filtro wavelet na busca pela representação mais discriminante para o reconhecimento das fases do gesto. Ambos os algoritmos alcançaram bons resultados com o uso dos aspectos temporais. O MLP conseguiu classificar todas as fases do gesto em configurações de representação contendo dados sobre todos os membros monitorados. O SOM apresentou boa capacidade para formar grupos contendo dados de uma mesma fase do gesto mesmo com o uso de poucas características na construção da representação, porém não foi possível identificar a proposta de uma nova fase do gesto com o aprendizado não supervisionado / Gestures analysis systems have stood out for their contributions to the interaction between humans, humans and machines, and humans and environments. In this interaction, natural gesticulation is seen as part of a linguistic system that supports the communication, and all information systems aiming at the use of such an interaction in making decisions should be able to interpret it. Such an interpretation can be carried out through the gesture phases segmentation. In order to solve this task, the establishment of an efficient data representation for gestures is a critical issue. The chosen representation as well as its combination with techniques for analysis can or can not favor the solution being developed. In this work, different forms representation for gestures are applied to pattern recognition algorithms MLP and SOM to create an adequate environment to identify the more discriminative representations, which aspect the different representations describe with more efficiency, and how they can be combined in order to improve gesture phases segmentation. To construct the multidimensional representations we use spatial and temporal aspects combined with the normalization of the data and the application of the wavelet filter in the search for the most discriminating representation for the recognition of the gesture phases. Both algorithms achieved good results with the use of temporal aspects. MLP was able to classify all gesture phases using representation settings containing data about all monitored members. SOM presented good ability to form groups containing data of the same gesture phase even with the use of few characteristics in the construction of the representation, but it was not possible to identify the proposal of a new gesture phase with unsupervised learning Aprendizado de Máquina Machine Learning Mapas Auto Organizáveis Multilayer Perceptron Pattern Recognition Perceptron Multicamadas Reconhecimento de Padrões Representação de Gestos Representation of Gestures Segmentação das Fases do Gesto Segmentation of Gesture Phases Self Organizing Maps
134	Redes neurais e algoritmos genéticos no estudo quimiossistemático da família Asteraceae / Neural Network and Genetic Algorithms in the Chemosystematic study of Asteraceae Family Correia, Mauro Vicentini 16 March 2010 (has links) No presente trabalho duas metodologias da área de inteligência artificial (Redes Neurais e Algoritmos Genéticos) foram utilizadas para realizar um estudo Quimiossistemático da família Asteraceae. A família Asteraceae é uma das maiores famílias entre as Angiospermas, conta com aproximadamente 24.000 espécies. As espécies da família produzem grande diversidade de metabólitos secundários, entre os quais merecem destaque os terpenóides, poliacetilenos, flavonóides e cumarinas. Para um melhor entendimento da diversidade química da família construiu-se um Banco de Dados com as ocorrências de doze classes de metabólitos (monoterpenos, sesquiterpenos, sesquiterpenos lactonizados, diterpenos, triterpenos, cumarinas, flavonóides, poliacetilenos, benzofuranos, benzopiranos, acetofenonas e fenilpropanóides) produzidos pelas espécies da família. A partir desse banco três diferentes estudos foram realizados. No primeiro estudo, utilizando os mapas auto-organizáveis de Kohonen e o banco de dados químico classificado segundo duas das mais recentes filogenias da família foi possível realizar com sucesso separações de tribos e gêneros da família Asteraceae. Também foi possível indicar que a informação química concorda mais com a filogenia de Funk (Funk et al. 2009) do que com a filogenia de Bremer (Bremer 1994, 1996). No estudo seguinte, onde se objetivou a criação de modelos de previsão dos números de ocorrências das doze classes de metabólitos, utilizando o perceptron de múltiplas camadas com algoritmo de retropropagação de erro, o resultado foi insatisfatório. Apesar de em algumas classes de metabólitos a fase de treino da rede apresentar resultados satisfatórios, a fase de teste mostrou que os modelos criados não são capazes de realizar previsão para dados aos quais eles não foram submetidos na fase de treino, e portanto não são modelos adequados para realizar previsões. Finalmente, o terceiro estudo consistiu na criação de modelos de regressão linear utilizando como método de seleção de variáveis os algoritmos genéticos. Nesse estudo foi possível indicar que os monoterpenos e os sesquiterpenos são bastante relacionados biossinteticamente, também foi possível indicar que existem relações biossintéticas entre monoterpenos e diterpenos e entre sesquiterpenos e triterpenos / In this study two methods of artificial intelligence (neural network and genetic algorithms) were used to work out a Chemosystematic study of the Asteraceae family. The family Asteraceae is one of the largest families among the Angiosperms, having about 24,000 species. The species of the family produce a large diversity of secondary metabolites, and some worth mentioning are the terpenoids, polyacetylenes, flavonoids and coumarins. For a better understanding of the chemical diversity of the family a database was built up with the occurrences of twelve classes of metabolites (monoterpenes, sesquiterpenes, lactonizadossesquiterpenes, diterpenes, triterpenes, coumarins, flavonoids, polyacetylenes, Benzofurans, benzopyrans, acetophenones and phenylpropanoids) produced by species of the family. From this database three different studies were conducted. In the first study, using the Kohonen self-organized map and the chemical data classified according to two of the most recent phylogenies of the family, it was possible to successfully separatethe tribes and genera of the Asteraceae family. It was also possible to indicate that the chemical information agrees with the phylogeny of Funk (Funk et al. 2009) than with the phylogeny of Bremer (Bremer 1994, 1996). In the next study, which aims at creating models to predict the number of occurrences of the twelve classes of metabolites using multi-layer perceptron with backpropagation algorithm error, the result was found unsatisfactory. Although in some classes of metabolites the training phase of the network has satisfactory results, the test phase showed that the models created are not able to make prevision for data to which they were submitted in the training phase and thus are not suitable models for predictions. Finally, the third study was the creation of linear regression models using a genetic algorithm method of variable selection. This study could indicate that the monoterpenes and sesquiterpenes are closely related biosynthetically, and was also possible to indicate that there are biosynthetic relations between monoterpenes and diterpenes and between sesquiterpenes and triterpenes Algoritmos Genéticos Asteraceae Asteraceae Banco de dados Chemosystematic Compositae (Estudo; Classificação) Genetic Algorithms Mapas Auto-Organizáveis Multi-layer Perceptron Natural products Neural Network Perceptron de Múltiplas Camadas Produtos naturais Quimiossistemática Redes Neurais Self-Organizing Maps
135	Aplicação de mapas auto-organizáveis na classificação de aberrações cromossômicas utilizando imagens de cromossomos humanos submetidos à radiação ionizante / Application of self-organizing maps for the classification of chromosomal aberrations using images of human chromosomes subjected to ionizing radiation Cunha, Kelly de Paula 15 April 2015 (has links) O presente trabalho é resultado da colaboração de pesquisadores do Centro de Engenharia Nuclear (CEN) e de pesquisadores do Centro de Biotecnologia (CB), ambos pertencentes ao IPEN, para o desenvolvimento de uma metodologia que visa auxiliar os profissionais citogeneticistas fornecendo uma ferramenta que automatize parte da rotina necessária para a avaliação qualitativa e quantitativa de danos biológicos em termos de aberração cromossômica. A técnica citogenética, sobre a qual esta ferramenta é desenvolvida, é a técnica de aberrações cromossômicas. Nela, são realizadas preparações citológicas de linfócitos de sangue periférico para que metáfases sejam analisadas e fotografadas ao microscópio e, com base na morfologia dos cromossomos, anomalias sejam investigadas. Quando esta tarefa é realizada manualmente, os cromossomos são analisados visualmente um a um pelo profissional citogeneticista, logo, trata-se de um processo minucioso em virtude da variação geral na aparência do cromossomo, do seu tamanho pequeno e do grande número de cromossomos por célula. Para um diagnóstico confiável, é necessário que várias células sejam analisadas, tornando-se uma tarefa repetitiva e demorada. Neste contexto, foi proposto o uso dos mapas auto-organizáveis para o reconhecimento automático de padrões morfológicos referentes às imagens de cromossomos humanos. Para isso, foi desenvolvido um método de extração de características por meio do qual é possível classificar os cromossomos em: dicêntricos, anéis, acrocêntricos, submetacêntricos e metacêntricos, com acerto de 93,4 % em relação ao diagnóstico dado por um profissional citogeneticista. / This work is a joint collaboration between Nuclear Energy Research Institute (IPEN), Nuclear Engineering Center and Biotechnology Center to develop a methodology aiming to assist cytogenetic professionals by providing a tool to automate part of the required routine to perform qualitative and quantitative evaluation of biological damage in terms of chromosomal aberration. The cytogenetic technique upon which this tool was developed, is the chromosome aberrations technique, in which cytological preparations of peripheral blood lymphocyte metaphases are performed to be analyzed and photographed under a microscope in order to investigating chromosomal aberration. Performed manually, the chromosomes are analyzed visually one by one by a cytogenetic professional, so it is a painstaking process due to the great deal of variation in the appearance of each chromosome, their small sizes and not to mention the high density of chromosomes per cell. In order to obtain a reliable diagnosis it is necessary that many cells be analyzed, which makes this a repetitive and time consuming process. In this context, the use of self-organizing maps for the automatic recognition of patterns relating to morphological pictures of human chromosomes has been proposed. For this, we developed a feature extraction method by which is possible to classify chromosomes in: dicentrics, ring-shaped, acrocentric, submetacentric and metacentric with 93.4% accuracy compared to diagnostic given by a professional cytogeneticist. aberrações cromossômicas artificial neural networks chromosomal aberrations chromosome classification classificação cromossômica diagnostic imaging diagnóstico por imagem Kohonen networks mapas auto-organizáveis redes de Kohonen redes neurais artificiais self-organizing maps
136	Estudo de representações multidimensionais para segmentação das fases do gesto / Study of multidimensional representations for the gesture phases segmentation Ricardo Alves Feitosa 17 April 2018 (has links) Sistemas de análise de gestos têm se destacado por suas contribuições para a interação entre humanos, humanos e máquinas, e humanos e ambiente. Nessa interação, a gesticulação natural é vista como parte do sistema linguístico que suporta a comunicação, e qualquer sistema de informação que objetiva usar interação para suporte à decisão deveria ser capaz de interpretá-la. Essa interpretação pode ser realizada por meio da segmentação das fases do gesto. Para resolver essa tarefa, o estabelecimento de uma representação de dados eficiente para os gestos é um ponto crítico. A representação escolhida e sua associação a técnicas de análise podem ou não favorecer a solução sob implementação. Neste trabalho, formas de representação de gestos são submetidas aos algoritmos de reconhecimento de padrões MLP e SOM para elaborar um ambiente propício à identificação das representações mais discriminantes, quais aspectos as diferentes representações descrevem com eficiência, e como elas podem ser combinadas para melhorar a segmentação das fases do gesto. Para construção das representações multidimensionais são usados aspectos espaciais e temporais combinados com a normalização dos dados e a aplicação do filtro wavelet na busca pela representação mais discriminante para o reconhecimento das fases do gesto. Ambos os algoritmos alcançaram bons resultados com o uso dos aspectos temporais. O MLP conseguiu classificar todas as fases do gesto em configurações de representação contendo dados sobre todos os membros monitorados. O SOM apresentou boa capacidade para formar grupos contendo dados de uma mesma fase do gesto mesmo com o uso de poucas características na construção da representação, porém não foi possível identificar a proposta de uma nova fase do gesto com o aprendizado não supervisionado / Gestures analysis systems have stood out for their contributions to the interaction between humans, humans and machines, and humans and environments. In this interaction, natural gesticulation is seen as part of a linguistic system that supports the communication, and all information systems aiming at the use of such an interaction in making decisions should be able to interpret it. Such an interpretation can be carried out through the gesture phases segmentation. In order to solve this task, the establishment of an efficient data representation for gestures is a critical issue. The chosen representation as well as its combination with techniques for analysis can or can not favor the solution being developed. In this work, different forms representation for gestures are applied to pattern recognition algorithms MLP and SOM to create an adequate environment to identify the more discriminative representations, which aspect the different representations describe with more efficiency, and how they can be combined in order to improve gesture phases segmentation. To construct the multidimensional representations we use spatial and temporal aspects combined with the normalization of the data and the application of the wavelet filter in the search for the most discriminating representation for the recognition of the gesture phases. Both algorithms achieved good results with the use of temporal aspects. MLP was able to classify all gesture phases using representation settings containing data about all monitored members. SOM presented good ability to form groups containing data of the same gesture phase even with the use of few characteristics in the construction of the representation, but it was not possible to identify the proposal of a new gesture phase with unsupervised learning Aprendizado de Máquina Mapas Auto Organizáveis Perceptron Multicamadas Reconhecimento de Padrões Representação de Gestos Segmentação das Fases do Gesto Machine Learning Multilayer Perceptron Pattern Recognition Representation of Gestures Segmentation of Gesture Phases Self Organizing Maps
137	Análise dos atropelamentos de mamíferos em uma rodovia no estado de São Paulo utilizando Self-Organizing Maps. / Using Self-Organizing Maps to analyse wildlife-vehicle collisions on a highway in São Paulo state. Tsuda, Larissa Sayuri 05 July 2018 (has links) A construção e ampliação de rodovias gera impactos significativos ao meio ambiente. Os principais impactos ao meio biótico são a supressão de vegetação, redução da riqueza e abundância de espécies de fauna como decorrência da fragmentação de habitats e aumento dos riscos de atropelamento de animais silvestres e domésticos. O objetivo geral do trabalho foi identificar padrões espaciais nos atropelamentos de fauna silvestre por espécie (nome popular) utilizando ferramentas de análise espacial e machine learning. Especificamente, buscou-se compreender a relação entre atropelamentos de animais silvestres e variáveis que representam características de uso e cobertura do solo e caracterização da rodovia, tais como formação florestal, corpos d\'água, silvicultura, áreas edificadas, velocidade máxima permitida, volume de tráfego, entre outras. Os atropelamentos de fauna silvestre foram analisados por espécie atropelada, a fim de identificar os padrões espaciais dos atropelamentos específicos para cada espécie. As ferramentas de análise espacial empregadas foram a Função K - para determinar o padrão de distribuição dos registros de atropelamento de fauna, o Estimador de Densidade de Kernel - para gerar estimativas de densidade de pontos sobre a rodovia, a Análise de Hotspots - para identificar os trechos mais críticos de atropelamento de fauna e, por fim, o Self-Organizing Maps (SOM), um tipo de rede neural artificial, que reorganiza amostras de dados n-dimensionais de acordo com a similaridade entre elas. Os resultados das análises de padrões pontuais foram importantes para entender que os pontos de atropelamento possuem padrões de distribuição espacial que variam por espécie. Os eventos ocorrem espacialmente agrupados e não estão homogeneamente distribuídos ao longo da rodovia. De maneira geral, os animais apresentam trechos de maior intensidade de atropelamento em locais distintos. O SOM permitiu analisar as relações entre múltiplas variáveis, lineares e não-lineares, tais como são os dados ecológicos, e encontrar padrões espaciais distintos por espécie. A maior parte dos animais foi atropelada próxima de fragmentos florestais e de corpos d\'água, e distante de cultivo de cana-de-açúcar, silvicultura e área edificada. Porém, uma parte considerável das mortes de animais dos tipos com maior número de atropelamentos ocorreu em áreas com paisagem diversificada, incluindo alta densidade de drenagem, fragmentos florestais, silvicultura e áreas edificadas. / The construction and expansion of roads cause significant impacts on the environment. The main potential impacts to biotic environment are vegetation suppression, reduction of the abundance and richness of species due to forest fragmentation and increase of animal (domestic and wildlife) vehicle collisions. The general objective of this work was to identify spatial patterns in wildlife-vehicle collisions individually per species by using spatial analysis and machine learning. Specifically, the relationship between wildlife-vehicle collisions and variables that represent land use and road characterization features - such as forests, water bodies, silviculture, sugarcane fields, built environment, speed limit and traffic volume - was investigated. The wildlife-vehicle collisions were analyzed per species, in order to identify the spatial patterns for each species separately. The spatial analysis tools used in this study were K-Function - to determine the distribution pattern of roadkill, Kernel Density Estimator (KDE) - to identify the location and intensity of hotspots and hotzones. Self-Organizing Maps (SOM), an artificial neural network (ANN), was selected to reorganize the multi-dimensional data according to the similarity between them. The results of the spatial pattern analysis were important to perceive that the point data pattern varies between species. The events occur spatially clustered and are not uniformly distributed along the highway. In general, wildlife-vehicle collsions have their hotzones in different locations. SOM was able to analyze the relationship between multiple variables, linear and non-linear, such as ecological data, and established distinct spatial patterns per each species. Most of the wildlife was run over close to forest area and water bodies, and distant from sugarcane, silviculture and built environments. But a considerable part of the wildlife-vehicle collisions occurred in areas with diverse landscape, including high density of water bodies, silviculture and built environments. Geographic Information Systems Geoprocessamento Geoprocessing GIS K Function KDE Kernel Density Estimator Machine learning Mapas auto-organizáveis Neural networks Redes neurais Road ecology Road safety Segurança rodoviária Self-Organizing Maps Sistema de informação geográfica SOM Wildlife-vehicle collisions
138	A three-dimensional representation method for noisy point clouds based on growing self-organizing maps accelerated on GPUs Orts-Escolano, Sergio 21 January 2014 (has links) The research described in this thesis was motivated by the need of a robust model capable of representing 3D data obtained with 3D sensors, which are inherently noisy. In addition, time constraints have to be considered as these sensors are capable of providing a 3D data stream in real time. This thesis proposed the use of Self-Organizing Maps (SOMs) as a 3D representation model. In particular, we proposed the use of the Growing Neural Gas (GNG) network, which has been successfully used for clustering, pattern recognition and topology representation of multi-dimensional data. Until now, Self-Organizing Maps have been primarily computed offline and their application in 3D data has mainly focused on free noise models, without considering time constraints. It is proposed a hardware implementation leveraging the computing power of modern GPUs, which takes advantage of a new paradigm coined as General-Purpose Computing on Graphics Processing Units (GPGPU). The proposed methods were applied to different problem and applications in the area of computer vision such as the recognition and localization of objects, visual surveillance or 3D reconstruction. 3D representation method Growing neural gas Self-organizing maps Topology preservation Parallel computing CUDA Real-time Point cloud 3D reconstruction GPGPU RGBD Noisy 3D data Object recognition
139	Contributions to 3D Data Registration and Representation Morell, Vicente 02 October 2014 (has links) Nowadays, new computers generation provides a high performance that enables to build computationally expensive computer vision applications applied to mobile robotics. Building a map of the environment is a common task of a robot and is an essential part to allow the robots to move through these environments. Traditionally, mobile robots used a combination of several sensors from different technologies. Lasers, sonars and contact sensors have been typically used in any mobile robotic architecture, however color cameras are an important sensor due to we want the robots to use the same information that humans to sense and move through the different environments. Color cameras are cheap and flexible but a lot of work need to be done to give robots enough visual understanding of the scenes. Computer vision algorithms are computational complex problems but nowadays robots have access to different and powerful architectures that can be used for mobile robotics purposes. The advent of low-cost RGB-D sensors like Microsoft Kinect which provide 3D colored point clouds at high frame rates made the computer vision even more relevant in the mobile robotics field. The combination of visual and 3D data allows the systems to use both computer vision and 3D processing and therefore to be aware of more details of the surrounding environment. The research described in this thesis was motivated by the need of scene mapping. Being aware of the surrounding environment is a key feature in many mobile robotics applications from simple robotic navigation to complex surveillance applications. In addition, the acquisition of a 3D model of the scenes is useful in many areas as video games scene modeling where well-known places are reconstructed and added to game systems or advertising where once you get the 3D model of one room the system can add furniture pieces using augmented reality techniques. In this thesis we perform an experimental study of the state-of-the-art registration methods to find which one fits better to our scene mapping purposes. Different methods are tested and analyzed on different scene distributions of visual and geometry appearance. In addition, this thesis proposes two methods for 3d data compression and representation of 3D maps. Our 3D representation proposal is based on the use of Growing Neural Gas (GNG) method. This Self-Organizing Maps (SOMs) has been successfully used for clustering, pattern recognition and topology representation of various kind of data. Until now, Self-Organizing Maps have been primarily computed offline and their application in 3D data has mainly focused on free noise models without considering time constraints. Self-organising neural models have the ability to provide a good representation of the input space. In particular, the Growing Neural Gas (GNG) is a suitable model because of its flexibility, rapid adaptation and excellent quality of representation. However, this type of learning is time consuming, specially for high-dimensional input data. Since real applications often work under time constraints, it is necessary to adapt the learning process in order to complete it in a predefined time. This thesis proposes a hardware implementation leveraging the computing power of modern GPUs which takes advantage of a new paradigm coined as General-Purpose Computing on Graphics Processing Units (GPGPU). Our proposed geometrical 3D compression method seeks to reduce the 3D information using plane detection as basic structure to compress the data. This is due to our target environments are man-made and therefore there are a lot of points that belong to a plane surface. Our proposed method is able to get good compression results in those man-made scenarios. The detected and compressed planes can be also used in other applications as surface reconstruction or plane-based registration algorithms. Finally, we have also demonstrated the goodness of the GPU technologies getting a high performance implementation of a CAD/CAM common technique called Virtual Digitizing. 3D representation method Growing Neural Gas Self-Organizing Maps Topology Preservation Parallel Computing CUDA Real-time Point Cloud GPGPU RGB-D Noisy 3D data 3D registration 3D compression
140	Marc integrador de les capacitats de Soft-Computing i de Knowledge Discovery dels Mapes Autoorganitzatius en el Raonament Basat en Casos Fornells Herrera, Albert 14 December 2007 (has links) El Raonament Basat en Casos (CBR) és un paradigma d'aprenentatge basat en establir analogies amb problemes prèviament resolts per resoldre'n de nous. Per tant, l'organització, l'accés i la utilització del coneixement previ són aspectes claus per tenir èxit en aquest procés. No obstant, la majoria dels problemes reals presenten grans volums de dades complexes, incertes i amb coneixement aproximat i, conseqüentment, el rendiment del CBR pot veure's minvat degut a la complexitat de gestionar aquest tipus de coneixement. Això ha fet que en els últims anys hagi sorgit una nova línia de recerca anomenada Soft-Computing and Intelligent Information Retrieval enfocada en mitigar aquests efectes. D'aquí neix el context d'aquesta tesi.Dins de l'ampli ventall de tècniques Soft-Computing per tractar coneixement complex, els Mapes Autoorganitzatius (SOM) destaquen sobre la resta per la seva capacitat en agrupar les dades en patrons, els quals permeten detectar relacions ocultes entre les dades. Aquesta capacitat ha estat explotada en treballs previs d'altres investigadors, on s'ha organitzat la memòria de casos del CBR amb SOM per tal de millorar la recuperació dels casos.La finalitat de la present tesi és donar un pas més enllà en la simple combinació del CBR i de SOM, de tal manera que aquí s'introdueixen les capacitats de Soft-Computing i de Knowledge Discovery de SOM en totes les fases del CBR per nodrir-les del nou coneixement descobert. A més a més, les mètriques de complexitat apareixen en aquest context com un instrument precís per modelar el funcionament de SOM segons la tipologia de les dades. L'assoliment d'aquesta integració es pot dividir principalment en quatre fites: (1) la definició d'una metodologia per determinar la millor manera de recuperar els casos tenint en compte la complexitat de les dades i els requeriments de l'usuari; (2) la millora de la fiabilitat de la proposta de solucions gràcies a les relacions entre els clústers i els casos; (3) la potenciació de les capacitats explicatives mitjançant la generació d'explicacions simbòliques; (4) el manteniment incremental i semi-supervisat de la memòria de casos organitzada per SOM.Tots aquests punts s'integren sota la plataforma SOMCBR, la qual és extensament avaluada sobre datasets provinents de l'UCI Repository i de dominis mèdics i telemàtics.Addicionalment, la tesi aborda de manera secundària dues línies de recerca fruït dels requeriments dels projectes on ha estat ubicada. D'una banda, s'aborda la definició de funcions de similitud específiques per definir com comparar un cas resolt amb un de nou mitjançant una variant de la Computació Evolutiva anomenada Evolució de Gramàtiques (GE). D'altra banda, s'estudia com definir esquemes de cooperació entre sistemes heterogenis per millorar la fiabilitat de la seva resposta conjunta mitjançant GE. Ambdues línies són integrades en dues plataformes, BRAIN i MGE respectivament, i són també avaluades amb els datasets anteriors. / El Razonamiento Basado en Casos (CBR) es un paradigma de aprendizaje basado en establecer analogías con problemas previamente resueltos para resolver otros nuevos. Por tanto, la organización, el acceso y la utilización del conocimiento previo son aspectos clave para tener éxito. No obstante, la mayoría de los problemas presentan grandes volúmenes de datos complejos, inciertos y con conocimiento aproximado y, por tanto, el rendimiento del CBR puede verse afectado debido a la complejidad de gestionarlos. Esto ha hecho que en los últimos años haya surgido una nueva línea de investigación llamada Soft-Computing and Intelligent Information Retrieval focalizada en mitigar estos efectos. Es aquí donde nace el contexto de esta tesis.Dentro del amplio abanico de técnicas Soft-Computing para tratar conocimiento complejo, los Mapas Autoorganizativos (SOM) destacan por encima del resto por su capacidad de agrupar los datos en patrones, los cuales permiten detectar relaciones ocultas entre los datos. Esta capacidad ha sido aprovechada en trabajos previos de otros investigadores, donde se ha organizado la memoria de casos del CBR con SOM para mejorar la recuperación de los casos.La finalidad de la presente tesis es dar un paso más en la simple combinación del CBR y de SOM, de tal manera que aquí se introducen las capacidades de Soft-Computing y de Knowledge Discovery de SOM en todas las fases del CBR para alimentarlas del conocimiento nuevo descubierto. Además, las métricas de complejidad aparecen en este contexto como un instrumento preciso para modelar el funcionamiento de SOM en función de la tipología de los datos. La consecución de esta integración se puede dividir principalmente en cuatro hitos: (1) la definición de una metodología para determinar la mejor manera de recuperar los casos teniendo en cuenta la complejidad de los datos y los requerimientos del usuario; (2) la mejora de la fiabilidad en la propuesta de soluciones gracias a las relaciones entre los clusters y los casos; (3) la potenciación de las capacidades explicativas mediante la generación de explicaciones simbólicas; (4) el mantenimiento incremental y semi-supervisado de la memoria de casos organizada por SOM. Todos estos puntos se integran en la plataforma SOMCBR, la cual es ampliamente evaluada sobre datasets procedentes del UCI Repository y de dominios médicos y telemáticos.Adicionalmente, la tesis aborda secundariamente dos líneas de investigación fruto de los requeri-mientos de los proyectos donde ha estado ubicada la tesis. Por un lado, se aborda la definición de funciones de similitud específicas para definir como comparar un caso resuelto con otro nuevo mediante una variante de la Computación Evolutiva denominada Evolución de Gramáticas (GE). Por otro lado, se estudia como definir esquemas de cooperación entre sistemas heterogéneos para mejorar la fiabilidad de su respuesta conjunta mediante GE. Ambas líneas son integradas en dos plataformas, BRAIN y MGE, las cuales también son evaluadas sobre los datasets anteriores. / Case-Based Reasoning (CBR) is an approach of machine learning based on solving new problems by identifying analogies with other previous solved problems. Thus, organization, access and management of this knowledge are crucial issues for achieving successful results. Nevertheless, the major part of real problems presents a huge amount of complex data, which also presents uncertain and partial knowledge. Therefore, CBR performance is influenced by the complex management of this knowledge. For this reason, a new research topic has appeared in the last years for tackling this problem: Soft-Computing and Intelligent Information Retrieval. This is the point where this thesis was born.Inside the wide variety of Soft-Computing techniques for managing complex data, the Self-Organizing Maps (SOM) highlight from the rest due to their capability for grouping data according to certain patterns using the relations hidden in data. This capability has been used in a wide range of works, where the CBR case memory has been organized with SOM for improving the case retrieval.The goal of this thesis is to take a step up in the simple combination of CBR and SOM. This thesis presents how to introduce the Soft-Computing and Knowledge Discovery capabilities of SOM inside all the steps of CBR to promote them with the discovered knowledge. Furthermore, complexity measures appear in this context as a mechanism to model the performance of SOM according to data topology. The achievement of this goal can be split in the next four points: (1) the definition of a methodology for setting up the best way of retrieving cases taking into account the data complexity and user requirements; (2) the improvement of the classification reliability through the relations between cases and clusters; (3) the promotion of the explaining capabilities by means of the generation of symbolic explanations; (4) the incremental and semi-supervised case-based maintenance. All these points are integrated in the SOMCBR framework, which has been widely tested in datasets from UCI Repository and from medical and telematic domains. Additionally, this thesis secondly tackles two additional research lines due to the requirements of a project in which it has been developed. First, the definition of similarity functions ad hoc a domain is analyzed using a variant of the Evolutionary Computation called Grammar Evolution (GE). Second, the definition of cooperation schemes between heterogeneous systems is also analyzed for improving the reliability from the point of view of GE. Both lines are developed in two frameworks, BRAIN and MGE respectively, which are also evaluated over the last explained datasets. Hybrid Systems Soft-Computing Self-Organizing Maps Case-Based Reasoning Sistemas Híbridos Soft-Computing Mapas Autoorganizativos Razonamiento Basado en Casos Sistemes Híbrids Soft-Computing Mapes Autooranitzatius Raonament Basat en Casos Les TIC i la seva Gestió 004

Search results