Global ETD Search

41	[en] A METHOD FOR INTERPRETING CONCEPT DRIFTS IN A STREAMING ENVIRONMENT / [pt] UM MÉTODO PARA INTERPRETAÇÃO DE MUDANÇAS DE REGIME EM UM AMBIENTE DE STREAMING JOAO GUILHERME MATTOS DE O SANTOS 10 August 2021 (has links) [pt] Em ambientes dinâmicos, os modelos de dados tendem a ter desempenho insatisfatório uma vez que a distribuição subjacente dos dados muda. Este fenômeno é conhecido como Concept Drift. Em relação a este tema, muito esforço tem sido direcionado ao desenvolvimento de métodos capazes de detectar tais fenômenos com antecedência suficiente para que os modelos possam se adaptar. No entanto, explicar o que levou ao drift e entender suas consequências ao modelo têm sido pouco explorado pela academia. Tais informações podem mudar completamente a forma como adaptamos os modelos. Esta dissertação apresenta uma nova abordagem, chamada Detector de Drift Interpretável, que vai além da identificação de desvios nos dados. Ele aproveita a estrutura das árvores de decisão para prover um entendimento completo de um drift, ou seja, suas principais causas, as regiões afetadas do modelo e sua severidade. / [en] In a dynamic environment, models tend to perform poorly once the underlying distribution shifts. This phenomenon is known as Concept Drift. In the last decade, considerable research effort has been directed towards developing methods capable of detecting such phenomena early enough so that models can adapt. However, not so much consideration is given to explain the drift, and such information can completely change the handling and understanding of the underlying cause. This dissertation presents a novel approach, called Interpretable Drift Detector, that goes beyond identifying drifts in data. It harnesses decision trees’ structure to provide a thorough understanding of a drift, i.e., its principal causes, the affected regions of a tree model, and its severity. Moreover, besides all information it provides, our method also outperforms benchmark drift detection methods in terms of falsepositive rates and true-positive rates across several different datasets available in the literature. [pt] ARVORE DE DECISAO [pt] ENTENDIMENTO DE DRIFT [pt] INTERPRETACAO DE DRIFT [pt] DETECCAO DE DRIFT [pt] MINERACAO DE DADOS DE TRAJETORIAS [en] DECISION TREE [en] DRIFT UNDERSTANDING [en] DRIFT INTERPRETATION [en] DRIFT DETECTION [en] TRAJECTORY DATA MINING
42	[en] ON THE RELATION BETWEEN REFACTORING AND CRITICAL INTERNAL ATTRIBUTES WHEN EVOLVING SOFTWARE FEATURES / [pt] SOBRE A RELAÇÃO ENTRE REFATORAÇÃO E ATRIBUTOS INTERNOS CRÍTICOS AO EVOLUIR FUNCIONALIDADES DE SOFTWARE EDUARDO MOREIRA FERNANDES 07 June 2021 (has links) [pt] Contexto: Várias mudanças de código aplicadas ao evoluir funcionalidades visam melhorar atributos internos de qualidade como coesão. Tais mudanças são as refatorações. Refatorações não dirigidas podem piorar, e não melhorar, atributos internos. Porém, o saber atual é insuficiente para gerir atributos internos durante a evolução do sistema. Objetivo: Nosso primeiro objetivo é entender como refatorações afetam atributos internos ao evoluir sistemas, mitigando limitações de escopo de estudos anteriores. Nosso segundo objetivo é atender uma carência por evidência quantitativa sobre como gerir atributos internos críticos via refatorações ao evoluir sistemas. Um atributo interno é crítico se sua medição assume valores anômalos. Baixa coesão é um exemplo de atributo crítico. Método: O primeiro estudo estende uma avaliação quantitativa da relação entre refatorações e cinco atributos internos: acoplamento, coesão, complexidade, herança e tamanho. Incluímos novas análises e resolvemos ameaças à validade da literatura. O segundo estudo contém estudos de caso qualitativos baseados em grupo focal. Em dois casos industriais, promovemos discussões sobre o quanto (e por que) atributos críticos são relevante ao evoluir funcionalidades. Por fim, cruzamos os achados dos dois estudos para discutir como gerir atributos críticos via refatoração ao evoluir funcionalidades. Resultados: Aproximadamente 64 por cento das refatorações melhoram ou não afetam os atributos internos. Desenvolvedores parecem refatorar até melhorar os atributos mais relevantes, ignorando outros atributos internos possivelmente críticos. Baixa coesão e alta complexidade são percebidos como relevantes e tornam mais difícil evoluir funcionalidades. Alto acoplamento, herança larga e tamanho largo são percebidos como irrelevantes ao implementar funcionalidades especialmente complexas, por exemplo. Ao cruzar dados entre estudos, discutimos como refatorações podem melhorar atributos internos, inclusive atributos críticos. Conclusões: Os achados dos nossos estudos podem apoiar a gestão de atributos críticos relevantes aos desenvolvedores, mas também preservar outros atributos que podem se tornar críticos. / [en] Context: Several software changes applied while evolving software features aim at improving internal quality attributes, e.g. cohesion. These changes are the refactorings. Non-assisted refactorings might worsen, rather than improve, internal attributes. However, current knowledge is insufficient for managing internal attributes during software evolution. Objective: Our first objective is assessing how refactorings affect internal attributes during software evolution by filling gaps of past work on study scope. Our second objective is filling gaps of qualitative evidence on how to manage critical internal attributes via refactorings while evolving features. An internal attribute is critical when its measurement has anomalous values. Low cohesion is an example of critical attribute. Method: Our first study extends a large quantitative assessment of the relationship between refactorings and five internal attributes: cohesion, complexity, coupling, inheritance, and size. We include a more detailed statistical analysis and address major threats to validity of past work. Our second study is a qualitative case study based on focus group. We selected two industry cases to promote discussions on how much (and why) critical attributes are relevant while evolving features. Finally, we crossed the findings from both conducted studies aimed at discussing how critical attributes can be addressed via refactoring when evolving features. Results: About 64 per cent of refactorings either improve or keep the internal attributes unaffected. Developers seem to perform refactorings until the most relevant internal attributes are improved, thereby neglecting other internal attributes that may be critical. Low cohesion and high complexity are perceived as relevant because they often make evolving features harder than usual. High coupling, large inheritance, and large size are perceived as irrelevant when developers implement especially complex features. By crossing the findings from both studies, we discuss how refactorings can improve internal attributes, especially the critical ones. Conclusions: The findings of our studies can support managing critical attributes that developers typically find relevant, while preserving other attributes that may become critical. [pt] GRUPO FOCAL [pt] ATRIBUTO INTERNO DE QUALIDADE [pt] FUNCIONALIDADE [pt] REFATORACAO [en] FOCUS GROUP [en] MINING SOFTWARE REPOSITORY [en] INTERNAL QUALITY ATTRIBUTES [en] FUNCTIONALITY [en] REFACTORING
43	[pt] MINERANDO O PROCESSO DE UM COQUEAMENTO RETARDADO ATRAVÉS DE AGRUPAMENTO DE ESTADOS / [en] MINING THE PROCESS OF A DELAYED COKER USING CLUSTERED STATES RAFAEL AUGUSTO GASETA FRANCA 25 November 2021 (has links) [pt] Procedimentos e processos são essenciais para garantir a qualidade de qualquer operação. Porém, o processo realizado na prática nem sempre está de acordo com o processo idealizado. Além disso, uma análise mais refinada de gargalos e inconsistências só é possível a partir do registro de eventos do processo (log). Mineração de processos (process mining) é uma área que reúne um conjunto de métodos para reconstruir, monitorar e aprimorar um processo a partir de seu registro de eventos. Mas, ao aplicar as soluções já existentes no log de uma unidade de coqueamento retardado, os resultados foram insatisfatórios. O núcleo do problema está na forma como o log está estruturado, carecendo de uma identificação de casos, essencial para a mineração do processo. Para contornar esse problema, aplicamos agrupamento hierárquico aglomerativo no log, separando as válvulas em grupos que exercem uma função na operação. Desenvolvemos uma ferramenta (PLANTSTATE) para avaliar a qualidade desses grupos no contexto da planta e ajustar conforme a necessidade do domínio. Identificando os momentos de ativação desses grupos no log chegamos a uma estrutura de sequência e paralelismo entre os grupos. Finalmente, propomos um modelo capaz de representar as relações entre os grupos, resultando em um processo que representa a operações em uma unidade de coqueamento retardado. / [en] Procedures and processes are essential to guarantee the quality of any operation. However, processes carried out in the real world are not always in accordance with the imagined process. Furthermore, a more refined analysis of obstacles and inconsistencies is only possible from the process events record (log). Process mining is an area that brings together a set of methods to rebuild, monitor and improve processes from their log. Nevertheless, when applying existing solutions to the log of a delayed coker unit, the results were unsatisfactory. The core of the problem is how the log is structured, lacking a case identification, essential for process mining. To deal with this issue, we apply agglomerative hierarchical clustering in the log, separating the valves into groups that perform a task in an operation. We developed a tool (PLANTSTATE) to assess the quality of these groups in the context of the plant and to adjust in accord to the needs of the domain. By identifying the moments of activation of these groups in the log we arrive at a structure of sequence and parallelism between the groups. Finally, we propose a model capable of representing the relationships between groups, resulting in a process that represents the operations in a delayed coker unit. [pt] APRENDIZADO DE MAQUINA [pt] COQUEAMENTO RETARDADO [pt] MINERACAO DE PROCESSOS [pt] CIENCIA DE DADOS [en] MACHINE LEARNING [en] DELAYED COKE [en] PROCESS MINING [en] DATA SCIENCE
44	[en] DEVIANCE MINING OF ONLINE PROCESSES WITH NONATOMIC EVENTS IN THE COVID-19 DOMAIN / [pt] MINERAÇÃO DE DESVIOS EM PROCESSOS ONLINE COM EVENTOS NÃO ATÔMICOS NO DOMÍNIO DA COVID-19 LUCAS SEIXAS JAZBIK 17 October 2022 (has links) [pt] As técnicas de mineração de processos vêm sendo aplicadas com sucesso como abordagens baseadas em dados e específicas do domínio para melhorar o desempenho do processo de negócios em várias organizações. Dentre suas aplicações, a Mineração de Desvios (Deviance Mining) visa descobrir as razões pelas quais um subconjunto das execuções de um processo de negócio desvia-se em relação aos seus resultados esperados ou desejáveis, produzindo assim insights para melhorar a operação do processo, tais descobertas podem ser feitas utilizando técnicas de aprendizado de tratamentos (Treatment Learning), que identificam os conjuntos de atributos mais influentes nos resultados. No entanto, apesar de os processos da vida real serem tipicamente compostos por eventos de duração não instantânea (eventos não atômicos), as abordagens existentes para mineração de processos, e para mineração de desvios em particular, endereçam exclusivamente eventos atômicos em seus experimentos. Este trabalho propõe um método orientado ao domínio para detectar automaticamente desvios em processos compostos por eventos não atômicos. O método utiliza a dimensão temporal dos eventos não atômicos para aplicar a mineração de desvios, gerando insights sobre como a duração e a ocorrência simultânea de eventos geram desvios e como esses desvios impactam os resultados dos processos. O método foi aplicado com sucesso no domínio da COVID-19, para descobrir quais sequências de intervenções não farmacêuticas mais contribuíram para diminuir a taxa de casos de COVID-19 em países ao redor do mundo. / [en] Process Mining techniques have been successfully applied as a datadriven and domain-aware approach for improving business process performance in several organizations. Among its applications, Deviance Mining aims at uncovering the reasons why a subset of the executions of a business process deviate with respect to its expected or desirable outcomes, thus producing insights towards improving the process operation, such discoveries can be made using treatment learning techniques, which identify the sets of attributes that are most influential in the results. However, despite the fact that real-life processes are typically composed by events with non-instantaneous duration (nonatomic events), existing approaches for process mining and deviance mining in particular only deal with atomic events in their experiments. This work proposes a domain-driven method for automatically detecting deviations in processes composed by non-atomic events. The method uses the temporal dimension of non-atomic events to apply deviance mining, generating insights on how the duration and the simultaneous occurrence of events generate deviations and how these deviations affect the results of the processes. The method was successfully applied in the COVID-19 domain, to find which domain-specific sequences of nonpharmaceutical interventions mostly contributed to slowing down the rate of COVID-19 cases in countries around the world. [pt] MELHORIA DE PROCESSOS [pt] PROCESSOS CONTINUOS [pt] MINERACAO DE DESVIOS [pt] EVENTOS NAO ATOMICOS [pt] COVID 19 [en] PROCESS IMPROVEMENTS [en] CONTINUOUS PROCESS [en] DEVIANCE MINING [en] NON-ATOMIC EVENTS [en] COVID 19
45	[en] BRANCH-CUT-AND-PRICE APPROACH FOR PROCESS DISCOVERY / [pt] UMA ABORDAGEM PARA MINERAÇÃO DE PROCESSOS USANDO GERAÇÃO DE COLUNAS E CORTES GEORGES MIRANDA SPYRIDES 28 May 2019 (has links) [pt] Descoberta de Processo significa determinar um modelo de processo a partir de um registro histórico de eventos de um processo de negócios. Muitos algoritmos de descoberta de processos tentam sintetizar uma rede de Petri que representa o registro localizando locais e arcos que relacionam as classes de eventos. Bergenthum et al (2007) e van der Werf et al. (2008) propõem formulações para este problema descobrir um place de cada vez, em que cada solução básica do conjunto de desigualdades representa um lugar candidato. Propomos uma formulação global de programação inteira que, dado um registro histórico, determina todos os places e arcos que definem uma rede de Petri de uma só vez. Este modelo é uma alternativa a seleção de locais, mas tem um problema de eficiência devido à grande quantidade de variáveis inteiras usadas. Também propomos um método de decomposição para o modelo ILP global para tratar cada place e suas restrições associadas como um subproblema separado. Conseguimos então executar o algoritmo em instâncias sintéticas grandes, o que é inédito para esta classe de mineradores de processo. / [en] Process Discovery amounts to determine a process model from an event log of a business process. Many process discovery algorithms try to synthesize a Petri net representing the log by finding places and arcs that relate the event classes. Bergenthum et al. (2007) and van der Werf et al. (2008) propose formulations for this problem discover one place at a time, in which each basic solution of the set of inequalities represents a candidate place. We propose a global integer programming formulation that, given a log, determines all places and arcs defining a Petri net. This model simplifies the selection of places but has an efficiency problem due to a large number of integer variables used. We also propose a decomposition method for the global ILP model to treat each place and their associated constraints as a separate sub-problem. We can run the algorithm on large synthetic instances, which is unprecedented for this kind of process miner. [pt] PROGRAMACAO INTEIRA [pt] DESCOBERTA DE PROCESSO [pt] MINERACAO DE PROCESSOS [pt] GERACAO DE COLUNAS [en] INTER LINEAR PROGRAMMIN [en] PROCESS DISCOVERY [en] PROCESS MINING [en] COLUMN GERERATION
46	[pt] ANÁLISE NUMÉRICA DE TALUDES DE MINERAÇÃO A CÉU ABERTO EM MACIÇO DE ROCHA FRATURADA / [en] NUMERICAL SLOPE ANALYSIS OF AN OPEN PIT MINE IN JOINTED ROCK MASS DEYSI MILAGRITOS GARCIA AGUILAR 26 October 2016 (has links) [pt] Esta dissertação tem como objetivo principal estudar o comportamento dos taludes da mineração Pampa de Pongo, situada no Peru, em termos de deslocamentos e fatores de segurança contra ruptura por cisalhamento. As escavações atingem atualmente profundidades superiores a 900m com o corpo mineral seccionado por várias falhas geológicas que podem aumentar o potencial de risco de colapso dos taludes. As análises tensão x deformação foram executadas pelo método dos elementos finitos, em modelos bi e tridimensionais, considerando a existência e a ausência do sistema de falhas para melhor estimar e compreender a influência destas nos deslocamentos do maciço com o avanço das escavações. Este estudo também investigou as diferenças observadas nos resultados obtidos com dois programas comerciais de elementos finitos para solução de problemas geotécnicos (Phase2 e Plaxis 2D) tendo em vista as diferentes metodologias por eles empregadas na simulação de fraturas em meios contínuos. Nas análises de estabilidade os valores dos fatores de segurança determinados pelo método dos elementos finitos, em simulações 2D e 3D, foram também comparados com aqueles calculados por método de equilíbrio limite (método das fatias). De modo geral, observou-se que os efeitos das descontinuidades são maiores nos campos de deslocamentos horizontais e que também tendem a diminuir os valores de fator de segurança, inclusive para limites mínimos abaixo dos recomendados por normas brasileiras. / [en] This thesis aims to study the behavior of the Pampa de Pongo open pit mine, situated in Peru, in terms of displacements and safety factors against shear failure. Excavations currently reach depths greater than 900m and the orebody is sectioned by several geological faults that may increase the potential risk of collapse of embankments. The stress x strain analyses were carried out by the finite element method, with two and three dimensional models, considering the existence and the absence of the fault system, to better estimate and understand the influence of these discontinuities in the displacements of the rock mass with the advancement of excavations. This study also investigated the differences in results obtained with two commercial finite element programs for solution of geotechnical problems (Phase2 and Plaxis 2D) since they use different techniques to simulate fractures in continuous media. In the stability analyses, the values of safety factors determined by the finite element method in 2D and 3D simulations were also compared with those calculated by the equilibrium method limit (method of slices). In general, it could be observed that the effects of discontinuities are larger on the horizontal displacement fields and they also tend to decrease the values of safety factors, even below the minimum thresholds recommended by Brazilian Standards. [pt] ESTABILIDADE DE TALUDE [pt] MINERACAO A CEU ABERTO [pt] TALUDE DE GRANDE ALTURA [pt] EQUILIBRIO LIMITE [pt] ANALISE DE ELEMENTOS FINITOS [en] ROCK SLOPE STABILITY [en] LIMIT BALANCE [en] FINITE ELEMENTS ANALYSIS
47	[en] INTELLIGENT ASSISTANCE FOR KDD-PROCESS ORIENTATION / [pt] ASSISTÊNCIA INTELIGENTE À ORIENTAÇÃO DO PROCESSO DE DESCOBERTA DE CONHECIMENTO EM BASES DE DADOS RONALDO RIBEIRO GOLDSCHMIDT 15 December 2003 (has links) [pt] A notória complexidade inerente ao processo de KDD - Descoberta de Conhecimento em Bases de Dados - decorre essencialmente de aspectos relacionados ao controle e à condução deste processo (Fayyad et al., 1996b; Hellerstein et al., 1999). De uma maneira geral, estes aspectos envolvem dificuldades em perceber inúmeros fatos cuja origem e os níveis de detalhe são os mais diversos e difusos, em interpretar adequadamente estes fatos, em conjugar dinamicamente tais interpretações e em decidir que ações devem ser realizadas de forma a procurar obter bons resultados. Como identificar precisamente os objetivos do processo, como escolher dentre os inúmeros algoritmos de mineração e de pré-processamento de dados existentes e, sobretudo, como utilizar adequadamente os algoritmos escolhidos em cada situação são alguns exemplos das complexas e recorrentes questões na condução de processos de KDD. Cabe ao analista humano a árdua tarefa de orientar a execução de processos de KDD. Para tanto, diante de cada cenário, o homem utiliza sua experiência anterior, seus conhecimentos e sua intuição para interpretar e combinar os fatos de forma a decidir qual a estratégia a ser adotada (Fayyad et al., 1996a, b; Wirth et al., 1998). Embora reconhecidamente úteis e desejáveis, são poucas as alternativas computacionais existentes voltadas a auxiliar o homem na condução do processo de KDD (Engels, 1996; Amant e Cohen, 1997; Livingston, 2001; Bernstein et al., 2002; Brazdil et al., 2003). Aliado ao exposto acima, a demanda por aplicações de KDD em diversas áreas vem crescendo de forma muito acentuada nos últimos anos (Buchanan, 2000). É muito comum não existirem profissionais com experiência em KDD disponíveis para atender a esta crescente demanda (Piatetsky-Shapiro, 1999). Neste contexto, a criação de ferramentas inteligentes que auxiliem o homem no controle do processo de KDD se mostra ainda mais oportuna (Brachman e Anand, 1996; Mitchell, 1997). Assim sendo, esta tese teve como objetivos pesquisar, propor, desenvolver e avaliar uma Máquina de Assistência Inteligente à Orientação do Processo de KDD que possa ser utilizada, fundamentalmente, como instrumento didático voltado à formação de profissionais especializados na área da Descoberta de Conhecimento em Bases de Dados. A máquina proposta foi formalizada com base na Teoria do Planejamento para Resolução de Problemas (Russell e Norvig, 1995) da Inteligência Artificial e implementada a partir da integração de funções de assistência utilizadas em diferentes níveis de controle do processo de KDD: Definição de Objetivos, Planejamento de Ações de KDD, Execução dos Planos de Ações de KDD e Aquisição e Formalização do Conhecimento. A Assistência à Definição de Objetivos tem como meta auxiliar o homem na identificação de tarefas de KDD cuja execução seja potencialmente viável em aplicações de KDD. Esta assistência foi inspirada na percepção de um certo tipo de semelhança no nível intensional apresentado entre determinados bancos de dados. Tal percepção auxilia na prospecção do tipo de conhecimento a ser procurado, uma vez que conjuntos de dados com estruturas similares tendem a despertar interesses similares mesmo em aplicações de KDD distintas. Conceitos da Teoria da Equivalência entre Atributos de Bancos de Dados (Larson et al., 1989) viabilizam a utilização de uma estrutura comum na qual qualquer base de dados pode ser representada. Desta forma, bases de dados, ao serem representadas na nova estrutura, podem ser mapeadas em tarefas de KDD, compatíveis com tal estrutura. Conceitos de Espaços Topológicos (Lipschutz, 1979) e recursos de Redes Neurais Artificiais (Haykin, 1999) são utilizados para viabilizar os mapeamentos entre padrões heterogêneos. Uma vez definidos os objetivos em uma aplicação de KDD, decisões sobre como tais objetivos podem ser alcançados se tornam necessárias. O primeiro passo envolve a escolha de qual algoritmo de mineração de dados é o mais apropriado para o problema em questão. A Assistência ao Planejamento de Ações de KDD auxilia o homem nesta escolha. Utiliza, para tanto, uma metodologia de ordenação dos algoritmos de mineração baseada no desempenho prévio destes algoritmos em problemas similares (Soares et al., 2001; Brazdil et al., 2003). Critérios de ordenação de algoritmos baseados em similaridade entre bases de dados nos níveis intensional e extensional foram propostos, descritos e avaliados. A partir da escolha de um ou mais algoritmos de mineração de dados, o passo seguinte requer a escolha de como deverá ser realizado o pré-processamento dos dados. Devido à diversidade de algoritmos de pré-processamento, são muitas as alternativas de combinação entre eles (Bernstein et al., 2002). A Assistência ao Planejamento de Ações de KDD também auxilia o homem na formulação e na escolha do plano ou dos planos de ações de KDD a serem adotados. Utiliza, para tanto, conceitos da Teoria do Planejamento para Resolução de Problemas. Uma vez escolhido um plano de ações de KDD, surge a necessidade de executá-lo. A execução de um plano de ações de KDD compreende a execução, de forma ordenada, dos algoritmos de KDD previstos no plano. A execução de um algoritmo de KDD requer conhecimento sobre ele. A Assistência à Execução dos Planos de Ações de KDD provê orientações específicas sobre algoritmos de KDD. Adicionalmente, esta assistência dispõe de mecanismos que auxiliam, de forma especializada, no processo de execução de algoritmos de KDD e na análise dos resultados obtidos. Alguns destes mecanismos foram descritos e avaliados. A execução da Assistência à Aquisição e Formalização do Conhecimento constitui-se em um requisito operacional ao funcionamento da máquina proposta. Tal assistência tem por objetivo adquirir e disponibilizar os conhecimentos sobre KDD em uma representação e uma organização que viabilizem o processamento das funções de assistência mencionadas anteriormente. Diversos recursos e técnicas de aquisição de conhecimento foram utilizados na concepção desta assistência. / [en] Generally speaking, such aspects involve difficulties in perceiving innumerable facts whose origin and levels of detail are highly diverse and diffused, in adequately interpreting these facts, in dynamically conjugating such interpretations, and in deciding which actions must be performed in order to obtain good results. How are the objectives of the process to be identified in a precise manner? How is one among the countless existing data mining and preprocessing algorithms to be selected? And most importantly, how can the selected algorithms be put to suitable use in each different situation? These are but a few examples of the complex and recurrent questions that are posed when KDD processes are performed. Human analysts must cope with the arduous task of orienting the execution of KDD processes. To this end, in face of each different scenario, humans resort to their previous experiences, their knowledge, and their intuition in order to interpret and combine the facts and therefore be able to decide on the strategy to be adopted (Fayyad et al., 1996a, b; Wirth et al., 1998). Although the existing computational alternatives have proved to be useful and desirable, few of them are designed to help humans to perform KDD processes (Engels, 1996; Amant and Cohen, 1997; Livingston, 2001; Bernstein et al., 2002; Brazdil et al., 2003). In association with the above-mentioned fact, the demand for KDD applications in several different areas has increased dramatically in the past few years (Buchanan, 2000). Quite commonly, the number of available practitioners with experience in KDD is not sufficient to satisfy this growing demand (Piatetsky-Shapiro, 1999). Within such a context, the creation of intelligent tools that aim to assist humans in controlling KDD processes proves to be even more opportune (Brachman and Anand, 1996; Mitchell, 1997). Such being the case, the objectives of this thesis were to investigate, propose, develop, and evaluate an Intelligent Machine for KDD-Process Orientation that is basically intended to serve as a teaching tool to be used in professional specialization courses in the area of Knowledge Discovery in Databases. The basis for formalization of the proposed machine was the Planning Theory for Problem-Solving (Russell and Norvig, 1995) in Artificial Intelligence. Its implementation was based on the integration of assistance functions that are used at different KDD process control levels: Goal Definition, KDD Action-Planning, KDD Action Plan Execution, and Knowledge Acquisition and Formalization. The Goal Definition Assistant aims to assist humans in identifying KDD tasks that are potentially executable in KDD applications. This assistant was inspired by the detection of a certain type of similarity between the intensional levels presented by certain databases. The observation of this fact helps humans to mine the type of knowledge that must be discovered since data sets with similar structures tend to arouse similar interests even in distinct KDD applications. Concepts from the Theory of Attribute Equivalence in Databases (Larson et al., 1989) make it possible to use a common structure in which any database may be represented. In this manner, when databases are represented in the new structure, it is possible to map them into KDD tasks that are compatible with such a structure. Topological space concepts and ANN resources as described in Topological Spaces (Lipschutz, 1979) and Artificial Neural Nets (Haykin, 1999) have been employed so as to allow mapping between heterogeneous patterns. After the goals have been defined in a KDD application, it is necessary to decide how such goals are to be achieved. The first step involves selecting the most appropriate data mining algorithm for the problem at hand. The KDD Action-Planning Assistant helps humans to make this choice. To this end, it makes use of a methodology for ordering the mining algorithms that is based on the previous experiences, their knowledge, and their intuition in order to interpret and combine the facts and therefore be able to decide on the strategy to be adopted (Fayyad et al., 1996a, b; Wirth et al., 1998). Although the existing computational alternatives have proved to be useful and desirable, few of them are designed to help humans to perform KDD processes (Engels, 1996; Amant & Cohen, 1997; Livingston, 2001; Bernstein et al., 2002; Brazdil et al., 2003). In association with the above-mentioned fact, the demand for KDD applications in several different areas has increased dramatically in the past few years (Buchanan, 2000). Quite commonly, the number of available practitioners with experience in KDD is not sufficient to satisfy this growing demand (Piatetsky-Shapiro, 1999). Within such a context, the creation of intelligent tools that aim to assist humans in controlling KDD processes proves to be even more opportune (Brachman & Anand, 1996; Mitchell, 1997). Such being the case, the objectives of this thesis were to investigate, propose, develop, and evaluate an Intelligent Machine for KDD-Process Orientation that is basically intended to serve as a teaching tool to be used in professional specialization courses in the area of Knowledge Discovery in Databases. The basis for formalization of the proposed machine was the Planning Theory for Problem-Solving (Russell and Norvig, 1995) in Artificial Intelligence. Its implementation was based on the integration of assistance functions that are used at different KDD process control levels: Goal Definition, KDD Action- Planning, KDD Action Plan Execution, and Knowledge Acquisition and Formalization. The Goal Definition Assistant aims to assist humans in identifying KDD tasks that are potentially executable in KDD applications. This assistant was inspired by the detection of a certain type of similarity between the intensional levels presented by certain databases. The observation of this fact helps humans to mine the type of knowledge that must be discovered since data sets with similar structures tend to arouse similar interests even in distinct KDD applications. Concepts from the Theory of Attribute Equivalence in Databases (Larson et al., 1989) make it possible to use a common structure in which any database may be represented. In this manner, when databases are represented in the new structure, it is possible to map them into KDD tasks that are compatible with such a structure. Topological space concepts and ANN resources as described in Topological Spaces (Lipschutz, 1979) and Artificial Neural Nets (Haykin, 1999) have been employed so as to allow mapping between heterogeneous patterns. After the goals have been defined in a KDD application, it is necessary to decide how such goals are to be achieved. The first step involves selecting the most appropriate data mining algorithm for the problem at hand. The KDD Action-Planning Assistant helps humans to make this choice. To this end, it makes use of a methodology for ordering the mining algorithms that is based on the previous performance of these algorithms in similar problems (Soares et al., 2001; Brazdil et al., 2003). Algorithm ordering criteria based on database similarity at the intensional and extensional levels were proposed, described and evaluated. The data mining algorithm or algorithms having been selected, the next step involves selecting the way in which data preprocessing is to be performed. Since there is a large variety of preprocessing algorithms, many are the alternatives for combining them (Bernstein et al., 2002). The KDD Action-Planning Assistant also helps humans to formulate and to select the KDD action plan or plans to be adopted. To this end, it makes use of concepts contained in the Planning Theory for Problem-Solving. Once a KDD action plan has been chosen, it is necessary to execute it. Executing a KDD action plan involves the ordered execution of the KDD algorithms that have been anticipated in the plan. Executing a KDD algorithm requires knowledge about it. The KDD Action Plan Execution Assistant provides specific guidance on KDD algorithms. In addition, this assistant is equipped with mechanisms that provide specialized assistance for performing the KDD algorithm execution process and for analyzing the results obtained. Some of these mechanisms have been described and evaluated. The execution of the Knowledge Acquisition and Formalization Assistant is an operational requirement for running the proposed machine. The objective of this assistant is to acquire knowledge about KDD and to make such knowledge available by representing and organizing it a way that makes it possible to process the above-mentioned assistance functions. A variety of knowledge acquisition resources and techniques were employed in the conception of this assistant. [pt] MINERACAO DE DADOS [en] DATA MINING [en] KNOWLEDGE DISCOVERY IN DATABASES [en] KDD TASK DEFINITION ASSISTANCE [pt] PLANEJAMENTO EM KDD [en] PLANNING IN KDD
48	[en] SEMANTIC INFERENCES IN INFORMATION RETRIEVAL FOR HYPERMEDIA APPLICATIONS / [pt] INFERÊNCIAS SEMÂNTICAS NA RECUPERAÇÃO DE INFORMAÇÕES PARA APLICAÇÕES HIPERMÍDIA CRISTIANO BRAZ ROCHA 27 October 2003 (has links) [pt] O problema de sobrecarga de informação é um dos mais sérios enfrentados atualmente. Para tentar resolver esse problema, áreas distintas como Gestão do Conhecimento, Web Semântica e Modelagem de Aplicações Hipermídia têm utilizado soluções parecidas que consistem basicamente na estruturação semântica da informação, para que ela seja mais facilmente acessada. Esta dissertação propõe uma infra-estrutura baseada em técnicas e algoritmos clássicos da área de Inteligência Artificial, que aproveita a crescente disponibilidade de modelos relativos a um domínio para permitir que as aplicações onde os mesmos estão definidos realizem inferências sobre o domínio em questão. Isso possibilita a introdução de diversas novas funcionalidades nessas aplicações. Foram propostas e desenvolvidas quatro novas funcionalidades, a principal sendo a busca semântica. As novas funcionalidades foram testadas com grande sucesso em duas aplicações: o site do Departamento de Informática da PUC-Rio e o Portal do Conhecimento a respeito da obra do grande pintor brasileiro Candido Portinari. / [en] The information overload problem is one of the most challenging problems being faced today. In order to solve this problem, different areas such as Knowledge Management, Semantic Web and Hypermedia Applications Modeling have used similar solutions that consist basically of semantically structuring the information so it can be better accessed. This dissertation proposes an infrastructure based on classic algorithms and techniques of Artificial Intelligence that utilizes the increase in the availability of domain specific models to enable the applications where they are defined to make inferences about these particular domains. These inferences enable the creation of new functionalities in these applications. Four new functionalities were proposed and implemented, the most important being a semantic search. The new functionalities presented were successfully tested in two existing applications: the website of the Computer Science Department of PUC-Rio and the Portinari Knowledge Portal that presents all the work of the famous brazilian painter Candido Portinari. [pt] MINERACAO DE DADOS [en] DATA MINING [pt] ONTOLOGIAS [en] ONTOLOGIES [pt] MARKETING INTERNACIONAL [en] INTERNATIONAL MARKETING [pt] INFERENCIAS [en] INFERENCES [pt] RECUPERACAO DE INFORMACAO [en] INFORMATION RETRIEVAL [pt] APLICACOES HIPERMIDIA [en] KNOWLEDGE MODELING [en] SPREAD ACTIVATION ALGORITHMS
49	[en] TIME SERIES ANALYSIS USING SINGULAR SPECTRUM ANALYSIS (SSA) AND BASED DENSITY CLUSTERING OF THE COMPONENTS / [pt] ANÁLISE DE SÉRIES TEMPORAIS USANDO ANÁLISE ESPECTRAL SINGULAR (SSA) E CLUSTERIZAÇÃO DE SUAS COMPONENTES BASEADA EM DENSIDADE KEILA MARA CASSIANO 19 June 2015 (has links) [pt] Esta tese propõe a utilização do DBSCAN (Density Based Spatial Clustering of Applications with Noise) para separar os componentes de ruído na fase de agrupamento das autotriplas da Análise Singular Espectral (SSA) de Séries Temporais. O DBSCAN é um método moderno de clusterização (revisto em 2013) e especialista em identificar ruído através de regiões de menor densidade. O método de agrupamento hierárquico até então é a última inovação na separação de ruído na abordagem SSA, implementado no pacote R- SSA. No entanto, o método de agrupamento hierárquico é muito sensível a ruído, não é capaz de separá-lo corretamente, não deve ser usado em conjuntos com diferentes densidades e não funciona bem no agrupamento de séries temporais de diferentes tendências, ao contrário dos métodos de aglomeração à base de densidade que são eficazes para separar o ruído a partir dos dados e dedicados para trabalhar bem em dados a partir de diferentes densidades. Este trabalho mostra uma melhor eficiência de DBSCAN sobre os outros métodos já utilizados nesta etapa do SSA, garantindo considerável redução de ruídos e proporcionando melhores previsões. O resultado é apoiado por avaliações experimentais realizadas para séries simuladas de modelos estacionários e não estacionários. A combinação de metodologias proposta também foi aplicada com sucesso na previsão de uma série real de velocidade do vento. / [en] This thesis proposes using DBSCAN (Density Based Spatial Clustering of Applications with Noise) to separate the noise components of eigentriples in the grouping stage of the Singular Spectrum Analysis (SSA) of Time Series. The DBSCAN is a modern (revised in 2013) and expert method at identify noise through regions of lower density. The hierarchical clustering method was the last innovation in noise separation in SSA approach, implemented on package R-SSA. However, is repeated in the literature that the hierarquical clustering method is very sensitive to noise, is unable to separate it correctly, and should not be used in clusters with varying densities and neither works well in clustering time series of different trends. Unlike, the methods of density based clustering are effective in separating the noise from the data and dedicated to work well on data from different densities This work shows better efficiency of DBSCAN over the others methods already used in this stage of SSA, because it allows considerable reduction of noise and provides better forecasting. The result is supported by experimental evaluations realized for simulated stationary and non-stationary series. The proposed combination of methodologies also was applied successfully to forecasting real series of wind s speed. [pt] MINERACAO DE DADOS [en] DATA MINING [pt] SERIES TEMPORAIS [en] TIME SERIES [pt] PREVISAO [en] FORECASTING [pt] ENERGIA EOLICA [en] WIND ENERGY [pt] MODELOS ARIMA [pt] ANALISE SINGULAR ESPECTRAL [pt] CLUSTERIZACAO BASEADA EM DENSIDADE [pt] DBSCAN [pt] PREVISAO SSA
50	[en] PURCHASING PORTFOLIO APPLIED IN STRATEGIES DEVELOPMENT: CASE STUDY ON A MINING COMPANY / [pt] UTILIZAÇÃO DO PORTFÓLIO DE COMPRAS PARA DESENVOLVIMENTO DE ESTRATÉGIAS: ESTUDO DE CASO EM UMA EMPRESA DE MINERAÇÃO FELIPPE GUERREIRO GASPAR DE OLIVEIRA 26 September 2016 (has links) [pt] Com a dinâmica da economia mundial e a crescente competitividade nos mercados, a área de compras ganha relevância e torna temas como risco de fornecimento, redução de custos, características do mercado fornecedor, parcerias estratégicas, entre outros, presentes no dia a dia das empresas. Neste cenário, uma estratégia de compras bem definida faz diferença. O desenvolvimento de uma estratégia de compras é complexo, pois as empresas lidam com uma grande variedade de itens e serviços, muitos fornecedores, bem como situações de compras heterogêneas e, por esses motivos, precisam de ferramentas analíticas avançadas para desenvolver estratégias eficazes. Devido a isso, existem várias metodologias que suportam este processo, como Análise SWOT, Cinco Forças de Porter e Portfólio de Compras. Apesar de ser relevante o portfólio de compras no desenvolvimento das estratégias, pouco se conhece sobre a real aderência dos mesmos na prática. Essa dissertação traz conceitos da literatura acadêmica sobre os portfólios de compras e apresenta um estudo de caso prático, sobre a utilização do mesmo no desenvolvimento da estratégia, dentro de uma grande empresa do setor de mineração. Com este trabalho pretende-se esclarecer para a literatura como o modelo de portfólio de compras é utilizado para o desenvolvimento de estratégias na prática, identificando as adaptações necessárias para torná-los aderentes a uma situação real. Também pretende-se identificar se existe alguma oportunidade de aperfeiçoamento do portfólio de compras utilizado pela organização corporativa estudada. / [en] With the dynamics of global economy and the increasing market competitiveness, procurement becomes more relevant making subjects like supply risk, cost reduction, supplier market characteristics, strategic partnerships, among others, present in the business day life. In scenario like this, a good purchasing strategy makes the difference. Developing a purchasing strategy is complex as companies deal with a great variety of items and services, as well as diversified purchasing situations, therefore, it is required to have advanced analytical tools in order to develop efficient strategies. Based on this, several methodologies support this process such as SWOT Analysis, the Porter s Five Forces and the Purchasing Portfolio. Despite the relevance of purchasing portfolio use in strategies development, there is little knowledge regarding its practical use. This dissertation brings concepts over the purchasing portfolio, presenting a practical case study on its use for strategy development within a big mining company. This work goal is to seek literature clearance of how Purchasing Portfolio is use in practical strategy development; identifying variation needs to adapt them into real situations. Furthermore, we expect to identify if there is any new improvement opportunity for the Purchasing Portfolio used in the corporate organization studied. [pt] ESTUDO DE CASO [pt] DIMENSAO DO PORTFOLIO DE COMPRA [pt] MODELO DE PORTFOLIO DE COMPRAS [pt] MINERACAO [pt] ESTRATEGIA DE COMPRA [pt] COMPRA [pt] SUPRIMENTO [en] CASE STUDY [en] PURCHASING PORTFOLIO MODEL [en] MINING [en] SUPPLY STRATEGIES [en] PURCHASES [en] PROCUREMENT

Search results