Global ETD Search

131	Addressing Semantic Interoperability and Text Annotations. Concerns in Electronic Health Records using Word Embedding, Ontology and Analogy Naveed, Arjmand January 2021 (has links) Electronic Health Record (EHR) creates a huge number of databases which are being updated dynamically. Major goal of interoperability in healthcare is to facilitate the seamless exchange of healthcare related data and an environment to supports interoperability and secure transfer of data. The health care organisations face difficulties in exchanging patient’s health care information and laboratory reports etc. due to a lack of semantic interoperability. Hence, there is a need of semantic web technologies for addressing healthcare interoperability problems by enabling various healthcare standards from various healthcare entities (doctors, clinics, hospitals etc.) to exchange data and its semantics which can be understood by both machines and humans. Thus, a framework with a similarity analyser has been proposed in the thesis that dealt with semantic interoperability. While dealing with semantic interoperability, another consideration was the use of word embedding and ontology for knowledge discovery. In medical domain, the main challenge for medical information extraction system is to find the required information by considering explicit and implicit clinical context with high degree of precision and accuracy. For semantic similarity of medical text at different levels (conceptual, sentence and document level), different methods and techniques have been widely presented, but I made sure that the semantic content of a text that is presented includes the correct meaning of words and sentences. A comparative analysis of approaches included ontology followed by word embedding or vice-versa have been applied to explore the methodology to define which approach gives better results for gaining higher semantic similarity. Selecting the Kidney Cancer dataset as a use case, I concluded that both approaches work better in different circumstances. However, the approach in which ontology is followed by word embedding to enrich data first has shown better results. Apart from enriching the EHR, extracting relevant information is also challenging. To solve this challenge, the concept of analogy has been applied to explain similarities between two different contents as analogies play a significant role in understanding new concepts. The concept of analogy helps healthcare professionals to communicate with patients effectively and help them understand their disease and treatment. So, I utilised analogies in this thesis to support the extraction of relevant information from the medical text. Since accessing EHR has been challenging, tweets text is used as an alternative for EHR as social media has appeared as a relevant data source in recent years. An algorithm has been proposed to analyse medical tweets based on analogous words. The results have been used to validate the proposed methods. Two experts from medical domain have given their views on the proposed methods in comparison with the similar method named as SemDeep. The quantitative and qualitative results have shown that the proposed analogy-based method bring diversity and are helpful in analysing the specific disease or in text classification. Electronic Health Record (EHR) Semantic annotations Word embedding Ontology Analogy Artificial intelligence (AI) Knowledge discovery Semantic interoperability
132	Physics-Informed Graph Learning In Urban Traffic Networks Jiawei Xue (8672484) 20 July 2024 (has links) <p dir="ltr">Urban traffic networks encompass the collection and interlinking of urban entities, including but not limited to road networks, congested segments, mobile populations, and emergency occurrences. These entities facilitate daily human activities, support economic endeavors, and influence the trajectory of societal advancement. Comprehending the characteristics and anticipating the evolution of dynamic urban traffic networks have been fundamental building blocks in urban science. Typical examples include the primal and dual representations of road networks, the macroscopic fundamental diagram applied to congested roads, and models on the spread of diseases. Current seminal studies either devise physics metrics and models to elucidate universal traits of urban traffic networks, or exploit data-driven approaches to depict the urban landscape using vast amounts of urban data. However, these physics and data-driven methods primarily function separately, resulting in a lack of a comprehensive framework to accurately and interpretably (1) characterize the topology and dynamics of urban traffic networks; and (2) forecast the evolution of dynamics within urban traffic networks.</p><p dir="ltr">In this dissertation, we develop physics-informed graph learning methods to learn and forecast urban traffic networks in manners that are accurate, interpretable, adaptable, and applicable, aiming to advance urban science theories and support urban decision-making processes.</p><p dir="ltr">In Chapters 3 and 4, we explore novel physics knowledge of urban traffic networks in terms of new metrics and equations. In Chapter 3, we define new morphological metrics for urban road networks. Specifically, we present a network metric called spatial homogeneity (SH), which gauges the topological similarities among urban road networks using graph neural networks. Employing this metric, we analyze 11,790 urban road networks across 30 cities worldwide. Our findings reveal the inherent correlations between innercity SH, gross domestic product, and population growth. Furthermore, we quantify learning trajectories between cities from intercity SH and connect them with existing qualitative urban studies. In Chapter 4, we establish new differential equations governing dynamic urban traffic. Through a symbolic regression-based learning approach, we come up with network-level dynamic traffic equations (NDTEs), which capture time-of-day traffic flow and traffic occupancy dynamics. The advantages of NDTEs are twofold: (1) all input variables are easily obtainable; (2) they incorporate vehicle count-related variables. Our experiments on road networks in Zurich and Toronto demonstrate that the generated NDTEs offer enhanced fitting accuracy compared to the baseline model while maintaining a moderate level of equation complexity.</p><p dir="ltr">In Chapters 5, 6, and 7, we harness physics knowledge to devise graph learning approaches for urban prediction and imputation. In Chapter 5, we present NMFD-GNN, a physics-informed machine learning method that integrates the network macroscopic fundamental diagram and the graph neural network for traffic state imputation. Our approach is the first physics-informed machine learning model specifically designed for real-world traffic networks with multiple roads, while existing studies have primarily focused on individual road corridors. In Chapter 6, we develop the spatio-temporal physics ordinary differential equation (ST-PODE), which connects PODEs with spatio-temporal neural networks. ST-PODE is composed of the spatio-temporal neural network module, the PODE module, and the state transition module. We downscale our focus to the prediction of morning traffic patterns and evaluate our models using datasets from the Bay Area and Los Angeles. In Chapter 7, we address the multiwave COVID-19 prediction challenge on urban mobility networks. The proposed social awareness-based graph neural network (SAB-GNN) models the evolution of public awareness across multiple pandemic waves as an exponential function with learnable parameters. We employ the mobility, web search, and infection data in Tokyo from April 2020 to May 2021 to validate its performance. </p><p dir="ltr">The intended audiences of this dissertation comprise colleagues in the fields of artificial intelligence, urban science, transportation engineering, and network science. Our goal is to offer instructive insights to the community to (1) explore universal properties, (2) foresee future evolution, and (3) interpret models and results using massive graph-structured data in urban traffic networks.</p> Urban informatics Transport engineering Data mining and knowledge discovery Neural networks Traffic networks Urban networks Machine learning Graph learning
133	A Study of Physicians' Serendipitous Knowledge Discovery: An Evaluation of Spark and the IF-SKD Model in a Clinical Setting Hopkins, Mark E 05 1900 (has links) This research study is conducted to test Workman, Fiszman, Rindflesch and Nahl's information flow-serendipitous knowledge discovery (IF-SKD) model of information behavior, in a clinical care context. To date, there have been few attempts to model the serendipitous knowledge discovery of physicians. Due to the growth and complexity of the biomedical literature, as well as the increasingly specialized nature of medicine, there is a need for advanced systems that can quickly present information and assist physicians to discover new knowledge. The National Library of Medicine's (NLM) Lister Hill Center for Biocommunication's Semantic MEDLINE project is focused on identifying and visualizing semantic relationships in the biomedical literature to support knowledge discovery. This project led to the development of a new information discovery system, Spark. The aim of Spark is to promote serendipitous knowledge discovery by assisting users in maximizing the use of their conceptual short-term memory to iteratively search for, engage, clarify and evaluate information presented from the biomedical literature. Using Spark, this study analyzes the IF- SKD model by capturing and analyzing physician feedback. The McCay-Peet, Toms and Kelloway's Perception of Serendipity and Serendipitous Digital Environment (SDE) questionnaires are used. Results are evaluated to determine whether Spark contributes to physicians' serendipitous knowledge discovery and the ability of the IF-SKD ability to capture physicians' information behavior in a clinical setting. Serendipitous Knowledge Discovery Serendipity Information Science IF-SKD Information Science Information behavior. Medical literature. Physicians. Serendipity in science.
134	Learning lost temporal fuzzy association rules Matthews, Stephen January 2012 (has links) Fuzzy association rule mining discovers patterns in transactions, such as shopping baskets in a supermarket, or Web page accesses by a visitor to a Web site. Temporal patterns can be present in fuzzy association rules because the underlying process generating the data can be dynamic. However, existing solutions may not discover all interesting patterns because of a previously unrecognised problem that is revealed in this thesis. The contextual meaning of fuzzy association rules changes because of the dynamic feature of data. The static fuzzy representation and traditional search method are inadequate. The Genetic Iterative Temporal Fuzzy Association Rule Mining (GITFARM) framework solves the problem by utilising flexible fuzzy representations from a fuzzy rule-based system (FRBS). The combination of temporal, fuzzy and itemset space was simultaneously searched with a genetic algorithm (GA) to overcome the problem. The framework transforms the dataset to a graph for efficiently searching the dataset. A choice of model in fuzzy representation provides a trade-off in usage between an approximate and descriptive model. A method for verifying the solution to the hypothesised problem was presented. The proposed GA-based solution was compared with a traditional approach that uses an exhaustive search method. It was shown how the GA-based solution discovered rules that the traditional approach did not. This shows that simultaneously searching for rules and membership functions with a GA is a suitable solution for mining temporal fuzzy association rules. So, in practice, more knowledge can be discovered for making well-informed decisions that would otherwise be lost with a traditional approach. 006.3
135	HIV Patient Monitoring Framework Through Knowledge Engineering Otine, Charles January 2012 (has links) Uganda has registered more than a million deaths since the HIV virus was first offi¬cially reported in the country over 3 decades ago. The governments in partnership with different groups have implemented different programmes to address the epidemic. The support from different donors and reduction in prices of treatment resulted in the focus on antiretroviral therapy access to those affected. Presently only a quarter of the approximately 1 million infected by HIV in Uganda are undergoing antiretroviral therapy. The number of patients pause a challenge in monitoring of therapy given the overall resource needs for health care in the country. Furthermore the numbers on antiretroviral therapy are set to increase in addition to the stringent requirements in tracking and monitoring of each individual patient during therapy. This research aimed at developing a framework for adopting knowledge engineering in information systems for monitoring HIV/AIDS patients. An open source approach was adopted due to the resource constrained context of the study to ensure a cost effec¬tive and sustainable solution. The research was motivated by the inconclusive literature on open source dimensional models for data warehouses and data mining for monitor¬ing antiretroviral therapy. The first phase of the research involved a situational analysis of HIV in health care and different health care information systems in the country. An analysis of the strengths, weaknesses and opportunities of the health care system to adopt knowledge bases was done. It proposed a dimensional model for implementing a data warehouse focused on monitoring HIV patients. The second phase involved the development of a knowledge base inform of an open source data warehouse, its simulation and testing. The study involved interdisciplinary collaboration between different stakeholders in the research domain and adopted a participatory action research methodology. This involved identification of the most appropriate technologies to foster this collabora¬tion. Analysis was done of how stakeholders can take ownership of basic HIV health information system architecture as their expertise grow in managing the systems and make changes to reflect even better results out of system functionality. Data mining simulations was done on the data warehouse out of which two machine learning algorithms (regression and classification) were developed and tested using data from the data warehouse. The algorithms were used to predict patient viral load from CD4 count test figures and to classify cases of treatment failure with 83% accu¬racy. The research additionally presents an open source dimensional model for moni¬toring antiretroviral therapy and the status of information systems in health care. An architecture showing the integration of different knowledge engineering components in the study including the data warehouse, the data mining platform and user interac-tion is presented. Action Research Antiretroviral therapy (ART) AIDS Classification Databases Data Mining Data Warehousing Health ICT IT HIV Open Source Knowledge Discovery Knowledge Engineering Machine Learning Regression
136	Knowledge discovery for moderating collaborative projects Choudhary, Alok K. January 2009 (has links) In today's global market environment, enterprises are increasingly turning towards collaboration in projects to leverage their resources, skills and expertise, and simultaneously address the challenges posed in diverse and competitive markets. Moderators, which are knowledge based systems have successfully been used to support collaborative teams by raising awareness of problems or conflicts. However, the functioning of a moderator is limited to the knowledge it has about the team members. Knowledge acquisition, learning and updating of knowledge are the major challenges for a Moderator's implementation. To address these challenges a Knowledge discOvery And daTa minINg inteGrated (KOATING) framework is presented for Moderators to enable them to continuously learn from the operational databases of the company and semi-automatically update the corresponding expert module. The architecture for the Universal Knowledge Moderator (UKM) shows how the existing moderators can be extended to support global manufacturing. A method for designing and developing the knowledge acquisition module of the Moderator for manual and semi-automatic update of knowledge is documented using the Unified Modelling Language (UML). UML has been used to explore the static structure and dynamic behaviour, and describe the system analysis, system design and system development aspects of the proposed KOATING framework. The proof of design has been presented using a case study for a collaborative project in the form of construction project supply chain. It has been shown that Moderators can "learn" by extracting various kinds of knowledge from Post Project Reports (PPRs) using different types of text mining techniques. Furthermore, it also proposed that the knowledge discovery integrated moderators can be used to support and enhance collaboration by identifying appropriate business opportunities and identifying corresponding partners for creation of a virtual organization. A case study is presented in the context of a UK based SME. Finally, this thesis concludes by summarizing the thesis, outlining its novelties and contributions, and recommending future research. 338.0068
137	A proposal for the protection of digital databases in Sri Lanka Abeysekara, Thusitha Bernad January 2013 (has links) Economic development in Sri Lanka has relied heavily on foreign and domestic investment. Digital databases are a new and attractive area for this investment. This thesis argues that investment needs protection and this is crucial to attract future investment. The thesis therefore proposes a digital database protection mechanism with a view to attracting investment in digital databases to Sri Lanka. The research examines various existing protection measures whilst mainly focusing on the sui generis right protection which confirms the protection of qualitative and/or quantitative substantial investment in the obtaining, verification or presentation of the contents of digital databases. In digital databases, this process is carried out by computer programs which establish meaningful and useful data patterns through their data mining process, and subsequently use those patterns in Knowledge Discovery within database processes. Those processes enhance the value and/or usefulness of the data/information. Computer programs need to be protected, as this thesis proposes, by virtue of patent protection because the process carried out by computer programs is that of a technical process - an area for which patents are particularly suitable for the purpose of protecting. All intellectual property concepts under the existing mechanisms address the issue of investment in databases in different ways. These include Copyright, Contract, Unfair Competition law and Misappropriation and Sui generis right protection. Since the primary objective of the thesis is to introduce a protection system for encouraging qualitative and quantitative investment in digital databases in Sri Lanka, this thesis suggests a set of mechanisms and rights which comprises of existing intellectual protection mechanisms for databases. The ultimate goal of the proposed protection mechanisms and rights is to improve the laws pertaining to the protection of digital databases in Sri Lanka in order to attract investment, to protect the rights and duties of the digital database users and owners/authors and, eventually, to bring positive economic effects to the country. Since digital database protection is a new concept in the Sri Lankan legal context, this research will provide guidelines for policy-makers, judges and lawyers in Sri Lanka and throughout the South Asian region. 006.312
138	An Exploratory Analysis of Twitter Keyword-Hashtag Networks and Knowledge Discovery Applications Hamed, Ahmed A 01 January 2014 (has links) The emergence of social media has impacted the way people think, communicate, behave, learn, and conduct research. In recent years, a large number of studies have analyzed and modeled this social phenomena. Driven by commercial and social interests, social media has become an attractive subject for researchers. Accordingly, new models, algorithms, and applications to address specific domains and solve distinct problems have erupted. In this thesis, we propose a novel network model and a path mining algorithm called HashnetMiner to discover implicit knowledge that is not easily exposed using other network models. Our experiments using HashnetMiner have demonstrated anecdotal evidence of drug-drug interactions when applied to a drug reaction context. The proposed research comprises three parts built upon the common theme of utilizing hashtags in tweets. 1 Digital Recruitment on Twitter. We build an expert system shell for two different studies: (1) a nicotine patch study where the system reads streams of tweets in real time and decides whether to recruit the senders to participate in the study, and (2) an environmental health study where the system identifies individuals who can participate in a survey using Twitter. 2 Does Social Media Big Data Make the World Smaller? This work provides an exploratory analysis of large-scale keyword-hashtag networks (K-H) generated from Twitter. We use two different measures, (1) the number of vertices that connect any two keywords, and (2) the eccentricity of keyword vertices, a well-known centrality and shortest path measure. Our analysis shows that K-H networks conform to the phenomenon of the shrinking world and expose hidden paths among concepts. 3 We pose the following biomedical web science question: Can patterns identified in Twitter hashtags provide clinicians with a powerful tool to extrapolate a new medical therapies and/or drugs? We present a systematic network mining method HashnetMiner, that operates on networks of medical concepts and hashtags. To the best of our knowledge, this is the first effort to present Biomedical Web Science models and algorithms that address such a question by means of data mining and knowledge discovery using hashtag-based networks. Keyword-Hashtag Networks Knowledge Discovery Twitter Recruitment Marijuana-Drug Interaction Artificial Intelligence and Robotics Computer Sciences Databases and Information Systems OS and Networks Social Statistics
139	Traitement de données numériques par analyse formelle de concepts et structures de patrons / Mining numerical data with formal concept analysis and pattern structures Kaytoue, Mehdi 22 April 2011 (has links) Le sujet principal de cette thèse porte sur la fouille de données numériques et plus particulièrement de données d'expression de gènes. Ces données caractérisent le comportement de gènes dans diverses situations biologiques (temps, cellule, etc.). Un problème important consiste à établir des groupes de gènes partageant un même comportement biologique. Cela permet d'identifier les gènes actifs lors d'un processus biologique, comme par exemple les gènes actifs lors de la défense d'un organisme face à une attaque. Le cadre de la thèse s'inscrit donc dans celui de l'extraction de connaissances à partir de données biologiques. Nous nous proposons d'étudier comment la méthode de classification conceptuelle qu'est l'analyse formelle de concepts (AFC) peut répondre au problème d'extraction de familles de gènes. Pour cela, nous avons développé et expérimenté diverses méthodes originales en nous appuyant sur une extension peu explorée de l'AFC : les structures de patrons. Plus précisément, nous montrons comment construire un treillis de concepts synthétisant des familles de gènes à comportement similaire. L'originalité de ce travail est (i) de construire un treillis de concepts sans discrétisation préalable des données de manière efficace, (ii) d'introduire une relation de similarité entres les gènes et (iii) de proposer des ensembles minimaux de conditions nécessaires et suffisantes expliquant les regroupements formés. Les résultats de ces travaux nous amènent également à montrer comment les structures de patrons peuvent améliorer la prise de décision quant à la dangerosité de pratiques agricoles dans le vaste domaine de la fusion d'information / The main topic of this thesis addresses the important problem of mining numerical data, and especially gene expression data. These data characterize the behaviour of thousand of genes in various biological situations (time, cell, etc.).A difficult task consists in clustering genes to obtain classes of genes with similar behaviour, supposed to be involved together within a biological process.Accordingly, we are interested in designing and comparing methods in the field of knowledge discovery from biological data. We propose to study how the conceptual classification method called Formal Concept Analysis (FCA) can handle the problem of extracting interesting classes of genes. For this purpose, we have designed and experimented several original methods based on an extension of FCA called pattern structures. Furthermore, we show that these methods can enhance decision making in agronomy and crop sanity in the vast formal domain of information fusion Découverte de connaissances Analyse formelle de concepts Extraction de motifs numériques Bi-clustering Fusion d'information Knowledge discovery in databases Formal concept analysis Numerical pattern mining Biclustering Information fusion
140	"Visualizações temporais em uma plataforma de software extensível e adaptável" / "Temporal visualizations in an extensible and adaptable software platform" Shimabukuro, Milton Hirokazu 05 July 2004 (has links) Repositórios com volumes de dados cada vez maiores foram viabilizados pelo desenvolvimento tecnológico, criando importantes fontes de informação em diversas áreas da atividade humana. Esses repositórios freqüentemente incluem informação sobre o comportamento temporal e o posicionamento espacial dos itens neles representados, os quais são extremamente relevantes para a análise dos dados. O processo de descoberta de conhecimento a partir de grandes volumes de dados tem sido objeto de estudo em diversas disciplinas, dentre elas a Visualização de Informação, cujas técnicas podem apoiar diversas etapas desse processo. Esta tese versa sobre o uso da Visualização Exploratória em conjuntos de dados com atributos temporais e espaciais, empregando a estratégia de múltiplas visualizações coordenadas para apoiar o tratamento de dados em estágios iniciais de processos de descoberta de conhecimento. São propostas duas novas representações visuais temporais denominadas Variação Temporal Uni-escala e Variação Temporal Multi-escala para apoiar a análise exploratória de dados temporais. Adicionalmente, é proposto um modelo de arquitetura de software AdaptaVis, que permite a integração dessas e outras representações visuais em uma plataforma de visualização de informação flexível, extensível e adaptável às necessidades de diferentes usuários, tarefas e domínios de aplicação a plataforma InfoVis. Sessões de uso realizadas com dados e usuários reais dos domínios de Climatologia e Negócios permitiram validar empiricamente as representações visuais e o modelo. O modelo AdaptaVis e a plataforma InfoVis estabelecem bases para a continuidade de diversas pesquisas em Visualização de Informação, particularmente o estudo de aspectos relacionados ao uso coordenado de múltiplas visualizações, à modelagem do processo de coordenação, e à integração entre múltiplas técnicas visuais e analíticas. / Data repositories with ever increasing volumes have been made possible by the evolution in data collection technologies, creating important sources of information in several fields of human activity. Such data repositories often include information about both the temporal behavior and the spatial positioning of data items that will be relevant in future data analysis tasks. The process of discovering knowledge embedded in great volumes of data is a topic of study in several disciplines, including Information Visualization, which offers a range of techniques to support different stages of a discovery process. This thesis addresses the application of Exploratory Visualization techniques on datasets with temporal and spatial attributes, using the strategy of coordinating multiple data views, to assist data treatment on early stages of knowledge discovery processes. Two temporal visual representations are proposed Uni-scale Temporal Behavior and Multi-scale Temporal Behavior that support the exploratory analysis of temporal data. Moreover, a software architecture model is introduced AdaptaVis, that allows the integration of these and other visualization techniques into a flexible, extensible and adaptable information visualization platform called InfoVis that may be tailored to meet the requirements of different users, tasks and application domains. Sessions conducted with real data and users from the Climatology and Business application domains allowed an empirical validation of both the visual representations and the model. The AdaptaVis model and the InfoVis platform establish the basis for further research on issues related to the coordinated use of multiple data views, the modeling of the coordination process and the integration amongst multiple visual and analytical techniques. dados temporais e espaciais descoberta de conhecimento exploratory visualization information visualization knowledge discovery mineração de dados visual temporal and spatial data visual data mining visualização de informação visualização exploratória

Search results