Spelling suggestions: "subject:"data mining -- 3research"" "subject:"data mining -- 1research""
1 |
Solar Data AnalysisRay, Mike C. T. 24 July 2013 (has links)
The solar industry has grown considerably in the last few years. This larger scale has introduced more problems as well as possibilities. One of those possibilities is analyzing the data coming from the sites that are now being monitored, and using the information to answer a variety of questions.
We have four questions which are of prime importance identified in this thesis:
1. Can data from customers be trusted?
2. Can we use data from existing sites to determine which sites need the most improvement?
3. Can we implement a location-based algorithm to reduce the amount of false positives for performance, or other alarms?
4. Can we improve upon the current predicted power algorithm?
We find that not only can we answer these questions definitively, but the improvements found are of significant value. Each of these items represents an important question that either directly or indirectly translates into increased revenue and engineering improvements for the solar industry as a whole.
|
2 |
Mining Biomedical Literature to Extract Pharmacokinetic Drug-Drug InteractionsKarnik, Shreyas 03 February 2014 (has links)
Indiana University-Purdue University Indianapolis (IUPUI) / Polypharmacy is a general clinical practice, there is a high chance that multiple administered drugs will interfere with each other, such phenomenon is called drug-drug interaction (DDI). DDI occurs when drugs administered change each other's pharmacokinetic (PK) or pharmacodynamic (PD) response. DDIs in many ways affect the overall effectiveness of the drug or at some times pose a risk of serious side effects to the patients thus, it becomes very challenging to for the successful drug development and clinical patient care. Biomedical literature is rich source for in-vitro and in-vivo DDI reports and there is growing need to automated methods to extract the DDI related information from unstructured text. In this work we present an ontology (PK ontology), which defines annotation guidelines for annotation of PK DDI studies. Using the ontology we have put together a corpora of PK DDI studies, which serves as excellent resource for training machine learning, based DDI extraction algorithms. Finally we demonstrate the use of PK ontology and corpora for extracting PK DDIs from biomedical literature using machine learning algorithms.
|
3 |
Interactive pattern mining of neuroscience dataWaranashiwar, Shruti Dilip 29 January 2014 (has links)
Indiana University-Purdue University Indianapolis (IUPUI) / Text mining is a process of extraction of knowledge from unstructured text documents. We have huge volumes of text documents in digital form. It is impossible to manually extract knowledge from these vast texts. Hence, text mining is used to find useful information from text through the identification and exploration of interesting patterns. The objective of this thesis in text mining area is to find compact but high quality frequent patterns from text documents related to neuroscience field. We try to prove that interactive sampling algorithm is efficient in terms of time when compared with exhaustive methods like FP Growth using RapidMiner tool. Instead of mining all frequent patterns, all of which may not be interesting to user, interactive method to mine only desired and interesting patterns is far better approach in terms of utilization of resources. This is especially observed with large number of keywords. In interactive patterns mining, a user gives feedback on whether a pattern is interesting or not. Using Markov Chain Monte Carlo (MCMC) sampling method, frequent patterns are generated in an interactive way. Thesis discusses extraction of patterns between the keywords related to some of the common disorders in neuroscience in an interactive way. PubMed database and keywords related to schizophrenia and alcoholism are used as inputs. This thesis reveals many associations between the different terms, which are otherwise difficult to understand by reading articles or journals manually. Graphviz tool is used to visualize associations.
|
Page generated in 0.0748 seconds