Spelling suggestions: "subject:"[een] LEARNING ALGORITHMS"" "subject:"[enn] LEARNING ALGORITHMS""
21 |
Transfer learning for classification of spatially varying dataJun, Goo 13 December 2010 (has links)
Many real-world datasets have spatial components that provide valuable information about characteristics of the data. In this dissertation, a novel framework for adaptive models that exploit spatial information in data is proposed. The proposed framework is mainly based on development and applications of Gaussian processes.
First, a supervised learning method is proposed for the classification of hyperspectral data with spatially adaptive model parameters. The proposed algorithm models spatially varying means of each spectral band of a given class using a Gaussian process regression model. For a given location, the predictive distribution of a given class is modeled by a multivariate Gaussian distribution with spatially adjusted parameters obtained from the proposed algorithm.
The Gaussian process model is generally regarded as a good tool for interpolation, but not for extrapolation. Moreover, the uncertainty of the predictive distribution increases as the distance from the training instances increases. To overcome this problem, a semi-supervised learning algorithm is presented for the classification of hyperspectral data with spatially adaptive model parameters. This algorithm fits the test data with a spatially adaptive mixture-of-Gaussians model, where the spatially varying parameters of each component are obtained by Gaussian process regressions with soft memberships using the mixture-of-Gaussian-processes model.
The proposed semi-supervised algorithm assumes a transductive setting, where the unlabeled data is considered to be similar to the training data. This is not true in general, however, since one may not know how many classes may existin the unexplored regions. A spatially adaptive nonparametric Bayesian framework is therefore proposed by applying spatially adaptive mechanisms to the mixture model with infinitely many components. In this method, each component in the mixture has spatially adapted parameters estimated by Gaussian process regressions, and spatial correlations between indicator variables are also considered.
In addition to land cover and land use classification applications based on hyperspectral imagery, the Gaussian process-based spatio-temporal model is also applied to predict ground-based aerosol optical depth measurements from satellite multispectral images, and to select the most informative ground-based sites by active learning. In this application, heterogeneous features with spatial and temporal information are incorporated together by employing a set of covariance functions, and it is shown that the spatio-temporal information exploited in this manner substantially improves the regression model.
The conventional meaning of spatial information usually refers to actual spatio-temporal locations in the physical world. In the final chapter of this dissertation, the meaning of spatial information is generalized to the parametrized low-dimensional representation of data in feature space, and a corresponding spatial modeling technique is exploited to develop a nearest-manifold classification algorithm. / text
|
22 |
Meta-learning strategies, implementations, and evaluations for algorithm selection /Köpf, Christian Rudolf. January 1900 (has links)
Thesis (doctorat)--Universität Ulm, 2005. / Includes bibliographical references (p. 227-248).
|
23 |
Αυτοπροσαρμοζόμενος αλγόριθμος για παιχνίδι μερικούς πληροφόρησηςΣώρρος, Νικόλαος 24 October 2012 (has links)
Στη παρούσα διπλωματική εργασία παρουσιάζεται η ανάπτυξη κώδικα σε γλώσσα προγραμματισμού Python με σκοπό να παίζει το παιχνίδι Bluff. Αναλυτικότερα το Bluff ανήκει στη κατηγορία των παιχνιδιών μερικούς πληροφόρησης και εκδοχές του περιλαμβάνουν το στοιχείο της τύχης άρα είναι και στοχαστικό. Στην ίδια κατηγορία παιχνιδιών εντάσεται και το πόκερ στο οποίο διεξάγεται εντονη ερευνητική δραστηριότητα αυτή τη περίοδο. Οι δυσκολίες που παρουσιάζει το εγχείρημα της κατασκευης ενός τέτοιου αλγόριθμου εγκειται στο μεγάλο χώρο καταστασης του παιχνιδιού και στην αδυναμια εφαρμογης της τεχνικής min max λόγω της δομής του παιχνιδιού. Επίσης ενας επιτυχημένος παίχτης bluff θα πρέπει να αναγνωρίζει ποτε ο αντίπαλος μπλοφάρει καθώς και να μπλοφάρει ο ίδιος. Τέλος όπως και στο ποκερ για να γίνεις μετρ στο παιχνίδι θα πρέπει να μεταβάλεις τη στρατηγική σου ανάλογα με τον αντίπαλο, θα πρέπει να εκμεταλεύεσαι τα λάθη του και ταυτόχρονα να μη γίνεσαι προβλέψιμος. Ο κώδικας μας εχει 3 versions. Στη πρώτη version ενας απλος μηχανισμός που στηρίζεται στους κανονες του παιχνιδίου υλοποιείται και εξετάζεται η επιτυχια του. Στη δευτερη εκδοση εισαγουμε το στοιχειο της μπλόφας ενώ στη τρίτη αφου μοντελοποιήσουμε τον αντίπαλο, λαμβάνουμε αποφάσεις με βάση αυτη τη μοντελοποίηση. / This diploma thesis deals with the problem of developing an algorithm that can play the game of Bluff. The programming language that is used is Python. Concretely the game of bluff belongs into the category of partial information games and some variations involve luck which makes it also stochastic. Intense research is conducted in poker which belongs to the same family of games. The main difficulty is the huge state space of these games due to uncertainty and the deficit of the min-max method. In addition a succesfull bluff player must recognize when the opponent is bluffing and must make bluffs on his own. One last thing that this game requires is to have dynamic strategies which means being able to change your strategy according to the opponent in order to maximize your wining by exploiting his errors. The algorithm builded has 3 versions. The first one simulated a beginner that sticks to the rules, makes no bluffs and raises according to probabilities. The second version introduces bluffing. The final version includes opponent modeling and making decision based on that.
|
24 |
Text Classification of Legitimate and Rogue online Privacy Policies : Manual Analysis and a Machine Learning Experimental ApproachRekanar, Kaavya January 2016 (has links)
No description available.
|
25 |
A Boosted-Window EnsembleElahi, Haroon January 2014 (has links)
Context. The problem of obtaining predictions from stream data involves training on the labeled instances and suggesting the class values for the unseen stream instances. The nature of the data-stream environments makes this task complicated. The large number of instances, the possibility of changes in the data distribution, presence of noise and drifting concepts are just some of the factors that add complexity to the problem. Various supervised-learning algorithms have been designed by putting together efficient data-sampling, ensemble-learning, and incremental-learning methods. The performance of the algorithm is dependent on the chosen methods. This leaves an opportunity to design new supervised-learning algorithms by using different combinations of constructing methods. Objectives. This thesis work proposes a fast and accurate supervised-learning algorithm for performing predictions on the data-streams. This algorithm is called as Boosted-Window Ensemble (BWE), which is invented using the mixture-of-experts technique. BWE uses Sliding Window, Online Boosting and incremental-learning for data-sampling, ensemble-learning, and maintaining a consistent state with the current stream data, respectively. In this regard, a sliding window method is introduced. This method uses partial-updates for sliding the window on the data-stream and is called Partially-Updating Sliding Window (PUSW). The investigation is carried out to compare two variants of sliding window and three different ensemble-learning methods for choosing the superior methods. Methods. The thesis uses experimentation approach for evaluating the Boosted-Window Ensemble (BWE). CPU-time and the Prediction accuracy are used as performance indicators, where CPU-time is the execution time in seconds. The benchmark algorithms include: Accuracy-Updated Ensemble1 (AUE1), Accuracy-Updated Ensemble2 (AUE2), and Accuracy-Weighted Ensemble (AWE). The experiments use nine synthetic and five real-world datasets for generating performance estimates. The Asymptotic Friedman test and the Wilcoxon Signed-Rank test are used for hypothesis testing. The Wilcoxon-Nemenyi-McDonald-Thompson test is used for performing post-hoc analysis. Results. The hypothesis testing suggests that: 1) both for the synthetic and real-wrold datasets, the Boosted Window Ensemble (BWE) has significantly lower CPU-time values than two benchmark algorithms (Accuracy-updated Ensemble1 (AUE1) and Accuracy-weighted Ensemble (AWE). 2) BWE returns similar prediction accuracy as AUE1 and AWE for synthetic datasets. 3) BWE returns similar prediction accuracy as the three benchmark algorithms for the real-world datasets. Conclusions. Experimental results demonstrate that the proposed algorithm can be as accurate as the state-of-the-art benchmark algorithms, while obtaining predictions from the stream data. The results further show that the use of Partially-Updating Sliding Window has resulted in lower CPU-time for BWE as compared with the chunk-based sliding window method used in AUE1, AUE2, and AWE.
|
26 |
Evaluation of Machine Learning Algorithms for Classification of Short-Chain Dehydrogenase/Reductase Protein Sequences / Evaluering av Maskininlärningsalgoritmer för Klassificering av Short-Chain Dehydrogenase/Reductase ProteinsekvenserOlandersson, Sandra January 2003 (has links)
The classification of protein sequences is a subfield in the area of Bioinformatics that attracts a substantial interest today. Machine Learning algorithms are here believed to be able to improve the performance of the classification phase. This thesis considers the application of different Machine Learning algorithms to the classification problem of a data set of short-chain dehydrogenases/reductases (SDR) proteins. The classification concerns both the division of the proteins into the two main families, Classic and Extended, and into their different subfamilies. The results of the different algorithms are compared to select the most appropriate algorithm for this particular classification problem. / Klassificeringen av proteinsekvenser är ett område inom Bioinformatik, vilket idag drar till sig ett stort intresse. Maskininlärningsalgoritmer anses här kunna förbättra utförandet av klassificeringsfasen. Denna uppsats rör tillämpandet av olika maskininlärningsalgoritmer för klassificering av ett dataset med short-chain dehydrogenases/reductases (SDR) proteiner. Klassificeringen rör både indelningen av proteinerna i två huvudklasser, Classic och Extended, och deras olika subklasser. Resultaten av de olika algoritmerna jämförs för att välja ut den mest lämpliga algoritmen för detta specifika klassificeringsproblem. / Sandra Olandersson Blåbärsvägen 27 372 38 Ronneby home: 0457-12084
|
27 |
Empirical Evaluation of Machine Learning Algorithms based on EMG, ECG and GSR Data to Classify Emotional StatesPandey, Amare Ketsela Tesfaye and Amrit January 2013 (has links)
The peripheral psychophysiological signals (EMG, ECG and GSR) of 13 participants were recorded in the well planned Cognition and Robotics lab at BTH University and 9 participants data were taken for further processing. Thirty(30) pictures of IAPS were shown to each participant individually as stimuli, and each picture was displayed for five-second intervals. Signal preprocessing, feature extraction and selection, models, datasets formation and data analysis and interpretation were done. The correlation between a combination of EMG, ECG and GSR signal and emotional states were investigated. 2- Dimensional valence-arousal model was used to represent emotional states. Finally, accuracy comparisons among selected machine learning classification algorithms have performed. Context: Psychophysiological measurement is one of the recent and popular ways to identify emotions when using computers or robots. It can be done using peripheral signals: Electromyography (EMG), Electrocardiography (ECG) and Galvanic Skin Response (GSR). The signals from these measurements are considered as reliable signals and can produce the required data. It is further carried out by preprocessing of data, feature selection and classification. Classification of EMG, ECG and GSR data can be conducted with appropriate machine learning algorithms for better accuracy results. Objectives: In this study, we investigate and analyzed with psychophysiological (EMG, ECG and GSR) data to find best classifier algorithm. Our main objective is to classify those data with appropriate machine learning techniques. Classifications of psychophysiological data are useful in emotion recognition. Therefore, our ultimate goal is to provide validated classified psychological measures for the automated adoption of human robot performance. Methods: We conducted a literature review in order to answer RQ1. The sources used are Inspec/ Compendex, IEEE, ACM Digital Library, Google Scholar and Springer Link. This helps us to identify suitable features required for the classification after reading the articles and papers that are peer reviewed as well as lie relevant to the area. Similarly, this helps us to select appropriate machine learning algorithms. We conducted an experiment in order to answer RQ2 and RQ3. A pilot experiment, then after main experiment was conducted in the Cognition and Robotics lab at the university. An experiment was conducted to take measures from EMG, ECG and GSR signal. Results: We obtained different accuracy results using different sets of datasets. The classification accuracy result was best given by the Support Vector Machine algorithm, which gives up to 59% classified emotional states correctly. Conclusions: The psychophysiological signals are very inconsistent with individual participant for specific emotion. Hence, the result we got from the experiment was higher with a single participant than all participants were together. Although, having large number of instances are good to train the classifier well. / The thesis is focused to classify emotional states from physiological signals. Features extraction and selection of the physiological signal was done, which was used for dataset formation and then classification of those emotional states. IAPS pictures were used to elicit emotional/affective states. Experiment was conducted with 13 participants in cognition and Robotics lab using biosensors EMG, ECG and GSR at BTH University. Nine participants data were taken for further preprocessing. We observed in our thesis the classification of emotions which could be analyzed by a combination of psychophysiological signal as Model A and Model B. Since signals of subjects are different for same emotional state, the accuracy was better for single participant than all participants together. Classification of emotional states is useful for HCI and HRI to manufacture emotional intelligence robot. So, it is essential to provide best classifier algorithms which can be helpful to detect emotions for developing emotional intelligence robots. Our work contribution lies in providing best algorithms for emotion recognition for psychophysiological data and selected features. Most of the results showed that SVM performed best with classification accuracy up to 59 % for single participant and 48.05 % for all participants together. For a single dataset and single participant, we found 60.17 % accuracy from MLP but it consumed more time and memory than other algorithms during classification. The rest of the algorithms like BNT, Naive Bayes, KNN and J48 also gave competitive accuracy to SVM. We conclude that SVM algorithm for emotion recognition from a combination of EMG, ECG and GSR is capable of handling and giving better classification accuracy among others. Tally between IAPS pictures with SAM helped to remove less correlated signals and to obtain better accuracies. Still the obtained results are small in percentage. Therefore, more participants are probably needed to get a better accuracy result over the whole dataset. / amarehenry@gmail.com ; Mobile: 0767042234 amrit.pandey111@gmail.com ; Mobile : 0704763190
|
28 |
Designing machine learning ensembles : a game coalition approachAlzubi, Omar A. January 2013 (has links)
No description available.
|
29 |
Nanoscale modeling of materials: post deposition morphological evolution of fcc metal surfacesKarim, Altaf January 1900 (has links)
Doctor of Philosophy / Department of Physics / Talat S. Rahman / This dissertation is an extensive study of several issues related to post deposition morphological evolution of fcc metal surfaces. These studies were carried out by probing the energetics and the dynamics of underlying atomistic mechanisms responsible for surface diffusion. An important aspect is the determination of relative probability of competing atomistic mechanisms and their contribution to controlling shapes and step edge patterns of nano structures on surfaces. In this scenario, the descent of adatoms from Ag islands on Ag(111) surface is examined. It shows an exchange mechanism to dominate over hopping and the process to favor the formation of (100)-microfacetted steps (A-type) over the (111)-microfacetted ones (B-type). Molecular dynamics simulations support these results at low temperature while at high temperature B-type step formation dominates. This change in the trend could happen if these processes leading to the formation of the A and B type steps have different values of their diffusion prefactors. This difference is confirmed on the basis of our calculations of the diffusion coefficients. Further, to understand the macroscopic properties of a system on the basis of its atomic scale information, spatial and temporal fluctuations of step edges on vicinal Cu(1 1 13) and Cu(1 1 19) surfaces is studied using kinetic Monte Carlo (KMC) simulations. These results show excellent agreement with experimental data, highlighting the role of mass transport along step edges, and also showing the validity of tools like KMC which aims at bridging the gap in length and time scales at which a range of interesting phenomena take place. To facilitate unbiased modeling of material properties, a novel way of performing KMC simulations is presented. In this approach the lists of diffusion processes are automatically collected during the simulation using a saddle-point search method in the potential energy landscape. The speed of the simulations is thus enhanced along with a substantial gain in reliability. Using this method the diffusion and coalescence of two-dimensional Cu and Ag adatom-island on Cu(111) and Ag(111) is studied. Together with input from molecular dynamics simulations, new processes involving the concerted motion of smaller islands are revealed. A significant difference in the scaling of the effective diffusion barriers with island size is observed for the sets of smaller (less than 10 atoms) and larger islands. In particular, the presence of concerted island motion leads to an almost linear increase in the effective diffusion barrier with size, while its absence accounts for strong size-dependent oscillations and anomalous behavior for trimers and heptamers. A crossover from diffusion due to the collective motion of the smaller island to a regime in which the island diffuses through the periphery dominated mass transport (large islands, 19 to 100 atoms) is predicted. For islands containing 19 to 100 atoms the scaling exponent is found to be in good agreement with that found in previous studies.
|
30 |
Robust Distribution-Free Learning Of Logic ExpressionsRajaraman, K 02 1900 (has links) (PDF)
No description available.
|
Page generated in 0.0282 seconds