Global ETD Search

1	Forward-Selection-Based Feature Selection for Genre Analysis and Recognition of Popular Music Chen, Wei-Yu 09 September 2012 (has links) In this thesis, a popular music genre recognition approach for Japanese popular music using SVM (support vector machine) with forward feature selection is proposed. First, various common acoustic features are extracted from the digital signal of popular music songs, including sub-bands, energy, rhythm, tempo, formants. A set of the most appropriate features for the genre identification is then selected by the proposed forward feature selection technique. Experiments conducted on the database consisting of 296 Japanese popular music songs demonstrate that the accuracy of recognition the proposed algorithm can achieve approximately 78.81% and the accuracy is stable when the number of testing music songs is increased. RBF (radial basis function) SVM (support vector machine) forward selection Genre recognition feature selection
2	Cross Site Product Page Classification with Supervised Machine Learning / Webbsideöverskridande klassificering av produktsidor med övervakad maskininlärning Huss, Jakob January 2016 (has links) This work outlines a possible technique for identifying webpages that contain product specifications. Using support vector machines a product web page classifier was constructed and tested with various settings. The final result for this classifier ended up being 0.958 in precision and 0.796 in recall for product pages. The scores imply that the method could be considered a valid technique in real world web classification tasks if additional features and more data were made available. Computer and Information Sciences Data- och informationsvetenskap
3	Clustering System and Clustering Support Vector Machine for Local Protein Structure Prediction Zhong, Wei 02 August 2006 (has links) Protein tertiary structure plays a very important role in determining its possible functional sites and chemical interactions with other related proteins. Experimental methods to determine protein structure are time consuming and expensive. As a result, the gap between protein sequence and its structure has widened substantially due to the high throughput sequencing techniques. Problems of experimental methods motivate us to develop the computational algorithms for protein structure prediction. In this work, the clustering system is used to predict local protein structure. At first, recurring sequence clusters are explored with an improved K-means clustering algorithm. Carefully constructed sequence clusters are used to predict local protein structure. After obtaining the sequence clusters and motifs, we study how sequence variation for sequence clusters may influence its structural similarity. Analysis of the relationship between sequence variation and structural similarity for sequence clusters shows that sequence clusters with tight sequence variation have high structural similarity and sequence clusters with wide sequence variation have poor structural similarity. Based on above knowledge, the established clustering system is used to predict the tertiary structure for local sequence segments. Test results indicate that highest quality clusters can give highly reliable prediction results and high quality clusters can give reliable prediction results. In order to improve the performance of the clustering system for local protein structure prediction, a novel computational model called Clustering Support Vector Machines (CSVMs) is proposed. In our previous work, the sequence-to-structure relationship with the K-means algorithm has been explored by the conventional K-means algorithm. The K-means clustering algorithm may not capture nonlinear sequence-to-structure relationship effectively. As a result, we consider using Support Vector Machine (SVM) to capture the nonlinear sequence-to-structure relationship. However, SVM is not favorable for huge datasets including millions of samples. Therefore, we propose a novel computational model called CSVMs. Taking advantage of both the theory of granular computing and advanced statistical learning methodology, CSVMs are built specifically for each information granule partitioned intelligently by the clustering algorithm. Compared with the clustering system introduced previously, our experimental results show that accuracy for local structure prediction has been improved noticeably when CSVMs are applied. granular computing SVM (Support Vector Machine) K-means clustering algorithm sequence motif protein structure prediction Computer Sciences
4	Hodnocení viability kardiomyocytů / Evaluation of viability of cardiomyocytes Kremličková, Lenka January 2017 (has links) The aim of this diploma thesis is to get acquainted with the properties of image data and the principle of their capture. Literary research on methods of image segmentation in the area of cardiac tissue imaging and, last but not least, efforts to find methods for classification of dead cardiomyocytes and analysis of their viability. Dead cardiomyocytes were analyzed for their shape and similarity to the template created as a mean of dead cells. Another approach was the application of the method based on local binary characters and the computation of symptoms from a simple and associated histogram.
5	Determining fixation stability of amd patients using predictive eye estimation regression Adelore, Temilade Adediwura 20 August 2008 (has links) Patients with macular degeneration (MD) often fixate with a preferred retinal locus (PRL). Eye movements made while fixating with the PRL (in MD patients) has been observed to be maladaptive compared to those made while fixating with the fovea (normal sighted individuals). For example, in MD patients, PRL eye movements negatively affect fixation stability and re-fixation precision; consequently creating difficulty in reading and limits to their execution of other everyday activities. Abnormal eye movements from the PRL affect research on the physiological adaptations to MD. Specifically, previous research on cortical reorganization using functional magnetic resonance imaging (fMRI), indicates a critical need to accurately determine a MD patient's point of gaze in order to better infer existence of cortical reorganization. Unfortunately, standard MR compatible hardware eye-tracking systems do not work well with these patients. Their reduction in fixation stability often overwhelms the tracking algorithms used by these systems. This research investigates the use of an existing magnetic resonance imaging (MRI) based technique called Predictive Eye Estimation Regression (PEER) to determine the point of gaze of MD patients and thus control for fixation instability. PEER makes use of the fluctuations in the MR signal caused by eye movements to identify position of gaze. Engineering adaptations such as temporal resolution and brain coverage were applied to tailor PEER to MD patients. Also participants were evaluated on different fixation protocols and the results compared to that of the micro-perimeter MP-1 to test the efficacy of PEER. The fixation stability results obtained from PEER were similar to that obtained from the eye tracking results of the micro-perimeter MP-1. However, PEER's point of gaze estimations was different from the MP-1's in the fixation tests. The difference in this result cannot be concluded to be specific to PEER. In order to resolve this issue, advancements to PEER by the inclusion of an eye tracker in the scanner to run concurrently with PEER could provide more evidence of PEER's reliability. In addition, increasing the diversity of AMD patients in terms of the different scotoma types will help provide a better estimate of PEER flexibility and robustness. AMD (Age-related macular degeneration) SVM (Support Vector Machine) Retinal degeneration Adaptation (Physiology) Visual perception Gaze Eye Movements
6	A Semantic Triplet Based Story Classifier January 2013 (has links) abstract: Text classification, in the artificial intelligence domain, is an activity in which text documents are automatically classified into predefined categories using machine learning techniques. An example of this is classifying uncategorized news articles into different predefined categories such as "Business", "Politics", "Education", "Technology" , etc. In this thesis, supervised machine learning approach is followed, in which a module is first trained with pre-classified training data and then class of test data is predicted. Good feature extraction is an important step in the machine learning approach and hence the main component of this text classifier is semantic triplet based features in addition to traditional features like standard keyword based features and statistical features based on shallow-parsing (such as density of POS tags and named entities). Triplet {Subject, Verb, Object} in a sentence is defined as a relation between subject and object, the relation being the predicate (verb). Triplet extraction process, is a 5 step process which takes input corpus as a web text document(s), each consisting of one or many paragraphs, from RSS feeds to lists of extremist website. Input corpus feeds into the "Pronoun Resolution" step, which uses an heuristic approach to identify the noun phrases referenced by the pronouns. The next step "SRL Parser" is a shallow semantic parser and converts the incoming pronoun resolved paragraphs into annotated predicate argument format. The output of SRL parser is processed by "Triplet Extractor" algorithm which forms the triplet in the form {Subject, Verb, Object}. Generalization and reduction of triplet features is the next step. Reduced feature representation reduces computing time, yields better discriminatory behavior and handles curse of dimensionality phenomena. For training and testing, a ten- fold cross validation approach is followed. In each round SVM classifier is trained with 90% of labeled (training) data and in the testing phase, classes of remaining 10% unlabeled (testing) data are predicted. Concluding, this paper proposes a model with semantic triplet based features for story classification. The effectiveness of the model is demonstrated against other traditional features used in the literature for text classification tasks. / Dissertation/Thesis / M.S. Computer Science 2013 Computer science Machine learning Natural Language Processing Ravi Karad SVM (Support Vector Machine) classifier Text classification
7	Improved in silico methods for target deconvolution in phenotypic screens Mervin, Lewis January 2018 (has links) Target-based screening projects for bioactive (orphan) compounds have been shown in many cases to be insufficiently predictive for in vivo efficacy, leading to attrition in clinical trials. Phenotypic screening has hence undergone a renaissance in both academia and in the pharmaceutical industry, partly due to this reason. One key shortcoming of this paradigm shift is that the protein targets modulated need to be elucidated subsequently, which is often a costly and time-consuming procedure. In this work, we have explored both improved methods and real-world case studies of how computational methods can help in target elucidation of phenotypic screens. One limitation of previous methods has been the ability to assess the applicability domain of the models, that is, when the assumptions made by a model are fulfilled and which input chemicals are reliably appropriate for the models. Hence, a major focus of this work was to explore methods for calibration of machine learning algorithms using Platt Scaling, Isotonic Regression Scaling and Venn-Abers Predictors, since the probabilities from well calibrated classifiers can be interpreted at a confidence level and predictions specified at an acceptable error rate. Additionally, many current protocols only offer probabilities for affinity, thus another key area for development was to expand the target prediction models with functional prediction (activation or inhibition). This extra level of annotation is important since the activation or inhibition of a target may positively or negatively impact the phenotypic response in a biological system. Furthermore, many existing methods do not utilize the wealth of bioactivity information held for orthologue species. We therefore also focused on an in-depth analysis of orthologue bioactivity data and its relevance and applicability towards expanding compound and target bioactivity space for predictive studies. The realized protocol was trained with 13,918,879 compound-target pairs and comprises 1,651 targets, which has been made available for public use at GitHub. Consequently, the methodology was applied to aid with the target deconvolution of AstraZeneca phenotypic readouts, in particular for the rationalization of cytotoxicity and cytostaticity in the High-Throughput Screening (HTS) collection. Results from this work highlighted which targets are frequently linked to the cytotoxicity and cytostaticity of chemical structures, and provided insight into which compounds to select or remove from the collection for future screening projects. Overall, this project has furthered the field of in silico target deconvolution, by improving the performance and applicability of current protocols and by rationalizing cytotoxicity, which has been shown to influence attrition in clinical trials.

1

Page generated in 0.0968 seconds