Global ETD Search

11	Novel Application of Neutrosophic Logic in Classifiers Evaluated under Region-Based Image Categorization System Ju, Wen 01 May 2011 (has links) Neutrosophic logic is a relatively new logic that is a generalization of fuzzy logic. In this dissertation, for the first time, neutrosophic logic is applied to the field of classifiers where a support vector machine (SVM) is adopted as the example to validate the feasibility and effectiveness of neutrosophic logic. The proposed neutrosophic set is integrated into a reformulated SVM, and the performance of the achieved classifier N-SVM is evaluated under an image categorization system. Image categorization is an important yet challenging research topic in computer vision. In this dissertation, images are first segmented by a hierarchical two-stage self organizing map (HSOM), using color and texture features. A novel approach is proposed to select the training samples of HSOM based on homogeneity properties. A diverse density support vector machine (DD-SVM) framework that extends the multiple-instance learning (MIL) technique is then applied to the image categorization problem by viewing an image as a bag of instances corresponding to the regions obtained from the image segmentation. Using the instance prototype, every bag is mapped to a point in the new bag space, and the categorization is transformed to a classification problem. Then, the proposed N-SVM based on the neutrosophic set is used as the classifier in the new bag space. N-SVM treats samples differently according to the weighting function, and it helps reduce the effects of outliers. Experimental results on a COREL dataset of 1000 general purpose images and a Caltech 101 dataset of 9000 images demonstrate the validity and effectiveness of the proposed method. image categorization neutrosophic logic support vector machine Computer Sciences
12	Improving Multiclass Text Classification with the Support Vector Machine Rennie, Jason D. M., Rifkin, Ryan 16 October 2001 (has links) We compare Naive Bayes and Support Vector Machines on the task of multiclass text classification. Using a variety of approaches to combine the underlying binary classifiers, we find that SVMs substantially outperform Naive Bayes. We present full multiclass results on two well-known text data sets, including the lowest error to date on both data sets. We develop a new indicator of binary performance to show that the SVM's lower multiclass error is a result of its improved binary performance. Furthermore, we demonstrate and explore the surprising result that one-vs-all classification performs favorably compared to other approaches even though it has no error-correcting properties. AI text classification support vector machine multiclass classification
13	Protein Backbone Reconstruction with Tool Preference Classification for Standard and Nonstandard Proteins Wu, Hsin-Fang 11 September 2012 (has links) Given a protein sequence and the C£\ coordinates on its backbone, the all-atom protein backbone reconstruction problem (PBRP) is to reconstruct the backbone by its 3D coordinates of N, C and O atoms. In the past few decades, many methods have been proposed for solving PBRP, such as ab initio, homology modeling, SABBAC, Wang¡¦s method, Chang¡¦s method, BBQ (Backbone Building from Quadrilaterals) and Chen¡¦s method. Chen found that, if they can choose the correct prediction tool to build the 3D coordinates of the desired atoms, the RMSD may be improved. In this thesis, we propose a method for solving PBRP based on Chen¡¦s method. We use tool preference classification on each atom of the residue, where the classification model is generated by SVM (Support Vector Machine). We rebuild the backbone by combing the prediction results of all atoms in all residues. The data sets used in our experiments are CASP7, CASP8 and CASP9, which have 65, 52 and 63 proteins, respectively. These data sets contain nonstandard amino acids as well as standard ones. We improve the average RMSDs of Chen¡¦s results in some cases. The average RMSDs of our method are 0.3496 in CASP7, 0.3084 in CASP8 and 0.3286 in CASP9. backbone bioinformatics standard protein support vector machine feature set
14	Accuracy Improvement for RNA Secondary Structure Prediction with SVM Chang, Chia-Hung 30 July 2008 (has links) Ribonucleic acid (RNA) sometimes occurs in a complex structure called pseudoknots. Prediction of RNA secondary structures has drawn much attention from both biologists and computer scientists. Consequently, many useful tools have been developed for RNA secondary structure prediction, with or without pseudoknots. These tools have their individual strength and weakness. As a result, we propose a hybrid feature extraction method which integrates two prediction tools pknotsRG and NUPACK with a support vector machine (SVM). We first extract some useful features from the target RNA sequence, and then decide its prediction tool preference with SVM classification. Our test data set contains 723 RNA sequences, where 202 pseudoknotted RNA sequences are obtained from PseudoBase, and 521 nested RNA sequences are obtained from RNA SSTRAND. Experimental results show that our method improves not only the overall accuracy but also the sensitivity and the selectivity of the target sequences. Our method serves as a preprocessing process in analyzing RNA sequences before employing the RNA secondary structure prediction tools. The ability to combine the existing methods and make the prediction tools more accurate is our main contribution. RNA secondary structure support vector machine machine learning classification
15	Characterizing The Distinguishability Of Microbial Genomes Perry, Scott 21 April 2010 (has links) The field of metagenomics has shown great promise in the ability to recover microbial DNA from communities whose members resist traditional cultivation techniques, although in most instances the recovered material comprises short anonymous genomic fragments rather than complete genome sequences. In order to effectively assess the microbial diversity and ecology represented in such samples, accurate methods for DNA classification capable of assigning metagenomic fragments into their most likely taxonomic unit are required. Existing DNA classification methods have shown high levels of accuracy in attempting to classify sequences derived from low-complexity communities, however genome distinguishability generally deteriorates for complex communities or those containing closely related organisms. The goal of this thesis was to identify factors both intrinsic or external to the genome that may lead to the improvement of existing DNA classification methods and to probe the fundamental limitations of composition-based genome distinguishability. To assess the suite of factors affecting the distinguishability of genomes, support vector machine classifiers were trained to discriminate between pairs of microbial genomes using the relative frequencies of oligonucleotide patterns calculated from orthologous genes or short genomic fragments, and the resulting classification accuracy scores used as the measure of genomic distinguishability. Models were generated in order to relate distinguishability to several measures of genomic and taxonomic similarity, and interesting outlier genome pairs were identified by large residuals to the fitted models. Examination of the outlier pairs identified numerous factors that influence genome distinguishability, including genome reduction, extreme G+C composition, lateral gene transfer, and habitat-induced genome convergence. Fragments containing multiple protein-coding and non-coding sequences showed an increased tendency for misclassification, except in cases where the genomes were very closely related. Analysis of the biological function annotations associated with each fragment demonstrated that certain functional role categories showed increased or decreased tendency for misclassification. The use of pre-processing steps including DNA recoding, unsupervised clustering, 'symmetrization' of oligonucleotide frequencies, and correction for G+C content did not improve distinguishability. Existing composition-based DNA classifiers will benefit from the results reported in this thesis. Sequence-segmentation approaches will improve genome distinguishability by decreasing fragment heterogeneity, while factors such as habitat, lifestyle, extreme G+C composition, genome reduction, and biological role annotations may be used to express confidence in the classification of individual fragments. Although genome distinguishability tends to be proportional to genomic and taxonomic relatedness, these trends can be violated for closely related genome pairs that have undergone rapid compositional divergence, or unrelated genome pairs that have converged in composition due to similar habitats or unusual selective pressures. Additionally, there are fundamental limits to the resolution of composition-based classifiers when applied to genomic fragments typical of current metagenomic studies. genome signature genome composition metagenomics support vector machine
16	Predicting homologous signaling pathways using machine learning Bostan, Babak Unknown Date No description available. signaling pathway machine learning support vector machine prediction
17	Predicting homologous signaling pathways using machine learning Bostan, Babak 11 1900 (has links) Understanding biochemical reactions inside cells of individual organisms is a key factor for improving our biological knowledge. Signaling pathways provide a road map for a wide range of these chemical reactions that convert one signal or stimulus into another. In general, each signaling pathway in a cell involves many different proteins, each with one or more specific roles that help to amplify a relatively small stimulus into an effective response. Since proteins are essential components of a cells activities, it is important to understand how they work and in particular, to determine which of species proteins participate in each role. Experimentally determining this mapping of proteins to roles is difficult and time consuming. Fortunately, many individual pathways have been annotated for some species, and the pathways of other species can often be inferred using protein homology and the protein properties. signaling pathway machine learning support vector machine prediction
18	Dynamic task scheduling onto heterogeneous machines using Support Vector Machine Park, Yongwon. Baskiyar, Sanjeev, January 2008 (has links) (PDF) Thesis (M.S.)--Auburn University, 2008. / Abstract. Includes bibliographical references (p. 26-29).
19	Machine learning and brain imaging in psychosis Zarogianni, Eleni January 2016 (has links) Over the past years early detection and intervention in schizophrenia have become a major objective in psychiatry. Early intervention strategies are intended to identify and treat psychosis prior to fulfilling diagnostic criteria for the disorder. To this aim, reliable early diagnostic biomarkers are needed in order to identify a high-risk state for psychosis and also predict transition to frank psychosis in those high-risk individuals destined to develop the disorder. Recently, machine learning methods have been successfully applied in the diagnostic classification of schizophrenia and in predicting transition to psychosis at an individual level based on magnetic resonance imaging (MRI) data and also neurocognitive variables. This work investigates the application of machine learning methods for the early identification of schizophrenia in subjects at high risk for developing the disorder. The dataset used in this work involves data from the Edinburgh High Risk Study (EHRS), which examined individuals at a heightened risk for developing schizophrenia for familial reasons, and the FePsy (Fruherkennung von Psychosen) study that was conducted in Basel and involves subjects at a clinical high-risk state for psychosis. The overriding aim of this thesis was to use machine learning, and specifically Support Vector Machine (SVM), in order to identify predictors of transition to psychosis in high-risk individuals, using baseline structural MRI data. There are three aims pertaining to this main one. (i) Firstly, our aim was to examine the feasibility of distinguishing at baseline those individuals who later developed schizophrenia from those who did not, yet had psychotic symptoms using SVM and baseline data from the EHRS study. (ii) Secondly, we intended to examine if our classification approach could generalize to clinical high-risk cohorts, using neuroanatomical data from the FePsy study. (iii) In a more exploratory context, we have also examined the diagnostic performance of our classifier by pooling the two datasets together. With regards to the first aim, our findings suggest that the early prediction of schizophrenia is feasible using a MRI-based linear SVM classifier operating at the single-subject level. Additionally, we have shown that the combination of baseline neuroanatomical data with measures of neurocognitive functioning and schizotypal cognition can improve predictive performance. The application of our pattern classification approach to baseline structural MRI data from the FePsy study highly replicated our previous findings. Our classification method identified spatially distributed networks that discriminate at baseline between subjects that later developed schizophrenia and other related psychoses and those that did not. Finally, a preliminary classification analysis using pooled datasets from the EHRS and the FePsy study supports the existence of a neuroanatomical pattern that differentiates between groups of high-risk subjects that develop psychosis against those who do not across research sites and despite any between-sites differences. Taken together, our findings suggest that machine learning is capable of distinguishing between cohorts of high risk subjects that later convert to psychosis and those that do not based on patterns of structural abnormalities that are present before disease onset. Our findings have some clinical implications in that machine learning-based approaches could advise or complement clinical decision-making in early intervention strategies in schizophrenia and related psychoses. Future work will be, however, required to tackle issues of reproducibility of early diagnostic biomarkers across research sites, where different assessment criteria and imaging equipment and protocols are used. In addition, future projects may also examine the diagnostic and prognostic value of multimodal neuroimaging data, possibly combined with other clinical, neurocognitive, genetic information. 616.89
20	Concrete Strength Prediction Modeling based on Support Vector Machine (SVM) Dhakal, Santosh 01 December 2015 (has links) Strength of concrete is the major parameter in the design of structures and is represented by the 28-day compressive strength of concrete. Many earlier studies proved that the compressive strength of concrete is not only related to w/c ratio but also rely on proportion of other constituent materials. Application of recently developed new generation admixtures for the production of high performance concrete, has made the concrete strength prediction complex and highly nonlinear challenging the research engineers and data scientists. Development of early accurate prediction model for concrete strength provides the mix designer a tentative idea to proportionate the mix ingredients accordingly reducing the number of trial mixes ultimately saving a lot of cost and time associated with it. In this study, we have proposed SVM regression tool to create the model for the prediction of concrete strength. Support vector machine (SVM) is a supervised machine learning technique based on statistical learning theory developed by Vapnik in 1995. SVM employs a kernel function to transform the data into high dimensional feature space and linear modeling is performed in the feature space to overcome the complexity related to highly nonlinear datasets. A dataset containing 425 observations of high performance concrete mix design with nine attribute variables from University of California, Irvine Repository are considered for this study. 395 datasets were used to train the model and 30 samples were taken as a test set by random sub sampling to test the model. Five-fold cross-validation technique was used to select the parameters of SVM. The metaparameter values ε = 0.001, C = 29.47 and γ = 10 are selected for creating the model. The model performance measures correlation coefficient (R), root mean square error (RMSE) values and residual plots suggest that the proposed SVM model is competent enough to predict the strength of concrete. The performance measures of proposed SVM model was compared with RVM model. Concrete Strength Modeling Prediction measures Support Vector Machine

Search results