Spelling suggestions: "subject:"contourmatching"" "subject:"formatching""
1 |
The Use of bioinformatics techniques to perform time-series trend matching and predictionTransell, Mark Marriott January 2012 (has links)
Process operators often have process faults and alarms due to recurring failures on process
equipment. It is also the case that some processes do not have enough input information
or process models to use conventional modelling or machine learning techniques for early
fault detection.
A proof of concept for online streaming prediction software based on matching process
behaviour to historical motifs has been developed, making use of the Basic Local
Alignment Search Tool (BLAST) used in the Bioinformatics field. Execution times of as
low as 1 second have been recorded, demonstrating that online matching is feasible.
Three techniques have been tested and compared in terms of their computational
effciency, robustness and selectivity, with results shown in Table 1:
• Symbolic Aggregate Approximation combined with PSI-BLAST
• Naive Triangular Representation with PSI-BLAST
• Dynamic Time Warping
Table 1: Properties of different motif-matching methods
Property SAX-PSIBLAST TER-PSIBLAST DTW
Noise tolerance (Selectivity) Acceptable Inconclusive Good
Vertical Shift tolerance None Perfect Poor
Matching speed Acceptable Acceptable Fast
Match speed scaling O < O(mn) O < O(mn) O(mn)
Dimensionality Reduction Tolerance Good Inconclusive Acceptable
It is recommended that a method using a weighted confidence measure for each technique
be investigated for the purpose of online process event handling and operator alerts. Keywords: SAX, BLAST, motif-matching, Dynamic Time Warping / Dissertation (MEng)--University of Pretoria, 2012. / Chemical Engineering / unrestricted
|
2 |
Pattern Discovery in Protein Structures and Interaction NetworksAhmed, Hazem Radwan A. 21 April 2014 (has links)
Pattern discovery in protein structures is a fundamental task in computational biology, with important applications in protein structure prediction, profiling and alignment. We propose a novel approach for pattern discovery in protein structures using Particle Swarm-based flying windows over potentially promising regions of the search space. Using a heuristic search, based on Particle Swarm Optimization (PSO) is, however, easily trapped in local optima due to the sparse nature of the problem search space. Thus, we introduce a novel fitness-based stagnation detection technique that effectively and efficiently restarts the search process to escape potential local optima.
The proposed fitness-based method significantly outperforms the commonly-used distance-based method when tested on eight classical and advanced (shifted/rotated) benchmark functions, as well as on two other applications for proteomic pattern matching and discovery. The main idea is to make use of the already-calculated fitness values of swarm particles, instead of their pairwise distance values, to predict an imminent stagnation situation. That is, the proposed fitness-based method does not require any computational overhead of repeatedly calculating pairwise distances between all particles at each iteration. Moreover, the fitness-based method is less dependent on the problem search space, compared with the distance-based method.
The proposed pattern discovery algorithms are first applied to protein contact maps, which are the 2D compact representation of protein structures. Then, they are extended to work on actual protein 3D structures and interaction networks, offering a novel and low-cost approach to protein structure classification and interaction prediction. Concerning protein structure classification, the proposed PSO-based approach correctly distinguishes between the positive and negative examples in two protein datasets over 50 trials. As for protein interaction prediction, the proposed approach works effectively on complex, mostly sparse protein interaction networks, and predicts high-confidence protein-protein interactions — validated by more than one computational and experimental source — through knowledge transfer between topologically-similar interaction patterns of close proximity.
Such encouraging results demonstrate that pattern discovery in protein structures and interaction networks are promising new applications of the fast-growing and far-reaching PSO algorithms, which is the main argument of this thesis. / Thesis (Ph.D, Computing) -- Queen's University, 2014-04-21 12:54:03.37
|
Page generated in 0.0682 seconds