This thesis introduces FlowGSP, a general-purpose sequence mining algorithm for flow graphs.
FlowGSP ranks sequences according to the frequency with which they occur and according to their
relative cost. This thesis also presents two parallel implementations of FlowGSP. The first implementation uses JavaTM threads and is designed for use on workstations equipped with multi-core
CPUs. The second implementation is distributed in nature and intended for use on clusters.
The thesis also presents results from an application of FlowGSP to mine program profiles in
the context of the development of a dynamic optimizing compiler. Interpreting patterns within raw
profiling data is extremely difficult and heavily reliant on human intuition.
FlowGSP has been tested on performance-counter profiles collected from the IBM WebSphere
Application Server. This investigation identifies a number of sequences which are known to be typical of WebSphere Application Server behavior, as well as some sequences which were previously
unknown.
Identifer | oai:union.ndltd.org:LACETR/oai:collectionscanada.gc.ca:AEU.10048/749 |
Date | 11 1900 |
Creators | Jocksch, Adam |
Contributors | Amaral, Jose Nelson (Computing Science), Gaudet, Vincent (Electical and Computer Engineering), Sander, Joerg (Computing Science) |
Source Sets | Library and Archives Canada ETDs Repository / Centre d'archives des thèses électroniques de Bibliothèque et Archives Canada |
Language | en_US |
Detected Language | English |
Type | Thesis |
Format | 393137 bytes, application/pdf |
Page generated in 0.0574 seconds