Global ETD Search

Return to search

Performance Analysis of Cache-Aware Multicore Parallelization with Application to Optimization Theory

In previous work, a cache-aware sparse matrix multiplication for linear programming interior point methods was proposed. The serial implementations achieved speedups ranging from 1.2 to 108.0 over the implementation in GLPK, an open-source linear programming solver. In this work, the same ideas and data structures are used to develop a cache-aware sparse cholesky decomposition as it is implemented in GLPK. The serial implementation achieves a speedup of up to 2.5 on the problem set considered. The matrix multiplication and cholesky decomposition are analysed by use of performance counters on both an AMD-based and an Intel-based system. The analysis shows that the applied blocking techniques reduce the number of floating point operations performed, and that this effect is even more important than the achieved cache utilization to produce speedup for some problems.

ntnudaim:7758

MTDT datateknikk

Komplekse datasystemer

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:ntnu-18991
Date	January 2012
Creators	Stensen, Kristoffer
Publisher	Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap, Institutt for datateknikk og informasjonsvitenskap
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0135 seconds

Performance Analysis of Cache-Aware Multicore Parallelization with Application to Optimization Theory

Description

Links & Downloads

Tags

Additional Fields