Exploiting parallelism of irregular problems and performance evaluation on heterogeneous multi-core architectures

dc.contributor.authorXu, Meilian
dc.contributor.examiningcommitteeLi, Ben (Computer Science) Annakkage, Udaya (Electrical and Computer Engineering) Yang, Laurence (Computer Science, St. Francis Xavier University)en_US
dc.contributor.supervisorThulasiraman, Parimala (Computer Science)en_US
dc.date.accessioned2012-10-04T17:24:22Z
dc.date.available2012-10-04T17:24:22Z
dc.date.issued2012-10-04
dc.degree.disciplineComputer Scienceen_US
dc.degree.levelDoctor of Philosophy (Ph.D.)en_US
dc.description.abstractIn this thesis, we design, develop and implement parallel algorithms for irregular problems on heterogeneous multi-core architectures. Irregular problems exhibit random and unpredictable memory access patterns, poor spatial locality and input dependent control flow. Heterogeneous multi-core processors vary in: clock frequency, power dissipation, programming model (MIMD vs. SIMD), memory design and computing units, scalar versus vector units. The heterogeneity of the processors makes designing efficient parallel algorithms for irregular problems on heterogeneous multicore processors challenging. Techniques of mapping tasks or data on traditional parallel computers can not be used as is on heterogeneous multi-core processors due to the varying hardware. In an attempt to understand the efficiency of futuristic heterogeneous multi-core architectures on applications we study several computation and bandwidth oriented irregular problems on one heterogeneous multi-core architecture, the IBM Cell Broadband Engine (Cell BE). The Cell BE consists of a general processor and eight specialized processors and addresses vector/data-level parallelism and instruction-level parallelism simultaneously. Through these studies on the Cell BE, we provide some discussions and insight on the performance of the applications on heterogeneous multi-core architectures. Verifying these experimental results require some performance modeling. Due to the diversity of heterogeneous multi-core architectures, theoretical performance models used for homogeneous multi-core architectures do not provide accurate results. Therefore, in this thesis we propose an analytical performance prediction model that considers the multitude architectural features of heterogeneous multi-cores (such as DMA transfers, number of instructions and operations, the processor frequency and DMA bandwidth). We show that the execution time from our prediction model is comparable to the execution time of the experimental results for a complex medical imaging application.en_US
dc.description.noteFebruary 2013en_US
dc.identifier.urihttp://hdl.handle.net/1993/9236
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectHeterogeneous multi-core architecturesen_US
dc.subjectirregular problemsen_US
dc.subjectIBM Cell Broadband Engineen_US
dc.subjectiterative reconstruction techniqueen_US
dc.subjectOS-SART (ordered subset simultaneous algebraic reconstruction techniqueen_US
dc.subjectMicrowave Tomographyen_US
dc.subjectPerformance Prediction Modelen_US
dc.subjectFFTen_US
dc.titleExploiting parallelism of irregular problems and performance evaluation on heterogeneous multi-core architecturesen_US
dc.typedoctoral thesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
xu_meilian.pdf
Size:
3.31 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.25 KB
Format:
Item-specific license agreed to upon submission
Description: