Boolean:	`(bicycle AND helmet) OR (head AND protection)` (always group AND in parenthesis)
Title:	`title:(climate change)`
Author:	`author:("bohr niels" OR "bohr n")` (avoid only full first name)
Phrase:	`"water pump control"` (does not work with wildcards)
Wildcards:	`wom?n pharm*`

More...

Report

Automatic Loop Parallelization via Compiler Guided Refactoring

By Larsen, Per^1,2; Ladelsky, Razya³; Lidman, Jacob⁴; McKee, Sally A.⁴; Karlsson, Sven^1,2; Zaks, Ayal³

From

Embedded Systems Engineering, Department of Informatics and Mathematical Modeling, Technical University of Denmark¹

Department of Informatics and Mathematical Modeling, Technical University of Denmark²

IBM Research Laboratory³

Chalmers University of Technology⁴

Abstract

For many parallel applications, performance relies not on instruction-level parallelism, but on loop-level parallelism. Unfortunately, many modern applications are written in ways that obstruct automatic loop parallelization. Since we cannot identify sufficient parallelization opportunities for these codes in a static, off-line compiler, we developed an interactive compilation feedback system that guides the programmer in iteratively modifying application source, thereby improving the compiler’s ability to generate loop-parallel code.

We use this compilation system to modify two sequential benchmarks, finding that the code parallelized in this way runs up to 8.3 times faster on an octo-core Intel Xeon 5570 system and up to 12.5 times faster on a quad-core IBM POWER6 system. Benchmark performance varies significantly between the systems.

This suggests that semi-automatic parallelization should be combined with target-specific optimizations. Furthermore, comparing the first benchmark to hand-parallelized, hand-optimized pthreads and OpenMP versions, we find that code generated using our approach typically outperforms the pthreads code (within 93-339%).

It also performs competitively against the OpenMP code (within 75-111%). The second benchmark outperforms hand-parallelized and optimized OpenMP code (within 109-242%).

Language:	English
Publisher:	Technical University of Denmark
Year:	2011
Series:	Imm-technical Report-2011
Types:	Report
ORCIDs:	Karlsson, Sven

Automatic Loop Parallelization via Compiler Guided Refactoring

DTU Library

Address

Shortcuts

Log in?

Automatic Loop Parallelization via Compiler Guided Refactoring

DTU Library

Address

Shortcuts