About

Log in?

DTU users get better search results including licensed content and discounts on order fees.

Anyone can log in and get personalized features such as favorites, tags and feeds.

Log in as DTU user Log in as non-DTU user No thanks

DTU Findit

Conference paper

Towards Unifying OpenMP Under the Task-Parallel Paradigm Implementation and Performance of the taskloop Construct

From

Department of Applied Mathematics and Computer Science, Technical University of Denmark1

Embedded Systems Engineering, Department of Applied Mathematics and Computer Science, Technical University of Denmark2

Formal Methods, Department of Applied Mathematics and Computer Science, Technical University of Denmark3

OpenMP 4.5 introduced a task-parallel version of the classical thread-parallel for-loop construct: the taskloop construct. With this new construct, programmers are given the opportunity to choose between the two parallel paradigms to parallelize their for loops. However, it is unclear where and when the two approaches should be used when writing efficient parallel applications.In this paper, we explore the taskloop construct.

We study performance differences between traditional thread-parallel for loops and the new taskloop directive. We introduce an efficient implementation and compare our implementation to other taskloop implementations using micro-and kernel-benchmarks, as well as an application. We show that our taskloop implementation on average results in a 3.2% increase in peak performance when compared against corresponding parallel-for loops.

Language: English
Publisher: Springer
Year: 2016
Pages: 116-129
Proceedings: 12th International Workshop on OpenMP
Series: Lecture Notes in Computer Science
ISBN: 3319455494 , 3319455508 , 9783319455495 and 9783319455501
ISSN: 03029743
Types: Conference paper
DOI: 10.1007/978-3-319-45550-1_9
ORCIDs: Karlsson, Sven

DTU users get better search results including licensed content and discounts on order fees.

Log in as DTU user

Access

Analysis