Boolean:	`(bicycle AND helmet) OR (head AND protection)` (always group AND in parenthesis)
Title:	`title:(climate change)`
Author:	`author:("bohr niels" OR "bohr n")` (avoid only full first name)
Phrase:	`"water pump control"` (does not work with wildcards)
Wildcards:	`wom?n pharm*`

Other

Accelerating Dense Linear Algebra on the GPU

By Sørensen, Hans Henrik Brandenborg^1,2

From

Department of Informatics and Mathematical Modeling, Technical University of Denmark¹

Scientific Computing, Department of Informatics and Mathematical Modeling, Technical University of Denmark²

Abstract

GPUs have already become an integral part of high performance scientific computing, since they offer dedicated parallel hardware that can potentially accelerate the execution of many scientific applications. In this talk, I will consider the automatic performance acceleration of dense vector and matrix-vector operations on GPUs.

Such operations form the backbone of level 1 and level 2 routines in the Basic Linear Algebra Subroutines (BLAS) library and are therefore of great importance in many scientific applications. The target hardware is the most recent NVIDIA Tesla 20-series (Fermi architecture). Most of the techniques I discuss for accelerating dense linear algebra are applicable to memory-bound GPU algorithms in general.

Language:	English
Year:	2011
Proceedings:	Accelerating Computations : Workshop
Types:	Other

Accelerating Dense Linear Algebra on the GPU

DTU Library

Address

Shortcuts

Log in?

Accelerating Dense Linear Algebra on the GPU

DTU Library

Address

Shortcuts