Applications of gpu computing Alex Karantza

Yüklə 0,84 Mb.

Pdf görüntüsü

səhifə	2/3
tarix	19.05.2023
ölçüsü	0,84 Mb.
	#127255

1 2 3

1-2

Ziegler, Grenot.
John E Stone, James C Phillips, Peter L Freddolino, David J

__global__ void convolutionColumnsKernel(
float *d_Dst,
float *d_Src,
int imageW,
int imageH,
int pitch
){
__shared__ float s_Data[COLUMNS_BLOCKDIM_X]
[(COLUMNS_RESULT_STEPS + 2 * COLUMNS_HALO_STEPS) *
COLUMNS_BLOCKDIM_Y + 1];

//// *snip* Populate s_Data from d_Src

__syncthreads();
#pragma unroll
for(int i = COLUMNS_HALO_STEPS; i < COLUMNS_HALO_STEPS + COLUMNS_RESULT_STEPS; i++){
float sum = 0;
#pragma unroll
for(int j = -KERNEL_RADIUS; j <= KERNEL_RADIUS; j++)
sum += c_Kernel[KERNEL_RADIUS - j] *
s_Data[threadIdx.x][threadIdx.y + i * COLUMNS_BLOCKDIM_Y + j];
d_Dst[i * COLUMNS_BLOCKDIM_Y * pitch] = sum;
}

Even More Fun
•
Some of that overhead can be avoided when the
destination of the GPU’s data is graphics
•
Texture memory can be shared between general
purpose computations and normal rendering
•
For post-processing effects or visualizing particles, the
pixel/vertex data never needs to leave the GPU

Conclusions
Certain classes of problem appear in many different
fields, and involve very data-parallel operations such
as filtering, sorting, or integration
Taking advantage of the architecture decisions behind
graphics processing units such as their multiprocessing
and native vector operations, these problems can be
solved quickly and cheaply

References
•
1. Ziegler, Grenot. Introduction to the CUDA Architecture. [Online] 2009.
http://www.cse.scitech.ac.uk/disco/workshops/200907/Day1_01_Intro_CUDA_Architecture.pdf.
•
2. NVIDIA Corporation. NVIDIA Compute PTX: Parallel Thread Execution ISA Version 1.1. 2007.
•
3. Göddeke, Dominik. Fast and Accurate Finite-Element Multigrid Solvers for PDE Simulations on GPU Clusters. Berlin : Logos Verlag, 2010.
978-3-8325-2768-6.
•
4. Accellerating molecular modeling application swith graphics processors. John E Stone, James C Phillips, Peter L Freddolino, David J

Yüklə 0,84 Mb.

Dostları ilə paylaş:

1 2 3