没有合适的资源?快使用搜索试试~ 我知道了~
CUDA函数的一些例子简介和下载链接
需积分: 13 7 下载量 87 浏览量
2017-11-26
15:20:54
上传
评论
收藏 2.75MB PDF 举报
温馨提示
试读
104页
CUDA开发环境的代码下载链接和示例代码功能简介等,主要在第三章,包含toolkit工具中的各种简单例子和进阶例子的简单介绍。如何开始CUDA开发等描述。CUDA API介绍等。
资源推荐
资源详情
资源评论
CUDA SAMPLES
TRM-06704-001_v6.0 | February 2014
Reference Manual
www.nvidia.com
CUDA Samples TRM-06704-001_v6.0|ii
TABLE OF CONTENTS
Chapter1.New Features.......................................................................................1
1.1.New Features in CUDA Toolkit 6.0.....................................................................1
1.1.1.CUDA Version 6.0 Highlights.......................................................................1
1.1.2.New CUDA 6.0 Code Samples..................................................................... 1
1.2.New Features in CUDA Toolkit 5.5.....................................................................2
1.2.1.CUDA Version 5.5 Highlights.......................................................................2
1.2.2.New CUDA 5.5 Code Samples..................................................................... 2
1.3.New Features in CUDA Toolkit 5.0.....................................................................3
1.3.1.CUDA Version 5.0 Highlights.......................................................................3
1.3.2.CUDA Dynamic Parallelism Samples in CUDA 5.0 and CUDA 5.5............................. 3
1.3.3.New Revised CUDA Code Samples................................................................ 4
1.4.New Features in CUDA Toolkit 4.2.....................................................................5
1.5.New Features in CUDA Toolkit 4.1.....................................................................5
Chapter2.Getting Started.....................................................................................8
2.1.Supported OS Platforms and Compilers...............................................................8
2.1.1.Supported Windows Platforms.................................................................... 8
2.1.2.Supported Linux Platforms.........................................................................9
2.1.3.Supported Mac Platforms......................................................................... 13
2.1.4.Supported Android Platforms.................................................................... 14
2.2.Installation Instructions................................................................................ 14
2.2.1.Windows Installation Instructions............................................................... 14
2.2.2.Linux Installation Instructions................................................................... 16
2.2.3.Mac OS X Installation Instructions...............................................................18
2.3.Using CUDA Samples to Create Your Own CUDA Projects.........................................20
2.3.1.Creating CUDA Projects for Windows...........................................................20
2.3.2.Creating CUDA Projects for Linux............................................................... 20
2.3.3.Creating CUDA Projects for Mac OS X.......................................................... 21
Chapter3.Samples Reference...............................................................................23
3.1.Simple Reference........................................................................................23
cppOverload................................................................................................. 23
Simple Quicksort (CUDA Dynamic Parallelism)........................................................ 24
Simple Print (CUDA Dynamic Parallelism).............................................................. 24
Simple Static GPU Device Library....................................................................... 24
Simple CUDA Callbacks.................................................................................... 24
simpleAssert................................................................................................. 25
Simple Cubemap Texture................................................................................. 25
Simple Peer-to-Peer Transfers with Multi-GPU........................................................ 25
Using Inline PTX............................................................................................ 26
Simple Layered Texture................................................................................... 26
simplePrintf..................................................................................................26
www.nvidia.com
CUDA Samples TRM-06704-001_v6.0|iii
Simple Surface Write...................................................................................... 26
Simple Multi Copy and Compute........................................................................ 27
Vector Addition............................................................................................. 27
Vector Addition Driver API................................................................................27
Template using CUDA Runtime...........................................................................28
Template..................................................................................................... 28
C++ Integration............................................................................................. 28
asyncAPI......................................................................................................29
Clock..........................................................................................................29
Simple Atomic Intrinsics.................................................................................. 29
Pitch Linear Texture....................................................................................... 29
simpleStreams...............................................................................................30
Simple Templates...........................................................................................30
Simple Texture.............................................................................................. 30
Simple Texture (Driver Version)..........................................................................31
Simple Vote Intrinsics......................................................................................31
simpleZeroCopy............................................................................................. 31
Simple Multi-GPU........................................................................................... 32
Matrix Multiplication (CUBLAS)...........................................................................32
Matrix Multiplication (CUDA Runtime API Version)................................................... 32
Matrix Multiplication (CUDA Driver API Version)...................................................... 33
Unified Memory Streams.................................................................................. 33
simpleMPI.................................................................................................... 33
cudaOpenMP.................................................................................................34
3.2.UtilitiesReference...................................................................................... 34
Peer-to-Peer Bandwidth Latency Test with Multi-GPUs.............................................. 34
Device Query................................................................................................ 34
Device Query Driver API.................................................................................. 35
Bandwidth Test............................................................................................. 35
3.3.GraphicsReference..................................................................................... 35
Bindless Texture............................................................................................ 35
Volumetric Filtering with 3D Textures and Surface Writes.......................................... 36
SLI D3D10 Texture..........................................................................................36
Simple D3D11 Texture..................................................................................... 36
Simple Direct3D9 (Vertex Arrays)........................................................................37
Simple D3D9 Texture.......................................................................................37
Simple Direct3D10 (Vertex Array)....................................................................... 37
Simple Direct3D10 Render Target....................................................................... 38
Simple D3D10 Texture..................................................................................... 38
Simple OpenGL..............................................................................................39
Simple Texture 3D..........................................................................................39
Mandelbrot...................................................................................................39
Marching Cubes Isosurfaces...............................................................................40
www.nvidia.com
CUDA Samples TRM-06704-001_v6.0|iv
Volume Rendering with 3D Textures.................................................................... 40
3.4.ImagingReference...................................................................................... 40
CUDA and OpenGL Interop of Images...................................................................40
Stereo Disparity Computation (SAD SIMD Intrinsics).................................................. 41
Optical Flow.................................................................................................41
CUDA Video Encode (C Library) API.....................................................................41
Bilateral Filter.............................................................................................. 42
DCT8x8....................................................................................................... 42
1D Discrete Haar Wavelet Decomposition..............................................................42
CUDA Histogram............................................................................................ 43
Box Filter.................................................................................................... 43
Post-Process in OpenGL................................................................................... 43
DirectX Texture Compressor (DXTC).................................................................... 43
Image denoising............................................................................................ 44
Sobel Filter.................................................................................................. 44
Recursive Gaussian Filter................................................................................. 44
CUDA Video Decoder D3D9 API.......................................................................... 45
CUDA Video Decoder GL API............................................................................. 45
Bicubic B-spline Interoplation............................................................................46
FFT-Based 2D Convolution................................................................................ 46
CUDA Separable Convolution............................................................................. 47
Texture-based Separable Convolution.................................................................. 47
3.5.FinanceReference...................................................................................... 47
Binomial Option Pricing................................................................................... 47
Black-Scholes Option Pricing............................................................................. 47
Niederreiter Quasirandom Sequence Generator...................................................... 48
Monte Carlo Option Pricing with Multi-GPU support................................................. 48
Sobol Quasirandom Number Generator................................................................. 48
Excel 2010 CUDA Integration Example................................................................. 48
Excel 2007 CUDA Integration Example................................................................. 49
3.6.SimulationsReference.................................................................................. 49
VFlockingD3D10............................................................................................. 49
Fluids (Direct3D Version)..................................................................................49
Fluids (OpenGL Version)...................................................................................50
CUDA FFT Ocean Simulation............................................................................. 50
Particles...................................................................................................... 50
CUDA N-Body Simulation.................................................................................. 51
Smoke Particles............................................................................................. 52
3.7.AdvancedReference.................................................................................... 52
Quad Tree (CUDA Dynamic Parallelism)................................................................ 52
LU Decomposition (CUDA Dynamic Parallelism)....................................................... 52
Bezier Line Tesselation (CUDA Dynamic Parallelism).................................................53
Advanced Quicksort (CUDA Dynamic Parallelism).....................................................53
www.nvidia.com
CUDA Samples TRM-06704-001_v6.0|v
simpleHyperQ............................................................................................... 53
CUDA Parallel Prefix Sum with Shuffle Intrinsics (SHFL_Scan)...................................... 53
CUDA Segmentation Tree Thrust Library............................................................... 54
NewDelete................................................................................................... 54
Function Pointers........................................................................................... 54
Interval Computing.........................................................................................54
CUDA C 3D FDTD........................................................................................... 54
CUDA Context Thread Management..................................................................... 55
Matrix Multiplication (CUDA Driver API version with Dynamic Linking Version).................. 55
Scalar Product.............................................................................................. 55
Concurrent Kernels.........................................................................................56
Aligned Types............................................................................................... 56
PTX Just-in-Time compilation............................................................................ 56
Eigenvalues.................................................................................................. 56
Fast Walsh Transform...................................................................................... 57
Line of Sight................................................................................................ 57
Matrix Transpose............................................................................................57
CUDA Parallel Reduction.................................................................................. 57
CUDA Parallel Prefix Sum (Scan)........................................................................ 58
threadFenceReduction..................................................................................... 58
CUDA Radix Sort (Thrust Library)....................................................................... 58
CUDA Sorting Networks....................................................................................59
Stream Priorities............................................................................................59
Merge Sort................................................................................................... 59
3.8.CudalibrariesReference................................................................................60
JPEG encode/decode and resize with NPP............................................................ 60
simpleDevLibCUBLAS GPU Device API Library Functions (CUDA Dynamic Parallelism)...........60
MersenneTwisterGP11213..................................................................................60
GrabCut with NPP.......................................................................................... 61
Image Segmentation using Graphcuts with NPP.......................................................61
Histogram Equalization with NPP........................................................................61
FreeImage and NPP Interopability.......................................................................61
Box Filter with NPP........................................................................................62
Preconditioned Conjugate Gradient.....................................................................62
Random Fog................................................................................................. 62
Monte Carlo Single Asian Option........................................................................ 62
Monte Carlo Estimation of Pi (batch QRNG)........................................................... 62
Monte Carlo Estimation of Pi (batch PRNG)........................................................... 63
Monte Carlo Estimation of Pi (batch inline QRNG)................................................... 63
Monte Carlo Estimation of Pi (inline PRNG)........................................................... 63
ConjugateGradient......................................................................................... 63
batchCUBLAS................................................................................................ 64
Simple CUBLAS.............................................................................................. 64
剩余103页未读,继续阅读
资源评论
lalalala256
- 粉丝: 24
- 资源: 10
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功