TI OpenCL v01.01.xx
TI OpenCL™ Runtime Documentation Contents:
- Introduction
- OpenCL 1.1 Reference Material
- Compilation
- Memory Usage
- Device Memory
- How DDR3 is
Partitioned for Linux System and OpenCL - Alternate
Host malloc/free Extension for Zero Copy OpenCL Kernels - The OpenCL Memory
Model - OpenCL Buffers
- Buffer Read/Write
vs. Map/Unmap - Discovering OpenCL
Memory Sizes and Limits - Cache Operations
- Large OpenCL
buffers and Memory Beyond the 32-bit DSP Address Space - User
Defined DSP Heap Extension
- Execution Model
- Extensions
- Calling
Standard C Code From OpenCL C Code - Calling Standard
C code with OpenMP from OpenCL C code - C66x standard
C compiler intrinsic functions - OpenCL C code
using printf - DMA Control Using
EdmaMgr Functions - Using
Extended Memory on the 66AK2x device - Fast Global
buffers in on-chip MSMC memory - OpenCL
C Builtin Function Extensions - Cache Operations
- Calling
- Environment
Variables - Optimization Tips
- Optimization
Techniques for Host Code - Optimization
Techniques for Device (DSP) Code- Prefer
Kernels with 1 work-item per work-group - Use
Local Buffers - Use
async_work_group_copy and async_work_group_strided_copy - Avoid
DSP writes directly to DDR - Use
the reqd_work_group_size attribute on kernels - Use
the TI OpenCL extension than allows Standard C code to be called from OpenCL C code - Avoid
OpenCL C Barriers - Use
the most efficient data type on the DSP - Do
Not Use Large Vector Types - Consecutive
memory accesses - Prefer
the CPU style of writing OpenCL code over the GPU style
- Prefer
- Typical
Steps to Optimize Device Code - Optimizing
3x3 Gaussian smoothing filter - Performance
Data
- Optimization
- Examples
- Building
and Running - Example Descriptions
- platforms
example - simple
example - mandelbrot,
mandelbrot_native examples - ccode
example - matmpy
example - offline
example - vecadd_openmp
example - vecadd_openmp_t
example - vecadd
example - vecadd_mpax
example - vecadd_mpax_openmp
example - dsplib_fft
example - ooo,
ooo_map examples - null
example - sgemm
example - dgemm
example - edmamgr
example - dspheap
example
- platforms
- Float compute
example - Monte Carlo
example
- Building
- Debug
- Profiling
- OpenCL on TI-RTOS
- Frequently Asked Questions
- How do I get support
for TI OpenCL products? - Which TI OpenCL Version
is Installed? - Using Python OpenCL
with the TI OpenCL implementation - Guidelines
for porting Stand-alone DSP applications to OpenCL - OpenCL Interoperability
with Host OpenMP - MCSDK-HPC
to OpenCL Component Version Map - Does TI’s OpenCL support
images and samplers? - Why does the OpenCL ICD
installed on my platform not find the TI OpenCL implementation? - Why do I get messages about
/var/lock/opencl when running OpenCL applications? - Why do I get DLOAD
error messages when running OpenCL applications? - How do I limit log
file sizes on EVM’s temporary file storage (tmpfs)?
- How do I get support
- Readme
- Disclaimer
- Important
Notice