Template class to offload some heavier computation to specialised hardware. Functions will be executed in CUDA if CUDA device is available else OpenMP routines will be used to execute the functions on CPU. More...
Modules | |
cpu | |
CPU optimized implementation for Offload. | |
Template class to offload some heavier computation to specialised hardware. Functions will be executed in CUDA if CUDA device is available else OpenMP routines will be used to execute the functions on CPU.