Commit Graph

14 Commits (a4e1692647946d664e97fd9548c6b1643e9f60c2)

Author SHA1 Message Date
jsteube c79bed3b7d Prepare for a more dynamic #pragma unroll use
8 years ago
Jens Steube 0b3743ce94 - Added inline declaration to functions from simd.c, common.c, rp.c and types_ocl.c to increase performance
8 years ago
Jens Steube e6b9071b52 Cleanup -m 9800 kernels to latest standard
8 years ago
Jens Steube d5d06b7023 Cleanup -m 9700 kernels to latest standard
8 years ago
jsteube dad03e394d Fixed two major problems
8 years ago
Jens Steube 7190dcf855 Prepare NEW_SIMD_MODE for -a 1 kernels
8 years ago
Gabriele 'matrix' Gristina 0f0984fe86 Fixed all gpu code (see PR #179 for details)
8 years ago
Jens Steube aa55adf652 Revert some RC4 based kernels back to scalar processing
9 years ago
Jens Steube 1d3795a3ab Converted _a3 kernels, use SIMD for CPU and GPU
9 years ago
Jens Steube a62b7ed06e Upgrade kernel to support dynamic local work sizes
9 years ago
jsteube 331188167c Replace the substring GPU to a more appropriate "device" or "kernel" substring depending on the context
9 years ago
jsteube 5f7c47b461 Fix path to includes
9 years ago
jsteube a849313093 Fix speed -m 9700, 9710, 9800 and 9810 for NV, also increases some on AMD
9 years ago
jsteube 0bf4e3c34a - Dropped all vector code since new GPU's are all scalar, makes the code much easier
9 years ago