Commit Graph

12 Commits (72e3821a4cffa30564f65b266b6044bc6f854173)

Author SHA1 Message Date
Jens Steube 72e3821a4c Simplify auto-tuning and benchmark routines
8 years ago
Jens Steube 0b3743ce94 - Added inline declaration to functions from simd.c, common.c, rp.c and types_ocl.c to increase performance
8 years ago
Jens Steube 63c7bda957 Cleanup -m 108xx kernels to latest standard
8 years ago
jsteube dad03e394d Fixed two major problems
8 years ago
Jens Steube 09dfc98797 Converted to new SIMD: -m 10800 -a 0
8 years ago
Jens Steube a62b7ed06e Upgrade kernel to support dynamic local work sizes
8 years ago
jsteube 331188167c Replace the substring GPU to a more appropriate "device" or "kernel" substring depending on the context
9 years ago
jsteube 61744662c0 Fix path to includes
9 years ago
jsteube 5f7c47b461 Fix path to includes
9 years ago
jsteube 2283d5c843 Fix more append_* functions in kernels
9 years ago
jsteube 50f39b3563 Fix append_* function calls
9 years ago
jsteube 0bf4e3c34a - Dropped all vector code since new GPU's are all scalar, makes the code much easier
9 years ago