Commit Graph

18 Commits (c172da1bd26cc49ee0feb668374ed900a0770a66)

Author SHA1 Message Date
Jens Steube 2899f53a15 Move files from include/ to OpenCL/ if they are used within kernels
8 years ago
jsteube c79bed3b7d Prepare for a more dynamic #pragma unroll use
8 years ago
jsteube a0221cd368 Fix broken -m 1500 and -m 3000
9 years ago
jsteube 6dac6b409e Add compiler kernel hints for algorithms with fixed workgroup size
9 years ago
Jens Steube c0d0ef60a5 Replace minus with select give a small boost in speed for -m 1500 and -m 3000
9 years ago
Jens Steube 01baa89bc0 Fix -m 3000 cracking performance
9 years ago
Jens Steube 55cadb6834 Fix broken -m 1500 and -m 3000 in -a 3 mode
9 years ago
jsteube dad03e394d Fixed two major problems
9 years ago
Jens Steube a62b7ed06e Upgrade kernel to support dynamic local work sizes
9 years ago
Jens Steube 245301c9b4 Started optimizing some of the OpenCL kernel for latest AMD Catalyst 15.12:
9 years ago
jsteube e8229af09b Fix for -m 1500, -m 3000: Missing "defined" keyword
9 years ago
jsteube 331188167c Replace the substring GPU to a more appropriate "device" or "kernel" substring depending on the context
9 years ago
jsteube 8b50b60484 More fixes for general OpenCL platforms
9 years ago
jsteube 5f7c47b461 Fix path to includes
9 years ago
jsteube acea69c561 final cleanup for -m 1500
9 years ago
jsteube 661225f03f Fix -m 1500 for NV
9 years ago
jsteube 76cc1631be More kernel fixes for function calls and vector datatypes
9 years ago
jsteube 0bf4e3c34a - Dropped all vector code since new GPU's are all scalar, makes the code much easier
9 years ago