Commit Graph

6 Commits (9a1f35d604407f74c6ee53b51dacba705f382f1f)

Author SHA1 Message Date
Jens Steube 1d3795a3ab Converted _a3 kernels, use SIMD for CPU and GPU
9 years ago
Jens Steube a62b7ed06e Upgrade kernel to support dynamic local work sizes
9 years ago
jsteube 331188167c Replace the substring GPU to a more appropriate "device" or "kernel" substring depending on the context
9 years ago
jsteube 5f7c47b461 Fix path to includes
9 years ago
jsteube 3942ae02a2 Speedup -m 5300
9 years ago
jsteube 0bf4e3c34a - Dropped all vector code since new GPU's are all scalar, makes the code much easier
9 years ago