Commit Graph

12 Commits (3828ea354c46fd8d2ac6cf7753e8b0c7ec4c5b61)

Author SHA1 Message Date
Jens Steube 8f8d98665b Cleanup -m 1400 kernels to latest standard
8 years ago
Jens Steube 6cf3e8324d New SIMD code for -a 1 -m 1400
8 years ago
jsteube dad03e394d Fixed two major problems
8 years ago
Jens Steube e6e5005a6b Revert "Zero pws_buf before reuse"
8 years ago
Jens Steube b409e5e9e1 Zero pws_buf before reuse
8 years ago
Jens Steube 1d3795a3ab Converted _a3 kernels, use SIMD for CPU and GPU
9 years ago
Jens Steube a62b7ed06e Upgrade kernel to support dynamic local work sizes
9 years ago
jsteube 331188167c Replace the substring GPU to a more appropriate "device" or "kernel" substring depending on the context
9 years ago
jsteube 5f7c47b461 Fix path to includes
9 years ago
jsteube 2283d5c843 Fix more append_* functions in kernels
9 years ago
jsteube 50f39b3563 Fix append_* function calls
9 years ago
jsteube 0bf4e3c34a - Dropped all vector code since new GPU's are all scalar, makes the code much easier
9 years ago