Commit Graph

11 Commits (f9834f3dfaa7405c6cc047345810c1a5018b8bc5)

Author SHA1 Message Date
magnum a5be8a75ed Allow and support vector-width 16, which is current maximum for
9 years ago
jsteube 7645a6f00f Fix warp bug on -m 8500
9 years ago
Gabriele 'matrix' Gristina 0f0984fe86 Fixed all gpu code (see PR #179 for details)
9 years ago
Jens Steube 1d3795a3ab Converted _a3 kernels, use SIMD for CPU and GPU
9 years ago
Jens Steube a62b7ed06e Upgrade kernel to support dynamic local work sizes
9 years ago
jsteube 331188167c Replace the substring GPU to a more appropriate "device" or "kernel" substring depending on the context
9 years ago
jsteube 5f7c47b461 Fix path to includes
9 years ago
jsteube bd16fd4cfc Declare word_buf_r as __constant for _a3 kernel
9 years ago
jsteube ed35c11e2a Fix -m 8500 kernel
9 years ago
jsteube 76cc1631be More kernel fixes for function calls and vector datatypes
9 years ago
jsteube 0bf4e3c34a - Dropped all vector code since new GPU's are all scalar, makes the code much easier
9 years ago