Commit Graph

13 Commits (9b3d18f87d2bf9d537240f0a664eabf111aad4f3)

Author SHA1 Message Date
Jens Steube 79b3a1b7ca Cleanup -m 111xx kernels to latest standard
8 years ago
jsteube dad03e394d Fixed two major problems
8 years ago
magnum a5be8a75ed Allow and support vector-width 16, which is current maximum for
8 years ago
Jens Steube 737678284f Converted to new SIMD: -m 10100 -a 0
9 years ago
Jens Steube 06dc6ba656 Converted to new SIMD: -m 11100 -a 0
9 years ago
Jens Steube d8e58d5fd3 Prepare _a0 kernel for SIMD
9 years ago
Jens Steube 1d3795a3ab Converted _a3 kernels, use SIMD for CPU and GPU
9 years ago
Jens Steube a62b7ed06e Upgrade kernel to support dynamic local work sizes
9 years ago
jsteube 331188167c Replace the substring GPU to a more appropriate "device" or "kernel" substring depending on the context
9 years ago
jsteube 61744662c0 Fix path to includes
9 years ago
jsteube 5f7c47b461 Fix path to includes
9 years ago
jsteube 76cc1631be More kernel fixes for function calls and vector datatypes
9 years ago
jsteube 0bf4e3c34a - Dropped all vector code since new GPU's are all scalar, makes the code much easier
9 years ago