Commit Graph

37 Commits (a2b5981303ee4ad91a2660e9b0a347c718762507)

Author SHA1 Message Date
Jens Steube a2b5981303 Fix some library names
5 years ago
Jens Steube be8f29ca39 Only warn about broken NVIDIA driver
5 years ago
Jens Steube 39e150fc1e Use xxx_v2 CUDA symbols
5 years ago
Jens Steube 33028314f0 Add hc_cuCtxSetCacheConfig()
5 years ago
Jens Steube fb82bfc169 Improve thread handling based on FIXED_LOCAL_SIZE
5 years ago
Jens Steube 3a3df091c7 Fix CUDA num_elements
5 years ago
Jens Steube 363e789b89 Assume local nvrtc.dll and cuda.dll on windows
5 years ago
Jens Steube a7d04adba3 Fix opencl_devices_active and backend_devices_active
5 years ago
Jens Steube 8ff8c5d536 Add LOCAL_VK to make use of __shared__
5 years ago
Jens Steube bbed0cd67a Fix test.sh and bitsliced algos
5 years ago
Jens Steube d0bd33c9d1 Rename CONSTANT_AS to CONSTANT_VK
5 years ago
Jens Steube 64c495dfa5 Use CUDA stream for all cuLaunchKernel() invocations
5 years ago
Jens Steube d94f582097 Replace CEILDIV() with round_up_multiple_64()
5 years ago
Jens Steube e9c04c2446 More CUDA implementation
5 years ago
Jens Steube 08dc1acc02 More CUDA rewrites
5 years ago
Jens Steube ec9925f3b1 Warnings self-check and autotune with CUDA
5 years ago
Jens Steube 4df00033d7 Prepare CUDA events
5 years ago
Jens Steube f2948460c9 Some first kernel invocations
5 years ago
Jens Steube 5ee033673c Disable name mangling in NVRTC's PTX output and more
5 years ago
Jens Steube 503304f36a Add some first CUDA device memory allocations and host buffer copies
5 years ago
Jens Steube 50a6e720ca More OpenCL variables rename
5 years ago
Jens Steube af8e317cf4 Begin renaming some OpenCL only variables
5 years ago
Jens Steube a6fa7a2749 Add support for some first CUDA module loader
5 years ago
Jens Steube 456c57a6d0 Set vector width size for CUDA
5 years ago
Jens Steube 3c4f4df771 Rename some more variables
5 years ago
Jens Steube 495d89f831 Find alias devices across different backend API's
5 years ago
Jens Steube 6fd936b43a Removed --opencl-platforms filter in order to force backend device numbers to stay constant
5 years ago
Jens Steube e3500ff4aa Add CUDA device attributes to -I
5 years ago
Jens Steube d862458ab5 Begin renaming API specific variables in backend section
5 years ago
Jens Steube d73c0ac8a9 More CUDA attribute queries
5 years ago
Jens Steube a415422123 Initialize CUDA devices and some first attribute queries
5 years ago
Jens Steube 58213c81d6 Add vector datatypes operators
5 years ago
Jens Steube 052e42ccef Fix CUDA_ARCH value
5 years ago
Jens Steube 06171958ee Add --gpu-architecture to NVRTC build options
5 years ago
Jens Steube 9faba41848 Use nvrtc to compile PTX (resulting PTX not yet used)
5 years ago
Jens Steube 4045e60021 Add nvrtc wrapper for later use
5 years ago
Jens Steube 4b986de5fb Prepare native CUDA hybrid integration
5 years ago