Commit Graph

348 Commits (6ad36db9bac451ad5586c8f3e406a9bdfceb88cb)

Author SHA1 Message Date
Jens Steube 1f6c82b6d1 Add hc_cuModuleLoadDataExLog wrapper function for more detailed error logging from CUDA
5 years ago
Jens Steube ce8a6fde0a Fix status screen current password query
5 years ago
Jens Steube f84eaa2e4d Fix bitsliced algorithm brute-force with CUDA
5 years ago
Jens Steube 523e0f7151 Fix free unallocated memory in case OpenCL initialization failed
5 years ago
Jens Steube bca03bb7ed CUDA offers a nice way to query available device memory, no need to brute force
5 years ago
Jens Steube a6bc1d3cc0 Experimental kernel-thread autotuner
5 years ago
Jens Steube d59474fded Testwise unlock full thread count on NVidia
5 years ago
Jens Steube d378aa7ab9 Show host memory requirement on startup
5 years ago
Jens Steube 46f737c5af Use real constant memory on CUDA
5 years ago
Jens Steube 5d14a59304 Need 3.x nvrtc minimum
5 years ago
Jens Steube 54feb62e94 brute-force nvrtc .dll name
5 years ago
Jens Steube a2b5981303 Fix some library names
5 years ago
Jens Steube be8f29ca39 Only warn about broken NVIDIA driver
5 years ago
Jens Steube 39e150fc1e Use xxx_v2 CUDA symbols
5 years ago
Jens Steube 33028314f0 Add hc_cuCtxSetCacheConfig()
5 years ago
Jens Steube fb82bfc169 Improve thread handling based on FIXED_LOCAL_SIZE
5 years ago
Jens Steube 3a3df091c7 Fix CUDA num_elements
5 years ago
Jens Steube 363e789b89 Assume local nvrtc.dll and cuda.dll on windows
5 years ago
Jens Steube a7d04adba3 Fix opencl_devices_active and backend_devices_active
5 years ago
Jens Steube 8ff8c5d536 Add LOCAL_VK to make use of __shared__
5 years ago
Jens Steube bbed0cd67a Fix test.sh and bitsliced algos
5 years ago
Jens Steube d0bd33c9d1 Rename CONSTANT_AS to CONSTANT_VK
5 years ago
Jens Steube 64c495dfa5 Use CUDA stream for all cuLaunchKernel() invocations
5 years ago
Jens Steube d94f582097 Replace CEILDIV() with round_up_multiple_64()
5 years ago
Jens Steube e9c04c2446 More CUDA implementation
5 years ago
Jens Steube 08dc1acc02 More CUDA rewrites
5 years ago
Jens Steube ec9925f3b1 Warnings self-check and autotune with CUDA
5 years ago
Jens Steube 4df00033d7 Prepare CUDA events
5 years ago
Jens Steube f2948460c9 Some first kernel invocations
5 years ago
Jens Steube 5ee033673c Disable name mangling in NVRTC's PTX output and more
5 years ago
Jens Steube 503304f36a Add some first CUDA device memory allocations and host buffer copies
5 years ago
Jens Steube 50a6e720ca More OpenCL variables rename
5 years ago
Jens Steube af8e317cf4 Begin renaming some OpenCL only variables
5 years ago
Jens Steube a6fa7a2749 Add support for some first CUDA module loader
5 years ago
Jens Steube 456c57a6d0 Set vector width size for CUDA
5 years ago
Jens Steube 3c4f4df771 Rename some more variables
5 years ago
Jens Steube 495d89f831 Find alias devices across different backend API's
5 years ago
Jens Steube 6fd936b43a Removed --opencl-platforms filter in order to force backend device numbers to stay constant
5 years ago
Jens Steube e3500ff4aa Add CUDA device attributes to -I
5 years ago
Jens Steube d862458ab5 Begin renaming API specific variables in backend section
5 years ago
Jens Steube d73c0ac8a9 More CUDA attribute queries
5 years ago
Jens Steube a415422123 Initialize CUDA devices and some first attribute queries
5 years ago
Jens Steube 58213c81d6 Add vector datatypes operators
5 years ago
Jens Steube 052e42ccef Fix CUDA_ARCH value
5 years ago
Jens Steube 06171958ee Add --gpu-architecture to NVRTC build options
5 years ago
Jens Steube 9faba41848 Use nvrtc to compile PTX (resulting PTX not yet used)
5 years ago
Jens Steube 4045e60021 Add nvrtc wrapper for later use
5 years ago
Jens Steube 4b986de5fb Prepare native CUDA hybrid integration
5 years ago