Commit Graph

63 Commits (3161aec3da8f763fe14461fa50c4a82b58717145)

Author SHA1 Message Date
Gabriele Gristina 3161aec3da fix the comments :)
5 years ago
Gabriele Gristina 5679ca3344 Rewrite hc_fopen to better handling file descriptor locking/unlocking functions, saving kernels binary from plain to gzip format
5 years ago
Gabriele Gristina 5d3ed3e754 Remove union from HCFILE, using std file ops in ocl_check_dri, remove debug comments
5 years ago
Gabriele Gristina c2e634c426 switch is_gzip from short to bool
5 years ago
Gabriele Gristina 481c752456 No more compress functions, update example.dict.gz, remove some comments
5 years ago
Gabriele Gristina 398c89c75c switch almost all FILE ops, potfile is the only missing
5 years ago
Gabriele Gristina b2529af172 remove original commented code
5 years ago
Gabriele Gristina 6cb4abd526 Add zlib support v2
5 years ago
Jens Steube 955bfeaa14 Improve performance of bitsliced algorithms on ROCm
5 years ago
Jens Steube 5e0eb288c9 Use __launch_bounds__ in CUDA as replacement for reqd_work_group_size() in OpenCL
5 years ago
Jens Steube c2fc849e2c Fix minimum threads_per_block check
5 years ago
Jens Steube 0568c0746a Emulate effect of reqd_work_group_size() in CUDA
5 years ago
Jens Steube 44ecc83d82 Do some CUDA and NVRTC version checks on startup
5 years ago
Jens Steube 03ed89684e Use --restrict nvrtc option by default
5 years ago
Jens Steube 87c336e822 Fix format warning in backend.c
5 years ago
Jens Steube 1f6c82b6d1 Add hc_cuModuleLoadDataExLog wrapper function for more detailed error logging from CUDA
5 years ago
Jens Steube ce8a6fde0a Fix status screen current password query
5 years ago
Jens Steube f84eaa2e4d Fix bitsliced algorithm brute-force with CUDA
5 years ago
Jens Steube 523e0f7151 Fix free unallocated memory in case OpenCL initialization failed
5 years ago
Jens Steube bca03bb7ed CUDA offers a nice way to query available device memory, no need to brute force
5 years ago
Jens Steube a6bc1d3cc0 Experimental kernel-thread autotuner
5 years ago
Jens Steube d59474fded Testwise unlock full thread count on NVidia
5 years ago
Jens Steube d378aa7ab9 Show host memory requirement on startup
5 years ago
Jens Steube 46f737c5af Use real constant memory on CUDA
5 years ago
Jens Steube 5d14a59304 Need 3.x nvrtc minimum
5 years ago
Jens Steube 54feb62e94 brute-force nvrtc .dll name
5 years ago
Jens Steube a2b5981303 Fix some library names
5 years ago
Jens Steube be8f29ca39 Only warn about broken NVIDIA driver
5 years ago
Jens Steube 39e150fc1e Use xxx_v2 CUDA symbols
5 years ago
Jens Steube 33028314f0 Add hc_cuCtxSetCacheConfig()
5 years ago
Jens Steube fb82bfc169 Improve thread handling based on FIXED_LOCAL_SIZE
5 years ago
Jens Steube 3a3df091c7 Fix CUDA num_elements
5 years ago
Jens Steube 363e789b89 Assume local nvrtc.dll and cuda.dll on windows
5 years ago
Jens Steube a7d04adba3 Fix opencl_devices_active and backend_devices_active
5 years ago
Jens Steube 8ff8c5d536 Add LOCAL_VK to make use of __shared__
5 years ago
Jens Steube bbed0cd67a Fix test.sh and bitsliced algos
5 years ago
Jens Steube d0bd33c9d1 Rename CONSTANT_AS to CONSTANT_VK
5 years ago
Jens Steube 64c495dfa5 Use CUDA stream for all cuLaunchKernel() invocations
5 years ago
Jens Steube d94f582097 Replace CEILDIV() with round_up_multiple_64()
5 years ago
Jens Steube e9c04c2446 More CUDA implementation
5 years ago
Jens Steube 08dc1acc02 More CUDA rewrites
5 years ago
Jens Steube ec9925f3b1 Warnings self-check and autotune with CUDA
5 years ago
Jens Steube 4df00033d7 Prepare CUDA events
5 years ago
Jens Steube f2948460c9 Some first kernel invocations
5 years ago
Jens Steube 5ee033673c Disable name mangling in NVRTC's PTX output and more
5 years ago
Jens Steube 503304f36a Add some first CUDA device memory allocations and host buffer copies
5 years ago
Jens Steube 50a6e720ca More OpenCL variables rename
5 years ago
Jens Steube af8e317cf4 Begin renaming some OpenCL only variables
5 years ago
Jens Steube a6fa7a2749 Add support for some first CUDA module loader
5 years ago
Jens Steube 456c57a6d0 Set vector width size for CUDA
5 years ago