Jukka Ojanen
cdf27a1cb3
Implement async run_cuda_kernel_memset() and run_cuda_kernel_memset32()
2021-07-27 18:56:59 +03:00
Jukka Ojanen
a642f7b233
Remove synchronous GPU memory copy functions
2021-07-26 15:36:42 +03:00
Jukka Ojanen
4bd7363674
Add missing HIP declarations
2021-07-25 22:18:16 +03:00
Jukka Ojanen
4263cafdcf
Add async CUDA memcpy functions: hc_cuMemcpyDtoDAsync(), hc_cuMemcpyDtoHAsync() and hc_cuMemcpyHtoDAsync(). Implement partially async CUDA memset and bzero kernels.
2021-07-20 12:23:39 +03:00
Jens Steube
ca3beacd93
Disable dynamic shared memory on HIP, because hipFuncSetAttribute() maps to cudaFuncSetAttribute() and not to cuFuncSetAttribute()
2021-07-11 14:30:49 +02:00
Jens Steube
1b84a9e53b
Add missing backports from code base v6.2.2
...
Fix context to thread management
Fix missing code in selftest.c, autotune.c, hashes.c, dispatch.c and backend.c
Use IS_HIP depending code makes it easier for future optimization related to inline assembly calls - instead of using IS_CUDA || IS_HIP
See TODO markers for more optimizations / next steps
2021-07-11 12:38:59 +02:00
Jens Steube
a22f8149fc
Merge branch 'HIP' into hip
2021-07-10 21:34:09 +02:00
reger-men
ea7b74389f
First draft HIP Version
2021-07-09 03:50:40 +00:00
nycex
470e844e5d
use XDG_CACHE_HOME for kernels
2021-06-05 00:38:43 +02:00
Jens Steube
5c6501444a
Kernels: Add standalone true UTF8 to UTF16 converter kernel that runs after amplifier. Use OPTS_TYPE_POST_AMP_UTF16LE from plugin
2021-05-20 14:34:24 +02:00
Jens Steube
0c2afde83b
Add support for clUnloadPlatformCompiler()
2021-05-02 08:15:25 +00:00
Jens Steube
a0eae9050c
OpenCL Runtime: Workaround JiT compiler deadlock on NVIDIA driver >= 465.89
2021-04-11 13:35:40 +02:00
Jens Steube
04d5e5a119
New Attack-Mode: Association Attack. Like JtR's single mode. Very early
...
stage. See hashcat Forum for detailed writeup.
2020-09-29 15:56:32 +02:00
Jens Steube
1fc37c25f9
OpenCL Kernels: Moved "gpu_decompress", "gpu_memset" and "gpu_atinit" into new OpenCL/shared.cl in order to reduce compile time
2020-02-01 09:00:48 +01:00
Jens Steube
346637ec43
Improve cujit logging
2020-01-30 11:44:57 +01:00
Jens Steube
66ae5125ce
Cache cubin instead of PTX to decrease startup time
2020-01-29 15:56:36 +01:00
Jens Steube
a8555fa048
Support use of all available CPU cores for hash-mode specific hooks
2019-11-03 12:05:52 +01:00
Rosen Penev
6ecb42b3ea
Run through Clang's readability-inconsistent-declaration-parameter-name
2019-08-03 22:37:38 -07:00
Jens Steube
5e0eb288c9
Use __launch_bounds__ in CUDA as replacement for reqd_work_group_size() in OpenCL
2019-06-16 18:01:26 +02:00
Jens Steube
0568c0746a
Emulate effect of reqd_work_group_size() in CUDA
2019-06-06 17:49:41 +02:00
Jens Steube
44ecc83d82
Do some CUDA and NVRTC version checks on startup
2019-06-05 10:53:48 +02:00
Jens Steube
1f6c82b6d1
Add hc_cuModuleLoadDataExLog wrapper function for more detailed error logging from CUDA
2019-06-01 07:47:30 +02:00
Jens Steube
ce8a6fde0a
Fix status screen current password query
2019-05-14 15:25:36 +02:00
Jens Steube
33028314f0
Add hc_cuCtxSetCacheConfig()
2019-05-09 00:04:05 +02:00
Jens Steube
4df00033d7
Prepare CUDA events
2019-05-04 10:44:03 +02:00
Jens Steube
f2948460c9
Some first kernel invocations
2019-05-04 10:13:43 +02:00
Jens Steube
5ee033673c
Disable name mangling in NVRTC's PTX output and more
2019-05-03 15:50:07 +02:00
Jens Steube
af8e317cf4
Begin renaming some OpenCL only variables
2019-05-02 17:12:59 +02:00
Jens Steube
a6fa7a2749
Add support for some first CUDA module loader
2019-05-02 14:58:52 +02:00
Jens Steube
d73c0ac8a9
More CUDA attribute queries
2019-04-28 18:54:26 +02:00
Jens Steube
a415422123
Initialize CUDA devices and some first attribute queries
2019-04-28 14:45:50 +02:00
Jens Steube
9faba41848
Use nvrtc to compile PTX (resulting PTX not yet used)
2019-04-26 13:28:44 +02:00
Jens Steube
4b986de5fb
Prepare native CUDA hybrid integration
2019-04-25 14:45:17 +02:00