Jukka Ojanen
c48e6a25a8
Enqueue several commands before clFlush()
2021-07-29 14:28:01 +03:00
Jukka Ojanen
9ed231c99c
Add comment to blocking OpenCL calls
2021-07-29 00:07:10 +03:00
Jukka Ojanen
e889cf98a9
Few cleanups in autotune
2021-07-28 23:50:58 +03:00
Jukka Ojanen
d7cc8d7cd2
Revert due to module_03200.c, module_25600.c and module_25800.c using device_param->kernel_dynamic_local_mem_size_memset
2021-07-28 19:49:02 +03:00
Jukka Ojanen
8b590f651b
Remove unused gpu_memset and its references
2021-07-28 19:26:30 +03:00
Jukka Ojanen
5dbe4958b8
Remove digests_shown_tmp buffer, used in check_cracked() to zero out hashes
2021-07-27 19:01:13 +03:00
Jukka Ojanen
cdf27a1cb3
Implement async run_cuda_kernel_memset() and run_cuda_kernel_memset32()
2021-07-27 18:56:59 +03:00
Jukka Ojanen
e8be7028cd
Handle errors inside check_cracked()
2021-07-27 16:28:07 +03:00
Jukka Ojanen
be33ccaa6c
check_hash() needs to return value to indicate possible error
2021-07-27 14:36:48 +03:00
Jukka Ojanen
ae44a3022b
Use fast event to synchronize selftest()
2021-07-27 13:00:39 +03:00
Jukka Ojanen
d7de3550b1
Add fast event for CUDA and HIP, and use it to synchronize check_hash()
2021-07-27 12:57:26 +03:00
Jukka Ojanen
73b8cda176
Avoid unnecessary memcpy in check_hash()
2021-07-26 22:25:50 +03:00
Jukka Ojanen
43a3622a77
Add necessary synchronization to check_hash() and check_cracked()
2021-07-26 20:57:24 +03:00
Jukka Ojanen
a642f7b233
Remove synchronous GPU memory copy functions
2021-07-26 15:36:42 +03:00
Jukka Ojanen
507d40fff6
Allow async execution of check_hash() and check_cracked()
2021-07-26 15:35:59 +03:00
Jukka Ojanen
de5200cffc
Allow async execution of backend
2021-07-26 15:17:25 +03:00
Jukka Ojanen
a86b44a750
Allow async execution of autotune() and prepare try_run() for async kernel execution
2021-07-26 01:51:19 +03:00
Jukka Ojanen
79404b1ff1
Allow async execution of selftest()
2021-07-25 22:19:50 +03:00
Jukka Ojanen
4bd7363674
Add missing HIP declarations
2021-07-25 22:18:16 +03:00
Jukka Ojanen
9f9333f2ef
Allow async execution of run_opencl_kernel_bzero(), run_hip_kernel_bzero() and run_opencl_kernel_bzero()
2021-07-22 15:18:10 +03:00
Jukka Ojanen
fafd24237d
Define HC_ALIGN macro to control data alignment and use common constant zero buffer in run_cuda_kernel_bzero(), run_hip_kernel_bzero() and run_opencl_kernel_bzero().
2021-07-22 14:24:03 +03:00
Jukka Ojanen
2c2988518d
Remove all calls to clFlush()
2021-07-22 13:59:19 +03:00
Jens Steube
5024865d87
Kernel Threads: Use warp size / wavefront size query instead of hardcoded values as base for kernel threads
...
Kernel Cache: Add kernel threads into hash computation which is later used in the kernel cache filename
Remove some unused function symbol lookups in HIP library
2021-07-22 11:46:47 +02:00
Jens Steube
a09efb2634
SCRYPT Kernels: Add more optimized values for some new NV/AMD GPUs
2021-07-22 11:46:11 +02:00
Jens Steube
0d64db07d5
Merge pull request #2892 from Xanadrel/patch-1
...
Add hashcat.hctune entries for Nvidia RTX 3060Ti & 3070 for scrypt based hash-modes
2021-07-22 11:38:41 +02:00
Xanadrel
1d7880fa30
Add hashcat.hctune entries for Nvidia RTX 3060Ti & 3070 for scrypt based hash-modes
2021-07-21 23:24:19 +02:00
Jens Steube
e83611742d
Brain Session: Adds hashconfig specific opti_type and opts_type parameters to hashcat session computation to cover features like -O and -M
2021-07-21 15:47:26 +02:00
Jens Steube
c990e252d3
Added option --multiply-accel-disable (short: -M) to disable multiply the kernel-accel with the multiprocessor count automatism
2021-07-21 15:47:05 +02:00
Jens Steube
a7a899e5a4
Backport changes from #2888 to HIP backend
2021-07-21 14:01:28 +02:00
Jens Steube
8ba907125a
Merge pull request #2888 from jtojanen/master
...
Calculate kernel dynamic memory size based on CU_DEVICE_ATTRIBUTE_MAX_SHARED_MEMORY_PER_BLOCK_OPTIN
2021-07-21 13:28:12 +02:00
Jukka Ojanen
cb923d6e46
Replace CU_DEVICE_ATTRIBUTE_MAX_SHARED_MEMORY_PER_BLOCK with CU_DEVICE_ATTRIBUTE_MAX_SHARED_MEMORY_PER_BLOCK_OPTIN
2021-07-21 13:42:22 +03:00
Jukka Ojanen
55a475cb4b
Merge branch 'hashcat:master' into master
2021-07-20 21:27:51 +03:00
Jukka Ojanen
d23f2d6c2f
Calculation kernel dynamic memory size based on CU_DEVICE_ATTRIBUTE_MAX_SHARED_MEMORY_PER_BLOCK
2021-07-20 21:27:29 +03:00
Jens Steube
41156bb0a5
Improve PIM fix for use on macOS
2021-07-20 15:10:49 +02:00
Jens Steube
a026171fb3
Merge pull request #2879 from jtojanen/master
...
Implement gpu_bzero
2021-07-20 13:00:37 +02:00
Jens Steube
7c5fe5e233
Slow Kernels: Set some of the slowest kernels to OPTS_TYPE_MP_MULTI_DISABLE
2021-07-20 12:47:40 +02:00
Jukka Ojanen
8674e23d79
Add async HIP memcpy functions: hc_hipMemcpyDtoDAsync(), hc_hipMemcpyDtoHAsync() and hc_hipMemcpyHtoDAsync(). Implement partially async HIP memset and bzero kernels.
2021-07-20 12:47:10 +03:00
Jens Steube
777512e862
Fixed false negative in all VeraCrypt hash-modes if both conditions are met: 1. use CPU for cracking and 2. PIM range was used
2021-07-20 11:31:31 +02:00
Jukka Ojanen
4263cafdcf
Add async CUDA memcpy functions: hc_cuMemcpyDtoDAsync(), hc_cuMemcpyDtoHAsync() and hc_cuMemcpyHtoDAsync(). Implement partially async CUDA memset and bzero kernels.
2021-07-20 12:23:39 +03:00
Jukka Ojanen
f07ff6f03d
Use kernel_bzero instead of kernel_memset when value is zero
2021-07-20 12:04:12 +03:00
Jukka Ojanen
4c0f6dd263
Merge branch 'hashcat:master' into master
2021-07-20 12:00:41 +03:00
Jukka Ojanen
ea5180ac46
Include missing bzero kernel parameters
2021-07-20 11:59:44 +03:00
Jens Steube
92dc57df28
Revert adding make_u32x() for constants to crypto primitives, fails on OpenCL
2021-07-20 10:34:34 +02:00
Jens Steube
bdb7999f07
Switch HIP vector datatypes to OpenCL like ext_vector_type()
2021-07-19 20:24:30 +02:00
Jens Steube
9421e1f89d
Fixed datatype in function sha384_hmac_init_vector_128() that could come into effect if vector datatype was manually set
2021-07-19 15:58:38 +02:00
Jens Steube
7d43b6e1eb
Clear tmps memory region after autotune use
2021-07-19 11:59:53 +02:00
Jens Steube
0504498e86
Merge pull request #2886 from pellekuiters/issue-2876
...
Auto-tuning outside kernel can cause unpredictable behavior for tmp_t structs on CUDA
2021-07-19 11:52:07 +02:00
pelle
ac3ccfcc51
Run init() and loop_prepare() during autotune to prevent possible corruptions during loop().
2021-07-19 11:07:24 +02:00
Jens Steube
0d8b4b74ad
More CUDA special backports to HIP
2021-07-18 22:56:22 +02:00
Jens Steube
257098a301
Get rid of hip/hip_runtime.h dependancy
2021-07-18 21:14:45 +02:00