Commit Graph

351 Commits (684ce0afcff73e4e9139db6ee37e0a59855b2d2e)

Author SHA1 Message Date
Jens Steube 8293964097 Fix coding convention
2 years ago
Gabriele Gristina cd363b32f6
Merge branch 'master' into metal_prepare_kernelIncludes_v2
2 years ago
Gabriele Gristina 01a28f80f7 Updated handling of POCL's known bugs
2 years ago
Gabriele Gristina a1ced24564 Fixed bug on benchmark engine, add some unstable warnings, updated negative status code
2 years ago
Gabriele Gristina 2e4a136758 Refactored standard kernel includes in order to support Apple Metal runtime, updated backend, test units and status code
2 years ago
Gabriele Gristina 7650894e02 fixed bug in benchmark engine, updated negative status code
2 years ago
Jens Steube dfd316c653
Merge pull request #3103 from matrix/backend_session_update_mp_rl
2 years ago
Jens Steube 7a9a1b37d0
Merge pull request #3104 from matrix/backend_session_update_mp
2 years ago
Jens Steube 56ef2b4bde
Merge pull request #3102 from matrix/backend_cuda_restyle
2 years ago
Jens Steube 045ca5cb7a Fixed method how OPTS_TYPE_AUX* kernels are called in association mode, for instance WPA/WPA2 kernels
2 years ago
Jens Steube 668d2179cd Kernels: Refactored standard kernel declaration to use a structure holding u32/u64 attributes to reduce the number of attributes
2 years ago
Gabriele Gristina 994083eaf5 Removed hc_clSetKernelArg() call from backend_session_update_mp()
2 years ago
Gabriele Gristina 0f0cf1fe08 Removed hc_clSetKernelArg() call from backend_session_update_mp_rl()
2 years ago
Gabriele Gristina f8ceb8785e CUDA Backend: moved functions to ext_cuda.c/ext_nvrtc.c and includes to ext_cuda.h/ext_nvrtc.h
2 years ago
Gabriele Gristina 78c7ee2af6 HIP Backend: moved functions to ext_hip.c/ext_hiprtc.c and includes to ext_hip.h/ext_hiprtc.h
2 years ago
Gabriele Gristina 26b6054cab OpenCL Backend: moved functions to ext_OpenCL.c and includes to ext_OpenCL.h
2 years ago
Gabriele Gristina 861e644057 OpenCL Backend: added workaround to make optimized kernels work on Apple Silicon
2 years ago
Jens Steube df6e5480ca Print module_extra_tuningdb_block undefined compute device warning only on GPU
2 years ago
Gabriele Gristina 3fd6dac523 Set default device-type to GPU with Apple M1
3 years ago
Gabriele Gristina 0fae3a4394 Added support for Apple Silicon compute devices
3 years ago
Jens Steube d4a54287b1 Add missing backslash for RUN_INSTRUCTION_CHECKS() on AMD
3 years ago
Jens Steube 3d53188cc3 Tuning Database: Added a warning if a module implements module_extra_tuningdb_block but the installed computing device is not found
3 years ago
Jens Steube 21f91c5bb8 Module Optimizaters: Added OPTS_TYPE_MAXIMUM_THREADS to deactivate the else branch route in the section to find -T before compilation
3 years ago
Gabriele Gristina 9be7bc71a5 OpenCL Backend: added workaround to support Apple Silicon
3 years ago
Jens Steube 53f6693495 Temporary enable HIP 4.4/ROCM 4.5 on Linux and globally set native thread count
3 years ago
Jens Steube f84aca82ca Backend types: The default filter for the device types is now set so that only the GPU is used, except for APPLE, where we set CPU
3 years ago
Jens Steube 49a68cd6c1 AMD Driver: Updated requirements for AMD Linux drivers to ROCm 4.5 or later due to new HIP interface
3 years ago
Jens Steube 576a71af5c Update minimum HIP version from 4.4 to upcoming 4.5
3 years ago
Jens Steube 756c29ec57 Add missing cleanup on windows if outdated HIP version is detected
3 years ago
Jens Steube 733f9c2d77 Add better detection future HIP 4.4
3 years ago
Jens Steube 07e58631a5 Backend devices: In non -S mode, limit the number of workitems so that no more than 4GB of host memory is required per backend device
3 years ago
Jens Steube 4b6654b503 Fix unstable plugin to driver warning
3 years ago
Jens Steube c1fd42fe72 Reduce work item maximum in -S mode even further, tested with NTLM
3 years ago
Jens Steube bd2cde31ae Back-end devices: In -S mode, limit the number of workitems so that no more than 2GB of host memory is required per backend device
3 years ago
Jens Steube 4ef1509bc7 Backend Devices: Reduce maximum workitems limited derived from available host memory down from to 8GB to 4GB per backend device
3 years ago
Jens Steube 721e1ea54d Fixed division by zero because backend_ctx->hardware_power_all was not re-inserted after refactoring device_param->hardware_power
3 years ago
Jens Steube 8c14fd85ea POCL: Added a workaround for an issue in POCL that uses a quote character as part of the path itself given to a path for the -I option
3 years ago
Jens Steube b4b2195fa5 OpenCL Runtime: The use of clUnloadPlatformCompiler () was disabled after some users received unexpected return codes
3 years ago
Jens Steube 85854236d1
Merge pull request #2935 from matrix/apple_gpu_workaround
3 years ago
Jens Steube 50e28ff306
Merge pull request #2926 from jtojanen/master
3 years ago
Jukka Ojanen 6b4786de84 Make blocking clEnqueueWriteBuffer() non-blocking
3 years ago
Jukka Ojanen f3cd32bffd Avoid memory copy in rebuild_pws_compressed_append() and assign instead
3 years ago
Jens Steube 76e388ea47 Memory Management: Check in OpenCL that CL_DEVICE_HOST_UNIFIED_MEMORY is true and if so, then half of the available memory
3 years ago
Jens Steube 9254603960 Fixed out-of-boundary read in PKZIP masterkey kernel if the password candidate has length zero.
3 years ago
Gabriele Gristina 72735ed7c4 update workaround
3 years ago
Gabriele Gristina fef18965c3 workaround to 'clEnqueueWriteBuffer(): CL_INVALID_VALUE' with apple gpu
3 years ago
Jukka Ojanen 5c0820b8d3 Assume hashconfig->tmp_size is the element size
3 years ago
Jukka Ojanen bcb4a8f391 Add clFlush() to backend_session_update_mp() and backend_session_update_mp_rl()
3 years ago
Jukka Ojanen 3b11b51113 Remove unnecessary hc_cuEventSynchronize() call
3 years ago
Jukka Ojanen 1edb649c1f Remove duplicate conditions
3 years ago
Jens Steube 5e1d37c82e Make unsupported AMD HIP runtime version error message a bit more human readable
3 years ago
Jens Steube cb69e2d413 Added some HIP version checks, fall back to OpenCL automatically
3 years ago
Jens Steube b2d1f42905 Fix self-test functionality if FIXED_LOCAL_SIZE_COMP is used
3 years ago
Jens Steube af5d346244
Merge pull request #2894 from jtojanen/master
3 years ago
Jens Steube 20a7b9f992 Tuning-Database: Add new module function module_extra_tuningdb_block() to extend hashcat.hctune content from a plugin
3 years ago
Jukka Ojanen 3d7ce7162b Merge branch 'master' of https://github.com/hashcat/hashcat
3 years ago
Jukka Ojanen 62a06f735f kernel_memset32(): assume offset and size are bytes, not elements
3 years ago
Jukka Ojanen fdbfae9a28 Modify OpenCL clEnqueueFillBuffer() workaround
3 years ago
Jukka Ojanen dbe2bad098 Maintain code style
3 years ago
Jukka Ojanen 1ee222d43f Replace free() with hcfree()
3 years ago
Jukka Ojanen e352a79a05 Extend context in gidd_to_pw_t()
3 years ago
Jens Steube d4997d1255 Added support for auto-tuning --kernel-threads (-T) on startup
3 years ago
Jukka Ojanen e154f9e781 Few cleanups
3 years ago
Jukka Ojanen c3195d0603 Merge branch 'master' of https://github.com/hashcat/hashcat
3 years ago
Jukka Ojanen 1064cce08c Synchronize before hooks
3 years ago
Jukka Ojanen 81c2ec3caf Small cleanup in gidd_to_pw_t()
3 years ago
Jukka Ojanen 7a8065d090 Do not call clWaitForEvents() after spin damper when we know that event status is CL_COMPLETE
3 years ago
Jukka Ojanen c48e6a25a8 Enqueue several commands before clFlush()
3 years ago
Jens Steube 03ed06849a
Merge branch 'HIP44' into master
3 years ago
Jens Steube d38d40c8ba Unlock all GPU threads for AMD GPUs if WaveFront size is 32 (basically new models)
3 years ago
Jukka Ojanen 9ed231c99c Add comment to blocking OpenCL calls
3 years ago
Jens Steube a4299b74af Memory Management: Refactored the code responsible for limiting kernel accel in order to avoid out of -host- memory situations
3 years ago
Jukka Ojanen d7cc8d7cd2 Revert due to module_03200.c, module_25600.c and module_25800.c using device_param->kernel_dynamic_local_mem_size_memset
3 years ago
Jukka Ojanen 8b590f651b Remove unused gpu_memset and its references
3 years ago
Jens Steube 9c0a37accf Update driver requirement
3 years ago
Jukka Ojanen cdf27a1cb3 Implement async run_cuda_kernel_memset() and run_cuda_kernel_memset32()
3 years ago
Jukka Ojanen d7de3550b1 Add fast event for CUDA and HIP, and use it to synchronize check_hash()
3 years ago
Jens Steube f6de3e61e0
Merge pull request #2906 from hashcat/master
3 years ago
Jens Steube fd2cb59d26 AMD GPUs: On Apple OpenCL platform, we ask for the preferred kernel thread size rather than hard-coding 32
3 years ago
Jukka Ojanen a642f7b233 Remove synchronous GPU memory copy functions
3 years ago
Jukka Ojanen de5200cffc Allow async execution of backend
3 years ago
Jens Steube 72e307fbce
Merge pull request #2901 from hashcat/master
3 years ago
Jens Steube 959a232828
Merge pull request #2885 from neheb/charfixes
3 years ago
Jens Steube 84a4058edf
Merge pull request #2900 from hashcat/master
3 years ago
Jens Steube 640d95a00f Vendor Detection: Add "Intel" as a valid vendor name for GPU on macOS
3 years ago
Jens Steube 5ffcaa980d HIP Backend: Added support to support HIP 4.4 and later, but added check to rule out older versions because they are incompatible
3 years ago
Rosen Penev adaf3f293b make const char pointers actually const
3 years ago
Jukka Ojanen 9f9333f2ef Allow async execution of run_opencl_kernel_bzero(), run_hip_kernel_bzero() and run_opencl_kernel_bzero()
3 years ago
Jukka Ojanen fafd24237d Define HC_ALIGN macro to control data alignment and use common constant zero buffer in run_cuda_kernel_bzero(), run_hip_kernel_bzero() and run_opencl_kernel_bzero().
3 years ago
Jukka Ojanen 2c2988518d Remove all calls to clFlush()
3 years ago
Jens Steube 5024865d87 Kernel Threads: Use warp size / wavefront size query instead of hardcoded values as base for kernel threads
3 years ago
Jens Steube c990e252d3 Added option --multiply-accel-disable (short: -M) to disable multiply the kernel-accel with the multiprocessor count automatism
3 years ago
Jens Steube a7a899e5a4 Backport changes from #2888 to HIP backend
3 years ago
Jukka Ojanen cb923d6e46 Replace CU_DEVICE_ATTRIBUTE_MAX_SHARED_MEMORY_PER_BLOCK with CU_DEVICE_ATTRIBUTE_MAX_SHARED_MEMORY_PER_BLOCK_OPTIN
3 years ago
Jukka Ojanen d23f2d6c2f Calculation kernel dynamic memory size based on CU_DEVICE_ATTRIBUTE_MAX_SHARED_MEMORY_PER_BLOCK
3 years ago
Jukka Ojanen 8674e23d79 Add async HIP memcpy functions: hc_hipMemcpyDtoDAsync(), hc_hipMemcpyDtoHAsync() and hc_hipMemcpyHtoDAsync(). Implement partially async HIP memset and bzero kernels.
3 years ago
Jukka Ojanen 4263cafdcf Add async CUDA memcpy functions: hc_cuMemcpyDtoDAsync(), hc_cuMemcpyDtoHAsync() and hc_cuMemcpyHtoDAsync(). Implement partially async CUDA memset and bzero kernels.
3 years ago
Jukka Ojanen 4c0f6dd263
Merge branch 'hashcat:master' into master
3 years ago
Jukka Ojanen ea5180ac46 Include missing bzero kernel parameters
3 years ago
Jens Steube 257098a301 Get rid of hip/hip_runtime.h dependancy
3 years ago