Commit Graph

1447 Commits (f859f466efaa8bf885f25c8bf6c9a28039d401c3)

Author SHA1 Message Date
jsteube f859f466ef Fix -m 8300 in -a 0 mode
7 years ago
jsteube f5e04254dc Fix -m 10800 in -a 0 mode
7 years ago
jsteube d3b9febb30 Fix some double variable declarations
7 years ago
jsteube 40b57677cd OpenCL Kernels: Reactivate Dalibors XOR optimization on MD5_H on all MD5 based algorithms
7 years ago
jsteube 6d112aeb39 OpenCL Kernels: Rewritten Keccak kernel to run fully on registers and partially reversed last round
7 years ago
jsteube a378abee66 Add missing NEW_SIMD_CODE in -m 6600
7 years ago
jsteube 1c169af0ad Make -m 14100 a pure kernel only
7 years ago
jsteube 2b9888486e Make -m 14000 a pure kernel only and add volatile for asm statement
7 years ago
jsteube 99f416435e Fix invalid use of __constant in LM kernel
7 years ago
jsteube 6db2f4cc18 Fix typo
7 years ago
jsteube 918578bee1 Improve some NVidia specific inline assembly
7 years ago
jsteube 9de1e557bb More VEGA specific inline assembly to improve SHA1 based kernels
7 years ago
jsteube a0be36d7b8 Fix compile error caused by __add3()
7 years ago
jsteube 00e38cc2c6 Add VEGA specific inline assembly to improve all MD4, MD5, SHA1 and SHA256 based kernels
7 years ago
jsteube 7bfd343ec9 Optimized rule_op_mangle_dupechar_last(), rule_op_mangle_rotate_right(), rule_op_mangle_rotate_left() and append_block1() in rule engine
7 years ago
jsteube 52a97fee75 Improve rule engine performance by improving append_0x80_xxx() performance by using precomputed values from constant memory
7 years ago
jsteube 3260000357 Fix whirlpool pure kernel in -a 0 mode
7 years ago
jsteube e3810d054b Fix some use of pw_t tmp variable
7 years ago
jsteube 5e01ff4c53 Refactor some u32x to u32 where u32x is not needed
7 years ago
jsteube 1aa76eac15 Refactor use of __constant to match up with the user selected attack mode
7 years ago
jsteube 938c281ee0 Resurrect some volatile variables in order to correctly compile pure kernels on AMD drivers
7 years ago
jsteube 48fbe81a09 Add more inline assembly for AMD ROCm
7 years ago
jsteube 6c619155c3 Workaround ROCm compiler error in aes256_ExpandKey()
7 years ago
jsteube 8c9c36ee2a Fix out-of-bound access in aesXXX_InvertKey()
7 years ago
jsteube bed7e8f466 Remove unused truncate_block_xxx_xx() functions and update kernels to use the _S function
7 years ago
jsteube 51dc1c7db3 Use truncate_block_4x4_le_S() instead of truncate_block_4x4_le() in -m 6800
7 years ago
jsteube 9b73c464d2 Fix typo in macro
7 years ago
jsteube 7b443ee7ff Optimize performance of rule_op_mangle_title_sep(), rule_op_mangle_purgechar() and rule_op_mangle_replace()
7 years ago
jsteube 0de41c2716 Some more optimizations for rule engine
7 years ago
jsteube 9f8c5a253d More rule engine performance optimizations
7 years ago
jsteube 0783289e2f Optimized a0 pure kernel for AMD
7 years ago
jsteube a5659d5619 Also switch optimized kernels rule engine to make use of kernel rules in constant memory
7 years ago
jsteube 1d04de3a8e Limit kernel-loops in straight-mode to 256, therefore allow rules to be stored in constant memory
7 years ago
jsteube 51372438fe Allow OpenCL kernel inline assembly if ROCm drivers was detected
7 years ago
jsteube 8853884f2a Fix append_four_byte() in case sm8 is 0
7 years ago
jsteube f32e113942 Add missing case in append_block() in pure kernel rule engine
7 years ago
jsteube 6907981f08 Backport current state of optimized kernel rule engine to CPU
7 years ago
jsteube 508f1562f2 Fix --stdout kernels, gid_max was still set to u32
7 years ago
jsteube 319799bbbf Switch the datatypes of the variables responsible for work-item count and work-item size from u32 to u64
7 years ago
jsteube d9c906e134 Move 0x80 to hardcoded position for sha3-256 bit in order to allow ROCm compiler to use registers only
7 years ago
jsteube 694cc0b740 Remove all calls to overwrite_at_* functions
7 years ago
jsteube e984a829ea Remove no longer needed overwrite_at_* functions
7 years ago
jsteube bf299fe043 Optimized 3DES for rocm
7 years ago
jsteube ad1ce462d1 Get rid of ceil() in OpenCL kernels
7 years ago
jsteube 53f53fe014 Reduced number of required registers in SIP based on maximum possible esalt length
7 years ago
jsteube 9ee5da40e0 Workaround rocm compiler error for -m 15300
7 years ago
jsteube 88e995ddcf Replace some SIMD related function calls
7 years ago
jsteube 5b5bdf3889 Replace some SIMD related function calls
7 years ago
jsteube 967e96728d Make all the OpenCL kernel function includes static
7 years ago
jsteube 21e9c63d46 Fix rotl64() the same was as rotr64()
7 years ago