Jens Steube
|
a841af000f
|
Fix updated HIP version handling
|
2022-04-22 13:28:33 +02:00 |
|
Jens Steube
|
4791cca0aa
|
AMD Driver: Updated requirements for AMD Linux drivers to ROCm 5.1.1 or later due to new HIP interface
AMD Driver: Updated requirements for AMD Windows drivers to Adrenalin 22.3.1 or later due to new HIP interface
|
2022-04-19 13:02:51 +02:00 |
|
Jens Steube
|
9a8f3ebcab
|
Update HIP version check on Linux and Windows
|
2022-04-18 18:07:40 +02:00 |
|
Jens Steube
|
c916ae821d
|
Fix HIP include folder usage on Windows
|
2022-04-16 18:32:21 +02:00 |
|
philsmd
|
a24d723d66
|
minor code style fixes
|
2022-03-31 19:19:16 +02:00 |
|
philsmd
|
b51f536ac5
|
sanity check for u32 cast for combs_cnt, bfs_cnt
|
2022-03-13 14:41:34 +01:00 |
|
philsmd
|
5d9d3c7eb7
|
sanity check for u32 cast for combs_cnt, bfs_cnt
|
2022-03-13 14:25:09 +01:00 |
|
philsmd
|
cd33a8a200
|
fixes #2560: mention "main driver" within the error message
|
2022-03-09 18:15:54 +01:00 |
|
Will Crozier
|
edf7365cda
|
stdout mode: transfer indexes and compressed pw buffer from device in blocks, reducing overhead
|
2022-02-25 16:35:55 +00:00 |
|
Jens Steube
|
e9cb796528
|
Merge pull request #3186 from willcrozi/opencl-include-quoting
Improve handling of whitespace in kernel compiler include-path
|
2022-02-24 17:50:13 +01:00 |
|
Will Crozier
|
919ad5a1fd
|
warn/skip for AMD devices on Linux when OpenCL/HIP include-path has spaces
|
2022-02-20 20:14:57 +00:00 |
|
Giulio Garzia
|
c49238e9cd
|
OpenCL include-path workaround for Windows with AMD GPU
|
2022-02-20 17:55:16 +00:00 |
|
Gabriele Gristina
|
3156b487b7
|
Avoid quoting OpenCL include path when it's not required to
|
2022-02-20 17:55:16 +00:00 |
|
Gabriele Gristina
|
f890cd8134
|
Improved management of systems with multiple OpenCL platforms and show empty OpenCL platforms only in backend information mode
|
2022-02-19 21:17:56 +01:00 |
|
Gabriele Gristina
|
24de156ce8
|
Fixed backend active devices checks
|
2022-02-19 17:21:27 +01:00 |
|
Gabriele Gristina
|
fb12de4be6
|
fix style
|
2022-02-16 05:40:34 +01:00 |
|
Gabriele Gristina
|
7eb66e9936
|
Hardware Monitor: Add support for GPU device utilization readings using iokit on Apple Silicon (OpenCL and Metal)
|
2022-02-15 22:20:03 +01:00 |
|
Gabriele Gristina
|
3064c63c71
|
User Options: Change --backend-info/-I option type, from bool to int
|
2022-02-13 12:33:11 +01:00 |
|
Gabriele Gristina
|
b3d3b31c3e
|
Metal: added support for vectors up to 4
|
2022-02-10 21:53:08 +01:00 |
|
Jens Steube
|
be75e4b4ea
|
Rename STR() to M2S() to avoid future collisions and move from kernel source to command line parameter
|
2022-02-07 09:57:08 +01:00 |
|
Gabriele Gristina
|
7ac879f1e4
|
Fixed unused variable warning on Windows
|
2022-02-06 02:04:27 +01:00 |
|
Gabriele Gristina
|
829d49c8ba
|
resync src/backend.c
|
2022-02-06 01:17:19 +01:00 |
|
Gabriele Gristina
|
490702fcfa
|
Backends: added Metal host-code
|
2022-02-05 22:48:16 +01:00 |
|
Jens Steube
|
8293964097
|
Fix coding convention
|
2022-02-04 10:28:45 +01:00 |
|
Gabriele Gristina
|
cd363b32f6
|
Merge branch 'master' into metal_prepare_kernelIncludes_v2
|
2022-01-28 19:56:26 +01:00 |
|
Gabriele Gristina
|
01a28f80f7
|
Updated handling of POCL's known bugs
|
2022-01-22 22:40:18 +01:00 |
|
Gabriele Gristina
|
a1ced24564
|
Fixed bug on benchmark engine, add some unstable warnings, updated negative status code
|
2022-01-22 12:10:09 +01:00 |
|
Gabriele Gristina
|
2e4a136758
|
Refactored standard kernel includes in order to support Apple Metal runtime, updated backend, test units and status code
|
2022-01-18 22:52:14 +01:00 |
|
Gabriele Gristina
|
7650894e02
|
fixed bug in benchmark engine, updated negative status code
|
2022-01-11 04:58:30 +01:00 |
|
Jens Steube
|
dfd316c653
|
Merge pull request #3103 from matrix/backend_session_update_mp_rl
Removed hc_clSetKernelArg() call from backend_session_update_mp_rl()
|
2022-01-09 13:22:00 +01:00 |
|
Jens Steube
|
7a9a1b37d0
|
Merge pull request #3104 from matrix/backend_session_update_mp
Removed hc_clSetKernelArg() call from backend_session_update_mp()
|
2022-01-09 13:21:52 +01:00 |
|
Jens Steube
|
56ef2b4bde
|
Merge pull request #3102 from matrix/backend_cuda_restyle
CUDA Backend: moved functions to ext_cuda.c/ext_nvrtc.c and includes to ext_cuda.h/ext_nvrtc.h
|
2022-01-06 11:29:56 +01:00 |
|
Jens Steube
|
045ca5cb7a
|
Fixed method how OPTS_TYPE_AUX* kernels are called in association mode, for instance WPA/WPA2 kernels
|
2022-01-05 10:46:00 +01:00 |
|
Jens Steube
|
668d2179cd
|
Kernels: Refactored standard kernel declaration to use a structure holding u32/u64 attributes to reduce the number of attributes
|
2022-01-04 18:07:18 +01:00 |
|
Gabriele Gristina
|
994083eaf5
|
Removed hc_clSetKernelArg() call from backend_session_update_mp()
|
2022-01-03 18:28:19 +01:00 |
|
Gabriele Gristina
|
0f0cf1fe08
|
Removed hc_clSetKernelArg() call from backend_session_update_mp_rl()
|
2022-01-03 17:16:27 +01:00 |
|
Gabriele Gristina
|
f8ceb8785e
|
CUDA Backend: moved functions to ext_cuda.c/ext_nvrtc.c and includes to ext_cuda.h/ext_nvrtc.h
|
2022-01-03 16:29:15 +01:00 |
|
Gabriele Gristina
|
78c7ee2af6
|
HIP Backend: moved functions to ext_hip.c/ext_hiprtc.c and includes to ext_hip.h/ext_hiprtc.h
|
2022-01-02 19:12:41 +01:00 |
|
Gabriele Gristina
|
26b6054cab
|
OpenCL Backend: moved functions to ext_OpenCL.c and includes to ext_OpenCL.h
|
2021-12-28 21:55:05 +01:00 |
|
Gabriele Gristina
|
861e644057
|
OpenCL Backend: added workaround to make optimized kernels work on Apple Silicon
|
2021-12-24 17:30:49 +01:00 |
|
Jens Steube
|
df6e5480ca
|
Print module_extra_tuningdb_block undefined compute device warning only on GPU
|
2021-12-23 19:23:35 +01:00 |
|
Gabriele Gristina
|
3fd6dac523
|
Set default device-type to GPU with Apple M1
|
2021-12-14 00:37:17 +01:00 |
|
Gabriele Gristina
|
0fae3a4394
|
Added support for Apple Silicon compute devices
|
2021-12-13 22:42:42 +01:00 |
|
Jens Steube
|
d4a54287b1
|
Add missing backslash for RUN_INSTRUCTION_CHECKS() on AMD
|
2021-12-09 14:03:32 +01:00 |
|
Jens Steube
|
3d53188cc3
|
Tuning Database: Added a warning if a module implements module_extra_tuningdb_block but the installed computing device is not found
|
2021-12-04 16:56:16 +01:00 |
|
Jens Steube
|
21f91c5bb8
|
Module Optimizaters: Added OPTS_TYPE_MAXIMUM_THREADS to deactivate the else branch route in the section to find -T before compilation
Set the new flag based on some testings with RX6900XT
|
2021-11-13 17:27:06 +01:00 |
|
Gabriele Gristina
|
9be7bc71a5
|
OpenCL Backend: added workaround to support Apple Silicon
|
2021-11-11 00:57:54 +01:00 |
|
Jens Steube
|
53f6693495
|
Temporary enable HIP 4.4/ROCM 4.5 on Linux and globally set native thread count
|
2021-11-10 19:32:54 +01:00 |
|
Jens Steube
|
f84aca82ca
|
Backend types: The default filter for the device types is now set so that only the GPU is used, except for APPLE, where we set CPU
|
2021-11-01 10:49:20 +01:00 |
|
Jens Steube
|
49a68cd6c1
|
AMD Driver: Updated requirements for AMD Linux drivers to ROCm 4.5 or later due to new HIP interface
|
2021-10-21 20:03:01 +02:00 |
|