Post by Solar DesignerWhat speeds are you getting on well's CPU for Argon2d at these settings?
With memory (de)allocation out of the loop, like we had for the Lyra2
and yescrypt benchmarks.
Also, please set m=1536, so we'd have exactly 1.5 MiB.
I made some tests although there is still a bug on super's AMD.
--dev=5 on super is working now. also I checked most of better speeds
in cracking run and were ok. I was setting gws every time due to this
bug on AMD and a bug with MEM_SIZE on nvidia
argon2i
CPU on well - 2480
GeForce GTX 960M - 1861
AMD Tahiti - 1288
GeForce GTX TITAN - 2805
memory: 1.5 MB
argon2d
CPU on well - 7808
GeForce GTX 960M - 4227
AMD Tahiti - 2742
GeForce GTX TITAN - 6083
memory: 1.5 MB
___________
well
***@well:~/f/run$ ./john --test --format=argon2d
Will run 8 OpenMP threads
Benchmarking: argon2d [Blake2 AVX]... (8xOMP)
memory per hash : 1.50 MB
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 7760 c/s real, 971 c/s virtual
Only one salt: 7808 c/s real, 976 c/s virtual
***@well:~/f/run$ ./john --test --format=argon2d --cost=1:1,1536:1536,5:5
Will run 8 OpenMP threads
Benchmarking: argon2d [Blake2 AVX]... (8xOMP)
memory per hash : 1.50 MB
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 5
Many salts: 7288 c/s real, 911 c/s virtual
Only one salt: 7224 c/s real, 904 c/s virtual
***@well:~/f/run$ ./john --test --format=argon2i
Will run 8 OpenMP threads
Benchmarking: argon2i [Blake2 AVX]... (8xOMP)
memory per hash : 1.50 MB
using different password for benchmarking
DONE
Speed for cost 1 (t) of 3, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 2488 c/s real, 311 c/s virtual
Only one salt: 2480 c/s real, 310 c/s virtual
***@well:~/f/run$ ./john --test --format=argon2i --cost=3:3,1536:1536,5:5
Will run 8 OpenMP threads
Benchmarking: argon2i [Blake2 AVX]... (8xOMP)
memory per hash : 1.50 MB
using different password for benchmarking
DONE
Speed for cost 1 (t) of 3, cost 2 (m) of 1536, cost 3 (l) of 5
Many salts: 2392 c/s real, 298 c/s virtual
Only one salt: 2384 c/s real, 298 c/s virtual
________________________
Titan X
[***@super run]$ GWS=1024 ./john --test --format=argon2d-opencl --v=4 --dev=5
Benchmarking: argon2d-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 5: GeForce GTX TITAN
Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__
-DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21
-D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64
-DPLAINTEXT_LENGTH=125
Local worksize (LWS) 64, global worksize (GWS) 1024
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 5206 c/s real, 5206 c/s virtual
Only one salt: 5206 c/s real, 5206 c/s virtual
[***@super run]$ LWS=32 GWS=1024 ./john --test --format=argon2d-opencl
--v=4 --dev=5
Benchmarking: argon2d-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 5: GeForce GTX TITAN
Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__
-DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21
-D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64
-DPLAINTEXT_LENGTH=125
Local worksize (LWS) 32, global worksize (GWS) 1024
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 6083 c/s real, 6083 c/s virtual
Only one salt: 6083 c/s real, 6083 c/s virtual
[***@super run]$ GWS=2048 ./john --test --format=argon2i-opencl --v=4 --dev=5
Benchmarking: argon2i-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 5: GeForce GTX TITAN
Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__
-DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21
-D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64
-DPLAINTEXT_LENGTH=125
Local worksize (LWS) 64, global worksize (GWS) 2048
using different password for benchmarking
DONE
Speed for cost 1 (t) of 3, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 2805 c/s real, 2805 c/s virtual
Only one salt: 2805 c/s real, 2824 c/s virtual
[***@super run]$ LWS=32 GWS=1024 ./john --test --format=argon2d-opencl
--v=4 --dev=5 --cost=1:1,1536:1536,5:5
Benchmarking: argon2d-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 5: GeForce GTX TITAN
Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__
-DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21
-D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64
-DPLAINTEXT_LENGTH=125
Local worksize (LWS) 32, global worksize (GWS) 1024
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 5
Many salts: 5688 c/s real, 5688 c/s virtual
Only one salt: 5636 c/s real, 5585 c/s virtual
[***@super run]$ GWS=2048 ./john --test --format=argon2i-opencl --v=4
--dev=5 --cost=3:3,1536:1536,5:5
Benchmarking: argon2i-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 5: GeForce GTX TITAN
Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__
-DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21
-D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64
-DPLAINTEXT_LENGTH=125
Local worksize (LWS) 64, global worksize (GWS) 2048
using different password for benchmarking
DONE
Speed for cost 1 (t) of 3, cost 2 (m) of 1536, cost 3 (l) of 5
Many salts: 2694 c/s real, 2730 c/s virtual
Only one salt: 2712 c/s real, 2712 c/s virtual
_____________________
Tahiti
[***@super run]$ GWS=1024 ./john --test --format=argon2i-opencl --v=4
Benchmarking: argon2i-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 0: Tahiti [AMD Radeon HD 7900 Series]
Local worksize (LWS) 64, global worksize (GWS) 1024
using different password for benchmarking
DONE
Speed for cost 1 (t) of 3, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 1288 c/s real, 204800 c/s virtual
Only one salt: 1288 c/s real, 204800 c/s virtual
[***@super run]$ LWS=32 GWS=1024 ./john --test --format=argon2d-opencl --v=4
Benchmarking: argon2d-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 0: Tahiti [AMD Radeon HD 7900 Series]
Local worksize (LWS) 32, global worksize (GWS) 1024
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 2648 c/s real, 307200 c/s virtual
Only one salt: 2742 c/s real, 307200 c/s virtual
[***@super run]$ LWS=32 GWS=1024 ./john --test --format=argon2d-opencl
--v=4 --cost=1:1,1536:1536,5:5
Benchmarking: argon2d-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 0: Tahiti [AMD Radeon HD 7900 Series]
Local worksize (LWS) 32, global worksize (GWS) 1024
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 5
Many salts: 2671 c/s real, 307200 c/s virtual
Only one salt: 2671 c/s real, 307200 c/s virtual
[***@super run]$ GWS=1024 ./john --test --format=argon2i-opencl --v=4
--cost=3:3,1536:1536,5:5
Benchmarking: argon2i-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 0: Tahiti [AMD Radeon HD 7900 Series]
Local worksize (LWS) 64, global worksize (GWS) 1024
using different password for benchmarking
DONE
Speed for cost 1 (t) of 3, cost 2 (m) of 1536, cost 3 (l) of 5
Many salts: 1211 c/s real, 204800 c/s virtual
Only one salt: 1211 c/s real, 204800 c/s virtual
__________________________
980m
***@none ~/Desktop/r/run $ GWS=1024 ./john --test --format=argon2i-opencl
Benchmarking: argon2i-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 0: GeForce GTX 960M
using different password for benchmarking
DONE
Speed for cost 1 (t) of 3, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 1878 c/s real, 1861 c/s virtual
Only one salt: 1861 c/s real, 1861 c/s virtual
***@none ~/Desktop/r/run $ GWS=512 ./john --test --format=argon2d-opencl
Benchmarking: argon2d-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 0: GeForce GTX 960M
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 3976 c/s real, 3938 c/s virtual
Only one salt: 3976 c/s real, 4015 c/s virtual
***@none ~/Desktop/r/run $ LWS=32 GWS=512 ./john --test --format=argon2d-opencl
Benchmarking: argon2d-opencl [Blake2 OpenCL]...
memory per hash : 1.50 MB
Device 0: GeForce GTX 960M
using different password for benchmarking
DONE
Speed for cost 1 (t) of 1, cost 2 (m) of 1536, cost 3 (l) of 1
Many salts: 4266 c/s real, 4227 c/s virtual
Only one salt: 4227 c/s real, 4266 c/s virtual