I am having this issue on 3 rigs now. 8th GPU fails with nvidia GPU driver error and then won’t get recognized again unless I rebuild the OS. Something is wrong with either the NVIDIA drivers or HiveOS
Bad OCs on all cards. dont use core offsets/power limits on modern cards.
to tune OC per card. start at 0 across the board and find the highest stable mem clock.
then find the lowest lcoked core clock that maintains full hashrate. this will also use the least amount of power it needs to run at that core clock.
your error says gpu 6 is carising the crash, but you should finx all the OCs and go from there. reduce oc on cards that crash after doing the above steps
youre on an older beta image. it wouldnt hurt to install the latest stable image, but that wont impact much. your drivers are fine but youre still using core offsets (-500) and power limits. your core clock should be around 1060mhz on the 3070s and 1080mhz on the 3080s