NVIDIA RTX A4000 + RTX A2000, I have “Force P0 state” and “Reduced idle power compsumpion” checked, OC had been stable since 3 months or so. I saw temperatures skyrocketing when it started mining (NBminer) , like +20 C, so I tried to downgrade. No luck, all GPUS were dead so I was forced to re-image the rig with latest stable. Not sure if it’s a compatibility issue with 515.x drivers and new Nvtool because I did not reinstall the new build yet, they are way faster than any 510.x so I have been using them since they were realized. Memory temps are not available but it’s a small issue, I get 7.3 shares per minute compared to 6.8 of any 510.x build (same hardware) so I’ll install the new build again with the fresh image very soon
yeah I know, I’m actually running 510.68.02 with no issue after re-imaging, when i downgraded I had 11 GPUs maked with a malfunction problem. If 515.x were the cause of the malfunction I’ll reproduce the issue, I will try again and report
I managed to reproduce the bug: 515.x are not compatible with nvtool 1.6.6. It’s quite dangerous for hardware because it messes up with OC and core temperatures suddenly rise. Probably 515.x branch contains major changes not jet supported.
No problems with previous nvtool version, working good with 515.x so bug was introduced in the latest 1.6.6