Hi, I just got my rig up and running. I have at the moment only one gpu in it and i can’t get it to run stable. It runs fine for like almost an hour then i get gpu driver error message and t hen the rig reboots and runs very unstable until I plug the power off for some minutes.
I updated to the new version of hiveos and nvidia drivers. works better now, but still crashes. I haven’t overclocket the card yet.
The card is used but worked fine before i plugged it in the rig.
any advice?
error codes:
GPU Health Data:
01:00.0 Temp: 57C Fan: 65% Power: 75W
Latest GPU driver errors list:
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics Exception: ESR 0x50c648=0xc000d 0x50c650=0x4 0x50c644=0xd3eff2 0x50c64c=0x17f
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Warp Exception on (GPC 1, TPC 1): Out Of Range Register
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Global Exception on (GPC 1, TPC 1): Physical Multiple Warp Errors
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics Exception: ESR 0x50ce48=0x9000d 0x50ce50=0x4 0x50ce44=0xd3eff2 0x50ce4c=0x17f
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Warp Exception on (GPC 1, TPC 2): Out Of Range Register
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Global Exception on (GPC 1, TPC 2): Physical Multiple Warp Errors
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics Exception: ESR 0x50d648=0xd 0x50d650=0x4 0x50d644=0xd3eff2 0x50d64c=0x17f
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Warp Exception on (GPC 1, TPC 3): Out Of Range Register
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Global Exception on (GPC 1, TPC 3): Physical Multiple Warp Errors
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics Exception: ESR 0x50de48=0xa000d 0x50de50=0x4 0x50de44=0xd3eff2 0x50de4c=0x17f
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Warp Exception on (GPC 1, TPC 4): Out Of Range Register
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Global Exception on (GPC 1, TPC 4): Physical Multiple Warp Errors
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics Exception: ESR 0x50e648=0x5000d 0x50e650=0x4 0x50e644=0xd3eff2 0x50e64c=0x17f
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics SM Warp Exception on (GPC 0, TPC 0): Out Of Range Register
Jun 14 17:51:40 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 13, pid=1050, Graphics Exception: ESR 0x504648=0x1000d 0x504650=0x0 0x504644=0xd3eff2 0x50464c=0x17f
and after reboot:
Jun 14 18:33:36 hppywrld-bud1 kernel: NVRM: Xid (PCI:0000:01:00): 61, pid=1020, 0a97(2a90) 00000000 00000000