HiveOS Crash

Hi,

HiveOS is crashing pretty often now. At least 2-3x a day. The miner keeps running after the crash, so I’m not sure what is going on. I downgraded to the Jan version of HiveOS to test that out and it still occurs. I know the miner is still running because occasionally I have the miner interface up and can see it running, but Telegram has notified me that the rig is offline. I also have tested leaving it up and verifying with the mining pool that the rig is in fact reporting. When I go to the rig and use the monitor I am unable to run any commands on the interface. Any ideas?

I am running HiveOS from a large external USB drive (500GB).

I have the log file on. I’ll wait for the next crash and post the report.

Log file.

Feb 7 13:50:01 First-Rig CRON[18485]: (root) CMD (agent-screen dontattach || echo “[date] STARTED BY CRON” >> /var/log/hive-agent.log)
Feb 7 13:50:31 First-Rig hive-watchdog[1119]: OK phoenixminer 103155 kHs >= 0.08 kHs
Feb 7 13:50:41 First-Rig hive-watchdog[1119]: OK LA(5m): 0.44 < 29.0, LA(1m): 0.35 < 58.0
Feb 7 13:50:41 First-Rig hive-watchdog[1119]: OK POWER: 289 W (250…9999 W)
Feb 7 13:51:31 First-Rig hive-watchdog[1119]: OK phoenixminer 103161 kHs >= 0.08 kHs
Feb 7 13:51:41 First-Rig hive-watchdog[1119]: OK LA(5m): 0.44 < 29.0, LA(1m): 0.35 < 58.0
Feb 7 13:51:41 First-Rig hive-watchdog[1119]: OK POWER: 292 W (250…9999 W)
Feb 7 13:51:47 First-Rig systemd-timesyncd[497]: Network configuration changed, trying to establish connection.
Feb 7 13:51:47 First-Rig systemd-networkd[711]: eth0: Configured
Feb 7 13:51:47 First-Rig systemd-timesyncd[497]: Synchronized to time server 192.168.4.1:123 (192.168.4.1).
Feb 7 13:51:47 First-Rig systemd[1]: Starting resolvconf-pull-resolved.service…
Feb 7 13:51:47 First-Rig systemd[1]: Started resolvconf-pull-resolved.service.
Feb 7 13:52:31 First-Rig hive-watchdog[1119]: OK phoenixminer 103146 kHs >= 0.08 kHs
Feb 7 13:52:41 First-Rig hive-watchdog[1119]: OK LA(5m): 0.48 < 29.0, LA(1m): 0.51 < 58.0
Feb 7 13:52:41 First-Rig hive-watchdog[1119]: OK POWER: 288 W (250…9999 W)
Feb 7 13:53:31 First-Rig hive-watchdog[1119]: OK phoenixminer 103152 kHs >= 0.08 kHs
Feb 7 13:53:41 First-Rig hive-watchdog[1119]: OK LA(5m): 0.57 < 29.0, LA(1m): 0.80 < 58.0
Feb 7 13:53:41 First-Rig hive-watchdog[1119]: OK POWER: 288 W (250…9999 W)
Feb 7 13:54:31 First-Rig hive-watchdog[1119]: OK phoenixminer 103145 kHs >= 0.08 kHs
Feb 7 13:54:41 First-Rig hive-watchdog[1119]: OK LA(5m): 0.57 < 29.0, LA(1m): 0.61 < 58.0
Feb 7 13:54:41 First-Rig hive-watchdog[1119]: OK POWER: 288 W (250…9999 W)
Feb 7 13:55:01 First-Rig CRON[2106]: (root) CMD (command -v debian-sa1 > /dev/null && debian-sa1 1 1)
Feb 7 13:55:31 First-Rig hive-watchdog[1119]: OK phoenixminer 103146 kHs >= 0.08 kHs
Feb 7 13:55:41 First-Rig hive-watchdog[1119]: OK LA(5m): 0.60 < 29.0, LA(1m): 0.72 < 58.0
Feb 7 13:55:41 First-Rig hive-watchdog[1119]: OK POWER: 289 W (250…9999 W)

I also have 2 miner with last Hive OS but I’m not on place,
so I don’t know how to understand why.
I just have a person that is go there and reset all 2 miners.
Thanks all for some idea
Denis

So I switched from my 500GB external USB drive to a 16GB USB drive. No crashes so far. I’ve been up for 1 day 3 hours continuous. Hope that helps you.

I use only SSD drive so I think this is not a problem.
Now I wait that a person go there and reset the miners and I’ll take a look on log (that I activate)

Thanks a lot
Denis

How did you resolve the problem? Im having the same issue and can not figure it out

I switched to a thumbdrive.

I rly cant believe that this is the Solution:D already changed ram etc. Will try thumbdrive tomorrow, thanks!

Hi, I have the same issue, how do you solved ?

I switched now to an external Drive. it crashes WAY LESS but still does!. Also my temps are about 5 ° LOWER on every card. Makes no sense at all

scroll up.

sounds like you need to find a good baseline now. If your cards are too hot, they will definitely crash. If the cards or risers are not powered correctly it will crash. If the risers are not powered correctly, you could burn your house down.

Nah you do not seem to understand. My setup is perfect. My cards were running at 50°C with SSD. Now i switched to the external drive as you suggested and now they run at 45°C. Like I said it makes no sense at all.

1 Like

Ok, so I use a SSD NVMe M2 and I dont think that using a usb key will solve my problem. But As followgeo said I have to risers powered by SATA cable. It could be the issue.

I had the same issue after moving my thumb drive to a SSD. Ended up switching back to the thumb drive.

This topic was automatically closed 416 days after the last reply. New replies are no longer allowed.