I’ve been having a strange problem while mining Zcash using 4 - 1080ti GPU.
After a random amount of time, EWBF 0.3.4b miner gets stuck. It only happens when mining with 4 cards. It will mine just fine using only 2 GPU’s if they are plugged into x16 slots on the mb.
EWBF doesn’t get hung and quit or freeze the system. It keeps trying to restart and issues the following errors:
CUDA DEVICE 2 Thread exited with code 4
CUDA DEVICE 0 Thread exited with code 4
CUDA DEVICE 1 Thread exited with code 4
The power report which shows all cards using less then 70 watts each. and about 0 sol/s
Which is then followed with this list for each GPU installed:
Error: Looks like GPU2 are stopped. Restart attempt
Info: GPU2 are restarted.
CUDA DEVICE: 2 User selected resolver 0
CUDA DEVICE 2 Thread exited with code 46
From here, it doesn’t mine until the machine is entirely powered off then restarted. (Or the miner is closed and restarted after the cards attached to the risers are disabled in control panel.) So I guess you can say that the problem only effects the GPU’s on the risers.
My hardware is:
Mb is MSI z170A-M7 with 8gb RAM and Celeron G3930 KabyLake processor.
Power supply for mb and two on board GPU’s is Cobra Power 700watt Gold 80plus
Power Supply for the two PCIe-1x to 16x riser connected GPU’s is an HP 1200 Watt
Found here at NewEgg
The risers are PCIE164P-NO3 Ver 006.
Steps I’ve taken to resolve the issue:
Followed instructions for Z170 motherboard listed here in this thread…https://bitcointalk.org/index.php?topic=1873651.0
Replaced all power supplies with brand new ones listed above.
Completely reload Windows in UEFI mode on GPT.
Tried all of the drivers on NVidia site that I could download. Currently using 384.94
Tried changing the slot location that the boards or connected to. As well as doing a clean install for each card one at a time.
Replacing the riser cards. Unless I have 2 bad risers, the problem is not isolated to one riser or one card.
I’m considering purchasing another MB and risers but I sure would like to narrow this problem down before I throw a bunch of money at TRYING to find whatever is causing the issue accidentally…
If anyone out there has any ideas, I am all ears. My next step is to start looking in event viewer to find reoccurring errors that might help me.
Thanks for reading!