Page 2 of 2
Re: 99.99% and stuck
Posted: Tue Apr 28, 2020 5:14 pm
by Nuitari
You should plan 2gb per GPU slot + whatever else is running as some project will use that much.
The upload is essentially the client telling the server that the work got dumped.
Hard to say what is going on from the information, you could install atop which will keep a snapshot every 10 minutes of the system usage, that way you could see any weird situation.
Re: 99.99% and stuck
Posted: Thu Apr 30, 2020 4:06 pm
by CoronaCrusher
I'm finding that for me it is always one particular type of work unit that stalls out and gets stuck at 99.99%. It's from Project 14436 (578, 4, 11) with a base credit of 53000 although I can't verify that those numbers in parentheses are the same. I've gotten at least 10 WUs like that that have stalled out and sometimes I can restart the computer and get it to revert to an old checkpoint often like 35 - 45% or so but half the time I have to just delete the WU to force my GPU to move on to a new one, otherwise it will sit at 99.99% for days... That type of WU doesn't always fail to complete on my system but when I do get one that's stuck, it's ALWAYS that WU. I'm running an i5-8600k @4.7GHz and dual liquid cooled GTX 1080s in SLI running at 2050MHz but I've run into this issue at base clock as well with different drivers so there has to be something different about that particular WU that causes it to fail fairly often...
Re: 99.99% and stuck
Posted: Thu Apr 30, 2020 5:52 pm
by qe4
A little update... I tried underclocking all GPU's but that did not really help. I then disabled my Hawaii GPU and one 8GB RX480 that I suspected might be causing issues. Since then it has been running fine. There is still one failed WU from the Hawaii GPU that the client is attempting (but failing) to upload. But I am pretty happy that I got 2 of my RX480 running stable.
Re: 99.99% and stuck
Posted: Thu Apr 30, 2020 6:36 pm
by Joe_H
@Corona Crusher - post the log of one of these WUs that reach 99.99% and the beginning section of the log that shows the system info and client configuration.
The Welcome topic - viewtopic.php?f=66&t=26036 - has directions on finding and posting your log file.
However, most likely Project 14436 WUs push you GPU more than those from other projects and any overclock, factory or otherwise, needs reducing.
Re: 99.99% and stuck
Posted: Sat May 02, 2020 6:28 am
by 1TM
qe4 wrote:issue: They get stuck at 99.99% and don't upload.
- How do I cancel/remove a WU that is stuck?
I also had a unit stuck at 99.99%. What seems to have helped was:
1. press Finish
2. wait for other WU (if running several) to reach a nearest save point such as 10% or 20 or,.. 90%
3. shut down the PC and switch off the power supply if it has a switch
4. restart - the FAHControl was able to find the checkpoint files and resume all WU runs (which 99% was at 30%)