Page 1 of 1
Project: 3043 (Run 3, Clone 57, Gen 20)
Posted: Thu Mar 13, 2008 11:43 pm
by AlanH
This WU has run to 96% three times on my 4-core Mac Pro and then aborted. Same place, and same error message each time:
Warning long 1-4 interactions
Core status = 0 (0)
Client-core communications error: ERROR 0x0
After the latest abort, it downloaded a new core and then started on the same WU again
Any bets on the probability of success this time?
Each run has taken 24 hours, so we are down 3 days worth of folding, and counting. At my Mac's normal work rate, that's over 5,000 points up in smoke
Re: P3043 (r3 c57 g20)
Posted: Thu Mar 13, 2008 11:58 pm
by él Mero
There have been many reports regarding this issue. One of the suggestions has been to try and let it run to right before 95% and then stop the client, wait until all copies of FahCore_a1 have stopped and then restart. This may allow the WU to pass the error at 96%.
Re: Project: 3043 (Run 3, Clone 57, Gen 20)
Posted: Fri Mar 14, 2008 8:48 pm
by AlanH
Well, there's only one way I know on my system to stop the FaH process without aborting it. That's to reboot. Using the Preferences panel to stop it results in an abort, and trying to kill the processes seems to get it into a total mess. So I'll try rebooting in a little while, when it gets to 95%. I have little to lose, and it might give useful data ... or not!
Re: Project: 3043 (Run 3, Clone 57, Gen 20)
Posted: Fri Mar 14, 2008 10:28 pm
by él Mero
You could backup the data before you shut down the client (in those few necessary situations). If it's like you say that it aborts when shutting down. Trying to kill the processes manually is not a nice way to go about, it will most definitely result in an error.
Anxiously waiting for the result of the suggestion...
Re: Project: 3043 (Run 3, Clone 57, Gen 20)
Posted: Sat Mar 15, 2008 3:12 am
by AlanH
Well, it completed! I rebooted my Mac when the log showed 95% complete. It rebooted and picked up from the checkpoint, ran through the remaining 5% and uploaded the WU to the servers.
Now we have to ask ourselves - was it the reboot that cured the problem? Or would it have finished anyway, either because it reloaded the core, or just because it was the fourth attempt?
I'd still far rather be folding 1760 point P2605 units every 20 hours, though
Even when they finish first time, these other SMP units take longer and are worth fewer points.
Re: Project: 3043 (Run 3, Clone 57, Gen 20)
Posted: Sat Mar 15, 2008 3:40 am
by él Mero
Great stuff! Good luck getting those fine p2605s.
Well, third times the charm right? Therefore the reboot cured the problem