Project: 3043 (Run 3, Clone 57, Gen 20)

Moderators: Site Moderators, FAHC Science Team

Post Reply
AlanH
Posts: 57
Joined: Mon Dec 03, 2007 9:54 pm

Project: 3043 (Run 3, Clone 57, Gen 20)

Post by AlanH »

This WU has run to 96% three times on my 4-core Mac Pro and then aborted. Same place, and same error message each time:

Warning long 1-4 interactions
Core status = 0 (0)
Client-core communications error: ERROR 0x0

After the latest abort, it downloaded a new core and then started on the same WU again :cry: Any bets on the probability of success this time?

Each run has taken 24 hours, so we are down 3 days worth of folding, and counting. At my Mac's normal work rate, that's over 5,000 points up in smoke :(
Folding for TeamCFC
- Mac Pro Dual 2.66GHz Xeon, 4 GBytes running Mac SMP2 client
él Mero
Posts: 49
Joined: Sun Dec 02, 2007 1:14 pm

Re: P3043 (r3 c57 g20)

Post by él Mero »

There have been many reports regarding this issue. One of the suggestions has been to try and let it run to right before 95% and then stop the client, wait until all copies of FahCore_a1 have stopped and then restart. This may allow the WU to pass the error at 96%.
AlanH
Posts: 57
Joined: Mon Dec 03, 2007 9:54 pm

Re: Project: 3043 (Run 3, Clone 57, Gen 20)

Post by AlanH »

Well, there's only one way I know on my system to stop the FaH process without aborting it. That's to reboot. Using the Preferences panel to stop it results in an abort, and trying to kill the processes seems to get it into a total mess. So I'll try rebooting in a little while, when it gets to 95%. I have little to lose, and it might give useful data ... or not!
él Mero
Posts: 49
Joined: Sun Dec 02, 2007 1:14 pm

Re: Project: 3043 (Run 3, Clone 57, Gen 20)

Post by él Mero »

You could backup the data before you shut down the client (in those few necessary situations). If it's like you say that it aborts when shutting down. Trying to kill the processes manually is not a nice way to go about, it will most definitely result in an error.

Anxiously waiting for the result of the suggestion...
AlanH
Posts: 57
Joined: Mon Dec 03, 2007 9:54 pm

Re: Project: 3043 (Run 3, Clone 57, Gen 20)

Post by AlanH »

Well, it completed! I rebooted my Mac when the log showed 95% complete. It rebooted and picked up from the checkpoint, ran through the remaining 5% and uploaded the WU to the servers.

Now we have to ask ourselves - was it the reboot that cured the problem? Or would it have finished anyway, either because it reloaded the core, or just because it was the fourth attempt?

I'd still far rather be folding 1760 point P2605 units every 20 hours, though :) Even when they finish first time, these other SMP units take longer and are worth fewer points.
Folding for TeamCFC
- Mac Pro Dual 2.66GHz Xeon, 4 GBytes running Mac SMP2 client
él Mero
Posts: 49
Joined: Sun Dec 02, 2007 1:14 pm

Re: Project: 3043 (Run 3, Clone 57, Gen 20)

Post by él Mero »

Great stuff! Good luck getting those fine p2605s.

Well, third times the charm right? Therefore the reboot cured the problem :wink:
Post Reply