Some WUs are faulty but we have no way of checking if that is one of them or not unless it is reassigned and then completed by someone else.
Some have been able to stop the client just before the error and restart it and it seems to skip the error.
Others have been able to run qfix and report the partial result. The client then moves on to another WU.
Others have simply deleted the WU that they know is going to fail and repeated that process until they got something else. We don't recommend deleting a WU, but in this case it's justified.
Thank you for the reply, Bruce.
I just let it do what it normally does. This time it retried only once more after the initial EUE.
Then I was assigned another 2653 WU.
Perhaps the dreaded thing has died, nevermore to return!
[08:09:47] - Preparing to get new work unit...
[08:09:47] + Attempting to get work packet
[08:09:47] - Connecting to assignment server
[08:09:48] - Successful: assigned to (171.64.65.63).
[08:09:48] + News From Folding@Home: Welcome to Folding@Home
[08:09:48] Loaded queue successfully.
[08:09:50] + Closed connections
[08:09:50]
[08:09:50] + Processing work unit
[08:09:50] Core required: FahCore_a1.exe
[08:09:50] Core found.
[08:09:50] Working on Unit 01 [May 7 08:09:50]
[08:09:50] + Working ...
[08:09:50]
[08:09:50] *------------------------------*
[08:09:50] Folding@Home Gromacs SMP Core
[08:09:50] Version 1.74 (March 10, 2007)
[08:09:50]
[08:09:50] Preparing to commence simulation
[08:09:50] - Ensuring status. Please wait.
[08:10:07] - Assembly optimizations manually forced on.
[08:10:07] - Not checking prior termination.
[08:10:07] - Expanded 283952 -> 1506689 (decompressed 530.6 percent)
[08:10:07] - Starting from initial work packet
[08:10:07]
[08:10:07] Project: 3050 (Run 7, Clone 32, Gen 62)
[08:10:07]
[08:10:07] Assembly optimizations on if available.
[08:10:07] Entering M.D.
[08:10:13] Protein: 9676 p3050_SProtein: 96Writing local files
[08:10:13] Extra SSE boost OK.
[08:10:13]
[08:10:13] Extra SSE boost OK.
[08:10:13] Writing local files
[08:10:13] Completed 0 out of 10000000 steps (0 percent)
[08:22:19] Writing local files
[08:22:19] Completed 100000 out of 10000000 steps (1 percent)
[08:35:01] Writing local files
[08:35:01] Completed 200000 out of 10000000 steps (2 percent)
[08:47:43] Writing local files
[08:47:43] Completed 300000 out of 10000000 steps (3 percent)
[09:00:23] Writing local files
[09:00:23] Completed 400000 out of 10000000 steps (4 percent)
<snip>
[17:51:11] Completed 4700000 out of 10000000 steps (47 percent)
[18:03:29] Writing local files
[18:03:29] Completed 4800000 out of 10000000 steps (48 percent)
[18:15:47] Writing local files
[18:15:47] Completed 4900000 out of 10000000 steps (49 percent)
[18:28:06] Writing local files
[18:28:06] Completed 5000000 out of 10000000 steps (50 percent)
[18:30:43] Warning: long 1-4 interactions
[18:30:44] Gromacs cannot continue further.
[18:30:44] Going to send back what have done.
[18:30:44] logfile size: 133136
[18:30:44] - Writing 133672 bytes of core data to disk...
[18:30:44] ... Done.
[18:30:44] - Failed to delete work/wudata_01.sas
[18:30:44] - Failed to delete work/wudata_01.goe
[18:30:44] Warning: check for stray files
[18:32:44]
[18:32:44] Folding@home Core Shutdown: EARLY_UNIT_END
[18:32:44]
[18:32:44] Folding@home Core Shutdown: EARLY_UNIT_END
[18:32:48] CoreStatus = 7B (123)
[18:32:48] Client-core communications error: ERROR 0x7b
[18:32:48] Deleting current work unit & continuing...
[18:35:08] - Preparing to get new work unit...
[18:35:08] + Attempting to get work packet
[18:35:08] - Connecting to assignment server
[18:35:09] - Successful: assigned to (171.64.65.64).
[18:35:09] + News From Folding@Home: Welcome to Folding@Home
[18:35:09] Loaded queue successfully.
[18:35:16] + Closed connections
This WU appears to be faulty. Each failed iteration is taking over 10 hours of wall clock time. It is a shame to waste so many crunching cycles on this. Please advise on an appropriate course of action.
Thanks!
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).