Code: Select all
[13:38:00] *------------------------------*
[13:38:00] Folding@Home Gromacs SMP Core
[13:38:00] Version 1.74 (November 27, 2006)
[13:38:00]
[13:38:00] Preparing to commence simulation
[13:38:00] - Ensuring status. Please wait.
[13:38:17] - Assembly optimizations manually forced on.
[13:38:17] - Not checking prior termination.
[13:38:18] - Expanded 4711387 -> 24426905 (decompressed 518.4 percent)
[13:38:18] - Starting from initial work packet
[13:38:18]
[13:38:18] Project: 2665 (Run 3, Clone 826, Gen 25)
[13:38:18]
[13:38:18] Assembly optimizations on if available.
[13:38:18] Entering M.D.
[13:38:24] Rejecting checkpoint
[13:38:25] Protein: HGG in waterExtra SSE boost OK.
[13:38:25]
[13:38:26] Extra SSE boost OK.
[13:38:26] Writing local files
[13:38:26] Completed 0 out of 250000 steps (0 percent)
[13:52:56] Writing local files
[13:52:56] Completed 2500 out of 250000 steps (1 percent)
[14:07:28] Writing local files
[14:07:29] Completed 5000 out of 250000 steps (2 percent)
[14:22:05] Writing local files
[14:22:06] Completed 7500 out of 250000 steps (3 percent)
[14:36:42] Writing local files
[14:36:42] Completed 10000 out of 250000 steps (4 percent)
[14:51:18] Writing local files
[14:51:18] Completed 12500 out of 250000 steps (5 percent)
[15:05:47] Writing local files
[15:05:47] Completed 15000 out of 250000 steps (6 percent)
[15:08:22] - Autosending finished units...
[15:08:22] Trying to send all finished work units
[15:08:22] + No unsent completed units remaining.
[15:08:22] - Autosend completed
[15:20:16] Writing local files
[15:20:16] Completed 17500 out of 250000 steps (7 percent)
[15:29:30]
[15:29:30] Folding@home Core Shutdown: INTERRUPTED
[15:29:34] CoreStatus = 66 (102)
[15:29:34] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
(NOT Shutdown by User - Terminal indicates Segmentation Fault)<------------
Identical results with fresh Core, reboot, etc.
[15:29:34] Killing all core threads
Folding@Home Client Shutdown.
Ok. I need to ask a question (or two or three):
![Smile :)](./images/smilies/icon_smile.gif)
Is there ANY way to prevent being reassigned this WU several times (or I am missing something).
![Smile :)](./images/smilies/icon_smile.gif)
(-delete x works fine but one gets the identical WU several times PLUS an unattended machine would simply remain idle until the stopped condition of the client was detected)
What is the proper technique for dealing with these situations?
I have one Quad that was assigned a WU, failed at frame 70 with failure ultimately being detected and WU being deleted.
Two different WUs were then assigned and completed successfully.
Then the previously reassigned defective WU was re-assigned, only to once again fail at frame 70.
I believe that I reported that one here.
I also do NOT like the fact that the WU count is reset to 0 when one encounters this sort of situation (probably by that "bad" packet from server that one sees).
The machine here has successfully completed over 125 WUs - had one bad one - count reset - ran 10 or 20 more to completion - and I assume is now reset to 0.
I think that that stinks.
![Sad :(](./images/smilies/icon_sad.gif)
I know how to stop a WU prior to failure and either run it on another machine or continue with restart (and I back the darned things up a few times as it progresses).
In most cases, the latter has permitted me to complete the WU.
Was not successful with this one.
![Sad :(](./images/smilies/icon_sad.gif)
BTW, I do NOT want to delete queue.dat since I monitor my farm from one server and the information contained in that file is useful.