Project: 2665 (Run 3, Clone 530, Gen 36)

Moderators: Site Moderators, FAHC Science Team

Post Reply
mikeb12
Posts: 28
Joined: Tue Feb 12, 2008 11:51 am
Location: South Carolina USA

Project: 2665 (Run 3, Clone 530, Gen 36)

Post by mikeb12 »

just reporting a wu. this is on a machine that's been folding 24/7 since oct 07. very unusual for this machine to eue...

Code: Select all

[06:10:33] Project: 2665 (Run 3, Clone 530, Gen 36)
[06:10:33] 
[06:10:33] Entering M.D.
[06:10:41] Rejecting checkpoint
[06:10:43] Protein: HGG in water
[06:10:43] Writing local files
[06:10:49] Extra SSE boost OK.
[06:10:50] Writing local files
[06:10:50] Completed 0 out of 250000 steps  (0 percent)
[06:27:29] Writing local files
[06:27:29] Completed 2500 out of 250000 steps  (1 percent)
[06:43:49] Writing local files
[06:43:49] Completed 5000 out of 250000 steps  (2 percent)
[07:00:11] Writing local files
[07:00:11] Completed 7500 out of 250000 steps  (3 percent)
[07:16:27] Writing local files
[07:16:27] Completed 10000 out of 250000 steps  (4 percent)
[07:32:42] Writing local files
[07:32:43] Completed 12500 out of 250000 steps  (5 percent)
[07:48:56] Writing local files
[07:48:56] Completed 15000 out of 250000 steps  (6 percent)
[08:05:11] Writing local files
[08:05:11] Completed 17500 out of 250000 steps  (7 percent)
[08:21:25] Writing local files
[08:21:26] Completed 20000 out of 250000 steps  (8 percent)
[08:37:41] Writing local files
[08:37:41] Completed 22500 out of 250000 steps  (9 percent)
[08:53:55] Writing local files
[08:53:56] Completed 25000 out of 250000 steps  (10 percent)
[09:10:09] Writing local files
[09:10:09] Completed 27500 out of 250000 steps  (11 percent)
[09:26:22] Writing local files
[09:26:22] Completed 30000 out of 250000 steps  (12 percent)
[09:42:35] Writing local files
[09:42:35] Completed 32500 out of 250000 steps  (13 percent)
[09:58:53] Writing local files
[09:58:53] Completed 35000 out of 250000 steps  (14 percent)
[10:15:06] Writing local files
[10:15:07] Completed 37500 out of 250000 steps  (15 percent)
[10:31:19] Writing local files
[10:31:19] Completed 40000 out of 250000 steps  (16 percent)
[10:47:33] Writing local files
[10:47:33] Completed 42500 out of 250000 steps  (17 percent)
[11:03:45] Writing local files
[11:03:45] Completed 45000 out of 250000 steps  (18 percent)
[11:19:58] Writing local files
[11:19:58] Completed 47500 out of 250000 steps  (19 percent)
[11:36:19] Writing local files
[11:36:19] Completed 50000 out of 250000 steps  (20 percent)
[11:52:47] Writing local files
[11:52:47] Completed 52500 out of 250000 steps  (21 percent)
[12:09:21] Writing local files
[12:09:21] Completed 55000 out of 250000 steps  (22 percent)
[12:25:52] Writing local files
[12:25:53] Completed 57500 out of 250000 steps  (23 percent)
[12:42:13] Writing local files
[12:42:13] Completed 60000 out of 250000 steps  (24 percent)
[12:58:56] Writing local files
[12:58:56] Completed 62500 out of 250000 steps  (25 percent)
[13:16:39] Writing local files
[13:16:40] Completed 65000 out of 250000 steps  (26 percent)
[13:34:02] Writing local files
[13:34:02] Completed 67500 out of 250000 steps  (27 percent)
[13:51:09] Writing local files
[13:51:09] Completed 70000 out of 250000 steps  (28 percent)
[14:07:58] Writing local files
[14:07:58] Completed 72500 out of 250000 steps  (29 percent)
[14:20:16] Warning:  long 1-4 interactions
[14:20:18] Quit 101 - NaN detected: (ener[0])
[14:20:18] 
[14:20:18] Simulation instability has been encountered. The run has entered a
[14:20:18]   state from which no further progress can be made.
[14:20:18] This may be the correct result of the simulation, however if you
[14:20:18]   often see other project units terminating early like this
[14:20:18]   too, you may wish to check the stability of your computer (issues
[14:20:18]   such as high temperature, overclocking, etc.).
[14:20:18] Going to send back what have done.
[14:20:18] logfile size: 63628
[14:20:18] - Writing 64177 bytes of core data to disk...
[14:20:18]   ... Done.
[14:20:18] - Failed to delete work/wudata_05.arc
[14:20:18] Warning:  check for stray files
[14:22:18] 
[14:22:18] Folding@home Core Shutdown: EARLY_UNIT_END
[14:22:18] 
[14:22:18] Folding@home Core Shutdown: EARLY_UNIT_END
[14:22:21] CoreStatus = 7B (123)
[14:22:21] Client-core communications error: ERROR 0x7b
[14:22:21] This is a sign of more serious problems, shutting down.
toTOW
Site Moderator
Posts: 6395
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2665 (Run 3, Clone 530, Gen 36)

Post by toTOW »

I see another EUE report for this WU in the DB ...

You might try qfix to submit your partial results.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
mikeb12
Posts: 28
Joined: Tue Feb 12, 2008 11:51 am
Location: South Carolina USA

Re: Project: 2665 (Run 3, Clone 530, Gen 36)

Post by mikeb12 »

I deleted work folder and queue.dat and untiinfo.txt after the error above and restarted the client.

but it picked the same wu up again anyway..

and It just eue'ed the second time, same place just now... I'm gonna delete the work folder and queue again and restart it, hopefully pick up a different wu... but that didn't work earlier today...

Code: Select all


--- Opening Log file [August 6 14:24:43 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.22 SMP Beta2

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: D:\Software\Folding\FOLDx64
Executable: D:\Software\Folding\FOLDx64\[email protected]
Arguments: -smp -local -advmethods 

[14:24:43] - Ask before connecting: No
[14:24:43] - User name: mikeb12 (Team 77826)
[14:24:43] - User ID: 504D8E7326393487
[14:24:43] - Machine ID: 1
[14:24:43] 
[14:24:43] Work directory not found. Creating...
[14:24:43] Could not open work queue, generating new queue...
[14:24:43] - Preparing to get new work unit...
[14:24:43] + Attempting to get work packet
[14:24:43] - Connecting to assignment server
[14:24:44] - Successful: assigned to (171.64.65.64).
[14:24:44] + News From Folding@Home: Welcome to Folding@Home
[14:24:44] Loaded queue successfully.
[14:25:05] + Closed connections
[14:25:05] 
[14:25:05] + Processing work unit
[14:25:05] Work type a1 not eligible for variable processors
[14:25:05] Core required: FahCore_a1.exe
[14:25:05] Core found.
[14:25:05] Using generic mpiexec calls
[14:25:05] Working on queue slot 01 [August 6 14:25:05 UTC]
[14:25:05] + Working ...
[14:25:05] 
[14:25:05] *------------------------------*
[14:25:05] Folding@Home Gromacs SMP Core
[14:25:05] Version 1.74 (March 10, 2007)
[14:25:05] 
[14:25:05] Preparing to commence simulation
[14:25:05] - Ensuring status. Please wait.
[14:25:11] - Starting from initial work packet
[14:25:11] 
[14:25:11] Project: 2665 (Run 3, Clone 530, Gen 36)
[14:25:11] 
[14:25:11] Assembly optimizations on if available.
[14:25:11] Entering M.D.
[14:25:31] al work packet
[14:25:31] 
[14:25:31] Project: 2665 (Run 3, Clone 530, Gen 36)
[14:25:31] 
[14:25:33] 65 (Run 3, Clone 530, Gen 36)
[14:25:33] 
[14:25:33] Entering M.D.
[14:25:40] Rejecting checkpoint
[14:25:42] Protein: HGG in water
[14:25:42] Writing local files
[14:25:49] Extra SSE boost OK.
[14:25:50] Writing local files
[14:25:50] Completed 0 out of 250000 steps  (0 percent)
[14:43:33] Writing local files
[14:43:33] Completed 2500 out of 250000 steps  (1 percent)
[14:59:50] Writing local files
[14:59:50] Completed 5000 out of 250000 steps  (2 percent)
[15:16:46] Writing local files
[15:16:47] Completed 7500 out of 250000 steps  (3 percent)
[15:33:42] Writing local files
[15:33:42] Completed 10000 out of 250000 steps  (4 percent)
[15:50:56] Writing local files
[15:50:56] Completed 12500 out of 250000 steps  (5 percent)
[16:08:04] Writing local files
[16:08:04] Completed 15000 out of 250000 steps  (6 percent)
[16:25:29] Writing local files
[16:25:29] Completed 17500 out of 250000 steps  (7 percent)
[16:41:43] Writing local files
[16:41:44] Completed 20000 out of 250000 steps  (8 percent)
[16:57:58] Writing local files
[16:57:58] Completed 22500 out of 250000 steps  (9 percent)
[17:14:10] Writing local files
[17:14:10] Completed 25000 out of 250000 steps  (10 percent)
[17:30:24] Writing local files
[17:30:24] Completed 27500 out of 250000 steps  (11 percent)
[17:46:27] Writing local files
[17:46:28] Completed 30000 out of 250000 steps  (12 percent)
[18:02:31] Writing local files
[18:02:31] Completed 32500 out of 250000 steps  (13 percent)
[18:18:34] Writing local files
[18:18:34] Completed 35000 out of 250000 steps  (14 percent)
[18:36:05] Writing local files
[18:36:05] Completed 37500 out of 250000 steps  (15 percent)
[18:53:25] Writing local files
[18:53:25] Completed 40000 out of 250000 steps  (16 percent)
[19:10:12] Writing local files
[19:10:12] Completed 42500 out of 250000 steps  (17 percent)
[19:27:09] Writing local files
[19:27:09] Completed 45000 out of 250000 steps  (18 percent)
[19:44:34] Writing local files
[19:44:35] Completed 47500 out of 250000 steps  (19 percent)
[20:01:14] Writing local files
[20:01:14] Completed 50000 out of 250000 steps  (20 percent)
[20:17:39] Writing local files
[20:17:39] Completed 52500 out of 250000 steps  (21 percent)
[20:35:13] Writing local files
[20:35:13] Completed 55000 out of 250000 steps  (22 percent)
[20:51:30] Writing local files
[20:51:30] Completed 57500 out of 250000 steps  (23 percent)
[21:08:30] Writing local files
[21:08:30] Completed 60000 out of 250000 steps  (24 percent)
[21:25:27] Writing local files
[21:25:27] Completed 62500 out of 250000 steps  (25 percent)
[21:42:11] Writing local files
[21:42:11] Completed 65000 out of 250000 steps  (26 percent)
[21:59:05] Writing local files
[21:59:05] Completed 67500 out of 250000 steps  (27 percent)
[22:16:14] Writing local files
[22:16:14] Completed 70000 out of 250000 steps  (28 percent)
[22:33:19] Writing local files
[22:33:19] Completed 72500 out of 250000 steps  (29 percent)
[22:46:08] Warning:  long 1-4 interactions
[22:46:11] Quit 101 - NaN detected: (ener[0])
[22:46:11] 
[22:46:11] Simulation instability has been encountered. The run has entered a
[22:46:11]   state from which no further progress can be made.
[22:46:11] This may be the correct result of the simulation, however if you
[22:46:11]   often see other project units terminating early like this
[22:46:11]   too, you may wish to check the stability of your computer (issues
[22:46:11]   such as high temperature, overclocking, etc.).
[22:46:11] Going to send back what have done.
[22:46:11] logfile size: 63628
[22:46:11] - Writing 64177 bytes of core data to disk...
[22:46:11]   ... Done.
[22:48:11] 
[22:48:11] Folding@home Core Shutdown: EARLY_UNIT_END
[22:48:11] 
[22:48:11] Folding@home Core Shutdown: EARLY_UNIT_END
[22:48:13] CoreStatus = 7B (123)
[22:48:13] Client-core communications error: ERROR 0x7b
[22:48:13] This is a sign of more serious problems, shutting down.
toTOW
Site Moderator
Posts: 6395
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2665 (Run 3, Clone 530, Gen 36)

Post by toTOW »

It will try to process the same WU three times, unless you tell the server you got an EUE using qfix.

Here is a How to : viewtopic.php?f=8&t=191
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
mikeb12
Posts: 28
Joined: Tue Feb 12, 2008 11:51 am
Location: South Carolina USA

Re: Project: 2665 (Run 3, Clone 530, Gen 36)

Post by mikeb12 »

Ok, Thanks... got a new one... it did pick it up again (3rd time)......

didn't use qfix, but made it error by stopping the client and restarting before the processes were dead..
then rebooted, killed the work folder and queue, started again and it picked up a new unit..
I rarely get eue's or bad wu's and I've got 7 smp clients going 24/7... last one I reported was in March. so it's not a regular thing.
I've bookmarked qfix for next though. thanks for the link.. Mike :)

Code: Select all

[23:06:36] Working on queue slot 01 [August 6 23:06:36 UTC]
[23:06:36] + Working ...
[23:06:36] 
[23:06:36] *------------------------------*
[23:06:36] Folding@Home Gromacs SMP Core
[23:06:36] Version 1.74 (March 10, 2007)
[23:06:36] 
[23:06:36] Preparing to commence simulation
[23:06:36] - Ensuring status. Please wait.
[23:06:43] - Starting from initial work packet
[23:06:43] 
[23:06:43] Project: 2665 (Run 0, Clone 741, Gen 37)
[23:06:43] 
[23:06:44] Assembly optimizations on if available.
[23:06:44] Entering M.D.
[23:07:01] al work packet
[23:07:01] 
[23:07:01] Project: 2665 (Run 0, Clone 741, Gen 37)
[23:07:01] 
[23:07:02] Entering M.D.
[23:07:03] ne 741, Gen 37)
[23:07:03] 
[23:07:03] Entering M.D.
[23:07:10] Rejecting checkpoint
[23:07:14] Protein: HGG in water
[23:07:14] Writing local files
[23:07:21] Extra SSE boost OK.
[23:07:22] Writing local files
[23:07:22] Completed 0 out of 250000 steps  (0 percent)
Post Reply