Project: 2665 (Run 3, Clone 974, Gen 28) long 1-4 int...

Moderators: Site Moderators, FAHC Science Team

Post Reply
MT_Bender
Posts: 4
Joined: Thu Jun 05, 2008 6:00 am

Project: 2665 (Run 3, Clone 974, Gen 28) long 1-4 int...

Post by MT_Bender »

Hi
tonight I had 3 EUE's on this WU

Code: Select all

...
[03:41:22] Completed 245000 out of 250000 steps  (98 percent)
[03:56:22] Timered checkpoint triggered.
[03:59:11] Writing local files
[03:59:12] Completed 247500 out of 250000 steps  (99 percent)
[04:14:11] Timered checkpoint triggered.
[04:17:02] Writing local files
[04:17:02] Completed 250000 out of 250000 steps  (100 percent)
[04:17:02] Writing final coordinates.
[04:17:03] Past main M.D. loop
[04:17:03] Will end MPI now
[04:18:03] 
[04:18:03] Finished Work Unit:
[04:18:03] - Reading up to 21310704 from "work/wudata_05.arc": Read 21310704
[04:18:03] - Reading up to 556032 from "work/wudata_05.xtc": Read 556032
[04:18:04] goefile size: 0
[04:18:04] logfile size: 221653
[04:18:04] Leaving Run
[04:18:05] - Writing 22094761 bytes of core data to disk...
[04:18:05]   ... Done.
[04:18:05] - Failed to delete work/wudata_05.sas
[04:18:05] - Failed to delete work/wudata_05.goe
[04:18:05] Warning:  check for stray files
[04:18:05] - Shutting down core
[04:20:05] 
[04:20:05] Folding@home Core Shutdown: FINISHED_UNIT
[04:20:05] 
[04:20:05] Folding@home Core Shutdown: FINISHED_UNIT
[04:20:08] CoreStatus = 64 (100)
[04:20:08] Unit 5 finished with 65 percent of time to deadline remaining.
[04:20:08] Updated performance fraction: 0.693705
[04:20:08] Sending work to server
[04:20:08] Project: 2665 (Run 3, Clone 523, Gen 53)

[04:20:08] + Attempting to send results [October 2 04:20:08 UTC]
[04:20:08] - Reading file work/wuresults_05.dat from core
[04:20:08]   (Read 22094761 bytes from disk)
[04:20:08] Connecting to http://171.64.65.64:8080/
[04:30:52] Posted data.
[04:30:52] Initial: 0000; - Uploaded at ~33 kB/s
[04:30:52] - Averaged speed for that direction ~40 kB/s
[04:30:52] + Results successfully sent
[04:30:52] Thank you for your contribution to Folding@Home.
[04:30:52] + Number of Units Completed: 96

[04:30:53] Using generic mpiexec calls
[04:32:57] - Warning: Could not delete all work unit files (5): Core returned invalid code
[04:32:57] Trying to send all finished work units
[04:32:57] + No unsent completed units remaining.
[04:32:57] - Preparing to get new work unit...
[04:32:57] + Attempting to get work packet
[04:32:57] - Will indicate memory of 3071 MB
[04:32:57] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 11
[04:32:57] - Connecting to assignment server
[04:32:57] Connecting to http://assign.stanford.edu:8080/
[04:32:58] Posted data.
[04:32:58] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[04:32:58] + News From Folding@Home: Welcome to Folding@Home
[04:32:58] Loaded queue successfully.
[04:32:58] Connecting to http://171.64.65.64:8080/
[04:33:04] Posted data.
[04:33:04] Initial: 0000; - Receiving payload (expected size: 4721631)
[04:35:10] - Downloaded at ~36 kB/s
[04:35:10] - Averaged speed for that direction ~61 kB/s
[04:35:10] + Received work.
[04:35:10] Trying to send all finished work units
[04:35:10] + No unsent completed units remaining.
[04:35:10] + Closed connections
[04:35:10] 
[04:35:10] + Processing work unit
[04:35:10] Work type a1 not eligible for variable processors
[04:35:10] Core required: FahCore_a1.exe
[04:35:10] Core found.
[04:35:10] Using generic mpiexec calls
[04:35:10] Working on queue slot 06 [October 2 04:35:10 UTC]
[04:35:10] + Working ...
[04:35:10] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 06 -nocpulock -checkpoint 15 -verbose -lifeline 3564 -version 622'

[04:35:10] 
[04:35:10] *------------------------------*
[04:35:10] Folding@Home Gromacs SMP Core
[04:35:10] Version 1.74 (March 10, 2007)
[04:35:10] 
[04:35:10] Preparing to commence simulation
[04:35:10] - Ensuring status. Please wait.
[04:35:15] - Starting from initial work packet
[04:35:15] 
[04:35:15] Project: 2665 (Run 3, Clone 974, Gen 28)
[04:35:15] 
[04:35:16] Assembly optimizations on if available.
[04:35:16] Entering M.D.
[04:35:36] al work packet
[04:35:36] 
[04:35:36] Project: 2665 (Run 3, Clone 974, Gen 28)
[04:35:36] 
[04:35:40] Entering M.D.
[04:35:46] Rejecting checkpoint
[04:35:48] Protein: HGG in water
[04:35:48] Writing local files
[04:35:55] Extra SSE boost OK.
[04:35:56] Writing local files
[04:35:56] Completed 0 out of 250000 steps  (0 percent)
[04:43:20] Warning:  long 1-4 interactions
[04:43:20] Gromacs cannot continue further.
[04:43:20] Going to send back what have done.
[04:43:20] logfile size: 9422
[04:43:20] - Writing 9958 bytes of core data to disk...
[04:43:20]   ... Done.
[04:43:20] - Failed to delete work/wudata_06.sas
[04:43:20] - Failed to delete work/wudata_06.goe
[04:43:20] Warning:  check for stray files
[04:43:20] 
[04:43:20] Folding@home Core Shutdown: EARLY_UNIT_END
[04:43:20] 
[04:43:20] Folding@home Core Shutdown: EARLY_UNIT_END
[04:43:24] CoreStatus = 7B (123)
[04:43:24] Client-core communications error: ERROR 0x7b
[04:43:24] Deleting current work unit & continuing...
[04:43:24] Using generic mpiexec calls
[04:45:28] - Warning: Could not delete all work unit files (6): Core returned invalid code
[04:45:28] Trying to send all finished work units
[04:45:28] + No unsent completed units remaining.
[04:45:28] - Preparing to get new work unit...
[04:45:28] + Attempting to get work packet
[04:45:28] - Will indicate memory of 3071 MB
[04:45:28] - Connecting to assignment server
[04:45:28] Connecting to http://assign.stanford.edu:8080/
[04:45:29] Posted data.
[04:45:29] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[04:45:29] + News From Folding@Home: Welcome to Folding@Home
[04:45:29] Loaded queue successfully.
[04:45:29] Connecting to http://171.64.65.64:8080/
[04:45:35] Posted data.
[04:45:35] Initial: 0000; - Receiving payload (expected size: 4721631)
[04:47:40] - Downloaded at ~36 kB/s
[04:47:40] - Averaged speed for that direction ~56 kB/s
[04:47:40] + Received work.
[04:47:40] + Closed connections
[04:47:45] 
[04:47:45] + Processing work unit
[04:47:45] Work type a1 not eligible for variable processors
[04:47:45] Core required: FahCore_a1.exe
[04:47:45] Core found.
[04:47:45] Using generic mpiexec calls
[04:47:45] Working on queue slot 07 [October 2 04:47:45 UTC]
[04:47:45] + Working ...
[04:47:45] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 07 -nocpulock -checkpoint 15 -verbose -lifeline 3564 -version 622'

[04:47:45] 
[04:47:45] *------------------------------*
[04:47:45] Folding@Home Gromacs SMP Core
[04:47:45] Version 1.74 (March 10, 2007)
[04:47:45] 
[04:47:45] Preparing to commence simulation
[04:47:45] - Ensuring status. Please wait.
[04:47:50] - Starting from initial work packet
[04:47:50] 
[04:47:50] Project: 2665 (Run 3, Clone 974, Gen 28)
[04:47:50] 
[04:47:50] Assembly optimizations on if available.
[04:47:50] Entering M.D.
[04:48:13] al work packet
[04:48:13] 
[04:48:13] Project: 2665 (Run 3, Clone 974, Gen 28)
[04:48:13] 
[04:48:13] g from initial work packet
[04:48:13] 
[04:48:13] Project: 2665 (Run 3, Clone 974, Gen 28)
[04:48:13] 
[04:48:14] Entering M.D.
[04:48:23] Protein: HGG in water
[04:48:23] Writing local files
[04:48:24] Extra SSE boost OK.
[04:48:31] ps  (0 percent)
[04:56:58] Warning:  long 1-4 interactions
[04:56:58] Gromacs cannot continue further.
[04:56:58] Going to send back what have done.
[04:56:58] logfile size: 9422
[04:56:59] - Writing 9958 bytes of core data to disk...
[04:56:59]   ... Done.
[04:56:59] - Failed to delete work/wudata_07.sas
[04:56:59] - Failed to delete work/wudata_07.goe
[04:56:59] Warning:  check for stray files
[04:56:59] 
[04:56:59] Folding@home Core Shutdown: EARLY_UNIT_END
[04:56:59] 
[04:56:59] Folding@home Core Shutdown: EARLY_UNIT_END
[04:57:01] CoreStatus = 7B (123)
[04:57:01] Client-core communications error: ERROR 0x7b
[04:57:01] Deleting current work unit & continuing...
[04:57:01] Using generic mpiexec calls
[04:59:05] - Warning: Could not delete all work unit files (7): Core returned invalid code
[04:59:05] Trying to send all finished work units
[04:59:05] + No unsent completed units remaining.
[04:59:05] - Preparing to get new work unit...
[04:59:05] + Attempting to get work packet
[04:59:05] - Will indicate memory of 3071 MB
[04:59:05] - Connecting to assignment server
[04:59:05] Connecting to http://assign.stanford.edu:8080/
[04:59:06] Posted data.
[04:59:06] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[04:59:06] + News From Folding@Home: Welcome to Folding@Home
[04:59:06] Loaded queue successfully.
[04:59:06] Connecting to http://171.64.65.64:8080/
[04:59:12] Posted data.
[04:59:12] Initial: 0000; - Receiving payload (expected size: 4721631)
[05:01:09] - Downloaded at ~39 kB/s
[05:01:09] - Averaged speed for that direction ~52 kB/s
[05:01:09] + Received work.
[05:01:09] + Closed connections
[05:01:14] 
[05:01:14] + Processing work unit
[05:01:14] Work type a1 not eligible for variable processors
[05:01:14] Core required: FahCore_a1.exe
[05:01:14] Core found.
[05:01:14] Using generic mpiexec calls
[05:01:14] Working on queue slot 08 [October 2 05:01:14 UTC]
[05:01:14] + Working ...
[05:01:14] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 08 -nocpulock -checkpoint 15 -verbose -lifeline 3564 -version 622'

[05:01:14] 
[05:01:14] *------------------------------*
[05:01:14] Folding@Home Gromacs SMP Core
[05:01:14] Version 1.74 (March 10, 2007)
[05:01:14] 
[05:01:14] Preparing to commence simulation
[05:01:14] - Ensuring status. Please wait.
[05:01:19] - Starting from initial work packet
[05:01:19] 
[05:01:19] Project: 2665 (Run 3, Clone 974, Gen 28)
[05:01:19] 
[05:01:19] Assembly optimizations on if available.
[05:01:19] Entering M.D.
[05:01:40] al work packet
[05:01:40] 
[05:01:40] Project: 2665 (Run 3, Clone 974, Gen 28)
[05:01:40] 
[05:01:43] 65 (Run 3, Clone 974, Gen 28)
[05:01:43] 
[05:01:44] Entering M.D.
[05:01:50] Rejecting checkpoint
[05:01:52] Protein: HGG in water
[05:01:52] Writing local files
[05:01:59] Extra SSE boost OK.
[05:01:59] Writing local files
[05:02:00] Completed 0 out of 250000 steps  (0 percent)
[05:09:40] Warning:  long 1-4 interactions
[05:09:40] Gromacs cannot continue further.
[05:09:40] Going to send back what have done.
[05:09:40] logfile size: 9422
[05:09:40] - Writing 9958 bytes of core data to disk...
[05:09:40]   ... Done.
[05:09:40] - Failed to delete work/wudata_08.sas
[05:09:40] - Failed to delete work/wudata_08.goe
[05:09:40] Warning:  check for stray files
[05:09:40] 
[05:09:40] Folding@home Core Shutdown: EARLY_UNIT_END
[05:09:40] 
[05:09:40] Folding@home Core Shutdown: EARLY_UNIT_END
[05:09:42] CoreStatus = 7B (123)
[05:09:42] Client-core communications error: ERROR 0x7b
[05:09:42] - Attempting to download new core...
[05:09:42] + Downloading new core: FahCore_a1.exe
[05:09:42] Downloading core (/~pande/Win32/x86/Core_a1.fah from www.stanford.edu)
[05:09:43] Initial: AFDE; + 10240 bytes downloaded
[05:09:43] Initial: AD21; + 20480 bytes downloaded
[05:09:44] Initial: CC38; + 30720 bytes downloaded
...
[05:09:49] Initial: 34A0; + 778240 bytes downloaded
[05:09:50] Initial: DD6C; + 788480 bytes downloaded
[05:09:50] Initial: D2E9; + 789667 bytes downloaded
[05:09:50] Verifying core Core_a1.fah...
[05:09:50] Signature is VALID
[05:09:50] 
[05:09:50] Trying to unzip core FahCore_a1.exe
[05:09:50] Decompressed FahCore_a1.exe (2035712 bytes) successfully
[05:09:55] + Core successfully engaged
[05:09:55] Deleting current work unit & continuing...
[05:09:55] Using generic mpiexec calls
[05:11:59] - Warning: Could not delete all work unit files (8): Core returned invalid code
[05:11:59] Trying to send all finished work units
[05:11:59] + No unsent completed units remaining.
[05:11:59] - Preparing to get new work unit...
[05:11:59] + Attempting to get work packet
[05:11:59] - Will indicate memory of 3071 MB
[05:11:59] - Connecting to assignment server
[05:11:59] Connecting to http://assign.stanford.edu:8080/
[05:12:00] Posted data.
[05:12:00] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[05:12:00] + News From Folding@Home: Welcome to Folding@Home
[05:12:00] Loaded queue successfully.
[05:12:00] Connecting to http://171.64.65.64:8080/
[05:12:00] Posted data.
[05:12:00] Initial: 0000; - Error: Bad packet type from server, expected work assignment
[05:12:01] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[05:12:10] + Attempting to get work packet
[05:12:10] - Will indicate memory of 3071 MB
[05:12:10] - Connecting to assignment server
[05:12:10] Connecting to http://assign.stanford.edu:8080/
[05:12:11] Posted data.
[05:12:11] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[05:12:11] + News From Folding@Home: Welcome to Folding@Home
[05:12:11] Loaded queue successfully.
[05:12:11] Connecting to http://171.64.65.64:8080/
[05:12:16] Posted data.
[05:12:16] Initial: 0000; - Receiving payload (expected size: 4761024)
[05:14:30] - Downloaded at ~34 kB/s
[05:14:30] - Averaged speed for that direction ~49 kB/s
[05:14:30] + Received work.
[05:14:30] + Closed connections
[05:14:35] 
[05:14:35] + Processing work unit
[05:14:35] Work type a1 not eligible for variable processors
[05:14:35] Core required: FahCore_a1.exe
[05:14:35] Core found.
[05:14:35] Using generic mpiexec calls
[05:14:35] Working on queue slot 09 [October 2 05:14:35 UTC]
[05:14:35] + Working ...
[05:14:35] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 09 -nocpulock -checkpoint 15 -verbose -lifeline 3564 -version 622'

[05:14:35] 
[05:14:35] *------------------------------*
[05:14:35] Folding@Home Gromacs SMP Core
[05:14:35] Version 1.74 (March 10, 2007)
[05:14:35] 
[05:14:35] Preparing to commence simulation
[05:14:35] - Ensuring status. Please wait.
[05:14:40] - Starting from initial work packet
[05:14:40] 
[05:14:40] Project: 2665 (Run 3, Clone 903, Gen 54)
[05:14:40] 
[05:14:40] Assembly optimizations on if available.
[05:14:40] Entering M.D.
[05:15:00]  percent)
[05:15:00] - Starting from initial work packet
[05:15:01] 
[05:15:01] Project: 2665 (Run 3, Clone 903, Gen 54)
[05:15:01] 
[05:15:04] Entering M.D.
[05:15:11] Rejecting checkpoint
[05:15:13] Protein: HGG in water
[05:15:13] Writing local files
[05:15:20] Extra SSE boost OK.
[05:15:20] Writing local files
[05:15:20] Completed 0 out of 250000 steps  (0 percent)
[05:30:20] Timered checkpoint triggered.
[05:33:16] Writing local files
[05:33:16] Completed 2500 out of 250000 steps  (1 percent)
[05:48:16] Timered checkpoint triggered.
[05:51:01] Writing local files
...
now folding w/o issues
bye :wink:
toTOW
Site Moderator
Posts: 6359
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2665 (Run 3, Clone 974, Gen 28) long 1-4 int...

Post by toTOW »

This one is a Bad WU, many people tried to fold it without success :(
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Post Reply