Project: 2665 (Run 2, Clone 149, Gen 39)

Moderators: Site Moderators, FAHC Science Team

Post Reply
ChelseaOilman
Posts: 1037
Joined: Sun Dec 02, 2007 3:47 pm
Location: Colorado @ 10,000 feet

Project: 2665 (Run 2, Clone 149, Gen 39)

Post by ChelseaOilman »

Code: Select all

[08:50:56] Working on queue slot 08 [September 2 08:50:56 UTC]
[08:50:56] + Working ...
[08:50:56] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -forceasm -verbose -lifeline 1892 -version 622'

[08:50:56] 
[08:50:56] *------------------------------*
[08:50:56] Folding@Home Gromacs SMP Core
[08:50:56] Version 1.74 (March 10, 2007)
[08:50:56] 
[08:50:56] Preparing to commence simulation
[08:50:56] - Ensuring status. Please wait.
[08:51:13] - Assembly optimizations manually forced on.
[08:51:13] - Not checking prior termination.
[08:51:28] - Expanded 4901502 -> 24810145 (decompressed 506.1 percent)
[08:51:28] - Starting from initial work packet
[08:51:28] 
[08:51:28] Project: 2665 (Run 2, Clone 149, Gen 39)
[08:51:28] 
[08:51:28] Assembly optimizations on if available.
[08:51:28] Entering M.D.
[08:51:35] Rejecting checkpoint
[08:51:37] NaN detected: x[17273][2]=9.23815 v[17273][2]=NaN
[08:51:37] utdown: BAD_CORE_FILES
[08:51:37] Finalizing output
[08:53:37] NaN detected: x[34361][2]=0.44656 v[34361][2]=NaN
[08:53:37] 
[08:53:37] Folding@home Core Shutdown: BAD_CORE_FILES
[08:53:37] Finalizing output
[08:55:40] CoreStatus = 1 (1)
[08:55:40] Client-core communications error: ERROR 0x1
[08:55:40] Deleting current work unit & continuing...
[08:55:40] Using generic mpiexec calls
[08:58:01] - Warning: Could not delete all work unit files (8): Core returned invalid code
[08:58:01] Trying to send all finished work units
[08:58:01] + No unsent completed units remaining.
[08:58:01] - Preparing to get new work unit...
[08:58:01] + Attempting to get work packet
[08:58:01] - Will indicate memory of 2046 MB
[08:58:01] - Connecting to assignment server
[08:58:01] Connecting to http://assign.stanford.edu:8080/
[08:58:01] Posted data.
[08:58:01] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[08:58:01] + News From Folding@Home: Welcome to Folding@Home
[08:58:01] Loaded queue successfully.
[08:58:01] Connecting to http://171.64.65.64:8080/
[08:58:07] Posted data.
[08:58:07] Initial: 0000; - Receiving payload (expected size: 4902014)
[08:58:11] - Downloaded at ~1196 kB/s
[08:58:11] - Averaged speed for that direction ~1264 kB/s
[08:58:11] + Received work.
[08:58:11] + Closed connections
[08:58:16] 
[08:58:16] + Processing work unit
[08:58:16] Work type a1 not eligible for variable processors
[08:58:16] Core required: FahCore_a1.exe
[08:58:16] Core found.
[08:58:16] Using generic mpiexec calls
[08:58:16] Working on queue slot 09 [September 2 08:58:16 UTC]
[08:58:16] + Working ...
[08:58:16] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 09 -checkpoint 15 -forceasm -verbose -lifeline 1892 -version 622'

[08:58:16] 
[08:58:16] *------------------------------*
[08:58:16] Folding@Home Gromacs SMP Core
[08:58:16] Version 1.74 (March 10, 2007)
[08:58:16] 
[08:58:16] Preparing to commence simulation
[08:58:16] - Ensuring status. Please wait.
[08:58:33] - Assembly optimizations manually forced on.
[08:58:33] - Not checking prior termination.
[08:58:47] - Expanded 4901502 -> 24810145 (decompressed 506.1 percent)
[08:58:47] - Starting from initial work packet
[08:58:47] 
[08:58:47] Project: 2665 (Run 2, Clone 149, Gen 39)
[08:58:47] 
[08:58:47] Assembly optimizations on if available.
[08:58:47] Entering M.D.
[08:58:55] Rejecting checkpoint
[08:58:57] NaN detected: x[17273][2]=9.23815 v[17273][2]=NaN
[08:58:57] utdown: BAD_CORE_FILES
[08:58:57] Finalizing output
[09:00:57] ES
[09:00:57] 
[09:00:57] Folding@home Core Shutdown: BAD_CORE_FILES
[09:00:58] aN
[09:00:58] 
[09:00:58] Folding@home Core Shutdown: BAD_CORE_FILES
[09:00:58] Finalizing output
[09:03:00] CoreStatus = 1 (1)
[09:03:00] Client-core communications error: ERROR 0x1
[09:03:00] Deleting current work unit & continuing...
[09:03:00] Using generic mpiexec calls
[09:05:21] - Warning: Could not delete all work unit files (9): Core returned invalid code
[09:05:21] Trying to send all finished work units
[09:05:21] + No unsent completed units remaining.
[09:05:21] - Preparing to get new work unit...
[09:05:21] + Attempting to get work packet
[09:05:21] - Will indicate memory of 2046 MB
[09:05:21] - Connecting to assignment server
[09:05:21] Connecting to http://assign.stanford.edu:8080/
[09:05:21] Posted data.
[09:05:21] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[09:05:21] + News From Folding@Home: Welcome to Folding@Home
[09:05:21] Loaded queue successfully.
[09:05:21] Connecting to http://171.64.65.64:8080/
[09:05:26] Posted data.
[09:05:26] Initial: 0000; - Receiving payload (expected size: 4902014)
[09:05:30] - Downloaded at ~1196 kB/s
[09:05:30] - Averaged speed for that direction ~1250 kB/s
[09:05:30] + Received work.
[09:05:30] + Closed connections
[09:05:35] 
[09:05:35] + Processing work unit
[09:05:35] Work type a1 not eligible for variable processors
[09:05:35] Core required: FahCore_a1.exe
[09:05:35] Core found.
[09:05:35] Using generic mpiexec calls
[09:05:35] Working on queue slot 00 [September 2 09:05:35 UTC]
[09:05:35] + Working ...
[09:05:35] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 00 -checkpoint 15 -forceasm -verbose -lifeline 1892 -version 622'

[09:05:35] 
[09:05:35] *------------------------------*
[09:05:35] Folding@Home Gromacs SMP Core
[09:05:35] Version 1.74 (March 10, 2007)
[09:05:35] 
[09:05:35] Preparing to commence simulation
[09:05:35] - Ensuring status. Please wait.
[09:05:52] - Assembly optimizations manually forced on.
[09:05:52] - Not checking prior termination.
[09:06:06] - Expanded 4901502 -> 24810145 (decompressed 506.1 percent)
[09:06:06] - Starting from initial work packet
[09:06:06] 
[09:06:06] Project: 2665 (Run 2, Clone 149, Gen 39)
[09:06:06] 
[09:06:06] Assembly optimizations on if available.
[09:06:06] Entering M.D.
[09:06:15] Rejecting checkpoint
[09:06:16] NaN detected: x[17273][2]=9.23815 v[17273][2]=NaN
[09:06:16] utdown: BAD_CORE_FILES
[09:06:16] Finalizing output
[09:08:16] NaN detected: x[34361][2]=0.44656 v[34361][2]=NaN
[09:08:16] 
[09:08:16] Folding@home Core Shutdown: BAD_CORE_FILES
[09:08:16] Finalizing output
[09:10:19] CoreStatus = 1 (1)
[09:10:19] Client-core communications error: ERROR 0x1
[09:10:19] - Attempting to download new core...
[09:10:19] + Downloading new core: FahCore_a1.exe
[09:10:19] Downloading core (/~pande/Win32/x86/Core_a1.fah from www.stanford.edu)
[09:10:19] Initial: AFDE; + 10240 bytes downloaded
<SNIP>
[09:10:21] Initial: D2E9; + 789667 bytes downloaded
[09:10:21] Verifying core Core_a1.fah...
[09:10:21] Signature is VALID
[09:10:21] 
[09:10:21] Trying to unzip core FahCore_a1.exe
[09:10:21] Decompressed FahCore_a1.exe (2035712 bytes) successfully
[09:10:26] + Core successfully engaged
[09:10:26] Deleting current work unit & continuing...
Related thread: BAD_CORE_FILES and NaN with 6.22b2 client
ChelseaOilman
Posts: 1037
Joined: Sun Dec 02, 2007 3:47 pm
Location: Colorado @ 10,000 feet

Re: Project: 2665 (Run 2, Clone 149, Gen 39)

Post by ChelseaOilman »

Received this WU again on another one of my machines last night. Log looks the same as before. This is a bad WU.
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: Project: 2665 (Run 2, Clone 149, Gen 39)

Post by VijayPande »

I've manually stopped this WU.
Post Reply