Page 1 of 1

Project: 2653 (Run 9, Clone 166, Gen 71) : Early Unit End

Posted: Tue Mar 04, 2008 8:10 pm
by toTOW
Here's the log :
[19:46:33] Completed 290000 out of 500000 steps (58 percent)
[19:49:11] Gromacs cannot continue further.
[19:49:11] Going to send back what have done.
[19:49:11] logfile size: 8782
[19:49:11] - Writing 9318 bytes of core data to disk...
[19:49:11] ... Done.
[19:49:11] - Failed to delete work/wudata_07.arc
[19:49:11] Warning: check for stray files
[19:49:11]
[19:49:11] Folding@home Core Shutdown: EARLY_UNIT_END
[19:49:11]
[19:49:11] Folding@home Core Shutdown: EARLY_UNIT_END
[19:49:14] CoreStatus = 7B (123)
[19:49:14] Client-core communications error: ERROR 0x7b
[19:49:14] Deleting current work unit & continuing...
I ran qfix and it recovered the result file, but server is down :
19:58:41] + Attempting to send results
[19:58:41] - Reading file work/wuresults_07.dat from core
[19:58:41] (Read 9318 bytes from disk)
[19:58:41] Connecting to http://171.64.65.64:8080/
[19:58:43] - Couldn't send HTTP request to server
[19:58:43] + Could not connect to Work Server (results)
[19:58:43] (171.64.65.64:8080)
[19:58:43] - Error: Could not transmit unit 07 (completed March 4) to work server.
[19:58:43] - 1 failed uploads of this unit.
[19:58:43] Keeping unit 07 in queue.
[19:58:43] + Sent 0 of 1 completed units to the server
[19:58:43] - Failed to send all units to server
When I started the client back, it tried again, and failed, but the Collection Server seemed to do its job (but it issued a strange message) :
[19:59:05] + Attempting to send results
[19:59:05] Core found.
[19:59:05] - Reading file work/wuresults_07.dat from core
[19:59:05] (Read 9318 bytes from disk)
[19:59:05] Connecting to http://171.64.65.64:8080/
[19:59:05] - Ensuring status. Please wait.
[19:59:07] - Couldn't send HTTP request to server
[19:59:07] + Could not connect to Work Server (results)
[19:59:07] (171.64.65.64:8080)
[19:59:07] - Error: Could not transmit unit 07 (completed March 4) to work server.
[19:59:07] - 2 failed uploads of this unit.
[...]
[19:59:07] - Reading file work/wuresults_07.dat from core
[19:59:07] (Read 9318 bytes from disk)
[19:59:07] Connecting to http://171.64.122.76:8080/
[19:59:08] Posted data.
[19:59:08] Initial: 0000; - Uploaded at ~10 kB/s
[19:59:08] - Averaged speed for that direction ~76 kB/s
[19:59:08] - Core type used on unit not what server demands.
[19:59:08] Successfully sent unit 07 to Collection server.
[19:59:08] + Sent 1 of 1 completed units to the server
Can someone check whether :
- anyone was able to return the WU
- I got partial credit for that, even with the strange behaviour of the CS

Thanks :)

Re: Project: 2653 (Run 9, Clone 166, Gen 71) : Early Unit End

Posted: Tue Mar 04, 2008 8:12 pm
by 7im
No recored of Project: 2653 (Run 9, Clone 166, Gen 71) being returned.

Maybe you could run it again, and see if it fails in the same place.

Re: Project: 2653 (Run 9, Clone 166, Gen 71) : Early Unit End

Posted: Tue Mar 04, 2008 8:16 pm
by toTOW
Everything has been deleted by the client (the WU on failure, and the result file with the strange behaviour of the CS) ...

The server for 2653 is in Reject mode (see my topic in Issue with specific server section) ... so I didn't get the same WU again :( (now folding a p3064).