Page 1 of 1

1st WU, no credit - P2620 (R76, C13, G33) [CS 171.64.122.76]

Posted: Wed Sep 03, 2008 11:35 pm
by CannonFodder08
Just curious about how long it takes for a WU to get credited in the stats? My 'puter finished the WU above late last night, and there's been at least a few stats updates, yet I still have a "0" credit. Here's a partial copy & paste of the log:

[22:23:24] Trying to unzip core FahCore_78.exe
[22:23:24] Decompressed FahCore_78.exe (2338816 bytes) successfully
[22:23:29] + Core successfully engaged
[22:23:34]
[22:23:34] + Processing work unit
[22:23:34] Core required: FahCore_78.exe
[22:23:34] Core found.
[22:23:34] Working on queue slot 01 [September 1 22:23:34 UTC]
[22:23:34] + Working ...
[22:23:34]
[22:23:34] *------------------------------*
[22:23:34] Folding@Home Gromacs Core
[22:23:34] Version 1.90 (March 8, 2006)
[22:23:34]
[22:23:34] Preparing to commence simulation
[22:23:34] - Looking at optimizations...
[22:23:34] - Created dyn
[22:23:34] - Files status OK
[22:23:41] - Expanded 3634627 -> 18750173 (decompressed 515.8 percent)
[22:23:41] - Starting from initial work packet
[22:23:41]
[22:23:41] Project: 2620 (Run 76, Clone 13, Gen 33)
[22:23:41]
[22:23:41] Assembly optimizations on if available.
[22:23:41] Entering M.D.
[22:23:49] Protein: p2620_p1475_tet1_03_1 t= 20000.00000
[22:23:49]
[22:23:49] Writing local files
[22:23:55] Extra SSE boost OK.
[22:23:56] Writing local files
[22:23:56] Completed 0 out of 125000 steps (0%)

[snip]

[08:03:49] Completed 125000 out of 125000 steps (100%)
[08:03:49] Writing final coordinates.
[08:03:51] Past main M.D. loop
[08:04:51]
[08:04:51] Finished Work Unit:
[08:04:51] - Reading up to 3024264 from "work/wudata_01.arc": Read 3024264
[08:04:51] - Reading up to 5197688 from "work/wudata_01.xtc": Read 5197688
[08:04:51] goefile size: 0
[08:04:51] logfile size: 36345
[08:04:51] Leaving Run
[08:04:55] - Writing 8307825 bytes of core data to disk...
[08:05:01] Done: 8307313 -> 8084288 (compressed to 97.3 percent)
[08:05:01] ... Done.
[08:05:01] - Shutting down core
[08:05:01]
[08:05:01] Folding@home Core Shutdown: FINISHED_UNIT
[08:05:04] CoreStatus = 64 (100)
[08:05:04] Sending work to server
[08:05:04] Project: 2620 (Run 76, Clone 13, Gen 33)


[08:05:04] + Attempting to send results [September 3 08:05:04 UTC]
[08:05:04] - Couldn't send HTTP request to server
[08:05:04] (Got status 503)
[08:05:04] + Could not connect to Work Server (results)
[08:05:04] (171.64.65.65:8080)
[08:05:04] + Retrying using alternative port
[08:05:04] + Could not connect to Work Server (results)
[08:05:04] (171.64.65.65:80)
[08:05:04] - Error: Could not transmit unit 01 (completed September 3) to work server.
[08:05:04] Keeping unit 01 in queue.
[08:05:04] Project: 2620 (Run 76, Clone 13, Gen 33)


[08:05:04] + Attempting to send results [September 3 08:05:04 UTC]
[08:05:04] - Couldn't send HTTP request to server
[08:05:04] + Could not connect to Work Server (results)
[08:05:04] (171.64.65.65:8080)
[08:05:04] + Retrying using alternative port
[08:05:04] + Could not connect to Work Server (results)
[08:05:04] (171.64.65.65:80)
[08:05:04] - Error: Could not transmit unit 01 (completed September 3) to work server.


[08:05:04] + Attempting to send results [September 3 08:05:04 UTC]
[08:05:04] - Couldn't send HTTP request to server
[08:05:04] (Got status 503)
[08:05:04] + Could not connect to Work Server (results)
[08:05:04] (171.64.122.76:8080)
[08:05:04] + Retrying using alternative port
[08:05:04] - Couldn't send HTTP request to server
[08:05:04] (Got status 503)
[08:05:04] + Could not connect to Work Server (results)
[08:05:04] (171.64.122.76:80)
[08:05:04] Could not transmit unit 01 to Collection server; keeping in queue.
[08:05:04] - Preparing to get new work unit...
[08:05:04] + Attempting to get work packet
[08:05:04] - Connecting to assignment server
[08:05:05] - Successful: assigned to (130.49.240.81).
[08:05:05] + News From Folding@Home: Welcome to Folding@Home
[08:05:05] Loaded queue successfully.
[08:05:07] Project: 2620 (Run 76, Clone 13, Gen 33)


[08:05:07] + Attempting to send results [September 3 08:05:07 UTC]
[08:05:07] - Couldn't send HTTP request to server
[08:05:07] (Got status 503)
[08:05:07] + Could not connect to Work Server (results)
[08:05:07] (171.64.65.65:8080)
[08:05:07] + Retrying using alternative port
[08:05:07] - Couldn't send HTTP request to server
[08:05:07] + Could not connect to Work Server (results)
[08:05:07] (171.64.65.65:80)
[08:05:07] - Error: Could not transmit unit 01 (completed September 3) to work server.


[08:05:07] + Attempting to send results [September 3 08:05:07 UTC]
[08:06:15] + Results successfully sent
[08:06:15] Thank you for your contribution to Folding@Home.
[08:06:15] + Starting local stats count at 1
[08:06:15] Successfully sent unit 01 to Collection server.
[08:06:15] + Closed connections

Re: 1st WU, no credit - Project: 2620 (Run 76, Clone 13, Gen 33)

Posted: Wed Sep 03, 2008 11:56 pm
by kasson
It looks like there's an issue here--this was sent to the CS, but the work server never got it. I've contacted the CS team and we're looking into it. Thanks for the report.

Re: 1st WU, no credit - Project: 2620 (Run 76, Clone 13, Gen 33)

Posted: Sat Sep 06, 2008 1:00 am
by CannonFodder08
I just figured out how to see which of my Work Units I've received credit for, and apparently I've received credit for another WU done on the same machine as the WU mentioned in this thread, as well as a WU done on another machine, but the WU mentioned in the title of this thread is still missing. Will the missing WU finally end up on the work server as well, and credit issued, or will it end up as I've read in some other threads, i.e. the WU ends up being reassigned?

Re: 1st WU, no credit - Project: 2620 (Run 76, Clone 13, Gen 33)

Posted: Sat Sep 06, 2008 2:05 am
by stanmc
I have a similar problem concerning a 2620 WU. It is Project 2620 Run 21 Clone 10 Gen 36. It was returned, but the fahlog does not show which server received it. I would love to know if it was actually received and if I will get credit. I can show the part of the log that records the successful transmission of the unit. It was sent at 13:58:55 UTC on 4 September.

Re: 1st WU, no credit - Project: 2620 (Run 76, Clone 13, Gen 33)

Posted: Sat Sep 06, 2008 2:14 am
by Sahkuhnder
I don't know if it helps but I am still having issues with the same server as well. The WUs show as sent and received in the FAHlog but are never credited.

Most recent was last night - sent Project: 2613 (Run 53, Clone 7, Gen 31):

Code: Select all

[03:07:43] Completed 123750 out of 125000 steps  (99)
[04:08:13] Writing local files
[04:08:15] Completed 125000 out of 125000 steps  (100)
[04:08:15] Writing final coordinates.
[04:08:22] Past main M.D. loop
[04:09:22] 
[04:09:22] Finished Work Unit:
[04:09:22] - Reading up to 7393248 from "work/wudata_05.arc": Read 7393248
[04:09:23] - Reading up to 10464248 from "work/wudata_05.xtc": Read 10464248
[04:09:23] goefile size: 0
[04:09:23] logfile size: 43093
[04:09:23] Leaving Run
[04:09:25] - Writing 17950153 bytes of core data to disk...
[04:09:43] Done: 17949641 -> 17459349 (compressed to 97.2 percent)
[04:09:44]   ... Done.
[04:09:44] - Shutting down core
[04:09:45] 
[04:09:45] Folding@home Core Shutdown: FINISHED_UNIT
[04:09:49] CoreStatus = 64 (100)
[04:09:49] Sending work to server


[04:09:49] + Attempting to send results
[04:11:44] + Results successfully sent
[04:11:44] Thank you for your contribution to Folding@Home.
[04:11:44] + Number of Units Completed: 139

[04:11:48] - Preparing to get new work unit...
[04:11:48] + Attempting to get work packet
[04:11:48] - Connecting to assignment server
[04:11:48] - Successful: assigned to (171.64.65.65).
[04:11:48] + News From Folding@Home: Welcome to Folding@Home
[04:11:49] Loaded queue successfully.
[04:11:49] + Could not connect to Work Server
[04:11:49] - Error: Attempt #1  to get work failed, and no other work to do.
             Waiting before retry.
[04:12:03] + Attempting to get work packet
[04:12:03] - Connecting to assignment server
[04:12:03] - Successful: assigned to (171.64.65.65).
[04:12:03] + News From Folding@Home: Welcome to Folding@Home
[04:12:03] Loaded queue successfully.
[04:12:03] + Could not connect to Work Server
[04:12:03] - Error: Attempt #2  to get work failed, and no other work to do.
             Waiting before retry.
[04:12:28] + Attempting to get work packet
[04:12:28] - Connecting to assignment server
[04:12:28] - Successful: assigned to (171.64.65.65).
[04:12:28] + News From Folding@Home: Welcome to Folding@Home
[04:12:28] Loaded queue successfully.
[04:12:44] + Closed connections
[04:12:44] 
[04:12:44] + Processing work unit
[04:12:44] Core required: FahCore_78.exe
[04:12:44] Core found.
[04:12:44] Working on Unit 06 [September 5 04:12:44]
[04:12:44] + Working ...
[04:12:44] 
[04:12:44] *------------------------------*
[04:12:44] Folding@Home Gromacs Core
[04:12:44] Version 1.90 (March 8, 2006)
[04:12:44] 
[04:12:44] Preparing to commence simulation
[04:12:44] - Assembly optimizations manually forced on.
[04:12:44] - Not checking prior termination.
[04:12:51] - Expanded 3639992 -> 18751629 (decompressed 515.1 percent)
[04:12:52] - Starting from initial work packet
[04:12:52] 
[04:12:52] Project: 2621 (Run 93, Clone 63, Gen 39)
[04:12:52] 
[04:12:52] Assembly optimizations on if available.
[04:12:52] Entering M.D.
[04:13:01] Protein: p2621_p1475_tet1_03_1 t= 20000.00000
[04:13:01] 
[04:13:01] Writing local files
[04:13:08] Extra SSE boost OK.
[04:13:09] Writing local files
[04:13:09] Completed 0 out of 125000 steps  (0)
[04:32:41] Writing local files
[04:32:42] Completed 1250 out of 125000 steps  (1)

Re: 1st WU, no credit - Project: 2620 (Run 76, Clone 13, Gen 33)

Posted: Sat Sep 06, 2008 2:21 am
by bruce
CannonFodder08 wrote:I just figured out how to see which of my Work Units I've received credit for, and apparently I've received credit for another WU done on the same machine as the WU mentioned in this thread, as well as a WU done on another machine, but the WU mentioned in the title of this thread is still missing. Will the missing WU finally end up on the work server as well, and credit issued, or will it end up as I've read in some other threads, i.e. the WU ends up being reassigned?

As Kasson has said, the Collection Server will find it. There's no way to know how long it will take until they look into the problem, though.

The same is probably true for any other WUs that were uploaded to 171.64.122.76.

Title changed to reflect the fact that it's a problem with the CS.