Page 13 of 28

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 2:24 am
by VijayPande
We've updated the code on 171.67.108.21 with what we think could be a fix. I say "think" since this bug is very subtle so it's hard to see why it is failing in some cases, but not others.

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 2:28 am
by Tobit
Uploading fine but still reporting server does not have any record of the unit. Client also takes a long time to timeout when trying to connect to the 108.26 CS server.

Code: Select all

Launch directory: C:\fah\gpu1
Executable: [email protected]
Arguments: -send all -verbosity 9 

[02:21:21] - Ask before connecting: No
[02:21:21] - User name: Tobit (Team 33)
[02:21:21] - User ID: **************
[02:21:21] - Machine ID: 3
[02:21:21] 
[02:21:21] Loaded queue successfully.
[02:21:21] Attempting to return result(s) to server...
[02:21:21] Trying to send all finished work units
[02:21:21] Project: 5781 (Run 10, Clone 80, Gen 4)
[02:21:21] - Read packet limit of 540015616... Set to 524286976.


[02:21:21] + Attempting to send results [February 16 02:21:21 UTC]
[02:21:21] - Reading file work/wuresults_01.dat from core
[02:21:21]   (Read 168832 bytes from disk)
[02:21:21] Connecting to http://171.67.108.21:8080/
[02:21:22] Posted data.
[02:21:22] Initial: 0000; - Uploaded at ~165 kB/s
[02:21:22] - Averaged speed for that direction ~135 kB/s
[02:21:22] - Server does not have record of this unit. Will try again later.
[02:21:22] - Error: Could not transmit unit 01 (completed February 14) to work server.
[02:21:22] - 17 failed uploads of this unit.
[02:21:22] - Read packet limit of 540015616... Set to 524286976.


[02:21:22] + Attempting to send results [February 16 02:21:22 UTC]
[02:21:22] - Reading file work/wuresults_01.dat from core
[02:21:22]   (Read 168832 bytes from disk)
[02:21:22] Connecting to http://171.67.108.26:8080/
[02:24:00] ***** Got a SIGTERM signal (2)
[02:24:00] Killing all core threads

Folding@Home Client Shutdown.

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 2:32 am
by ikerekes
VijayPande wrote:Could people confirm the following:
- the problems are only seen in one server: 171.67.108.21
- the problems are only seen with configs with multiple GPUs in a single box

If either of the above aren't true for you, could you please post here?
Unfortunately neither is true one of my client (all of them are single GPU's) has 3 unit's ready for upload to servers 108.21 and 65.71 (the rest of them already overwritten).
Asus 9800GT/512M
Index 1: ready for upload 164 X min speed
server: 171.67.108.21:8080; project: 3470
Folding: run 10, clone 62, generation 0; benchmark 0; misc: 500, 200
issue: Fri Feb 12 20:27:41 2010; begin: Fri Feb 12 20:27:36 2010
end: Fri Feb 12 21:46:40 2010; due: Sun Feb 21 20:27:36 2010 (9 days)
--
Index 2: ready for upload
server: 171.64.65.71:8080; project: 10102
Folding: run 363, clone 0, generation 9; benchmark 0; misc: 500, 200
issue: Mon Feb 15 08:19:47 2010; begin: Mon Feb 15 08:19:47 2010
end: Mon Feb 15 09:02:25 2010; due: Thu Feb 18 08:19:47 2010 (3 days)
--
Index 3: ready for upload 24.7 X min speed
server: 171.64.65.71:8080; project: 10105
Folding: run 109, clone 6, generation 2; benchmark 0; misc: 500, 200
issue: Fri Feb 12 23:07:54 2010; begin: Fri Feb 12 23:07:49 2010
end: Sat Feb 13 02:02:26 2010; due: Mon Feb 15 23:07:49 2010 (3 days)
[/quote]

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 2:43 am
by weedacres
VijayPande wrote:We've updated the code on 171.67.108.21 with what we think could be a fix. I say "think" since this bug is very subtle so it's hard to see why it is failing in some cases, but not others.
It's uploaded a couple of recent workunits but is not dealing with those workunits that have not been previously sent.

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 3:07 am
by VijayPande
Our first goal is getting the recent ones to go back w/o issues. The next step is to see what we can do with the old ones.

Is anyone still having problems with recent WUs? (eg recent = assigned on Feb 15) TIA.

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 3:20 am
by weedacres
VijayPande wrote:Our first goal is getting the recent ones to go back w/o issues. The next step is to see what we can do with the old ones.

Is anyone still having problems with recent WUs? (eg recent = assigned on Feb 15) TIA.
I've uploaded 2 successfully since your post regarding the software change. Several more are due in the next hour.

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 3:32 am
by lambdapro
All recent WUs are working fine on my GTX260 now.
David

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 3:42 am
by ikerekes
VijayPande wrote:Our first goal is getting the recent ones to go back w/o issues. The next step is to see what we can do with the old ones.

Is anyone still having problems with recent WUs? (eg recent = assigned on Feb 15) TIA.
I don't know if server 171.64.65.71 counts, but I have restarted my GUI client just to see if my Feb 15 wu, would upload, but NO, it would not
"server doesn't have record of the wu"

Code: Select all

--- Opening Log file [February 16 03:22:25 UTC] 


# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Documents and Settings\Ivan\Application Data\Folding@home-gpu


[03:22:25] - Ask before connecting: No
[03:22:25] - User name: ikerekes (Team 50619)
[03:22:25] - User ID: 3AC8B048259843DC
[03:22:25] - Machine ID: 2
[03:22:25] 
[03:22:25] Loaded queue successfully.
[03:22:25] Initialization complete
[03:22:25] 
[03:22:25] + Processing work unit
[03:22:25] Project: 3470 (Run 10, Clone 62, Gen 0)
[03:22:25] - Read packet limit of 540015616... Set to 524286976.


[03:22:25] + Attempting to send results [February 16 03:22:25 UTC]
[03:22:25] Core required: FahCore_11.exe
[03:22:25] Core found.
[03:22:25] Working on queue slot 07 [February 16 03:22:25 UTC]
[03:22:25] + Working ...
[03:22:25] 
[03:22:25] *------------------------------*
[03:22:25] Folding@Home GPU Core
[03:22:25] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[03:22:25] 
[03:22:25] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[03:22:25] Build host: amoeba
[03:22:25] Board Type: Nvidia
[03:22:25] Core      : 
[03:22:25] Preparing to commence simulation
[03:22:25] - Looking at optimizations...
[03:22:25] - Files status OK
[03:22:25] - Expanded 88670 -> 447307 (decompressed 504.4 percent)
[03:22:25] Called DecompressByteArray: compressed_data_size=88670 data_size=447307, decompressed_data_size=447307 diff=0
[03:22:25] - Digital signature verified
[03:22:25] 
[03:22:25] Project: 10105 (Run 439, Clone 8, Gen 3)
[03:22:25] 
[03:22:25] Assembly optimizations on if available.
[03:22:25] Entering M.D.
[03:22:26] - Couldn't send HTTP request to server
[03:22:26] + Could not connect to Work Server (results)
[03:22:26]     (171.67.108.21:8080)
[03:22:26] + Retrying using alternative port
[03:22:31] Will resume from checkpoint file
[03:22:31] Tpr hash work/wudata_07.tpr:  4113724815 1354760288 1656209051 1406404920 3687825015
[03:22:31] 
[03:22:31] Calling fah_main args: 14 usage=100
[03:22:31] 
[03:22:31] Working on p10105_lambda_370K
[03:22:33] Client config found, loading data.
[03:22:33] Starting GUI Server
[03:22:33] Resuming from checkpoint
[03:22:33] fcCheckPointResume: retreived and current tpr file hash:
[03:22:33]    0   4113724815   4113724815
[03:22:33]    1   1354760288   1354760288
[03:22:33]    2   1656209051   1656209051
[03:22:33]    3   1406404920   1406404920
[03:22:33]    4   3687825015   3687825015
[03:22:33] fcCheckPointResume: file hashes same.
[03:22:33] fcCheckPointResume: state restored.
[03:22:33] Verified work/wudata_07.log
[03:22:33] Verified work/wudata_07.edr
[03:22:33] Verified work/wudata_07.xtc
[03:22:33] Completed 70%
[03:22:45] - Couldn't send HTTP request to server
[03:22:45] + Could not connect to Work Server (results)
[03:22:45]     (171.67.108.21:80)
[03:22:45] - Error: Could not transmit unit 01 (completed February 13) to work server.
[03:22:45] - Read packet limit of 540015616... Set to 524286976.


[03:22:45] + Attempting to send results [February 16 03:22:45 UTC]
[03:24:22] Completed 71%
[03:26:12] Completed 72%
[03:27:57] + Could not connect to Work Server (results)
[03:27:57]     (171.67.108.26:8080)
[03:27:57] + Retrying using alternative port
[03:27:57] - Couldn't send HTTP request to server
[03:27:57]   (Got status 503)
[03:27:57] + Could not connect to Work Server (results)
[03:27:57]     (171.67.108.26:80)
[03:27:57]   Could not transmit unit 01 to Collection server; keeping in queue.
[03:27:57] Project: 10102 (Run 363, Clone 0, Gen 9)
[03:27:57] - Read packet limit of 540015616... Set to 524286976.


[[b]03:27:57] + Attempting to send results [February 16 03:27:57 UTC]
[03:28:00] - Couldn't send HTTP request to server
[03:28:00] + Could not connect to Work Server (results)
[03:28:00]     (171.64.65.71:8080)
[03:28:00] + Retrying using alternative port
[03:28:02] Completed 73%
[03:28:03] - Couldn't send HTTP request to server
[03:28:03] + Could not connect to Work Server (results)
[03:28:03]     (171.64.65.71:80)
[03:28:03] - Error: Could not transmit unit 02 (completed February 15) to work server.
[03:28:03] - Read packet limit of 540015616... Set to 524286976.


[03:28:03] + Attempting to send results [February 16 03:28:03 UTC]
[03:28:06] - Server does not have record of this unit. Will try again later.
[03:28:06]   Could not transmit unit 02 to Collection server; keeping in queue.[/b]
[03:28:06] Project: 10105 (Run 109, Clone 6, Gen 2)
[03:28:06] - Read packet limit of 540015616... Set to 524286976.


[03:28:06] + Attempting to send results [February 16 03:28:06 UTC]
[03:28:08] - Couldn't send HTTP request to server
[03:28:08] + Could not connect to Work Server (results)
[03:28:08]     (171.64.65.71:8080)
[03:28:08] + Retrying using alternative port
[03:28:10] - Couldn't send HTTP request to server
[03:28:10] + Could not connect to Work Server (results)
[03:28:10]     (171.64.65.71:80)
[03:28:10] - Error: Could not transmit unit 03 (completed February 13) to work server.
[03:28:10] - Read packet limit of 540015616... Set to 524286976.


[03:28:10] + Attempting to send results [February 16 03:28:10 UTC]
[03:28:12] - Server does not have record of this unit. Will try again later.
[03:28:12]   Could not transmit unit 03 to Collection server; keeping in queue.
[03:29:52] Completed 74%
[03:31:41] Completed 75%
[03:33:31] Completed 76%
[03:35:21] Completed 77%
PS. an other one on a different machine. Just finished but couldn't upload :twisted:

Code: Select all

04:16:23] + Processing work unit
[04:16:23] Core required: FahCore_11.exe
[04:16:23] Core found.
[04:16:23] Working on queue slot 02 [February 16 04:16:23 UTC]
[04:16:23] + Working ...
[04:16:23] - Calling '.\FahCore_11.exe -dir work/ -suffix 02 -checkpoint 15 -verbose -lifeline 8 -version 623'

[04:16:23] - Couldn't send HTTP request to server
[04:16:23] + Could not connect to Work Server (results)
[04:16:23]     (171.64.65.71:8080)
[04:16:23] + Retrying using alternative port
[04:16:23] Connecting to http://171.64.65.71:80/
[04:16:23] 
[04:16:23] *------------------------------*
[04:16:23] Folding@Home GPU Core
[04:16:23] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[04:16:23] 
[04:16:23] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[04:16:23] Build host: amoeba
[04:16:23] Board Type: Nvidia
[04:16:23] Core      : 
[04:16:23] Preparing to commence simulation
[04:16:23] - Looking at optimizations...
[04:16:23] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[04:16:23] - Created dyn
[04:16:23] - Files status OK
[04:16:23] - Expanded 88589 -> 447307 (decompressed 504.9 percent)
[04:16:23] Called DecompressByteArray: compressed_data_size=88589 data_size=447307, decompressed_data_size=447307 diff=0
[04:16:24] - Digital signature verified
[04:16:24] 
[04:16:24] Project: 10102 (Run 458, Clone 6, Gen 9)
[04:16:24] 
[04:16:24] Assembly optimizations on if available.
[04:16:24] Entering M.D.
[04:16:26] - Couldn't send HTTP request to server
[04:16:26] + Could not connect to Work Server (results)
[04:16:26]     (171.64.65.71:80)
[04:16:26] - Error: Could not transmit unit 01 (completed February 16) to work server.
[04:16:26] - 2 failed uploads of this unit.
[04:16:26] - Read packet limit of 540015616... Set to 524286976.


[04:16:26] + Attempting to send results [February 16 04:16:26 UTC]
[04:16:26] - Reading file work/wuresults_01.dat from core
[04:16:26]   (Read 169315 bytes from disk)
[04:16:26] Connecting to http://171.67.108.26:8080/
[04:16:30] Tpr hash work/wudata_02.tpr:  1851384266 2899504792 3853567251 2290650361 1657385430
[04:16:30] 
[04:16:30] Calling fah_main args: 14 usage=90
[04:16:30] 
[04:16:30] Working on p10102_lambda_370K
[04:16:32] Client config found, loading data.
[04:16:32] Starting GUI Server
[04:18:33] - Couldn't send HTTP request to server
[04:18:33] + Could not connect to Work Server (results)
[04:18:33]     (171.67.108.26:8080)
[04:18:33] + Retrying using alternative port
[04:18:33] Connecting to http://171.67.108.26:80/
[04:18:33] Completed 1%
[04:18:36] - Couldn't send HTTP request to server
[04:18:36] + Could not connect to Work Server (results)
[04:18:36]     (171.67.108.26:80)
[04:18:36]   Could not transmit unit 01 to Collection server; keeping in queue.
[04:18:36] + Sent 0 of 1 completed units to the server
[04:18:36] - Autosend completed
[04:20:35] Completed 2%
[04:22:36] Completed 3%

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 3:47 am
by Bobby-Uschi

Code: Select all

[20:32:59] Folding@Home GPU Core
[20:32:59] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[20:32:59] 
[20:32:59] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[20:32:59] Build host: amoeba
[20:32:59] Board Type: Nvidia
[20:32:59] Core      : 
[20:32:59] Preparing to commence simulation
[20:32:59] - Looking at optimizations...
[20:32:59] DeleteFrameFiles: successfully deleted file=work/wudata_09.ckp
[20:32:59] - Created dyn
[20:32:59] - Files status OK
[20:32:59] - Expanded 64986 -> 343707 (decompressed 528.8 percent)
[20:32:59] Called DecompressByteArray: compressed_data_size=64986 data_size=343707, decompressed_data_size=343707 diff=0
[20:32:59] - Digital signature verified
[20:32:59] 
[20:32:59] Project: 5783 (Run 7, Clone 19, Gen 17)
[20:32:59] 
[20:32:59] Assembly optimizations on if available.
[20:32:59] Entering M.D.
[20:33:05] Tpr hash work/wudata_09.tpr:  3242578799 830206300 3941946508 22098274 3559022915
[20:33:05] 
[20:33:05] Calling fah_main args: 14 usage=100
[20:33:05] 
[20:33:06] Working on GROwing Monsters And Cloning Shrimps
[20:33:11] Client config found, loading data.
[20:33:11] Starting GUI Server
[20:34:27] Completed 1%
[20:35:43] Completed 2%
[20:36:58] Completed 3%
[20:38:14] Completed 4%
[20:39:30] Completed 5%
[20:40:45] Completed 6%
[20:42:01] Completed 7%
[20:43:16] Completed 8%
[20:44:32] Completed 9%
[20:45:48] Completed 10%
[20:47:03] Completed 11%
[20:48:19] Completed 12%
[20:49:35] Completed 13%
[20:50:50] Completed 14%
[20:52:06] Completed 15%
[20:53:21] Completed 16%
[20:54:37] Completed 17%
[20:55:53] Completed 18%
[20:57:08] Completed 19%
[20:58:24] Completed 20%
[20:59:40] Completed 21%
[21:00:55] Completed 22%
[21:02:11] Completed 23%
[21:03:26] Completed 24%
[21:04:42] Completed 25%
[21:05:58] Completed 26%
[21:07:13] Completed 27%
[21:08:29] Completed 28%
[21:09:45] Completed 29%
[21:11:00] Completed 30%
[21:12:16] Completed 31%
[21:13:31] Completed 32%
[21:14:47] Completed 33%
[21:16:03] Completed 34%
[21:17:18] Completed 35%
[21:18:34] Completed 36%
[21:19:50] Completed 37%
[21:21:05] Completed 38%
[21:22:21] Completed 39%
[21:23:36] Completed 40%
[21:24:52] Completed 41%
[21:25:43] + Working...
[21:26:08] Completed 42%
[21:27:23] Completed 43%
[21:28:39] Completed 44%
[21:29:54] Completed 45%
[21:31:10] Completed 46%
[21:32:26] Completed 47%
[21:33:41] Completed 48%
[21:34:57] Completed 49%
[21:36:13] Completed 50%
[21:37:28] Completed 51%
[21:38:44] Completed 52%
[21:40:00] Completed 53%
[21:41:15] Completed 54%
[21:42:31] Completed 55%
[21:43:46] Completed 56%
[21:45:02] Completed 57%
[21:46:18] Completed 58%
[21:47:33] Completed 59%
[21:48:49] Completed 60%
[21:50:05] Completed 61%
[21:51:20] Completed 62%
[21:52:36] Completed 63%
[21:53:51] Completed 64%
[21:55:07] Completed 65%
[21:56:23] Completed 66%
[21:57:38] Completed 67%
[21:58:54] Completed 68%
[22:00:09] Completed 69%
[22:01:25] Completed 70%
[22:02:41] Completed 71%
[22:03:56] Completed 72%
[22:05:12] Completed 73%
[22:06:28] Completed 74%
[22:07:43] Completed 75%
[22:08:59] Completed 76%
[22:10:15] Completed 77%
[22:11:30] Completed 78%
[22:12:46] Completed 79%
[22:14:01] Completed 80%
[22:15:17] Completed 81%
[22:16:33] Completed 82%
[22:17:48] Completed 83%
[22:19:04] Completed 84%
[22:20:20] Completed 85%
[22:21:35] Completed 86%
[22:22:51] Completed 87%
[22:24:06] Completed 88%
[22:25:22] Completed 89%
[22:26:38] Completed 90%
[22:27:53] Completed 91%
[22:29:09] Completed 92%
[22:30:24] Completed 93%
[22:31:40] Completed 94%
[22:32:56] Completed 95%
[22:34:11] Completed 96%
[22:35:27] Completed 97%
[22:36:43] Completed 98%
[22:37:58] Completed 99%
[22:39:14] Completed 100%
[22:39:14] Successful run
[22:39:14] DynamicWrapper: Finished Work Unit: sleep=10000
[22:39:24] Reserved 145756 bytes for xtc file; Cosm status=0
[22:39:24] Allocated 145756 bytes for xtc file
[22:39:24] - Reading up to 145756 from "work/wudata_09.xtc": Read 145756
[22:39:24] Read 145756 bytes from xtc file; available packet space=786284708
[22:39:24] xtc file hash check passed.
[22:39:24] Reserved 22200 22200 786284708 bytes for arc file=<work/wudata_09.trr> Cosm status=0
[22:39:24] Allocated 22200 bytes for arc file
[22:39:24] - Reading up to 22200 from "work/wudata_09.trr": Read 22200
[22:39:24] Read 22200 bytes from arc file; available packet space=786262508
[22:39:24] trr file hash check passed.
[22:39:24] Allocated 560 bytes for edr file
[22:39:24] Read bedfile
[22:39:24] edr file hash check passed.
[22:39:24] Logfile not read.
[22:39:24] GuardedRun: success in DynamicWrapper
[22:39:24] GuardedRun: done
[22:39:24] Run: GuardedRun completed.
[22:39:24] + Opened results file
[22:39:24] - Writing 169028 bytes of core data to disk...
[22:39:24] Done: 168516 -> 167068 (compressed to 99.1 percent)
[22:39:24]   ... Done.
[22:39:24] DeleteFrameFiles: successfully deleted file=work/wudata_09.ckp
[22:39:24] Shutting down core 
[22:39:24] 
[22:39:24] Folding@home Core Shutdown: FINISHED_UNIT
[22:39:28] CoreStatus = 64 (100)
[22:39:28] Sending work to server
[22:39:28] Project: 5783 (Run 7, Clone 19, Gen 17)
[22:39:28] - Read packet limit of 540015616... Set to 524286976.


[22:39:28] + Attempting to send results [February 15 22:39:28 UTC]

Folding@Home Client Shutdown.


--- Opening Log file [February 15 22:52:46 UTC] 


# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Dokumente und Einstellungen\Bob\Anwendungsdaten\Folding@home-gpu


[22:52:46] - Ask before connecting: No
[22:52:46] - User name: Bobby-Uschi (Team 34361)
[22:52:46] - User ID: 28D3F64619208C4B
[22:52:46] - Machine ID: 2
[22:52:46] 
[22:52:46] Loaded queue successfully.
[22:52:46] Initialization complete
[22:52:46] - Preparing to get new work unit...
[22:52:46] + Attempting to get work packet
[22:52:46] Project: 5783 (Run 7, Clone 19, Gen 17)
[22:52:46] - Connecting to assignment server
[22:52:46] - Read packet limit of 540015616... Set to 524286976.


[22:52:46] + Attempting to send results [February 15 22:52:46 UTC]
[22:52:48] - Successful: assigned to (171.67.108.11).
[22:52:48] + News From Folding@Home: Welcome to Folding@Home
[22:52:48] Loaded queue successfully.
[22:52:50] + Closed connections
[22:52:50] 
[22:52:50] + Processing work unit
[22:52:50] Core required: FahCore_11.exe
[22:52:50] Core found.
[22:52:50] Working on queue slot 00 [February 15 22:52:50 UTC]
[22:52:50] + Working ...
[22:52:51] 
[22:52:51] *------------------------------*
[22:52:51] Folding@Home GPU Core
[22:52:51] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[22:52:51] 
[22:52:51] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[22:52:51] Build host: amoeba
[22:52:51] Board Type: Nvidia
[22:52:51] Core      : 
[22:52:51] Preparing to commence simulation
[22:52:51] - Looking at optimizations...
[22:52:51] DeleteFrameFiles: successfully deleted file=work/wudata_00.ckp
[22:52:51] - Created dyn
[22:52:51] - Files status OK
[22:52:51] - Expanded 45436 -> 251112 (decompressed 552.6 percent)
[22:52:51] Called DecompressByteArray: compressed_data_size=45436 data_size=251112, decompressed_data_size=251112 diff=0
[22:52:51] - Digital signature verified
[22:52:51] 
[22:52:51] Project: 5772 (Run 1, Clone 200, Gen 2072)
[22:52:51] 
[22:52:51] Assembly optimizations on if available.
[22:52:51] Entering M.D.
[22:52:57] Tpr hash work/wudata_00.tpr:  2541501440 3000741762 3307526087 3638586476 490331188
[22:52:57] 
[22:52:57] Calling fah_main args: 14 usage=100
[22:52:57] 
[22:52:58] Working on Protein
[22:53:03] Client config found, loading data.
[22:53:03] Starting GUI Server
[22:53:37] Completed 1%
[22:54:12] Completed 2%
[22:54:47] Completed 3%
[22:55:22] Completed 4%
[22:55:56] Completed 5%
[22:56:31] Completed 6%
[22:57:06] Completed 7%
[22:57:41] Completed 8%
[22:58:15] Completed 9%
[22:58:50] Completed 10%
[22:59:25] Completed 11%
[23:00:00] Completed 12%
[23:00:34] Completed 13%
[23:01:09] Completed 14%
[23:01:44] Completed 15%
[23:02:19] Completed 16%
[23:02:53] Completed 17%
[23:03:28] Completed 18%
[23:04:03] Completed 19%
[23:04:38] Completed 20%
[23:05:12] Completed 21%
[23:05:47] Completed 22%
[23:06:22] Completed 23%
[23:06:57] Completed 24%
[23:07:31] Completed 25%
[23:08:06] Completed 26%
[23:08:41] Completed 27%
[23:09:16] Completed 28%
[23:09:50] Completed 29%
[23:10:25] Completed 30%
[23:11:00] Completed 31%
[23:11:35] Completed 32%
[23:12:09] Completed 33%
[23:12:44] Completed 34%
[23:13:19] Completed 35%
[23:13:54] Completed 36%
[23:14:28] Completed 37%
[23:15:03] Completed 38%
[23:15:38] Completed 39%
[23:16:13] Completed 40%
[23:16:47] Completed 41%
[23:17:22] Completed 42%
[23:17:57] Completed 43%
[23:18:32] Completed 44%
[23:19:06] Completed 45%
[23:19:41] Completed 46%
[23:20:16] Completed 47%
[23:20:51] Completed 48%
[23:21:25] Completed 49%
[23:22:00] Completed 50%
[23:22:35] Completed 51%
[23:23:10] Completed 52%
[23:23:44] Completed 53%
[23:24:19] Completed 54%
[23:24:54] Completed 55%
[23:25:29] Completed 56%
[23:26:03] Completed 57%
[23:26:26] - Unknown packet returned from server, expected ACK for results
[23:26:26] - Error: Could not transmit unit 09 (completed February 15) to work server.
[23:26:26]   Keeping unit 09 in queue.
[23:26:38] Completed 58%
[23:27:13] Completed 59%
[23:27:48] Completed 60%
[23:28:22] Completed 61%
[23:28:57] Completed 62%
[23:29:32] Completed 63%
[23:30:07] Completed 64%
[23:30:41] Completed 65%
[23:31:16] Completed 66%
[23:31:51] Completed 67%
[23:32:26] Completed 68%
[23:33:00] Completed 69%
[23:33:35] Completed 70%
[23:34:10] Completed 71%
[23:34:45] Completed 72%
[23:35:19] Completed 73%
[23:35:54] Completed 74%
[23:36:29] Completed 75%
[23:37:04] Completed 76%
[23:37:38] Completed 77%
[23:38:13] Completed 78%
[23:38:48] Completed 79%
[23:39:23] Completed 80%
[23:39:57] Completed 81%
[23:40:32] Completed 82%
[23:41:07] Completed 83%
[23:41:42] Completed 84%
[23:42:16] Completed 85%
[23:42:51] Completed 86%
[23:43:26] Completed 87%
[23:44:01] Completed 88%
[23:44:35] Completed 89%
[23:45:10] Completed 90%
[23:45:45] Completed 91%
[23:46:20] Completed 92%
[23:46:54] Completed 93%
[23:47:29] Completed 94%
[23:48:04] Completed 95%
[23:48:39] Completed 96%
[23:49:13] Completed 97%
[23:49:48] Completed 98%
[23:50:23] Completed 99%
[23:50:58] Completed 100%
[23:50:58] Successful run
[23:50:58] DynamicWrapper: Finished Work Unit: sleep=10000
[23:51:08] Reserved 76188 bytes for xtc file; Cosm status=0
[23:51:08] Allocated 76188 bytes for xtc file
[23:51:08] - Reading up to 76188 from "work/wudata_00.xtc": Read 76188
[23:51:08] Read 76188 bytes from xtc file; available packet space=786354276
[23:51:08] xtc file hash check passed.
[23:51:08] Reserved 15168 15168 786354276 bytes for arc file=<work/wudata_00.trr> Cosm status=0
[23:51:08] Allocated 15168 bytes for arc file
[23:51:08] - Reading up to 15168 from "work/wudata_00.trr": Read 15168
[23:51:08] Read 15168 bytes from arc file; available packet space=786339108
[23:51:08] trr file hash check passed.
[23:51:08] Allocated 560 bytes for edr file
[23:51:08] Read bedfile
[23:51:08] edr file hash check passed.
[23:51:08] Allocated 33327 bytes for logfile
[23:51:08] Read logfile
[23:51:08] GuardedRun: success in DynamicWrapper
[23:51:08] GuardedRun: done
[23:51:08] Run: GuardedRun completed.
[23:51:11] + Opened results file
[23:51:11] - Writing 125755 bytes of core data to disk...
[23:51:11] Done: 125243 -> 99616 (compressed to 79.5 percent)
[23:51:11]   ... Done.
[23:51:11] DeleteFrameFiles: successfully deleted file=work/wudata_00.ckp
[23:51:11] Shutting down core 
[23:51:11] 
[23:51:11] Folding@home Core Shutdown: FINISHED_UNIT
[23:51:15] CoreStatus = 64 (100)
[23:51:15] Sending work to server
[23:51:15] Project: 5772 (Run 1, Clone 200, Gen 2072)
[23:51:15] - Read packet limit of 540015616... Set to 524286976.


[23:51:15] + Attempting to send results [February 15 23:51:15 UTC]
[23:51:18] + Results successfully sent
[23:51:18] Thank you for your contribution to Folding@Home.
[23:51:18] + Number of Units Completed: 3269

[23:51:22] Project: 5783 (Run 7, Clone 19, Gen 17)
[23:51:22] - Read packet limit of 540015616... Set to 524286976.


[23:51:22] + Attempting to send results [February 15 23:51:22 UTC]
[23:51:30] - Server does not have record of this unit. Will try again later.
[23:51:30] - Error: Could not transmit unit 09 (completed February 15) to work server.
[23:51:30] - Read packet limit of 540015616... Set to 524286976.


[23:51:30] + Attempting to send results [February 15 23:51:30 UTC]
[00:41:42] + Could not connect to Work Server (results)
[00:41:42]     (171.67.108.26:8080)
[00:41:42] + Retrying using alternative port
[00:41:43] - Couldn't send HTTP request to server
[00:41:43]   (Got status 503)
[00:41:43] + Could not connect to Work Server (results)
[00:41:43]     (171.67.108.26:80)
[00:41:43]   Could not transmit unit 09 to Collection server; keeping in queue.
[00:41:43] - Preparing to get new work unit...
[00:41:43] + Attempting to get work packet
[00:41:43] - Connecting to assignment server
[00:41:44] - Successful: assigned to (171.67.108.11).
[00:41:44] + News From Folding@Home: Welcome to Folding@Home
[00:41:44] Loaded queue successfully.
[00:41:46] Project: 5783 (Run 7, Clone 19, Gen 17)
[00:41:46] - Read packet limit of 540015616... Set to 524286976.


[00:41:46] + Attempting to send results [February 16 00:41:46 UTC]
[00:41:53] - Server has already received unit.
[00:41:53] + Closed connections

Server has already received unit und noch x2

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 5:15 am
by Jolly-Swagman
Still not able to Upload to Servers, this is single GPU box 8800GT

Code: Select all

[04:35:49] Completed 96%
[04:37:45] Completed 97%
[04:39:42] Completed 98%
[04:41:39] Completed 99%
[04:43:36] Completed 100%
[04:43:36] Successful run
[04:43:36] DynamicWrapper: Finished Work Unit: sleep=10000
[04:43:46] Reserved 101288 bytes for xtc file; Cosm status=0
[04:43:46] Allocated 101288 bytes for xtc file
[04:43:46] - Reading up to 101288 from "work/wudata_02.xtc": Read 101288
[04:43:46] Read 101288 bytes from xtc file; available packet space=786329176
[04:43:46] xtc file hash check passed.
[04:43:46] Reserved 30216 30216 786329176 bytes for arc file=<work/wudata_02.trr> Cosm status=0
[04:43:46] Allocated 30216 bytes for arc file
[04:43:46] - Reading up to 30216 from "work/wudata_02.trr": Read 30216
[04:43:46] Read 30216 bytes from arc file; available packet space=786298960
[04:43:46] trr file hash check passed.
[04:43:46] Allocated 560 bytes for edr file
[04:43:46] Read bedfile
[04:43:46] edr file hash check passed.
[04:43:46] Logfile not read.
[04:43:46] GuardedRun: success in DynamicWrapper
[04:43:46] GuardedRun: done
[04:43:46] Run: GuardedRun completed.
[04:43:46] + Opened results file
[04:43:46] - Writing 132576 bytes of core data to disk...
[04:43:46] Done: 132064 -> 131609 (compressed to 99.6 percent)
[04:43:46]   ... Done.
[04:43:46] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[04:43:47] Shutting down core 
[04:43:47] 
[04:43:47] Folding@home Core Shutdown: FINISHED_UNIT
[04:43:49] CoreStatus = 64 (100)
[04:43:49] Sending work to server
[04:43:49] Project: 10105 (Run 422, Clone 3, Gen 3)


[04:43:49] + Attempting to send results [February 16 04:43:49 UTC]
[04:43:53] + Results successfully sent
[04:43:53] Thank you for your contribution to Folding@Home.
[04:43:53] + Number of Units Completed: 1328

[04:43:57] Project: 10103 (Run 782, Clone 6, Gen 4)


[04:43:57] + Attempting to send results [February 16 04:43:57 UTC]
[04:44:01] - Couldn't send HTTP request to server
[04:44:01] + Could not connect to Work Server (results)
[04:44:01]     (171.64.65.71:8080)
[04:44:01] + Retrying using alternative port
[04:44:05] - Couldn't send HTTP request to server
[04:44:05]   (Got status 400)
[04:44:05] + Could not connect to Work Server (results)
[04:44:05]     (171.64.65.71:80)
[04:44:05] - Error: Could not transmit unit 05 (completed February 13) to work server.


[04:44:05] + Attempting to send results [February 16 04:44:05 UTC]

Dual GPU 295-GTX GPU-0 GPU-1

Code: Select all

[19:48:55] + Attempting to send results [February 14 19:48:55 UTC]
[19:49:00] - Couldn't send HTTP request to server
[19:49:00] + Could not connect to Work Server (results)
[19:49:00]     (171.67.108.21:8080)
[19:49:00] + Retrying using alternative port
[19:52:00] - Couldn't send HTTP request to server
[19:52:00]   (Got status 504)
[19:52:00] + Could not connect to Work Server (results)
[19:52:00]     (171.67.108.21:80)
[19:52:00] - Error: Could not transmit unit 04 (completed February 7) to work server.


[19:52:00] + Attempting to send results [February 14 19:52:00 UTC]
[19:55:00] Completed 86%
[20:01:45] - Couldn't send HTTP request to server
[20:01:45] + Could not connect to Work Server (results)
[20:01:45]     (171.67.108.26:8080)
[20:01:45] + Retrying using alternative port
[20:01:46] - Couldn't send HTTP request to server
[20:01:46]   (Got status 503)
[20:01:46] + Could not connect to Work Server (results)
[20:01:46]     (171.67.108.26:80)
[20:01:46]   Could not transmit unit 04 to Collection server; keeping in queue.
[23:56:24] Completed 87%
[23:57:23] Completed 88%
[23:58:23] Completed 89%
[23:59:22] Completed 90%
[00:00:23] Completed 91%
[00:01:24] Completed 92%
[00:02:22] Completed 93%
[00:03:19] Completed 94%
[00:04:18] Completed 95%
[00:05:15] Completed 96%
[00:06:13] Completed 97%
[00:07:11] Completed 98%
[00:08:08] Completed 99%
[00:09:05] Completed 100%
[00:09:05] Successful run
[00:09:05] DynamicWrapper: Finished Work Unit: sleep=10000
[00:09:15] Reserved 65928 bytes for xtc file; Cosm status=0
[00:09:15] Allocated 65928 bytes for xtc file
[00:09:15] - Reading up to 65928 from "work/wudata_07.xtc": Read 65928
[00:09:15] Read 65928 bytes from xtc file; available packet space=786364536
[00:09:15] xtc file hash check passed.
[00:09:15] Reserved 6456 6456 786364536 bytes for arc file=<work/wudata_07.trr> Cosm status=0
[00:09:15] Allocated 6456 bytes for arc file
[00:09:15] - Reading up to 6456 from "work/wudata_07.trr": Read 6456
[00:09:15] Read 6456 bytes from arc file; available packet space=786358080
[00:09:15] trr file hash check passed.
[00:09:15] Allocated 560 bytes for edr file
[00:09:15] Read bedfile
[00:09:15] edr file hash check passed.
[00:09:15] Logfile not read.
[00:09:15] GuardedRun: success in DynamicWrapper
[00:09:15] GuardedRun: done
[00:09:15] Run: GuardedRun completed.
[00:09:16] + Opened results file
[00:09:16] - Writing 73456 bytes of core data to disk...
[00:09:16] Done: 72944 -> 69518 (compressed to 95.3 percent)
[00:09:16]   ... Done.
[00:09:16] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[00:09:17] Shutting down core 
[00:09:17] 
[00:09:17] Folding@home Core Shutdown: FINISHED_UNIT
[00:09:19] CoreStatus = 64 (100)
[00:09:19] Sending work to server
[00:09:19] Project: 3469 (Run 2, Clone 175, Gen 0)


[00:09:19] + Attempting to send results [February 15 00:09:19 UTC]
[00:09:22] - Couldn't send HTTP request to server
[00:09:22] + Could not connect to Work Server (results)
[00:09:22]     (171.67.108.21:8080)
[00:09:22] + Retrying using alternative port
[00:12:22] - Couldn't send HTTP request to server
[00:12:22]   (Got status 504)
[00:12:22] + Could not connect to Work Server (results)
[00:12:22]     (171.67.108.21:80)
[00:12:22] - Error: Could not transmit unit 07 (completed February 15) to work server.
[00:12:22]   Keeping unit 07 in queue.
[00:12:22] Project: 5781 (Run 8, Clone 352, Gen 3)


[00:12:22] + Attempting to send results [February 15 00:12:22 UTC]
[00:12:26] - Couldn't send HTTP request to server
[00:12:26] + Could not connect to Work Server (results)
[00:12:26]     (171.67.108.21:8080)
[00:12:26] + Retrying using alternative port
[00:15:27] - Couldn't send HTTP request to server
[00:15:27]   (Got status 504)
[00:15:27] + Could not connect to Work Server (results)
[00:15:27]     (171.67.108.21:80)
[00:15:27] - Error: Could not transmit unit 04 (completed February 7) to work server.


[00:15:27] + Attempting to send results [February 15 00:15:27 UTC]
[00:16:04] - Server does not have record of this unit. Will try again later.
[00:16:04]   Could not transmit unit 04 to Collection server; keeping in queue.
[00:16:04] Project: 3469 (Run 2, Clone 175, Gen 0)


[00:16:04] + Attempting to send results [February 15 00:16:04 UTC]
[00:16:07] - Server has already received unit.
[00:16:07] - Preparing to get new work unit...
[00:16:07] + Attempting to get work packet
[00:16:07] - Connecting to assignment server
[00:16:08] - Successful: assigned to (171.67.108.21).
[00:16:08] + News From Folding@Home: Welcome to Folding@Home
[00:16:08] Loaded queue successfully.
[00:16:09] - Couldn't send HTTP request to server
[00:16:09] + Could not connect to Work Server
[00:16:09] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[00:16:20] + Attempting to get work packet
[00:16:20] - Connecting to assignment server
[00:16:21] - Successful: assigned to (171.67.108.21).
[00:16:21] + News From Folding@Home: Welcome to Folding@Home
[00:16:21] Loaded queue successfully.
[00:16:22] - Couldn't send HTTP request to server
[00:16:22] + Could not connect to Work Server
[00:16:22] - Attempt #2  to get work failed, and no other work to do.
Waiting before retry.
[00:16:42] + Attempting to get work packet
[00:16:42] - Connecting to assignment server
[00:16:43] - Successful: assigned to (171.67.108.21).
[00:16:43] + News From Folding@Home: Welcome to Folding@Home
[00:16:43] Loaded queue successfully.
[00:16:44] - Couldn't send HTTP request to server
[00:16:44] + Could not connect to Work Server
[00:16:44] - Attempt #3  to get work failed, and no other work to do.
Waiting before retry.
[00:17:17] + Attempting to get work packet
[00:17:17] - Connecting to assignment server
[00:17:18] - Successful: assigned to (171.67.108.21).
[00:17:18] + News From Folding@Home: Welcome to Folding@Home
[00:17:18] Loaded queue successfully.
[00:17:19] - Couldn't send HTTP request to server
[00:17:19] + Could not connect to Work Server
[00:17:19] - Attempt #4  to get work failed, and no other work to do.
Waiting before retry.
[00:18:00] + Attempting to get work packet
[00:18:00] - Connecting to assignment server
[00:18:01] - Successful: assigned to (171.67.108.21).
[00:18:01] + News From Folding@Home: Welcome to Folding@Home
[00:18:01] Loaded queue successfully.
[00:18:02] - Couldn't send HTTP request to server
[00:18:02] + Could not connect to Work Server
[00:18:02] - Attempt #5  to get work failed, and no other work to do.
Waiting before retry.
[00:19:25] + Attempting to get work packet
[00:19:25] - Connecting to assignment server
[00:19:26] - Successful: assigned to (171.67.108.21).
[00:19:26] + News From Folding@Home: Welcome to Folding@Home
[00:19:26] Loaded queue successfully.
[00:19:27] - Couldn't send HTTP request to server
[00:19:27] + Could not connect to Work Server
[00:19:27] - Attempt #6  to get work failed, and no other work to do.
Waiting before retry.
[00:22:13] + Attempting to get work packet
[00:22:13] - Connecting to assignment server
[00:22:14] - Successful: assigned to (171.67.108.21).
[00:22:14] + News From Folding@Home: Welcome to Folding@Home
[00:22:14] Loaded queue successfully.
[00:22:15] - Couldn't send HTTP request to server
[00:22:15] + Could not connect to Work Server
[00:22:15] - Attempt #7  to get work failed, and no other work to do.
Waiting before retry.
[00:27:44] + Attempting to get work packet
[00:27:44] - Connecting to assignment server
[00:27:45] - Successful: assigned to (171.67.108.21).
[00:27:45] + News From Folding@Home: Welcome to Folding@Home
[00:27:45] Loaded queue successfully.
[00:27:46] - Couldn't send HTTP request to server
[00:27:46] + Could not connect to Work Server
[00:27:46] - Attempt #8  to get work failed, and no other work to do.
Waiting before retry.
[00:38:34] + Attempting to get work packet
[00:38:34] - Connecting to assignment server
[00:38:36] - Successful: assigned to (171.67.108.21).
[00:38:36] + News From Folding@Home: Welcome to Folding@Home
[00:38:36] Loaded queue successfully.
[00:38:36] - Couldn't send HTTP request to server
[00:38:36] + Could not connect to Work Server
[00:38:36] - Attempt #9  to get work failed, and no other work to do.
Waiting before retry.

Folding@Home Client Shutdown.

Code: Select all

[19:18:29] Loaded queue successfully.
[19:18:29] 
[19:18:29] + Processing work unit
[19:18:29] Core required: FahCore_11.exe
[19:18:29] Core found.
[19:18:29] Project: 5781 (Run 7, Clone 636, Gen 3)
[19:18:29] Working on queue slot 00 [February 14 19:18:29 UTC]


[19:18:29] + Working ...
[19:18:29] + Attempting to send results [February 14 19:18:29 UTC]
[19:18:29] 
[19:18:29] *------------------------------*
[19:18:29] Folding@Home GPU Core
[19:18:29] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[19:18:29] 
[19:18:29] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[19:18:29] Build host: amoeba
[19:18:29] Board Type: Nvidia
[19:18:29] Core      : 
[19:18:29] Preparing to commence simulation
[19:18:29] - Looking at optimizations...
[19:18:29] - Files status OK
[19:18:29] - Expanded 65003 -> 344387 (decompressed 529.8 percent)
[19:18:29] Called DecompressByteArray: compressed_data_size=65003 data_size=344387, decompressed_data_size=344387 diff=0
[19:18:29] - Digital signature verified
[19:18:29] 
[19:18:29] Project: 5781 (Run 18, Clone 24, Gen 3)
[19:18:29] 
[19:18:29] Assembly optimizations on if available.
[19:18:29] Entering M.D.
[19:18:34] - Couldn't send HTTP request to server
[19:18:34] + Could not connect to Work Server (results)
[19:18:34]     (171.67.108.21:8080)
[19:18:34] + Retrying using alternative port
[19:18:35] Will resume from checkpoint file
[19:18:35] Tpr hash work/wudata_00.tpr:  1321234567 3844498680 506652074 1581133975 1932259826
[19:18:35] 
[19:18:35] Calling fah_main args: 14 usage=100
[19:18:35] 
[19:18:36] Working on Great Red Owns Many ACres of Sand
[19:19:42] Client config found, loading data.
[19:19:42] Starting GUI Server
[19:19:45] Resuming from checkpoint
[19:19:45] fcCheckPointResume: retreived and current tpr file hash:
[19:19:45]    0   1321234567   1321234567
[19:19:45]    1   3844498680   3844498680
[19:19:45]    2    506652074    506652074
[19:19:45]    3   1581133975   1581133975
[19:19:45]    4   1932259826   1932259826
[19:19:45] fcCheckPointResume: file hashes same.
[19:19:45] fcCheckPointResume: state restored.
[19:19:45] Verified work/wudata_00.log
[19:19:45] Verified work/wudata_00.edr
[19:19:45] Verified work/wudata_00.xtc
[19:19:45] Completed 1%
[19:21:33] - Couldn't send HTTP request to server
[19:21:33]   (Got status 504)
[19:21:33] + Could not connect to Work Server (results)
[19:21:33]     (171.67.108.21:80)
[19:21:33] - Error: Could not transmit unit 04 (completed February 7) to work server.


[19:21:33] + Attempting to send results [February 14 19:21:33 UTC]
[20:01:45] - Unknown packet returned from server, expected ACK for results
[20:01:45]   Could not transmit unit 04 to Collection server; keeping in queue.

Folding@Home Client Shutdown.

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 8:36 am
by DavidMudkips
Returned several 576x and 1010x WUs without a problem today, but ran into the "Server does not have record of this unit. Will try again later." error on a 10105 just now.

EDIT: the 10105 got sent successfully after my next WU finished

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 9:33 am
by jima13
FWIW... scanned the logs of my 7 gpu clients and only found one reject>

[07:09:26] Project: 10103 (Run 896, Clone 8, Gen 5)


[07:09:26] + Attempting to send results [February 16 07:09:26 UTC]
[07:09:28] - Couldn't send HTTP request to server
[07:09:28] + Could not connect to Work Server (results)
[07:09:28] (171.64.65.71:8080)
[07:09:28] + Retrying using alternative port
[07:09:29] - Couldn't send HTTP request to server
[07:09:29] + Could not connect to Work Server (results)
[07:09:29] (171.64.65.71:80)
[07:09:29] - Error: Could not transmit unit 03 (completed February 16) to work server.


[07:09:29] + Attempting to send results [February 16 07:09:29 UTC]
[07:12:34] - Server does not have record of this unit. Will try again later.
[07:12:34] Could not transmit unit 03 to Collection server; keeping in queue.

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 12:35 pm
by SnW
Still WU's that just won't upload , even the ones i just Folded :(

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 12:40 pm
by noorman
SnW wrote:Still WU's that just won't upload , even the ones i just Folded :(
.

Yes, I also got such a report from a fellow Folder / I passed on the problem to Vijay Pande to look at.

It seems that 171.64.65.71 has the same problem as 171.67.108.21 (of which I got a similar report)

EDIT: Reported this one too, because it 's the same report with both of these; might be the same problem.

.

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Posted: Tue Feb 16, 2010 2:37 pm
by cdb
Any new WUs I'm getting since the server came back on yesterday are folding and sending, but I still can't upload any WUs from the weekend and they're causing my cards to pause whilst they fail trying. Shall I just delete them?

Code: Select all

[14:25:02] - Machine ID: 4
[14:25:02] 
[14:25:02] Loaded queue successfully.
[14:25:02] Initialization complete
[14:25:02] 
[14:25:02] + Processing work unit
[14:25:02] Core required: FahCore_11.exe
[14:25:02] Core found.
[14:25:02] - Autosending finished units... [February 16 14:25:02 UTC]
[14:25:02] Trying to send all finished work units
[14:25:02] Project: 5781 (Run 9, Clone 599, Gen 4)
[14:25:02] - Read packet limit of 540015616... Set to 524286976.


[14:25:02] + Attempting to send results [February 16 14:25:02 UTC]
[14:25:02] - Reading file work/wuresults_04.dat from core
[14:25:02]   (Read 167878 bytes from disk)
[14:25:02] Connecting to http://171.67.108.21:8080/
[14:25:02] Working on queue slot 02 [February 16 14:25:02 UTC]
[14:25:02] + Working ...
[14:25:02] - Calling '.\FahCore_11.exe -dir work/ -suffix 02 -priority 96 -checkpoint 15 -verbose -lifeline 2996 -version 623'

[14:25:02] 
[14:25:02] *------------------------------*
[14:25:02] Folding@Home GPU Core
[14:25:02] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[14:25:02] 
[14:25:02] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[14:25:02] Build host: amoeba
[14:25:02] Board Type: Nvidia
[14:25:02] Core      : 
[14:25:02] Preparing to commence simulation
[14:25:02] - Looking at optimizations...
[14:25:02] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[14:25:03] - Created dyn
[14:25:03] - Files status OK
[14:25:03] - Expanded 88645 -> 447307 (decompressed 504.6 percent)
[14:25:03] Called DecompressByteArray: compressed_data_size=88645 data_size=447307, decompressed_data_size=447307 diff=0
[14:25:03] - Digital signature verified
[14:25:03] 
[14:25:03] Project: 10105 (Run 313, Clone 1, Gen 4)
[14:25:03] 
[14:25:03] Assembly optimizations on if available.
[14:25:03] Entering M.D.
[14:25:04] - Couldn't send HTTP request to server
[14:25:04] + Could not connect to Work Server (results)
[14:25:04]     (171.67.108.21:8080)
[14:25:04] + Retrying using alternative port
[14:25:04] Connecting to http://171.67.108.21:80/
[14:25:09] Tpr hash work/wudata_02.tpr:  1804672923 1201156577 3376530980 1872270874 3847857237
[14:25:09] 
[14:25:09] Calling fah_main args: 14 usage=100
[14:25:09] 
[14:25:09] Working on p10105_lambda_370K
[14:25:11] Client config found, loading data.
[14:25:11] Starting GUI Server
[14:25:25] - Couldn't send HTTP request to server
[14:25:25] + Could not connect to Work Server (results)
[14:25:25]     (171.67.108.21:80)
[14:25:25] - Error: Could not transmit unit 04 (completed February 14) to work server.
[14:25:25] - 21 failed uploads of this unit.
[14:25:25] - Read packet limit of 540015616... Set to 524286976.


[14:25:25] + Attempting to send results [February 16 14:25:25 UTC]
[14:25:25] - Reading file work/wuresults_04.dat from core
[14:25:25]   (Read 167878 bytes from disk)
[14:25:25] Connecting to http://171.67.108.26:8080/
[14:27:11] Completed 1%