Page 1 of 1

WS 171.64.65.56 not accepting/assigning WUs

Posted: Thu Dec 01, 2011 10:14 am
by Hisuichan
Good morning! I still can't upload a p11294 WU to either 171.64.65.56 (work server) or 171.67.108.26 (collection server) over 10 hours after completing it.

Thankfully I managed to upload different WUs to a different servers in the meantime, so my initial worry that the v7 client could crumble over stacking finished unsent WUs has vanished. Guess all I have to do is wait then until the server issues are resolved.

(Note: Since yesterday midnight I have been posting in a different thread about servers not accepting my WU - however the only similarity with the topic was the collection server, which needs no immediate reporting. So I deleted those posts and started a new thread to focus more on the work server.)

Code: Select all

23:54:56:Sending unit results: id:01 state:SEND error:OK project:11294 run:8 clone:376 gen:72 core:0x16 unit:0x0000004a0a3b1e5c4d9a1d3a18528d7b
23:54:56:Unit 01: Uploading 2.37MiB to 171.64.65.56
23:54:56:Connecting to 171.64.65.56:8080
23:55:58:SocketDevice::write(): Send error: 10054: An existing connection was forcibly closed by the remote host.
23:55:58:SocketDevice::write(): Socket not open
23:55:58:WARNING: Exception: Failed to send results to work server: Upload failed
23:55:58:Trying to send results to collection server
23:55:58:Unit 01: Uploading 2.37MiB to 171.67.108.26
23:55:58:Connecting to 171.67.108.26:8080
23:56:00:WARNING: WorkServer connection failed on port 8080 trying 80
23:56:00:Connecting to 171.67.108.26:80
23:56:01:ERROR: Exception: Failed to connect to 171.67.108.26:80: A socket operation was attempted to an unreachable network.
23:56:01:Sending unit results: id:01 state:SEND error:OK project:11294 run:8 clone:376 gen:72 core:0x16 unit:0x0000004a0a3b1e5c4d9a1d3a18528d7b
23:56:02:Unit 01: Uploading 2.37MiB to 171.64.65.56
23:56:02:Connecting to 171.64.65.56:8080
23:58:38:SocketDevice::write(): Send error: 10054: An existing connection was forcibly closed by the remote host.
23:58:38:SocketDevice::write(): Socket not open
23:58:38:WARNING: Exception: Failed to send results to work server: Upload failed
23:58:38:Trying to send results to collection server
... Rinse repeat ...
09:43:56:Sending unit results: id:01 state:SEND error:OK project:11294 run:8 clone:376 gen:72 core:0x16 unit:0x0000004a0a3b1e5c4d9a1d3a18528d7b
09:43:56:Unit 01: Uploading 2.37MiB to 171.64.65.56
09:43:56:Connecting to 171.64.65.56:8080
09:44:37:SocketDevice::write(): Send error: 10054: An existing connection was forcibly closed by the remote host.
09:44:37:SocketDevice::write(): Socket not open
09:44:37:WARNING: Exception: Failed to send results to work server: Upload failed
09:44:37:Trying to send results to collection server
09:44:37:Unit 01: Uploading 2.37MiB to 171.67.108.26
09:44:37:Connecting to 171.67.108.26:8080
09:44:39:WARNING: WorkServer connection failed on port 8080 trying 80
09:44:39:Connecting to 171.67.108.26:80
09:44:40:ERROR: Exception: Failed to connect to 171.67.108.26:80: A socket operation was attempted to an unreachable network.

Re: WS 171.64.65.56 and its CS not accepting WUs

Posted: Thu Dec 01, 2011 12:24 pm
by mattuconn
Same Issue. I have one completed and waiting to send for about 16hrs now.

Collection server is 171.67.108.26

Code: Select all

03:12:53:Enabled folding slot 01: READY smp:7
03:12:53:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:12:53:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:12:53:Starting Unit 02
03:12:53:Connecting to 171.64.65.56:8080
03:12:53:Running core: C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_16.fah/FahCore_16.exe -dir 02 -suffix 01 -lifeline 3496 -version 701 -checkpoint 15 -gpu 0
03:12:53:Started core on PID 2700
03:12:53:FahCore 0x16 started
03:12:53:Starting Unit 00
03:12:53:Running core: C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -lifeline 3496 -version 701 -checkpoint 15 -np 7
03:12:53:Started core on PID 2756
03:12:53:FahCore 0xa4 started
03:12:53:Unit 02:
03:12:53:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
03:12:53:Unit 02:*------------------------------*
03:12:53:Unit 02:Folding@Home GPU Core
03:12:53:Unit 02:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
03:12:53:Unit 02:
03:12:53:Unit 02:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
03:12:53:Unit 02:Build host: user-f6d030f24f
03:12:53:Unit 02:Board Type: AMD/OpenCL
03:12:53:Unit 02:Core      : x=16
03:12:53:Unit 02: Window's signal control handler registered.
03:12:53:Unit 02:Preparing to commence simulation
03:12:53:Unit 02:- Ensuring status. Please wait.
03:12:53:Unit 00:
03:12:53:Unit 00:*------------------------------*
03:12:53:Unit 00:Folding@Home Gromacs GB Core
03:12:53:Unit 00:Version 2.27 (Dec. 15, 2010)
03:12:53:Unit 00:
03:12:53:Unit 00:Preparing to commence simulation
03:12:53:Unit 00:- Ensuring status. Please wait.
03:13:03:Unit 00:- Looking at optimizations...
03:13:03:Unit 02:- Looking at optimizations...
03:13:03:Unit 00:- Working with standard loops on this execution.
03:13:03:Unit 02:- Working with standard loops on this execution.
03:13:03:Unit 00:- Previous termination of core was improper.
03:13:03:Unit 02:- Previous termination of core was improper.
03:13:03:Unit 00:- Files status OK
03:13:03:Unit 02:- Files status OK
03:13:03:Unit 02:sizeof(CORE_PACKET_HDR) = 512 file=<>
03:13:03:Unit 02:- Expanded 42506 -> 171163 (decompressed 402.6 percent)
03:13:03:Unit 02:Called DecompressByteArray: compressed_data_size=42506 data_size=171163, decompressed_data_size=171163 diff=0
03:13:03:Unit 02:- Digital signature verified
03:13:03:Unit 02:
03:13:03:Unit 02:Project: 11293 (Run 9, Clone 190, Gen 0)
03:13:03:Unit 02:
03:13:03:Unit 02:Entering M.D.
03:13:03:Unit 00:- Expanded 2053956 -> 5365960 (decompressed 261.2 percent)
03:13:03:Unit 00:Called DecompressByteArray: compressed_data_size=2053956 data_size=5365960, decompressed_data_size=5365960 diff=0
03:13:03:Unit 00:- Digital signature verified
03:13:03:Unit 00:
03:13:03:Unit 00:Project: 7808 (Run 2, Clone 350, Gen 11)
03:13:03:Unit 00:
03:13:03:Unit 00:Entering M.D.
03:13:04:Unit 02:Will resume from checkpoint file 02/wudata_01.ckp
03:13:04:Unit 02:Tpr hash 02/wudata_01.tpr:  3901370273 4098663553 4094707657 2830800939 1407606670
03:13:05:Unit 02:Working on ALZHEIMER DISEASE AMYLOID
03:13:05:Unit 02:Client config unavailable.
03:13:05:Unit 02:Starting GUI Server
03:13:08:Unit 02:Resuming from checkpoint
03:13:08:Unit 02:fcCheckPointResume: retreived and current tpr file hash:
03:13:08:Unit 02:   0   3901370273   3901370273
03:13:08:Unit 02:   1   4098663553   4098663553
03:13:08:Unit 02:   2   4094707657   4094707657
03:13:08:Unit 02:   3   2830800939   2830800939
03:13:08:Unit 02:   4   1407606670   1407606670
03:13:08:Unit 02:fcCheckPointResume: file hashes same.
03:13:08:Unit 02:fcCheckPointResume: state restored.
03:13:08:Unit 02:fcCheckPointResume: name 02/wudata_01.log Verified 02/wudata_01.log
03:13:08:Unit 02:fcCheckPointResume: name 02/wudata_01.trr Verified 02/wudata_01.trr
03:13:08:Unit 02:fcCheckPointResume: name 02/wudata_01.xtc Verified 02/wudata_01.xtc
03:13:08:Unit 02:fcCheckPointResume: name 02/wudata_01.edr Verified 02/wudata_01.edr
03:13:08:Unit 02:fcCheckPointResume: state restored 2
03:13:08:Unit 02:Resumed from checkpoint
03:13:08:Unit 02:Setting checkpoint frequency: 500000
03:13:08:Unit 02:Completed  12500001 out of 50000000 steps (25%).
03:13:09:Unit 00:Using Gromacs checkpoints
03:13:09:Unit 00:Mapping NT from 7 to 7 
03:13:09:Unit 00:Resuming from checkpoint
03:13:10:Unit 00:Verified 00/wudata_01.log
03:13:10:Unit 00:Verified 00/wudata_01.trr
03:13:10:Unit 00:Verified 00/wudata_01.xtc
03:13:10:Unit 00:Verified 00/wudata_01.edr
03:13:10:Unit 00:Completed 700880 out of 1500000 steps  (46%)
03:13:19:WARNING: Exception: Failed to send results to work server: Upload failed
03:13:19:Trying to send results to collection server
03:13:19:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:13:19:Connecting to 171.67.108.26:8080
03:13:20:WARNING: WorkServer connection failed on port 8080 trying 80
03:13:20:Connecting to 171.67.108.26:80
03:13:22:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
03:13:22:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:13:22:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:13:22:Connecting to 171.64.65.56:8080
03:13:43:WARNING: Exception: Failed to send results to work server: Upload failed
03:13:43:Trying to send results to collection server
03:13:43:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:13:43:Connecting to 171.67.108.26:8080
03:13:44:WARNING: WorkServer connection failed on port 8080 trying 80
03:13:44:Connecting to 171.67.108.26:80
03:13:46:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
03:14:22:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:14:22:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:14:22:Connecting to 171.64.65.56:8080
03:14:45:WARNING: Exception: Failed to send results to work server: Upload failed
03:14:45:Trying to send results to collection server
03:14:45:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:14:45:Connecting to 171.67.108.26:8080
03:14:47:WARNING: WorkServer connection failed on port 8080 trying 80
03:14:47:Connecting to 171.67.108.26:80
03:14:48:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
03:15:59:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:15:59:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:15:59:Connecting to 171.64.65.56:8080
03:16:22:WARNING: Exception: Failed to send results to work server: Upload failed
03:16:22:Trying to send results to collection server
03:16:22:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:16:22:Connecting to 171.67.108.26:8080
03:16:24:WARNING: WorkServer connection failed on port 8080 trying 80
03:16:24:Connecting to 171.67.108.26:80
03:16:25:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
03:16:34:Unit 00:Completed 705000 out of 1500000 steps  (47%)
03:18:36:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:18:36:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:18:36:Connecting to 171.64.65.56:8080
03:18:59:WARNING: Exception: Failed to send results to work server: Upload failed
03:18:59:Trying to send results to collection server
03:18:59:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:18:59:Connecting to 171.67.108.26:8080
03:19:01:WARNING: WorkServer connection failed on port 8080 trying 80
03:19:01:Connecting to 171.67.108.26:80
03:19:02:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
03:20:32:Unit 02:Completed  13000000 out of 50000000 steps (26%).
03:22:51:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:22:51:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:22:51:Connecting to 171.64.65.56:8080
03:23:14:WARNING: Exception: Failed to send results to work server: Upload failed
03:23:14:Trying to send results to collection server
03:23:14:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:23:14:Connecting to 171.67.108.26:8080
03:23:15:WARNING: WorkServer connection failed on port 8080 trying 80
03:23:15:Connecting to 171.67.108.26:80
03:23:17:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
03:26:58:Unit 02:Completed  13500000 out of 50000000 steps (27%).
03:29:14:Unit 00:Completed 720000 out of 1500000 steps  (48%)
03:29:42:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:29:42:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:29:42:Connecting to 171.64.65.56:8080
03:30:05:WARNING: Exception: Failed to send results to work server: Upload failed
03:30:05:Trying to send results to collection server
03:30:05:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:30:05:Connecting to 171.67.108.26:8080
03:30:07:WARNING: WorkServer connection failed on port 8080 trying 80
03:30:07:Connecting to 171.67.108.26:80
03:30:08:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
03:31:23:Unit 02:Completed  14000000 out of 50000000 steps (28%).
03:35:48:Unit 02:Completed  14500000 out of 50000000 steps (29%).
03:40:14:Unit 02:Completed  15000000 out of 50000000 steps (30%).
03:40:48:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:40:48:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:40:48:Connecting to 171.64.65.56:8080
03:41:11:WARNING: Exception: Failed to send results to work server: Upload failed
03:41:11:Trying to send results to collection server
03:41:11:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:41:11:Connecting to 171.67.108.26:8080
03:41:12:WARNING: WorkServer connection failed on port 8080 trying 80
03:41:12:Connecting to 171.67.108.26:80
03:41:14:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
03:41:17:Unit 00:Completed 735000 out of 1500000 steps  (49%)
03:44:41:Unit 02:Completed  15500000 out of 50000000 steps (31%).
03:49:09:Unit 02:Completed  16000000 out of 50000000 steps (32%).
03:53:16:Unit 00:Completed 750000 out of 1500000 steps  (50%)
03:53:36:Unit 02:Completed  16500000 out of 50000000 steps (33%).
03:58:03:Unit 02:Completed  17000000 out of 50000000 steps (34%).
03:58:44:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
03:58:44:Unit 01: Uploading 2.37MiB to 171.64.65.56
03:58:44:Connecting to 171.64.65.56:8080
03:59:07:WARNING: Exception: Failed to send results to work server: Upload failed
03:59:07:Trying to send results to collection server
03:59:07:Unit 01: Uploading 2.37MiB to 171.67.108.26
03:59:07:Connecting to 171.67.108.26:8080
03:59:09:WARNING: WorkServer connection failed on port 8080 trying 80
03:59:09:Connecting to 171.67.108.26:80
03:59:11:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
04:02:30:Unit 02:Completed  17500000 out of 50000000 steps (35%).
04:05:12:Unit 00:Completed 765000 out of 1500000 steps  (51%)
04:06:57:Unit 02:Completed  18000000 out of 50000000 steps (36%).
04:11:25:Unit 02:Completed  18500000 out of 50000000 steps (37%).
04:15:49:Unit 02:Completed  19000000 out of 50000000 steps (38%).
04:17:14:Unit 00:Completed 780000 out of 1500000 steps  (52%)
04:20:16:Unit 02:Completed  19500000 out of 50000000 steps (39%).
04:24:42:Unit 02:Completed  20000000 out of 50000000 steps (40%).
04:27:47:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
04:27:47:Unit 01: Uploading 2.37MiB to 171.64.65.56
04:27:47:Connecting to 171.64.65.56:8080
04:28:12:WARNING: Exception: Failed to send results to work server: Upload failed
04:28:12:Trying to send results to collection server
04:28:12:Unit 01: Uploading 2.37MiB to 171.67.108.26
04:28:12:Connecting to 171.67.108.26:8080
04:28:13:WARNING: WorkServer connection failed on port 8080 trying 80
04:28:13:Connecting to 171.67.108.26:80
04:28:14:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
04:29:08:Unit 02:Completed  20500000 out of 50000000 steps (41%).
04:29:12:Unit 00:Completed 795000 out of 1500000 steps  (53%)
04:33:30:Unit 02:Completed  21000000 out of 50000000 steps (42%).
04:37:57:Unit 02:Completed  21500000 out of 50000000 steps (43%).
04:41:16:Unit 00:Completed 810000 out of 1500000 steps  (54%)
04:42:23:Unit 02:Completed  22000000 out of 50000000 steps (44%).
04:46:51:Unit 02:Completed  22500000 out of 50000000 steps (45%).
04:51:16:Unit 02:Completed  23000000 out of 50000000 steps (46%).
04:53:14:Unit 00:Completed 825000 out of 1500000 steps  (55%)
04:55:43:Unit 02:Completed  23500000 out of 50000000 steps (47%).
05:00:10:Unit 02:Completed  24000000 out of 50000000 steps (48%).
05:04:37:Unit 02:Completed  24500000 out of 50000000 steps (49%).
05:05:11:Unit 00:Completed 840000 out of 1500000 steps  (56%)
05:09:04:Unit 02:Completed  25000000 out of 50000000 steps (50%).
05:13:31:Unit 02:Completed  25500000 out of 50000000 steps (51%).
05:14:45:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
05:14:45:Unit 01: Uploading 2.37MiB to 171.64.65.56
05:14:45:Connecting to 171.64.65.56:8080
05:15:08:WARNING: Exception: Failed to send results to work server: Upload failed
05:15:08:Trying to send results to collection server
05:15:08:Unit 01: Uploading 2.37MiB to 171.67.108.26
05:15:08:Connecting to 171.67.108.26:8080
05:15:10:WARNING: WorkServer connection failed on port 8080 trying 80
05:15:10:Connecting to 171.67.108.26:80
05:15:11:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
05:17:08:Unit 00:Completed 855000 out of 1500000 steps  (57%)
05:17:58:Unit 02:Completed  26000000 out of 50000000 steps (52%).
05:22:24:Unit 02:Completed  26500000 out of 50000000 steps (53%).
05:26:50:Unit 02:Completed  27000000 out of 50000000 steps (54%).
05:29:06:Unit 00:Completed 870000 out of 1500000 steps  (58%)
05:31:35:Unit 02:Completed  27500000 out of 50000000 steps (55%).
05:36:02:Unit 02:Completed  28000000 out of 50000000 steps (56%).
05:40:27:Unit 02:Completed  28500000 out of 50000000 steps (57%).
05:41:43:Unit 00:Completed 885000 out of 1500000 steps  (59%)
05:44:54:Unit 02:Completed  29000000 out of 50000000 steps (58%).
05:49:21:Unit 02:Completed  29500000 out of 50000000 steps (59%).
05:53:37:Unit 00:Completed 900000 out of 1500000 steps  (60%)
05:53:48:Unit 02:Completed  30000000 out of 50000000 steps (60%).
05:58:14:Unit 02:Completed  30500000 out of 50000000 steps (61%).
06:02:41:Unit 02:Completed  31000000 out of 50000000 steps (62%).
06:05:34:Unit 00:Completed 915000 out of 1500000 steps  (61%)
06:07:07:Unit 02:Completed  31500000 out of 50000000 steps (63%).
06:11:34:Unit 02:Completed  32000000 out of 50000000 steps (64%).
06:15:59:Unit 02:Completed  32500000 out of 50000000 steps (65%).
06:17:30:Unit 00:Completed 930000 out of 1500000 steps  (62%)
06:20:25:Unit 02:Completed  33000000 out of 50000000 steps (66%).
06:24:51:Unit 02:Completed  33500000 out of 50000000 steps (67%).
06:29:18:Unit 02:Completed  34000000 out of 50000000 steps (68%).
06:29:23:Unit 00:Completed 945000 out of 1500000 steps  (63%)
06:30:46:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
06:30:46:Unit 01: Uploading 2.37MiB to 171.64.65.56
06:30:46:Connecting to 171.64.65.56:8080
06:31:09:WARNING: Exception: Failed to send results to work server: Upload failed
06:31:09:Trying to send results to collection server
06:31:09:Unit 01: Uploading 2.37MiB to 171.67.108.26
06:31:09:Connecting to 171.67.108.26:8080
06:31:11:WARNING: WorkServer connection failed on port 8080 trying 80
06:31:11:Connecting to 171.67.108.26:80
06:31:12:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
06:33:40:Unit 02:Completed  34500000 out of 50000000 steps (69%).
06:38:03:Unit 02:Completed  35000000 out of 50000000 steps (70%).
06:41:24:Unit 00:Completed 960000 out of 1500000 steps  (64%)
06:42:30:Unit 02:Completed  35500000 out of 50000000 steps (71%).
06:46:56:Unit 02:Completed  36000000 out of 50000000 steps (72%).
06:51:23:Unit 02:Completed  36500000 out of 50000000 steps (73%).
06:53:18:Unit 00:Completed 975000 out of 1500000 steps  (65%)
06:55:48:Unit 02:Completed  37000000 out of 50000000 steps (74%).
07:00:14:Unit 02:Completed  37500000 out of 50000000 steps (75%).
07:04:41:Unit 02:Completed  38000000 out of 50000000 steps (76%).
07:05:12:Unit 00:Completed 990000 out of 1500000 steps  (66%)
07:09:08:Unit 02:Completed  38500000 out of 50000000 steps (77%).
07:13:34:Unit 02:Completed  39000000 out of 50000000 steps (78%).
07:17:05:Unit 00:Completed 1005000 out of 1500000 steps  (67%)
07:18:00:Unit 02:Completed  39500000 out of 50000000 steps (79%).
07:22:27:Unit 02:Completed  40000000 out of 50000000 steps (80%).
07:26:54:Unit 02:Completed  40500000 out of 50000000 steps (81%).
07:28:58:Unit 00:Completed 1020000 out of 1500000 steps  (68%)
07:31:20:Unit 02:Completed  41000000 out of 50000000 steps (82%).
07:35:47:Unit 02:Completed  41500000 out of 50000000 steps (83%).
07:40:13:Unit 02:Completed  42000000 out of 50000000 steps (84%).
07:40:51:Unit 00:Completed 1035000 out of 1500000 steps  (69%)
07:44:38:Unit 02:Completed  42500000 out of 50000000 steps (85%).
07:49:04:Unit 02:Completed  43000000 out of 50000000 steps (86%).
07:52:45:Unit 00:Completed 1050000 out of 1500000 steps  (70%)
07:53:31:Unit 02:Completed  43500000 out of 50000000 steps (87%).
07:57:57:Unit 02:Completed  44000000 out of 50000000 steps (88%).
08:02:24:Unit 02:Completed  44500000 out of 50000000 steps (89%).
08:04:39:Unit 00:Completed 1065000 out of 1500000 steps  (71%)
08:06:49:Unit 02:Completed  45000000 out of 50000000 steps (90%).
08:11:15:Unit 02:Completed  45500000 out of 50000000 steps (91%).
08:15:41:Unit 02:Completed  46000000 out of 50000000 steps (92%).
08:16:32:Unit 00:Completed 1080000 out of 1500000 steps  (72%)
08:20:06:Unit 02:Completed  46500000 out of 50000000 steps (93%).
08:24:32:Unit 02:Completed  47000000 out of 50000000 steps (94%).
08:28:24:Unit 00:Completed 1095000 out of 1500000 steps  (73%)
08:28:58:Unit 02:Completed  47500000 out of 50000000 steps (95%).
08:33:24:Unit 02:Completed  48000000 out of 50000000 steps (96%).
08:33:46:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
08:33:46:Unit 01: Uploading 2.37MiB to 171.64.65.56
08:33:46:Connecting to 171.64.65.56:8080
08:34:07:WARNING: Exception: Failed to send results to work server: Upload failed
08:34:07:Trying to send results to collection server
08:34:07:Unit 01: Uploading 2.37MiB to 171.67.108.26
08:34:07:Connecting to 171.67.108.26:8080
08:34:08:WARNING: WorkServer connection failed on port 8080 trying 80
08:34:08:Connecting to 171.67.108.26:80
08:34:10:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
08:37:51:Unit 02:Completed  48500000 out of 50000000 steps (97%).
08:40:14:Unit 00:Completed 1110000 out of 1500000 steps  (74%)
08:42:17:Unit 02:Completed  49000000 out of 50000000 steps (98%).
08:46:43:Unit 02:Completed  49500000 out of 50000000 steps (99%).
08:46:44:Connecting to assign-GPU.stanford.edu:80
08:46:44:News: Welcome to Folding@Home
08:46:44:Assigned to work server 171.67.108.44
08:46:44:Requesting new work unit for slot 00: RUNNING gpu:0:"Juniper XT [AMD Radeon HD 6000 Series]" from 171.67.108.44
08:46:44:Connecting to 171.67.108.44:8080
08:46:45:Slot 00: Downloading 42.01KiB
08:46:45:Slot 00: Download complete
08:46:45:Received Unit: id:03 state:DOWNLOAD error:OK project:11293 run:3 clone:197 gen:0 core:0x16 unit:0x000000006652edbc4d92565cb2f3d434
08:51:09:Unit 02:Completed  50000000 out of 50000000 steps (100%).
08:51:34:Unit 02:Finished fah_main
08:51:34:Unit 02:
08:51:34:Unit 02:Successful run
08:51:34:Unit 02:DynamicWrapper: Finished Work Unit: sleep=10000
08:51:45:Unit 02:Reserved 2446072 bytes for xtc file; Cosm status=0
08:51:45:Unit 02:Allocated 2446072 bytes for xtc file
08:51:45:Unit 02:- Reading up to 2446072 from "02/wudata_01.xtc": Read 2446072
08:51:45:Unit 02:Read 2446072 bytes from xtc file; available packet space=783984392
08:51:45:Unit 02:xtc file hash check passed.
08:51:45:Unit 02:Reserved 75840 75840 783984392 bytes for arc file=<02/wudata_01.trr> Cosm status=0
08:51:45:Unit 02:Allocated 75840 bytes for arc file
08:51:45:Unit 02:- Reading up to 75840 from "02/wudata_01.trr": Read 75840
08:51:45:Unit 02:Read 75840 bytes from arc file; available packet space=783908552
08:51:45:Unit 02:trr file hash check passed.
08:51:45:Unit 02:Allocated 544 bytes for edr file
08:51:45:Unit 02:Read bedfile
08:51:45:Unit 02:edr file hash check passed.
08:51:45:Unit 02:Allocated 120211 bytes for logfile
08:51:45:Unit 02:Read logfile
08:51:45:Unit 02:GuardedRun: success in DynamicWrapper
08:51:45:Unit 02:GuardedRun: done
08:51:45:Unit 02:Run: GuardedRun completed.
08:51:48:Unit 02:+ Opened results file
08:51:48:Unit 02:- Writing 2643179 bytes of core data to disk...
08:51:48:Unit 02:Done: 2642667 -> 2487724 (compressed to 94.1 percent)
08:51:48:Unit 02:  ... Done.
08:51:49:Unit 02:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
08:51:49:Unit 02:Shutting down core 
08:51:49:Unit 02:
08:51:49:Unit 02:Folding@home Core Shutdown: FINISHED_UNIT
08:51:49:FahCore, running Unit 02, returned: FINISHED_UNIT (100 = 0x64)
08:51:49:Sending unit results: id:02 state:SEND error:OK project:11293 run:9 clone:190 gen:0 core:0x16 unit:0x000000006652edbc4d92579bdde64670
08:51:49:Unit 02: Uploading 2.37MiB to 171.67.108.44
08:51:49:Starting Unit 03
08:51:49:Connecting to 171.67.108.44:8080
08:51:49:Running core: C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_16.fah/FahCore_16.exe -dir 03 -suffix 01 -lifeline 3496 -version 701 -checkpoint 15 -gpu 0
08:51:49:Started core on PID 4368
08:51:49:FahCore 0x16 started
08:51:50:Unit 03:
08:51:50:Unit 03:*------------------------------*
08:51:50:Unit 03:Folding@Home GPU Core
08:51:50:Unit 03:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
08:51:50:Unit 03:
08:51:50:Unit 03:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
08:51:50:Unit 03:Build host: user-f6d030f24f
08:51:50:Unit 03:Board Type: AMD/OpenCL
08:51:50:Unit 03:Core      : x=16
08:51:50:Unit 03: Window's signal control handler registered.
08:51:50:Unit 03:Preparing to commence simulation
08:51:50:Unit 03:- Looking at optimizations...
08:51:50:Unit 03:DeleteFrameFiles: successfully deleted file=03/wudata_01.ckp
08:51:50:Unit 03:- Created dyn
08:51:50:Unit 03:- Files status OK
08:51:50:Unit 03:sizeof(CORE_PACKET_HDR) = 512 file=<>
08:51:50:Unit 03:- Expanded 42507 -> 171163 (decompressed 402.6 percent)
08:51:50:Unit 03:Called DecompressByteArray: compressed_data_size=42507 data_size=171163, decompressed_data_size=171163 diff=0
08:51:50:Unit 03:- Digital signature verified
08:51:50:Unit 03:
08:51:50:Unit 03:Project: 11293 (Run 3, Clone 197, Gen 0)
08:51:50:Unit 03:
08:51:50:Unit 03:Assembly optimizations on if available.
08:51:50:Unit 03:Entering M.D.
08:51:51:Unit 03:Tpr hash 03/wudata_01.tpr:  4102228708 1199708522 346979139 1815837837 4288333640
08:51:51:Unit 03:Working on ALZHEIMER DISEASE AMYLOID
08:51:51:Unit 03:Client config unavailable.
08:51:51:Unit 03:Starting GUI Server
08:51:55:Unit 02: 13.50%
08:51:57:Unit 03:Setting checkpoint frequency: 500000
08:51:57:Unit 03:Completed         3 out of 50000000 steps (0%).
08:52:01:Unit 02: 28.48%
08:52:05:Unit 00:Completed 1125000 out of 1500000 steps  (75%)
08:52:07:Unit 02: 43.46%
08:52:13:Unit 02: 58.60%
08:52:19:Unit 02: 73.58%
08:52:25:Unit 02: 88.73%
08:52:30:Unit 02: Upload complete
08:52:30:Server responded WORK_ACK (400)
08:52:30:Final credit estimate, 1835.00 points
08:52:30:Cleaning up Unit 02
08:55:58:Unit 03:Completed    500000 out of 50000000 steps (1%).
09:00:24:Unit 03:Completed   1000000 out of 50000000 steps (2%).
09:03:54:Unit 00:Completed 1140000 out of 1500000 steps  (76%)
09:04:51:Unit 03:Completed   1500000 out of 50000000 steps (3%).
09:09:17:Unit 03:Completed   2000000 out of 50000000 steps (4%).
09:13:44:Unit 03:Completed   2500000 out of 50000000 steps (5%).
09:15:44:Unit 00:Completed 1155000 out of 1500000 steps  (77%)
09:18:11:Unit 03:Completed   3000000 out of 50000000 steps (6%).
09:22:38:Unit 03:Completed   3500000 out of 50000000 steps (7%).
09:27:05:Unit 03:Completed   4000000 out of 50000000 steps (8%).
09:27:33:Unit 00:Completed 1170000 out of 1500000 steps  (78%)
09:31:31:Unit 03:Completed   4500000 out of 50000000 steps (9%).
09:35:58:Unit 03:Completed   5000000 out of 50000000 steps (10%).
09:39:21:Unit 00:Completed 1185000 out of 1500000 steps  (79%)
09:40:25:Unit 03:Completed   5500000 out of 50000000 steps (11%).
09:44:51:Unit 03:Completed   6000000 out of 50000000 steps (12%).
09:49:18:Unit 03:Completed   6500000 out of 50000000 steps (13%).
09:51:11:Unit 00:Completed 1200000 out of 1500000 steps  (80%)
09:53:44:Unit 03:Completed   7000000 out of 50000000 steps (14%).
09:58:10:Unit 03:Completed   7500000 out of 50000000 steps (15%).
10:02:37:Unit 03:Completed   8000000 out of 50000000 steps (16%).
10:03:00:Unit 00:Completed 1215000 out of 1500000 steps  (81%)
10:07:03:Unit 03:Completed   8500000 out of 50000000 steps (17%).
10:11:30:Unit 03:Completed   9000000 out of 50000000 steps (18%).
10:14:49:Unit 00:Completed 1230000 out of 1500000 steps  (82%)
10:15:57:Unit 03:Completed   9500000 out of 50000000 steps (19%).
10:20:23:Unit 03:Completed  10000000 out of 50000000 steps (20%).
10:24:50:Unit 03:Completed  10500000 out of 50000000 steps (21%).
10:26:39:Unit 00:Completed 1245000 out of 1500000 steps  (83%)
10:29:17:Unit 03:Completed  11000000 out of 50000000 steps (22%).
10:33:43:Unit 03:Completed  11500000 out of 50000000 steps (23%).
10:38:10:Unit 03:Completed  12000000 out of 50000000 steps (24%).
10:38:29:Unit 00:Completed 1260000 out of 1500000 steps  (84%)
10:42:36:Unit 03:Completed  12500000 out of 50000000 steps (25%).
10:47:03:Unit 03:Completed  13000000 out of 50000000 steps (26%).
10:50:18:Unit 00:Completed 1275000 out of 1500000 steps  (85%)
10:51:30:Unit 03:Completed  13500000 out of 50000000 steps (27%).
10:55:57:Unit 03:Completed  14000000 out of 50000000 steps (28%).
11:00:23:Unit 03:Completed  14500000 out of 50000000 steps (29%).
11:02:07:Unit 00:Completed 1290000 out of 1500000 steps  (86%)
11:04:50:Unit 03:Completed  15000000 out of 50000000 steps (30%).
11:09:16:Unit 03:Completed  15500000 out of 50000000 steps (31%).
11:13:43:Unit 03:Completed  16000000 out of 50000000 steps (32%).
11:13:56:Unit 00:Completed 1305000 out of 1500000 steps  (87%)
11:18:10:Unit 03:Completed  16500000 out of 50000000 steps (33%).
11:22:37:Unit 03:Completed  17000000 out of 50000000 steps (34%).
11:25:42:Unit 00:Completed 1320000 out of 1500000 steps  (88%)
11:27:04:Unit 03:Completed  17500000 out of 50000000 steps (35%).
11:31:30:Unit 03:Completed  18000000 out of 50000000 steps (36%).
11:35:57:Unit 03:Completed  18500000 out of 50000000 steps (37%).
11:37:31:Unit 00:Completed 1335000 out of 1500000 steps  (89%)
11:40:24:Unit 03:Completed  19000000 out of 50000000 steps (38%).
11:44:50:Unit 03:Completed  19500000 out of 50000000 steps (39%).
11:49:17:Unit 03:Completed  20000000 out of 50000000 steps (40%).
11:49:22:Unit 00:Completed 1350000 out of 1500000 steps  (90%)
11:52:46:Sending unit results: id:01 state:SEND error:OK project:11294 run:3 clone:365 gen:68 core:0x16 unit:0x0000004c0a3b1e5c4d9a1c804889b65c
11:52:46:Unit 01: Uploading 2.37MiB to 171.64.65.56
11:52:46:Connecting to 171.64.65.56:8080
11:53:09:WARNING: Exception: Failed to send results to work server: Upload failed
11:53:09:Trying to send results to collection server
11:53:09:Unit 01: Uploading 2.37MiB to 171.67.108.26
11:53:09:Connecting to 171.67.108.26:8080
11:53:11:WARNING: WorkServer connection failed on port 8080 trying 80
11:53:11:Connecting to 171.67.108.26:80
11:53:12:ERROR: Exception: Failed to connect to 171.67.108.26:80: No connection could be made because the target machine actively refused it.
11:53:44:Unit 03:Completed  20500000 out of 50000000 steps (41%).
11:58:11:Unit 03:Completed  21000000 out of 50000000 steps (42%).
12:01:14:Unit 00:Completed 1365000 out of 1500000 steps  (91%)
12:02:42:Unit 03:Completed  21500000 out of 50000000 steps (43%).
12:07:15:Unit 03:Completed  22000000 out of 50000000 steps (44%).
12:11:47:Unit 03:Completed  22500000 out of 50000000 steps (45%).
12:13:17:Unit 00:Completed 1380000 out of 1500000 steps  (92%)
12:16:15:Unit 03:Completed  23000000 out of 50000000 steps (46%).
12:20:46:Unit 03:Completed  23500000 out of 50000000 steps (47%).

Re: [Resolved] WS 171.64.65.56 and its CS not accepting WUs

Posted: Thu Dec 01, 2011 3:20 pm
by Hisuichan
Just came back from uni, 171.64.65.56 has now accepted my WU, case closed.

Re: [Resolved] WS 171.64.65.56 and its CS not accepting WUs

Posted: Fri Dec 02, 2011 2:29 am
by mattuconn
all good, just need a day or two.

WS 171.64.65.56 not accepting and not assigning WUs

Posted: Thu Dec 08, 2011 11:30 am
by Hisuichan
WS 171.64.65.56 is acting up again, and this time it seems the server is being incredibly busy. My v7 client has been trying to upload a WU there, same errors as in the log in my first post.

However, what's worse this time is that I can't get WUs at all either, and that since 3 AM (8 hours ago). I guess the server(s in the 171.64.65.x range) need(s) a restart.

Code: Select all

03:35:29:Connecting to assign-GPU.stanford.edu:80
03:35:30:News: Welcome to Folding@Home
03:35:30:Assigned to work server 171.64.65.56
03:35:30:Requesting new work unit for slot 01: RUNNING gpu:0:"Barts XT [ATI Radeon HD 6800 Series]" from 171.64.65.56
03:35:30:Connecting to 171.64.65.56:8080
03:36:00:ERROR: Exception: 10002: Received short response, expected 512 bytes, got 0
Is there only one server my HD 6870 gets assigned to? I mean 99% of the WUs that have been assigned to me in the last 2 months were p11294, the rest was p11293, so it gets rather... boring. Either way, my GPU has been sitting idle for too long now, it wants to crunch some stuff.

Edit: After 9 hours, I finally got assigned to a different server, and I got a... 11293! :eo

Code: Select all

12:05:41:Connecting to assign-GPU.stanford.edu:80
12:05:51:News: Welcome to Folding@Home
12:05:51:Assigned to work server 171.67.108.44
12:05:51:Requesting new work unit for slot 01: READY gpu:0:"Barts XT [ATI Radeon HD 6800 Series]" from 171.67.108.44
12:05:51:Connecting to 171.67.108.44:8080
12:06:01:Slot 01: Downloading 42.27KiB
12:06:01:Slot 01: Download complete
12:06:01:Received Unit: id:03 state:DOWNLOAD error:OK project:11293 run:0 clone:210 gen:0 core:0x16 unit:0x000000006652edbc4d9255ca47ad02fd
12:06:01:Starting Unit 03

Re: WS 171.64.65.56 not accepting/assigning WUs

Posted: Fri Dec 09, 2011 2:11 am
by Grandpa_01
Could somebody go and kick this server I have 3 ATI WU's waiting for it to be kicked. As you can see the computer has no problem connecting to other CS or AS just this one.

Code: Select all

01:37:05:News: Welcome to Folding@Home
01:37:05:Assigned to work server 171.64.65.56
01:37:05:Requesting new work unit for slot 01: READY gpu:0:"Broadway XT [Mobility Radeon HD 5800 Series]" from 171.64.65.56
01:37:05:Connecting to 171.64.65.56:8080
01:37:26:WARNING: WorkServer connection failed on port 8080 trying 80
01:37:26:Connecting to 171.64.65.56:80
01:37:47:ERROR: Exception: Failed to connect to 171.64.65.56:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
01:38:05:Connecting to assign-GPU.stanford.edu:80
01:38:05:News: Welcome to Folding@Home
01:38:05:Assigned to work server 171.64.65.56
01:38:05:Requesting new work unit for slot 01: READY gpu:0:"Broadway XT [Mobility Radeon HD 5800 Series]" from 171.64.65.56
01:38:05:Connecting to 171.64.65.56:8080
01:38:26:WARNING: WorkServer connection failed on port 8080 trying 80
01:38:26:Connecting to 171.64.65.56:80
01:38:47:ERROR: Exception: Failed to connect to 171.64.65.56:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
01:39:42:Connecting to assign-GPU.stanford.edu:80
01:39:42:News: Welcome to Folding@Home
01:39:42:Assigned to work server 171.64.65.56
01:39:42:Requesting new work unit for slot 01: READY gpu:0:"Broadway XT [Mobility Radeon HD 5800 Series]" from 171.64.65.56
01:39:42:Connecting to 171.64.65.56:8080
01:40:15:ERROR: Exception: 10002: Received short response, expected 512 bytes, got 0
01:41:45:Unit 02:Completed 990000 out of 1000000 steps  (99%)
01:41:46:Connecting to assign3.stanford.edu:8080
01:41:46:News: Welcome to Folding@Home
01:41:46:Assigned to work server 171.67.108.58
01:41:46:Requesting new work unit for slot 00: RUNNING smp:6 from 171.67.108.58
01:41:46:Connecting to 171.67.108.58:8080
01:41:46:Slot 00: Downloading 671.18KiB
01:41:47:Slot 00: Download complete
01:41:47:Received Unit: id:04 state:DOWNLOAD error:OK project:8001 run:0 clone:0 gen:0 core:0xa4 unit:0x000000016652edca4eded7319a05f949
01:41:47:Downloading core from http://www.stanford.edu/~pande/Win32/AMD64/beta/Core_a4.fah
01:41:47:Connecting to www.stanford.edu:80
01:41:47:FahCore a4: Downloading 2.89MiB
01:41:51:FahCore a4: Download complete
01:41:51:Valid core signature
01:41:52:Unpacked 9.59MiB to cores/www.stanford.edu/~pande/Win32/AMD64/beta/Core_a4.fah/FahCore_a4.exe
01:42:20:Connecting to assign-GPU.stanford.edu:80
01:42:20:News: Welcome to Folding@Home
01:42:20:Assigned to work server 171.64.65.56
01:42:20:Requesting new work unit for slot 01: READY gpu:0:"Broadway XT [Mobility Radeon HD 5800 Series]" from 171.64.65.56
01:42:20:Connecting to 171.64.65.56:8080
01:42:41:WARNING: WorkServer connection failed on port 8080 trying 80
01:42:41:Connecting to 171.64.65.56:80
01:43:00:ERROR: Exception: 10002: Received short response, expected 512 bytes, got 0
01:43:00:Slot 00 finishing
01:43:00:Slot 01 finishing
01:50:11:Unit 02:Completed 1000000 out of 1000000 steps  (100%)
01:50:13:Unit 02:DynamicWrapper: Finished Work Unit: sleep=10000
01:50:23:Unit 02:
01:50:23:Unit 02:Finished Work Unit:
01:50:23:Unit 02:- Reading up to 9346392 from "02/wudata_01.trr": Read 9346392
01:50:23:Unit 02:trr file hash check passed.
01:50:23:Unit 02:- Reading up to 1416776 from "02/wudata_01.xtc": Read 1416776
01:50:23:Unit 02:xtc file hash check passed.
01:50:23:Unit 02:edr file hash check passed.
01:50:23:Unit 02:logfile size: 29434
01:50:23:Unit 02:Leaving Run
01:50:23:Unit 02:- Writing 10799966 bytes of core data to disk...
01:50:26:Unit 02:Done: 10799454 -> 10309629 (compressed to 95.4 percent)
01:50:26:Unit 02:  ... Done.
01:50:29:Unit 02:- Shutting down core
01:50:29:Unit 02:
01:50:29:Unit 02:Folding@home Core Shutdown: FINISHED_UNIT
01:50:30:FahCore, running Unit 02, returned: FINISHED_UNIT (100 = 0x64)
01:50:30:Sending unit results: id:02 state:SEND error:OK project:11051 run:0 clone:32 gen:23 core:0xa3 unit:0x000000190a3b1e5b4db73f63fb119147
01:50:30:Unit 02: Uploading 9.83MiB to 171.64.65.55
01:50:30:Connecting to 171.64.65.55:8080
01:50:36:Unit 02: 9.30%
01:50:42:Unit 02: 20.06%
01:50:48:Unit 02: 30.71%
01:50:54:Unit 02: 41.24%
01:51:00:Unit 02: 51.88%
01:51:06:Unit 02: 62.61%
01:51:12:Unit 02: 73.10%
01:51:18:Unit 02: 83.75%
01:51:24:Unit 02: 94.12%
01:51:29:Unit 02: Upload complete
01:51:29:Server responded WORK_ACK (400)
01:51:29:Final credit estimate, 3512.00 points
01:51:29:Cleaning up Unit 02

Re: WS 171.64.65.56 not accepting/assigning WUs

Posted: Fri Dec 09, 2011 3:24 am
by Hisuichan
Aaaand I'm being assigned to this server again. 24 hours later, server still ailing, can't upload, can't get WUs.
I wonder why I didn't keep getting assigned to the other server that gave me 11293 WUs, or any other server that is reliably working, instead of forcing me into an neverending loop that just makes that poor .56 server ache more.
(Also Grandpa_01, you just need one more post to see magic happening.)

Re: WS 171.64.65.56 not accepting/assigning WUs

Posted: Fri Dec 09, 2011 4:24 am
by bruce
I'd be happy to kick the server, but Stanford is 400 miles away and only the Pande Group has ssh passwords (and I wouldn't know what to do with one if I had one). I have attempted to notify the right people.

Re: WS 171.64.65.56 not accepting/assigning WUs

Posted: Fri Dec 09, 2011 8:11 am
by Grandpa_01
Thanks bruce that is a kick in my book. :lol:

Re: WS 171.64.65.56 not accepting/assigning WUs

Posted: Fri Dec 09, 2011 1:39 pm
by Hisuichan
Well the server's not busy anymore, reached a peak of 210-something yesterday and is back to <1 levels - however, it is actively rejecting connections now. щ(゚Д゚щ)

Dealing with new stuff every day, so bear with my (maybe impatient-sounding) posts - anything that can be done there, or is it just a matter of waiting that needs no reporting?

Re: WS 171.64.65.56 not accepting/assigning WUs

Posted: Fri Dec 09, 2011 6:08 pm
by bruce
I've reported it again. Its status changed to rejecting connections at about the same time the issue with CPU_LOAD was resolved. :(

Re: WS 171.64.65.56 not accepting/assigning WUs

Posted: Fri Dec 09, 2011 7:02 pm
by Hisuichan
Thanks bruce, the server is accepting connections now. :)

Re: WS 171.64.65.56 not accepting/assigning WUs

Posted: Sun Dec 11, 2011 5:14 am
by Hisuichan
Well, seems a lot of servers are back to rejecting connections again, including - you guessed it - our old friend 65.56 here! :(