Page 1 of 5
Failed to connect to 171.64.65.104:80
Posted: Sat Mar 12, 2016 3:08 pm
by DarkFoss
i'm guessing it is down again? Comp been trying to send in a wu since yesterday.
Code: Select all
04:53:50:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:9211 run:2 clone:11 gen:22 core:0x21 unit:0x00000075664f2dd055ee292a0f6788f6
04:53:51:WU01:FS00:Uploading 17.50MiB to 171.64.65.104
04:53:51:WU01:FS00:Connecting to 171.64.65.104:8080
04:53:52:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
04:53:52:WU01:FS00:Connecting to 171.64.65.104:80
04:53:53:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.64.65.104:80: No connection could be made because the target machine actively refused it.
04:53:53:WU01:FS00:Trying to send results to collection server
04:53:53:WU01:FS00:Uploading 17.50MiB to 171.65.103.160
04:53:53:WU01:FS00:Connecting to 171.65.103.160:8080
04:53:59:WU01:FS00:Upload 21.43%
04:54:05:WU01:FS00:Upload 46.07%
04:54:11:WU01:FS00:Upload 70.71%
04:54:17:WU01:FS00:Upload 95.00%
04:54:18:WU01:FS00:Upload complete
04:54:18:WU01:FS00:Server responded PLEASE_WAIT (464)
04:54:18:WARNING:WU01:FS00:Failed to send results, will try again later
Re: 171.65.103.160 Server responded PLEASE_WAIT
Posted: Sun Mar 13, 2016 5:15 am
by bruce
There are two problems here.
04:53:51:WU01:FS00:Uploading 17.50MiB to 171.64.65.104
The first failure should have been reported against 171.64.65.104 which is the primary work server for your project.
The second failure is a a report from a Collection Server which is no longer in service.
You've delayed the fix by reporting it against the wrong server (171.65.103.160)
I've contacted the owner of the WS and that will be fixed. When it's fixed, the CS will not be needed so there will be no error for 171.65.103.160
Re: Failed to connect to 171.64.65.104:80
Posted: Sun Mar 13, 2016 6:36 pm
by DarkFoss
I'm sorry for the errors on my part. Thank you Bruce for the corrections, I'll try to be more attentive in the future.
Re: Failed to connect to 171.64.65.104:80
Posted: Sun Mar 13, 2016 8:32 pm
by kwerboom
I'm having a similar problem with the same servers. What's wrong with the server? What's going to happen to the completed work unit that can't upload?
Code: Select all
20:05:41:WU00:FS01:0x21:Completed 2500000 out of 2500000 steps (100%)
20:05:45:WU00:FS01:0x21:Saving result file logfile_01.txt
20:05:45:WU00:FS01:0x21:Saving result file checkpointState.xml
20:05:47:WU00:FS01:0x21:Saving result file checkpt.crc
20:05:47:WU00:FS01:0x21:Saving result file log.txt
20:05:47:WU00:FS01:0x21:Saving result file positions.xtc
20:05:48:WU00:FS01:0x21:Folding@home Core Shutdown: FINISHED_UNIT
20:05:49:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
20:05:49:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:9208 run:1 clone:20 gen:33 core:0x21 unit:0x00000089664f2dd055edd3e58f603712
20:05:49:WU00:FS01:Uploading 17.50MiB to 171.64.65.104
20:05:49:WU00:FS01:Connecting to 171.64.65.104:8080
20:05:50:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
20:05:50:WU00:FS01:Connecting to 171.64.65.104:80
20:05:51:WARNING:WU00:FS01:Exception: Failed to send results to work server: Failed to connect to 171.64.65.104:80: No connection could be made because the target machine actively refused it.
20:05:51:WU00:FS01:Trying to send results to collection server
20:05:51:WU00:FS01:Uploading 17.50MiB to 171.65.103.160
20:05:51:WU00:FS01:Connecting to 171.65.103.160:8080
20:05:57:WU00:FS01:Upload 3.21%
20:06:03:WU00:FS01:Upload 6.43%
20:06:09:WU00:FS01:Upload 9.29%
20:06:15:WU00:FS01:Upload 12.14%
20:06:21:WU00:FS01:Upload 15.36%
20:06:27:WU00:FS01:Upload 18.21%
20:06:33:WU00:FS01:Upload 21.43%
20:06:39:WU00:FS01:Upload 24.64%
20:06:45:WU00:FS01:Upload 27.50%
20:06:51:WU00:FS01:Upload 30.72%
20:06:57:WU00:FS01:Upload 32.50%
20:07:03:WU00:FS01:Upload 35.72%
20:07:09:WU00:FS01:Upload 38.57%
20:07:15:WU00:FS01:Upload 41.43%
20:07:21:WU00:FS01:Upload 44.29%
20:07:28:WU00:FS01:Upload 47.14%
20:07:34:WU00:FS01:Upload 50.72%
20:07:40:WU00:FS01:Upload 53.57%
20:07:46:WU00:FS01:Upload 56.79%
20:07:52:WU00:FS01:Upload 59.64%
20:07:58:WU00:FS01:Upload 62.86%
20:08:04:WU00:FS01:Upload 65.72%
20:08:10:WU00:FS01:Upload 68.93%
20:08:16:WU00:FS01:Upload 71.79%
20:08:22:WU00:FS01:Upload 74.64%
20:08:28:WU00:FS01:Upload 77.50%
20:08:34:WU00:FS01:Upload 80.36%
20:08:40:WU00:FS01:Upload 83.57%
20:08:46:WU00:FS01:Upload 85.72%
20:08:52:WU00:FS01:Upload 88.93%
20:08:58:WU00:FS01:Upload 91.43%
20:09:04:WU00:FS01:Upload 94.65%
20:09:10:WU00:FS01:Upload 97.50%
20:09:16:WU00:FS01:Upload complete
20:09:16:WU00:FS01:Server responded PLEASE_WAIT (464)
20:09:16:WARNING:WU00:FS01:Failed to send results, will try again later
20:09:17:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:9208 run:1 clone:20 gen:33 core:0x21 unit:0x00000089664f2dd055edd3e58f603712
20:09:17:WU00:FS01:Uploading 17.50MiB to 171.64.65.104
20:09:17:WU00:FS01:Connecting to 171.64.65.104:8080
20:09:18:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
20:09:18:WU00:FS01:Connecting to 171.64.65.104:80
20:09:19:WARNING:WU00:FS01:Exception: Failed to send results to work server: Failed to connect to 171.64.65.104:80: No connection could be made because the target machine actively refused it.
20:09:19:WU00:FS01:Trying to send results to collection server
20:09:19:WU00:FS01:Uploading 17.50MiB to 171.65.103.160
20:09:19:WU00:FS01:Connecting to 171.65.103.160:8080
Re: Failed to connect to 171.64.65.104:80
Posted: Mon Mar 14, 2016 1:15 am
by jadeshi
Sorry about this guys. WS went down for unknown reasons. I'm working on this now, and I'll give you guys an update when this is dealt with.
Re: Failed to connect to 171.64.65.104:80
Posted: Mon Mar 14, 2016 5:43 am
by bruce
It has been a few hours, and I'll report what I observe: (It's not fixed yet so, as expected, jadeshi has not provided an update.)
If you look at
serverstat you'll notice that vspg14b (AKA 171.64.65.104) is reported as "Reject" in the connections column. As long as it's rejecting connections, no uploads or downloads can take place. Once it has been fixed, that'll change to "Accepting."
As far as what will happen to the WUs is concerned, I expect that they'll be uploaded once that status changes.
Since it's late Sunday night at Stanford, it'll probably be another half day (night) before it changes unless it requires a significant hardware repair.
Problems uploading to 171.65.103.160
Posted: Wed Mar 16, 2016 4:40 am
by Nick200
Hi
Have been having difficulties for three days now trying to upload a completed WU to 171.65.103.160.
I have checked the server page and that says VSPMF93 is accepting. But not in my case!
I have been watching the credit value of the WU drop from over 60K to an estimated 45K. I will give up and dump this WU soon.
Log file attached below
Any suggestions?
Nick200
Code: Select all
*********************** Log Started 2016-03-16T04:31:02Z ***********************
04:31:02:************************* Folding@home Client *************************
04:31:02: Website: http://folding.stanford.edu/
04:31:02: Copyright: (c) 2009-2014 Stanford University
04:31:02: Author: Joseph Coffland <[email protected]>
04:31:02: Args: --open-web-control
04:31:02: Config: C:/Users/nickm/AppData/Roaming/FAHClient/config.xml
04:31:02:******************************** Build ********************************
04:31:02: Version: 7.4.4
04:31:02: Date: Mar 4 2014
04:31:02: Time: 20:26:54
04:31:02: SVN Rev: 4130
04:31:02: Branch: fah/trunk/client
04:31:02: Compiler: Intel(R) C++ MSVC 1500 mode 1200
04:31:02: Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
04:31:02: /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
04:31:02: Platform: win32 XP
04:31:02: Bits: 32
04:31:02: Mode: Release
04:31:02:******************************* System ********************************
04:31:02: CPU: Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz
04:31:02: CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
04:31:02: CPUs: 8
04:31:02: Memory: 15.85GiB
04:31:02: Free Memory: 9.37GiB
04:31:02: Threads: WINDOWS_THREADS
04:31:02: OS Version: 6.2
04:31:02: Has Battery: false
04:31:02: On Battery: false
04:31:02: UTC Offset: 13
04:31:02: PID: 15348
04:31:02: CWD: C:/Users/nickm/AppData/Roaming/FAHClient
04:31:02: OS: Windows 10 Pro Insider Preview
04:31:02: OS Arch: AMD64
04:31:02: GPUs: 2
04:31:02: GPU 0: NVIDIA:5 GM204 [GeForce GTX 980]
04:31:02: GPU 1: NVIDIA:5 GM204 [GeForce GTX 980]
04:31:02: CUDA: 5.2
04:31:02: CUDA Driver: 8000
04:31:02:Win32 Service: false
04:31:02:***********************************************************************
04:31:02:<config>
04:31:02: <!-- Slot Control -->
04:31:02: <power v='full'/>
04:31:02:
04:31:02: <!-- User Information -->
04:31:02: <passkey v='********************************'/>
04:31:02: <team v='142900'/>
04:31:02: <user v='Montague-Cripps'/>
04:31:02:
04:31:02: <!-- Folding Slots -->
04:31:02: <slot id='0' type='CPU'>
04:31:02: <paused v='true'/>
04:31:02: </slot>
04:31:02: <slot id='1' type='GPU'>
04:31:02: <paused v='true'/>
04:31:02: </slot>
04:31:02: <slot id='2' type='GPU'>
04:31:02: <paused v='true'/>
04:31:02: </slot>
04:31:02:</config>
04:31:02:Trying to access database...
04:31:02:Successfully acquired database lock
04:31:02:Enabled folding slot 00: PAUSED cpu:6 (by user)
04:31:02:Enabled folding slot 01: PAUSED gpu:0:GM204 [GeForce GTX 980] (by user)
04:31:02:Enabled folding slot 02: PAUSED gpu:1:GM204 [GeForce GTX 980] (by user)
04:31:02:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:9209 run:0 clone:58 gen:65 core:0x21 unit:0x000000d1664f2dd055edef78da0b6f35
04:31:05:WU01:FS01:Uploading 17.50MiB to 171.64.65.104
04:31:05:WU01:FS01:Connecting to 171.64.65.104:8080
04:31:07:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
04:31:07:WU01:FS01:Connecting to 171.64.65.104:80
04:31:09:WARNING:WU01:FS01:Exception: Failed to send results to work server: Failed to connect to 171.64.65.104:80: No connection could be made because the target machine actively refused it.
04:31:09:WU01:FS01:Trying to send results to collection server
04:31:09:WU01:FS01:Uploading 17.50MiB to 171.65.103.160
04:31:09:WU01:FS01:Connecting to 171.65.103.160:8080
04:31:10:FS00:Unpaused
04:31:10:FS01:Unpaused
04:31:10:FS02:Unpaused
04:31:10:WU03:FS00:Starting
04:31:10:WU03:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/nickm/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 03 -suffix 01 -version 704 -lifeline 15348 -checkpoint 15 -np 6
04:31:10:WU03:FS00:Started FahCore on PID 15060
04:31:10:WU03:FS00:Core PID:14924
04:31:10:WU03:FS00:FahCore 0xa4 started
04:31:11:WU03:FS00:0xa4:
04:31:11:WU03:FS00:0xa4:*------------------------------*
04:31:11:WU03:FS00:0xa4:Folding@Home Gromacs GB Core
04:31:11:WU03:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
04:31:11:WU03:FS00:0xa4:
04:31:11:WU03:FS00:0xa4:Preparing to commence simulation
04:31:11:WU03:FS00:0xa4:- Looking at optimizations...
04:31:11:WU04:FS02:Starting
04:31:11:WU04:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/nickm/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 04 -suffix 01 -version 704 -lifeline 15348 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
04:31:11:WU04:FS02:Started FahCore on PID 14480
04:31:11:WU04:FS02:Core PID:15144
04:31:11:WU04:FS02:FahCore 0x18 started
04:31:12:WU03:FS00:0xa4:- Files status OK
04:31:12:WU03:FS00:0xa4:- Expanded 1216019 -> 2903220 (decompressed 238.7 percent)
04:31:12:WU03:FS00:0xa4:Called DecompressByteArray: compressed_data_size=1216019 data_size=2903220, decompressed_data_size=2903220 diff=0
04:31:12:WU03:FS00:0xa4:- Digital signature verified
04:31:12:WU03:FS00:0xa4:
04:31:12:WU03:FS00:0xa4:Project: 11622 (Run 0, Clone 213, Gen 28)
04:31:12:WU03:FS00:0xa4:
04:31:12:WU03:FS00:0xa4:Assembly optimizations on if available.
04:31:12:WU00:FS01:Starting
04:31:12:WU03:FS00:0xa4:Entering M.D.
04:31:12:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/nickm/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 00 -suffix 01 -version 704 -lifeline 15348 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
04:31:12:WU00:FS01:Started FahCore on PID 3544
04:31:12:WU00:FS01:Core PID:4056
04:31:12:WU00:FS01:FahCore 0x18 started
04:31:12:WU04:FS02:0x18:*********************** Log Started 2016-03-16T04:31:11Z ***********************
04:31:12:WU04:FS02:0x18:Project: 9159 (Run 130, Clone 0, Gen 171)
04:31:12:WU04:FS02:0x18:Unit: 0x000000baab404154567457c19ec0aa15
04:31:12:WU04:FS02:0x18:CPU: 0x00000000000000000000000000000000
04:31:12:WU04:FS02:0x18:Machine: 2
04:31:12:WU04:FS02:0x18:Digital signatures verified
04:31:12:WU04:FS02:0x18:Folding@home GPU core18
04:31:12:WU04:FS02:0x18:Version 0.0.4
04:31:12:WU04:FS02:0x18: Found a checkpoint file
04:31:12:WU00:FS01:0x18:*********************** Log Started 2016-03-16T04:31:12Z ***********************
04:31:12:WU00:FS01:0x18:Project: 9137 (Run 6, Clone 0, Gen 383)
04:31:12:WU00:FS01:0x18:Unit: 0x000001b60a3b1e61556647c09f8fdb8e
04:31:12:WU00:FS01:0x18:CPU: 0x00000000000000000000000000000000
04:31:12:WU00:FS01:0x18:Machine: 1
04:31:12:WU00:FS01:0x18:Digital signatures verified
04:31:12:WU00:FS01:0x18:Folding@home GPU core18
04:31:12:WU00:FS01:0x18:Version 0.0.4
04:31:12:WU00:FS01:0x18: Found a checkpoint file
04:31:15:WU01:FS01:Upload 7.50%
04:31:18:WU03:FS00:0xa4:Using Gromacs checkpoints
04:31:18:WU03:FS00:0xa4:Mapping NT from 6 to 6
04:31:18:13:127.0.0.1:New Web connection
04:31:18:WU03:FS00:0xa4:Resuming from checkpoint
04:31:18:WU03:FS00:0xa4:Verified 03/wudata_01.log
04:31:18:WU03:FS00:0xa4:Verified 03/wudata_01.trr
04:31:19:WU03:FS00:0xa4:Verified 03/wudata_01.xtc
04:31:19:WU03:FS00:0xa4:Verified 03/wudata_01.edr
04:31:19:WU03:FS00:0xa4:Completed 1012840 out of 1250000 steps (81%)
04:31:21:WU01:FS01:Upload 15.36%
04:31:27:WU01:FS01:Upload 24.29%
04:31:33:WU01:FS01:Upload 32.15%
04:31:33:WU04:FS02:0x18:Completed 600000 out of 2500000 steps (24%)
04:31:33:WU04:FS02:0x18:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
04:31:39:WU00:FS01:0x18:Completed 700000 out of 2500000 steps (28%)
04:31:39:WU00:FS01:0x18:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
04:31:39:WU01:FS01:Upload 40.36%
04:31:45:WU01:FS01:Upload 44.29%
04:31:51:WU01:FS01:Upload 51.79%
04:31:58:WU01:FS01:Upload 56.79%
04:32:03:Removing old file 'configs/config-20160305-002239.xml'
04:32:03:Saving configuration to config.xml
04:32:03:<config>
04:32:03: <!-- Slot Control -->
04:32:03: <power v='full'/>
04:32:03:
04:32:03: <!-- User Information -->
04:32:03: <passkey v='********************************'/>
04:32:03: <team v='142900'/>
04:32:03: <user v='Montague-Cripps'/>
04:32:03:
04:32:03: <!-- Folding Slots -->
04:32:03: <slot id='0' type='CPU'/>
04:32:03: <slot id='1' type='GPU'/>
04:32:03: <slot id='2' type='GPU'/>
04:32:03:</config>
04:32:04:WU01:FS01:Upload 62.86%
04:32:10:WU01:FS01:Upload 67.51%
04:32:16:WU01:FS01:Upload 74.29%
04:32:22:WU01:FS01:Upload 82.87%
04:32:28:WU01:FS01:Upload 91.44%
04:32:34:WU01:FS01:Upload 98.94%
04:32:35:WU01:FS01:Upload complete
04:32:36:WU01:FS01:Server responded PLEASE_WAIT (464)
04:32:36:WARNING:WU01:FS01:Failed to send results, will try again later
04:32:36:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:9209 run:0 clone:58 gen:65 core:0x21 unit:0x000000d1664f2dd055edef78da0b6f35
04:32:36:WU01:FS01:Uploading 17.50MiB to 171.64.65.104
04:32:36:WU01:FS01:Connecting to 171.64.65.104:8080
04:32:38:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
04:32:38:WU01:FS01:Connecting to 171.64.65.104:80
04:32:40:WARNING:WU01:FS01:Exception: Failed to send results to work server: Failed to connect to 171.64.65.104:80: No connection could be made because the target machine actively refused it.
04:32:40:WU01:FS01:Trying to send results to collection server
04:32:40:WU01:FS01:Uploading 17.50MiB to 171.65.103.160
04:32:40:WU01:FS01:Connecting to 171.65.103.160:8080
04:32:46:WU01:FS01:Upload 8.22%
04:32:52:WU01:FS01:Upload 15.72%
04:32:56:WU04:FS02:0x18:Completed 625000 out of 2500000 steps (25%)
04:32:58:WU01:FS01:Upload 23.93%
04:33:04:WU01:FS01:Upload 32.15%
04:33:10:WU01:FS01:Upload 38.58%
04:33:16:WU01:FS01:Upload 45.36%
04:33:22:WU01:FS01:Upload 50.72%
04:33:22:WU00:FS01:0x18:Completed 725000 out of 2500000 steps (29%)
04:33:28:WU01:FS01:Upload 59.29%
04:33:34:WU01:FS01:Upload 67.15%
04:33:40:WU01:FS01:Upload 75.37%
04:33:46:WU01:FS01:Upload 83.94%
04:33:52:WU01:FS01:Upload 91.44%
04:33:58:WU01:FS01:Upload 100.00%
04:33:59:WU01:FS01:Upload complete
04:33:59:WU01:FS01:Server responded PLEASE_WAIT (464)
04:33:59:WARNING:WU01:FS01:Failed to send results, will try again later
04:34:12:WU04:FS02:0x18:Completed 650000 out of 2500000 steps (26%)
04:34:13:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:9209 run:0 clone:58 gen:65 core:0x21 unit:0x000000d1664f2dd055edef78da0b6f35
04:34:13:WU01:FS01:Uploading 17.50MiB to 171.64.65.104
04:34:13:WU01:FS01:Connecting to 171.64.65.104:8080
04:34:15:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
04:34:15:WU01:FS01:Connecting to 171.64.65.104:80
04:34:17:WARNING:WU01:FS01:Exception: Failed to send results to work server: Failed to connect to 171.64.65.104:80: No connection could be made because the target machine actively refused it.
04:34:17:WU01:FS01:Trying to send results to collection server
04:34:17:WU01:FS01:Uploading 17.50MiB to 171.65.103.160
04:34:17:WU01:FS01:Connecting to 171.65.103.160:8080
04:34:23:WU01:FS01:Upload 6.43%
04:34:29:WU01:FS01:Upload 13.57%
04:34:35:WU01:FS01:Upload 20.36%
04:34:41:WU01:FS01:Upload 28.93%
04:34:47:WU01:FS01:Upload 37.50%
04:34:53:WU01:FS01:Upload 44.29%
04:34:56:WU00:FS01:0x18:Completed 750000 out of 2500000 steps (30%)
04:34:59:WU01:FS01:Upload 51.79%
04:35:05:WU01:FS01:Upload 59.65%
04:35:11:WU01:FS01:Upload 67.86%
04:35:17:WU01:FS01:Upload 72.87%
04:35:23:WU01:FS01:Upload 78.94%
04:35:29:WU01:FS01:Upload 86.44%
04:35:29:WU04:FS02:0x18:Completed 675000 out of 2500000 steps (27%)
04:35:35:WU01:FS01:Upload 90.72%
04:35:41:WU01:FS01:Upload 99.65%
04:35:42:WU01:FS01:Upload complete
04:35:42:WU01:FS01:Server responded PLEASE_WAIT (464)
04:35:42:WARNING:WU01:FS01:Failed to send results, will try again later
Mod edit: Merged with correct topic, please report the problem WS - 171.64.65.104
Re: Failed to connect to 171.64.65.104:80
Posted: Wed Mar 16, 2016 1:45 pm
by gupsterg
Hi,
I've getting "reject" on 171.64.65.104 for several days, will the WU waiting to upload get rejected due to deadline?
Re: Failed to connect to 171.64.65.104:80
Posted: Wed Mar 16, 2016 3:05 pm
by Joe_H
gupsterg wrote:Hi,
I've getting "reject" on 171.64.65.104 for several days, will the WU waiting to upload get rejected due to deadline?
The preferred deadline for projects on this server is listed as 7 days, and the final as 10 days. So as long as the repairs needed are completed before the final deadline is reached for the WU on your system, it will get accepted after the WS is back up.
Re: Failed to connect to 171.64.65.104:80
Posted: Thu Mar 17, 2016 6:21 pm
by Seagull181005
Not sure if it is related or not but I just got a "Server did not like results, dumping" warning from this server. The WU in question missed the timeout deadline by less than an hour.
Code: Select all
*********************** Log Started 2016-03-15T11:08:14Z ***********************
11:08:14:************************* Folding@home Client *************************
11:08:14: Website: http://folding.stanford.edu/
11:08:14: Copyright: (c) 2009-2014 Stanford University
11:08:14: Author: Joseph Coffland <[email protected]>
11:08:14: Args: --child --lifeline 1166 /etc/fahclient/config.xml --run-as
11:08:14: fahclient --pid-file=/var/run/fahclient.pid --daemon
11:08:14: Config: /etc/fahclient/config.xml
11:08:14:******************************** Build ********************************
11:08:14: Version: 7.4.4
11:08:14: Date: Mar 4 2014
11:08:14: Time: 12:02:38
11:08:14: SVN Rev: 4130
11:08:14: Branch: fah/trunk/client
11:08:14: Compiler: GNU 4.4.7
11:08:14: Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
11:08:14: -fno-unsafe-math-optimizations -msse2
11:08:14: Platform: linux2 3.2.0-1-amd64
11:08:14: Bits: 64
11:08:14: Mode: Release
11:08:14:******************************* System ********************************
11:08:14: CPU: AMD Athlon(tm) II X2 270 Processor
11:08:14: CPU ID: AuthenticAMD Family 16 Model 6 Stepping 3
11:08:14: CPUs: 2
11:08:14: Memory: 3.86GiB
11:08:14:Free Memory: 3.47GiB
11:08:14: Threads: POSIX_THREADS
11:08:14: OS Version: 3.13
11:08:14:Has Battery: false
11:08:14: On Battery: false
11:08:14: UTC Offset: 0
11:08:14: PID: 1168
11:08:14: CWD: /var/lib/fahclient
11:08:14: OS: Linux 3.13.0-83-generic x86_64
11:08:14: OS Arch: AMD64
11:08:14: GPUs: 1
11:08:14: GPU 0: NVIDIA:3 GK208 [GeForce GT 630]
11:08:14: CUDA: 3.5
11:08:14:CUDA Driver: 6050
11:08:14:***********************************************************************
11:08:14:<config>
11:08:14: <!-- Client Control -->
11:08:14: <fold-anon v='true'/>
11:08:14:
11:08:14: <!-- Network -->
11:08:14: <proxy v=':8080'/>
11:08:14:
11:08:14: <!-- Slot Control -->
11:08:14: <power v='full'/>
11:08:14:
11:08:14: <!-- User Information -->
11:08:14: <passkey v='********************************'/>
11:08:14: <team v='163'/>
11:08:14: <user v='Seagull181005'/>
11:08:14:
11:08:14: <!-- Folding Slots -->
11:08:14: <slot id='1' type='GPU'/>
11:08:14:</config>
11:08:14:Switching to user fahclient
11:08:14:Trying to access database...
11:08:14:Successfully acquired database lock
11:08:14:Enabled folding slot 01: READY gpu:0:GK208 [GeForce GT 630]
11:08:14:WU01:FS01:Starting
11:08:14:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/web.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21 -dir 01 -suffix 01 -version 704 -lifeline 1168 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
11:08:14:WU01:FS01:Started FahCore on PID 1177
11:08:14:WU01:FS01:Core PID:1181
11:08:14:WU01:FS01:FahCore 0x21 started
11:08:15:WU01:FS01:0x21:*********************** Log Started 2016-03-15T11:08:14Z ***********************
11:08:15:WU01:FS01:0x21:Project: 9208 (Run 0, Clone 16, Gen 152)
11:08:15:WU01:FS01:0x21:Unit: 0x0000014c664f2dd055edd357ba4c7274
11:08:15:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
11:08:15:WU01:FS01:0x21:Machine: 1
11:08:15:WU01:FS01:0x21:Digital signatures verified
11:08:15:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
11:08:15:WU01:FS01:0x21:Version 0.0.17
11:08:15:WU01:FS01:0x21: Found a checkpoint file
11:09:28:WU01:FS01:0x21:Completed 1600000 out of 2500000 steps (64%)
11:09:28:WU01:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
12:28:33:WU01:FS01:0x21:Completed 1625000 out of 2500000 steps (65%)
13:45:02:WU01:FS01:0x21:Completed 1650000 out of 2500000 steps (66%)
14:54:38:FS01:Paused
14:54:38:FS01:Shutting core down
14:54:38:WU01:FS01:0x21:Caught signal SIGINT(2) on PID 1181
14:54:38:WU01:FS01:0x21:Exiting, please wait. . .
14:54:39:WU01:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
14:54:57:Removing old file 'configs/config-20160309-213616.xml'
14:54:57:Saving configuration to /etc/fahclient/config.xml
14:54:57:<config>
14:54:57: <!-- Client Control -->
14:54:57: <fold-anon v='true'/>
14:54:57:
14:54:57: <!-- Network -->
14:54:57: <proxy v=':8080'/>
14:54:57:
14:54:57: <!-- Slot Control -->
14:54:57: <power v='full'/>
14:54:57:
14:54:57: <!-- User Information -->
14:54:57: <passkey v='********************************'/>
14:54:57: <team v='163'/>
14:54:57: <user v='Seagull181005'/>
14:54:57:
14:54:57: <!-- Folding Slots -->
14:54:57: <slot id='1' type='GPU'>
14:54:57: <paused v='true'/>
14:54:57: </slot>
14:54:57:</config>
******************************* Date: 2016-03-15 *******************************
19:24:10:FS01:Unpaused
19:24:10:WU01:FS01:Starting
19:24:10:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/web.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21 -dir 01 -suffix 01 -version 704 -lifeline 1168 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
19:24:10:WU01:FS01:Started FahCore on PID 6735
19:24:10:WU01:FS01:Core PID:6739
19:24:10:WU01:FS01:FahCore 0x21 started
19:24:10:WU01:FS01:0x21:*********************** Log Started 2016-03-15T19:24:10Z ***********************
19:24:10:WU01:FS01:0x21:Project: 9208 (Run 0, Clone 16, Gen 152)
19:24:10:WU01:FS01:0x21:Unit: 0x0000014c664f2dd055edd357ba4c7274
19:24:10:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
19:24:10:WU01:FS01:0x21:Machine: 1
19:24:10:WU01:FS01:0x21:Digital signatures verified
19:24:10:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
19:24:10:WU01:FS01:0x21:Version 0.0.17
19:24:10:WU01:FS01:0x21: Found a checkpoint file
19:24:22:Removing old file 'configs/config-20160309-214626.xml'
19:24:22:Saving configuration to /etc/fahclient/config.xml
19:24:22:<config>
19:24:22: <!-- Client Control -->
19:24:22: <fold-anon v='true'/>
19:24:22:
19:24:22: <!-- Network -->
19:24:22: <proxy v=':8080'/>
19:24:22:
19:24:22: <!-- Slot Control -->
19:24:22: <power v='full'/>
19:24:22:
19:24:22: <!-- User Information -->
19:24:22: <passkey v='********************************'/>
19:24:22: <team v='163'/>
19:24:22: <user v='Seagull181005'/>
19:24:22:
19:24:22: <!-- Folding Slots -->
19:24:22: <slot id='1' type='GPU'/>
19:24:22:</config>
19:25:08:WU01:FS01:0x21:Completed 1600000 out of 2500000 steps (64%)
19:25:08:WU01:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
20:44:52:WU01:FS01:0x21:Completed 1625000 out of 2500000 steps (65%)
22:03:03:WU01:FS01:0x21:Completed 1650000 out of 2500000 steps (66%)
23:21:37:WU01:FS01:0x21:Completed 1675000 out of 2500000 steps (67%)
00:38:12:WU01:FS01:0x21:Completed 1700000 out of 2500000 steps (68%)
******************************* Date: 2016-03-16 *******************************
01:55:20:WU01:FS01:0x21:Completed 1725000 out of 2500000 steps (69%)
03:11:58:WU01:FS01:0x21:Completed 1750000 out of 2500000 steps (70%)
04:29:12:WU01:FS01:0x21:Completed 1775000 out of 2500000 steps (71%)
05:46:46:WU01:FS01:0x21:Completed 1800000 out of 2500000 steps (72%)
07:04:39:WU01:FS01:0x21:Completed 1825000 out of 2500000 steps (73%)
******************************* Date: 2016-03-16 *******************************
08:22:18:WU01:FS01:0x21:Completed 1850000 out of 2500000 steps (74%)
09:39:56:WU01:FS01:0x21:Completed 1875000 out of 2500000 steps (75%)
10:57:32:WU01:FS01:0x21:Completed 1900000 out of 2500000 steps (76%)
12:15:28:WU01:FS01:0x21:Completed 1925000 out of 2500000 steps (77%)
13:33:06:WU01:FS01:0x21:Completed 1950000 out of 2500000 steps (78%)
******************************* Date: 2016-03-16 *******************************
14:50:44:WU01:FS01:0x21:Completed 1975000 out of 2500000 steps (79%)
16:08:22:WU01:FS01:0x21:Completed 2000000 out of 2500000 steps (80%)
17:27:58:WU01:FS01:0x21:Completed 2025000 out of 2500000 steps (81%)
18:47:39:WU01:FS01:0x21:Completed 2050000 out of 2500000 steps (82%)
20:05:47:WU01:FS01:0x21:Completed 2075000 out of 2500000 steps (83%)
******************************* Date: 2016-03-16 *******************************
21:24:28:WU01:FS01:0x21:Completed 2100000 out of 2500000 steps (84%)
22:42:00:WU01:FS01:0x21:Completed 2125000 out of 2500000 steps (85%)
23:58:50:WU01:FS01:0x21:Completed 2150000 out of 2500000 steps (86%)
01:15:37:WU01:FS01:0x21:Completed 2175000 out of 2500000 steps (87%)
02:32:58:WU01:FS01:0x21:Completed 2200000 out of 2500000 steps (88%)
******************************* Date: 2016-03-17 *******************************
03:49:59:WU01:FS01:0x21:Completed 2225000 out of 2500000 steps (89%)
05:06:06:WU01:FS01:0x21:Completed 2250000 out of 2500000 steps (90%)
06:22:10:WU01:FS01:0x21:Completed 2275000 out of 2500000 steps (91%)
07:38:18:WU01:FS01:0x21:Completed 2300000 out of 2500000 steps (92%)
08:54:43:WU01:FS01:0x21:Completed 2325000 out of 2500000 steps (93%)
******************************* Date: 2016-03-17 *******************************
10:10:50:WU01:FS01:0x21:Completed 2350000 out of 2500000 steps (94%)
11:26:53:WU01:FS01:0x21:Completed 2375000 out of 2500000 steps (95%)
12:42:59:WU01:FS01:0x21:Completed 2400000 out of 2500000 steps (96%)
13:59:20:WU01:FS01:0x21:Completed 2425000 out of 2500000 steps (97%)
15:15:24:WU01:FS01:0x21:Completed 2450000 out of 2500000 steps (98%)
******************************* Date: 2016-03-17 *******************************
16:34:10:WU01:FS01:0x21:Completed 2475000 out of 2500000 steps (99%)
16:34:11:WU00:FS01:Connecting to 171.67.108.45:80
16:34:13:WU00:FS01:Assigned to work server 171.67.108.144
16:34:13:WU00:FS01:Requesting new work unit for slot 01: RUNNING gpu:0:GK208 [GeForce GT 630] from 171.67.108.144
16:34:13:WU00:FS01:Connecting to 171.67.108.144:8080
16:34:26:WU00:FS01:Downloading 8.26MiB
16:34:32:WU00:FS01:Download 38.57%
16:34:38:WU00:FS01:Download 83.95%
16:34:40:WU00:FS01:Download complete
16:34:40:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13106 run:4 clone:28 gen:24 core:0x21 unit:0x0000001cab436c9056be6927cc133666
17:52:24:WU01:FS01:0x21:Completed 2500000 out of 2500000 steps (100%)
17:52:42:WU01:FS01:0x21:Saving result file logfile_01.txt
17:52:42:WU01:FS01:0x21:Saving result file checkpointState.xml
17:52:46:WU01:FS01:0x21:Saving result file checkpt.crc
17:52:46:WU01:FS01:0x21:Saving result file log.txt
17:52:46:WU01:FS01:0x21:Saving result file positions.xtc
17:52:49:WU01:FS01:0x21:Folding@home Core Shutdown: FINISHED_UNIT
17:52:50:WU01:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
17:52:50:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:9208 run:0 clone:16 gen:152 core:0x21 unit:0x0000014c664f2dd055edd357ba4c7274
17:52:50:WU01:FS01:Uploading 17.50MiB to 171.64.65.104
17:52:50:WU00:FS01:Starting
17:52:50:WU01:FS01:Connecting to 171.64.65.104:8080
17:52:50:WU00:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/web.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21 -dir 00 -suffix 01 -version 704 -lifeline 1168 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
17:52:50:WU00:FS01:Started FahCore on PID 24289
17:52:50:WU00:FS01:Core PID:24293
17:52:50:WU00:FS01:FahCore 0x21 started
17:52:50:WU00:FS01:0x21:*********************** Log Started 2016-03-17T17:52:50Z ***********************
17:52:50:WU00:FS01:0x21:Project: 13106 (Run 4, Clone 28, Gen 24)
17:52:50:WU00:FS01:0x21:Unit: 0x0000001cab436c9056be6927cc133666
17:52:50:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
17:52:50:WU00:FS01:0x21:Machine: 1
17:52:50:WU00:FS01:0x21:Reading tar file core.xml
17:52:50:WU00:FS01:0x21:Reading tar file integrator.xml
17:52:50:WU00:FS01:0x21:Reading tar file state.xml
17:52:52:WU00:FS01:0x21:Reading tar file system.xml
17:52:52:WU00:FS01:0x21:Digital signatures verified
17:52:52:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
17:52:52:WU00:FS01:0x21:Version 0.0.17
17:52:56:WU01:FS01:Upload 3.93%
17:53:03:WU01:FS01:Upload 7.50%
17:53:11:WU01:FS01:Upload 12.14%
17:53:18:WU01:FS01:Upload 16.78%
17:53:24:WU01:FS01:Upload 19.28%
17:53:30:WU01:FS01:Upload 23.57%
17:53:37:WU01:FS01:Upload 27.14%
17:53:43:WU01:FS01:Upload 28.92%
17:53:49:WU01:FS01:Upload 33.21%
17:53:57:WU01:FS01:Upload 36.78%
17:54:03:WU01:FS01:Upload 41.06%
17:54:04:WU00:FS01:0x21:Completed 0 out of 520000 steps (0%)
17:54:04:WU00:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
17:54:10:WU01:FS01:Upload 43.56%
17:54:16:WU01:FS01:Upload 48.20%
17:54:22:WU01:FS01:Upload 49.99%
17:54:28:WU01:FS01:Upload 54.63%
17:54:35:WU01:FS01:Upload 57.84%
17:54:41:WU01:FS01:Upload 61.41%
17:54:47:WU01:FS01:Upload 64.27%
17:54:53:WU01:FS01:Upload 67.48%
17:55:00:WU01:FS01:Upload 70.34%
17:55:06:WU01:FS01:Upload 73.91%
17:55:13:WU01:FS01:Upload 77.84%
17:55:20:WU01:FS01:Upload 81.41%
17:55:27:WU01:FS01:Upload 84.26%
17:55:33:WU01:FS01:Upload 88.19%
17:55:41:WU01:FS01:Upload 91.40%
17:55:48:WU01:FS01:Upload 95.33%
17:55:54:WU01:FS01:Upload 99.26%
17:55:59:WU01:FS01:Upload complete
17:55:59:WU01:FS01:Server responded WORK_QUIT (404)
17:55:59:WARNING:WU01:FS01:Server did not like results, dumping
17:55:59:WU01:FS01:Cleaning up
18:05:09:WU00:FS01:0x21:Completed 5200 out of 520000 steps (1%)
18:16:27:WU00:FS01:0x21:Completed 10400 out of 520000 steps (2%)
Re: Failed to connect to 171.64.65.104:80
Posted: Thu Mar 17, 2016 6:44 pm
by Joe_H
When was the WU first assigned? The log just shows processing from about 64%, if the time of assignment was more than 10 days before turn in the WU would not be accepted. After 7 days it should be accepted, but only get base points for credit.
Based on a sampling of frame times, your GT 630 should be able to complete a WU from this project in about 5.5 days if run continuously. That is within the preferred deadline. But if not run continuously the deadlines will be reached fairly easily.
Re: Failed to connect to 171.64.65.104:80
Posted: Thu Mar 17, 2016 8:08 pm
by Seagull181005
It was assigned around 1700Z on the 10th Mar. As I say it missed the first deadline by about an hour.
It was run most of the time but not continuously - sometimes I want to use my GPU for things other than folding! It it were a deadline issue wouldn't the error message be more descriptive?
Re: Failed to connect to 171.64.65.104:80
Posted: Thu Mar 17, 2016 8:12 pm
by jadeshi
Alright guys, thanks for the wait. Try it now, and let me know if any other issues come up.
Re: Failed to connect to 171.64.65.104:80
Posted: Thu Mar 17, 2016 9:47 pm
by Joe_H
Seagull181005 wrote:It was assigned around 1700Z on the 10th Mar. As I say it missed the first deadline by about an hour.
It was run most of the time but not continuously - sometimes I want to use my GPU for things other than folding! It it were a deadline issue wouldn't the error message be more descriptive?
Since it was just over the preferred deadline, but not the final, then the WU not being accepted would have been for another reason. It could have been corrupted, or just another symptom of the problems jadeshi has been working on since the weekend with this WS. So far there have not been any successful returns of this WU.
As for using your GPU for other things, that is understood. But what I was trying to get across is that your GT 630 is about the minimum required to complete many of the recent projects using the Core_17 and later GPU folding cores. You do have the option of opting in to get the older Core_15 WU's, they have deadlines of 5 weeks for the preferred deadline and nearly 7 weeks for the final one.
Re: Failed to connect to 171.64.65.104:80
Posted: Thu Mar 17, 2016 11:32 pm
by Seagull181005
Joe_H wrote:As for using your GPU for other things, that is understood. But what I was trying to get across is that your GT 630 is about the minimum required to complete many of the recent projects using the Core_17 and later GPU folding cores. You do have the option of opting in to get the older Core_15 WU's, they have deadlines of 5 weeks for the preferred deadline and nearly 7 weeks for the final one.
OK thanks for that, I'll bear it in mind if I keep failing deadlines.
How do I opt in to Core_15? It is not obvious to me from looking at the fahclient interface or the FAQ.