Had to work overtime today so I made the effort to come home on my dinner break and restart my slots folding after "finishing" them at the start of the expensive Time-of-use electricity rates this morning.
When I returned home this evening it was to 7/8 slots stuck: Waiting on "WS Assignment" Rebooted the stuck systems but still no joy.
All the stuck slots are all getting sent to WS 54.157.202.86 mskcc1.foldingathome.org
*********************** Log Started 2021-07-27T03:40:05Z ***********************
03:40:05:******************************* libFAH ********************************
03:40:05: Date: Oct 8 2020
03:40:05: Time: 19:34:47
03:40:05: Revision: 06b99f7701e0d3f883dd14a78b459ad27da23809
03:40:05: Branch: master
03:40:05: Compiler: GNU 8.3.0
03:40:05: Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05: -O3 -funroll-loops -fno-pie
03:40:05: Platform: linux2 5.8.0-1-amd64
03:40:05: Bits: 64
03:40:05: Mode: Release
03:40:05:****************************** FAHClient ******************************
03:40:05: Version: 7.6.20
03:40:05: Author: Joseph Coffland <[email protected]>
03:40:05: Copyright: 2020 foldingathome.org
03:40:05: Homepage: https://foldingathome.org/
03:40:05: Date: Oct 12 2020
03:40:05: Time: 22:00:41
03:40:05: Revision: c858fe2a8342bfa3e116e00b394d8dfa322ecd18
03:40:05: Branch: master
03:40:05: Compiler: GNU 8.3.0
03:40:05: Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05: -O3 -funroll-loops -fno-pie
03:40:05: Platform: linux2 5.8.0-1-amd64
03:40:05: Bits: 64
03:40:05: Mode: Release
03:40:05: Args: --child /etc/fahclient/config.xml --run-as fahclient
03:40:05: --pid-file=/var/run/fahclient.pid --daemon
03:40:05: Config: /etc/fahclient/config.xml
03:40:05:******************************** CBang ********************************
03:40:05: Date: Oct 8 2020
03:40:05: Time: 19:34:20
03:40:05: Revision: ab0a6d9e35982b831a74cb2706c569fe46bac2af
03:40:05: Branch: master
03:40:05: Compiler: GNU 8.3.0
03:40:05: Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05: -O3 -funroll-loops -fno-pie -fPIC
03:40:05: Platform: linux2 5.8.0-1-amd64
03:40:05: Bits: 64
03:40:05: Mode: Release
03:40:05:******************************* System ********************************
03:40:05: CPU: AMD Ryzen 9 3950X 16-Core Processor
03:40:05: CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
03:40:05: CPUs: 32
03:40:05: Memory: 31.37GiB
03:40:05: Free Memory: 27.07GiB
03:40:05: Threads: POSIX_THREADS
03:40:05: OS Version: 5.4
03:40:05: Has Battery: false
03:40:05: On Battery: false
03:40:05: UTC Offset: -4
03:40:05: PID: 2062
03:40:05: CWD: /var/lib/fahclient
03:40:05: OS: Linux 5.4.0-74-generic x86_64
03:40:05: OS Arch: AMD64
03:40:05: GPUs: 2
03:40:05: GPU 0: Bus:9 Slot:0 Func:0 NVIDIA:7 TU106 [GeForce RTX 2060 SUPER]
03:40:05: GPU 1: Bus:10 Slot:0 Func:0 NVIDIA:8 TU104 [GeForce RTX 2070 SUPER]
03:40:05: 8218
03:40:05: CUDA Device 0: Platform:0 Device:0 Bus:10 Slot:0 Compute:7.5 Driver:11.2
03:40:05: CUDA Device 1: Platform:0 Device:1 Bus:9 Slot:0 Compute:7.5 Driver:11.2
03:40:05:OpenCL Device 0: Platform:0 Device:0 Bus:10 Slot:0 Compute:1.2 Driver:460.84
03:40:05:OpenCL Device 1: Platform:0 Device:1 Bus:9 Slot:0 Compute:1.2 Driver:460.84
03:40:05:***********************************************************************
03:40:05:<config>
03:40:05: <!-- Client Control -->
03:40:05: <fold-anon v='true'/>
03:40:05:
03:40:05: <!-- Folding Slot Configuration -->
03:40:05: <cause v='COVID_19'/>
03:40:05: <gpu v='false'/>
03:40:05:
03:40:05: <!-- HTTP Server -->
03:40:05: <allow v='127.0.0.1 *****************'/>
03:40:05:
03:40:05: <!-- Network -->
03:40:05: <proxy v=':8080'/>
03:40:05:
03:40:05: <!-- Remote Command Server -->
03:40:05: <command-allow-no-pass v='127.0.0.1 ********************'/>
03:40:05:
03:40:05: <!-- Slot Control -->
03:40:05: <pause-on-battery v='false'/>
03:40:05: <pause-on-start v='true'/>
03:40:05: <power v='full'/>
03:40:05:
03:40:05: <!-- User Information -->
03:40:05: <passkey v='*****'/>
03:40:05: <team v='*********'/>
03:40:05: <user v='************'/>
03:40:05:
03:40:05: <!-- Folding Slots -->
03:40:05: <slot id='1' type='GPU'>
03:40:05: <pci-bus v='9'/>
03:40:05: <pci-slot v='0'/>
03:40:05: </slot>
03:40:05: <slot id='0' type='GPU'>
03:40:05: <pci-bus v='10'/>
03:40:05: <pci-slot v='0'/>
03:40:05: </slot>
03:40:05:</config>
03:40:05:Trying to access database...
03:40:05:Successfully acquired database lock
03:40:05:FS01:Initialized folding slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - PAUSED by user
03:40:05:FS00:Initialized folding slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - PAUSED by user
03:40:54:FS01:Unpaused
03:40:54:FS00:Unpaused
03:40:54:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:40:54:WU00:FS01:Assigned to work server 54.157.202.86
03:40:54:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:40:54:WU00:FS01:Connecting to 54.157.202.86:8080
03:40:54:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:40:54:WU01:FS00:Assigned to work server 54.157.202.86
03:40:54:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:40:54:WU01:FS00:Connecting to 54.157.202.86:8080
03:40:54:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:40:55:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:40:55:WU00:FS01:Assigned to work server 54.157.202.86
03:40:55:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:40:55:WU00:FS01:Connecting to 54.157.202.86:8080
03:40:55:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:40:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:40:55:WU01:FS00:Assigned to work server 54.157.202.86
03:40:55:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:40:55:WU01:FS00:Connecting to 54.157.202.86:8080
03:40:55:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:40:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:41:55:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:41:55:WU00:FS01:Assigned to work server 54.157.202.86
03:41:55:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:41:55:WU00:FS01:Connecting to 54.157.202.86:8080
03:41:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:41:55:WU01:FS00:Assigned to work server 54.157.202.86
03:41:55:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:41:55:WU01:FS00:Connecting to 54.157.202.86:8080
03:41:56:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:41:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:43:32:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:43:32:WU00:FS01:Assigned to work server 54.157.202.86
03:43:32:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:43:32:WU00:FS01:Connecting to 54.157.202.86:8080
03:43:32:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:43:33:WU01:FS00:Assigned to work server 54.157.202.86
03:43:33:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:43:33:WU01:FS00:Connecting to 54.157.202.86:8080
03:43:33:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:43:33:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:46:09:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:46:09:WU00:FS01:Assigned to work server 54.157.202.86
03:46:09:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:46:09:WU00:FS01:Connecting to 54.157.202.86:8080
03:46:10:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:46:10:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:46:10:WU01:FS00:Assigned to work server 54.157.202.86
03:46:10:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:46:10:WU01:FS00:Connecting to 54.157.202.86:8080
03:46:10:ERROR:WU01:FS00:Exception: Server did not assign work unit
gordonbb wrote:Had to work overtime today so I made the effort to come home on my dinner break and restart my slots folding after "finishing" them at the start of the expensive Time-of-use electricity rates this morning.
When I returned home this evening it was to 7/8 slots stuck: Waiting on "WS Assignment" Rebooted the stuck systems but still no joy.
All the stuck slots are all getting sent to WS 54.157.202.86 mskcc1.foldingathome.org
*********************** Log Started 2021-07-27T03:40:05Z ***********************
03:40:05:******************************* libFAH ********************************
03:40:05: Date: Oct 8 2020
03:40:05: Time: 19:34:47
03:40:05: Revision: 06b99f7701e0d3f883dd14a78b459ad27da23809
03:40:05: Branch: master
03:40:05: Compiler: GNU 8.3.0
03:40:05: Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05: -O3 -funroll-loops -fno-pie
03:40:05: Platform: linux2 5.8.0-1-amd64
03:40:05: Bits: 64
03:40:05: Mode: Release
03:40:05:****************************** FAHClient ******************************
03:40:05: Version: 7.6.20
03:40:05: Author: Joseph Coffland <[email protected]>
03:40:05: Copyright: 2020 foldingathome.org
03:40:05: Homepage: https://foldingathome.org/
03:40:05: Date: Oct 12 2020
03:40:05: Time: 22:00:41
03:40:05: Revision: c858fe2a8342bfa3e116e00b394d8dfa322ecd18
03:40:05: Branch: master
03:40:05: Compiler: GNU 8.3.0
03:40:05: Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05: -O3 -funroll-loops -fno-pie
03:40:05: Platform: linux2 5.8.0-1-amd64
03:40:05: Bits: 64
03:40:05: Mode: Release
03:40:05: Args: --child /etc/fahclient/config.xml --run-as fahclient
03:40:05: --pid-file=/var/run/fahclient.pid --daemon
03:40:05: Config: /etc/fahclient/config.xml
03:40:05:******************************** CBang ********************************
03:40:05: Date: Oct 8 2020
03:40:05: Time: 19:34:20
03:40:05: Revision: ab0a6d9e35982b831a74cb2706c569fe46bac2af
03:40:05: Branch: master
03:40:05: Compiler: GNU 8.3.0
03:40:05: Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05: -O3 -funroll-loops -fno-pie -fPIC
03:40:05: Platform: linux2 5.8.0-1-amd64
03:40:05: Bits: 64
03:40:05: Mode: Release
03:40:05:******************************* System ********************************
03:40:05: CPU: AMD Ryzen 9 3950X 16-Core Processor
03:40:05: CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
03:40:05: CPUs: 32
03:40:05: Memory: 31.37GiB
03:40:05: Free Memory: 27.07GiB
03:40:05: Threads: POSIX_THREADS
03:40:05: OS Version: 5.4
03:40:05: Has Battery: false
03:40:05: On Battery: false
03:40:05: UTC Offset: -4
03:40:05: PID: 2062
03:40:05: CWD: /var/lib/fahclient
03:40:05: OS: Linux 5.4.0-74-generic x86_64
03:40:05: OS Arch: AMD64
03:40:05: GPUs: 2
03:40:05: GPU 0: Bus:9 Slot:0 Func:0 NVIDIA:7 TU106 [GeForce RTX 2060 SUPER]
03:40:05: GPU 1: Bus:10 Slot:0 Func:0 NVIDIA:8 TU104 [GeForce RTX 2070 SUPER]
03:40:05: 8218
03:40:05: CUDA Device 0: Platform:0 Device:0 Bus:10 Slot:0 Compute:7.5 Driver:11.2
03:40:05: CUDA Device 1: Platform:0 Device:1 Bus:9 Slot:0 Compute:7.5 Driver:11.2
03:40:05:OpenCL Device 0: Platform:0 Device:0 Bus:10 Slot:0 Compute:1.2 Driver:460.84
03:40:05:OpenCL Device 1: Platform:0 Device:1 Bus:9 Slot:0 Compute:1.2 Driver:460.84
03:40:05:***********************************************************************
03:40:05:<config>
03:40:05: <!-- Client Control -->
03:40:05: <fold-anon v='true'/>
03:40:05:
03:40:05: <!-- Folding Slot Configuration -->
03:40:05: <cause v='COVID_19'/>
03:40:05: <gpu v='false'/>
03:40:05:
03:40:05: <!-- HTTP Server -->
03:40:05: <allow v='127.0.0.1 *****************'/>
03:40:05:
03:40:05: <!-- Network -->
03:40:05: <proxy v=':8080'/>
03:40:05:
03:40:05: <!-- Remote Command Server -->
03:40:05: <command-allow-no-pass v='127.0.0.1 ********************'/>
03:40:05:
03:40:05: <!-- Slot Control -->
03:40:05: <pause-on-battery v='false'/>
03:40:05: <pause-on-start v='true'/>
03:40:05: <power v='full'/>
03:40:05:
03:40:05: <!-- User Information -->
03:40:05: <passkey v='*****'/>
03:40:05: <team v='*********'/>
03:40:05: <user v='************'/>
03:40:05:
03:40:05: <!-- Folding Slots -->
03:40:05: <slot id='1' type='GPU'>
03:40:05: <pci-bus v='9'/>
03:40:05: <pci-slot v='0'/>
03:40:05: </slot>
03:40:05: <slot id='0' type='GPU'>
03:40:05: <pci-bus v='10'/>
03:40:05: <pci-slot v='0'/>
03:40:05: </slot>
03:40:05:</config>
03:40:05:Trying to access database...
03:40:05:Successfully acquired database lock
03:40:05:FS01:Initialized folding slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - PAUSED by user
03:40:05:FS00:Initialized folding slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - PAUSED by user
03:40:54:FS01:Unpaused
03:40:54:FS00:Unpaused
03:40:54:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:40:54:WU00:FS01:Assigned to work server 54.157.202.86
03:40:54:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:40:54:WU00:FS01:Connecting to 54.157.202.86:8080
03:40:54:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:40:54:WU01:FS00:Assigned to work server 54.157.202.86
03:40:54:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:40:54:WU01:FS00:Connecting to 54.157.202.86:8080
03:40:54:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:40:55:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:40:55:WU00:FS01:Assigned to work server 54.157.202.86
03:40:55:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:40:55:WU00:FS01:Connecting to 54.157.202.86:8080
03:40:55:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:40:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:40:55:WU01:FS00:Assigned to work server 54.157.202.86
03:40:55:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:40:55:WU01:FS00:Connecting to 54.157.202.86:8080
03:40:55:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:40:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:41:55:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:41:55:WU00:FS01:Assigned to work server 54.157.202.86
03:41:55:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:41:55:WU00:FS01:Connecting to 54.157.202.86:8080
03:41:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:41:55:WU01:FS00:Assigned to work server 54.157.202.86
03:41:55:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:41:55:WU01:FS00:Connecting to 54.157.202.86:8080
03:41:56:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:41:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:43:32:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:43:32:WU00:FS01:Assigned to work server 54.157.202.86
03:43:32:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:43:32:WU00:FS01:Connecting to 54.157.202.86:8080
03:43:32:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:43:33:WU01:FS00:Assigned to work server 54.157.202.86
03:43:33:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:43:33:WU01:FS00:Connecting to 54.157.202.86:8080
03:43:33:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:43:33:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:46:09:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:46:09:WU00:FS01:Assigned to work server 54.157.202.86
03:46:09:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:46:09:WU00:FS01:Connecting to 54.157.202.86:8080
03:46:10:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:46:10:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:46:10:WU01:FS00:Assigned to work server 54.157.202.86
03:46:10:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:46:10:WU01:FS00:Connecting to 54.157.202.86:8080
03:46:10:ERROR:WU01:FS00:Exception: Server did not assign work unit
The fix was to change my Client Preference from "COVID" to "Any" and reboot. 7/8 slots now on other Work Servers but one assigned to 54.157.202.86 mskcc1.foldingathome.org and still stuck waiting
I've had a few GPUs stuck waiting for work from this work server as well. I was able to get work by changing the preferred cause for the affected slots. That changed the work server the slot was assigned to and the GPUs are now folding again.
My Slots continued on happily overnight after I changed the Preferences to "Any" from "COVID"
One possible hint is that this server in the Stats shows JUST 1.34TB free considerably lower than all the other active servers so perhaps it has some Resource Constraints preventing it from assigning tasks.
I also noted that both the F@H main page and Post Era show a 0% completion rate for the latest Moonshot so perhaps some more tasty COVID WUs are being prepped for our GPUs to work on!