Page 1 of 1

Can't get work for GPU

Posted: Thu Mar 19, 2020 8:04 pm
by Seneca
Hi there,

simple thing, but I find no way around ... I'm running trhe FAH client on an Windows 10 machine with a Core i7-2600k CPU, 32 GB RAM (16 GB as RAMdisk) and a NVIDIA RTX2800super GPU.

Unfortunately the slot seem to not getting work regulary - yesterday both CPU and GPU slots parts were idling around, today the GPU is idle while the CPU was fortunte enough to get some work ...

Any hint ?

Seneca 0=0

Annex: Last part of Log ....

Code: Select all

19:47:21:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Frank\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/FahCore_a7.exe -dir 00 -suffix 01 -version 705 -lifeline 10244 -checkpoint 5 -np 6
19:47:21:WU00:FS00:Started FahCore on PID 6676
19:47:21:WU00:FS00:Core PID:13032
19:47:21:WU00:FS00:FahCore 0xa7 started
19:47:21:WU00:FS00:0xa7:*********************** Log Started 2020-03-19T19:47:21Z ***********************
19:47:21:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
19:47:21:WU00:FS00:0xa7:       Type: 0xa7
19:47:21:WU00:FS00:0xa7:       Core: Gromacs
19:47:21:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 705 -lifeline 6676 -checkpoint 5 -np 6
19:47:21:WU00:FS00:0xa7:************************************ CBang *************************************
19:47:21:WU00:FS00:0xa7:       Date: Oct 26 2019
19:47:21:WU00:FS00:0xa7:       Time: 01:38:25
19:47:21:WU00:FS00:0xa7:   Revision: c46a1a011a24143739ac7218c5a435f66777f62f
19:47:21:WU00:FS00:0xa7:     Branch: master
19:47:21:WU00:FS00:0xa7:   Compiler: Visual C++ 2008
19:47:21:WU00:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
19:47:21:WU00:FS00:0xa7:   Platform: win32 10
19:47:21:WU00:FS00:0xa7:       Bits: 64
19:47:21:WU00:FS00:0xa7:       Mode: Release
19:47:21:WU00:FS00:0xa7:************************************ System ************************************
19:47:21:WU00:FS00:0xa7:        CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
19:47:21:WU00:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
19:47:21:WU00:FS00:0xa7:       CPUs: 8
19:47:21:WU00:FS00:0xa7:     Memory: 31.98GiB
19:47:21:WU00:FS00:0xa7:Free Memory: 17.99GiB
19:47:21:WU00:FS00:0xa7:    Threads: WINDOWS_THREADS
19:47:21:WU00:FS00:0xa7: OS Version: 6.2
19:47:21:WU00:FS00:0xa7:Has Battery: false
19:47:21:WU00:FS00:0xa7: On Battery: false
19:47:21:WU00:FS00:0xa7: UTC Offset: 1
19:47:21:WU00:FS00:0xa7:        PID: 13032
19:47:21:WU00:FS00:0xa7:        CWD: C:\Users\Frank\AppData\Roaming\FAHClient\work
19:47:21:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
19:47:21:WU00:FS00:0xa7:    Version: 0.0.18
19:47:21:WU00:FS00:0xa7:     Author: Joseph Coffland <[email protected]>
19:47:21:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
19:47:21:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
19:47:21:WU00:FS00:0xa7:       Date: Oct 26 2019
19:47:21:WU00:FS00:0xa7:       Time: 01:52:30
19:47:21:WU00:FS00:0xa7:   Revision: c1e3513b1bc0c16013668f2173ee969e5995b38e
19:47:21:WU00:FS00:0xa7:     Branch: master
19:47:21:WU00:FS00:0xa7:   Compiler: Visual C++ 2008
19:47:21:WU00:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
19:47:21:WU00:FS00:0xa7:   Platform: win32 10
19:47:21:WU00:FS00:0xa7:       Bits: 64
19:47:21:WU00:FS00:0xa7:       Mode: Release
19:47:21:WU00:FS00:0xa7:************************************ Build *************************************
19:47:21:WU00:FS00:0xa7:       SIMD: avx_256
19:47:21:WU00:FS00:0xa7:********************************************************************************
19:47:21:WU00:FS00:0xa7:Project: 14400 (Run 0, Clone 3357, Gen 123)
19:47:21:WU00:FS00:0xa7:Unit: 0x0000008580fccb095dcad794b2f8bf67
19:47:21:WU00:FS00:0xa7:Digital signatures verified
19:47:21:WU00:FS00:0xa7:Calling: mdrun -s frame123.tpr -o frame123.trr -x frame123.xtc -cpi state.cpt -cpt 5 -nt 6
19:47:21:WU00:FS00:0xa7:Steps: first=15375000 total=125000
19:47:21:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
19:47:21:WU01:FS01:Connecting to 18.218.241.186:80
19:47:24:WU00:FS00:0xa7:Completed 62812 out of 125000 steps (50%)
19:47:31:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
19:47:31:ERROR:WU01:FS01:Exception: Could not get an assignment
19:47:32:ERROR:Receive error: 10053: Eine bestehende Verbindung wurde softwaregesteuert
19:47:32:ERROR:durch den Hostcomputer abgebrochen.
19:48:58:WU01:FS01:Connecting to 65.254.110.245:8080
19:48:58:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
19:48:58:WU01:FS01:Connecting to 18.218.241.186:80
19:48:59:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
19:48:59:ERROR:WU01:FS01:Exception: Could not get an assignment
19:49:38:WU00:FS00:0xa7:Completed 63750 out of 125000 steps (51%)
19:51:35:WU01:FS01:Connecting to 65.254.110.245:8080
19:51:35:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
19:51:35:WU01:FS01:Connecting to 18.218.241.186:80
19:51:36:WU01:FS01:Assigned to work server 128.252.203.10
19:51:36:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:TU104 [GeForce RTX 2080 Super] from 128.252.203.10
19:51:36:WU01:FS01:Connecting to 128.252.203.10:8080
19:52:38:WU00:FS00:0xa7:Completed 65000 out of 125000 steps (52%)
19:53:20:ERROR:WU01:FS01:Exception: 10002: Received short response, expected 512 bytes, got 0
19:55:35:WU00:FS00:0xa7:Completed 66250 out of 125000 steps (53%)
19:55:49:WU01:FS01:Connecting to 65.254.110.245:8080
19:55:50:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
19:55:50:WU01:FS01:Connecting to 18.218.241.186:80
19:55:50:WU01:FS01:Assigned to work server 155.247.166.220
19:55:50:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:TU104 [GeForce RTX 2080 Super] from 155.247.166.220
19:55:50:WU01:FS01:Connecting to 155.247.166.220:8080
19:55:51:ERROR:WU01:FS01:Exception: 10001: Server responded: HTTP_SERVICE_UNAVAILABLE
19:58:33:WU00:FS00:0xa7:Completed 67500 out of 125000 steps (54%)
20:01:27:WU00:FS00:0xa7:Completed 68750 out of 125000 steps (55%)

Re: Can't get work for GPU

Posted: Thu Mar 19, 2020 8:23 pm
by Seneca
Yeehaa ! Looks like it got fixed ... just got work for GPU, too.

Nonetheless: Any hint why there are so log gaps w/o work while the serer stats tell there's much work around ?

Re: Can't get work for GPU

Posted: Thu Mar 19, 2020 9:09 pm
by Joe_H
The Work Servers (WS) can only handle so many connections at a time, that includes for download and for upload of WU's. They dis add two large WS's on Azure yesterday and it looks they are starting to see what their max rate looks like from my latest look at the server status page.