Page 1 of 2
CPUs not getting WS Assignments
Posted: Wed Jul 26, 2017 12:56 pm
by Jimbert
My Nvidia GTX 1080 Ti 11380 is working great, but my CPUs have been waiting for WS Assignments for at least a day. The computer is an Intel Core(TM) i7-7700K CPU @ 4.20GHz, Family 6 Model 158 Stepping 9 running Windows 10 Pro. The Folding@home client is Version 7.4.4
Here is the salient log file snippit.
09:03:03:WU00:FS00:Connecting to 171.67.108.45:8080
09:03:04:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.45:8080': Empty work server assignment
09:03:04:WU00:FS00:Connecting to 171.64.65.35:80
09:03:07:WARNING:WU00:FS00:Failed to get assignment from '171.64.65.35:80': Empty work server assignment
09:03:07:ERROR:WU00:FS00:Exception: Could not get an assignment
09:03:24:WU01:FS01:0x21:Completed 1300000 out of 2000000 steps (65%)
Is something wrong on my end or is it the server?
Jim
Re: CPUs not getting WS Assignments
Posted: Wed Jul 26, 2017 1:33 pm
by Joe_H
Most likely the cause is on your end. With a 4-core i7, the default settings on the Full slider setting will request CPU:7 assignments and there are few if any of those left. In the future that value will no longer be available. Either set the slider to Medium which will change the request to CPU:6, or set the CPU slot to 6 CPU threads using Configure in FAHControl.
The alternative is installing the public beta, version 7.4.16. That has new code to negotiate with the servers for the highest available CPU count up to the requested number submitted by the client.
Re: CPUs not getting WS Assignments
Posted: Wed Jul 26, 2017 3:34 pm
by JimboPalmer
Joe_H is right.
The back story is that F@H has issues with 'large' prime number of CPUs, (3 is OK, 5 is iffy, 7 and up are bad) and multiple of them (so 14 is bad even though it is not prime)
In the past, they tested each project and blocked each prime as it failed. Lately they have started projects already blocking primes and multiples of primes.
While they wrote code to avoid primes in your client, it is not yet the stable version. so not many folks have it, yet.
Re: CPUs not getting WS Assignments
Posted: Wed Jul 26, 2017 4:00 pm
by Jimbert
Thanks for your speedy reply.
If I set folding power to medium, will the GPU still run full tilt?
Jim
Re: CPUs not getting WS Assignments
Posted: Wed Jul 26, 2017 4:41 pm
by JimboPalmer
I would use the Advanced Client, then choose Configure, and Slots, then select the CPU slot and press Edit.
Under CPUs it is likely to have -1 selected, which lets the client choose, and I would change it to 6.
Then OK and Save, and then Exit if you are all done.
Re: CPUs not getting WS Assignments
Posted: Wed Jul 26, 2017 4:59 pm
by Jimbert
Got it - thank you guys. The CPUs started folding as soon as I changed to medium power and restarted the client. I just changed back to full power with CPU slots set to 6.
The Nvidia GTX 1080 Ti is a beast. I'm now folding @ (pun intended) close to or slightly over 1 million points per day.
Jim
Re: CPUs not getting WS Assignments
Posted: Thu Jul 27, 2017 12:04 am
by bruce
With 1 GPU and an 8-way CPU, Full power on FAHClient 7.4.4 sets CPUs to 7 and Medium sets it to 6.
The GPU is either folding or not folding -- no intermediate settings.
On FAHClient 7.4.16, the client and the server will negotiate the maximum setting that's safe.
If you're actually running that WU with 7 active CPUs, the chances of an early failure are likely. Please don't complain if it fails before reaching 100%. With a setting of 6, it's likely stable.
The whole point of the recent changes was to maximize the chances of a successful completion, even if it only uses 86% of your CPUs.
Actually, there's a minor improvement in GPU speed when your total CPU utilization is less than 100%.
Depending on several factors, you may be really running it with 6 or with 7.
Re: CPUs not getting WS Assignments
Posted: Thu Jan 31, 2019 9:47 pm
by Mactin
I have two CPU clients not folding for a few hours now, one Mac, one Windows.
From my Windows client :
Code: Select all
21:37:58:WU01:FS00:Connecting to assign3.stanford.edu:8080
21:37:59:WU01:FS00:News:
21:37:59:WU01:FS00:Assigned to work server 128.252.203.4
21:37:59:WU01:FS00:Requesting new work unit for slot 00: READY cpu:10 from 128.252.203.4
21:37:59:WU01:FS00:Connecting to 128.252.203.4:8080
21:37:59:ERROR:WU01:FS00:Exception: Server did not assign work unit
21:39:35:WU01:FS00:Connecting to assign3.stanford.edu:8080
21:39:35:WU01:FS00:News:
21:39:35:WU01:FS00:Assigned to work server 128.252.203.4
21:39:36:WU01:FS00:Requesting new work unit for slot 00: READY cpu:10 from 128.252.203.4
21:39:36:WU01:FS00:Connecting to 128.252.203.4:8080
21:39:36:ERROR:WU01:FS00:Exception: Server did not assign work unit
21:42:12:WU01:FS00:Connecting to assign3.stanford.edu:8080
21:42:13:WU01:FS00:News:
21:42:13:WU01:FS00:Assigned to work server 128.252.203.4
21:42:13:WU01:FS00:Requesting new work unit for slot 00: READY cpu:10 from 128.252.203.4
21:42:13:WU01:FS00:Connecting to 128.252.203.4:8080
21:42:13:ERROR:WU01:FS00:Exception: Server did not assign work unit
Re: CPUs not getting WS Assignments
Posted: Fri Feb 01, 2019 1:23 am
by JimboPalmer
Folks are going to look at the title and think "Oh we solved that". You are MUCH better served to start a new thread.
Including the first 100 lines of the Log can give those wise enough to help, the information they need to help.
Best of luck!
Re: CPUs not getting WS Assignments
Posted: Fri Feb 01, 2019 1:59 am
by Joe_H
I had this happen to one of my computers earlier today, removing client-type advanced from the configuration resulted in a WU download.
Re: CPUs not getting WS Assignments
Posted: Fri Feb 01, 2019 2:48 am
by Mactin
Joe, thanks.
Removing client-type got the Mac, Win10 and Win7 CPU clients going.
Re: CPUs not getting WS Assignments
Posted: Fri Feb 01, 2019 3:23 am
by Joe_H
I did some further checking, WS 128.252.203.4 went down earlier in the day and there wee some issues after it came back up. It appearss to be operating normally now, will look again later to see if there is an y update.
Re: CPUs not getting WS Assignments
Posted: Fri Feb 01, 2019 11:57 am
by goodyca
I have one client that is still unable to download from 128.252.203.4 and no other servers are tried. I do not have the client-type set.
Re: CPUs not getting WS Assignments
Posted: Fri Feb 01, 2019 12:01 pm
by goodyca
After I submitted the above posting, the client downloaded a unit from 128.252.203.2.
Re: CPUs not getting WS Assignments
Posted: Fri Feb 01, 2019 3:34 pm
by Mactin
During the night, all three of my CPU clients successfully downloaded WUs without client-type attributes.
This morning, I reinstated client-type for my Mac client. Upon finishing the WU, it failed to download a new WU. I soon noticed by the sound of the fan. I removed it and it connected to a different WS and got work.
Code: Select all
09:52:55:WU01:FS00:0xa7:Completed 2475000 out of 2500000 steps (99%)
09:55:03:WU01:FS00:0xa7:Completed 2500000 out of 2500000 steps (100%)
09:55:06:WU01:FS00:0xa7:Saving result file ../logfile_01.txt
09:55:06:WU01:FS00:0xa7:Saving result file frame40.trr
09:55:06:WU01:FS00:0xa7:Saving result file md.log
09:55:06:WU01:FS00:0xa7:Saving result file science.log
09:55:06:WU01:FS00:0xa7:Saving result file traj_comp.xtc
09:55:06:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
09:55:06:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14143 run:1 clone:77 gen:40 core:0xa7 unit:0x0000002d0002894c5bffef219ecac799
09:55:06:WU01:FS00:Uploading 3.55MiB to 155.247.166.220
09:55:06:WU01:FS00:Connecting to 155.247.166.220:8080
09:55:12:WU01:FS00:Upload complete
09:55:12:WU01:FS00:Server responded WORK_ACK (400)
09:55:12:WU01:FS00:Final credit estimate, 5163.00 points
09:55:12:WU01:FS00:Cleaning up
******************************* Date: 2019-02-01 *******************************
15:14:51:WU01:FS00:Connecting to 65.254.110.245:8080
15:14:51:WU01:FS00:Assigned to work server 128.252.203.4
15:14:51:WU01:FS00:Requesting new work unit for slot 00: RUNNING cpu:8 from 128.252.203.4
15:14:51:WU01:FS00:Connecting to 128.252.203.4:8080
15:14:52:ERROR:WU01:FS00:Exception: Server did not assign work unit
15:14:52:WU01:FS00:Connecting to 65.254.110.245:8080
15:14:53:WU01:FS00:Assigned to work server 128.252.203.4
15:14:53:WU01:FS00:Requesting new work unit for slot 00: RUNNING cpu:8 from 128.252.203.4
15:14:53:WU01:FS00:Connecting to 128.252.203.4:8080
15:14:53:ERROR:WU01:FS00:Exception: Server did not assign work unit
15:15:52:WU01:FS00:Connecting to 65.254.110.245:8080
15:15:53:WU01:FS00:Assigned to work server 128.252.203.4
15:15:53:WU01:FS00:Requesting new work unit for slot 00: RUNNING cpu:8 from 128.252.203.4
15:15:53:WU01:FS00:Connecting to 128.252.203.4:8080
15:15:53:ERROR:WU01:FS00:Exception: Server did not assign work unit
15:17:29:WU01:FS00:Connecting to 65.254.110.245:8080
15:17:30:WU01:FS00:Assigned to work server 128.252.203.4
15:17:30:WU01:FS00:Requesting new work unit for slot 00: RUNNING cpu:8 from 128.252.203.4
15:17:30:WU01:FS00:Connecting to 128.252.203.4:8080
15:17:30:ERROR:WU01:FS00:Exception: Server did not assign work unit
15:20:07:WU01:FS00:Connecting to 65.254.110.245:8080
15:20:07:WU01:FS00:Assigned to work server 128.252.203.4
15:20:07:WU01:FS00:Requesting new work unit for slot 00: READY cpu:8 from 128.252.203.4
15:20:07:WU01:FS00:Connecting to 128.252.203.4:8080
15:20:08:ERROR:WU01:FS00:Exception: Server did not assign work unit
15:21:07:WU01:FS00:Connecting to 65.254.110.245:8080
15:21:07:WU01:FS00:Assigned to work server 155.247.166.219
15:21:07:WU01:FS00:Requesting new work unit for slot 00: READY cpu:8 from 155.247.166.219
15:21:07:WU01:FS00:Connecting to 155.247.166.219:8080
15:21:09:WU01:FS00:Downloading 2.52MiB
15:21:14:WU01:FS00:Download complete
15:21:14:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:14096 run:102 clone:11 gen:5 core:0xa7 unit:0x000000050002894b5c005ea800cbccc7
15:21:14:WU01:FS00:Starting
15:21:14:WU01:FS00:Running FahCore: /usr/local/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/cores.foldingathome.org/OSX/AMD64/AVX/Core_a7.fah/FahCore_a7" -dir 01 -suffix 01 -version 705 -lifeline 67 -checkpoint 10 -np 8
15:21:14:WU01:FS00:Started FahCore on PID 1501
15:21:14:WU01:FS00:Core PID:1502
15:21:14:WU01:FS00:FahCore 0xa7 started
15:21:14:WU01:FS00:0xa7:*********************** Log Started 2019-02-01T15:21:14Z ***********************
15:21:14:WU01:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
15:21:14:WU01:FS00:0xa7: Type: 0xa7
15:21:14:WU01:FS00:0xa7: Core: Gromacs
15:21:14:WU01:FS00:0xa7: Website: https://foldingathome.org/
15:21:14:WU01:FS00:0xa7: Copyright: (c) 2009-2018 foldingathome.org
15:21:14:WU01:FS00:0xa7: Author: Joseph Coffland <[email protected]>
15:21:14:WU01:FS00:0xa7: Args: -dir 01 -suffix 01 -version 705 -lifeline 1501 -checkpoint 10 -np 8
15:21:14:WU01:FS00:0xa7: Config: <none>
15:21:14:WU01:FS00:0xa7:************************************ Build *************************************
15:21:14:WU01:FS00:0xa7: Version: 0.0.17
15:21:14:WU01:FS00:0xa7: Date: Apr 25 2018
15:21:14:WU01:FS00:0xa7: Time: 10:52:35
15:21:14:WU01:FS00:0xa7: Repository: Git
15:21:14:WU01:FS00:0xa7: Revision: fd11abfb405c921e66db1226933e9dd2d18d2acc
15:21:14:WU01:FS00:0xa7: Branch: master
15:21:14:WU01:FS00:0xa7: Compiler: GNU 4.2.1 Compatible Apple LLVM 7.3.0 (clang-703.0.31)
15:21:14:WU01:FS00:0xa7: Options: -std=gnu++98 -O3 -funroll-loops -mmacosx-version-min=10.6
15:21:14:WU01:FS00:0xa7: -Wno-unused-local-typedefs
15:21:14:WU01:FS00:0xa7: Platform: darwin 15.6.0
15:21:14:WU01:FS00:0xa7: Bits: 64
15:21:14:WU01:FS00:0xa7: Mode: Release
15:21:14:WU01:FS00:0xa7: SIMD: avx_256
15:21:14:WU01:FS00:0xa7:************************************ System ************************************
15:21:14:WU01:FS00:0xa7: CPU: Intel(R) Core(TM) i7-4770HQ CPU @ 2.20GHz
15:21:14:WU01:FS00:0xa7: CPU ID: GenuineIntel Family 6 Model 70 Stepping 1
15:21:14:WU01:FS00:0xa7: CPUs: 8
15:21:14:WU01:FS00:0xa7: Memory: 16.00GiB
15:21:14:WU01:FS00:0xa7:Free Memory: 3.84GiB
15:21:14:WU01:FS00:0xa7: Threads: POSIX_THREADS
15:21:14:WU01:FS00:0xa7: OS Version: 10.13
15:21:14:WU01:FS00:0xa7:Has Battery: true
15:21:14:WU01:FS00:0xa7: On Battery: false
15:21:14:WU01:FS00:0xa7: UTC Offset: -5
15:21:14:WU01:FS00:0xa7: PID: 1502
15:21:14:WU01:FS00:0xa7: CWD: /Library/Application Support/FAHClient/work
15:21:14:WU01:FS00:0xa7: OS: Darwin 17.7.0 x86_64
15:21:14:WU01:FS00:0xa7: OS Arch: AMD64
15:21:14:WU01:FS00:0xa7:********************************************************************************
15:21:14:WU01:FS00:0xa7:Project: 14096 (Run 102, Clone 11, Gen 5)
15:21:14:WU01:FS00:0xa7:Unit: 0x000000050002894b5c005ea800cbccc7
15:21:14:WU01:FS00:0xa7:Reading tar file core.xml
15:21:14:WU01:FS00:0xa7:Reading tar file frame5.tpr
15:21:14:WU01:FS00:0xa7:Digital signatures verified
15:21:14:WU01:FS00:0xa7:Calling: mdrun -s frame5.tpr -o frame5.trr -cpt 10 -nt 8
15:21:15:WU01:FS00:0xa7:Steps: first=2500000 total=500000
15:21:25:WU01:FS00:0xa7:Completed 1 out of 500000 steps (0%)
15:25:03:WU01:FS00:0xa7:Completed 5000 out of 500000 steps (1%)
15:28:31:WU01:FS00:0xa7:Completed 10000 out of 500000 steps (2%)
My Win7 client will finish in a little more than 75 mins. I will set the client-type to normal and report back.
EDIT: client-type to normal worked