flaw with one of my GPU's resulting in BAD_WORK_UNI

Moderators: Site Moderators, FAHC Science Team

Post Reply
kevenstu01
Posts: 2
Joined: Mon Mar 30, 2020 1:53 pm

flaw with one of my GPU's resulting in BAD_WORK_UNI

Post by kevenstu01 »

hi there

i am contacting you to report a issue i encountered with the WU's that where assigned to the GPU of my ALIENWARE laptop that i added to my temporary folding aide in response to COVID-19

system: alienware m14x r1
GPU: Nvidia GT 555m

when i added my laptop the cpu started folding immediately no errors as of yet
but when the GPU got a WU it failed (it got assigned 2 maybe 3 WU's, all of which failed)

when i saw that the gpu would only fail i removed it from folding (to prevent it from unnecessarily hindering the research)
i suspected that the gpu might have an issue since it as trouble rendering game without a external monitor plugged in

(i set my logs to 0 to help preserve my laptop ssd but i did not know that PRCG would not be included in the WU fail error (which would be a good idea to add) so i apologize that i cannot tell you which one failed)

Code: Select all

13:20:36:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
13:20:57:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
13:20:58:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
13:21:18:WARNING:WU02:FS01:Exception: Failed to send results to work server: Failed to connect to 128.252.203.10:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
13:22:15:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
still i am glad that i can help by providing 4 slot to folding (1 GPU 3 CPU's) (not including the laptop GPU)
1x Intel Core2 Quad CPU Q8300 @ 2.50GHz (4 CPUs)
1x Intel Core i7-2670QM @ 2.20GHz (8 CPUs)
1x Intel Core i7-8700K CPU @ 3.70GHz (12 CPUs)
1x NVidia Geforce RTX 2070 SUPER
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: flaw with one of my GPU's resulting in BAD_WORK_UNI

Post by bruce »

welcome to foldingforum.org, kevenstu01

The PRCG numbers mayb be in a message just a few lines before the error message you posted. but they probably don't matter much. Three different WUs is a pattern.

Did the error occurr immediately when the WU started processing or was there a time-lag of several seconds (or more). The problem MIGHT be that If the WU is overheating your GPU by running it at 99% but that generally takes a little time to get up to a critical temperature.

It might also be a OS/FAH configuration issue on your GPU or even a defective GPU.

Post the beginning of FAH's log.txt.
kevenstu01
Posts: 2
Joined: Mon Mar 30, 2020 1:53 pm

Re: flaw with one of my GPU's resulting in BAD_WORK_UNI

Post by kevenstu01 »

hi

i am not trying to fix the GPU so it can fold since i might just be wasting my time and also be unnecessarily pulling WU's on a GPU that might never be able to run it.
let me explain:

i do not remember when it started but my GPU had started having issue running games on it(the game would see the card but would render off the CPU anyway, even when it was selected).
the only way i could get my GPU to run games would be to connect a monitor to one of the dedicated GPU ports (in this case HDMI or MINI Displayport) and switch the desktop to only that monitor, then the game would render of the GPU correctly.
since the issue started the computer was formatted and upgraded many times so i know that the issue is on a hardware level.

since my laptop is already short of thermal throttling (if its not already) on cpu load i decided that i would not risk crashing my computer.
(i prefer to run 1 stable slot then 2 unstable slots)

i reported it so that you may reassign the WU's (if you can figure out which one where assigned to this GPU)
i do not require assistance to fix it since i will not be assigning this GPU to be folding to prevent throttling and stability issue

as of the error the WU would almost immediately fail after starting (status would stay at 0.00%)

here is the entire logs since i installed it on my alienware laptop:

Code: Select all

*********************** Log Started 2020-03-30T12:29:34Z ***********************
12:29:35:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:29:35:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:29:35:ERROR:WU01:FS01:Exception: Could not get an assignment
12:29:35:ERROR:WU00:FS00:Exception: Server did not assign work unit
12:29:36:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:29:36:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:29:36:WARNING:WU00:FS00:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:29:36:ERROR:WU00:FS00:Exception: Could not get an assignment
12:29:37:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:29:37:ERROR:WU01:FS01:Exception: Could not get an assignment
12:30:36:WARNING:WU00:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:30:36:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:30:36:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:30:36:ERROR:WU01:FS01:Exception: Could not get an assignment
12:30:57:ERROR:Receive error: 10054: An existing connection was forcibly closed by the remote host.
12:32:13:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:32:13:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:32:13:ERROR:WU01:FS01:Exception: Could not get an assignment
12:34:50:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:34:50:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:34:50:ERROR:WU01:FS01:Exception: Could not get an assignment
12:39:05:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:39:05:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:39:05:ERROR:WU01:FS01:Exception: Could not get an assignment
12:45:56:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:45:56:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:45:56:ERROR:WU01:FS01:Exception: Could not get an assignment
12:57:02:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
12:57:02:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
12:57:02:ERROR:WU01:FS01:Exception: Could not get an assignment
13:14:59:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
13:16:06:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
13:16:07:WARNING:WU02:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
13:16:07:WARNING:WU02:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
13:16:07:ERROR:WU02:FS01:Exception: Could not get an assignment
13:16:07:WARNING:WU02:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
13:16:08:WARNING:WU02:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
13:16:08:ERROR:WU02:FS01:Exception: Could not get an assignment
13:17:07:WARNING:WU02:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
13:17:07:WARNING:WU02:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
13:17:07:ERROR:WU02:FS01:Exception: Could not get an assignment
13:20:36:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
13:20:57:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
13:20:58:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
13:21:18:WARNING:WU02:FS01:Exception: Failed to send results to work server: Failed to connect to 128.252.203.10:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
13:22:15:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
13:22:16:WARNING:WU02:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
13:22:16:WARNING:WU02:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
13:22:16:ERROR:WU02:FS01:Exception: Could not get an assignment
13:22:16:WARNING:WU02:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
13:22:17:WARNING:WU02:FS01:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
13:22:17:ERROR:WU02:FS01:Exception: Could not get an assignment
13:23:16:WARNING:WU00:FS00:Changed SMP threads from 7 to 8 this can cause some work units to fail
13:23:16:WARNING:WU00:FS00:AS lowered CPUs from 8 to 7
13:49:22:Saving configuration to config.xml
13:49:53:Saving configuration to config.xml
13:50:09:WU00:FS00:0xa7:Completed 750000 out of 2500000 steps (30%)
13:52:46:WU00:FS00:0xa7:Completed 775000 out of 2500000 steps (31%)
13:55:24:WU00:FS00:0xa7:Completed 800000 out of 2500000 steps (32%)
13:58:04:WU00:FS00:0xa7:Completed 825000 out of 2500000 steps (33%)
14:00:41:WU00:FS00:0xa7:Completed 850000 out of 2500000 steps (34%)
14:03:19:WU00:FS00:0xa7:Completed 875000 out of 2500000 steps (35%)
14:05:56:WU00:FS00:0xa7:Completed 900000 out of 2500000 steps (36%)
14:08:34:WU00:FS00:0xa7:Completed 925000 out of 2500000 steps (37%)
14:11:12:WU00:FS00:0xa7:Completed 950000 out of 2500000 steps (38%)
14:13:49:WU00:FS00:0xa7:Completed 975000 out of 2500000 steps (39%)
14:16:27:WU00:FS00:0xa7:Completed 1000000 out of 2500000 steps (40%)
14:19:05:WU00:FS00:0xa7:Completed 1025000 out of 2500000 steps (41%)
14:21:44:WU00:FS00:0xa7:Completed 1050000 out of 2500000 steps (42%)
14:24:23:WU00:FS00:0xa7:Completed 1075000 out of 2500000 steps (43%)
14:27:01:WU00:FS00:0xa7:Completed 1100000 out of 2500000 steps (44%)
14:29:39:WU00:FS00:0xa7:Completed 1125000 out of 2500000 steps (45%)
14:32:18:WU00:FS00:0xa7:Completed 1150000 out of 2500000 steps (46%)
14:34:56:WU00:FS00:0xa7:Completed 1175000 out of 2500000 steps (47%)
14:37:34:WU00:FS00:0xa7:Completed 1200000 out of 2500000 steps (48%)
14:40:11:WU00:FS00:0xa7:Completed 1225000 out of 2500000 steps (49%)
15:22:57:Saving configuration to config.xml
15:22:57:<config>
15:22:57:  <!-- Network -->
15:22:57:  <proxy v=':8080'/>
15:22:57:
15:22:57:  <!-- Remote Command Server -->
15:22:57:  <password v='**************'/>
15:22:57:
15:22:57:  <!-- Slot Control -->
15:22:57:  <power v='full'/>
15:22:57:
15:22:57:  <!-- User Information -->
15:22:57:  <passkey v='********************************'/>
15:22:57:  <team v='223518'/>
15:22:57:  <user v='kevenstu01'/>
15:22:57:
15:22:57:  <!-- Folding Slots -->
15:22:57:  <slot id='0' type='CPU'/>
15:22:57:</config>
15:23:25:Removing old file 'configs/config-20200330-121008.xml'
15:23:25:Saving configuration to config.xml
15:23:25:<config>
15:23:25:  <!-- Network -->
15:23:25:  <proxy v=':8080'/>
15:23:25:
15:23:25:  <!-- Remote Command Server -->
15:23:25:  <password v='**************'/>
15:23:25:
15:23:25:  <!-- Slot Control -->
15:23:25:  <power v='full'/>
15:23:25:
15:23:25:  <!-- User Information -->
15:23:25:  <passkey v='********************************'/>
15:23:25:  <team v='223518'/>
15:23:25:  <user v='kevenstu01'/>
15:23:25:
15:23:25:  <!-- Folding Slots -->
15:23:25:  <slot id='0' type='CPU'/>
15:23:25:</config>
15:24:52:WU00:FS00:0xa7:Completed 1650000 out of 2500000 steps (66%)
16:51:14:WARNING:WU01:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
16:51:14:WARNING:WU01:FS00:Failed to get assignment from '18.218.241.186:80': No WUs available for this configuration
16:51:14:ERROR:WU01:FS00:Exception: Could not get an assignment
16:51:15:WARNING:WU01:FS00:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
also i am sorry about the duplicate post i had an error when posting and thought it did not post as it did not appear at first (did not see the message that said that approval was required) so i deleted the post with no answer.

thank you for your time.
Post Reply