Bad Work Unit for Project 12214

Moderators: Site Moderators, FAHC Science Team

Post Reply
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Bad Work Unit for Project 12214

Post by SandyG »

Seeing some bad work units for this particular project pop up a few times for project 12214.

Never seen this error today, hardware seems to be working OK, here are a couple of log entries from 2 of the errors.

Both cards are 4090's (FS03 and FS04), and this is on a Linux box running Mint (ubuntu). Card temps looks good, and other work units have been processing fine.

Both from Today, and I don't see any other reports for this project, so might be me, but not sure. Just a heads up I guess.

02:28:33:WU02:FS03:0x22:ERROR:Kinetic energy error of 1719.85, threshold of 800
02:28:33:WU02:FS03:0x22:ERROR:Reference Kinetic Energy: 787018 | Given Kinetic Energy: 788738
02:28:33:WU02:FS03:0x22:Saving result file ../logfile_01.txt
02:28:33:WU02:FS03:0x22:Saving result file science.log
02:28:33:WU02:FS03:0x22:Saving result file state.xml.bz2
02:28:33:WU02:FS03:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
02:28:34:WARNING:WU02:FS03:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:28:34:WU02:FS03:Sending unit results: id:02 state:SEND error:FAULTY project:12214 run:0 clone:122 gen:0 core:0x22 unit:0x0000007a0000000000002fb600000000
...
02:28:34:WU02:FS03:Uploading 20.94MiB to 158.130.118.23
02:28:34:WU02:FS03:Connecting to 158.130.118.23:8080
02:28:34:WU04:FS03:Connecting to assign1.foldingathome.org:80
02:28:35:WU04:FS03:Assigned to work server 129.32.209.202
02:28:35:WU04:FS03:Requesting new work unit for slot 03: gpu:182:0 AD102 [GeForce RTX 4090] from 129.32.209.202
02:28:35:WU04:FS03:Connecting to 129.32.209.202:8080
02:28:35:WU06:FS03:Upload 52.36%
02:28:36:WU04:FS03:Downloading 59.71MiB
02:28:37:WU00:FS02:0x22:Completed 437500 out of 1250000 steps (35%)
02:28:37:WU00:FS02:0x22:Checkpoint completed at step 437500
02:28:40:WU02:FS03:Upload 8.06%
02:28:40:WU03:FS04:0x22:Completed 500000 out of 2500000 steps (20%)
02:28:40:WU03:FS04:0x22:Checkpoint completed at step 500000
02:28:42:WU04:FS03:Download 0.63%
02:28:42:WU06:FS03:Upload 62.95%
02:28:47:WU02:FS03:Upload 16.41%
02:28:48:WU04:FS03:Download 3.45%
02:28:48:WU06:FS03:Upload 70.37%
02:28:48:WU00:FS02:0x22:Completed 450000 out of 1250000 steps (36%)
02:28:53:WU02:FS03:Upload 27.46%
02:28:54:WU04:FS03:Download 6.59%
02:28:54:WU06:FS03:Upload 77.58%
02:28:59:WU01:FS01:0x22:Completed 1900000 out of 2500000 steps (76%)
02:28:59:WU01:FS01:0x22:Checkpoint completed at step 1900000
02:29:00:WU02:FS03:Upload 39.39%
02:29:00:WU00:FS02:0x22:Completed 462500 out of 1250000 steps (37%)
02:29:00:WU04:FS03:Download 9.42%
02:29:01:WU06:FS03:Upload 85.21%
02:29:03:WU03:FS04:0x22:Completed 525000 out of 2500000 steps (21%)
02:29:06:WU04:FS03:Download 11.62%
02:29:06:WU02:FS03:Upload 50.14%
02:29:07:WU06:FS03:Upload 92.84%
02:29:11:WU00:FS02:0x22:Completed 475000 out of 1250000 steps (38%)
02:29:12:WU04:FS03:Download 14.24%
02:29:13:WU02:FS03:Upload 60.88%
02:29:18:WU04:FS03:Download 17.79%
02:29:19:WU02:FS03:Upload 74.01%
02:29:20:WU06:FS03:Upload complete
02:29:20:WU06:FS03:Server responded WORK_ACK (400)
02:29:20:WU06:FS03:Final credit estimate, 389316.00 points
02:29:20:WU06:FS03:Cleaning up
02:29:22:WU00:FS02:0x22:Completed 487500 out of 1250000 steps (39%)
02:29:24:WU04:FS03:Download 21.35%
02:29:25:WU02:FS03:Upload 96.09%
02:29:25:WU03:FS04:0x22:Completed 550000 out of 2500000 steps (22%)
02:29:26:WU03:FS04:0x22:Checkpoint completed at step 550000
02:29:29:WU02:FS03:Upload complete
02:29:29:WU02:FS03:Server responded WORK_ACK (400)
02:29:29:WU02:FS03:Cleaning up


02:59:33:WU01:FS04:0x22:ERROR:Kinetic energy error of 1719.85, threshold of 800
02:59:33:WU01:FS04:0x22:ERROR:Reference Kinetic Energy: 787018 | Given Kinetic Energy: 788738
02:59:33:WU01:FS04:0x22:Saving result file ../logfile_01.txt
02:59:33:WU01:FS04:0x22:Saving result file science.log
02:59:33:WU01:FS04:0x22:Saving result file state.xml.bz2
02:59:33:WU01:FS04:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
02:59:33:WARNING:WU01:FS04:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:59:33:WU01:FS04:Sending unit results: id:01 state:SEND error:FAULTY project:12214 run:0 clone:143 gen:0 core:0x22 unit:0x0000008f0000000000002fb600000000
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
toTOW
Site Moderator
Posts: 6349
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Bad Work Unit for Project 12214

Post by toTOW »

Reported to the researcher.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
lgsmith
Scientist
Posts: 12
Joined: Fri Jan 20, 2023 10:49 pm

Re: Bad Work Unit for Project 12214

Post by lgsmith »

I inspected some of the logs I'm getting back and it looks like this system is running into the 'kinetic energy' bug that is present in the version of openmm that's used by core 0x22. I'd changed the settings to address these issues for most of my projects, but apparently this one needed to be _higher_. Thanks for letting us know!
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Post by SandyG »

lgsmith wrote: Sun Jul 30, 2023 9:19 pm I inspected some of the logs I'm getting back and it looks like this system is running into the 'kinetic energy' bug that is present in the version of openmm that's used by core 0x22. I'd changed the settings to address these issues for most of my projects, but apparently this one needed to be _higher_. Thanks for letting us know!
No problem, just glad it's something that is known and has a fix... and not my hardware :D

Glad to help!

Sandy
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
Post Reply