Page 1 of 1

WORK_QUIT for WU 11752 (Run 0, Clone 9163, Gen 14)

Posted: Sun May 24, 2020 6:48 pm
by emf
Dunno what happened with this WU. It completed well within it's timeout period and didn't seem to throw any errors during computation. It doesn't appear in the /wu query as having been completed by anyone else, either.

Code: Select all

12:49:48:WU02:FS00:Connecting to 18.218.241.186:80
12:49:48:WU02:FS00:Assigned to work server 140.163.4.231
12:49:48:WU02:FS00:Requesting new work unit for slot 00: READY gpu:0:GP106 [GeForce GTX 1060 6GB] 4372 from 140.163.4.231
12:49:48:WU02:FS00:Connecting to 140.163.4.231:8080
12:50:09:WU02:FS00:Downloading 13.15MiB
12:50:15:WU02:FS00:Download 25.66%
12:50:21:WU02:FS00:Download 53.23%
12:50:27:WU02:FS00:Download 80.32%
12:50:31:WU02:FS00:Download complete
12:50:31:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:11752 run:0 clone:9163 gen:14 core:0x22 unit:0x000000238ca304e75e6bbff6f50010ac
12:50:31:WU02:FS00:Starting
12:50:31:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 02 -suffix 01 -version 706 -lifeline 1476 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
12:50:31:WU02:FS00:Started FahCore on PID 660
12:50:31:WU02:FS00:Core PID:664
12:50:31:WU02:FS00:FahCore 0x22 started
12:50:32:WU02:FS00:0x22:*********************** Log Started 2020-05-24T12:50:31Z ***********************
12:50:32:WU02:FS00:0x22:*************************** Core22 Folding@home Core ***************************
12:50:32:WU02:FS00:0x22:       Type: 0x22
12:50:32:WU02:FS00:0x22:       Core: Core22
12:50:32:WU02:FS00:0x22:    Website: https://foldingathome.org/
12:50:32:WU02:FS00:0x22:  Copyright: (c) 2009-2018 foldingathome.org
12:50:32:WU02:FS00:0x22:     Author: John Chodera <[email protected]> and Rafal Wiewiora
12:50:32:WU02:FS00:0x22:             <[email protected]>
12:50:32:WU02:FS00:0x22:       Args: -dir 02 -suffix 01 -version 706 -lifeline 660 -checkpoint 15
12:50:32:WU02:FS00:0x22:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
12:50:32:WU02:FS00:0x22:             0 -gpu 0
12:50:32:WU02:FS00:0x22:     Config: <none>
12:50:32:WU02:FS00:0x22:************************************ Build *************************************
12:50:32:WU02:FS00:0x22:    Version: 0.0.5
12:50:32:WU02:FS00:0x22:       Date: Apr 22 2020
12:50:32:WU02:FS00:0x22:       Time: 03:57:11
12:50:32:WU02:FS00:0x22: Repository: Git
12:50:32:WU02:FS00:0x22:   Revision: 2d69202c898bd9bb3e093f51cd32bf411c2a0388
12:50:32:WU02:FS00:0x22:     Branch: HEAD
12:50:32:WU02:FS00:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
12:50:32:WU02:FS00:0x22:    Options: -std=c++11 -O3 -funroll-loops
12:50:32:WU02:FS00:0x22:   Platform: linux2 4.19.76-linuxkit
12:50:32:WU02:FS00:0x22:       Bits: 64
12:50:32:WU02:FS00:0x22:       Mode: Release
12:50:32:WU02:FS00:0x22:************************************ System ************************************
12:50:32:WU02:FS00:0x22:        CPU: Intel(R) Xeon(R) CPU E5430 @ 2.66GHz
12:50:32:WU02:FS00:0x22:     CPU ID: GenuineIntel Family 6 Model 23 Stepping 6
12:50:32:WU02:FS00:0x22:       CPUs: 4
12:50:32:WU02:FS00:0x22:     Memory: 7.79GiB
12:50:32:WU02:FS00:0x22:Free Memory: 4.25GiB
12:50:32:WU02:FS00:0x22:    Threads: POSIX_THREADS
12:50:32:WU02:FS00:0x22: OS Version: 4.15
12:50:32:WU02:FS00:0x22:Has Battery: false
12:50:32:WU02:FS00:0x22: On Battery: false
12:50:32:WU02:FS00:0x22: UTC Offset: 0
12:50:32:WU02:FS00:0x22:        PID: 664
12:50:32:WU02:FS00:0x22:        CWD: /var/lib/fahclient/work
12:50:32:WU02:FS00:0x22:         OS: Linux 4.15.0-99-generic x86_64
12:50:32:WU02:FS00:0x22:    OS Arch: AMD64
12:50:32:WU02:FS00:0x22:********************************************************************************
12:50:32:WU02:FS00:0x22:Project: 11752 (Run 0, Clone 9163, Gen 14)
12:50:32:WU02:FS00:0x22:Unit: 0x000000238ca304e75e6bbff6f50010ac
12:50:32:WU02:FS00:0x22:Reading tar file core.xml
12:50:32:WU02:FS00:0x22:Reading tar file integrator.xml
12:50:32:WU02:FS00:0x22:Reading tar file state.xml
12:50:33:WU02:FS00:0x22:Reading tar file system.xml
12:50:34:WU02:FS00:0x22:Digital signatures verified
12:50:34:WU02:FS00:0x22:Folding@home GPU Core22 Folding@home Core
12:50:34:WU02:FS00:0x22:Version 0.0.5
12:51:02:WU02:FS00:0x22:Completed 0 out of 1000000 steps (0%)
12:51:02:WU02:FS00:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
12:54:11:WU02:FS00:0x22:Completed 10000 out of 1000000 steps (1%)
12:57:22:WU02:FS00:0x22:Completed 20000 out of 1000000 steps (2%)
13:00:33:WU02:FS00:0x22:Completed 30000 out of 1000000 steps (3%)
13:03:44:WU02:FS00:0x22:Completed 40000 out of 1000000 steps (4%)
13:06:56:WU02:FS00:0x22:Completed 50000 out of 1000000 steps (5%)
13:10:16:WU02:FS00:0x22:Completed 60000 out of 1000000 steps (6%)
13:13:28:WU02:FS00:0x22:Completed 70000 out of 1000000 steps (7%)
13:16:39:WU02:FS00:0x22:Completed 80000 out of 1000000 steps (8%)
13:19:51:WU02:FS00:0x22:Completed 90000 out of 1000000 steps (9%)
13:23:02:WU02:FS00:0x22:Completed 100000 out of 1000000 steps (10%)
13:26:23:WU02:FS00:0x22:Completed 110000 out of 1000000 steps (11%)
13:29:35:WU02:FS00:0x22:Completed 120000 out of 1000000 steps (12%)
13:32:46:WU02:FS00:0x22:Completed 130000 out of 1000000 steps (13%)
13:35:58:WU02:FS00:0x22:Completed 140000 out of 1000000 steps (14%)
13:39:09:WU02:FS00:0x22:Completed 150000 out of 1000000 steps (15%)
13:42:30:WU02:FS00:0x22:Completed 160000 out of 1000000 steps (16%)
13:45:42:WU02:FS00:0x22:Completed 170000 out of 1000000 steps (17%)
13:48:53:WU02:FS00:0x22:Completed 180000 out of 1000000 steps (18%)
13:52:04:WU02:FS00:0x22:Completed 190000 out of 1000000 steps (19%)
13:55:16:WU02:FS00:0x22:Completed 200000 out of 1000000 steps (20%)
13:58:37:WU02:FS00:0x22:Completed 210000 out of 1000000 steps (21%)
14:01:49:WU02:FS00:0x22:Completed 220000 out of 1000000 steps (22%)
14:05:00:WU02:FS00:0x22:Completed 230000 out of 1000000 steps (23%)
14:08:12:WU02:FS00:0x22:Completed 240000 out of 1000000 steps (24%)
14:11:24:WU02:FS00:0x22:Completed 250000 out of 1000000 steps (25%)
14:14:45:WU02:FS00:0x22:Completed 260000 out of 1000000 steps (26%)
14:17:56:WU02:FS00:0x22:Completed 270000 out of 1000000 steps (27%)
14:21:07:WU02:FS00:0x22:Completed 280000 out of 1000000 steps (28%)
14:24:19:WU02:FS00:0x22:Completed 290000 out of 1000000 steps (29%)
14:27:31:WU02:FS00:0x22:Completed 300000 out of 1000000 steps (30%)
14:30:52:WU02:FS00:0x22:Completed 310000 out of 1000000 steps (31%)
14:34:03:WU02:FS00:0x22:Completed 320000 out of 1000000 steps (32%)
14:37:14:WU02:FS00:0x22:Completed 330000 out of 1000000 steps (33%)
14:40:26:WU02:FS00:0x22:Completed 340000 out of 1000000 steps (34%)
14:43:39:WU02:FS00:0x22:Completed 350000 out of 1000000 steps (35%)
14:47:03:WU02:FS00:0x22:Completed 360000 out of 1000000 steps (36%)
14:50:18:WU02:FS00:0x22:Completed 370000 out of 1000000 steps (37%)
14:53:34:WU02:FS00:0x22:Completed 380000 out of 1000000 steps (38%)
14:56:49:WU02:FS00:0x22:Completed 390000 out of 1000000 steps (39%)
15:00:04:WU02:FS00:0x22:Completed 400000 out of 1000000 steps (40%)
15:03:28:WU02:FS00:0x22:Completed 410000 out of 1000000 steps (41%)
15:06:42:WU02:FS00:0x22:Completed 420000 out of 1000000 steps (42%)
15:09:58:WU02:FS00:0x22:Completed 430000 out of 1000000 steps (43%)
15:13:12:WU02:FS00:0x22:Completed 440000 out of 1000000 steps (44%)
15:16:27:WU02:FS00:0x22:Completed 450000 out of 1000000 steps (45%)
15:19:51:WU02:FS00:0x22:Completed 460000 out of 1000000 steps (46%)
15:23:06:WU02:FS00:0x22:Completed 470000 out of 1000000 steps (47%)
15:26:21:WU02:FS00:0x22:Completed 480000 out of 1000000 steps (48%)
15:29:36:WU02:FS00:0x22:Completed 490000 out of 1000000 steps (49%)
15:32:51:WU02:FS00:0x22:Completed 500000 out of 1000000 steps (50%)
15:36:15:WU02:FS00:0x22:Completed 510000 out of 1000000 steps (51%)
15:39:30:WU02:FS00:0x22:Completed 520000 out of 1000000 steps (52%)
15:42:44:WU02:FS00:0x22:Completed 530000 out of 1000000 steps (53%)
15:45:55:WU02:FS00:0x22:Completed 540000 out of 1000000 steps (54%)
15:49:06:WU02:FS00:0x22:Completed 550000 out of 1000000 steps (55%)
15:52:27:WU02:FS00:0x22:Completed 560000 out of 1000000 steps (56%)
15:55:39:WU02:FS00:0x22:Completed 570000 out of 1000000 steps (57%)
15:58:50:WU02:FS00:0x22:Completed 580000 out of 1000000 steps (58%)
16:02:02:WU02:FS00:0x22:Completed 590000 out of 1000000 steps (59%)
16:05:13:WU02:FS00:0x22:Completed 600000 out of 1000000 steps (60%)
16:08:34:WU02:FS00:0x22:Completed 610000 out of 1000000 steps (61%)
16:11:46:WU02:FS00:0x22:Completed 620000 out of 1000000 steps (62%)
16:14:58:WU02:FS00:0x22:Completed 630000 out of 1000000 steps (63%)
16:18:09:WU02:FS00:0x22:Completed 640000 out of 1000000 steps (64%)
16:21:21:WU02:FS00:0x22:Completed 650000 out of 1000000 steps (65%)
16:24:42:WU02:FS00:0x22:Completed 660000 out of 1000000 steps (66%)
16:27:53:WU02:FS00:0x22:Completed 670000 out of 1000000 steps (67%)
16:31:05:WU02:FS00:0x22:Completed 680000 out of 1000000 steps (68%)
16:34:17:WU02:FS00:0x22:Completed 690000 out of 1000000 steps (69%)
16:37:29:WU02:FS00:0x22:Completed 700000 out of 1000000 steps (70%)
16:40:49:WU02:FS00:0x22:Completed 710000 out of 1000000 steps (71%)
16:44:01:WU02:FS00:0x22:Completed 720000 out of 1000000 steps (72%)
16:47:13:WU02:FS00:0x22:Completed 730000 out of 1000000 steps (73%)
16:50:25:WU02:FS00:0x22:Completed 740000 out of 1000000 steps (74%)
16:53:37:WU02:FS00:0x22:Completed 750000 out of 1000000 steps (75%)
16:56:57:WU02:FS00:0x22:Completed 760000 out of 1000000 steps (76%)
17:00:09:WU02:FS00:0x22:Completed 770000 out of 1000000 steps (77%)
17:03:20:WU02:FS00:0x22:Completed 780000 out of 1000000 steps (78%)
17:06:32:WU02:FS00:0x22:Completed 790000 out of 1000000 steps (79%)
17:09:44:WU02:FS00:0x22:Completed 800000 out of 1000000 steps (80%)
17:13:04:WU02:FS00:0x22:Completed 810000 out of 1000000 steps (81%)
17:16:16:WU02:FS00:0x22:Completed 820000 out of 1000000 steps (82%)
17:19:28:WU02:FS00:0x22:Completed 830000 out of 1000000 steps (83%)
17:22:39:WU02:FS00:0x22:Completed 840000 out of 1000000 steps (84%)
17:25:51:WU02:FS00:0x22:Completed 850000 out of 1000000 steps (85%)
17:29:12:WU02:FS00:0x22:Completed 860000 out of 1000000 steps (86%)
17:32:24:WU02:FS00:0x22:Completed 870000 out of 1000000 steps (87%)
17:35:35:WU02:FS00:0x22:Completed 880000 out of 1000000 steps (88%)
17:38:47:WU02:FS00:0x22:Completed 890000 out of 1000000 steps (89%)
17:41:59:WU02:FS00:0x22:Completed 900000 out of 1000000 steps (90%)
17:45:20:WU02:FS00:0x22:Completed 910000 out of 1000000 steps (91%)
17:48:32:WU02:FS00:0x22:Completed 920000 out of 1000000 steps (92%)
17:51:43:WU02:FS00:0x22:Completed 930000 out of 1000000 steps (93%)
17:54:55:WU02:FS00:0x22:Completed 940000 out of 1000000 steps (94%)
17:58:07:WU02:FS00:0x22:Completed 950000 out of 1000000 steps (95%)
18:01:28:WU02:FS00:0x22:Completed 960000 out of 1000000 steps (96%)
18:04:40:WU02:FS00:0x22:Completed 970000 out of 1000000 steps (97%)
18:07:51:WU02:FS00:0x22:Completed 980000 out of 1000000 steps (98%)
18:11:03:WU02:FS00:0x22:Completed 990000 out of 1000000 steps (99%)
18:14:15:WU02:FS00:0x22:Completed 1000000 out of 1000000 steps (100%)
18:14:25:WU02:FS00:0x22:Saving result file ../logfile_01.txt
18:14:25:WU02:FS00:0x22:Saving result file checkpointState.xml
18:14:31:WU02:FS00:0x22:Saving result file checkpt.crc
18:14:31:WU02:FS00:0x22:Saving result file positions.xtc
18:14:34:WU02:FS00:0x22:Saving result file science.log
18:14:34:WU02:FS00:0x22:Folding@home Core Shutdown: FINISHED_UNIT
18:14:35:WU02:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
18:14:35:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:11752 run:0 clone:9163 gen:14 core:0x22 unit:0x000000238ca304e75e6bbff6f50010ac
18:14:35:WU02:FS00:Uploading 24.34MiB to 140.163.4.231
18:14:35:WU02:FS00:Connecting to 140.163.4.231:8080
18:14:41:WU02:FS00:Upload 2.82%
18:14:48:WU02:FS00:Upload 4.62%
18:14:54:WU02:FS00:Upload 6.68%
18:15:01:WU02:FS00:Upload 8.47%
18:15:07:WU02:FS00:Upload 10.53%
18:15:14:WU02:FS00:Upload 12.84%
18:15:20:WU02:FS00:Upload 14.64%
18:15:27:WU02:FS00:Upload 16.95%
18:15:33:WU02:FS00:Upload 19.00%
18:15:39:WU02:FS00:Upload 20.80%
18:15:45:WU02:FS00:Upload 22.60%
18:15:51:WU02:FS00:Upload 24.65%
18:15:57:WU02:FS00:Upload 26.45%
18:16:03:WU02:FS00:Upload 28.50%
18:16:09:WU02:FS00:Upload 30.30%
18:16:16:WU02:FS00:Upload 32.10%
18:16:22:WU02:FS00:Upload 34.41%
18:16:28:WU02:FS00:Upload 36.46%
18:16:34:WU02:FS00:Upload 38.26%
18:16:40:WU02:FS00:Upload 40.31%
18:16:46:WU02:FS00:Upload 42.62%
18:16:52:WU02:FS00:Upload 44.68%
18:16:58:WU02:FS00:Upload 46.47%
18:17:04:WU02:FS00:Upload 48.53%
18:17:10:WU02:FS00:Upload 50.84%
18:17:16:WU02:FS00:Upload 52.89%
18:17:22:WU02:FS00:Upload 55.20%
18:17:28:WU02:FS00:Upload 57.26%
18:17:35:WU02:FS00:Upload 59.57%
18:17:42:WU02:FS00:Upload 62.14%
18:17:48:WU02:FS00:Upload 64.45%
18:17:54:WU02:FS00:Upload 66.50%
18:18:01:WU02:FS00:Upload 68.81%
18:18:07:WU02:FS00:Upload 71.12%
18:18:13:WU02:FS00:Upload 73.18%
18:18:19:WU02:FS00:Upload 75.49%
18:18:25:WU02:FS00:Upload 77.54%
18:18:32:WU02:FS00:Upload 79.85%
18:18:38:WU02:FS00:Upload 82.42%
18:18:44:WU02:FS00:Upload 84.22%
18:18:51:WU02:FS00:Upload 86.79%
18:18:57:WU02:FS00:Upload 89.10%
18:19:03:WU02:FS00:Upload 90.89%
18:19:10:WU02:FS00:Upload 93.46%
18:19:17:WU02:FS00:Upload 95.77%
18:19:24:WU02:FS00:Upload 96.80%
18:19:31:WU02:FS00:Upload 99.11%
18:19:35:WU02:FS00:Upload complete
18:19:36:WU02:FS00:Server responded WORK_QUIT (404)
18:19:36:WARNING:WU02:FS00:Server did not like results, dumping
18:19:36:WU02:FS00:Cleaning up

Re: WORK_QUIT for WU 11752 (Run 0, Clone 9163, Gen 14)

Posted: Mon May 25, 2020 9:01 am
by PantherX
This message:
18:19:36:WARNING:WU02:FS00:Server did not like results, dumping

Generally means that the validation checks on the Server end has failed for that WU. It could be a number of reasons like network issues between your client and server, hardware that's on the edge of stability, etc. If it was a server issue or accidental configuration, then there would be a lot more reports for that Server or Project.

Re: WORK_QUIT for WU 11752 (Run 0, Clone 9163, Gen 14)

Posted: Sat Jul 04, 2020 10:15 am
by Knish
wouldn't that previously mentioned project be completed by somebody by now? https://apps.foldingathome.org/wu#proje ... 163&gen=14

sry for the dumb questions, but what are the odds that a lot of ppl end up trying that WU and the server rejects it after they upload it for all of them?

... or for this one too? - https://apps.foldingathome.org/wu#proje ... 522&gen=41 (i uploaded that on 07-04T 07:02:09 Z but it won't be listed bc server rejected it)

Re: WORK_QUIT for WU 11752 (Run 0, Clone 9163, Gen 14)

Posted: Sat Jul 04, 2020 4:01 pm
by DRQ
OK, not the same project but the same group of projects and same server, I think:
"...
14:31:32:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
14:31:32:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:11751 run:0 clone:116 gen:39 core:0x22 unit:0x0000005c8ca304e75e6a80388a048582
14:31:32:WU01:FS00:Uploading 14.65MiB to 140.163.4.231
14:31:32:WU01:FS00:Connecting to 140.163.4.231:8080
14:31:38:WU01:FS00:Upload 20.47%
14:31:44:WU01:FS00:Upload 44.78%
14:31:50:WU01:FS00:Upload 69.09%
14:31:56:WU01:FS00:Upload 93.40%
14:31:57:WU01:FS00:Upload complete
14:31:58:WU01:FS00:Server responded WORK_QUIT (404)
14:31:58:WARNING:WU01:FS00:Server did not like results, dumping
..."

I'm posting this here because PantherX suggested there would be more reports if there was a problem; where would these be? Should I be reporting something?

Re: WORK_QUIT for WU 11752 (Run 0, Clone 9163, Gen 14)

Posted: Sat Jul 04, 2020 6:38 pm
by bruce
Knish wrote:wouldn't that previously mentioned project be completed by somebody by now?
If the server is "full" or for some other reason can't accept the uploads, the WU can't be completed until the problem is corrected.

A WU is not completed until the completed result is uploaded.

Re: WORK_QUIT for WU 11752 (Run 0, Clone 9163, Gen 14)

Posted: Sun Jul 05, 2020 7:59 am
by DRQ
bruce wrote:If the server is "full" or for some other reason can't accept the uploads, the WU can't be completed until the problem is corrected.

A WU is not completed until the completed result is uploaded.
If the server is full then why are more WUs being issued?

Perhaps this topic should be in the "Issues with a specific server" section.

Re: WORK_QUIT for WU 11752 (Run 0, Clone 9163, Gen 14)

Posted: Sun Jul 05, 2020 8:07 am
by DRQ
DRQ wrote:Perhaps this topic should be in the "Issues with a specific server" section.
Looks like there have been problems with this server already reported in "Can't upload to 140.163.4.231 again"

I see that some people already know this but adding a link so that everyone can follow if they wish.