prjectID 16525 7 hrs then had total 12 1/2 complete

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

Post Reply
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

prjectID 16525 7 hrs in then had total 12 1/2 complete.

jek81 id

assigned sometime early morning 8am chicago time. checked it few 6-7 hrs later, hfm client shows blank for completed.

How do i find out what happened.

Ain't about points but narrows this project down, 2 million pts. 12 1/2 hour prject.
Image
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

3060 TI overclocked 2115 MHz.
Image
bollix47
Posts: 2959
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by bollix47 »

P16525 is a slow one .... my 3060 ti takes over 12 hours to complete that project.

As far as how to check things we usually start with the log.txt file.

See viewtopic.php?t=26036
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

bollix47 wrote: Tue Nov 19, 2024 10:54 pm P16525 is a slow one .... my 3060 ti takes over 12 hours to complete that project.

As far as how to check things we usually start with the log.txt file.

See viewtopic.php?t=26036
Big project, but this dont make sense.

Code: Select all


15:54:27:WU01:FS01:0x23:Project: 16525 (Run 239, Clone 2, Gen 33)
15:54:27:WU01:FS01:0x23:Digital signatures verified
15:54:27:WU01:FS01:0x23:Folding@home GPU Core23 Folding@home Core
15:54:27:WU01:FS01:0x23:Version 8.0.3
15:54:27:WU01:FS01:0x23:  Checkpoint write interval: 50000 steps (2%) [50 total]
15:54:27:WU01:FS01:0x23:  JSON viewer frame write interval: 25000 steps (1%) [100 total]
15:54:27:WU01:FS01:0x23:  XTC frame write interval: 25000 steps (1%) [100 total]
15:54:27:WU01:FS01:0x23:  Global context and integrator variables write interval: disabled
15:54:27:WU01:FS01:0x23:There are 4 platforms available.
15:54:27:WU01:FS01:0x23:Platform 0: Reference
15:54:27:WU01:FS01:0x23:Platform 1: CPU
15:54:27:WU01:FS01:0x23:Platform 2: OpenCL
15:54:27:WU01:FS01:0x23:  opencl-device 0 specified
15:54:27:WU01:FS01:0x23:Platform 3: CUDA
15:54:27:WU01:FS01:0x23:  cuda-device 0 specified
15:54:31:WU02:FS00:0xa9:Completed 171551 out of 250000 steps (68%)
15:55:13:WU01:FS01:0x23:Attempting to create CUDA context:
15:55:13:WU01:FS01:0x23:  Configuring platform CUDA
15:55:23:WU01:FS01:0x23:  Using CUDA on CUDA Platform and gpu 0
15:55:23:WU01:FS01:0x23:  GPU info: Platform: CUDA
15:55:23:WU01:FS01:0x23:  GPU info: PlatformIndex: 0
15:55:23:WU01:FS01:0x23:  GPU info: Device: NVIDIA GeForce RTX 3060 Ti
15:55:23:WU01:FS01:0x23:  GPU info: DeviceIndex: 0
15:55:23:WU01:FS01:0x23:  GPU info: Vendor: 0x10de
15:55:23:WU01:FS01:0x23:  GPU info: PCI: 01:00:00
15:55:23:WU01:FS01:0x23:  GPU info: Compute: 8.6
15:55:23:WU01:FS01:0x23:  GPU info: Driver: 12.3
15:55:23:WU01:FS01:0x23:  GPU info: GPU: true
15:55:24:WU01:FS01:0x23:Completed 0 out of 2500000 steps (0%)
15:55:26:WU01:FS01:0x23:Checkpoint completed at step 0
15:55:29:WARNING:WU02:FS00:Detected clock skew (1 mins 03 secs), I/O delay, laptop hibernation or other slowdown noted, adjusting time estimates
15:55:29:WARNING:WU01:FS01:Detected clock skew (1 mins 02 secs), I/O delay, laptop hibernation or other slowdown noted, adjusting time estimates
15:55:32:ERROR:Receive error: 10053: An established connection was aborted by the software in your host machine.
15:55:35:ERROR:Receive error: 10053: An established connection was aborted by the software in your host machine.
Image
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

rest

Code: Select all

15:55:55:9:127.0.0.1:New Web session
15:57:09:WU02:FS00:0xa9:Completed 172500 out of 250000 steps (69%)
16:03:02:WU01:FS01:0x23:Completed 25000 out of 2500000 steps (1%)
16:03:44:WU02:FS00:0xa9:Completed 175000 out of 250000 steps (70%)
16:10:10:WU02:FS00:0xa9:Completed 177500 out of 250000 steps (71%)
16:10:40:WU01:FS01:0x23:Completed 50000 out of 2500000 steps (2%)
16:10:43:WU01:FS01:0x23:Checkpoint completed at step 50000
16:16:41:WU02:FS00:0xa9:Completed 180000 out of 250000 steps (72%)
16:18:18:WU01:FS01:0x23:Completed 75000 out of 2500000 steps (3%)
16:23:08:WU02:FS00:0xa9:Completed 182500 out of 250000 steps (73%)
16:25:51:WU01:FS01:0x23:Completed 100000 out of 2500000 steps (4%)
16:25:54:WU01:FS01:0x23:Checkpoint completed at step 100000
16:29:36:WU02:FS00:0xa9:Completed 185000 out of 250000 steps (74%)
16:33:27:WU01:FS01:0x23:Completed 125000 out of 2500000 steps (5%)
16:36:01:WU02:FS00:0xa9:Completed 187500 out of 250000 steps (75%)
16:41:01:WU01:FS01:0x23:Completed 150000 out of 2500000 steps (6%)
16:41:04:WU01:FS01:0x23:Checkpoint completed at step 150000
16:42:28:WU02:FS00:0xa9:Completed 190000 out of 250000 steps (76%)
16:48:37:WU01:FS01:0x23:Completed 175000 out of 2500000 steps (7%)
16:48:56:WU02:FS00:0xa9:Completed 192500 out of 250000 steps (77%)
16:55:21:WU02:FS00:0xa9:Completed 195000 out of 250000 steps (78%)
16:56:10:WU01:FS01:0x23:Completed 200000 out of 2500000 steps (8%)
16:56:13:WU01:FS01:0x23:Checkpoint completed at step 200000
17:01:47:WU02:FS00:0xa9:Completed 197500 out of 250000 steps (79%)
17:03:47:WU01:FS01:0x23:Completed 225000 out of 2500000 steps (9%)
17:08:12:WU02:FS00:0xa9:Completed 200000 out of 250000 steps (80%)
17:11:20:WU01:FS01:0x23:Completed 250000 out of 2500000 steps (10%)
17:11:23:WU01:FS01:0x23:Checkpoint completed at step 250000
17:14:38:WU02:FS00:0xa9:Completed 202500 out of 250000 steps (81%)
17:18:56:WU01:FS01:0x23:Completed 275000 out of 2500000 steps (11%)
17:21:05:WU02:FS00:0xa9:Completed 205000 out of 250000 steps (82%)
17:26:29:WU01:FS01:0x23:Completed 300000 out of 2500000 steps (12%)
17:26:32:WU01:FS01:0x23:Checkpoint completed at step 300000
17:27:34:WU02:FS00:0xa9:Completed 207500 out of 250000 steps (83%)
17:34:03:WU02:FS00:0xa9:Completed 210000 out of 250000 steps (84%)
17:34:05:WU01:FS01:0x23:Completed 325000 out of 2500000 steps (13%)
17:40:28:WU02:FS00:0xa9:Completed 212500 out of 250000 steps (85%)
17:41:38:WU01:FS01:0x23:Completed 350000 out of 2500000 steps (14%)
17:41:42:WU01:FS01:0x23:Checkpoint completed at step 350000
17:46:52:WU02:FS00:0xa9:Completed 215000 out of 250000 steps (86%)
17:49:15:WU01:FS01:0x23:Completed 375000 out of 2500000 steps (15%)
17:53:15:WU02:FS00:0xa9:Completed 217500 out of 250000 steps (87%)
17:56:47:WU01:FS01:0x23:Completed 400000 out of 2500000 steps (16%)
17:56:50:WU01:FS01:0x23:Checkpoint completed at step 400000
17:59:37:WU02:FS00:0xa9:Completed 220000 out of 250000 steps (88%)
18:04:24:WU01:FS01:0x23:Completed 425000 out of 2500000 steps (17%)
18:05:58:WU02:FS00:0xa9:Completed 222500 out of 250000 steps (89%)
18:11:57:WU01:FS01:0x23:Completed 450000 out of 2500000 steps (18%)
18:12:00:WU01:FS01:0x23:Checkpoint completed at step 450000
18:12:22:WU02:FS00:0xa9:Completed 225000 out of 250000 steps (90%)
18:18:51:WU02:FS00:0xa9:Completed 227500 out of 250000 steps (91%)
18:19:33:WU01:FS01:0x23:Completed 475000 out of 2500000 steps (19%)
18:25:17:WU02:FS00:0xa9:Completed 230000 out of 250000 steps (92%)
18:27:06:WU01:FS01:0x23:Completed 500000 out of 2500000 steps (20%)
18:27:09:WU01:FS01:0x23:Checkpoint completed at step 500000
18:31:46:WU02:FS00:0xa9:Completed 232500 out of 250000 steps (93%)
18:34:42:WU01:FS01:0x23:Completed 525000 out of 2500000 steps (21%)
18:38:13:WU02:FS00:0xa9:Completed 235000 out of 250000 steps (94%)
18:42:15:WU01:FS01:0x23:Completed 550000 out of 2500000 steps (22%)
18:42:18:WU01:FS01:0x23:Checkpoint completed at step 550000
18:44:42:WU02:FS00:0xa9:Completed 237500 out of 250000 steps (95%)
18:49:51:WU01:FS01:0x23:Completed 575000 out of 2500000 steps (23%)
18:51:08:WU02:FS00:0xa9:Completed 240000 out of 250000 steps (96%)
18:57:24:WU01:FS01:0x23:Completed 600000 out of 2500000 steps (24%)
18:57:27:WU01:FS01:0x23:Checkpoint completed at step 600000
18:57:36:WU02:FS00:0xa9:Completed 242500 out of 250000 steps (97%)
19:04:08:WU02:FS00:0xa9:Completed 245000 out of 250000 steps (98%)
19:05:00:WU01:FS01:0x23:Completed 625000 out of 2500000 steps (25%)
19:10:33:WU02:FS00:0xa9:Completed 247500 out of 250000 steps (99%)
19:10:33:WU00:FS00:Connecting to assign1.foldingathome.org:80
19:10:33:WU00:FS00:Assigned to work server 144.121.86.56
Image
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

I'm sorry, rabbit hole.

Code: Select all

*********************** Log Started 2024-11-19T15:54:09Z ***********************
15:54:09:FS01:Initialized folding slot 01: gpu:1:0 GA104 [GeForce RTX 3060 Ti]
15:54:25:FS01:Unpaused
15:54:26:WU01:FS01:Starting
15:54:26:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\ProgramData\FAHClient\cores/cores.foldingathome.org/openmm-core-23/windows-10-64bit/release/0x23-8.0.3/Core_23.fah/FahCore_23.exe -dir 01 -suffix 01 -version 706 -lifeline 8572 -checkpoint 3 -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu-vendor nvidia -gpu 0 -gpu-usage 100
15:54:26:WU01:FS01:Started FahCore on PID 2456
15:54:26:WU01:FS01:Core PID:940
15:54:26:WU01:FS01:FahCore 0x23 started
15:54:27:WU01:FS01:0x23:*********************** Log Started 2024-11-19T15:54:26Z ***********************
15:54:27:WU01:FS01:0x23:*************************** Core23 Folding@home Core ***************************
15:54:27:WU01:FS01:0x23:       Core: Core23
15:54:27:WU01:FS01:0x23:       Type: 0x23
15:54:27:WU01:FS01:0x23:    Version: 8.0.3
15:54:27:WU01:FS01:0x23:     Author: Joseph Coffland <[email protected]>
15:54:27:WU01:FS01:0x23:  Copyright: 2022 foldingathome.org
15:54:27:WU01:FS01:0x23:   Homepage: https://foldingathome.org/
15:54:27:WU01:FS01:0x23:       Date: Aug 3 2023
15:54:27:WU01:FS01:0x23:       Time: 08:39:06
15:54:27:WU01:FS01:0x23:   Compiler: Visual C++
15:54:27:WU01:FS01:0x23:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
15:54:27:WU01:FS01:0x23:             -DOPENMM_VERSION="\"8.0.0\""
15:54:27:WU01:FS01:0x23:   Platform: win32 10
15:54:27:WU01:FS01:0x23:       Bits: 64
15:54:27:WU01:FS01:0x23:       Mode: Release
15:54:27:WU01:FS01:0x23:Maintainers: John Chodera <[email protected]> and Peter Eastman
15:54:27:WU01:FS01:0x23:             <[email protected]>
15:54:27:WU01:FS01:0x23:       Args: -dir 01 -suffix 01 -version 706 -lifeline 2456 -checkpoint 3
15:54:27:WU01:FS01:0x23:             -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu-vendor
15:54:27:WU01:FS01:0x23:             nvidia -gpu 0 -gpu-usage 100
15:54:27:WU01:FS01:0x23:************************************ libFAH ************************************
15:54:27:WU01:FS01:0x23:       Date: Aug 3 2023
15:54:27:WU01:FS01:0x23:       Time: 08:37:55
15:54:27:WU01:FS01:0x23:   Compiler: Visual C++
15:54:27:WU01:FS01:0x23:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
15:54:27:WU01:FS01:0x23:   Platform: win32 10
15:54:27:WU01:FS01:0x23:       Bits: 64
15:54:27:WU01:FS01:0x23:       Mode: Release
15:54:27:WU01:FS01:0x23:************************************ CBang *************************************
15:54:27:WU01:FS01:0x23:    Version: 1.7.2
15:54:27:WU01:FS01:0x23:     Author: Joseph Coffland <[email protected]>
15:54:27:WU01:FS01:0x23:        Org: Cauldron Development LLC
15:54:27:WU01:FS01:0x23:  Copyright: Cauldron Development LLC, 2003-2023
15:54:27:WU01:FS01:0x23:   Homepage: https://cauldrondevelopment.com/
15:54:27:WU01:FS01:0x23:    License: GPL 2+
15:54:27:WU01:FS01:0x23:       Date: Aug 3 2023
15:54:27:WU01:FS01:0x23:       Time: 08:37:14
15:54:27:WU01:FS01:0x23:   Compiler: Visual C++
15:54:27:WU01:FS01:0x23:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
15:54:27:WU01:FS01:0x23:   Platform: win32 10
15:54:27:WU01:FS01:0x23:       Bits: 64
15:54:27:WU01:FS01:0x23:       Mode: Release
15:54:27:WU01:FS01:0x23:************************************ System ************************************
15:54:27:WU01:FS01:0x23:        CPU: Intel(R) Core(TM) i5-10600K CPU @ 4.10GHz
15:54:27:WU01:FS01:0x23:     CPU ID: GenuineIntel Family 6 Model 165 Stepping 5
15:54:27:WU01:FS01:0x23:       CPUs: 12
15:54:27:WU01:FS01:0x23:     Memory: 31.93GiB
15:54:27:WU01:FS01:0x23:Free Memory: 28.45GiB
15:54:27:WU01:FS01:0x23:    Threads: WINDOWS_THREADS
15:54:27:WU01:FS01:0x23: OS Version: 6.2
15:54:27:WU01:FS01:0x23:Has Battery: false
15:54:27:WU01:FS01:0x23: On Battery: false
15:54:27:WU01:FS01:0x23: UTC Offset: -6
15:54:27:WU01:FS01:0x23:        PID: 940
15:54:27:WU01:FS01:0x23:        CWD: C:\ProgramData\FAHClient\work
15:54:27:WU01:FS01:0x23:       Exec: C:\ProgramData\FAHClient\cores\cores.foldingathome.org\openmm-core-23\windows-10-64bit\release\0x23-8.0.3\Core_23.fah\FahCore_23.exe
15:54:27:WU01:FS01:0x23:************************************ OpenMM ************************************
15:54:27:WU01:FS01:0x23:    Version: 8.0.0
15:54:27:WU01:FS01:0x23:********************************************************************************
15:54:27:WU01:FS01:0x23:Project: 16525 (Run 239, Clone 2, Gen 33)
15:54:27:WU01:FS01:0x23:Digital signatures verified
15:54:27:WU01:FS01:0x23:Folding@home GPU Core23 Folding@home Core
15:54:27:WU01:FS01:0x23:Version 8.0.3
15:54:27:WU01:FS01:0x23:  Checkpoint write interval: 50000 steps (2%) [50 total]
15:54:27:WU01:FS01:0x23:  JSON viewer frame write interval: 25000 steps (1%) [100 total]
15:54:27:WU01:FS01:0x23:  XTC frame write interval: 25000 steps (1%) [100 total]
15:54:27:WU01:FS01:0x23:  Global context and integrator variables write interval: disabled
15:54:27:WU01:FS01:0x23:There are 4 platforms available.
15:54:27:WU01:FS01:0x23:Platform 0: Reference
15:54:27:WU01:FS01:0x23:Platform 1: CPU
15:54:27:WU01:FS01:0x23:Platform 2: OpenCL
15:54:27:WU01:FS01:0x23:  opencl-device 0 specified
15:54:27:WU01:FS01:0x23:Platform 3: CUDA
15:54:27:WU01:FS01:0x23:  cuda-device 0 specified
15:55:13:WU01:FS01:0x23:Attempting to create CUDA context:
15:55:13:WU01:FS01:0x23:  Configuring platform CUDA
15:55:23:WU01:FS01:0x23:  Using CUDA on CUDA Platform and gpu 0
15:55:23:WU01:FS01:0x23:  GPU info: Platform: CUDA
15:55:23:WU01:FS01:0x23:  GPU info: PlatformIndex: 0
15:55:23:WU01:FS01:0x23:  GPU info: Device: NVIDIA GeForce RTX 3060 Ti
15:55:23:WU01:FS01:0x23:  GPU info: DeviceIndex: 0
15:55:23:WU01:FS01:0x23:  GPU info: Vendor: 0x10de
15:55:23:WU01:FS01:0x23:  GPU info: PCI: 01:00:00
15:55:23:WU01:FS01:0x23:  GPU info: Compute: 8.6
15:55:23:WU01:FS01:0x23:  GPU info: Driver: 12.3
15:55:23:WU01:FS01:0x23:  GPU info: GPU: true
15:55:24:WU01:FS01:0x23:Completed 0 out of 2500000 steps (0%)
15:55:26:WU01:FS01:0x23:Checkpoint completed at step 0
15:55:29:WARNING:WU01:FS01:Detected clock skew (1 mins 02 secs), I/O delay, laptop hibernation or other slowdown noted, adjusting time estimates
16:03:02:WU01:FS01:0x23:Completed 25000 out of 2500000 steps (1%)
16:10:40:WU01:FS01:0x23:Completed 50000 out of 2500000 steps (2%)
16:10:43:WU01:FS01:0x23:Checkpoint completed at step 50000
16:18:18:WU01:FS01:0x23:Completed 75000 out of 2500000 steps (3%)
16:25:51:WU01:FS01:0x23:Completed 100000 out of 2500000 steps (4%)
16:25:54:WU01:FS01:0x23:Checkpoint completed at step 100000
16:33:27:WU01:FS01:0x23:Completed 125000 out of 2500000 steps (5%)
16:41:01:WU01:FS01:0x23:Completed 150000 out of 2500000 steps (6%)
16:41:04:WU01:FS01:0x23:Checkpoint completed at step 150000
16:48:37:WU01:FS01:0x23:Completed 175000 out of 2500000 steps (7%)
16:56:10:WU01:FS01:0x23:Completed 200000 out of 2500000 steps (8%)
16:56:13:WU01:FS01:0x23:Checkpoint completed at step 200000
17:03:47:WU01:FS01:0x23:Completed 225000 out of 2500000 steps (9%)
17:11:20:WU01:FS01:0x23:Completed 250000 out of 2500000 steps (10%)
17:11:23:WU01:FS01:0x23:Checkpoint completed at step 250000
17:18:56:WU01:FS01:0x23:Completed 275000 out of 2500000 steps (11%)
17:26:29:WU01:FS01:0x23:Completed 300000 out of 2500000 steps (12%)
17:26:32:WU01:FS01:0x23:Checkpoint completed at step 300000
17:34:05:WU01:FS01:0x23:Completed 325000 out of 2500000 steps (13%)
17:41:38:WU01:FS01:0x23:Completed 350000 out of 2500000 steps (14%)
17:41:42:WU01:FS01:0x23:Checkpoint completed at step 350000
17:49:15:WU01:FS01:0x23:Completed 375000 out of 2500000 steps (15%)
17:56:47:WU01:FS01:0x23:Completed 400000 out of 2500000 steps (16%)
17:56:50:WU01:FS01:0x23:Checkpoint completed at step 400000
18:04:24:WU01:FS01:0x23:Completed 425000 out of 2500000 steps (17%)
18:11:57:WU01:FS01:0x23:Completed 450000 out of 2500000 steps (18%)
18:12:00:WU01:FS01:0x23:Checkpoint completed at step 450000
18:19:33:WU01:FS01:0x23:Completed 475000 out of 2500000 steps (19%)
18:27:06:WU01:FS01:0x23:Completed 500000 out of 2500000 steps (20%)
18:27:09:WU01:FS01:0x23:Checkpoint completed at step 500000
18:34:42:WU01:FS01:0x23:Completed 525000 out of 2500000 steps (21%)
18:42:15:WU01:FS01:0x23:Completed 550000 out of 2500000 steps (22%)
18:42:18:WU01:FS01:0x23:Checkpoint completed at step 550000
18:49:51:WU01:FS01:0x23:Completed 575000 out of 2500000 steps (23%)
18:57:24:WU01:FS01:0x23:Completed 600000 out of 2500000 steps (24%)
18:57:27:WU01:FS01:0x23:Checkpoint completed at step 600000
19:05:00:WU01:FS01:0x23:Completed 625000 out of 2500000 steps (25%)
19:11:57:WU01:FS01:0x23:An exception occurred at step 647618: Error invoking kernel: CUDA_ERROR_LAUNCH_TIMEOUT (702)
19:11:57:WU01:FS01:0x23:ERROR:98: Attempting to restart from last good checkpoint by restarting core.
19:11:57:WU01:FS01:0x23:Folding@home Core Shutdown: CORE_RESTART
19:12:00:WARNING:WU01:FS01:FahCore returned an unknown error code which probably indicates that it crashed
19:12:00:WARNING:WU01:FS01:FahCore returned: UNKNOWN_ENUM (-1073740791 = 0xc0000409)
Image
BobWilliams757
Posts: 521
Joined: Fri Apr 03, 2020 2:22 pm
Hardware configuration: ASRock X370M PRO4
Ryzen 2400G APU
16 GB DDR4-3200
MSI GTX 1660 Super Gaming X

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by BobWilliams757 »

The project crashed multiple times and failed. Though it can happen to anyone at times, it's often an indication of unstable hardware or software. In this case the next person completed it, which once again points to some type of instabllity on your system.

https://apps.foldingathome.org/wu ... 33

Overclocking on F@H can create instabilities that don't show up in benchmarks or games, and most that do it tend to have milder overclocks, and then they watch for any errors in the logs and adjust from there. Chances are that same work unit running at stock clocks would have completed unless there are hardware issues in the system.

Usually HFM will show failed work units along with completed ones if you look in the "Tools" then "work unit history tab". I've had a few work units somehow sneak through HFM over time, but I'm not sure what the cause is.
Fold them if you get them!
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

BobWilliams757 wrote: Wed Nov 20, 2024 10:39 pm The project crashed multiple times and failed. Though it can happen to anyone at times, it's often an indication of unstable hardware or software. In this case the next person completed it, which once again points to some type of instabllity on your system.

https://apps.foldingathome.org/wu ... 33

Overclocking on F@H can create instabilities that don't show up in benchmarks or games, and most that do it tend to have milder overclocks, and then they watch for any errors in the logs and adjust from there. Chances are that same work unit running at stock clocks would have completed unless there are hardware issues in the system.

Usually HFM will show failed work units along with completed ones if you look in the "Tools" then "work unit history tab". I've had a few work units somehow sneak through HFM over time, but I'm not sure what the cause is.

Can i view all failed wu's? on https://apps.foldingathome.org/cpu?q=jek81
Image
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

JEK81 wrote: Thu Nov 21, 2024 1:46 am
BobWilliams757 wrote: Wed Nov 20, 2024 10:39 pm The project crashed multiple times and failed. Though it can happen to anyone at times, it's often an indication of unstable hardware or software. In this case the next person completed it, which once again points to some type of instabllity on your system.

https://apps.foldingathome.org/wu ... 33

Overclocking on F@H can create instabilities that don't show up in benchmarks or games, and most that do it tend to have milder overclocks, and then they watch for any errors in the logs and adjust from there. Chances are that same work unit running at stock clocks would have completed unless there are hardware issues in the system.

Usually HFM will show failed work units along with completed ones if you look in the "Tools" then "work unit history tab". I've had a few work units somehow sneak through HFM over time, but I'm not sure what the cause is.

Can i view all failed wu's? on https://apps.foldingathome.org/cpu?q=jek81
yup thats how i determined failed, on that 165 project, i see click finished column twice shows blank as failed.
Image
BobWilliams757
Posts: 521
Joined: Fri Apr 03, 2020 2:22 pm
Hardware configuration: ASRock X370M PRO4
Ryzen 2400G APU
16 GB DDR4-3200
MSI GTX 1660 Super Gaming X

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by BobWilliams757 »

You can check for completion of any work unit you have the PRCG for. You can get this from the logs or HFM if it logged the work unit. I *think* HFM requires a return to the server, and if the system crashes in a way where it doesn't return the work unit doesn't log. But you can also check PRCG in your logs as well, and then track them down.

You can check overall completion rates through the "Bonus Status" app. https://apps.foldingathome.org/bonus

In most cases HFM will identify where the problems are, if a trend exists. If you have work units failing with no trends that you can see, it could just be a general instability that crashes work units in a more random way. In the case of any overclocks, running for a period without an overclock oten shows the trend of the overclock creating the instability. Unlike many games and other benchmarks, folding will shut you done and return the work unit after errors..... it's not forgiving since science desires to be exacting.

If the system is overall stable for folding, reaching to a 95% or higher completion rate shouldn't be difficult. And that is accounting for the occasional user error of pausing it and forgetting to resume, long term power outages, a new driver with issues, etc. Stability always wins for folding. If it's stable, you can let it run for days, weeks, and even months without errors or very few errors.


ETA: The link you posted to "recent CPUs" will not show all bad work units, only if it's the most recent with that CPUID. If the work unit was bad or otherwise failed the "Got Bonus" column will show a 0 rather than a 1. For many people it only shows the most recent completed work units.
Fold them if you get them!
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

BobWilliams757 wrote: Thu Nov 21, 2024 3:04 am You can check for completion of any work unit you have the PRCG for. You can get this from the logs or HFM if it logged the work unit. I *think* HFM requires a return to the server, and if the system crashes in a way where it doesn't return the work unit doesn't log. But you can also check PRCG in your logs as well, and then track them down.

You can check overall completion rates through the "Bonus Status" app. https://apps.foldingathome.org/bonus

In most cases HFM will identify where the problems are, if a trend exists. If you have work units failing with no trends that you can see, it could just be a general instability that crashes work units in a more random way. In the case of any overclocks, running for a period without an overclock oten shows the trend of the overclock creating the instability. Unlike many games and other benchmarks, folding will shut you done and return the work unit after errors..... it's not forgiving since science desires to be exacting.

If the system is overall stable for folding, reaching to a 95% or higher completion rate shouldn't be difficult. And that is accounting for the occasional user error of pausing it and forgetting to resume, long term power outages, a new driver with issues, etc. Stability always wins for folding. If it's stable, you can let it run for days, weeks, and even months without errors or very few errors.


ETA: The link you posted to "recent CPUs" will not show all bad work units, only if it's the most recent with that CPUID. If the work unit was bad or otherwise failed the "Got Bonus" column will show a 0 rather than a 1. For many people it only shows the most recent completed work units.
85.47%. I believe im failing wu's. my system is stable.

Image
Last edited by JEK81 on Thu Nov 21, 2024 3:53 am, edited 1 time in total.
Image
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

Image
Image
JEK81
Posts: 41
Joined: Sun Apr 23, 2023 6:39 pm

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by JEK81 »

Passkey was displayed. 😞, on net now. camt do anything about it now. 😡@myself.
Image
BobWilliams757
Posts: 521
Joined: Fri Apr 03, 2020 2:22 pm
Hardware configuration: ASRock X370M PRO4
Ryzen 2400G APU
16 GB DDR4-3200
MSI GTX 1660 Super Gaming X

Re: prjectID 16525 7 hrs then had total 12 1/2 complete

Post by BobWilliams757 »

If your username is unique you don't need to enter a pass key to find out bonus status.

And if your system is stable, the percentage of completed work units versus uncompleted will be a higher percentage.
Fold them if you get them!
Post Reply