canot abort unit

Moderators: Site Moderators, FAHC Science Team

Post Reply
Lamberto Vitali
Posts: 80
Joined: Fri Feb 25, 2022 12:21 am

canot abort unit

Post by Lamberto Vitali »

we need way of abort units, after my pc crash i had to reboot and unit is stuck, cannot fold it cannot abort it, this software needs to be rewrite, it sux
Whompithian
Posts: 39
Joined: Thu Jun 25, 2020 12:40 am

Re: canot abort unit

Post by Whompithian »

Expert CLI command (WARNING: this will dump ALL active work units and report them as dumped to the collection servers - all progress will be permanently lost):

Code: Select all

systemctl stop FAHClient.service
FAHClient --config=/etc/fahclient/config.xml --dump all
This is for Linux. I am not sure what the equivalent would be for Windows.
Lamberto Vitali
Posts: 80
Joined: Fri Feb 25, 2022 12:21 am

Re: canot abort unit

Post by Lamberto Vitali »

1 - i use windows
2 - i want to dump one work unit only
3 - this should be in the gui
gunnarre
Posts: 559
Joined: Sun May 24, 2020 7:23 pm
Location: Norway

Re: canot abort unit

Post by gunnarre »

Perhaps you can post the relevant parts of the log?

2: You can try deleting the slot that is running the WU, watch the client dump it, and re-creating the slot afterwards.

3: No it shouldn't, because otherwise people would be habitually dumping WUs if they don't like the PPD projection, to get more cryptobucks from the schemes that reward folders. Instead, the client should ideally by itself notice that the WU has failed and auto-dump it without user intervention.
Image
Online: GTX 1660 Super + occasional CPU folding in the cold.
Offline: Radeon HD 7770, GTX 1050 Ti 4G OC, RX580
Lamberto Vitali
Posts: 80
Joined: Fri Feb 25, 2022 12:21 am

Re: canot abort unit

Post by Lamberto Vitali »

2. i can do that and have done it befor, but i shouldnt have to
3. but you can pick covid for exampl in your prefs to do the same thing

Log (not sure if thisshows it not sure how to see which bits of log refer to which slots - pc locked up had to power off - other slot kept running):

01:05:54:WU00:FS00:0xa8:Project: 18433 (Run 40, Clone 10, Gen 182)
01:05:54:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
01:05:54:WU00:FS00:0xa8:Digital signatures verified
01:05:54:WU00:FS00:0xa8:Calling: mdrun -c frame182.gro -s frame182.tpr -x frame182.xtc -cpi state.cpt -cpt 15 -nt 8 -ntmpi 1
01:05:54:WU00:FS00:0xa8:ERROR:Guru Meditation #caf469f2bd338b73.1a48148e815157d (280964.281439) '00/01/pullx.xvg'
01:05:55:WARNING:WU00:FS00:FahCore returned an unknown error code which probably indicates that it crashed
01:05:55:WARNING:WU00:FS00:FahCore returned: UNKNOWN_ENUM (-1073740791 = 0xc0000409)
toTOW
Site Moderator
Posts: 6373
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: canot abort unit

Post by toTOW »

Can you post the lines that follow ?

This is a corrupted checkpoint, but the client usually dump the WU automatically after such errors.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Lamberto Vitali
Posts: 80
Joined: Fri Feb 25, 2022 12:21 am

Re: canot abort unit

Post by Lamberto Vitali »

01:05:55:WU00:FS00:Starting
01:05:55:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\ProgramData\FAHClient\cores/cores.foldingathome.org/win/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8.exe -dir 00 -suffix 01 -version 706 -lifeline 12240 -checkpoint 15 -np 8
01:05:55:WU00:FS00:Started FahCore on PID 20176
01:05:55:WU00:FS00:Core PID:20184
01:05:55:WU00:FS00:FahCore 0xa8 started
01:05:56:WU00:FS00:0xa8:*********************** Log Started 2022-03-09T01:05:55Z ***********************
01:05:56:WU00:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
01:05:56:WU00:FS00:0xa8: Core: Gromacs
01:05:56:WU00:FS00:0xa8: Type: 0xa8
01:05:56:WU00:FS00:0xa8: Version: 0.0.12
01:05:56:WU00:FS00:0xa8: Author: Joseph Coffland <[email protected]>
01:05:56:WU00:FS00:0xa8: Copyright: 2020 foldingathome.org
01:05:56:WU00:FS00:0xa8: Homepage: https://foldingathome.org/
01:05:56:WU00:FS00:0xa8: Date: Jan 16 2021
01:05:56:WU00:FS00:0xa8: Time: 12:29:40
01:05:56:WU00:FS00:0xa8: Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
01:05:56:WU00:FS00:0xa8: Branch: master
01:05:56:WU00:FS00:0xa8: Compiler: Visual C++ 2019 16.7
01:05:56:WU00:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
01:05:56:WU00:FS00:0xa8: Platform: win32 10
01:05:56:WU00:FS00:0xa8: Bits: 64
01:05:56:WU00:FS00:0xa8: Mode: Release
01:05:56:WU00:FS00:0xa8: SIMD: avx2_256
01:05:56:WU00:FS00:0xa8: OpenMP: ON
01:05:56:WU00:FS00:0xa8: CUDA: OFF
01:05:56:WU00:FS00:0xa8: Args: -dir 00 -suffix 01 -version 706 -lifeline 20176 -checkpoint 15 -np
01:05:56:WU00:FS00:0xa8: 8
01:05:56:WU00:FS00:0xa8:************************************ libFAH ************************************
01:05:56:WU00:FS00:0xa8: Date: Jan 16 2021
01:05:56:WU00:FS00:0xa8: Time: 11:24:13
01:05:56:WU00:FS00:0xa8: Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
01:05:56:WU00:FS00:0xa8: Branch: master
01:05:56:WU00:FS00:0xa8: Compiler: Visual C++ 2019 16.7
01:05:56:WU00:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
01:05:56:WU00:FS00:0xa8: Platform: win32 10
01:05:56:WU00:FS00:0xa8: Bits: 64
01:05:56:WU00:FS00:0xa8: Mode: Release
01:05:56:WU00:FS00:0xa8:************************************ CBang *************************************
01:05:56:WU00:FS00:0xa8: Date: Jan 16 2021
01:05:56:WU00:FS00:0xa8: Time: 11:23:53
01:05:56:WU00:FS00:0xa8: Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
01:05:56:WU00:FS00:0xa8: Branch: master
01:05:56:WU00:FS00:0xa8: Compiler: Visual C++ 2019 16.7
01:05:56:WU00:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
01:05:56:WU00:FS00:0xa8: Platform: win32 10
01:05:56:WU00:FS00:0xa8: Bits: 64
01:05:56:WU00:FS00:0xa8: Mode: Release
01:05:56:WU00:FS00:0xa8:************************************ System ************************************
01:05:56:WU00:FS00:0xa8: CPU: AMD Ryzen 9 3900XT 12-Core Processor
01:05:56:WU00:FS00:0xa8: CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
01:05:56:WU00:FS00:0xa8: CPUs: 24
01:05:56:WU00:FS00:0xa8: Memory: 63.93GiB
01:05:56:WU00:FS00:0xa8:Free Memory: 50.92GiB
01:05:56:WU00:FS00:0xa8: Threads: WINDOWS_THREADS
01:05:56:WU00:FS00:0xa8: OS Version: 6.2
01:05:56:WU00:FS00:0xa8:Has Battery: true
01:05:56:WU00:FS00:0xa8: On Battery: false
01:05:56:WU00:FS00:0xa8: UTC Offset: 0
01:05:56:WU00:FS00:0xa8: PID: 20184
01:05:56:WU00:FS00:0xa8: CWD: C:\ProgramData\FAHClient\work
01:05:56:WU00:FS00:0xa8:********************************************************************************
01:05:56:WU00:FS00:0xa8:Project: 18433 (Run 40, Clone 10, Gen 182)
01:05:56:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
01:05:56:WU00:FS00:0xa8:Digital signatures verified
01:05:56:WU00:FS00:0xa8:Calling: mdrun -c frame182.gro -s frame182.tpr -x frame182.xtc -cpi state.cpt -cpt 15 -nt 8 -ntmpi 1
01:05:56:WU00:FS00:0xa8:ERROR:Guru Meditation #caf469f2bd338b73.1a48148e815157d (280964.281439) '00/01/pullx.xvg'
01:05:56:WARNING:WU00:FS00:FahCore returned an unknown error code which probably indicates that it crashed
01:05:56:WARNING:WU00:FS00:FahCore returned: UNKNOWN_ENUM (-1073740791 = 0xc0000409)
toTOW
Site Moderator
Posts: 6373
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: canot abort unit

Post by toTOW »

So it keeps repeating ? :(

In Webcontrol or FAHControl, Pause folding slot.

Go to C:\ProgramData\FAHClient\work and delete 00 folder. It will get rid of this corrupted work files.

Resume folding slot.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Lamberto Vitali
Posts: 80
Joined: Fri Feb 25, 2022 12:21 am

Re: canot abort unit

Post by Lamberto Vitali »

yes doesnt seem to cancel itself, ah your suggestions worked thankyou !
Post Reply