Page 1 of 1

canot abort unit

Posted: Wed Mar 09, 2022 1:13 am
by Lamberto Vitali
we need way of abort units, after my pc crash i had to reboot and unit is stuck, cannot fold it cannot abort it, this software needs to be rewrite, it sux

Re: canot abort unit

Posted: Wed Mar 09, 2022 6:00 am
by Whompithian
Expert CLI command (WARNING: this will dump ALL active work units and report them as dumped to the collection servers - all progress will be permanently lost):

Code: Select all

systemctl stop FAHClient.service
FAHClient --config=/etc/fahclient/config.xml --dump all
This is for Linux. I am not sure what the equivalent would be for Windows.

Re: canot abort unit

Posted: Wed Mar 09, 2022 8:40 am
by Lamberto Vitali
1 - i use windows
2 - i want to dump one work unit only
3 - this should be in the gui

Re: canot abort unit

Posted: Wed Mar 09, 2022 8:48 am
by gunnarre
Perhaps you can post the relevant parts of the log?

2: You can try deleting the slot that is running the WU, watch the client dump it, and re-creating the slot afterwards.

3: No it shouldn't, because otherwise people would be habitually dumping WUs if they don't like the PPD projection, to get more cryptobucks from the schemes that reward folders. Instead, the client should ideally by itself notice that the WU has failed and auto-dump it without user intervention.

Re: canot abort unit

Posted: Wed Mar 09, 2022 9:09 am
by Lamberto Vitali
2. i can do that and have done it befor, but i shouldnt have to
3. but you can pick covid for exampl in your prefs to do the same thing

Log (not sure if thisshows it not sure how to see which bits of log refer to which slots - pc locked up had to power off - other slot kept running):

01:05:54:WU00:FS00:0xa8:Project: 18433 (Run 40, Clone 10, Gen 182)
01:05:54:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
01:05:54:WU00:FS00:0xa8:Digital signatures verified
01:05:54:WU00:FS00:0xa8:Calling: mdrun -c frame182.gro -s frame182.tpr -x frame182.xtc -cpi state.cpt -cpt 15 -nt 8 -ntmpi 1
01:05:54:WU00:FS00:0xa8:ERROR:Guru Meditation #caf469f2bd338b73.1a48148e815157d (280964.281439) '00/01/pullx.xvg'
01:05:55:WARNING:WU00:FS00:FahCore returned an unknown error code which probably indicates that it crashed
01:05:55:WARNING:WU00:FS00:FahCore returned: UNKNOWN_ENUM (-1073740791 = 0xc0000409)

Re: canot abort unit

Posted: Wed Mar 09, 2022 4:42 pm
by toTOW
Can you post the lines that follow ?

This is a corrupted checkpoint, but the client usually dump the WU automatically after such errors.

Re: canot abort unit

Posted: Thu Mar 10, 2022 3:39 am
by Lamberto Vitali
01:05:55:WU00:FS00:Starting
01:05:55:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\ProgramData\FAHClient\cores/cores.foldingathome.org/win/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8.exe -dir 00 -suffix 01 -version 706 -lifeline 12240 -checkpoint 15 -np 8
01:05:55:WU00:FS00:Started FahCore on PID 20176
01:05:55:WU00:FS00:Core PID:20184
01:05:55:WU00:FS00:FahCore 0xa8 started
01:05:56:WU00:FS00:0xa8:*********************** Log Started 2022-03-09T01:05:55Z ***********************
01:05:56:WU00:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
01:05:56:WU00:FS00:0xa8: Core: Gromacs
01:05:56:WU00:FS00:0xa8: Type: 0xa8
01:05:56:WU00:FS00:0xa8: Version: 0.0.12
01:05:56:WU00:FS00:0xa8: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
01:05:56:WU00:FS00:0xa8: Copyright: 2020 foldingathome.org
01:05:56:WU00:FS00:0xa8: Homepage: https://foldingathome.org/
01:05:56:WU00:FS00:0xa8: Date: Jan 16 2021
01:05:56:WU00:FS00:0xa8: Time: 12:29:40
01:05:56:WU00:FS00:0xa8: Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
01:05:56:WU00:FS00:0xa8: Branch: master
01:05:56:WU00:FS00:0xa8: Compiler: Visual C++ 2019 16.7
01:05:56:WU00:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
01:05:56:WU00:FS00:0xa8: Platform: win32 10
01:05:56:WU00:FS00:0xa8: Bits: 64
01:05:56:WU00:FS00:0xa8: Mode: Release
01:05:56:WU00:FS00:0xa8: SIMD: avx2_256
01:05:56:WU00:FS00:0xa8: OpenMP: ON
01:05:56:WU00:FS00:0xa8: CUDA: OFF
01:05:56:WU00:FS00:0xa8: Args: -dir 00 -suffix 01 -version 706 -lifeline 20176 -checkpoint 15 -np
01:05:56:WU00:FS00:0xa8: 8
01:05:56:WU00:FS00:0xa8:************************************ libFAH ************************************
01:05:56:WU00:FS00:0xa8: Date: Jan 16 2021
01:05:56:WU00:FS00:0xa8: Time: 11:24:13
01:05:56:WU00:FS00:0xa8: Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
01:05:56:WU00:FS00:0xa8: Branch: master
01:05:56:WU00:FS00:0xa8: Compiler: Visual C++ 2019 16.7
01:05:56:WU00:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
01:05:56:WU00:FS00:0xa8: Platform: win32 10
01:05:56:WU00:FS00:0xa8: Bits: 64
01:05:56:WU00:FS00:0xa8: Mode: Release
01:05:56:WU00:FS00:0xa8:************************************ CBang *************************************
01:05:56:WU00:FS00:0xa8: Date: Jan 16 2021
01:05:56:WU00:FS00:0xa8: Time: 11:23:53
01:05:56:WU00:FS00:0xa8: Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
01:05:56:WU00:FS00:0xa8: Branch: master
01:05:56:WU00:FS00:0xa8: Compiler: Visual C++ 2019 16.7
01:05:56:WU00:FS00:0xa8: Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
01:05:56:WU00:FS00:0xa8: Platform: win32 10
01:05:56:WU00:FS00:0xa8: Bits: 64
01:05:56:WU00:FS00:0xa8: Mode: Release
01:05:56:WU00:FS00:0xa8:************************************ System ************************************
01:05:56:WU00:FS00:0xa8: CPU: AMD Ryzen 9 3900XT 12-Core Processor
01:05:56:WU00:FS00:0xa8: CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
01:05:56:WU00:FS00:0xa8: CPUs: 24
01:05:56:WU00:FS00:0xa8: Memory: 63.93GiB
01:05:56:WU00:FS00:0xa8:Free Memory: 50.92GiB
01:05:56:WU00:FS00:0xa8: Threads: WINDOWS_THREADS
01:05:56:WU00:FS00:0xa8: OS Version: 6.2
01:05:56:WU00:FS00:0xa8:Has Battery: true
01:05:56:WU00:FS00:0xa8: On Battery: false
01:05:56:WU00:FS00:0xa8: UTC Offset: 0
01:05:56:WU00:FS00:0xa8: PID: 20184
01:05:56:WU00:FS00:0xa8: CWD: C:\ProgramData\FAHClient\work
01:05:56:WU00:FS00:0xa8:********************************************************************************
01:05:56:WU00:FS00:0xa8:Project: 18433 (Run 40, Clone 10, Gen 182)
01:05:56:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
01:05:56:WU00:FS00:0xa8:Digital signatures verified
01:05:56:WU00:FS00:0xa8:Calling: mdrun -c frame182.gro -s frame182.tpr -x frame182.xtc -cpi state.cpt -cpt 15 -nt 8 -ntmpi 1
01:05:56:WU00:FS00:0xa8:ERROR:Guru Meditation #caf469f2bd338b73.1a48148e815157d (280964.281439) '00/01/pullx.xvg'
01:05:56:WARNING:WU00:FS00:FahCore returned an unknown error code which probably indicates that it crashed
01:05:56:WARNING:WU00:FS00:FahCore returned: UNKNOWN_ENUM (-1073740791 = 0xc0000409)

Re: canot abort unit

Posted: Fri Mar 11, 2022 11:33 am
by toTOW
So it keeps repeating ? :(

In Webcontrol or FAHControl, Pause folding slot.

Go to C:\ProgramData\FAHClient\work and delete 00 folder. It will get rid of this corrupted work files.

Resume folding slot.

Re: canot abort unit

Posted: Fri Mar 11, 2022 12:17 pm
by Lamberto Vitali
yes doesnt seem to cancel itself, ah your suggestions worked thankyou !