Page 1 of 1

Bad wu from project 14520?

Posted: Tue Sep 29, 2020 6:27 pm
by gs60
Hello,

This just popped up. A new wu was downloaded from project 16805 so everything is off and running again. It appears the retry count is 5 and then it moves on.

Code: Select all

*********************** Log Started 2020-09-28T21:50:21Z ***********************
21:50:21:Trying to access database...
21:50:21:Successfully acquired database lock
21:50:21:Read GPUs.txt
21:50:22:Enabled folding slot 00: PAUSED cpu:10 (by user)
21:50:22:Enabled folding slot 01: PAUSED gpu:0:Ellesmere XT [Radeon RX 470/480/570/580/590] (by user)
21:50:22:****************************** FAHClient ******************************
21:50:22:        Version: 7.6.13
21:50:22:         Author: Joseph Coffland <[email protected]>
21:50:22:      Copyright: 2020 foldingathome.org
21:50:22:       Homepage: https://foldingathome.org/
21:50:22:           Date: Apr 27 2020
21:50:22:           Time: 21:21:01
21:50:22:       Revision: 5a652817f46116b6e135503af97f18e094414e3b
21:50:22:         Branch: master
21:50:22:       Compiler: Visual C++ 2008
21:50:22:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
21:50:22:       Platform: win32 10
21:50:22:           Bits: 32
21:50:22:           Mode: Release
21:50:22:         Config: C:\Users\gary\AppData\Roaming\FAHClient\config.xml
21:50:22:******************************** CBang ********************************
21:50:22:           Date: Apr 24 2020
21:50:22:           Time: 17:07:55
21:50:22:       Revision: ea081a3b3b0f4a37c4d0440b4f1bc184197c7797
21:50:22:         Branch: master
21:50:22:       Compiler: Visual C++ 2008
21:50:22:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
21:50:22:       Platform: win32 10
21:50:22:           Bits: 32
21:50:22:           Mode: Release
21:50:22:******************************* System ********************************
21:50:22:            CPU: AMD Ryzen 5 3600 6-Core Processor
21:50:22:         CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
21:50:22:           CPUs: 12
21:50:22:         Memory: 7.93GiB
21:50:22:    Free Memory: 6.35GiB
21:50:22:        Threads: WINDOWS_THREADS
21:50:22:     OS Version: 6.2
21:50:22:    Has Battery: false
21:50:22:     On Battery: false
21:50:22:     UTC Offset: -7
21:50:22:            PID: 5956
21:50:22:            CWD: C:\Users\gary\AppData\Roaming\FAHClient
21:50:22:  Win32 Service: false
21:50:22:             OS: Windows 10 Home
21:50:22:        OS Arch: AMD64
21:50:22:           GPUs: 1
21:50:22:          GPU 0: Bus:8 Slot:0 Func:0 AMD:5 Ellesmere XT [Radeon RX
21:50:22:                 470/480/570/580/590]
21:50:22:           CUDA: Not detected: Failed to open dynamic library 'nvcuda.dll': The
21:50:22:                 specified module could not be found.
21:50:22:
21:50:22:OpenCL Device 0: Platform:0 Device:0 Bus:8 Slot:0 Compute:1.2 Driver:3110.7
21:50:22:******************************* libFAH ********************************
21:50:22:           Date: Apr 15 2020
21:50:22:           Time: 14:53:14
21:50:22:       Revision: 216968bc7025029c841ed6e36e81a03a316890d3
21:50:22:         Branch: master
21:50:22:       Compiler: Visual C++ 2008
21:50:22:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
21:50:22:       Platform: win32 10
21:50:22:           Bits: 32
21:50:22:           Mode: Release
21:50:22:***********************************************************************
21:50:22:<config>
21:50:22:  <!-- HTTP Server -->
21:50:22:  <allow v='127.0.0.1 192.168.43.0/24'/>
21:50:22:
21:50:22:  <!-- Network -->
21:50:22:  <proxy v=':8080'/>
21:50:22:
21:50:22:  <!-- Remote Command Server -->
21:50:22:  <command-allow-no-pass v='127.0.0.1 192.168.43.0/24'/>
21:50:22:
21:50:22:  <!-- User Information -->
21:50:22:  <passkey v='*****'/>
21:50:22:  <team v='259095'/>
21:50:22:  <user v='Gary_And_Shirley'/>
21:50:22:
21:50:22:  <!-- Folding Slots -->
21:50:22:  <slot id='0' type='CPU'>
21:50:22:    <paused v='true'/>
21:50:22:  </slot>
21:50:22:  <slot id='1' type='GPU'>
21:50:22:    <paused v='true'/>
21:50:22:  </slot>
21:50:22:</config>

Code: Select all

18:19:36:WU01:FS00:FahCore 0xa7 started
18:19:36:WU01:FS00:0xa7:*********************** Log Started 2020-09-29T18:19:36Z ***********************
18:19:36:WU01:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
18:19:36:WU01:FS00:0xa7:       Type: 0xa7
18:19:36:WU01:FS00:0xa7:       Core: Gromacs
18:19:36:WU01:FS00:0xa7:       Args: -dir 01 -suffix 01 -version 706 -lifeline 4376 -checkpoint 15 -np
18:19:36:WU01:FS00:0xa7:             10
18:19:36:WU01:FS00:0xa7:************************************ CBang *************************************
18:19:36:WU01:FS00:0xa7:       Date: Nov 27 2019
18:19:36:WU01:FS00:0xa7:       Time: 03:40:09
18:19:36:WU01:FS00:0xa7:   Revision: d25803215b59272441049dfa05a0a9bf7a6e3c48
18:19:36:WU01:FS00:0xa7:     Branch: master
18:19:36:WU01:FS00:0xa7:   Compiler: Visual C++ 2008
18:19:36:WU01:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
18:19:36:WU01:FS00:0xa7:   Platform: win32 10
18:19:36:WU01:FS00:0xa7:       Bits: 64
18:19:36:WU01:FS00:0xa7:       Mode: Release
18:19:36:WU01:FS00:0xa7:************************************ System ************************************
18:19:36:WU01:FS00:0xa7:        CPU: AMD Ryzen 5 3600 6-Core Processor
18:19:36:WU01:FS00:0xa7:     CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
18:19:36:WU01:FS00:0xa7:       CPUs: 12
18:19:36:WU01:FS00:0xa7:     Memory: 7.93GiB
18:19:36:WU01:FS00:0xa7:Free Memory: 4.83GiB
18:19:36:WU01:FS00:0xa7:    Threads: WINDOWS_THREADS
18:19:36:WU01:FS00:0xa7: OS Version: 6.2
18:19:36:WU01:FS00:0xa7:Has Battery: false
18:19:36:WU01:FS00:0xa7: On Battery: false
18:19:36:WU01:FS00:0xa7: UTC Offset: -7
18:19:36:WU01:FS00:0xa7:        PID: 8412
18:19:36:WU01:FS00:0xa7:        CWD: C:\Users\gary\AppData\Roaming\FAHClient\work
18:19:36:WU01:FS00:0xa7:******************************** Build - libFAH ********************************
18:19:36:WU01:FS00:0xa7:    Version: 0.0.19
18:19:36:WU01:FS00:0xa7:     Author: Joseph Coffland <[email protected]>
18:19:36:WU01:FS00:0xa7:  Copyright: 2019 foldingathome.org
18:19:36:WU01:FS00:0xa7:   Homepage: https://foldingathome.org/
18:19:36:WU01:FS00:0xa7:       Date: Nov 25 2019
18:19:36:WU01:FS00:0xa7:       Time: 17:12:41
18:19:36:WU01:FS00:0xa7:   Revision: d5b5c747532224f986b7cd02c968ed9a20c16d6e
18:19:36:WU01:FS00:0xa7:     Branch: master
18:19:36:WU01:FS00:0xa7:   Compiler: Visual C++ 2008
18:19:36:WU01:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
18:19:36:WU01:FS00:0xa7:   Platform: win32 10
18:19:36:WU01:FS00:0xa7:       Bits: 64
18:19:36:WU01:FS00:0xa7:       Mode: Release
18:19:36:WU01:FS00:0xa7:************************************ Build *************************************
18:19:36:WU01:FS00:0xa7:       SIMD: avx_256
18:19:36:WU01:FS00:0xa7:********************************************************************************
18:19:36:WU01:FS00:0xa7:Project: 14520 (Run 0, Clone 752, Gen 267)
18:19:36:WU01:FS00:0xa7:Unit: 0x0000015280fccb0a5e34947c984bad4a
18:19:36:WU01:FS00:0xa7:Reading tar file core.xml
18:19:36:WU01:FS00:0xa7:Reading tar file frame267.tpr
18:19:36:WU01:FS00:0xa7:Digital signatures verified
18:19:36:WU01:FS00:0xa7:Calling: mdrun -s frame267.tpr -o frame267.trr -x frame267.xtc -cpt 15 -nt 10
18:19:36:WU01:FS00:0xa7:Steps: first=66750000 total=250000
18:19:36:WU01:FS00:0xa7:ERROR:
18:19:36:WU01:FS00:0xa7:ERROR:-------------------------------------------------------
18:19:36:WU01:FS00:0xa7:ERROR:Program GROMACS, VERSION 5.0.4-20191026-456f0d636-unknown
18:19:36:WU01:FS00:0xa7:ERROR:Source code file: C:\build\fah\core-a7-avx-release\windows-10-64bit-core-a7-avx-release\gromacs-core\build\gromacs\src\gromacs\mdlib\domdec.c, line: 6902
18:19:36:WU01:FS00:0xa7:ERROR:
18:19:36:WU01:FS00:0xa7:ERROR:Fatal error:
18:19:36:WU01:FS00:0xa7:ERROR:There is no domain decomposition for 10 ranks that is compatible with the given box and a minimum cell size of 1.46925 nm
18:19:36:WU01:FS00:0xa7:ERROR:Change the number of ranks or mdrun option -rcon or -dds or your LINCS settings
18:19:36:WU01:FS00:0xa7:ERROR:Look in the log file for details on the domain decomposition
18:19:36:WU01:FS00:0xa7:ERROR:For more information and tips for troubleshooting, please check the GROMACS
18:19:36:WU01:FS00:0xa7:ERROR:website at http://www.gromacs.org/Documentation/Errors
18:19:36:WU01:FS00:0xa7:ERROR:-------------------------------------------------------
18:19:41:WU01:FS00:0xa7:WARNING:Unexpected exit
18:19:41:WARNING:WU01:FS00:FahCore returned: EARLY_UNIT_END (123 = 0x7b)

...

18:21:41:WARNING:WU01:FS00:Too many errors, failing
18:21:41:WU01:FS00:Sending unit results: id:01 state:SEND error:FAILED project:14520 run:0 clone:752 gen:267 core:0xa7 unit:0x0000015280fccb0a5e34947c984bad4a

Re: Bad wu from project 14520?

Posted: Tue Sep 29, 2020 10:16 pm
by PantherX
Thanks for reporting that. I will be informing the Project Owner about this :)

Re: Bad wu from project 14520?

Posted: Wed Sep 30, 2020 5:04 am
by PantherX
FYI, I have had an update where changes were made on the Server to prevent CPU:10 from getting that Project. Thanks for your report!

Re: Bad wu from project 14520?

Posted: Wed Sep 30, 2020 1:51 pm
by gs60
Thank you for all your help!