I've been having some trouble with this core - or possibly with my setup. New folder, I only really started last weekend after I got sick of my GPU sitting around. I have maybe an unconventional setup, but am happy to debug however I can, I'm just not too familiar with tracking down issues like this yet.
Running on Ubuntu 19.10, with an RTX 2080 Super. 440.59 drivers and CUDA 10.2. I'm running with Docker as well (nvidia-container-runtime) which might be complicating things. I've had pretty good luck running other GPU cores for the past few days, but this afternoon Core22 started erroring out. Every time this work unit starts I see an immediate "FahCore returned: INTERRUPTED (102 = 0x66)". Pausing and restarting, restarting the container, rebooting the PC to be sure, and clearing out the `cores/` and `work/` folders for the client and restarting did not help.
Any advice on how to track down what that error code means and where it's coming from?
Full logs for that core, in isolation:
Code: Select all
00:51:18:WU01:FS01:Starting
00:51:18:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/opt/folding/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 01 -suffix 01 -version 705 -lifeline 8 -ch
eckpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
00:51:18:WU01:FS01:Started FahCore on PID 70
00:51:18:WU01:FS01:Core PID:74
00:51:18:WU01:FS01:FahCore 0x22 started
00:51:18:WU01:FS01:0x22:*********************** Log Started 2020-03-11T00:51:18Z ***********************
00:51:18:WU01:FS01:0x22:*************************** Core22 Folding@home Core ***************************
00:51:18:WU01:FS01:0x22: Type: 0x22
00:51:18:WU01:FS01:0x22: Core: Core22
00:51:18:WU01:FS01:0x22: Website: https://foldingathome.org/
00:51:18:WU01:FS01:0x22: Copyright: (c) 2009-2018 foldingathome.org
00:51:18:WU01:FS01:0x22: Author: John Chodera <[email protected]> and Rafal Wiewiora
00:51:18:WU01:FS01:0x22: <[email protected]>
00:51:18:WU01:FS01:0x22: Args: -dir 01 -suffix 01 -version 705 -lifeline 70 -checkpoint 15
00:51:18:WU01:FS01:0x22: -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
00:51:18:WU01:FS01:0x22: 0 -gpu 0
00:51:18:WU01:FS01:0x22: Config: <none>
00:51:18:WU01:FS01:0x22:************************************ Build *************************************
00:51:18:WU01:FS01:0x22: Version: 0.0.2
00:51:18:WU01:FS01:0x22: Date: Dec 6 2019
00:51:18:WU01:FS01:0x22: Time: 21:20:17
00:51:18:WU01:FS01:0x22: Repository: Git
00:51:18:WU01:FS01:0x22: Revision: f87d92b58abdf7e6bf2e173cfbc4dc3e837c7042
00:51:18:WU01:FS01:0x22: Branch: core22
00:51:18:WU01:FS01:0x22: Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
00:51:18:WU01:FS01:0x22: Options: -std=gnu++98 -O3 -funroll-loops
00:51:18:WU01:FS01:0x22: Platform: linux2 4.9.87-linuxkit-aufs
00:51:18:WU01:FS01:0x22: Bits: 64
00:51:18:WU01:FS01:0x22: Mode: Release
00:51:18:WU01:FS01:0x22:************************************ System ************************************
00:51:18:WU01:FS01:0x22: CPU: Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz
00:51:18:WU01:FS01:0x22: CPU ID: GenuineIntel Family 6 Model 158 Stepping 12
00:51:18:WU01:FS01:0x22: CPUs: 16
00:51:18:WU01:FS01:0x22: Memory: 31.30GiB
00:51:18:WU01:FS01:0x22:Free Memory: 15.37GiB
00:51:18:WU01:FS01:0x22: Threads: POSIX_THREADS
00:51:18:WU01:FS01:0x22: OS Version: 5.5
00:51:18:WU01:FS01:0x22:Has Battery: false
00:51:18:WU01:FS01:0x22: On Battery: false
00:51:18:WU01:FS01:0x22: UTC Offset: 0
00:51:18:WU01:FS01:0x22: PID: 74
00:51:18:WU01:FS01:0x22: CWD: /var/opt/folding/work
00:51:18:WU01:FS01:0x22: OS: Linux 5.5.5-050505-generic x86_64
00:51:18:WU01:FS01:0x22: OS Arch: AMD64
00:51:18:WU01:FS01:0x22:********************************************************************************
00:51:18:WU01:FS01:0x22:Project: 11741 (Run 0, Clone 2360, Gen 1)
00:51:18:WU01:FS01:0x22:Unit: 0x000000018ca304f15e67d8cb67bdf2b9
00:51:18:WU01:FS01:0x22:Reading tar file core.xml
00:51:18:WU01:FS01:0x22:Reading tar file integrator.xml
00:51:18:WU01:FS01:0x22:Reading tar file state.xml
00:51:18:WU01:FS01:0x22:Reading tar file system.xml
00:51:19:WU01:FS01:0x22:Digital signatures verified
00:51:19:WU01:FS01:0x22:Folding@home GPU Core22 Folding@home Core
00:51:19:WU01:FS01:0x22:Version 0.0.2
00:51:19:85:127.0.0.1:New Web connection
00:51:21:WU01:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
Plus the logs from startup:
Code: Select all
00:50:15:INFO(1):Read GPUs.txt
00:50:15:Removing old file 'logs/log-20200306-023030.txt'
00:50:15:************************* Folding@home Client *************************
00:50:15: Website: https://foldingathome.org/
00:50:15: Copyright: (c) 2009-2018 foldingathome.org
00:50:15: Author: Joseph Coffland <[email protected]>
00:50:15: Args: --web-allow=0/0 --allow=0/0 --cpu-usage=35 --session-lifetime=0
00:50:15: --session-timeout=0 --command-enable=true
00:50:15: --command-address=0.0.0.0 --command-allow-no-pass=0/0
00:50:15: --command-port=36330
00:50:15: Config: /var/opt/folding/config.xml
00:50:15:******************************** Build ********************************
00:50:15: Version: 7.5.1
00:50:15: Date: May 11 2018
00:50:15: Time: 19:59:04
00:50:15: Repository: Git
00:50:15: Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
00:50:15: Branch: master
00:50:15: Compiler: GNU 6.3.0 20170516
00:50:15: Options: -std=gnu++98 -O3 -funroll-loops
00:50:15: Platform: linux2 4.14.0-3-amd64
00:50:15: Bits: 64
00:50:15: Mode: Release
00:50:15:******************************* System ********************************
00:50:15: CPU: Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz
00:50:15: CPU ID: GenuineIntel Family 6 Model 158 Stepping 12
00:50:15: CPUs: 16
00:50:15: Memory: 31.30GiB
00:50:15: Free Memory: 19.51GiB
00:50:15: Threads: POSIX_THREADS
00:50:15: OS Version: 5.5
00:50:15: Has Battery: false
00:50:15: On Battery: false
00:50:15: UTC Offset: 0
00:50:15: PID: 8
00:50:15: CWD: /var/opt/folding
00:50:15: OS: Linux 5.5.5-050505-generic x86_64
00:50:15: OS Arch: AMD64
00:50:15: GPUs: 1
00:50:15: GPU 0: Bus:1 Slot:0 Func:0 NVIDIA:8 TU104 [GeForce RTX 2080 Super]
00:50:15: CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:7.5 Driver:10.2
00:50:15:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:440.59
00:50:15:***********************************************************************
00:50:15:<config>
00:50:15: <!-- Slot Control -->
00:50:15: <power v='MEDIUM'/>
00:50:15:
00:50:15: <!-- User Information -->
00:50:15: <passkey v='********************************'/>
00:50:15: <team v='224497'/>
00:50:15: <user v='iavas_ALL_1HGuzc3yMQT2gABNc8Q6B1eaSWY934J55i'/>
00:50:15:
00:50:15: <!-- Folding Slots -->
00:50:15: <slot id='0' type='CPU'/>
00:50:15: <slot id='1' type='GPU'/>
00:50:15:</config>
00:50:15:Trying to access database...
00:50:15:Successfully acquired database lock
00:50:15:Enabled folding slot 00: READY cpu:14
00:50:15:Enabled folding slot 01: READY gpu:0:TU104 [GeForce RTX 2080 Super]