Page 2 of 4
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 7:44 pm
by bikeaddict
Keep in mind that Core 22 will eventually be discontinued and new projects will start using 23. Right now there are more tasks for Core 22 than for 23, but who knows when that will change.
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 7:48 pm
by muziqaz
bikeaddict wrote: ↑Sat Dec 09, 2023 7:44 pm
Keep in mind that Core 22 will eventually be discontinued and new projects will start using 23. Right now there are more tasks for Core 22 than for 23, but who knows when that will change.
Now that we realised that there are certain limitations, this will be heavily considered once the time comes.
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 8:56 pm
by verdeva
Would my CPU suffer from this condition? Same symptoms.
20:49:12:WU00:FS00:0xa8: CPU: AMD Ryzen 7 1700 Eight-Core Processor
20:49:12:WU00:FS00:0xa8: CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1
Are there any workarounds, I'm dead in the water on this machine.
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 9:09 pm
by muziqaz
verdeva wrote: ↑Sat Dec 09, 2023 8:56 pm
Would my CPU suffer from this condition? Same symptoms.
20:49:12:WU00:FS00:0xa8: CPU: AMD Ryzen 7 1700 Eight-Core Processor
20:49:12:WU00:FS00:0xa8: CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1
Are there any workarounds, I'm dead in the water on this machine.
Your CPU supports required instructions. So I guess this is not the issue then. WTF?
Please post the log snippet with the error (or when it is not working)
Full system info would be helpful too
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 9:26 pm
by verdeva
I split the CPU into two 4 core processors and one is now folding Project: 16977 (Run 86, Clone 244, Gen 592). Slot 0 has the following log snip:
Code: Select all
21:24:13:WU00:FS00:0xa8:********************************************************************************
21:24:13:WU00:FS00:0xa8:Project: 12405 (Run 112, Clone 6, Gen 2)
21:24:13:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
21:24:13:WU00:FS00:0xa8:Digital signatures verified
21:24:13:WU00:FS00:0xa8:Calling: mdrun -c frame2.gro -s frame2.tpr -x frame2.xtc -cpt 15 -nt 4 -ntmpi 1
21:24:13:WU00:FS00:0xa8:Steps: first=5000000 total=10000000
21:24:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:24:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:25:13:WU00:FS00:Starting
21:25:13:WU00:FS00:Removing old file 'work/00/logfile_01-20231209-205312.txt'
21:25:13:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 00 -suffix 01 -version 706 -lifeline 1012 -checkpoint 15 -np 4
21:25:13:WU00:FS00:Started FahCore on PID 71241
21:25:13:WU00:FS00:Core PID:71245
21:25:13:WU00:FS00:FahCore 0xa8 started
21:25:13:WU00:FS00:0xa8:*********************** Log Started 2023-12-09T21:25:13Z ***********************
21:25:13:WU00:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
21:25:13:WU00:FS00:0xa8: Core: Gromacs
21:25:13:WU00:FS00:0xa8: Type: 0xa8
21:25:13:WU00:FS00:0xa8: Version: 0.0.12
21:25:13:WU00:FS00:0xa8: Author: Joseph Coffland <[email protected]>
21:25:13:WU00:FS00:0xa8: Copyright: 2020 foldingathome.org
21:25:13:WU00:FS00:0xa8: Homepage: https://foldingathome.org/
21:25:13:WU00:FS00:0xa8: Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8: Time: 19:24:44
21:25:13:WU00:FS00:0xa8: Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8: -fdata-sections -O3 -funroll-loops -fno-pie
21:25:13:WU00:FS00:0xa8: Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8: Bits: 64
21:25:13:WU00:FS00:0xa8: Mode: Release
21:25:13:WU00:FS00:0xa8: SIMD: avx2_256
21:25:13:WU00:FS00:0xa8: OpenMP: ON
21:25:13:WU00:FS00:0xa8: CUDA: OFF
21:25:13:WU00:FS00:0xa8: Args: -dir 00 -suffix 01 -version 706 -lifeline 71241 -checkpoint 15 -np
21:25:13:WU00:FS00:0xa8: 4
21:25:13:WU00:FS00:0xa8:************************************ libFAH ************************************
21:25:13:WU00:FS00:0xa8: Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8: Time: 19:21:38
21:25:13:WU00:FS00:0xa8: Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8: -fdata-sections -O3 -funroll-loops -fno-pie
21:25:13:WU00:FS00:0xa8: Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8: Bits: 64
21:25:13:WU00:FS00:0xa8: Mode: Release
21:25:13:WU00:FS00:0xa8:************************************ CBang *************************************
21:25:13:WU00:FS00:0xa8: Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8: Time: 19:21:24
21:25:13:WU00:FS00:0xa8: Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8: -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
21:25:13:WU00:FS00:0xa8: Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8: Bits: 64
21:25:13:WU00:FS00:0xa8: Mode: Release
21:25:13:WU00:FS00:0xa8:************************************ System ************************************
21:25:13:WU00:FS00:0xa8: CPU: AMD Ryzen 7 1700 Eight-Core Processor
21:25:13:WU00:FS00:0xa8: CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1
21:25:13:WU00:FS00:0xa8: CPUs: 16
21:25:13:WU00:FS00:0xa8: Memory: 15.61GiB
21:25:13:WU00:FS00:0xa8:Free Memory: 478.75MiB
21:25:13:WU00:FS00:0xa8: Threads: POSIX_THREADS
21:25:13:WU00:FS00:0xa8: OS Version: 5.4
21:25:13:WU00:FS00:0xa8:Has Battery: false
21:25:13:WU00:FS00:0xa8: On Battery: false
21:25:13:WU00:FS00:0xa8: UTC Offset: -8
21:25:13:WU00:FS00:0xa8: PID: 71245
21:25:13:WU00:FS00:0xa8: CWD: /var/lib/fahclient/work
21:25:13:WU00:FS00:0xa8:********************************************************************************
21:25:13:WU00:FS00:0xa8:Project: 12405 (Run 112, Clone 6, Gen 2)
21:25:13:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
21:25:13:WU00:FS00:0xa8:Digital signatures verified
21:25:13:WU00:FS00:0xa8:Calling: mdrun -c frame2.gro -s frame2.tpr -x frame2.xtc -cpt 15 -nt 4 -ntmpi 1
21:25:13:WU00:FS00:0xa8:Steps: first=5000000 total=10000000
21:25:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:25:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 9:34 pm
by muziqaz
verdeva wrote: ↑Sat Dec 09, 2023 9:26 pm
I split the CPU into two 4 core processors and one is now folding Project: 16977 (Run 86, Clone 244, Gen 592). Slot 0 has the following log snip:
Code: Select all
21:24:13:WU00:FS00:0xa8:********************************************************************************
21:24:13:WU00:FS00:0xa8:Project: 12405 (Run 112, Clone 6, Gen 2)
21:24:13:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
21:24:13:WU00:FS00:0xa8:Digital signatures verified
21:24:13:WU00:FS00:0xa8:Calling: mdrun -c frame2.gro -s frame2.tpr -x frame2.xtc -cpt 15 -nt 4 -ntmpi 1
21:24:13:WU00:FS00:0xa8:Steps: first=5000000 total=10000000
21:24:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:24:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:25:13:WU00:FS00:Starting
21:25:13:WU00:FS00:Removing old file 'work/00/logfile_01-20231209-205312.txt'
21:25:13:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 00 -suffix 01 -version 706 -lifeline 1012 -checkpoint 15 -np 4
21:25:13:WU00:FS00:Started FahCore on PID 71241
21:25:13:WU00:FS00:Core PID:71245
21:25:13:WU00:FS00:FahCore 0xa8 started
21:25:13:WU00:FS00:0xa8:*********************** Log Started 2023-12-09T21:25:13Z ***********************
21:25:13:WU00:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
21:25:13:WU00:FS00:0xa8: Core: Gromacs
21:25:13:WU00:FS00:0xa8: Type: 0xa8
21:25:13:WU00:FS00:0xa8: Version: 0.0.12
21:25:13:WU00:FS00:0xa8: Author: Joseph Coffland <[email protected]>
21:25:13:WU00:FS00:0xa8: Copyright: 2020 foldingathome.org
21:25:13:WU00:FS00:0xa8: Homepage: https://foldingathome.org/
21:25:13:WU00:FS00:0xa8: Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8: Time: 19:24:44
21:25:13:WU00:FS00:0xa8: Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8: -fdata-sections -O3 -funroll-loops -fno-pie
21:25:13:WU00:FS00:0xa8: Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8: Bits: 64
21:25:13:WU00:FS00:0xa8: Mode: Release
21:25:13:WU00:FS00:0xa8: SIMD: avx2_256
21:25:13:WU00:FS00:0xa8: OpenMP: ON
21:25:13:WU00:FS00:0xa8: CUDA: OFF
21:25:13:WU00:FS00:0xa8: Args: -dir 00 -suffix 01 -version 706 -lifeline 71241 -checkpoint 15 -np
21:25:13:WU00:FS00:0xa8: 4
21:25:13:WU00:FS00:0xa8:************************************ libFAH ************************************
21:25:13:WU00:FS00:0xa8: Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8: Time: 19:21:38
21:25:13:WU00:FS00:0xa8: Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8: -fdata-sections -O3 -funroll-loops -fno-pie
21:25:13:WU00:FS00:0xa8: Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8: Bits: 64
21:25:13:WU00:FS00:0xa8: Mode: Release
21:25:13:WU00:FS00:0xa8:************************************ CBang *************************************
21:25:13:WU00:FS00:0xa8: Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8: Time: 19:21:24
21:25:13:WU00:FS00:0xa8: Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8: Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8: -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
21:25:13:WU00:FS00:0xa8: Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8: Bits: 64
21:25:13:WU00:FS00:0xa8: Mode: Release
21:25:13:WU00:FS00:0xa8:************************************ System ************************************
21:25:13:WU00:FS00:0xa8: CPU: AMD Ryzen 7 1700 Eight-Core Processor
21:25:13:WU00:FS00:0xa8: CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1
21:25:13:WU00:FS00:0xa8: CPUs: 16
21:25:13:WU00:FS00:0xa8: Memory: 15.61GiB
21:25:13:WU00:FS00:0xa8:Free Memory: 478.75MiB
21:25:13:WU00:FS00:0xa8: Threads: POSIX_THREADS
21:25:13:WU00:FS00:0xa8: OS Version: 5.4
21:25:13:WU00:FS00:0xa8:Has Battery: false
21:25:13:WU00:FS00:0xa8: On Battery: false
21:25:13:WU00:FS00:0xa8: UTC Offset: -8
21:25:13:WU00:FS00:0xa8: PID: 71245
21:25:13:WU00:FS00:0xa8: CWD: /var/lib/fahclient/work
21:25:13:WU00:FS00:0xa8:********************************************************************************
21:25:13:WU00:FS00:0xa8:Project: 12405 (Run 112, Clone 6, Gen 2)
21:25:13:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
21:25:13:WU00:FS00:0xa8:Digital signatures verified
21:25:13:WU00:FS00:0xa8:Calling: mdrun -c frame2.gro -s frame2.tpr -x frame2.xtc -cpt 15 -nt 4 -ntmpi 1
21:25:13:WU00:FS00:0xa8:Steps: first=5000000 total=10000000
21:25:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:25:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
Your log shows nothing failing, and it also shows no core_23 work done.
We need log showing where it is failing
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 9:37 pm
by verdeva
Here is a dump of my system info:
Code: Select all
System: Kernel: 5.4.0-167-generic x86_64 bits: 64 compiler: gcc v: 9.4.0
Desktop: Cinnamon 4.6.7 wm: muffin dm: LightDM Distro: Linux Mint 20 Ulyana
base: Ubuntu 20.04 focal
Machine: Type: Desktop Mobo: Micro-Star model: B450 TOMAHAWK MAX II (MS-7C02) v: 3.0
serial: <filter> UEFI: American Megatrends LLC. v: H.40 date: 02/03/2021
CPU: Topology: 8-Core model: AMD Ryzen 7 1700 bits: 64 type: MT MCP arch: Zen rev: 1
L2 cache: 4096 KiB
flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 95992
Speed: 3155 MHz min/max: N/A Core speeds (MHz): 1: 3155 2: 3200 3: 3200 4: 2563 5: 3200
6: 3200 7: 3160 8: 3200 9: 3012 10: 3200 11: 3268 12: 2581 13: 3200 14: 3208 15: 3014
16: 3200
Graphics: Device-1: NVIDIA GT218 [GeForce 210] vendor: ASUSTeK driver: nvidia v: 340.108
bus ID: 26:00.0 chip ID: 10de:0a65
Display: x11 server: X.Org 1.20.13 driver: nvidia
unloaded: fbdev,modesetting,nouveau,vesa resolution: 1920x1080~60Hz
OpenGL: renderer: GeForce 210/PCIe/SSE2 v: 3.3.0 NVIDIA 340.108 direct render: Yes
Audio: Device-1: NVIDIA High Definition Audio vendor: ASUSTeK driver: snd_hda_intel v: kernel
bus ID: 26:00.1 chip ID: 10de:0be3
Device-2: AMD Family 17h HD Audio vendor: Micro-Star MSI driver: snd_hda_intel
v: kernel bus ID: 28:00.3 chip ID: 1022:1457
Sound Server: ALSA v: k5.4.0-167-generic
Network: Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: Micro-Star MSI
driver: r8169 v: kernel port: f000 bus ID: 22:00.0 chip ID: 10ec:8168
IF: enp34s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
Drives: Local Storage: total: 1.05 TiB used: 118.35 GiB (11.0%)
ID-1: /dev/nvme0n1 vendor: Intel model: SSDPEKNW010T9 size: 953.87 GiB speed: 31.6 Gb/s
lanes: 4 serial: <filter>
ID-2: /dev/sda vendor: Samsung model: SSD 840 PRO Series size: 119.24 GiB
speed: 6.0 Gb/s serial: <filter>
Partition: ID-1: / size: 937.33 GiB used: 118.34 GiB (12.6%) fs: ext4 dev: /dev/nvme0n1p2
USB: Hub: 1-0:1 info: Full speed (or root) Hub ports: 10 rev: 2.0 chip ID: 1d6b:0002
Device-1: 1-9:2 info: Chicony USB Optical Mouse type: Mouse driver: hid-generic,usbhid
rev: 2.0 chip ID: 04f2:0939
Hub: 2-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.1 chip ID: 1d6b:0003
Hub: 3-0:1 info: Full speed (or root) Hub ports: 4 rev: 2.0 chip ID: 1d6b:0002
Device-2: 3-2:2 info: Chicony KU-0833 Keyboard type: Keyboard,HID
driver: hid-generic,usbhid rev: 2.0 chip ID: 04f2:0833
Hub: 4-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.0 chip ID: 1d6b:0003
Sensors: System Temperatures: cpu: 43.9 C mobo: N/A gpu: nvidia temp: 45 C
Fan Speeds (RPM): N/A
Repos: No active apt repos in: /etc/apt/sources.list
Active apt repos in: /etc/apt/sources.list.d/official-package-repositories.list
1: deb http: //packages.linuxmint.com ulyana main upstream import backport
2: deb http: //archive.ubuntu.com/ubuntu focal main restricted universe multiverse
3: deb http: //archive.ubuntu.com/ubuntu focal-updates main restricted universe multiverse
4: deb http: //archive.ubuntu.com/ubuntu focal-backports main restricted universe multiverse
5: deb http: //security.ubuntu.com/ubuntu/ focal-security main restricted universe multiverse
6: deb http: //archive.canonical.com/ubuntu/ focal partner
Info: Processes: 356 Uptime: 1h 06m Memory: 15.61 GiB used: 2.68 GiB (17.1%) Init: systemd
v: 245 runlevel: 5 Compilers: gcc: 9.4.0 alt: 9 Client: Unknown python3.8 client
inxi: 3.0.38
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 9:42 pm
by muziqaz
verdeva wrote: ↑Sat Dec 09, 2023 9:37 pm
Here is a dump of my system info:
Code: Select all
System: Kernel: 5.4.0-167-generic x86_64 bits: 64 compiler: gcc v: 9.4.0
Desktop: Cinnamon 4.6.7 wm: muffin dm: LightDM Distro: Linux Mint 20 Ulyana
base: Ubuntu 20.04 focal
Machine: Type: Desktop Mobo: Micro-Star model: B450 TOMAHAWK MAX II (MS-7C02) v: 3.0
serial: <filter> UEFI: American Megatrends LLC. v: H.40 date: 02/03/2021
CPU: Topology: 8-Core model: AMD Ryzen 7 1700 bits: 64 type: MT MCP arch: Zen rev: 1
L2 cache: 4096 KiB
flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 95992
Speed: 3155 MHz min/max: N/A Core speeds (MHz): 1: 3155 2: 3200 3: 3200 4: 2563 5: 3200
6: 3200 7: 3160 8: 3200 9: 3012 10: 3200 11: 3268 12: 2581 13: 3200 14: 3208 15: 3014
16: 3200
Graphics: Device-1: NVIDIA GT218 [GeForce 210] vendor: ASUSTeK driver: nvidia v: 340.108
bus ID: 26:00.0 chip ID: 10de:0a65
Display: x11 server: X.Org 1.20.13 driver: nvidia
unloaded: fbdev,modesetting,nouveau,vesa resolution: 1920x1080~60Hz
OpenGL: renderer: GeForce 210/PCIe/SSE2 v: 3.3.0 NVIDIA 340.108 direct render: Yes
Audio: Device-1: NVIDIA High Definition Audio vendor: ASUSTeK driver: snd_hda_intel v: kernel
bus ID: 26:00.1 chip ID: 10de:0be3
Device-2: AMD Family 17h HD Audio vendor: Micro-Star MSI driver: snd_hda_intel
v: kernel bus ID: 28:00.3 chip ID: 1022:1457
Sound Server: ALSA v: k5.4.0-167-generic
Network: Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: Micro-Star MSI
driver: r8169 v: kernel port: f000 bus ID: 22:00.0 chip ID: 10ec:8168
IF: enp34s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
Drives: Local Storage: total: 1.05 TiB used: 118.35 GiB (11.0%)
ID-1: /dev/nvme0n1 vendor: Intel model: SSDPEKNW010T9 size: 953.87 GiB speed: 31.6 Gb/s
lanes: 4 serial: <filter>
ID-2: /dev/sda vendor: Samsung model: SSD 840 PRO Series size: 119.24 GiB
speed: 6.0 Gb/s serial: <filter>
Partition: ID-1: / size: 937.33 GiB used: 118.34 GiB (12.6%) fs: ext4 dev: /dev/nvme0n1p2
USB: Hub: 1-0:1 info: Full speed (or root) Hub ports: 10 rev: 2.0 chip ID: 1d6b:0002
Device-1: 1-9:2 info: Chicony USB Optical Mouse type: Mouse driver: hid-generic,usbhid
rev: 2.0 chip ID: 04f2:0939
Hub: 2-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.1 chip ID: 1d6b:0003
Hub: 3-0:1 info: Full speed (or root) Hub ports: 4 rev: 2.0 chip ID: 1d6b:0002
Device-2: 3-2:2 info: Chicony KU-0833 Keyboard type: Keyboard,HID
driver: hid-generic,usbhid rev: 2.0 chip ID: 04f2:0833
Hub: 4-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.0 chip ID: 1d6b:0003
Sensors: System Temperatures: cpu: 43.9 C mobo: N/A gpu: nvidia temp: 45 C
Fan Speeds (RPM): N/A
Repos: No active apt repos in: /etc/apt/sources.list
Active apt repos in: /etc/apt/sources.list.d/official-package-repositories.list
1: deb http: //packages.linuxmint.com ulyana main upstream import backport
2: deb http: //archive.ubuntu.com/ubuntu focal main restricted universe multiverse
3: deb http: //archive.ubuntu.com/ubuntu focal-updates main restricted universe multiverse
4: deb http: //archive.ubuntu.com/ubuntu focal-backports main restricted universe multiverse
5: deb http: //security.ubuntu.com/ubuntu/ focal-security main restricted universe multiverse
6: deb http: //archive.canonical.com/ubuntu/ focal partner
Info: Processes: 356 Uptime: 1h 06m Memory: 15.61 GiB used: 2.68 GiB (17.1%) Init: systemd
v: 245 runlevel: 5 Compilers: gcc: 9.4.0 alt: 9 Client: Unknown python3.8 client
inxi: 3.0.38
I doubt your issue is related to this topic. Your GPU cannot get core_23 WUs, as it is too old and too slow. Please, create another thread detailing what is failing and where, with related fahlogs.
Thanks
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 9:49 pm
by bikeaddict
This a CPU task on Core A8 crashing for verdeva. Some kind of crash may be causing INTERRUPTED (102 = 0x66). Check output of
for any crashes.
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 9:50 pm
by verdeva
Forgive me, im an old dial-up guy! I cannot get past the Interrupted log line, same as the first poster. Just loops for hours.
Note: This is a PCU folding unit. Don't assume it's on GPU.
21:40:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:40:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
Probably don't understand the question.
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 10:04 pm
by verdeva
Executing journalctl -p err produces the following (just the last block copied) No crashing indicated.
-- Logs begin at Tue 2021-07-06 05:55:01 PDT, end at Sat 2023-12-09 13:54:51 PST. --
Jul 06 07:15:13 dave-AB350M-HD3 systemd-coredump[1422459]: Process 1412949 (FahCore_a8) of user 126 dumped core.
Stack trace of thread 1412954:
#0 0x0000000000a02687 n/a (FahCore_a8 + 0x602687)
#1 0x00000000008c49cd n/a (FahCore_a8 + 0x4c49cd)
#2 0x0000000000ef4b5e n/a (FahCore_a8 + 0xaf4b5e)
#3 0x00007f2a9b738609 start_thread (libpthread.so.0 + 0x9609)
#4 0x00007f2a9b50a293 __clone (libc.so.6 + 0x122293)
Stack trace of thread 1412956:
#0 0x0000000000a03c0d n/a (FahCore_a8 + 0x603c0d)
#1 0x00000000008c49cd n/a (FahCore_a8 + 0x4c49cd)
#2 0x0000000000ef4b5e n/a (FahCore_a8 + 0xaf4b5e)
#3 0x00007f2a9b738609 start_thread (libpthread.so.0 + 0x9609)
#4 0x00007f2a9b50a293 __clone (libc.so.6 + 0x122293)
Stack trace of thread 1412961:
#0 0x0000000000a03cb2 n/a (FahCore_a8 + 0x603cb2)
#1 0x00000000008c49cd n/a (FahCore_a8 + 0x4c49cd)
#2 0x0000000000ef4b5e n/a (FahCore_a8 + 0xaf4b5e)
#3 0x00007f2a9b738609 start_thread (libpthread.so.0 + 0x9609)
#4 0x00007f2a9b50a293 __clone (libc.so.6 + 0x122293)
lines 1-23
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 10:35 pm
by muziqaz
Your OS is killing fahcore process for some reason. Unstable hardware would be one of the reasons.
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 10:38 pm
by muziqaz
Ah, you are out of RAM. It says 478something MB of memory free
I am not sure why Fahclient thinks you only have so little memory left :/
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 10:50 pm
by muziqaz
wdanwatts wrote: ↑Fri Dec 01, 2023 4:11 am
When I run the software for any length of time, it gives me a core 23 project and then I get this:
Code: Select all
03:56:50:WU00:FS00:Starting
03:56:50:WU00:FS00:Removing old file 'work/00/logfile_01-20231201-032446.txt'
03:56:50:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/openmm-core-23/centos-7.9.2009-64bit/release/0x23-8.0.3/Core_23.fah/FahCore_23 -dir 00 -suffix 01 -version 706 -lifeline 1612 -checkpoint 30 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
03:56:50:WU00:FS00:Started FahCore on PID 40865
03:56:50:WU00:FS00:Core PID:40869
03:56:50:WU00:FS00:FahCore 0x23 started
03:56:51:WU00:FS00:0x23:*********************** Log Started 2023-12-01T03:56:50Z ***********************
03:56:51:WU00:FS00:0x23:*************************** Core23 Folding@home Core ***************************
03:56:51:WU00:FS00:0x23: Core: Core23
03:56:51:WU00:FS00:0x23: Type: 0x23
03:56:51:WU00:FS00:0x23: Version: 8.0.3
03:56:51:WU00:FS00:0x23: Author: Joseph Coffland <[email protected]>
03:56:51:WU00:FS00:0x23: Copyright: 2022 foldingathome.org
03:56:51:WU00:FS00:0x23: Homepage: https://foldingathome.org/
03:56:51:WU00:FS00:0x23: Date: Aug 3 2023
03:56:51:WU00:FS00:0x23: Time: 08:28:22
03:56:51:WU00:FS00:0x23: Revision: 199cb870317d05441d0a301287d9ef61254fa32b
03:56:51:WU00:FS00:0x23: Branch: HEAD
03:56:51:WU00:FS00:0x23: Compiler: GNU 7.5.0
03:56:51:WU00:FS00:0x23: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
03:56:51:WU00:FS00:0x23: -fdata-sections -O3 -funroll-loops -fno-pie
03:56:51:WU00:FS00:0x23: -DOPENMM_VERSION="\"8.0.0\""
03:56:51:WU00:FS00:0x23: Platform: linux 5.15.0-1041-azure
03:56:51:WU00:FS00:0x23: Bits: 64
03:56:51:WU00:FS00:0x23: Mode: Release
03:56:51:WU00:FS00:0x23:Maintainers: John Chodera <[email protected]> and Peter Eastman
03:56:51:WU00:FS00:0x23: <[email protected]>
03:56:51:WU00:FS00:0x23: Args: -dir 00 -suffix 01 -version 706 -lifeline 40865 -checkpoint 30
03:56:51:WU00:FS00:0x23: -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
03:56:51:WU00:FS00:0x23: 0 -gpu 0
03:56:51:WU00:FS00:0x23:************************************ libFAH ************************************
03:56:51:WU00:FS00:0x23: Date: Aug 3 2023
03:56:51:WU00:FS00:0x23: Time: 08:27:48
03:56:51:WU00:FS00:0x23: Revision: 112c2234abe20611a05652defc3c7f854cbf927f
03:56:51:WU00:FS00:0x23: Branch: HEAD
03:56:51:WU00:FS00:0x23: Compiler: GNU 7.5.0
03:56:51:WU00:FS00:0x23: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
03:56:51:WU00:FS00:0x23: -fdata-sections -O3 -funroll-loops -fno-pie
03:56:51:WU00:FS00:0x23: Platform: linux 5.15.0-1041-azure
03:56:51:WU00:FS00:0x23: Bits: 64
03:56:51:WU00:FS00:0x23: Mode: Release
03:56:51:WU00:FS00:0x23:************************************ CBang *************************************
03:56:51:WU00:FS00:0x23: Version: 1.7.2
03:56:51:WU00:FS00:0x23: Author: Joseph Coffland <[email protected]>
03:56:51:WU00:FS00:0x23: Org: Cauldron Development LLC
03:56:51:WU00:FS00:0x23: Copyright: Cauldron Development LLC, 2003-2023
03:56:51:WU00:FS00:0x23: Homepage: https://cauldrondevelopment.com/
03:56:51:WU00:FS00:0x23: License: GPL 2+
03:56:51:WU00:FS00:0x23: Date: Aug 3 2023
03:56:51:WU00:FS00:0x23: Time: 08:27:30
03:56:51:WU00:FS00:0x23: Revision: eae4b58965bdd4d54ea9eb77972674352b37a547
03:56:51:WU00:FS00:0x23: Branch: HEAD
03:56:51:WU00:FS00:0x23: Compiler: GNU 7.5.0
03:56:51:WU00:FS00:0x23: Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
03:56:51:WU00:FS00:0x23: -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
03:56:51:WU00:FS00:0x23: Platform: linux 5.15.0-1041-azure
03:56:51:WU00:FS00:0x23: Bits: 64
03:56:51:WU00:FS00:0x23: Mode: Release
03:56:51:WU00:FS00:0x23:************************************ System ************************************
03:56:51:WU00:FS00:0x23: CPU: AMD Phenom(tm) II X2 545 Processor
03:56:51:WU00:FS00:0x23: CPU ID: AuthenticAMD Family 16 Model 4 Stepping 2
03:56:51:WU00:FS00:0x23: CPUs: 2
03:56:51:WU00:FS00:0x23: Memory: 3.81GiB
03:56:51:WU00:FS00:0x23:Free Memory: 744.96MiB
03:56:51:WU00:FS00:0x23: Threads: POSIX_THREADS
03:56:51:WU00:FS00:0x23: OS Version: 6.5
03:56:51:WU00:FS00:0x23:Has Battery: false
03:56:51:WU00:FS00:0x23: On Battery: false
03:56:51:WU00:FS00:0x23: UTC Offset: -6
03:56:51:WU00:FS00:0x23: PID: 40869
03:56:51:WU00:FS00:0x23: CWD: /var/lib/fahclient/work
03:56:51:WU00:FS00:0x23: Exec: /var/lib/fahclient/cores/cores.foldingathome.org/openmm-core-23/centos-7.9.2009-64bit/release/0x23-8.0.3/Core_23.fah/FahCore_23
03:56:51:WU00:FS00:0x23:************************************ OpenMM ************************************
03:56:51:WU00:FS00:0x23: Version: 8.0.0
03:56:51:WU00:FS00:0x23:********************************************************************************
03:56:51:WU00:FS00:0x23:Project: 12248 (Run 0, Clone 105, Gen 10)
03:56:51:WU00:FS00:0x23:Digital signatures verified
03:56:51:WU00:FS00:0x23:Folding@home GPU Core23 Folding@home Core
03:56:51:WU00:FS00:0x23:Version 8.0.3
03:56:51:WU00:FS00:0x23: Checkpoint write interval: 50000 steps (2%) [50 total]
03:56:51:WU00:FS00:0x23: JSON viewer frame write interval: 25000 steps (1%) [100 total]
03:56:51:WU00:FS00:0x23: XTC frame write interval: 25000 steps (1%) [100 total]
03:56:51:WU00:FS00:0x23: Global context and integrator variables write interval: disabled
03:56:51:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
It continually cycles. I can stop this behavior by removing the GPU slot and re-entering it. That lasts until another core 23 job is assigned.
I'm runnung Fedora Linux 39 (Workstation Edition) on an AMD Phenom™ II X2 545 × 2 with a NVIDIA GeForce GTX 1660 SUPER GPU. It had been running ~ 1 million 'points' per day.
Running the command suggested in another thread
Code: Select all
./var/lib/fahclient/cores/cores.foldingathome.org/openmm-core-23/centos-7.9.2009-64bit/release/0x23-8.0.3/Core_23.fah/FahCore_23 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
gets a reply
Code: Select all
error while loading shared libraries: libOpenMM.so.8.0: cannot open shared object file: No such file or directory
even though the 'offending' library is in that folder, and removing the Core_23 directory (which causes a new one to be rebuilt) causes no different behavior.
Is this core cursed, or does Fedora 39 have a folding problem?
In those logs, the next step after
Code: Select all
Global context and integrator variables write interval: disabled
would be this:
Code: Select all
22:43:04:WU00:FS02:0x23:There are 3 platforms available.
22:43:04:WU00:FS02:0x23:Platform 0: Reference
22:43:04:WU00:FS02:0x23:Platform 1: CPU
22:43:04:WU00:FS02:0x23:Platform 2: OpenCL
22:43:04:WU00:FS02:0x23: opencl-device 0 specified
22:43:06:WU00:FS02:0x23:Attempting to create OpenCL context:
22:43:06:WU00:FS02:0x23: Configuring platform OpenCL
22:43:10:WU00:FS02:0x23: Using OpenCL on OpenCL platformId 0 and gpu 0
22:43:10:WU00:FS02:0x23: GPU info: Platform: OpenCL: AMD Accelerated Parallel Processing
22:43:10:WU00:FS02:0x23: GPU info: PlatformIndex: 0
22:43:10:WU00:FS02:0x23: GPU info: Device: gfx1100
22:43:10:WU00:FS02:0x23: GPU info: DeviceIndex: 0
22:43:10:WU00:FS02:0x23: GPU info: Vendor: 0x1002
22:43:10:WU00:FS02:0x23: GPU info: PCI: 03:00:00
22:43:10:WU00:FS02:0x23: GPU info: Compute: 2.0
22:43:10:WU00:FS02:0x23: GPU info: Driver: 3592.0
22:43:10:WU00:FS02:0x23: GPU info: GPU: true
22:43:10:WU00:FS02:0x23:Completed 0 out of 2500000 steps (0%)
Obviously yours would be:
Platform 0: Reference
Platform 1: CPU
Platform 2: CUDA
Platform 3: OpenCL
what does clinfo and CUDA equivalent return? Though Fahclient would cry if none of the required platforms/devices were seen/available.
Is this on v7 of fahclient, or v8?
Re: FahCore 23 broken on Fedora 39
Posted: Sat Dec 09, 2023 11:32 pm
by verdeva
Unstable HW?
Running two 4 core processor slots 0/1 with folding units: 12405 (Run 112, Clone 6, Gen 2) and 16977 (Run 86, Clone 244, Gen 592) respectively.
Slot 0 stops instantly at step 0, with the interrupted message until FahCore Run attempts again.
Slot 1 is at 24%. No issues
Are there unique HW req to run unit 12405? No stability issues have been noted except for this interrupted message. My system HW and settings are unchanged for over a year,
Just read your post about free memory. System monitor reports using 12% of 15.6 for over 13 Gb free. Fah control logs show free mem 475 and 911 Mb for slots 0/1 respectively. How can Fah see less memory?
Your help is really appreciated, Thanks Dave