FahCore 23 broken on Fedora 39

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

bikeaddict
Posts: 210
Joined: Sun May 03, 2020 1:20 am

Re: FahCore 23 broken on Fedora 39

Post by bikeaddict »

Keep in mind that Core 22 will eventually be discontinued and new projects will start using 23. Right now there are more tasks for Core 22 than for 23, but who knows when that will change.
muziqaz
Posts: 946
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 7950x3D, 5950x, 5800x3D, 3900x
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: FahCore 23 broken on Fedora 39

Post by muziqaz »

bikeaddict wrote: Sat Dec 09, 2023 7:44 pm Keep in mind that Core 22 will eventually be discontinued and new projects will start using 23. Right now there are more tasks for Core 22 than for 23, but who knows when that will change.
Now that we realised that there are certain limitations, this will be heavily considered once the time comes.
FAH Omega tester
verdeva
Posts: 30
Joined: Mon Dec 03, 2007 1:40 pm
Location: Seattle, WA

Re: FahCore 23 broken on Fedora 39

Post by verdeva »

Would my CPU suffer from this condition? Same symptoms.

20:49:12:WU00:FS00:0xa8: CPU: AMD Ryzen 7 1700 Eight-Core Processor
20:49:12:WU00:FS00:0xa8: CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1

Are there any workarounds, I'm dead in the water on this machine.
muziqaz
Posts: 946
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 7950x3D, 5950x, 5800x3D, 3900x
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: FahCore 23 broken on Fedora 39

Post by muziqaz »

verdeva wrote: Sat Dec 09, 2023 8:56 pm Would my CPU suffer from this condition? Same symptoms.

20:49:12:WU00:FS00:0xa8: CPU: AMD Ryzen 7 1700 Eight-Core Processor
20:49:12:WU00:FS00:0xa8: CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1

Are there any workarounds, I'm dead in the water on this machine.
Your CPU supports required instructions. So I guess this is not the issue then. WTF?

Please post the log snippet with the error (or when it is not working)

Full system info would be helpful too
FAH Omega tester
verdeva
Posts: 30
Joined: Mon Dec 03, 2007 1:40 pm
Location: Seattle, WA

Re: FahCore 23 broken on Fedora 39

Post by verdeva »

I split the CPU into two 4 core processors and one is now folding Project: 16977 (Run 86, Clone 244, Gen 592). Slot 0 has the following log snip:

Code: Select all

21:24:13:WU00:FS00:0xa8:********************************************************************************
21:24:13:WU00:FS00:0xa8:Project: 12405 (Run 112, Clone 6, Gen 2)
21:24:13:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
21:24:13:WU00:FS00:0xa8:Digital signatures verified
21:24:13:WU00:FS00:0xa8:Calling: mdrun -c frame2.gro -s frame2.tpr -x frame2.xtc -cpt 15 -nt 4 -ntmpi 1
21:24:13:WU00:FS00:0xa8:Steps: first=5000000 total=10000000
21:24:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:24:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:25:13:WU00:FS00:Starting
21:25:13:WU00:FS00:Removing old file 'work/00/logfile_01-20231209-205312.txt'
21:25:13:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 00 -suffix 01 -version 706 -lifeline 1012 -checkpoint 15 -np 4
21:25:13:WU00:FS00:Started FahCore on PID 71241
21:25:13:WU00:FS00:Core PID:71245
21:25:13:WU00:FS00:FahCore 0xa8 started
21:25:13:WU00:FS00:0xa8:*********************** Log Started 2023-12-09T21:25:13Z ***********************
21:25:13:WU00:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
21:25:13:WU00:FS00:0xa8:       Core: Gromacs
21:25:13:WU00:FS00:0xa8:       Type: 0xa8
21:25:13:WU00:FS00:0xa8:    Version: 0.0.12
21:25:13:WU00:FS00:0xa8:     Author: Joseph Coffland <[email protected]>
21:25:13:WU00:FS00:0xa8:  Copyright: 2020 foldingathome.org
21:25:13:WU00:FS00:0xa8:   Homepage: https://foldingathome.org/
21:25:13:WU00:FS00:0xa8:       Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8:       Time: 19:24:44
21:25:13:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
21:25:13:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8:       Bits: 64
21:25:13:WU00:FS00:0xa8:       Mode: Release
21:25:13:WU00:FS00:0xa8:       SIMD: avx2_256
21:25:13:WU00:FS00:0xa8:     OpenMP: ON
21:25:13:WU00:FS00:0xa8:       CUDA: OFF
21:25:13:WU00:FS00:0xa8:       Args: -dir 00 -suffix 01 -version 706 -lifeline 71241 -checkpoint 15 -np
21:25:13:WU00:FS00:0xa8:             4
21:25:13:WU00:FS00:0xa8:************************************ libFAH ************************************
21:25:13:WU00:FS00:0xa8:       Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8:       Time: 19:21:38
21:25:13:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
21:25:13:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8:       Bits: 64
21:25:13:WU00:FS00:0xa8:       Mode: Release
21:25:13:WU00:FS00:0xa8:************************************ CBang *************************************
21:25:13:WU00:FS00:0xa8:       Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8:       Time: 19:21:24
21:25:13:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
21:25:13:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8:       Bits: 64
21:25:13:WU00:FS00:0xa8:       Mode: Release
21:25:13:WU00:FS00:0xa8:************************************ System ************************************
21:25:13:WU00:FS00:0xa8:        CPU: AMD Ryzen 7 1700 Eight-Core Processor
21:25:13:WU00:FS00:0xa8:     CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1
21:25:13:WU00:FS00:0xa8:       CPUs: 16
21:25:13:WU00:FS00:0xa8:     Memory: 15.61GiB
21:25:13:WU00:FS00:0xa8:Free Memory: 478.75MiB
21:25:13:WU00:FS00:0xa8:    Threads: POSIX_THREADS
21:25:13:WU00:FS00:0xa8: OS Version: 5.4
21:25:13:WU00:FS00:0xa8:Has Battery: false
21:25:13:WU00:FS00:0xa8: On Battery: false
21:25:13:WU00:FS00:0xa8: UTC Offset: -8
21:25:13:WU00:FS00:0xa8:        PID: 71245
21:25:13:WU00:FS00:0xa8:        CWD: /var/lib/fahclient/work
21:25:13:WU00:FS00:0xa8:********************************************************************************
21:25:13:WU00:FS00:0xa8:Project: 12405 (Run 112, Clone 6, Gen 2)
21:25:13:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
21:25:13:WU00:FS00:0xa8:Digital signatures verified
21:25:13:WU00:FS00:0xa8:Calling: mdrun -c frame2.gro -s frame2.tpr -x frame2.xtc -cpt 15 -nt 4 -ntmpi 1
21:25:13:WU00:FS00:0xa8:Steps: first=5000000 total=10000000
21:25:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:25:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
muziqaz
Posts: 946
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 7950x3D, 5950x, 5800x3D, 3900x
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: FahCore 23 broken on Fedora 39

Post by muziqaz »

verdeva wrote: Sat Dec 09, 2023 9:26 pm I split the CPU into two 4 core processors and one is now folding Project: 16977 (Run 86, Clone 244, Gen 592). Slot 0 has the following log snip:

Code: Select all

21:24:13:WU00:FS00:0xa8:********************************************************************************
21:24:13:WU00:FS00:0xa8:Project: 12405 (Run 112, Clone 6, Gen 2)
21:24:13:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
21:24:13:WU00:FS00:0xa8:Digital signatures verified
21:24:13:WU00:FS00:0xa8:Calling: mdrun -c frame2.gro -s frame2.tpr -x frame2.xtc -cpt 15 -nt 4 -ntmpi 1
21:24:13:WU00:FS00:0xa8:Steps: first=5000000 total=10000000
21:24:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:24:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:25:13:WU00:FS00:Starting
21:25:13:WU00:FS00:Removing old file 'work/00/logfile_01-20231209-205312.txt'
21:25:13:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 00 -suffix 01 -version 706 -lifeline 1012 -checkpoint 15 -np 4
21:25:13:WU00:FS00:Started FahCore on PID 71241
21:25:13:WU00:FS00:Core PID:71245
21:25:13:WU00:FS00:FahCore 0xa8 started
21:25:13:WU00:FS00:0xa8:*********************** Log Started 2023-12-09T21:25:13Z ***********************
21:25:13:WU00:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
21:25:13:WU00:FS00:0xa8:       Core: Gromacs
21:25:13:WU00:FS00:0xa8:       Type: 0xa8
21:25:13:WU00:FS00:0xa8:    Version: 0.0.12
21:25:13:WU00:FS00:0xa8:     Author: Joseph Coffland <[email protected]>
21:25:13:WU00:FS00:0xa8:  Copyright: 2020 foldingathome.org
21:25:13:WU00:FS00:0xa8:   Homepage: https://foldingathome.org/
21:25:13:WU00:FS00:0xa8:       Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8:       Time: 19:24:44
21:25:13:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
21:25:13:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8:       Bits: 64
21:25:13:WU00:FS00:0xa8:       Mode: Release
21:25:13:WU00:FS00:0xa8:       SIMD: avx2_256
21:25:13:WU00:FS00:0xa8:     OpenMP: ON
21:25:13:WU00:FS00:0xa8:       CUDA: OFF
21:25:13:WU00:FS00:0xa8:       Args: -dir 00 -suffix 01 -version 706 -lifeline 71241 -checkpoint 15 -np
21:25:13:WU00:FS00:0xa8:             4
21:25:13:WU00:FS00:0xa8:************************************ libFAH ************************************
21:25:13:WU00:FS00:0xa8:       Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8:       Time: 19:21:38
21:25:13:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
21:25:13:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8:       Bits: 64
21:25:13:WU00:FS00:0xa8:       Mode: Release
21:25:13:WU00:FS00:0xa8:************************************ CBang *************************************
21:25:13:WU00:FS00:0xa8:       Date: Jan 16 2021
21:25:13:WU00:FS00:0xa8:       Time: 19:21:24
21:25:13:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
21:25:13:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
21:25:13:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
21:25:13:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
21:25:13:WU00:FS00:0xa8:       Bits: 64
21:25:13:WU00:FS00:0xa8:       Mode: Release
21:25:13:WU00:FS00:0xa8:************************************ System ************************************
21:25:13:WU00:FS00:0xa8:        CPU: AMD Ryzen 7 1700 Eight-Core Processor
21:25:13:WU00:FS00:0xa8:     CPU ID: AuthenticAMD Family 23 Model 1 Stepping 1
21:25:13:WU00:FS00:0xa8:       CPUs: 16
21:25:13:WU00:FS00:0xa8:     Memory: 15.61GiB
21:25:13:WU00:FS00:0xa8:Free Memory: 478.75MiB
21:25:13:WU00:FS00:0xa8:    Threads: POSIX_THREADS
21:25:13:WU00:FS00:0xa8: OS Version: 5.4
21:25:13:WU00:FS00:0xa8:Has Battery: false
21:25:13:WU00:FS00:0xa8: On Battery: false
21:25:13:WU00:FS00:0xa8: UTC Offset: -8
21:25:13:WU00:FS00:0xa8:        PID: 71245
21:25:13:WU00:FS00:0xa8:        CWD: /var/lib/fahclient/work
21:25:13:WU00:FS00:0xa8:********************************************************************************
21:25:13:WU00:FS00:0xa8:Project: 12405 (Run 112, Clone 6, Gen 2)
21:25:13:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
21:25:13:WU00:FS00:0xa8:Digital signatures verified
21:25:13:WU00:FS00:0xa8:Calling: mdrun -c frame2.gro -s frame2.tpr -x frame2.xtc -cpt 15 -nt 4 -ntmpi 1
21:25:13:WU00:FS00:0xa8:Steps: first=5000000 total=10000000
21:25:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:25:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
Your log shows nothing failing, and it also shows no core_23 work done.
We need log showing where it is failing
FAH Omega tester
verdeva
Posts: 30
Joined: Mon Dec 03, 2007 1:40 pm
Location: Seattle, WA

Re: FahCore 23 broken on Fedora 39

Post by verdeva »

Here is a dump of my system info:

Code: Select all

System:    Kernel: 5.4.0-167-generic x86_64 bits: 64 compiler: gcc v: 9.4.0 
           Desktop: Cinnamon 4.6.7 wm: muffin dm: LightDM Distro: Linux Mint 20 Ulyana 
           base: Ubuntu 20.04 focal 
Machine:   Type: Desktop Mobo: Micro-Star model: B450 TOMAHAWK MAX II (MS-7C02) v: 3.0 
           serial: <filter> UEFI: American Megatrends LLC. v: H.40 date: 02/03/2021 
CPU:       Topology: 8-Core model: AMD Ryzen 7 1700 bits: 64 type: MT MCP arch: Zen rev: 1 
           L2 cache: 4096 KiB 
           flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 95992 
           Speed: 3155 MHz min/max: N/A Core speeds (MHz): 1: 3155 2: 3200 3: 3200 4: 2563 5: 3200 
           6: 3200 7: 3160 8: 3200 9: 3012 10: 3200 11: 3268 12: 2581 13: 3200 14: 3208 15: 3014 
           16: 3200 
Graphics:  Device-1: NVIDIA GT218 [GeForce 210] vendor: ASUSTeK driver: nvidia v: 340.108 
           bus ID: 26:00.0 chip ID: 10de:0a65 
           Display: x11 server: X.Org 1.20.13 driver: nvidia 
           unloaded: fbdev,modesetting,nouveau,vesa resolution: 1920x1080~60Hz 
           OpenGL: renderer: GeForce 210/PCIe/SSE2 v: 3.3.0 NVIDIA 340.108 direct render: Yes 
Audio:     Device-1: NVIDIA High Definition Audio vendor: ASUSTeK driver: snd_hda_intel v: kernel 
           bus ID: 26:00.1 chip ID: 10de:0be3 
           Device-2: AMD Family 17h HD Audio vendor: Micro-Star MSI driver: snd_hda_intel 
           v: kernel bus ID: 28:00.3 chip ID: 1022:1457 
           Sound Server: ALSA v: k5.4.0-167-generic 
Network:   Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: Micro-Star MSI 
           driver: r8169 v: kernel port: f000 bus ID: 22:00.0 chip ID: 10ec:8168 
           IF: enp34s0 state: up speed: 1000 Mbps duplex: full mac: <filter> 
Drives:    Local Storage: total: 1.05 TiB used: 118.35 GiB (11.0%) 
           ID-1: /dev/nvme0n1 vendor: Intel model: SSDPEKNW010T9 size: 953.87 GiB speed: 31.6 Gb/s 
           lanes: 4 serial: <filter> 
           ID-2: /dev/sda vendor: Samsung model: SSD 840 PRO Series size: 119.24 GiB 
           speed: 6.0 Gb/s serial: <filter> 
Partition: ID-1: / size: 937.33 GiB used: 118.34 GiB (12.6%) fs: ext4 dev: /dev/nvme0n1p2 
USB:       Hub: 1-0:1 info: Full speed (or root) Hub ports: 10 rev: 2.0 chip ID: 1d6b:0002 
           Device-1: 1-9:2 info: Chicony USB Optical Mouse type: Mouse driver: hid-generic,usbhid 
           rev: 2.0 chip ID: 04f2:0939 
           Hub: 2-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.1 chip ID: 1d6b:0003 
           Hub: 3-0:1 info: Full speed (or root) Hub ports: 4 rev: 2.0 chip ID: 1d6b:0002 
           Device-2: 3-2:2 info: Chicony KU-0833 Keyboard type: Keyboard,HID 
           driver: hid-generic,usbhid rev: 2.0 chip ID: 04f2:0833 
           Hub: 4-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.0 chip ID: 1d6b:0003 
Sensors:   System Temperatures: cpu: 43.9 C mobo: N/A gpu: nvidia temp: 45 C 
           Fan Speeds (RPM): N/A 
Repos:     No active apt repos in: /etc/apt/sources.list 
           Active apt repos in: /etc/apt/sources.list.d/official-package-repositories.list 
           1: deb http: //packages.linuxmint.com ulyana main upstream import backport
           2: deb http: //archive.ubuntu.com/ubuntu focal main restricted universe multiverse
           3: deb http: //archive.ubuntu.com/ubuntu focal-updates main restricted universe multiverse
           4: deb http: //archive.ubuntu.com/ubuntu focal-backports main restricted universe multiverse
           5: deb http: //security.ubuntu.com/ubuntu/ focal-security main restricted universe multiverse
           6: deb http: //archive.canonical.com/ubuntu/ focal partner
Info:      Processes: 356 Uptime: 1h 06m Memory: 15.61 GiB used: 2.68 GiB (17.1%) Init: systemd 
           v: 245 runlevel: 5 Compilers: gcc: 9.4.0 alt: 9 Client: Unknown python3.8 client 
           inxi: 3.0.38 
muziqaz
Posts: 946
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 7950x3D, 5950x, 5800x3D, 3900x
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: FahCore 23 broken on Fedora 39

Post by muziqaz »

verdeva wrote: Sat Dec 09, 2023 9:37 pm Here is a dump of my system info:

Code: Select all

System:    Kernel: 5.4.0-167-generic x86_64 bits: 64 compiler: gcc v: 9.4.0 
           Desktop: Cinnamon 4.6.7 wm: muffin dm: LightDM Distro: Linux Mint 20 Ulyana 
           base: Ubuntu 20.04 focal 
Machine:   Type: Desktop Mobo: Micro-Star model: B450 TOMAHAWK MAX II (MS-7C02) v: 3.0 
           serial: <filter> UEFI: American Megatrends LLC. v: H.40 date: 02/03/2021 
CPU:       Topology: 8-Core model: AMD Ryzen 7 1700 bits: 64 type: MT MCP arch: Zen rev: 1 
           L2 cache: 4096 KiB 
           flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 95992 
           Speed: 3155 MHz min/max: N/A Core speeds (MHz): 1: 3155 2: 3200 3: 3200 4: 2563 5: 3200 
           6: 3200 7: 3160 8: 3200 9: 3012 10: 3200 11: 3268 12: 2581 13: 3200 14: 3208 15: 3014 
           16: 3200 
Graphics:  Device-1: NVIDIA GT218 [GeForce 210] vendor: ASUSTeK driver: nvidia v: 340.108 
           bus ID: 26:00.0 chip ID: 10de:0a65 
           Display: x11 server: X.Org 1.20.13 driver: nvidia 
           unloaded: fbdev,modesetting,nouveau,vesa resolution: 1920x1080~60Hz 
           OpenGL: renderer: GeForce 210/PCIe/SSE2 v: 3.3.0 NVIDIA 340.108 direct render: Yes 
Audio:     Device-1: NVIDIA High Definition Audio vendor: ASUSTeK driver: snd_hda_intel v: kernel 
           bus ID: 26:00.1 chip ID: 10de:0be3 
           Device-2: AMD Family 17h HD Audio vendor: Micro-Star MSI driver: snd_hda_intel 
           v: kernel bus ID: 28:00.3 chip ID: 1022:1457 
           Sound Server: ALSA v: k5.4.0-167-generic 
Network:   Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: Micro-Star MSI 
           driver: r8169 v: kernel port: f000 bus ID: 22:00.0 chip ID: 10ec:8168 
           IF: enp34s0 state: up speed: 1000 Mbps duplex: full mac: <filter> 
Drives:    Local Storage: total: 1.05 TiB used: 118.35 GiB (11.0%) 
           ID-1: /dev/nvme0n1 vendor: Intel model: SSDPEKNW010T9 size: 953.87 GiB speed: 31.6 Gb/s 
           lanes: 4 serial: <filter> 
           ID-2: /dev/sda vendor: Samsung model: SSD 840 PRO Series size: 119.24 GiB 
           speed: 6.0 Gb/s serial: <filter> 
Partition: ID-1: / size: 937.33 GiB used: 118.34 GiB (12.6%) fs: ext4 dev: /dev/nvme0n1p2 
USB:       Hub: 1-0:1 info: Full speed (or root) Hub ports: 10 rev: 2.0 chip ID: 1d6b:0002 
           Device-1: 1-9:2 info: Chicony USB Optical Mouse type: Mouse driver: hid-generic,usbhid 
           rev: 2.0 chip ID: 04f2:0939 
           Hub: 2-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.1 chip ID: 1d6b:0003 
           Hub: 3-0:1 info: Full speed (or root) Hub ports: 4 rev: 2.0 chip ID: 1d6b:0002 
           Device-2: 3-2:2 info: Chicony KU-0833 Keyboard type: Keyboard,HID 
           driver: hid-generic,usbhid rev: 2.0 chip ID: 04f2:0833 
           Hub: 4-0:1 info: Full speed (or root) Hub ports: 4 rev: 3.0 chip ID: 1d6b:0003 
Sensors:   System Temperatures: cpu: 43.9 C mobo: N/A gpu: nvidia temp: 45 C 
           Fan Speeds (RPM): N/A 
Repos:     No active apt repos in: /etc/apt/sources.list 
           Active apt repos in: /etc/apt/sources.list.d/official-package-repositories.list 
           1: deb http: //packages.linuxmint.com ulyana main upstream import backport
           2: deb http: //archive.ubuntu.com/ubuntu focal main restricted universe multiverse
           3: deb http: //archive.ubuntu.com/ubuntu focal-updates main restricted universe multiverse
           4: deb http: //archive.ubuntu.com/ubuntu focal-backports main restricted universe multiverse
           5: deb http: //security.ubuntu.com/ubuntu/ focal-security main restricted universe multiverse
           6: deb http: //archive.canonical.com/ubuntu/ focal partner
Info:      Processes: 356 Uptime: 1h 06m Memory: 15.61 GiB used: 2.68 GiB (17.1%) Init: systemd 
           v: 245 runlevel: 5 Compilers: gcc: 9.4.0 alt: 9 Client: Unknown python3.8 client 
           inxi: 3.0.38 
I doubt your issue is related to this topic. Your GPU cannot get core_23 WUs, as it is too old and too slow. Please, create another thread detailing what is failing and where, with related fahlogs.
Thanks
FAH Omega tester
bikeaddict
Posts: 210
Joined: Sun May 03, 2020 1:20 am

Re: FahCore 23 broken on Fedora 39

Post by bikeaddict »

This a CPU task on Core A8 crashing for verdeva. Some kind of crash may be causing INTERRUPTED (102 = 0x66). Check output of

Code: Select all

journalctl -p err
for any crashes.
verdeva
Posts: 30
Joined: Mon Dec 03, 2007 1:40 pm
Location: Seattle, WA

Re: FahCore 23 broken on Fedora 39

Post by verdeva »

Forgive me, im an old dial-up guy! I cannot get past the Interrupted log line, same as the first poster. Just loops for hours.

Note: This is a PCU folding unit. Don't assume it's on GPU.

21:40:17:WU00:FS00:0xa8:Completed 1 out of 5000000 steps (0%)
21:40:18:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)

Probably don't understand the question.
verdeva
Posts: 30
Joined: Mon Dec 03, 2007 1:40 pm
Location: Seattle, WA

Re: FahCore 23 broken on Fedora 39

Post by verdeva »

Executing journalctl -p err produces the following (just the last block copied) No crashing indicated.

-- Logs begin at Tue 2021-07-06 05:55:01 PDT, end at Sat 2023-12-09 13:54:51 PST. --
Jul 06 07:15:13 dave-AB350M-HD3 systemd-coredump[1422459]: Process 1412949 (FahCore_a8) of user 126 dumped core.

Stack trace of thread 1412954:
#0 0x0000000000a02687 n/a (FahCore_a8 + 0x602687)
#1 0x00000000008c49cd n/a (FahCore_a8 + 0x4c49cd)
#2 0x0000000000ef4b5e n/a (FahCore_a8 + 0xaf4b5e)
#3 0x00007f2a9b738609 start_thread (libpthread.so.0 + 0x9609)
#4 0x00007f2a9b50a293 __clone (libc.so.6 + 0x122293)

Stack trace of thread 1412956:
#0 0x0000000000a03c0d n/a (FahCore_a8 + 0x603c0d)
#1 0x00000000008c49cd n/a (FahCore_a8 + 0x4c49cd)
#2 0x0000000000ef4b5e n/a (FahCore_a8 + 0xaf4b5e)
#3 0x00007f2a9b738609 start_thread (libpthread.so.0 + 0x9609)
#4 0x00007f2a9b50a293 __clone (libc.so.6 + 0x122293)

Stack trace of thread 1412961:
#0 0x0000000000a03cb2 n/a (FahCore_a8 + 0x603cb2)
#1 0x00000000008c49cd n/a (FahCore_a8 + 0x4c49cd)
#2 0x0000000000ef4b5e n/a (FahCore_a8 + 0xaf4b5e)
#3 0x00007f2a9b738609 start_thread (libpthread.so.0 + 0x9609)
#4 0x00007f2a9b50a293 __clone (libc.so.6 + 0x122293)
lines 1-23
muziqaz
Posts: 946
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 7950x3D, 5950x, 5800x3D, 3900x
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: FahCore 23 broken on Fedora 39

Post by muziqaz »

Your OS is killing fahcore process for some reason. Unstable hardware would be one of the reasons.
FAH Omega tester
muziqaz
Posts: 946
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 7950x3D, 5950x, 5800x3D, 3900x
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: FahCore 23 broken on Fedora 39

Post by muziqaz »

Ah, you are out of RAM. It says 478something MB of memory free

I am not sure why Fahclient thinks you only have so little memory left :/
FAH Omega tester
muziqaz
Posts: 946
Joined: Sun Dec 16, 2007 6:22 pm
Hardware configuration: 7950x3D, 5950x, 5800x3D, 3900x
7900xtx, Radeon 7, 5700xt, 6900xt, RX 550 640SP
Location: London
Contact:

Re: FahCore 23 broken on Fedora 39

Post by muziqaz »

wdanwatts wrote: Fri Dec 01, 2023 4:11 am When I run the software for any length of time, it gives me a core 23 project and then I get this:

Code: Select all

03:56:50:WU00:FS00:Starting
03:56:50:WU00:FS00:Removing old file 'work/00/logfile_01-20231201-032446.txt'
03:56:50:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/openmm-core-23/centos-7.9.2009-64bit/release/0x23-8.0.3/Core_23.fah/FahCore_23 -dir 00 -suffix 01 -version 706 -lifeline 1612 -checkpoint 30 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
03:56:50:WU00:FS00:Started FahCore on PID 40865
03:56:50:WU00:FS00:Core PID:40869
03:56:50:WU00:FS00:FahCore 0x23 started
03:56:51:WU00:FS00:0x23:*********************** Log Started 2023-12-01T03:56:50Z ***********************
03:56:51:WU00:FS00:0x23:*************************** Core23 Folding@home Core ***************************
03:56:51:WU00:FS00:0x23:       Core: Core23
03:56:51:WU00:FS00:0x23:       Type: 0x23
03:56:51:WU00:FS00:0x23:    Version: 8.0.3
03:56:51:WU00:FS00:0x23:     Author: Joseph Coffland <[email protected]>
03:56:51:WU00:FS00:0x23:  Copyright: 2022 foldingathome.org
03:56:51:WU00:FS00:0x23:   Homepage: https://foldingathome.org/
03:56:51:WU00:FS00:0x23:       Date: Aug 3 2023
03:56:51:WU00:FS00:0x23:       Time: 08:28:22
03:56:51:WU00:FS00:0x23:   Revision: 199cb870317d05441d0a301287d9ef61254fa32b
03:56:51:WU00:FS00:0x23:     Branch: HEAD
03:56:51:WU00:FS00:0x23:   Compiler: GNU 7.5.0
03:56:51:WU00:FS00:0x23:    Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
03:56:51:WU00:FS00:0x23:             -fdata-sections -O3 -funroll-loops -fno-pie
03:56:51:WU00:FS00:0x23:             -DOPENMM_VERSION="\"8.0.0\""
03:56:51:WU00:FS00:0x23:   Platform: linux 5.15.0-1041-azure
03:56:51:WU00:FS00:0x23:       Bits: 64
03:56:51:WU00:FS00:0x23:       Mode: Release
03:56:51:WU00:FS00:0x23:Maintainers: John Chodera <[email protected]> and Peter Eastman
03:56:51:WU00:FS00:0x23:             <[email protected]>
03:56:51:WU00:FS00:0x23:       Args: -dir 00 -suffix 01 -version 706 -lifeline 40865 -checkpoint 30
03:56:51:WU00:FS00:0x23:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
03:56:51:WU00:FS00:0x23:             0 -gpu 0
03:56:51:WU00:FS00:0x23:************************************ libFAH ************************************
03:56:51:WU00:FS00:0x23:       Date: Aug 3 2023
03:56:51:WU00:FS00:0x23:       Time: 08:27:48
03:56:51:WU00:FS00:0x23:   Revision: 112c2234abe20611a05652defc3c7f854cbf927f
03:56:51:WU00:FS00:0x23:     Branch: HEAD
03:56:51:WU00:FS00:0x23:   Compiler: GNU 7.5.0
03:56:51:WU00:FS00:0x23:    Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
03:56:51:WU00:FS00:0x23:             -fdata-sections -O3 -funroll-loops -fno-pie
03:56:51:WU00:FS00:0x23:   Platform: linux 5.15.0-1041-azure
03:56:51:WU00:FS00:0x23:       Bits: 64
03:56:51:WU00:FS00:0x23:       Mode: Release
03:56:51:WU00:FS00:0x23:************************************ CBang *************************************
03:56:51:WU00:FS00:0x23:    Version: 1.7.2
03:56:51:WU00:FS00:0x23:     Author: Joseph Coffland <[email protected]>
03:56:51:WU00:FS00:0x23:        Org: Cauldron Development LLC
03:56:51:WU00:FS00:0x23:  Copyright: Cauldron Development LLC, 2003-2023
03:56:51:WU00:FS00:0x23:   Homepage: https://cauldrondevelopment.com/
03:56:51:WU00:FS00:0x23:    License: GPL 2+
03:56:51:WU00:FS00:0x23:       Date: Aug 3 2023
03:56:51:WU00:FS00:0x23:       Time: 08:27:30
03:56:51:WU00:FS00:0x23:   Revision: eae4b58965bdd4d54ea9eb77972674352b37a547
03:56:51:WU00:FS00:0x23:     Branch: HEAD
03:56:51:WU00:FS00:0x23:   Compiler: GNU 7.5.0
03:56:51:WU00:FS00:0x23:    Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
03:56:51:WU00:FS00:0x23:             -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
03:56:51:WU00:FS00:0x23:   Platform: linux 5.15.0-1041-azure
03:56:51:WU00:FS00:0x23:       Bits: 64
03:56:51:WU00:FS00:0x23:       Mode: Release
03:56:51:WU00:FS00:0x23:************************************ System ************************************
03:56:51:WU00:FS00:0x23:        CPU: AMD Phenom(tm) II X2 545 Processor
03:56:51:WU00:FS00:0x23:     CPU ID: AuthenticAMD Family 16 Model 4 Stepping 2
03:56:51:WU00:FS00:0x23:       CPUs: 2
03:56:51:WU00:FS00:0x23:     Memory: 3.81GiB
03:56:51:WU00:FS00:0x23:Free Memory: 744.96MiB
03:56:51:WU00:FS00:0x23:    Threads: POSIX_THREADS
03:56:51:WU00:FS00:0x23: OS Version: 6.5
03:56:51:WU00:FS00:0x23:Has Battery: false
03:56:51:WU00:FS00:0x23: On Battery: false
03:56:51:WU00:FS00:0x23: UTC Offset: -6
03:56:51:WU00:FS00:0x23:        PID: 40869
03:56:51:WU00:FS00:0x23:        CWD: /var/lib/fahclient/work
03:56:51:WU00:FS00:0x23:       Exec: /var/lib/fahclient/cores/cores.foldingathome.org/openmm-core-23/centos-7.9.2009-64bit/release/0x23-8.0.3/Core_23.fah/FahCore_23
03:56:51:WU00:FS00:0x23:************************************ OpenMM ************************************
03:56:51:WU00:FS00:0x23:    Version: 8.0.0
03:56:51:WU00:FS00:0x23:********************************************************************************
03:56:51:WU00:FS00:0x23:Project: 12248 (Run 0, Clone 105, Gen 10)
03:56:51:WU00:FS00:0x23:Digital signatures verified
03:56:51:WU00:FS00:0x23:Folding@home GPU Core23 Folding@home Core
03:56:51:WU00:FS00:0x23:Version 8.0.3
03:56:51:WU00:FS00:0x23:  Checkpoint write interval: 50000 steps (2%) [50 total]
03:56:51:WU00:FS00:0x23:  JSON viewer frame write interval: 25000 steps (1%) [100 total]
03:56:51:WU00:FS00:0x23:  XTC frame write interval: 25000 steps (1%) [100 total]
03:56:51:WU00:FS00:0x23:  Global context and integrator variables write interval: disabled
03:56:51:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
It continually cycles. I can stop this behavior by removing the GPU slot and re-entering it. That lasts until another core 23 job is assigned.
I'm runnung Fedora Linux 39 (Workstation Edition) on an AMD Phenom™ II X2 545 × 2 with a NVIDIA GeForce GTX 1660 SUPER GPU. It had been running ~ 1 million 'points' per day.
Running the command suggested in another thread

Code: Select all

./var/lib/fahclient/cores/cores.foldingathome.org/openmm-core-23/centos-7.9.2009-64bit/release/0x23-8.0.3/Core_23.fah/FahCore_23 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
gets a reply

Code: Select all

 error while loading shared libraries: libOpenMM.so.8.0: cannot open shared object file: No such file or directory
even though the 'offending' library is in that folder, and removing the Core_23 directory (which causes a new one to be rebuilt) causes no different behavior.
Is this core cursed, or does Fedora 39 have a folding problem?
In those logs, the next step after

Code: Select all

Global context and integrator variables write interval: disabled
would be this:

Code: Select all

22:43:04:WU00:FS02:0x23:There are 3 platforms available.
22:43:04:WU00:FS02:0x23:Platform 0: Reference
22:43:04:WU00:FS02:0x23:Platform 1: CPU
22:43:04:WU00:FS02:0x23:Platform 2: OpenCL
22:43:04:WU00:FS02:0x23:  opencl-device 0 specified
22:43:06:WU00:FS02:0x23:Attempting to create OpenCL context:
22:43:06:WU00:FS02:0x23:  Configuring platform OpenCL
22:43:10:WU00:FS02:0x23:  Using OpenCL on OpenCL platformId 0 and gpu 0
22:43:10:WU00:FS02:0x23:  GPU info: Platform: OpenCL: AMD Accelerated Parallel Processing
22:43:10:WU00:FS02:0x23:  GPU info: PlatformIndex: 0
22:43:10:WU00:FS02:0x23:  GPU info: Device: gfx1100
22:43:10:WU00:FS02:0x23:  GPU info: DeviceIndex: 0
22:43:10:WU00:FS02:0x23:  GPU info: Vendor: 0x1002
22:43:10:WU00:FS02:0x23:  GPU info: PCI: 03:00:00
22:43:10:WU00:FS02:0x23:  GPU info: Compute: 2.0
22:43:10:WU00:FS02:0x23:  GPU info: Driver: 3592.0
22:43:10:WU00:FS02:0x23:  GPU info: GPU: true
22:43:10:WU00:FS02:0x23:Completed 0 out of 2500000 steps (0%)
Obviously yours would be:
Platform 0: Reference
Platform 1: CPU
Platform 2: CUDA
Platform 3: OpenCL

what does clinfo and CUDA equivalent return? Though Fahclient would cry if none of the required platforms/devices were seen/available.
Is this on v7 of fahclient, or v8?
FAH Omega tester
verdeva
Posts: 30
Joined: Mon Dec 03, 2007 1:40 pm
Location: Seattle, WA

Re: FahCore 23 broken on Fedora 39

Post by verdeva »

Unstable HW?
Running two 4 core processor slots 0/1 with folding units: 12405 (Run 112, Clone 6, Gen 2) and 16977 (Run 86, Clone 244, Gen 592) respectively.

Slot 0 stops instantly at step 0, with the interrupted message until FahCore Run attempts again.
Slot 1 is at 24%. No issues

Are there unique HW req to run unit 12405? No stability issues have been noted except for this interrupted message. My system HW and settings are unchanged for over a year,

Just read your post about free memory. System monitor reports using 12% of 15.6 for over 13 Gb free. Fah control logs show free mem 475 and 911 Mb for slots 0/1 respectively. How can Fah see less memory?


Your help is really appreciated, Thanks Dave
Post Reply