Page 1 of 1

Verify [AMD GPU on Arch Linux]

Posted: Sat Mar 28, 2020 6:53 pm
by raooowty
Mod note: added OS and GPU type to Subject line

Hello,

I'm oldschool user (some odd 10 years ago), now out of hibernation ready to Fold again.
I understand WUs are scarce. I get WUs for CPU, and my first WU for GFX did not seem to work somehow (that's what the log is about):
If any additional info is needed, I happily provide it.

My question is; does my setup look ready for gfx-folding?

Code: Select all

$ uname -r
5.5.11-arch1-1

Code: Select all

$ yay -Q foldingathome ocl-icd opencl-mesa mesa fahcontrol 
foldingathome 7.5.1-1
ocl-icd 2.2.12-3
opencl-mesa 19.3.4-2
mesa 19.3.4-2
fahcontrol 7.5.1-1
Image

Here is the main log from my "best attempt" at GFX folding. When googling for error code, I retrieved necro-old-threads that all had to do with aged drivers. I cannot see how mine are.

Code: Select all

15:13:31:WU02:FS01:Connecting to 18.218.241.186:80
15:13:32:WU02:FS01:Assigned to work server 13.90.152.57
15:13:32:WU02:FS01:Requesting new work unit for slot 01: READY gpu:0:Pitcairn [Radeon HD 7800] from 13.90.152.57
15:13:32:WU02:FS01:Connecting to 13.90.152.57:8080
15:13:54:WU02:FS01:Downloading 50.74MiB
15:14:00:WU02:FS01:Download 6.78%
15:14:06:WU02:FS01:Download 14.54%
15:14:12:WU02:FS01:Download 22.91%
15:14:18:WU02:FS01:Download 30.06%
15:14:24:WU02:FS01:Download 36.96%
15:14:30:WU02:FS01:Download 45.09%
15:14:36:WU02:FS01:Download 56.17%
15:14:42:WU02:FS01:Download 65.17%
15:14:49:WU02:FS01:Download 73.17%
15:14:55:WU02:FS01:Download 80.44%
15:15:01:WU02:FS01:Download 88.69%
15:15:06:WU02:FS01:Download complete
15:15:06:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:11779 run:0 clone:4000 gen:10 core:0x22 unit:0x000000150d5a98395e73c5a4942c2776
15:15:06:WU02:FS01:Starting
15:15:06:WU02:FS01:Running FahCore: /opt/fah/FAHCoreWrapper /opt/fah/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 02 -suffix 01 -version 705 -lifeline 21971 -checkpoint 15 -gpu-vendor amd -opencl-platform 0 -opencl-device 0 -gpu 0
15:15:06:WU02:FS01:Started FahCore on PID 29101
15:15:06:WU02:FS01:Core PID:29105
15:15:06:WU02:FS01:FahCore 0x22 started
15:15:07:WU02:FS01:0x22:*********************** Log Started 2020-03-28T15:15:06Z ***********************
15:15:07:WU02:FS01:0x22:*************************** Core22 Folding@home Core ***************************
15:15:07:WU02:FS01:0x22:       Type: 0x22
15:15:07:WU02:FS01:0x22:       Core: Core22
15:15:07:WU02:FS01:0x22:    Website: https://foldingathome.org/
15:15:07:WU02:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
15:15:07:WU02:FS01:0x22:     Author: John Chodera <[email protected]> and Rafal Wiewiora
15:15:07:WU02:FS01:0x22:             <[email protected]>
15:15:07:WU02:FS01:0x22:       Args: -dir 02 -suffix 01 -version 705 -lifeline 29101 -checkpoint 15
15:15:07:WU02:FS01:0x22:             -gpu-vendor amd -opencl-platform 0 -opencl-device 0 -gpu 0
15:15:07:WU02:FS01:0x22:     Config: <none>
15:15:07:WU02:FS01:0x22:************************************ Build *************************************
15:15:07:WU02:FS01:0x22:    Version: 0.0.2
15:15:07:WU02:FS01:0x22:       Date: Dec 6 2019
15:15:07:WU02:FS01:0x22:       Time: 21:20:17
15:15:07:WU02:FS01:0x22: Repository: Git
15:15:07:WU02:FS01:0x22:   Revision: f87d92b58abdf7e6bf2e173cfbc4dc3e837c7042
15:15:07:WU02:FS01:0x22:     Branch: core22
15:15:07:WU02:FS01:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
15:15:07:WU02:FS01:0x22:    Options: -std=gnu++98 -O3 -funroll-loops
15:15:07:WU02:FS01:0x22:   Platform: linux2 4.9.87-linuxkit-aufs
15:15:07:WU02:FS01:0x22:       Bits: 64
15:15:07:WU02:FS01:0x22:       Mode: Release
15:15:07:WU02:FS01:0x22:************************************ System ************************************
15:15:07:WU02:FS01:0x22:        CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
15:15:07:WU02:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
15:15:07:WU02:FS01:0x22:       CPUs: 8
15:15:07:WU02:FS01:0x22:     Memory: 15.56GiB
15:15:07:WU02:FS01:0x22:Free Memory: 6.84GiB
15:15:07:WU02:FS01:0x22:    Threads: POSIX_THREADS
15:15:07:WU02:FS01:0x22: OS Version: 5.5
15:15:07:WU02:FS01:0x22:Has Battery: false
15:15:07:WU02:FS01:0x22: On Battery: false
15:15:07:WU02:FS01:0x22: UTC Offset: 1
15:15:07:WU02:FS01:0x22:        PID: 29105
15:15:07:WU02:FS01:0x22:        CWD: /opt/fah/work
15:15:07:WU02:FS01:0x22:         OS: Linux 5.5.11-arch1-1 x86_64
15:15:07:WU02:FS01:0x22:    OS Arch: AMD64
15:15:07:WU02:FS01:0x22:********************************************************************************
15:15:07:WU02:FS01:0x22:Project: 11779 (Run 0, Clone 4000, Gen 10)
15:15:07:WU02:FS01:0x22:Unit: 0x000000150d5a98395e73c5a4942c2776
15:15:07:WU02:FS01:0x22:Reading tar file core.xml
15:15:07:WU02:FS01:0x22:Reading tar file integrator.xml
15:15:07:WU02:FS01:0x22:Reading tar file state.xml
15:15:07:WU02:FS01:0x22:Reading tar file system.xml
15:15:07:WU02:FS01:0x22:Digital signatures verified
15:15:07:WU02:FS01:0x22:Folding@home GPU Core22 Folding@home Core
15:15:07:WU02:FS01:0x22:Version 0.0.2
15:15:07:WU02:FS01:0x22:ERROR:126: Bad platformId size.
15:15:07:WU02:FS01:0x22:Saving result file ../logfile_01.txt
15:15:07:WU02:FS01:0x22:Saving result file science.log
15:15:07:WU02:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
15:15:07:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:15:07:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:11779 run:0 clone:4000 gen:10 core:0x22 unit:0x000000150d5a98395e73c5a4942c2776
15:15:07:WU02:FS01:Uploading 7.00KiB to 13.90.152.57
15:15:07:WU02:FS01:Connecting to 13.90.152.57:8080
15:15:28:WU02:FS01:Upload complete
15:15:28:WU02:FS01:Server responded WORK_ACK (400)
15:15:28:WU02:FS01:Cleaning up
16:52:56:FS01:Paused
16:52:56:FS01:Unpaused
16:52:56:WU00:FS01:Connecting to 65.254.110.245:8080
16:52:57:WARNING:WU00:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
16:52:57:WU00:FS01:Connecting to 18.218.241.186:80

Re: Verify

Posted: Sat Mar 28, 2020 8:47 pm
by Joe_H
There are some posts from recent users no setting up for Arch Linux, perhaps someone will get you pointed towards them.

In the meantime, to use your HD 7800 series cards you will need to switch to the AMD proprietary drivers and OpenCL support, the GPU folding core does not work with the open source Mesa drivers shown in your second Code block.

Re: Verify

Posted: Sun Mar 29, 2020 7:14 am
by raooowty
Joe_H wrote:There are some posts from recent users no setting up for Arch Linux, perhaps someone will get you punted towards them.

In the meantime, to use your HD 7800 series cards you will need to switch to the AMD proprietary drivers and OpenCL support, the GPU folding core does not work with the open source Mesa drivers shown in your second Code block.
Thank you!

Re: Verify

Posted: Sun Mar 29, 2020 9:04 am
by AmunRa

Re: Verify

Posted: Sun Mar 29, 2020 11:09 am
by raooowty
Thank you AmunRa, very helpful post.

Unfortunately it does not add to my attempts as of yet.

Since the time of my posting I've progressed slightly and attempted the following:

- removing opencl-mesa
- adding opencl-amd
-> this resulted in foldingathome.service no longer being able to start, with the following error:

Code: Select all

FAHClient[592]: amdgpu_device_initialize: DRM version is 2.50.0 but this driver is only compatible with 3.x.x.
- removing opencl-amd
- adding rocm-amdgpu-pro-orca (per friend recommendation: https://wiki.archlinux.org/index.php/GPGPU#AMD/ATI )
-> this resulted in foldingathome.service no longer being able to start, with the following error:

Code: Select all

FAHClient[592]: amdgpu_device_initialize: DRM version is 2.50.0 but this driver is only compatible with 3.x.x.
- removing rocm-amdgpu-pro-orca
- re:adding opencl-mesa
-> service starts again, and I can at least fold via the CPU.

Output of

Code: Select all

clinfo

Code: Select all

$ clinfo
Number of platforms                               1
  Platform Name                                   Clover
  Platform Vendor                                 Mesa
  Platform Version                                OpenCL 1.1 Mesa 19.3.4
  Platform Profile                                FULL_PROFILE
  Platform Extensions                             cl_khr_icd
  Platform Extensions function suffix             MESA

  Platform Name                                   Clover
Number of devices                                 1
  Device Name                                     AMD PITCAIRN (DRM 2.50.0, 5.5.11-arch1-1, LLVM 9.0.1)
  Device Vendor                                   AMD
  Device Vendor ID                                0x1002
  Device Version                                  OpenCL 1.1 Mesa 19.3.4
  Driver Version                                  19.3.4
  Device OpenCL C Version                         OpenCL C 1.1 
  Device Type                                     GPU
  Device Profile                                  FULL_PROFILE
  Device Available                                Yes
  Compiler Available                              Yes
  Max compute units                               20
  Max clock frequency                             1010MHz
  Max work item dimensions                        3
  Max work item sizes                             256x256x256
  Max work group size                             256
  Preferred work group size multiple              64
  Preferred / native vector sizes                 
    char                                                16 / 16      
    short                                                8 / 8       
    int                                                  4 / 4       
    long                                                 2 / 2       
    half                                                 8 / 8        (cl_khr_fp16)
    float                                                4 / 4       
    double                                               2 / 2        (cl_khr_fp64)
  Half-precision Floating-point support           (cl_khr_fp16)
    Denormals                                     No
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 No
    Round to infinity                             No
    IEEE754-2008 fused multiply-add               No
    Support is emulated in software               No
  Single-precision Floating-point support         (core)
    Denormals                                     No
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 No
    Round to infinity                             No
    IEEE754-2008 fused multiply-add               No
    Support is emulated in software               No
    Correctly-rounded divide and sqrt operations  No
  Double-precision Floating-point support         (cl_khr_fp64)
    Denormals                                     Yes
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 Yes
    Round to infinity                             Yes
    IEEE754-2008 fused multiply-add               Yes
    Support is emulated in software               No
  Address bits                                    64, Little-Endian
  Global memory size                              2147483648 (2GiB)
  Error Correction support                        No
  Max memory allocation                           1503238553 (1.4GiB)
  Unified memory for Host and Device              No
  Minimum alignment for any data type             128 bytes
  Alignment of base address                       32768 bits (4096 bytes)
  Global Memory cache type                        None
  Image support                                   No
  Local memory type                               Local
  Local memory size                               32768 (32KiB)
  Max number of constant args                     16
  Max constant buffer size                        1503238553 (1.4GiB)
  Max size of kernel argument                     1024
  Queue properties                                
    Out-of-order execution                        No
    Profiling                                     Yes
  Profiling timer resolution                      0ns
  Execution capabilities                          
    Run OpenCL kernels                            Yes
    Run native kernels                            No
  Device Extensions                               cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_fp64 cl_khr_fp16

NULL platform behavior
  clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...)  Clover
  clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...)   Success [MESA]
  clCreateContext(NULL, ...) [default]            Success [MESA]
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT)  Success (1)
    Platform Name                                 Clover
    Device Name                                   AMD PITCAIRN (DRM 2.50.0, 5.5.11-arch1-1, LLVM 9.0.1)
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU)  Success (1)
    Platform Name                                 Clover
    Device Name                                   AMD PITCAIRN (DRM 2.50.0, 5.5.11-arch1-1, LLVM 9.0.1)
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL)  Success (1)
    Platform Name                                 Clover
    Device Name                                   AMD PITCAIRN (DRM 2.50.0, 5.5.11-arch1-1, LLVM 9.0.1)

ICD loader properties
  ICD loader Name                                 OpenCL ICD Loader
  ICD loader Vendor                               OCL Icd free software
  ICD loader Version                              2.2.12
  ICD loader Profile                              OpenCL 2.2
[zrk@main ~]$ clinfo
Number of platforms                               1
  Platform Name                                   Clover
  Platform Vendor                                 Mesa
  Platform Version                                OpenCL 1.1 Mesa 19.3.4
  Platform Profile                                FULL_PROFILE
  Platform Extensions                             cl_khr_icd
  Platform Extensions function suffix             MESA

  Platform Name                                   Clover
Number of devices                                 1
  Device Name                                     AMD PITCAIRN (DRM 2.50.0, 5.5.11-arch1-1, LLVM 9.0.1)
  Device Vendor                                   AMD
  Device Vendor ID                                0x1002
  Device Version                                  OpenCL 1.1 Mesa 19.3.4
  Driver Version                                  19.3.4
  Device OpenCL C Version                         OpenCL C 1.1 
  Device Type                                     GPU
  Device Profile                                  FULL_PROFILE
  Device Available                                Yes
  Compiler Available                              Yes
  Max compute units                               20
  Max clock frequency                             1010MHz
  Max work item dimensions                        3
  Max work item sizes                             256x256x256
  Max work group size                             256
  Preferred work group size multiple              64
  Preferred / native vector sizes                 
    char                                                16 / 16      
    short                                                8 / 8       
    int                                                  4 / 4       
    long                                                 2 / 2       
    half                                                 8 / 8        (cl_khr_fp16)
    float                                                4 / 4       
    double                                               2 / 2        (cl_khr_fp64)
  Half-precision Floating-point support           (cl_khr_fp16)
    Denormals                                     No
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 No
    Round to infinity                             No
    IEEE754-2008 fused multiply-add               No
    Support is emulated in software               No
  Single-precision Floating-point support         (core)
    Denormals                                     No
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 No
    Round to infinity                             No
    IEEE754-2008 fused multiply-add               No
    Support is emulated in software               No
    Correctly-rounded divide and sqrt operations  No
  Double-precision Floating-point support         (cl_khr_fp64)
    Denormals                                     Yes
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 Yes
    Round to infinity                             Yes
    IEEE754-2008 fused multiply-add               Yes
    Support is emulated in software               No
  Address bits                                    64, Little-Endian
  Global memory size                              2147483648 (2GiB)
  Error Correction support                        No
  Max memory allocation                           1503238553 (1.4GiB)
  Unified memory for Host and Device              No
  Minimum alignment for any data type             128 bytes
  Alignment of base address                       32768 bits (4096 bytes)
  Global Memory cache type                        None
  Image support                                   No
  Local memory type                               Local
  Local memory size                               32768 (32KiB)
  Max number of constant args                     16
  Max constant buffer size                        1503238553 (1.4GiB)
  Max size of kernel argument                     1024
  Queue properties                                
    Out-of-order execution                        No
    Profiling                                     Yes
  Profiling timer resolution                      0ns
  Execution capabilities                          
    Run OpenCL kernels                            Yes
    Run native kernels                            No
  Device Extensions                               cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_fp64 cl_khr_fp16

NULL platform behavior
  clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...)  Clover
  clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...)   Success [MESA]
  clCreateContext(NULL, ...) [default]            Success [MESA]
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT)  Success (1)
    Platform Name                                 Clover
    Device Name                                   AMD PITCAIRN (DRM 2.50.0, 5.5.11-arch1-1, LLVM 9.0.1)
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU)  Success (1)
    Platform Name                                 Clover
    Device Name                                   AMD PITCAIRN (DRM 2.50.0, 5.5.11-arch1-1, LLVM 9.0.1)
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL)  Success (1)
    Platform Name                                 Clover
    Device Name                                   AMD PITCAIRN (DRM 2.50.0, 5.5.11-arch1-1, LLVM 9.0.1)

ICD loader properties
  ICD loader Name                                 OpenCL ICD Loader
  ICD loader Vendor                               OCL Icd free software
  ICD loader Version                              2.2.12
  ICD loader Profile                              OpenCL 2.2
However, there is this also, where Im using Kernel driver 'radeon'. I dont know what to make of that at this point.

Code: Select all

$ lspci -k | egrep 'VGA|in use'
        Kernel driver in use: snb_uncore
        Kernel driver in use: pcieport
        Kernel driver in use: i915
        Kernel driver in use: mei_me
        Kernel driver in use: ehci-pci
        Kernel driver in use: snd_hda_intel
        Kernel driver in use: pcieport
        Kernel driver in use: pcieport
        Kernel driver in use: pcieport
        Kernel driver in use: pcieport
        Kernel driver in use: pcieport
        Kernel driver in use: pcieport
        Kernel driver in use: ehci-pci
        Kernel driver in use: lpc_ich
        Kernel driver in use: ahci
        Kernel driver in use: i801_smbus
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Pitcairn XT [Radeon HD 7870 GHz Edition]
       [b] Kernel driver in use: radeon[/b]
        Kernel driver in use: snd_hda_intel
        Kernel driver in use: ahci
        Kernel driver in use: firewire_ohci
        Kernel driver in use: xhci_hcd
        Kernel driver in use: xhci_hcd
        Kernel driver in use: r8169
        Kernel driver in use: ahci
I'm not sure what to make of this.

Re: Verify [AMD GPU on Arch Linux]

Posted: Sat May 01, 2021 8:33 am
by ng0177
some similar issues here:

Code: Select all

Description:	EndeavourOS Linux
5.11.16-arch1-1
x11

$ pacman -Q --info cuda
Name            : cuda
Version         : 11.3.0-1

$ pacman -Q --info ocl-icd
Name            : ocl-icd
Version         : 2.2.14-1

AMD: ~ $ pacman -Q --info foldingathome
Name            : foldingathome
Version         : 7.6.21-1
Optional Deps   : clinfo: for folding with an NVIDIA GPU [installed]
                  cuda: for folding with an NVIDIA GPU [installed]
                  ocl-icd: for folding with any GPU [installed]
                  opencl-driver: for folding with any GPU [installed]

/var/log/foldingathome/log.txt
...
18:25:09:       GPUs: 1
18:25:09:      GPU 0: Bus:39 Slot:0 Func:0 NVIDIA:7 GP106 [GeForce GTX 1060 6GB] 4372
18:25:09:       CUDA: Not detected: cuInit() returned 999
18:25:09:     OpenCL: Not detected: clGetPlatformIDs() returned -1001
...
18:35:09:WARNING:FS01:No CUDA or OpenCL 1.2+ support detected for GPU slot 0
1: gpu:39:0 GP106 [GeForce GTX 1060 6GB] 4372.  Disabling.

$ sudo systemctl status foldingathome-nvidia
○ foldingathome-nvidia.service - Folding@home helper for NVIDIA GPUs
     Loaded: loaded (/usr/lib/systemd/system/foldingathome-nvidia.service; disa>
     Active: inactive (dead)

Re: Verify [AMD GPU on Arch Linux]

Posted: Sat May 01, 2021 6:37 pm
by bruce
Please paste the first part of FAH's log in a future post. (See below)

FAH configures GPUs when it starts up.

Have you installed OpenCL?