Client suddenly stopped working

Moderators: Site Moderators, FAHC Science Team

Post Reply
kcramakrishna
Posts: 4
Joined: Wed Apr 15, 2020 12:58 am

Client suddenly stopped working

Post by kcramakrishna »

I have been running FAH for a few weeks now. I run this manually on Manjaro. I sometimes interrupt jobs midway when I need to shutdown my laptop and resume them later. For this, I 'pause' the job and then shut down the client and then the FAHControl. This has worked perfectly all the time.

I fold on the GPU (RTX 1070 Mobile) and have disabled CPU folding.

Yesterday, I interrupted a job at around 90% to shutdown the laptop. This morning, when I tried to restart the job, I get an error:
ERROR:No compute devices matched GPU #0 INTEL:1 [xxx]. You may need to update your graphics drivers.
I have no idea on why my Nvidia card/driver is not recognised. I tried setting GPU to 2 in config.xml but that did not work and I reverted to original config.xml

I did some googling and someone suggested doing a 'sudo clinfo'. Since this is my first post, I am not allowed to post URLs since this is my first post here but this was on the archlinux bbs

Code: Select all

bbs.archlinux.org slash viewtopic.php?id=254001
I did this too but there is no improvement.

Noticed something which I did not understand. See the message "NULL platform behavior" in the output below:

Code: Select all

[kc@kc-manjaro configs]$ sudo clinfo
[sudo] password for kc: 
Number of platforms                               1
  Platform Name                                   NVIDIA CUDA
  Platform Vendor                                 NVIDIA Corporation
  Platform Version                                OpenCL 1.2 CUDA 10.2.159
  Platform Profile                                FULL_PROFILE
  Platform Extensions                             cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Platform Extensions function suffix             NV

  Platform Name                                   NVIDIA CUDA
Number of devices                                 1
  Device Name                                     GeForce RTX 2070
  Device Vendor                                   NVIDIA Corporation
  Device Vendor ID                                0x10de
  Device Version                                  OpenCL 1.2 CUDA
  Driver Version                                  440.82
  Device OpenCL C Version                         OpenCL C 1.2 
  Device Type                                     GPU
  Device Topology (NV)                            PCI-E, 01:00.0
  Device Profile                                  FULL_PROFILE
  Device Available                                Yes
  Compiler Available                              Yes
  Linker Available                                Yes
  Max compute units                               36
  Max clock frequency                             1440MHz
  Compute Capability (NV)                         7.5
  Device Partition                                (core)
    Max number of sub-devices                     1
    Supported partition types                     None
    Supported affinity domains                    (n/a)
  Max work item dimensions                        3
  Max work item sizes                             1024x1024x64
  Max work group size                             1024
  Preferred work group size multiple              32
  Warp size (NV)                                  32
  Preferred / native vector sizes                 
    char                                                 1 / 1       
    short                                                1 / 1       
    int                                                  1 / 1       
    long                                                 1 / 1       
    half                                                 0 / 0        (n/a)
    float                                                1 / 1       
    double                                               1 / 1        (cl_khr_fp64)
  Half-precision Floating-point support           (n/a)
  Single-precision Floating-point support         (core)
    Denormals                                     Yes
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 Yes
    Round to infinity                             Yes
    IEEE754-2008 fused multiply-add               Yes
    Support is emulated in software               No
    Correctly-rounded divide and sqrt operations  Yes
  Double-precision Floating-point support         (cl_khr_fp64)
    Denormals                                     Yes
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 Yes
    Round to infinity                             Yes
    IEEE754-2008 fused multiply-add               Yes
    Support is emulated in software               No
  Address bits                                    64, Little-Endian
  Global memory size                              8361279488 (7.787GiB)
  Error Correction support                        No
  Max memory allocation                           2090319872 (1.947GiB)
  Unified memory for Host and Device              No
  Integrated memory (NV)                          No
  Minimum alignment for any data type             128 bytes
  Alignment of base address                       4096 bits (512 bytes)
  Global Memory cache type                        Read/Write
  Global Memory cache size                        1179648 (1.125MiB)
  Global Memory cache line size                   128 bytes
  Image support                                   Yes
    Max number of samplers per kernel             32
    Max size for 1D images from buffer            268435456 pixels
    Max 1D or 2D image array size                 2048 images
    Max 2D image size                             32768x32768 pixels
    Max 3D image size                             16384x16384x16384 pixels
    Max number of read image args                 256
    Max number of write image args                32
  Local memory type                               Local
  Local memory size                               49152 (48KiB)
  Registers per block (NV)                        65536
  Max number of constant args                     9
  Max constant buffer size                        65536 (64KiB)
  Max size of kernel argument                     4352 (4.25KiB)
  Queue properties                                
    Out-of-order execution                        Yes
    Profiling                                     Yes
  Prefer user sync for interop                    No
  Profiling timer resolution                      1000ns
  Execution capabilities                          
    Run OpenCL kernels                            Yes
    Run native kernels                            No
    Kernel execution timeout (NV)                 Yes
  Concurrent copy and kernel execution (NV)       Yes
    Number of async copy engines                  3
  printf() buffer size                            1048576 (1024KiB)
  Built-in kernels                                (n/a)
  Device Extensions                               cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics

NULL platform behavior
  clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...)  NVIDIA CUDA
  clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...)   Success [NV]
  clCreateContext(NULL, ...) [default]            Success [NV]
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT)  No platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU)  No platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR)  No devices found in platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM)  Invalid device type for platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL)  No platform

ICD loader properties
  ICD loader Name                                 OpenCL ICD Loader
  ICD loader Vendor                               OCL Icd free software
  ICD loader Version                              2.2.12
  ICD loader Profile                              OpenCL 2.2
I don't remember updating any drivers or packages on my distro yesterday. Any ideas on what changed or how to fix this ?
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Client suddenly stopped working

Post by bruce »

Please post segments of FAH's log. We need the first couple pages of the log where FAH detects the hardware and sets it up plus segments showing applicable errors. What messages are you getting and what WUs were have you seen?

See instructions below.
kcramakrishna
Posts: 4
Joined: Wed Apr 15, 2020 12:58 am

Re: Client suddenly stopped working

Post by kcramakrishna »

Apologies for the delay. Here are some relevant details:

Here is example of the working config and log file:

Code: Select all

*********************** Log Started 2020-04-13T10:44:52Z ***********************
10:44:52:************************* Folding@home Client *************************
10:44:52:        Website: https://foldingathome.org/
10:44:52:      Copyright: (c) 2009-2018 foldingathome.org
10:44:52:         Author: Joseph Coffland <[email protected]>
10:44:52:           Args: 
10:44:52:         Config: /home/kc/FAHClient/config.xml
10:44:52:******************************** Build ********************************
10:44:52:        Version: 7.5.1
10:44:52:           Date: May 11 2018
10:44:52:           Time: 19:59:04
10:44:52:     Repository: Git
10:44:52:       Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
10:44:52:         Branch: master
10:44:52:       Compiler: GNU 6.3.0 20170516
10:44:52:        Options: -std=gnu++98 -O3 -funroll-loops
10:44:52:       Platform: linux2 4.14.0-3-amd64
10:44:52:           Bits: 64
10:44:52:           Mode: Release
10:44:52:******************************* System ********************************
10:44:52:            CPU: Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz
10:44:52:         CPU ID: GenuineIntel Family 6 Model 158 Stepping 12
10:44:52:           CPUs: 16
10:44:52:         Memory: 31.17GiB
10:44:52:    Free Memory: 28.88GiB
10:44:52:        Threads: POSIX_THREADS
10:44:52:     OS Version: 5.6
10:44:52:    Has Battery: true
10:44:52:     On Battery: false
10:44:52:     UTC Offset: 5
10:44:52:            PID: 2251
10:44:52:            CWD: /home/kc/FAHClient
10:44:52:             OS: Linux 5.6.3-2-MANJARO x86_64
10:44:52:        OS Arch: AMD64
10:44:52:           GPUs: 1
10:44:52:          GPU 0: Bus:1 Slot:0 Func:0 NVIDIA:7 TU106M [GeForce RTX 2070 Mobile]
10:44:52:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:7.5 Driver:10.2
10:44:52:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:440.82
10:44:52:***********************************************************************
10:44:52:Trying to access database...
10:44:52:Enabled folding slot 01: READY gpu:0:TU106M [GeForce RTX 2070 Mobile]
10:50:12:WU00:FS01:Assigned to work server 140.163.4.241
10:50:12:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:TU106M [GeForce RTX 2070 Mobile] from 140.163.4.241
10:50:23:WU00:FS01:Downloading 7.92MiB
10:50:29:WU00:FS01:Download 18.93%
10:50:37:WU00:FS01:Download 24.45%

Here is the log file for the config which is Not working:

Code: Select all

*********************** Log Started 2020-04-15T00:25:22Z ***********************
00:25:22:************************* Folding@home Client *************************
00:25:22:        Website: https://foldingathome.org/
00:25:22:      Copyright: (c) 2009-2018 foldingathome.org
00:25:22:         Author: Joseph Coffland <[email protected]>
00:25:22:           Args: 
00:25:22:         Config: /home/kc/FAHClient/config.xml
00:25:22:******************************** Build ********************************
00:25:22:        Version: 7.5.1
00:25:22:           Date: May 11 2018
00:25:22:           Time: 19:59:04
00:25:22:     Repository: Git
00:25:22:       Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
00:25:22:         Branch: master
00:25:22:       Compiler: GNU 6.3.0 20170516
00:25:22:        Options: -std=gnu++98 -O3 -funroll-loops
00:25:22:       Platform: linux2 4.14.0-3-amd64
00:25:22:           Bits: 64
00:25:22:           Mode: Release
00:25:22:******************************* System ********************************
00:25:22:            CPU: Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz
00:25:22:         CPU ID: GenuineIntel Family 6 Model 158 Stepping 12
00:25:22:           CPUs: 16
00:25:22:         Memory: 31.17GiB
00:25:22:    Free Memory: 29.15GiB
00:25:22:        Threads: POSIX_THREADS
00:25:22:     OS Version: 5.6
00:25:22:    Has Battery: true
00:25:22:     On Battery: false
00:25:22:     UTC Offset: 5
00:25:22:            PID: 2191
00:25:22:            CWD: /home/kc/FAHClient
00:25:22:             OS: Linux 5.6.3-2-MANJARO x86_64
00:25:22:        OS Arch: AMD64
00:25:22:           GPUs: 2
00:25:22:          GPU 0: Bus:0 Slot:2 Func:0 INTEL:1 [xxx]
00:25:22:          GPU 1: Bus:1 Slot:0 Func:0 NVIDIA:7 TU106M [GeForce RTX 2070 Mobile]
00:25:22:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:7.5 Driver:10.2
00:25:22:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:440.82
00:25:22:***********************************************************************
00:25:22:Trying to access database...
ESC[91m00:25:22:ERROR:No compute devices matched GPU #0 INTEL:1 [xxx].  You may need to update your graphics drivers.ESC[0m
ESC[93m00:25:22:WARNING:WU00:No longer matches Slot 1's configuration and there are no other matching slots, dumpingESC[0m
00:25:22:WU00:FS01:Sending unit results: id:00 state:SEND error:DUMPED project:11743 run:0 clone:3406 gen:56 core:0x22 unit:0x000000528ca304f15e69d6a393f8dce6
00:25:23:WU00:FS01:Server responded WORK_ACK (400)
00:25:23:WU00:FS01:Cleaning up

As you can see, there is a new intel device detected and it is trying to use this device. I tried setting gpu-Id to '1' instead of '0' but that did not help.

I tried configuring again:

Code: Select all

[kc@kc-manjaro FAHClient]$ FAHClient --configure
13:43:23:INFO(1):Read GPUs.txt
User name [Anonymous]: kcramakrishna
Team number [0]: 224497
Passkey: 8xxxxxxxxxxxxxx1
Enable SMP [true]: 
Enable GPU [true]: true
Name of configuration file [config.xml]: 
Overwrite 'config.xml'? [y/N] [false]: y
This did not detect nor configure any GPU.

Code: Select all

[kc@kc-manjaro FAHClient]$ cat config.xml 
<config>
  <!-- User Information -->
  <passkey v='8xxxxx1'/>
  <team v='224497'/>
  <user v='kcramakrishna'/>

  <!-- Folding Slots -->
  <slot id='0' type='CPU'/>
</config>
I am running hybrid driver. Let me check this:

Code: Select all

[kc@kc-manjaro FAHClient]$ sudo mhwd -a pci nonfree 0300
[sudo] password for kc: 
> Using config 'video-nvidia-440xx' for device: 0000:01:00.0 (0300:10de:1f50) Display controller nVidia Corporation TU106M [GeForce RTX 2070 Mobile]
Error: config 'video-nvidia-440xx' conflicts with config(s): video-hybrid-intel-nvidia-440xx-prime
Looks like this conflict may be an issue.

Let me remove the hybrid driver:

Code: Select all

[kc@kc-manjaro FAHClient]$ sudo mhwd -r pci video-intel-nvidia-440xx-prime
Error: config 'video-intel-nvidia-440xx-prime' is not installed!
[kc@kc-manjaro FAHClient]$ mhwd -li
> Installed PCI configs:
--------------------------------------------------------------------------------
                  NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
           video-linux            2018.05.04                true            PCI
video-hybrid-intel-nvidia-440xx-prime            2019.10.25               false            PCI


Warning: No installed USB configs!

[kc@kc-manjaro FAHClient]$ sudo mhwd -r pci video-hybrid-intel-nvidia-440xx-prime 
[sudo] password for kc: 
> Removing video-hybrid-intel-nvidia-440xx-prime...
Sourcing /etc/mhwd-x86_64.conf
Has lib32 support: true
Sourcing /var/lib/mhwd/local/pci/video-hybrid-intel-nvidia-440xx-prime/MHWDCONFIG
Processing classid: 0300
Sourcing /var/lib/mhwd/scripts/include/0300
checking dependencies...
error: failed to prepare transaction (could not satisfy dependencies)
:: removing nvidia-440xx-utils breaks dependency 'nvidia-utils' required by nvidia-system-monitor-qt
Error: pacman failed!
Error: script failed!
Realized that I don't seem to have 2 conflicting drivers but then why am I getting this error? I am of course posting to Manjaro forums about this but in case that is not resolved, can I still run FAH ?
(I updated to latest Linux 5.6 Kernel on Manjaro but that was on 11Apr and I have been folding successfully till about 24 hrs back. System is shut down every night.)

Please let me know if you need any more info.
kcramakrishna
Posts: 4
Joined: Wed Apr 15, 2020 12:58 am

Re: Client suddenly stopped working

Post by kcramakrishna »

I removed the driver conflict. Removed the hybrid driver and installed the regular driver:

Code: Select all

[kc@kc-manjaro FAHClient]$ sudo mhwd -r pci video-hybrid-intel-nvidia-440xx-prime  nvidia-440xx-utils
Error: config 'nvidia-440xx-utils' is not installed!
[kc@kc-manjaro FAHClient]$ sudo mhwd -i pci video-nvidia-440xx
!! Rebooted system and:

Code: Select all

[kc@kc-manjaro FAHClient]$ sudo mhwd -a pci nonfree 0300
[sudo] password for kc: 
> Skipping already installed config 'video-nvidia-440xx' for device: 0000:01:00.0 (0300:10de:1f50) Display controller nVidia Corporation TU106M [GeForce RTX 2070 Mobile]
No more conflicts.

Ran FAHClient --configure again and tried running the client. No improvement.

Code: Select all

15:14:25:******************************* System ********************************
15:14:25:            CPU: Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz
15:14:25:         CPU ID: GenuineIntel Family 6 Model 158 Stepping 12
15:14:25:           CPUs: 16
15:14:25:         Memory: 31.17GiB
15:14:25:    Free Memory: 25.53GiB
15:14:25:        Threads: POSIX_THREADS
15:14:25:     OS Version: 5.6
15:14:25:    Has Battery: true
15:14:25:     On Battery: false
15:14:25:     UTC Offset: 5
15:14:25:            PID: 3848
15:14:25:            CWD: /home/kc/FAHClient
15:14:25:             OS: Linux 5.6.3-2-MANJARO x86_64
15:14:25:        OS Arch: AMD64
15:14:25:           GPUs: 2
15:14:25:          GPU 0: Bus:0 Slot:2 Func:0 INTEL:1 [xxx]
15:14:25:          GPU 1: Bus:1 Slot:0 Func:0 NVIDIA:7 TU106M [GeForce RTX 2070 Mobile]
15:14:25:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:7.5 Driver:10.2
15:14:25:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:440.82
15:14:25:***********************************************************************
15:14:25:<config>
15:14:25:  <!-- User Information -->
15:14:25:  <passkey v='********************************'/>
15:14:25:  <team v='224497'/>
15:14:25:  <user v='kcramakrishna'/>
15:14:25:
15:14:25:  <!-- Folding Slots -->
15:14:25:</config>
15:14:25:Trying to access database...
15:14:25:Successfully acquired database lock
15:14:25:Enabled folding slot 00: READY cpu:7
GPU is not showing up in the FAHControl. So unable to fold on GPU.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Client suddenly stopped working

Post by bruce »

Computers with hybrid devices can present a real challenge unless you go into the BIOS and disable that function.

FAHClient 7.5.1 does not support folding on Intel GPUs. but that doesn't mean you don't have one. FAH needs to be able to detect your GPUs in a consistent manner.
According to this log, the INTEL GPU is GPU 0 and the NVIDIA GPU is GPU 1.
GPU 0: Bus:0 Slot:2 Func:0 INTEL:1 [xxx] GPU 1: Bus:1 Slot:0 Func:0 NVIDIA:7 TU106M [GeForce RTX 2070 Mobile]
CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:7.5 Driver:10.2
OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:440.82

then your client indicates that SOMETHING HAS CHANGED.
ESC[91m00:25:22:ERROR:No compute devices matched GPU #0 INTEL:1 [xxx]. You may need to update your graphics drivers.
ESC[93m00:25:22:WARNING:WU00:No longer matches Slot 1's configuration and there are no other matching slots, dumping
The GPU that Slot 1 had prevously been configured for was GPU 0 when, in fact, it must have been what is not GPU 1.

You can reconfigure to fix this, but only if you'll always have the NVidia GPU recognized to be at GPU N for a particular N.
kcramakrishna
Posts: 4
Joined: Wed Apr 15, 2020 12:58 am

[SOLVED] Re: Client suddenly stopped working

Post by kcramakrishna »

Thanks for the info. I was swamped with work and could not get time to sort this out. I was folding on Windows (on the same hardware) for the past few days. Booted back into Manjaro today and tried to get things working. I succeeded in getting things working. Below is what I did:

(My initial problem was that it stopped working overnight without any changes to either my system or FAH settings. )

I updated my Manjaro system (which also updated my FAHControl and viewer.)
I tried configuring again:
[kc@kc-manjaro FAHClient]$ FAHClient --configure
This detected but still did not configure my GPUs.

Now, I opened FAHControl -> Configure -> Slots -> Add -> (radio button) GPU. I clicked on "Save" and presto, I see my Intel GPU.
I again went FAHControl -> Configure -> Slots -> Add -> (radio button) GPU and this time it added my nvidia GPU too.

I paused my CPU and Intel GPUs to prevent them running any jobs.

My nvidia GPU downloaded the job but is not running it.

Code: Select all

13:53:26:WU02:FS02:Download complete
13:53:26:WU02:FS02:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:14415 run:0 clone:1170 gen:4 core:0x22 unit:0x0000000c0d5262775e839e598e491b4a
13:53:26:WU02:FS02:Starting
13:53:26:ERROR:WU02:FS02:Failed to start core: OpenCL device matching slot 2 not found, try setting 'opencl-index' manually                                                                                                       
13:53:27:WU02:FS02:Starting
13:53:27:ERROR:WU02:FS02:Failed to start core: OpenCL device matching slot 2 not found, try setting 'opencl-index' manually                                                                                                       
^C13:54:02:Caught signal SIGINT(2) on PID 37278
13:54:02:Exiting, please wait. . .
13:54:04:Clean exit
[kc@kc-manjaro FAHClient]$ sudo clinfo
[sudo] password for kc: 
Number of platforms                               0
I tried running a job on my intel GPU and here is what it said

Code: Select all

13:54:57:WU01:FS01:Download complete
13:54:57:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:13860 run:0 clone:6588 gen:88 core:0xa7 unit:0x000000610d5a98395e731382dc1e09df
13:55:10:FS01:Unpaused
13:55:10:WU01:FS01:Starting
13:55:10:ERROR:WU01:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually                                                                                                       
13:55:11:WU01:FS01:Starting
13:55:11:ERROR:WU01:FS01:Failed to start core: OpenCL device matching slot 1 not found, try setting 'opencl-index' manually                                                                                                       
13:55:14:Removing old file 'configs/config-20200415-003354.xml'
13:55:14:Saving configuration to config.xml
13:55:14:<config>
13:55:14:  <!-- Folding Core -->
13:55:14:  <checkpoint v='5'/>
13:55:14:
13:55:14:  <!-- Network -->
13:55:14:  <proxy v=':8080'/>
13:55:14:
13:55:14:  <!-- Slot Control -->
13:55:14:  <power v='full'/>
13:55:14:
13:55:14:  <!-- User Information -->
13:55:14:  <passkey v='********************************'/>
13:55:14:  <team v='224497'/>
13:55:14:  <user v='kcramakrishna'/>
13:55:14:
13:55:14:  <!-- Folding Slots -->
13:55:14:  <slot id='0' type='CPU'>
13:55:14:    <paused v='true'/>
13:55:14:  </slot>
13:55:14:  <slot id='1' type='GPU'/>
13:55:14:  <slot id='2' type='GPU'>
13:55:14:    <paused v='true'/>
13:55:14:  </slot>
13:55:14:</config>
13:55:19:FS02:Unpaused
13:55:19:WU02:FS02:Starting
13:55:19:ERROR:WU02:FS02:Failed to start core: OpenCL device matching slot 2 not found, try setting 'opencl-index' manually                                                                                                       
13:55:25:FS01:Paused
13:55:25:FS02:Paused
13:56:15:Removing old file 'configs/config-20200415-014753.xml'
13:56:15:Saving configuration to config.xml
13:56:15:<config>
13:56:15:  <!-- Folding Core -->
13:56:15:  <checkpoint v='5'/>
13:56:15:
13:56:15:  <!-- Network -->
13:56:15:  <proxy v=':8080'/>
13:56:15:
13:56:15:  <!-- Slot Control -->
13:56:15:  <power v='full'/>
13:56:15:
13:56:15:  <!-- User Information -->
13:56:15:  <passkey v='********************************'/>
13:56:15:  <team v='224497'/>
13:56:15:  <user v='kcramakrishna'/>
13:56:15:
13:56:15:  <!-- Folding Slots -->
13:56:15:  <slot id='0' type='CPU'>
13:56:15:    <paused v='true'/>
13:56:15:  </slot>
13:56:15:  <slot id='1' type='GPU'>
13:56:15:    <paused v='true'/>
13:56:15:  </slot>
13:56:15:  <slot id='2' type='GPU'>
13:56:15:    <paused v='true'/>
13:56:15:  </slot>
13:56:15:</config>
I rebooted the system and tried folding again. Now both my CPU and nvidia GPU are folding well !!

I am getting the error for my Intel GPU but I have deleted this from my configuration to avoid FAH attempting to use it.

Code: Select all

14:11:46:Saving configuration to config.xml
14:11:46:<config>
14:11:46:  <!-- Folding Core -->
14:11:46:  <checkpoint v='5'/>
14:11:46:
14:11:46:  <!-- Network -->
14:11:46:  <proxy v=':8080'/>
14:11:46:
14:11:46:  <!-- Slot Control -->
14:11:46:  <power v='full'/>
14:11:46:
14:11:46:  <!-- User Information -->
14:11:46:  <passkey v='********************************'/>
14:11:46:  <team v='224497'/>
14:11:46:  <user v='kcramakrishna'/>
14:11:46:
14:11:46:  <!-- Folding Slots -->
14:11:46:  <slot id='0' type='CPU'/>
14:11:46:  <slot id='2' type='GPU'/>
14:11:46:</config>
On deleting the intel GPU, somehow the configuration was screwed up and FAH was again trying to use the intel GPU and nvidia GPU was not showing up (again).

Again followed
FAHControl -> Configure -> Slots -> Add -> (radio button) GPU to add nvidia GPU.

Now paused folding on intel GPU (but did not delete it.)

Started folding again on nvidia. I think we should not delete the intel GPU from the config. Just put the intel GPU on 'Pause' permanently.

So right now everything is working well on my Manjaro installation. If anyone has an idea on why 'deleting' the intel GPU messes up the system, please let me know.
ajm
Posts: 750
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: Client suddenly stopped working

Post by ajm »

I would search on the manufacturer's website if there's an option in the bios (maybe a new one) to properly deactivate the Intel display adaptor.
Then there is a tool that might do the trick: https://help.ubuntu.com/community/HybridGraphics
And this possibly too: https://wiki.archlinux.org/index.php/NV ... phics_only

And someone claims to have solved it with bumblebee here: https://askubuntu.com/questions/477765/ ... screte-gpu
It was fixed by installing nVidia driver version 331:

sudo apt-get purge nvidia*
sudo apt-get install nvidia-331-updates-dev

Then installing bumblebee per the instructions here:

sudo apt-get install bumblebee bumblebee-nvidia primus

After a reboot, I was able to use my nVidia gpu for specific programs by using the 'optirun' command. :)
EDIT: I see that it works now, so my post is a bit deprecated. But glad you had it sorted!
Post Reply