Having more GPU issues; ways to test configurations?

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

Post Reply
RiaSkies
Posts: 16
Joined: Sun Apr 05, 2020 1:02 am
Hardware configuration: CPU: Ryzen 7 1700 @ 3.2 GHz all core
GPU: Sapphire Pulse RX 5600 XT

Having more GPU issues; ways to test configurations?

Post by RiaSkies »

I've been continuing to try to get my GPU to successfully fold, but I am just not having any luck at stability. CPU is folding fine without any issue, but the GPU continues to spit out errors. Usually, it's a 'particle coordinate is NaN error', though occasionally I get 'clCreateCommandQueue (-6)' errors. Sometimes, the (F@H) core will retry from a saved state, sometimes it just gives up and shut the core down.. From what I have seen, this often indicates some sort of OC issue. However, even after using software to downclock the GPU and VRAM to reference base speed, turning up the fan curve to be very aggressive, slight increases in the voltage for extra stability, and lowering the power target near the minimums allowed (-45% or so), I am still finding myself unable to successfully fold.

I'm currently checking system memory, and there's been no issues with CPU folding or issues on my old GPU, so I am not sure the source of instability is. This has happened on many WU's to the point where the issue is definitely with this graphics cards.

I'd like to continue to tweak settings, but I don't want to test running my GPU on live WU's until I can ensure that I have some baseline level of stability, in order to ensure that the science isn't being hindered by potentially faulty hardware. So I'd like to know if there is any way to get old, already run, known-to-be-good dummy WU's to run in order to verify stability as I continue to try to tweak settings (or decide if the card is just faulty and needs replacing).
toTOW
Site Moderator
Posts: 6349
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Having more GPU issues; ways to test configurations?

Post by toTOW »

You should try FAHBench : https://fahbench.github.io/ ;)
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
RiaSkies
Posts: 16
Joined: Sun Apr 05, 2020 1:02 am
Hardware configuration: CPU: Ryzen 7 1700 @ 3.2 GHz all core
GPU: Sapphire Pulse RX 5600 XT

Re: Having more GPU issues; ways to test configurations?

Post by RiaSkies »

toTOW wrote:You should try FAHBench : https://fahbench.github.io/ ;)
Will this work on a 5600 xt? I understand that Core 21 doesn't work with Navi cards for architectural reasons.
toTOW
Site Moderator
Posts: 6349
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Having more GPU issues; ways to test configurations?

Post by toTOW »

Good point. Current FAHBench version is still based on latest Core 21 revisions.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Sven
Posts: 71
Joined: Fri Nov 01, 2013 8:12 pm

Re: Having more GPU issues; ways to test configurations?

Post by Sven »

What power supply are you using?

Not only the wattage, but the brand is also important.
RiaSkies
Posts: 16
Joined: Sun Apr 05, 2020 1:02 am
Hardware configuration: CPU: Ryzen 7 1700 @ 3.2 GHz all core
GPU: Sapphire Pulse RX 5600 XT

Re: Having more GPU issues; ways to test configurations?

Post by RiaSkies »

500 W EVGA power supply.

Given that the CPU folds fine and hasn't had any issue, it could be an issue with the 6+2 pin connector, but my old GTX 970 using 2x 6 pin connectors was folding fine as well. So I do not believe the PSU is the likely culprit.
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Having more GPU issues; ways to test configurations?

Post by PantherX »

EVGA is a good brand, what's the efficiency of it? What's your CPU model? Do you have additional fans, multiple HDDs, etc. In theory, 500 Watts should suffice as long as it isn't damaged. How old is it?
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
RiaSkies
Posts: 16
Joined: Sun Apr 05, 2020 1:02 am
Hardware configuration: CPU: Ryzen 7 1700 @ 3.2 GHz all core
GPU: Sapphire Pulse RX 5600 XT

Re: Having more GPU issues; ways to test configurations?

Post by RiaSkies »

PantherX wrote:EVGA is a good brand, what's the efficiency of it? What's your CPU model? Do you have additional fans, multiple HDDs, etc. In theory, 500 Watts should suffice as long as it isn't damaged. How old is it?
80+ (White), Ryzen 1700 @ 3.2 GHz, one CPU fan & one case fan, 2 HDD's & 1 M.2 SATA SSD, 1.5 yrs
HaloJones
Posts: 906
Joined: Thu Jul 24, 2008 10:16 am

Re: Having more GPU issues; ways to test configurations?

Post by HaloJones »

try turning off the cpu folding slot and see if it can cope. if it still can't you've probably ruled out the PSU. It it can, it strongly suggests the PSU may not be enough.
single 1070

Image
jrweiss
Posts: 704
Joined: Tue Dec 04, 2007 6:56 am
Hardware configuration: Ryzen 7 5700G, 22.40.46 VGA driver; 32GB G-Skill Trident DDR4-3200; Samsung 860EVO 1TB Boot SSD; VelociRaptor 1TB; MSI GTX 1050ti, 551.23 studio driver; BeQuiet FM 550 PSU; Lian Li PC-9F; Win11Pro-64, F@H 8.3.5.

[Suspended] Ryzen 7 3700X, MSI X570MPG, 32GB G-Skill Trident Z DDR4-3600; Corsair MP600 M.2 PCIe Gen4 Boot, Samsung 840EVO-250 SSDs; VelociRaptor 1TB, Raptor 150; MSI GTX 1050ti, 526.98 driver; Kingwin Stryker 500 PSU; Lian Li PC-K7B. Win10Pro-64, F@H 8.3.5.
Location: @Home
Contact:

Re: Having more GPU issues; ways to test configurations?

Post by jrweiss »

A good 500W PSU should have plenty of power, and that one has a single 40A 12V rail.

However, you may want to check the voltage stability under high load. Some GPUs are very sensitive to voltage fluctuations, and a PSU that just meets the +/- 5% (0.6V) spec may not be good enough. See if HWMonitor (from CPUID) shows any fluctuations on the 12V rail over time. Mine shows +/- 0.8% (0.096V)...
Ryzen 7 5700G, 22.40.46 VGA driver; MSI GTX 1050ti, 551.23 studio driver
Ryzen 7 3700X; MSI GTX 1050ti, 551.23 studio driver [Suspended]
RiaSkies
Posts: 16
Joined: Sun Apr 05, 2020 1:02 am
Hardware configuration: CPU: Ryzen 7 1700 @ 3.2 GHz all core
GPU: Sapphire Pulse RX 5600 XT

Re: Having more GPU issues; ways to test configurations?

Post by RiaSkies »

Now that there's a new FAHBench on Core 22, I was able to do more GPU testing; still getting NaN errors even with stock configuration, downclocked to factory specs, and with extra fan speed, all while no CPU core is running.
Sven
Posts: 71
Joined: Fri Nov 01, 2013 8:12 pm

Re: Having more GPU issues; ways to test configurations?

Post by Sven »

The original 80+ power supplies (without bronze, gold, platinum, etc.) are mostly ATX V2.3 or lower and aren't ready for newer graphics cards.

Because of the strong power fluctuations required by the new chips. The continious load wouldn't be a problem, but the short spikes can lead to short voltages drops. That can lead to crashes of the FAH-Cores.

I would test a modern, quality vendor power supply
jrweiss
Posts: 704
Joined: Tue Dec 04, 2007 6:56 am
Hardware configuration: Ryzen 7 5700G, 22.40.46 VGA driver; 32GB G-Skill Trident DDR4-3200; Samsung 860EVO 1TB Boot SSD; VelociRaptor 1TB; MSI GTX 1050ti, 551.23 studio driver; BeQuiet FM 550 PSU; Lian Li PC-9F; Win11Pro-64, F@H 8.3.5.

[Suspended] Ryzen 7 3700X, MSI X570MPG, 32GB G-Skill Trident Z DDR4-3600; Corsair MP600 M.2 PCIe Gen4 Boot, Samsung 840EVO-250 SSDs; VelociRaptor 1TB, Raptor 150; MSI GTX 1050ti, 526.98 driver; Kingwin Stryker 500 PSU; Lian Li PC-K7B. Win10Pro-64, F@H 8.3.5.
Location: @Home
Contact:

Re: Having more GPU issues; ways to test configurations?

Post by jrweiss »

BTW, I just read the ATX12V v2.0 PSU Design Guide, and it allows +/-10% voltage variation on a second 12V rail at peak loading. It also allows 200mV Noise/Ripple on 12V2, but only 120mV on 12V1.

So, you may also have to look at your PSU wiring diagram and ensure your GPU is fully powered by +12V1DC if you have 2 rails. Just another reason to look at mfgrs specs and trustworthy reviews before buying...
Ryzen 7 5700G, 22.40.46 VGA driver; MSI GTX 1050ti, 551.23 studio driver
Ryzen 7 3700X; MSI GTX 1050ti, 551.23 studio driver [Suspended]
Post Reply