Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Moderators: Site Moderators, FAHC Science Team

Post Reply
CCTCHFUN
Posts: 5
Joined: Mon Apr 25, 2011 11:02 pm

Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Post by CCTCHFUN »

I been getting this one the whole day, did anyone complete this WU? I am not sure is this a Bad WU or just me, below is the log.

[01:24:54] Project: 6801 (Run 8950, Clone 1, Gen 3)
[01:24:54]
[01:24:54] Assembly optimizations on if available.
[01:24:54] Entering M.D.
[01:24:56] Tpr hash work/wudata_08.tpr: 3607300280 4160975690 466456219 2840422234 798880461
[01:24:56] Working on ALZHEIMER'S DISEASE AMYLOID
[01:24:56] Client config found, loading data.
[01:24:57] Setting checkpoint frequency: 500000
[01:24:57] Setting checkpoint frequency: 500000
[01:24:57] Starting GUI Server
[01:26:15] Completed 500000 out of 50000000 steps (1%).
[01:26:15] mdrun_gpu returned 52
[01:26:15] NANs detected on GPU
[01:26:15]
[01:26:15] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:26:18] CoreStatus = 7A (122)
[01:26:18] Sending work to server
[01:26:18] Project: 6801 (Run 8950, Clone 1, Gen 3)
[01:26:18] - Read packet limit of 540015616... Set to 524286976.
[01:26:18] - Error: Could not get length of results file work/wuresults_08.dat
[01:26:18] - Error: Could not read unit 08 file. Removing from queue.
[01:26:18] EUE limit exceeded. Pausing 24 hours.
CCTCHFUN
Posts: 5
Joined: Mon Apr 25, 2011 11:02 pm

Re: Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Post by CCTCHFUN »

Delete the Que and Work folder didn't help either, after changing the client with different I.D.#, it picked up a different WU and folding Ok....fingers crossed.
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Post by PantherX »

There is a single (failure) report in the WU Database but it doesn't match your Forum username:
Your WU (P6801 R8950 C1 G3) was added to the stats database on 2011-04-12 00:06:34 for 0 points of credit.
I have marked it for a follow-up.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
speedy6635
Posts: 1
Joined: Mon May 09, 2011 5:27 am

Re: Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Post by speedy6635 »

I'm Getting the same thing here too. With a gtx 470 and gtx 560ti

Code: Select all

Arguments: -oneunit -forcegpu nvidia_fermi -advmethods -gpu 0 
[05:10:41] - Ask before connecting: No
[05:10:41] - User name: Speedy6635 (Team 111065)
[05:10:41] - User ID: 2CBEC193014C91A
[05:10:41] - Machine ID: 3
[05:10:41] 
[05:10:41] Gpu type=3 species=30.
[05:10:41] Work directory not found. Creating...
[05:10:41] Could not open work queue, generating new queue...
[05:10:41] - Preparing to get new work unit...
[05:10:41] Cleaning up work directory
[05:10:41] + Attempting to get work packet
[05:10:41] Passkey found
[05:10:41] Gpu type=3 species=30.
[05:10:41] - Connecting to assignment server
[05:10:42] - Successful: assigned to (171.64.65.64).
[05:10:42] + News From Folding@Home: Welcome to Folding@Home
[05:10:42] Loaded queue successfully.
[05:10:42] Gpu type=3 species=30.
[05:10:42] + Closed connections
[05:10:42] 
[05:10:42] + Processing work unit
[05:10:42] Core required: FahCore_15.exe
[05:10:42] Core found.
[05:10:42] Working on queue slot 01 [May 9 05:10:42 UTC]
[05:10:42] + Working ...
[05:10:42] 
[05:10:42] *------------------------------*
[05:10:42] Folding@Home GPU Core
[05:10:42] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[05:10:42] 
[05:10:42] Build host: SimbiosNvdWin7
[05:10:42] Board Type: NVIDIA/CUDA
[05:10:42] Project: 6801 (Run 8950, Clone 1, Gen 3)
[05:10:42] 
[05:10:42] Assembly optimizations on if available.
[05:10:42] Entering M.D.
[05:10:44] Tpr hash work/wudata_01.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[05:10:44] Working on ALZHEIMER'S DISEASE AMYLOID
[05:10:44] Client config found, loading data.
[05:10:44] Starting GUI Server
[05:10:45] Setting checkpoint frequency: 500000
[05:10:45] Setting checkpoint frequency: 500000
[05:12:05] Completed    500000 out of 50000000 steps (1%).
[05:12:05] mdrun_gpu returned 52
[05:12:05] NANs detected on GPU
[05:12:05] 
[05:12:05] Folding@home Core Shutdown: UNSTABLE_MACHINE
[05:12:07] CoreStatus = 7A (122)
[05:12:07] Sending work to server
[05:12:07] Project: 6801 (Run 8950, Clone 1, Gen 3)
[05:12:07] - Read packet limit of 540015616... Set to 524286976.
[05:12:07] - Error: Could not get length of results file work/wuresults_01.dat
[05:12:07] - Error: Could not read unit 01 file. Removing from queue.
only way to keep folding is remove -advmethods flag

Mod Edit: Added Code Tags - PantherX
CCTCHFUN
Posts: 5
Joined: Mon Apr 25, 2011 11:02 pm

Re: Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Post by CCTCHFUN »

PantherX wrote:There is a single (failure) report in the WU Database but it doesn't match your Forum username:
Your WU (P6801 R8950 C1 G3) was added to the stats database on 2011-04-12 00:06:34 for 0 points of credit.
I have marked it for a follow-up.
5/7/11 is my first time getting this WU.
Bill1024
Posts: 75
Joined: Mon Jun 30, 2008 2:45 am

Re: Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Post by Bill1024 »

I am getting the same wu and the same result.
Tried del work and que 10 times and keep getting the same wu.
Project: 6801 (Run 8950, Clone 1, Gen 3)
[04:30:03]
[04:30:03] Assembly optimizations on if available.
[04:30:03] Entering M.D.
[04:30:05] Tpr hash work/wudata_01.tpr: 3607300280 4160975690 466456219 2840422234 798880461
[04:30:05] Working on ALZHEIMER'S DISEASE AMYLOID
[04:30:05] Client config found, loading data.
[04:30:05] Starting GUI Server
[04:30:06] Setting checkpoint frequency: 500000
[04:30:06] Setting checkpoint frequency: 500000
[04:32:01] Completed 500000 out of 50000000 steps (1%).
[04:32:02] mdrun_gpu returned 52
[04:32:02] NANs detected on GPU
[04:32:02]
[04:32:02] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:32:04] CoreStatus = 7A (122)
GreyWhiskers
Posts: 660
Joined: Mon Oct 25, 2010 5:57 am
Hardware configuration: a) Main unit
Sandybridge in HAF922 w/200 mm side fan
--i7 [email protected] GHz
--ASUS P8P67 DeluxeB3
--4GB ADATA 1600 RAM
--750W Corsair PS
--2Seagate Hyb 750&500 GB--WD Caviar Black 1TB
--EVGA 660GTX-Ti FTW - Signature 2 GPU@ 1241 Boost
--MSI GTX560Ti @900MHz
--Win7Home64; FAH V7.3.2; 327.23 drivers

b) 2004 HP a475c desktop, 1 core Pent 4 [email protected] GHz; Mem 2GB;HDD 160 GB;Zotac GT430PCI@900 MHz
WinXP SP3-32 FAH v7.3.6 301.42 drivers - GPU slot only

c) 2005 Toshiba M45-S551 laptop w/2 GB mem, 160GB HDD;Pent M 740 CPU @ 1.73 GHz
WinXP SP3-32 FAH v7.3.6 [Receiving Core A4 work units]
d) 2011 lappy-15.6"-1920x1080;i7-2860QM,2.5;IC Diamond Thermal Compound;GTX 560M 1,536MB u/c@700;16GB-1333MHz RAM;HDD:500GBHyb w/ 4GB SSD;Win7HomePrem64;320.18 drivers FAH 7.4.2ß
Location: Saratoga, California USA

Re: Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Post by GreyWhiskers »

There was a huge thread on the NaNs Detected issue - that you might want to peruse. I've quoted Bruce's last post, edited by PantherX, on the thread.

Have you looked at any of these troubleshooting steps on your hardware, or on your GPU drivers?

Re: [Please read] NaNs detected on GPU - UNSTABLE_MACHINE er
Re: [Please read] NaNs detected on GPU - UNSTABLE_MACHINE er
by bruce » Tue Mar 22, 2011 2:30 pm

I'm closing this topic. It has become a catch-all for several DIFFERENT types of problems and each one has it's own signature even though all may give you the NaNs detected message.

1) It may be a bad WU. The post, above, by "drbricks" is an excellent example of that problem. The same WU failed three times but there's no idication of a problem with other WUs. "HendricksSA" answered that one accurately. I'll copy the information from drbricks' post into a new post in that forum, though an extract from a log would be helpful. (It's not clear if he had an UNSTABLE_MACHINE error or not.)

2) Be sure your software has been upgraded to the latest version. Golden Dragoon's log shows that, and again, HendricksSA suggested the proper next step.

3) There's also the possibility of a hardware problem. GPUs do sometimes fail. They can overheat (particularly the single slot variety which leaves all the heat inside the case). They can be installed in systems which do not provide enough power. All of these options should be considered if you're still seeing UNSTABLE_MACHINE after eliminating (1) and (2).

EDIT By PantherX-> A good place to start troubleshooting is by reading this post -> 15 - Troubleshooting The GPU3 BETA Client
Bill1024
Posts: 75
Joined: Mon Jun 30, 2008 2:45 am

Re: Project: 6801 (Run 8950, Clone 1, Gen 3) EUE @ 1%

Post by Bill1024 »

I had to run config, and say NO to advmethods, then del work folder, unit info .ect, to get a new WU..
Now I am back to folding just fine.
Not sure why it would not give me a new wu after trying better than 15 times the normal way.
Post Reply