75XX Project issues (crashes, too many steps etc.)
Moderators: Site Moderators, FAHC Science Team
Re: 75XX Project issues (crashes, too many steps etc.)
So perhaps the best interpretation of this problem is: If you're still on V6, upgrade to V7. It's really quote easy and you'll certainly get better support bu using the improved analytics.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
-
- Posts: 1164
- Joined: Wed Apr 01, 2009 9:22 pm
- Hardware configuration: Asus Z8NA D6C, 2 [email protected] Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)
Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS
Not currently folding
Asus Z9PE- D8 WS, 2 [email protected] Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only) - Location: Jersey, Channel islands
Re: 75XX Project issues (crashes, too many steps etc.)
Sorry, the best interpretation is to fix the issue and let people who want to use 6.34 continue to do so.
I just tried to install v7 on one of my Linux boxes and it was a hopeless waste of 4 hours and a WU, suffice to say that it has been deleted and I have gone back to v6
I just tried to install v7 on one of my Linux boxes and it was a hopeless waste of 4 hours and a WU, suffice to say that it has been deleted and I have gone back to v6
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: 75XX Project issues (crashes, too many steps etc.)
I followed the Linux install guide in Ubuntu and it took me 4 minutes, after I installed debi. What flavor are you running?
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Posts: 1164
- Joined: Wed Apr 01, 2009 9:22 pm
- Hardware configuration: Asus Z8NA D6C, 2 [email protected] Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)
Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS
Not currently folding
Asus Z9PE- D8 WS, 2 [email protected] Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only) - Location: Jersey, Channel islands
Re: 75XX Project issues (crashes, too many steps etc.)
Ubuntu 12.10 - which I didn't know at the time and realise is part of the issue.
Client installed OK, FAHcontrol was a whole different matter, it didn't install properly, so I reinstalled and it complained about dependancies, followed the wiki steps and it complained about the second set of commands needed, so I reinstalled a 3rd time and this time it wanted to download some updates for gnome, which it couldn't do as its 12.10 not the 12.04 LTS release that I thought it was running.
So moved on to at least monitor and salvage the current WU and webcontrol was only giving 100k PPD as opposed to the 170k I usually get on A3 work, left it to settle for an hour, came back and progress had moved on 25% but PPD was down to 80k, left it another couple of hours and progress was only at 60% so something was slowing everything down, then web control crashed and wouldn't connect to the client, even after rebooting the machine. At this point I vaped everything and went back to v6 and BA units - which I want to move away from, but couldn't because v6 wont connect to a server to get a WU, ARGH!
Client installed OK, FAHcontrol was a whole different matter, it didn't install properly, so I reinstalled and it complained about dependancies, followed the wiki steps and it complained about the second set of commands needed, so I reinstalled a 3rd time and this time it wanted to download some updates for gnome, which it couldn't do as its 12.10 not the 12.04 LTS release that I thought it was running.
So moved on to at least monitor and salvage the current WU and webcontrol was only giving 100k PPD as opposed to the 170k I usually get on A3 work, left it to settle for an hour, came back and progress had moved on 25% but PPD was down to 80k, left it another couple of hours and progress was only at 60% so something was slowing everything down, then web control crashed and wouldn't connect to the client, even after rebooting the machine. At this point I vaped everything and went back to v6 and BA units - which I want to move away from, but couldn't because v6 wont connect to a server to get a WU, ARGH!
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: 75XX Project issues (crashes, too many steps etc.)
Being more of a nix newb, I tend to stick with the LTS versions. Even so, I haven't seen any reports like this before. The client version may have changed, but the FAHCore hasn't, and the core does all the folding. It *should* run at the same speed.
If you feel like experimenting again later, we should look at a Top screen to see what if anything is stealing cycles.
If you feel like experimenting again later, we should look at a Top screen to see what if anything is stealing cycles.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Posts: 1164
- Joined: Wed Apr 01, 2009 9:22 pm
- Hardware configuration: Asus Z8NA D6C, 2 [email protected] Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)
Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS
Not currently folding
Asus Z9PE- D8 WS, 2 [email protected] Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only) - Location: Jersey, Channel islands
Re: 75XX Project issues (crashes, too many steps etc.)
I'm going to install 12.04 LTS Thursday on my day off, then I can try again and also get the various tweaks that tear and co created installed.
Still would be nice for v6 to work though.......
Still would be nice for v6 to work though.......
-
- Posts: 1003
- Joined: Thu May 02, 2013 8:46 pm
- Hardware configuration: Full Time:
2x NVidia GTX 980
1x NVidia GTX 780 Ti
2x 3GHz Core i5 PC (Linux)
Retired:
3.2GHz Core i5 PC (Linux)
3.2GHz Core i5 iMac
2.8GHz Core i5 iMac
2.16GHz Core 2 Duo iMac
2GHz Core 2 Duo MacBook
1.6GHz Core 2 Duo Acer laptop - Location: Near Oxford, United Kingdom
- Contact:
Re: 75XX Project issues (crashes, too many steps etc.)
I had that trouble with 12.04 and (I think) 14.04- I thought it was just my unfamiliarity with Linux so didn't raise it here.Nathan_P wrote: Client installed OK, FAHcontrol was a whole different matter
In both, FAHContol installed without a hitch if I just right-clicked the file and let the package installer look after it.
I'm using Mint 17.1 now and don't bother with the installation instructions at all, I just double-click the .deb files.
-
- Site Moderator
- Posts: 6349
- Joined: Sun Dec 02, 2007 10:38 am
- Location: Bordeaux, France
- Contact:
Re: 75XX Project issues (crashes, too many steps etc.)
Project: 7521 (Run 0, Clone 43, Gen 263)
[07:30:55] *------------------------------*
[07:30:55] Folding@Home Gromacs GB Core
[07:30:55] Version 2.27 (Dec. 15, 2010)
[07:30:55]
[07:30:55] Preparing to commence simulation
[07:30:55] - Ensuring status. Please wait.
[07:31:05] - Looking at optimizations...
[07:31:05] - Working with standard loops on this execution.
[07:31:05] - Created dyn
[07:31:05] - Files status OK
[07:31:05] - Expanded 2523688 -> 3157328 (decompressed 125.1 percent)
[07:31:05] Called DecompressByteArray: compressed_data_size=2523688 data_size=3157328, decompressed_data_size=3157328 diff=0
[07:31:05] - Digital signature verified
[07:31:05]
[07:31:05] Project: 7521 (Run 0, Clone 43, Gen 263)
[07:31:05]
[07:31:05] Entering M.D.
[07:31:11] CoreStatus = 0 (0)
[07:31:11] Sending work to server
[07:31:11] Project: 7521 (Run 0, Clone 43, Gen 263)
[07:31:11] - Error: Could not get length of results file work/wuresults_00.dat
[07:31:11] - Error: Could not read unit 00 file. Removing from queue.
[07:31:11] Trying to send all finished work units
[07:31:11] + No unsent completed units remaining.
Re: 75XX Project issues (crashes, too many steps etc.)
This WU is still failing immediately after downloading:
7520 (Run 64, Clone 3, Gen 254)
7520 (Run 64, Clone 3, Gen 254)
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: 75XX Project issues (crashes, too many steps etc.)
Most of these failures started as Run 0 problems. But now we're getting higher up in to the R C G numbers and still having the same issues.
This looks more and more like inherently unstable projects than some malformed WUs from a raid crash.
This looks more and more like inherently unstable projects than some malformed WUs from a raid crash.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
7520 (Run 32, Clone 3, Gen 465)
OK, here's another one:
My client downloaded from .97, and attempted to run this one about a dozen times before switching over to .96 and getting an 8828, which is running now.
Is there a way to block the client from going to .97 and using .96 instead, until this all sorts out?
Code: Select all
Reading file work/wudata_00.tpr, VERSION 4.5.5-dev-20120703-fc032f9-dirty (single precision)
[06:14:53] CoreStatus = 0 (0)
[06:14:53] Sending work to server
[06:14:53] Project: 7520 (Run 32, Clone 3, Gen 465)
[06:14:53] - Error: Could not get length of results file work/wuresults_00.dat
[06:14:53] - Error: Could not read unit 00 file. Removing from queue.
[06:14:53] Trying to send all finished work units
[06:14:53] + No unsent completed units remaining.
Is there a way to block the client from going to .97 and using .96 instead, until this all sorts out?
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: 75XX Project issues (crashes, too many steps etc.)
No.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Posts: 33
- Joined: Sun May 25, 2008 7:40 pm
Re: 75XX Project issues (crashes, too many steps etc.)
I'm about to have to start back on ID one from having done this with these 7520 WU's.7im wrote:You've been running v6 too long to remember to change the Machine ID value in the config after deleting the WU info to force a new WU in Windows. Or to delete the ID .dat file in Linux to affect the same change.
I'm going to start posting the faulty units in this thread. When I started with this problem, there wasn't any info I could find here.
These are bad:
7520 (R 47, C 4, G 282)
7520 (R 59, C 4, G 246)
-
- Site Moderator
- Posts: 6349
- Joined: Sun Dec 02, 2007 10:38 am
- Location: Bordeaux, France
- Contact:
Re: 75XX Project issues (crashes, too many steps etc.)
Two more with are alternately crashing on the same client :
Project: 7516 (Run 0, Clone 181, Gen 0)
Project: 7521 (Run 0, Clone 43, Gen 263)
Project: 7516 (Run 0, Clone 181, Gen 0)
Project: 7521 (Run 0, Clone 43, Gen 263)
Re: 75XX Project issues (crashes, too many steps etc.)
This WU STILL gets downloaded and the client wastes a lot of time retrying before getting a functional WU:
[08:29:26] Project: 7520 (Run 64, Clone 3, Gen 254)
[08:29:26] - Error: Could not get length of results file work/wuresults_06.dat
[08:29:26] - Error: Could not read unit 06 file. Removing from queue.
It needs to be fixed/deleted.
[08:29:26] Project: 7520 (Run 64, Clone 3, Gen 254)
[08:29:26] - Error: Could not get length of results file work/wuresults_06.dat
[08:29:26] - Error: Could not read unit 06 file. Removing from queue.
It needs to be fixed/deleted.