Is GPU folding more valuable than CPU folding scientifically
Moderator: Site Moderators
Forum rules
Please read the forum rules before posting.
Please read the forum rules before posting.
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: Is GPU folding more valuable than CPU folding scientific
Which fah project number would that be?
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Posts: 2948
- Joined: Sun Dec 02, 2007 4:36 am
- Hardware configuration: Machine #1:
Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).
Machine #2:
Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.
Machine 3:
Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32
I am currently folding just on the 5x GTX 460's for aprox. 70K PPD - Location: Salem. OR USA
Re: Is GPU folding more valuable than CPU folding scientific
I didn't know that there were any folding projects/cores using AVX. I think that newest GROMACS version is now AVX capable but I didn't know that any of the folding cores had been converted so as to use it. The NaCl core is by far the newest GROMACS core and I don't know if it even uses the AVX capabilities. Perhaps you are using the term "projects" in the more general (non-folding) definition.Grandpa_01 wrote: More like tripple, the projects I run that use AVX generrally run about 3 time as fast as when AVX is not used on the same project.
-
- Posts: 1122
- Joined: Wed Mar 04, 2009 7:36 am
- Hardware configuration: 3 - Supermicro H8QGi-F AMD MC 6174=144 cores 2.5Ghz, 96GB G.Skill DDR3 1333Mhz Ubuntu 10.10
2 - Asus P6X58D-E i7 980X 4.4Ghz 6GB DDR3 2000 A-Data 64GB SSD Ubuntu 10.10
1 - Asus Rampage Gene III 17 970 4.3Ghz DDR3 2000 2-500GB Segate 7200.11 0-Raid Ubuntu 10.10
1 - Asus G73JH Laptop i7 740QM 1.86Ghz ATI 5870M
Re: Is GPU folding more valuable than CPU folding scientific
I did not say fah projects I said other projects, just stating what the actual difference is with other cpu crunching intensive work. The speed up with AVX is huge and is also limited by memory the faster the memory the greater the speed up. AVX floods the memory and the memory can create a bottle neck. If fah develops a AVX capable client/core there is quite a potential for a very large and real speed up. Other projects have already done this and the results are quite impressive.P5-133XL wrote:I didn't know that there were any folding projects/cores using AVX. I think that newest GROMACS version is now AVX capable but I didn't know that any of the folding cores had been converted so as to use it. The NaCl core is by far the newest GROMACS core and I don't know if it even uses the AVX capabilities. Perhaps you are using the term "projects" in the more general (non-folding) definition.Grandpa_01 wrote: More like tripple, the projects I run that use AVX generrally run about 3 time as fast as when AVX is not used on the same project.
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
Re: Is GPU folding more valuable than CPU folding scientific
Both SSE and AVX can provide excellent speed up of single precision floating point math. Most of the benefits of SSE could only be obtained with the help of an outstanding programmer. The current version of Gromacs is based on the work of one such programmer. AVX has more potential and a good compiler might do as well or even better, depending on many factors, but only relatively new processors and relatively new OSs have AVX support. Of course many are going to be comparing the speedup of AVX to ordinary x86 code and they can give more impressive numbers than if they were comparing AVX to code that is already highly optimized for SSE.
As long as FAH has to support Donors who do not have AVX support, the Fah infrastructure support will have to get more complex to support both. I don't have any realistic way to estimate what percentage of FAH donors still depend purely on SSE and how many have both AVX capable hardware running on an AVX capable operating system.
As long as FAH has to support Donors who do not have AVX support, the Fah infrastructure support will have to get more complex to support both. I don't have any realistic way to estimate what percentage of FAH donors still depend purely on SSE and how many have both AVX capable hardware running on an AVX capable operating system.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
Re: Is GPU folding more valuable than CPU folding scientific
NaCl uses a version of GROMACS that supports AVX, however I don't know if it is enabled (I would guess not as it is probably a generic compiled exe). There are also a lot of other improvements and speedups in newer GROMACS that all systems can take advantage of. How much of a boost AVX would bring depends on how cache and memory bound folding is (a linpack would get most of the full theoretical boost). My guess would be a decent jump, which may vary between projects (similar to how the GPU projects seem to be variably performing).
-
- Posts: 1576
- Joined: Tue May 28, 2013 12:14 pm
- Location: Tokyo
Re: Is GPU folding more valuable than CPU folding scientific
To what I read most of the relevant OS support AXV2 and as for hardware Haswell upwards (AXV with SB upwards)
And the current BA was also made for a minority; my guess is that the base with AXV(2) capable systems is even bigger. Let's hope a "new" core 0xa7 or 0xa8 (wild guess on names) gives us that power.
Finding a good programmer to harvest that power is a challenge, I can understand.
And the current BA was also made for a minority; my guess is that the base with AXV(2) capable systems is even bigger. Let's hope a "new" core 0xa7 or 0xa8 (wild guess on names) gives us that power.
Finding a good programmer to harvest that power is a challenge, I can understand.
Please contribute your logs to http://ppd.fahmm.net
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: Is GPU folding more valuable than CPU folding scientific
The 2x speed up already mentioned was conservative and specific only to FAH, because FAH is already so highly optimized for SSE.
Any old timers will recall the doubling of fah speed when we moved from the Pentium 4 to the Core 2 Duo. C2Ds could process twice the SSE instructions per clock cycle. AVX does approximately the same thing for fah, eventually.
Any old timers will recall the doubling of fah speed when we moved from the Pentium 4 to the Core 2 Duo. C2Ds could process twice the SSE instructions per clock cycle. AVX does approximately the same thing for fah, eventually.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Re: Is GPU folding more valuable than CPU folding scientific
Old timers will also remember the ~3.5x speed-up when SSE optimizations were optional in Core_78 (out of a 4x theoretical maximum). That's when the real SSE programming work was added to FAH.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.