Comparing performance between folding clients
Moderators: Site Moderators, FAHC Science Team
Comparing performance between folding clients
Hi all - is there an easy way to compare by how much my GPU client is faster than a CPU client? I keep hearing that it is much faster, but it would be nice to know by how much and, for example, how both of them compare to a PS3 client. From what I see from other posts, points and WU seems to be a client type specific, e.g. NVIDIA GPU points can't accurately compare to ATI GPU points. Is that the case? Is there a way to perform an apple to apple comparison between different clients (e.g. using the same WU compiled for different core and clients)?
Last edited by sergd on Wed Apr 08, 2009 4:29 am, edited 1 time in total.
-
- Posts: 823
- Joined: Tue Mar 25, 2008 12:45 am
- Hardware configuration: Core i7 3770K @3.5 GHz (not folding), 8 GB DDR3 @2133 MHz, 2xGTX 780 @1215 MHz, Windows 7 Pro 64-bit running 7.3.6 w/ 1xSMP, 2xGPU
4P E5-4650 @3.1 GHz, 64 GB DDR3 @1333MHz, Ubuntu Desktop 13.10 64-bit
Re: Comparting performance between folding clients
This can't happen because the same WUs are not made for multiple clients.sergd wrote:Hi all - is there an easy way to compare by how much my GPU client is faster than a CPU client? I keep hearing that it is much faster, but it would be nice to know by how much and, for example, how both of them compare to a PS3 client. From what I see from other posts, points and WU seems to be a client type specific, e.g. NVIDIA GPU points can't accurately compare to ATI GPU points. Is that the case? Is there a way to perform an apple to apple comparison between different clients (e.g. using the same WU compiled for different core and clients)?
Are you asking about Points Per Day? If you get a monitoring client like FahMon and point it to the client directories, it'll calculate how many points per day each client produces (doesn't work with PS3s). I don't know the site, but there is a webpage that has a database of PPD for most, if not all, of the clients, with multiple hardware configurations. On the ATI and Nvidia forums there are charts of how many PPD most models get. PS3s get approximately 900 PPD when folding 24/7.
The problem with comparing non-PS3 clients is that it depends a good deal on your hardware. The GPU2 client can do as low as a couple hundred PPD (low-end ATI card) to several thousand (Nvidia GTX 280). The SMP client can range from the hundreds (low-end dual-core) to a couple thousand (Core i7). The uniprocesoor client is the only one, other than the PS3, that doesn't have a huge range (only looking at one instance being run), mainly because the current advances in CPU technology seem to be more in number of cores and less in more efficient architecture.
-
- Posts: 90
- Joined: Wed Mar 04, 2009 4:30 am
- Hardware configuration: L1:Dual Core 1.6ghz, 1GB Ram
L2:
PC1&2:P2 3.2ghz, 1GB Ram, Gforce 6800 :(
PS3 - Location: Australia
Re: Comparting performance between folding clients
Client statistics by OS
The Pande Group from Stanford have done some conversions of the output from each different client type to make them all comparable... how this relates to points per day on each of the clients I've no idea...
If you average it out, each active playstation out there is producing 0.06 [x86]tflops but this is the only client you could possibly hope to come up with that kind of figure for... all the other clients produce a different ppd depending on the system they are being run on.
It'd be nice to have some form of comparitive figure though, would help in determining our personal "ideal" balance for systems which we run.
The Pande Group from Stanford have done some conversions of the output from each different client type to make them all comparable... how this relates to points per day on each of the clients I've no idea...
If you average it out, each active playstation out there is producing 0.06 [x86]tflops but this is the only client you could possibly hope to come up with that kind of figure for... all the other clients produce a different ppd depending on the system they are being run on.
It'd be nice to have some form of comparitive figure though, would help in determining our personal "ideal" balance for systems which we run.
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: Comparting performance between folding clients
While the GPU client may be 20x faster at folding than the CPU client, it only earns 10x the points, or whatever the current values are. This is because the work done by the GPU is of a very limited scope, and not 20x as useful as the CPU client. While nowhere near perfect, the best comparison to make at this time is the rate PPD of the benchmark system for each client. 110 CPU, 900 PS3, 1500 GPU, 1760 SMP.
Do NOT compare flops, as those numbers can be deceptive, especially when you take in to account the relative speeds I mentioned above. 20x the flops does not mean the client is 20x faster nor produces 20x the amount of useful science.
Do NOT compare flops, as those numbers can be deceptive, especially when you take in to account the relative speeds I mentioned above. 20x the flops does not mean the client is 20x faster nor produces 20x the amount of useful science.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Posts: 30
- Joined: Thu Jan 08, 2009 11:30 am
- Hardware configuration: 3 x Nvidia Geforce GTX 295 cards (total 6 GPU's)
AMD Phenom x4 2600MHz Black Edition
MSI 790FX K9A2 Platinum system board, 4GB RAM
ATX case, 1000 W Chieftec PS
Windows XP 64-bit
More: http://estoniadonates.wordpress.com/ - Location: Tallinn, Estonia, EU
- Contact:
Re: Comparting performance between folding clients
Are Your sure? Running one 1 GFLOPS PC or running 20 PC-s each 1 GFLOPS. I really think that I give 20x more to science???!7im wrote:20x the flops does not mean the client is 20x faster nor produces 20x the amount of useful science.
Jaak
building 32 GPU folding rig: 16 x NVIDIA GeForce GTX 295 cards; dual PSU solution; 4 nodes.
Website: Estonia Donates, ambitious 400 PPD supercomputer project
Sponsored by AdlerFlow ceramic coated Porsche exhausts
Website: Estonia Donates, ambitious 400 PPD supercomputer project
Sponsored by AdlerFlow ceramic coated Porsche exhausts
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: Comparting performance between folding clients
Again, that is not an accurate assumption if you based it on the GFLOPS.
The CPU client is much slower, but can do a wide variety of calculations. The GPU is much faster, but can only do one type of calculation. Read the history section of the GPU FAQ for more info on the nature of GPU folding.
Note, I am not arguing against you. I am not saying the 20 CPUs are better. I am saying you can't make that judgement based on the GFLOPS. The GFLOPS count is deceptive in the GPU client, as it has to repeat some calculations twice because it can't store the results for some simulations. So while 20x faster, it might only be doing 10x the actual work. I don't have exact numbers, but that should demonstrate why you can't compare GFLOPS.
The CPU client is much slower, but can do a wide variety of calculations. The GPU is much faster, but can only do one type of calculation. Read the history section of the GPU FAQ for more info on the nature of GPU folding.
Note, I am not arguing against you. I am not saying the 20 CPUs are better. I am saying you can't make that judgement based on the GFLOPS. The GFLOPS count is deceptive in the GPU client, as it has to repeat some calculations twice because it can't store the results for some simulations. So while 20x faster, it might only be doing 10x the actual work. I don't have exact numbers, but that should demonstrate why you can't compare GFLOPS.
Last edited by 7im on Sat Apr 04, 2009 5:19 am, edited 1 time in total.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Posts: 90
- Joined: Wed Mar 04, 2009 4:30 am
- Hardware configuration: L1:Dual Core 1.6ghz, 1GB Ram
L2:
PC1&2:P2 3.2ghz, 1GB Ram, Gforce 6800 :(
PS3 - Location: Australia
Re: Comparting performance between folding clients
I thought points per day was based purely as a comparison of your system with the benchmark system for that client type and had no real useable inference to the science being achieved.
It would be nice to be able to compare all the different clients together is all, Granny smith apples to luscious red apples at least, rather than apples with oranges
It would be nice to be able to compare all the different clients together is all, Granny smith apples to luscious red apples at least, rather than apples with oranges
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: Comparting performance between folding clients
Yes, and then no. You have it half right.kelliegang wrote:I thought points per day was based purely as a comparison of your system with the benchmark system for that client type and had no real useable inference to the science being achieved.
It would be nice to be able to compare all the different clients together is all, Granny smith apples to luscious red apples at least, rather than apples with oranges
Because the benchmark PPD value is based primarily on scientific production, and since your system performance is evaluated relative to that benchmark, your system PPD is tied to the science it produces. It's not exactly linear, and science isn't the ONLY criteria, but science is the MAIN consideration in regards to the benchmark.
PPD is in the only apples to apples comparison you can make at this time. But again, it's not exact.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Posts: 90
- Joined: Wed Mar 04, 2009 4:30 am
- Hardware configuration: L1:Dual Core 1.6ghz, 1GB Ram
L2:
PC1&2:P2 3.2ghz, 1GB Ram, Gforce 6800 :(
PS3 - Location: Australia
Re: Comparting performance between folding clients
Cheers for the clarity.
Though if, in terms of importance, PS3 = CPU = GPU = SMP .... then swouldnt it be a good idea to apply some form of weighting so the ppd be more equal rather than things like 6,000 ppd for nvidia clients compared to 100-150ppd for cpu clients?
Though if, in terms of importance, PS3 = CPU = GPU = SMP .... then swouldnt it be a good idea to apply some form of weighting so the ppd be more equal rather than things like 6,000 ppd for nvidia clients compared to 100-150ppd for cpu clients?
-
- Posts: 823
- Joined: Tue Mar 25, 2008 12:45 am
- Hardware configuration: Core i7 3770K @3.5 GHz (not folding), 8 GB DDR3 @2133 MHz, 2xGTX 780 @1215 MHz, Windows 7 Pro 64-bit running 7.3.6 w/ 1xSMP, 2xGPU
4P E5-4650 @3.1 GHz, 64 GB DDR3 @1333MHz, Ubuntu Desktop 13.10 64-bit
Re: Comparting performance between folding clients
I think the issue is that it's kind of hard to rate the clients in terms of importance because they each have their advantages and weaknesses, and it's kind of hard to average out the benefits to some sort of hardline figure.kelliegang wrote:Cheers for the clarity.
Though if, in terms of importance, PS3 = CPU = GPU = SMP .... then swouldnt it be a good idea to apply some form of weighting so the ppd be more equal rather than things like 6,000 ppd for nvidia clients compared to 100-150ppd for cpu clients?
Maybe the only ones that could be compared are SMP vs. CPU since they both run on the same hardware. I think I once saw that part of the reason why the SMP client gets more PPD vs. running as many instances of the uniprocessor client as you have physical cores is that the SMP has the advantage of inter-core communication, allowing more science to be done (the other parts are that the SMP clients are still not nearly as stable as the CPU client and have much shorter deadlines).
-
- Posts: 30
- Joined: Thu Jan 08, 2009 11:30 am
- Hardware configuration: 3 x Nvidia Geforce GTX 295 cards (total 6 GPU's)
AMD Phenom x4 2600MHz Black Edition
MSI 790FX K9A2 Platinum system board, 4GB RAM
ATX case, 1000 W Chieftec PS
Windows XP 64-bit
More: http://estoniadonates.wordpress.com/ - Location: Tallinn, Estonia, EU
- Contact:
Re: Comparting performance between folding clients
Isn't there now apples-to-apples comparisin x86 FLOPS metrics up and running now?
http://fah-web.stanford.edu/cgi-bin/mai ... pe=osstats
http://fah-web.stanford.edu/cgi-bin/mai ... pe=osstats
building 32 GPU folding rig: 16 x NVIDIA GeForce GTX 295 cards; dual PSU solution; 4 nodes.
Website: Estonia Donates, ambitious 400 PPD supercomputer project
Sponsored by AdlerFlow ceramic coated Porsche exhausts
Website: Estonia Donates, ambitious 400 PPD supercomputer project
Sponsored by AdlerFlow ceramic coated Porsche exhausts
Re: Comparting performance between folding clients
Not if what 7im said is true, in that the GPU clients have to rerun some of those calculations, because they do not retain the results in memory like a CPU would.
Folding@home: 24/7
- RTX 4070 Ti
- RTX 4070 Ti
-
- Posts: 90
- Joined: Wed Mar 04, 2009 4:30 am
- Hardware configuration: L1:Dual Core 1.6ghz, 1GB Ram
L2:
PC1&2:P2 3.2ghz, 1GB Ram, Gforce 6800 :(
PS3 - Location: Australia
Re: Comparting performance between folding clients
Well Jaak, if I understand what is being said, the argument is that flops dont represent the "science" as clearly as points per day does I think? Surely though you could easily give an x86 factor instead of points per day [figure out total flops produced by one client type, divide this by the points contributed by that client type; voila x86 rating per point]
-
- Posts: 85
- Joined: Fri Feb 13, 2009 12:38 pm
- Hardware configuration: Linux & CPUs
- Location: USA
Re: Comparting performance between folding clients
If the only thing the clients did were "flops" , It might be easier to figure. The clients are calculating the path each atom takes as it morphs from one compound into another. It is much more complex than "How many times can I multiply each second".
Re: Comparting performance between folding clients
I don't think that you can say that PS3 = CPU = GPU = SMP. In fact, all are important but it is very difficult to know how to compare them. To make a Waldorf salad, you need a mixture of fruits. If it's all apples or all oranges or all nuts, it's not a good salad. If you have two salads with different mixtures, both can be acceptable.kelliegang wrote:Though if, in terms of importance, PS3 = CPU = GPU = SMP .... then swouldnt it be a good idea to apply some form of weighting so the ppd be more equal rather than things like 6,000 ppd for nvidia clients compared to 100-150ppd for cpu clients?
If there are reasonable variations in the production of the four clients, science will get done. If too many people choose NOT to run, say the SMP client becuase it's harder to make work, then the Pande Group might consider giving it more points to influence your choices, but the generally tend to make no changes as long as the overall mixture is in the reasonable range.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.