Page 1 of 1

COVID Moonshot - Unefficient Assignment!!

Posted: Tue Aug 04, 2020 10:05 am
by wuchzael
The Nvidia bias of the folding client will slow down this sprint to a snail race. None of my Nvidia GPUs got one of this (unrewarding) 134xx WUs while my AMD cards get nothing other - without being utilized efficiently. What a mess... the ancient GTX 970 gets giant WUs that take hours to fold while the Vega is not half utilized with this tiny WUs. Maybe you should rethink the assignment algorithm?

Cheers

Re: COVID Moonshot - Unefficient Assignment!!

Posted: Tue Aug 04, 2020 11:23 am
by Neil-B
What flag for client-type are are you running (beta, advanced, "none")? ... I ask as I believe there are beta and advanced GPU projects which are non Covid-19 - the 134** Projects are I believe public (no flag) ... This may mean that GPU slots running beta or advanced flags may get non 134** WUs due to the nature of the flags - a beta flag will get a beta WU in preference to advanced or public - an advanced flag will get an advanced WU in preference to public ... If I am right then removing a beta or advanced flag may push the AS towards the highest priority public WUs which is I believe the 134** Covid-19 ones.

Re: COVID Moonshot - Unefficient Assignment!!

Posted: Tue Aug 04, 2020 1:54 pm
by Joe_H
As written about several times before in connection with these WUs, data from these projects is being collected towards modifying the assignment algorithm. It is not going to happen overnight.

Re: COVID Moonshot - Unefficient Assignment!!

Posted: Wed Aug 05, 2020 12:40 am
by JohnChodera
> The Nvidia bias of the folding client will slow down this sprint to a snail race. None of my Nvidia GPUs got one of this (unrewarding) 134xx WUs while my AMD cards get nothing other - without being utilized efficiently. What a mess... the ancient GTX 970 gets giant WUs that take hours to fold while the Vega is not half utilized with this tiny WUs. Maybe you should rethink the assignment algorithm?

We have someone digging into benchmark data from 17100 (the benchmarking project) right now! We're actively working on refining the GPUSpecies to use more of the valid 2-255 range so that we can better refine these projects with live data. Right now, the GPUSpecies uses a narrow range (2-7) and manually-defined categories that don''t work well, especially for projects like 134xx that have a very different workload than other projects. The benchmark project includes a variety of different workloads, so we'll better be able to cluster GPUs that achieve equivalent performance.

Longer-term, we are working on a way to ensure your GPU really can deliver the PPD you expect with a more clever approach.

Thanks so much for bearing with us---we're generating a ton of useful data for the COVID Moonshot that will, with some luck, produce a new COVID-19 therapeutic candidate!

~ John Chodera // MSKCC