A Central Database Of Completed Projects
Posted: Sat Jan 04, 2014 1:06 am
I was reading an article about a UBC study recently (about the loss of data from surveys) and it made me ask the question: Is there any way to access the data ( Completed WU ) that the network has created? Is it stored in a central location? Is it just given to the scientists and then forgotten and possibly *gasp* deleted?
Also another question, just for curiosity, what type of file type is the data stored in?
Below is some of my suggestions for how the data could be made available for public "use". (This is assuming it already isn't or that there isn't some legal reason why it can't be public. I was jovial when writing this, so take it that way. feel free to point out any flaws in my suggestions.
My own opinion is that it should be available to the "public" because the data was created by the public. I have a few suggestions for this but I do not actually know how large the data is so it is difficult to speculate on much. The first idea would be as downloads (each protein its own file) on a server. This may not work because there may be a high number of downloads using a large amount of bandwidth and requiring a large server infrastructure. My second idea is that the data for each protein could be stored on a server with a limited bandwidth cap and available as a torrent download. This would put less strain on any server and because there would always be at least one server with the torrent seed the torrent would never be unavailable.
Also, if I'm just blind and can't find it even though it's available, feel free to get mad for wasting your time.
Also another question, just for curiosity, what type of file type is the data stored in?
Below is some of my suggestions for how the data could be made available for public "use". (This is assuming it already isn't or that there isn't some legal reason why it can't be public. I was jovial when writing this, so take it that way. feel free to point out any flaws in my suggestions.
My own opinion is that it should be available to the "public" because the data was created by the public. I have a few suggestions for this but I do not actually know how large the data is so it is difficult to speculate on much. The first idea would be as downloads (each protein its own file) on a server. This may not work because there may be a high number of downloads using a large amount of bandwidth and requiring a large server infrastructure. My second idea is that the data for each protein could be stored on a server with a limited bandwidth cap and available as a torrent download. This would put less strain on any server and because there would always be at least one server with the torrent seed the torrent would never be unavailable.
Also, if I'm just blind and can't find it even though it's available, feel free to get mad for wasting your time.