I have a large GPU job finishing in 20 minutes or so.
The Collection Server address is 0.0.0.0. That's obviously not good.
Will the job result be sent to the Work Server? (52.224.109.74)?
https://www.dropbox.com/s/e4ffcgwwbc652 ... er0000.png
Collection Server: 0.0.0.0 (?)
Moderators: Site Moderators, FAHC Science Team
Re: Collection Server: 0.0.0.0 (?)
The collection server is just a backup if the main server is down, not all servers have a collection server set up.
-
- Posts: 1996
- Joined: Sun Mar 22, 2020 5:52 pm
- Hardware configuration: 1: 2x Xeon [email protected], 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon [email protected], 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: [email protected], 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21 - Location: UK
Re: Collection Server: 0.0.0.0 (?)
One could say that in the normal course of events having a CS is a bad thing ... since a WU always needs to get the WS that issued it then if it has to go to a CS it means that there is an issue with the WS or connecting to the WS and the WU is going to be delayed ... when things are more stable then tbh CS should be needed much less ... hence the reasoning that a CS is a bad thing as although it is there as a backup in the normal course of events it should rarely if ever be needed?
However currently most WS do have a CS to cover overload and server issues during this period of rapid expansion.
However currently most WS do have a CS to cover overload and server issues during this period of rapid expansion.
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070
(Green/Bold = Active)
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070
(Green/Bold = Active)
Re: Collection Server: 0.0.0.0 (?)
It would be nice if they were not ever needed but users have lost a lot of points due to down servers. Of course they have also lost points because the collection server needs to know what units are valid to collect and they are also sometimes out of sync and refuse valid units.
-
- Posts: 12
- Joined: Sun Apr 26, 2020 6:59 pm
Re: Collection Server: 0.0.0.0 (?)
Thanks for the replies. Good to know all that hard work doesn't end up in a bit bucket!
-
- Posts: 1996
- Joined: Sun Mar 22, 2020 5:52 pm
- Hardware configuration: 1: 2x Xeon [email protected], 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon [email protected], 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: [email protected], 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21 - Location: UK
Re: Collection Server: 0.0.0.0 (?)
Being up front and honest, unfortunately sometimes it does (a couple of issues at the moment mean some is doing just that ) but everyone tries really hard to stop this happening and rectify issues that can cause this when they happen.
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070
(Green/Bold = Active)
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070
(Green/Bold = Active)
Re: Collection Server: 0.0.0.0 (?)
So finally it was OK.