Fails to restart after disconnection from internet

Moderators: Site Moderators, FAHC Science Team

Post Reply
jarruza
Posts: 7
Joined: Fri Sep 18, 2020 10:19 pm

Fails to restart after disconnection from internet

Post by jarruza »

F@H was running, I disconnected the internet, F@H was working on a WU. Now it is not running. I reconnected the internet expecting the process to automatically start up. However, F@H is still in inactive state and did not restart/upload the WU it completed. I still see the process in Taskmgr but at 0% CPU.

As anyone seen this behavior before, i.e., disconnecting the internet while processing a WU causes F@H to eventually stop working even after reconnecting? I would expect that it would periodically check for internet connectivity then upload the results and pull down the next WU.

Thanks
JimboPalmer
Posts: 2522
Joined: Mon Feb 16, 2009 4:12 am
Location: Greenwood MS USA

Re: Fails to restart after disconnection from internet

Post by JimboPalmer »

Welcome to Folding@Home!

When a Work Unit finishes, the Client starts looking for another WU. At first it tries fairly often, but over time it checks less and less often.

You can speed this up by stopping and restarting the client or just rebooting the PC.
Tsar of all the Rushers
I tried to remain childlike, all I achieved was childish.
A friend to those who want no friends
jarruza
Posts: 7
Joined: Fri Sep 18, 2020 10:19 pm

Re: Fails to restart after disconnection from internet

Post by jarruza »

If I reboot the PC, will I loose the work or is it saved? If it by default it's lost, is there a way to preserve the work, e.g., pause the app, reboot then start (or unpause)?

Thanks
Neil-B
Posts: 1996
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon [email protected], 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon [email protected], 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: [email protected], 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Fails to restart after disconnection from internet

Post by Neil-B »

If you pause the slots, wait a minute or so then pause you should reset the retry timer.
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
JimboPalmer
Posts: 2522
Joined: Mon Feb 16, 2009 4:12 am
Location: Greenwood MS USA

Re: Fails to restart after disconnection from internet

Post by JimboPalmer »

jarruza wrote:If I reboot the PC, will I loose the work or is it saved?
I seem to have misunderstood your situation.

I thought you were completing Work Units while not on line and then when you become online, you do not upload that work.

Now you ask about work in the middle of being processed, so my image of what is happening is flawed.

Can you explain your set up?
Tsar of all the Rushers
I tried to remain childlike, all I achieved was childish.
A friend to those who want no friends
Knish
Posts: 222
Joined: Tue Mar 17, 2020 5:20 am

Re: Fails to restart after disconnection from internet

Post by Knish »

if it is a CPU WU, you won't lose it through a reboot. If it is a GPU WU, there is a chance that if you do not have an internet connection up and running when FAHClient starts that if the client tries to update the GPUs.txt file it will drop the WU data. This only happens about once a month though. Example: I have a PC with a wifi dongle that I have to manually connect to the internet. I left this pc folding for more than a month straight, then needed to reboot. I paused after letting all WUs finish. After reboot, since my fahclient started up before I could connect my wifi, it tried to update GPUs.txt and failed, thus couldn't verify the GPU slot, so dropped the GPU WU. the cpu WU was fine tho.
jarruza
Posts: 7
Joined: Fri Sep 18, 2020 10:19 pm

Re: Fails to restart after disconnection from internet

Post by jarruza »

I seem to have misunderstood your situation.

I thought you were completing Work Units while not on line and then when you become online, you do not upload that work.
No misunderstanding, the WU was completed while the device was offline. Once back online F@H did not upload.

However, based on the comments in this thread, I feel I may be able to overcome this shortcoming programmatically by checking for internet connectivity and if it's not connected, say for 15/30 minutes, send the pause command to F@H then wait for internet connectivity and once established, unpause F@H.

Thanks
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Fails to restart after disconnection from internet

Post by PantherX »

jarruza wrote:...the WU was completed while the device was offline. Once back online F@H did not upload...
If you look at the log file, you may notice that once the WU was finished, it tried to upload and failed which is expected. However, the retry for the next upload is using an exponential back-off function which caps it to 1 hour maximum between retry. The fastest way to recover from that is to either pause/unpause your slot or restart the client.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Post Reply