Long Delay Between Done and Shutting Down Core

Moderators: Site Moderators, FAHC Science Team

Post Reply
NickOfTime
Posts: 12
Joined: Thu Nov 08, 2012 1:09 am

Long Delay Between Done and Shutting Down Core

Post by NickOfTime »

I am seeing long delays between when a WU is completed and Done and the Shutdown and FINISHED_UNIT Returned

25m at 16:26 and 13m at 18:11....

Running Client 7.2.9 in Ubuntu 12.10 Desktop in Win8 Hyper-V
On 2P AMD 6234 with SMP 20 and thekraken installed.

Code: Select all

	16:26:47:WU00:FS00:0xa4:  ... Done.
	16:51:26:WU00:FS00:0xa4:- Shutting down core
16:51:26:WU00:FS00:0xa4:
16:51:26:WU00:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
16:54:20:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
16:54:20:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:8050 run:23 clone:3 gen:51 core:0xa4 unit:0x000000476652edcc50133c66063c24f3
16:54:20:WU00:FS00:Uploading 4.06MiB to 171.67.108.60
16:54:20:WU00:FS00:Connecting to 171.67.108.60:8080
16:54:20:WU01:FS00:Starting
16:54:20:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 702 -lifeline 1217 -checkpoint 5 -np 20
16:54:20:WU01:FS00:Started FahCore on PID 11195
16:54:20:Started thread 11 on PID 1217
16:54:20:WU01:FS00:Core PID:11199
16:54:20:WU01:FS00:FahCore 0xa4 started
16:54:21:WU01:FS00:0xa4:
16:54:21:WU01:FS00:0xa4:*------------------------------*
16:54:21:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
16:54:21:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
16:54:21:WU01:FS00:0xa4:
16:54:21:WU01:FS00:0xa4:Preparing to commence simulation
16:54:21:WU01:FS00:0xa4:- Looking at optimizations...
16:54:21:WU01:FS00:0xa4:- Created dyn
16:54:21:WU01:FS00:0xa4:- Files status OK
16:54:21:WU01:FS00:0xa4:- Expanded 965818 -> 2208812 (decompressed 228.6 percent)
16:54:21:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=965818 data_size=2208812, decompressed_data_size=2208812 diff=0
16:54:21:WU01:FS00:0xa4:- Digital signature verified
16:54:21:WU01:FS00:0xa4:
16:54:21:WU01:FS00:0xa4:Project: 8049 (Run 229, Clone 4, Gen 128)
16:54:21:WU01:FS00:0xa4:
16:54:21:WU01:FS00:0xa4:Assembly optimizations on if available.
16:54:21:WU01:FS00:0xa4:Entering M.D.
16:54:26:WU00:FS00:Upload 15.40%
16:54:28:WU01:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
16:54:32:WU00:FS00:Upload 33.88%
16:54:38:WU00:FS00:Upload 52.37%
16:54:44:WU00:FS00:Upload 66.23%
16:54:51:WU00:FS00:Upload 77.01%
16:54:59:WU00:FS00:Upload 86.25%
16:55:06:WU00:FS00:Upload 92.41%
16:55:13:WU00:FS00:Upload 97.03%
16:55:14:WU01:FS00:0xa4:Completed 2500 out of 250000 steps  (1%)
16:55:23:WU00:FS00:Upload complete
16:55:23:WU00:FS00:Server responded WORK_ACK (400)
16:55:23:WU00:FS00:Final credit estimate, 3560.00 points
16:55:23:WU00:FS00:Cleaning up
16:55:59:WU01:FS00:0xa4:Completed 5000 out of 250000 steps  (2%)
16:56:44:WU01:FS00:0xa4:Completed 7500 out of 250000 steps  (3%)
16:57:29:WU01:FS00:0xa4:Completed 10000 out of 250000 steps  (4%)

18:07:19:WU01:FS00:0xa4:Completed 237500 out of 250000 steps  (95%)
18:08:05:WU01:FS00:0xa4:Completed 240000 out of 250000 steps  (96%)
18:08:52:WU01:FS00:0xa4:Completed 242500 out of 250000 steps  (97%)
18:09:40:WU01:FS00:0xa4:Completed 245000 out of 250000 steps  (98%)
18:10:27:WU01:FS00:0xa4:Completed 247500 out of 250000 steps  (99%)
18:11:13:WU01:FS00:0xa4:Completed 250000 out of 250000 steps  (100%)
18:11:15:WU00:FS00:Connecting to assign3.stanford.edu:8080
18:11:15:WU01:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
18:11:15:WU00:FS00:News: Welcome to Folding@Home
18:11:15:WU00:FS00:Assigned to work server 171.67.108.60
18:11:15:WU00:FS00:Requesting new work unit for slot 00: RUNNING smp:20 from 171.67.108.60
18:11:15:WU00:FS00:Connecting to 171.67.108.60:8080
18:11:16:WU00:FS00:Downloading 946.61KiB
18:11:18:WU00:FS00:Download complete
18:11:18:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:8049 run:341 clone:11 gen:91 core:0xa4 unit:0x000000946652edcc501330d661c76231
18:11:25:WU01:FS00:0xa4:


18:11:25:WU01:FS00:0xa4:Finished Work Unit:
18:11:25:WU01:FS00:0xa4:- Reading up to 1405068 from "01/wudata_01.trr": Read 1405068
18:11:25:WU01:FS00:0xa4:trr file hash check passed.
18:11:25:WU01:FS00:0xa4:- Reading up to 851116 from "01/wudata_01.xtc": Read 851116
18:11:25:WU01:FS00:0xa4:xtc file hash check passed.
18:11:25:WU01:FS00:0xa4:edr file hash check passed.
18:11:25:WU01:FS00:0xa4:logfile size: 23627
18:11:25:WU01:FS00:0xa4:Leaving Run
18:11:28:WU01:FS00:0xa4:- Writing 2285215 bytes of core data to disk...
18:11:28:WU01:FS00:0xa4:Done: 2284703 -> 2183652 (compressed to 95.5 percent)
	18:11:28:WU01:FS00:0xa4:  ... Done.
	18:24:38:WU01:FS00:0xa4:- Shutting down core
18:24:38:WU01:FS00:0xa4:
18:24:38:WU01:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
18:26:12:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
18:26:12:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:8049 run:229 clone:4 gen:128 core:0xa4 unit:0x000000e26652edcc50132e4fcc857b64
18:26:12:WU01:FS00:Uploading 2.08MiB to 171.67.108.60
18:26:12:WU01:FS00:Connecting to 171.67.108.60:8080
18:26:12:WU00:FS00:Starting
18:26:12:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 00 -suffix 01 -version 702 -lifeline 1217 -checkpoint 5 -np 20
18:26:12:WU00:FS00:Started FahCore on PID 11550
18:26:12:Started thread 12 on PID 1217
18:26:12:WU00:FS00:Core PID:11554
18:26:12:WU00:FS00:FahCore 0xa4 started
18:26:13:WU00:FS00:0xa4:
18:26:13:WU00:FS00:0xa4:*------------------------------*
18:26:13:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
18:26:13:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
18:26:13:WU00:FS00:0xa4:
18:26:13:WU00:FS00:0xa4:Preparing to commence simulation
18:26:13:WU00:FS00:0xa4:- Looking at optimizations...
18:26:13:WU00:FS00:0xa4:- Created dyn
18:26:13:WU00:FS00:0xa4:- Files status OK
18:26:13:WU00:FS00:0xa4:- Expanded 968814 -> 2214428 (decompressed 228.5 percent)
18:26:13:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=968814 data_size=2214428, decompressed_data_size=2214428 diff=0
18:26:13:WU00:FS00:0xa4:- Digital signature verified
18:26:13:WU00:FS00:0xa4:
18:26:13:WU00:FS00:0xa4:Project: 8049 (Run 341, Clone 11, Gen 91)
18:26:13:WU00:FS00:0xa4:
18:26:13:WU00:FS00:0xa4:Assembly optimizations on if available.
18:26:13:WU00:FS00:0xa4:Entering M.D.
18:26:18:WU01:FS00:Upload 30.01%
18:26:20:WU00:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
18:26:24:WU01:FS00:Upload 66.01%
18:26:30:WU01:FS00:Upload 99.02%
18:26:32:WU01:FS00:Upload complete
18:26:32:WU01:FS00:Server responded WORK_ACK (400)
18:26:32:WU01:FS00:Final credit estimate, 1693.00 points
18:26:32:WU01:FS00:Cleaning up
18:27:08:WU00:FS00:0xa4:Completed 2500 out of 250000 steps  (1%)
bollix47
Posts: 2963
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Long Delay Between Done and Shutting Down Core

Post by bollix47 »

Welcome to the folding support forum NickOfTime.

You may be experiencing a barrier problem.

Can you please copy/paste /etc/fstab here.

If you don't see barrier=0 there is a fix compliments of Tear that you can copy/paste in a Terminal window after pausing and quitting folding.

Copy and paste the following as one complete line(triple click inside the code box or click on Select All above the code box so that everything in the code box is highlighted and then right-click and select copy ... right click in a Terminal window, select paste and press Enter ... you'll need to enter your Ubuntu password when prompted for it):

Code: Select all

sudo sed -ri '/barrier=/ {p;d}; /ext[34]/ {s/([ \t]ext[34][ \t]+)([^ \t]+)([ \t])/\1\2,barrier=0\3/}' /etc/fstab
If you decide to try this you have to reboot afterwards.
Image
NickOfTime
Posts: 12
Joined: Thu Nov 08, 2012 1:09 am

Re: Long Delay Between Done and Shutting Down Core

Post by NickOfTime »

Code: Select all

UUID=51b13c14-b298-46cb-bf10-571a92bd15b7 /               ext4    errors=remount-ro 0       1
# swap was on /dev/sda5 during installation
UUID=1f0081d3-9c12-4845-a23e-f469ac1ff1da none            swap    sw              0       0
/dev/fd0        /media/floppy0  auto    rw,user,noauto,exec,utf8 0       0
Ok, Will add barrier=0 and reboot and see what happens...
Macaholic
Site Moderator
Posts: 811
Joined: Thu Nov 29, 2007 11:57 pm
Location: 1 Infinite Loop

Re: Long Delay Between Done and Shutting Down Core

Post by Macaholic »

Using ext4. Check the thread here.
Fold! It does a body good!™
NickOfTime
Posts: 12
Joined: Thu Nov 08, 2012 1:09 am

Re: Long Delay Between Done and Shutting Down Core

Post by NickOfTime »

Yep, Changing Barrier=0 Fixes the problem.
Post Reply