Project: 2619 (Run 1, Clone 870, Gen 0)
Posted: Mon Apr 21, 2008 10:43 am
I was issued this WU April 6.
This was processing on my office PC at work. I was about to take two weeks off, so I decided to shut the PC down for that period. I added -oneunit to fah in the hopes it would complete the unit and then shut down. Unfortunately it crashed:
It looked like it was about to download a new unit so I quickly shut it down.
Two weeks later:
The same WU! This time it finished just fine and I got credited as well. My question is what happened to this WU in the 10 days period when my PC was off? Since I was unable to complete it first time around, it should have reached the preferred deadline (April 10) and be assigned to someone else. Yet it was waiting for me on April 17. How is this possible?
Code: Select all
[02:39:01] Core required: FahCore_a2.exe
[02:39:01] Core found.
[02:39:01] Working on Unit 06 [April 6 02:39:01]
[02:39:01] + Working ...
[02:39:01] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 06 -checkpoint 15 -forceasm -verbose -lifeline 3951 -version 602'
[02:39:02]
[02:39:02] *------------------------------*
[02:39:02] Folding@Home Gromacs SMP Core
[02:39:02] Version 1.91 (2007)
[02:39:02]
[02:39:02] Preparing to commence simulation
[02:39:02] - Ensuring status. Please wait.
[02:39:19] - Assembly optimizations manually forced on.
[02:39:19] - Not checking prior termination.
[02:39:19] Error: Work unit read from disk is invalid
[02:39:19] Finalizing output
[02:39:24] - Expanded 7865616 -> 48331685 (decompressed 68.4 percent)
[02:39:26]
[02:39:26] Project: 2619 (Run 1, Clone 870, Gen 0)
snip
[10:26:58] Completed 114380 out of 125000 steps (92%)
[10:42:07] Timer requesting checkpoint
[10:47:51] Completed 115630 out of 125000 steps (93%)
[11:03:01] Timer requesting checkpoint
[11:08:44] Completed 116880 out of 125000 steps (94%)
[11:23:54] Timer requesting checkpoint
[11:26:49] ***** Got a SIGTERM signal (15)
[11:26:49] Killing all core threads
Folding@Home Client Shutdown.
Code: Select all
--- Opening Log file [April 7 11:31:20]
# SMP Client ##################################################################
###############################################################################
Folding@Home Client Version 6.02beta
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: /home/r3d/foldingathome/CPU1
Executable: ./fah6
Arguments: -smp -oneunit -verbosity 9
[11:31:20] - Ask before connecting: No
[11:31:20] - User name: Ren02 (Team 385)
[11:31:20] - User ID: 3A14A43C1E9D8348
[11:31:20] - Machine ID: 1
[11:31:20]
[11:31:21] Loaded queue successfully.
[11:31:21]
[11:31:21] + Processing work unit
[11:31:21] Core required: FahCore_a2.exe
[11:31:21] Core found.
[11:31:21] - Autosending finished units...
[11:31:21] Trying to send all finished work units
[11:31:21] + No unsent completed units remaining.
[11:31:21] - Autosend completed
[11:31:21] Working on Unit 06 [April 7 11:31:21]
[11:31:21] + Working ...
[11:31:21] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 06 -checkpoint 15 -verbose -lifeline 879 -version 602'
[11:31:21]
[11:31:21] *------------------------------*
[11:31:21] Folding@Home Gromacs SMP Core
[11:31:21] Version 1.91 (2007)
[11:31:21]
[11:31:21] Preparing to commence simulation
[11:31:21] - Ensuring status. Please wait.
[11:31:38] - Looking at optimizations...
[11:31:38] - Working with standard loops on this execution.
[11:31:38] - Created dyn
[11:31:38] - Files status OK
[11:31:38] Error: Work unit read from disk is invalid
[11:31:38] Finalizing output
[11:31:41] - Expanded 7865616 -> 48331685 (decompressed 68.4 percent)
[11:31:42]
[11:31:42] Project: 2619 (Run 1, Clone 870, Gen 0)
[11:31:42]
[11:31:42] Entering M.D.
[11:31:49] Will resume from checkpoint file
[11:32:20] (0%)
[11:32:25] CoreStatus = FF (255)
[11:32:25] Client-core communications error: ERROR 0xff
[11:32:25] Deleting current work unit & continuing...
[11:34:02] ***** Got an Activate signal (2)
[11:34:02] Killing all core threads
Folding@Home Client Shutdown.
[11:34:02] - Warning: Could not delete all work unit files (6): Core file absent
[11:34:02] Trying to send all finished work units
[11:34:02] + No unsent completed units remaining.
[11:34:02] - Preparing to get new work unit...
[11:34:02] + Attempting to get work packet
[11:34:02] - Will indicate memory of 1003 MB
[11:34:02] - Detect CPU. Vendor: AuthenticAMD, Family: 15, Model: 11, Stepping: 2
[11:34:02] - Connecting to assignment server
[11:34:02] Connecting to http://assign.stanford.edu:8080/
Two weeks later:
Code: Select all
--- Opening Log file [April 17 11:28:09]
# SMP Client ##################################################################
###############################################################################
Folding@Home Client Version 6.02beta
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: /home/r3d/foldingathome/CPU1
Executable: /home/r3d/foldingathome/CPU1/fah6
Arguments: -local -forceasm -smp -verbosity 9
Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.
[11:28:09] - Ask before connecting: No
[11:28:09] - User name: Ren02 (Team 385)
[11:28:09] - User ID: 3A14A43C1E9D8348
[11:28:09] - Machine ID: 1
[11:28:09]
[11:28:09] Loaded queue successfully.
[11:28:09] - Autosending finished units...
[11:28:09] Trying to send all finished work units
[11:28:09] + No unsent completed units remaining.
[11:28:09] - Autosend completed
[11:28:09] - Preparing to get new work unit...
[11:28:09] + Attempting to get work packet
[11:28:09] - Will indicate memory of 1003 MB
[11:28:09] - Detect CPU. Vendor: AuthenticAMD, Family: 15, Model: 11, Stepping: 2
[11:28:09] - Connecting to assignment server
[11:28:09] Connecting to http://assign.stanford.edu:8080/
[11:28:10] Posted data.
[11:28:10] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[11:28:10] + News From Folding@Home: Welcome to Folding@Home
[11:28:10] Loaded queue successfully.
[11:28:10] Connecting to http://171.64.65.56:8080/
[11:28:19] Posted data.
[11:28:19] Initial: 0000; - Receiving payload (expected size: 7866128)
[11:28:34] - Downloaded at ~512 kB/s
[11:28:34] - Averaged speed for that direction ~559 kB/s
[11:28:34] + Received work.
[11:28:34] + Closed connections
[11:28:34]
[11:28:34] + Processing work unit
[11:28:34] Core required: FahCore_a2.exe
[11:28:34] Core found.
[11:28:34] Working on Unit 07 [April 17 11:28:34]
[11:28:34] + Working ...
[11:28:34] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 07 -checkpoint 15 -forceasm -verbose -lifeline 4023 -version 602'
[11:28:35]
[11:28:35] *------------------------------*
[11:28:35] Folding@Home Gromacs SMP Core
[11:28:35] Version 1.91 (2007)
[11:28:35]
[11:28:35] Preparing to commence simulation
[11:28:35] - Ensuring status. Please wait.
[11:28:52] - Assembly optimizations manually forced on.
[11:28:52] - Not checking prior termination.
[11:28:52] Error: Work unit read from disk is invalid
[11:28:52] Finalizing output
[11:28:57] - Expanded 7865616 -> 48331685 (decompressed 68.4 percent)
[11:29:00]
[11:29:00] Project: 2619 (Run 1, Clone 870, Gen 0)
snip
[22:52:46] Completed 123130 out of 125000 steps (99%)
[23:07:56] Timer requesting checkpoint
[23:14:20] Completed 124380 out of 125000 steps (100%)
[23:26:08]
[23:26:08] Finished Work Unit:
[23:26:08] - Reading up to 7393252 from "work/wudata_07.trr": Read 7393252
[23:26:09] - Reading up to 10364620 from "work/wudata_07.xtc": Read 10364620
[23:26:10] logfile size: 63884
[23:26:10] Leaving Run
[23:26:14] - Writing 17907124 bytes of core data to disk...
[23:26:24] Done: 17906612 -> 17383643 (compressed to 97.0 percent)
[23:26:24] ... Done.
[23:26:28] - Shutting down core
[23:28:08] - Autosending finished units...
[23:28:08] Trying to send all finished work units
[23:28:08] + No unsent completed units remaining.
[23:28:08] - Autosend completed
[23:28:28]
[23:28:28] Folding@home Core Shutdown: FINISHED_UNIT
[23:31:29] CoreStatus = 64 (100)
[23:31:29] Unit 7 finished with 62 percent of time to deadline remaining.
[23:31:29] Updated performance fraction: 0.582747
[23:31:29] Sending work to server
[23:31:29] + Attempting to send results
[23:31:29] - Reading file work/wuresults_07.dat from core
[23:31:30] (Read 17384155 bytes from disk)
[23:31:30] Connecting to http://171.64.65.56:8080/
[23:31:53] Posted data.
[23:31:54] Initial: 0000; - Uploaded at ~707 kB/s
[23:31:54] - Averaged speed for that direction ~808 kB/s
[23:31:54] + Results successfully sent
[23:31:54] Thank you for your contribution to Folding@Home.
[23:31:54] + Number of Units Completed: 16