No Bonus for 6900
Posted: Sun Nov 28, 2010 6:32 pm
I complete my first 6900 unit in a similar time to my usual for bigadv units, but received no bonus. I got 8,955 points. My previous bigadv unit was slightly slower and earned 65,372.
Community driven support forum for Folding@home
https://foldingforum.org/
Code: Select all
[20:50:11] Completed 177500 out of 250000 steps (71%)
[21:30:33] Completed 180000 out of 250000 steps (72%)
[21:54:36] ***** Got a SIGTERM signal (15)
[21:54:36] Killing all core threads
Folding@Home Client Shutdown.
--- Opening Log file [November 25 21:55:45 UTC]
# Mac OS X SMP Console Edition ################################################
###############################################################################
Folding@Home Client Version 6.29r3
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: /Users/stephen/Library/Folding@home
Executable: /usr/local/fah/fah6
Arguments: -smp 8 -verbosity 9 -bigadv
[21:55:45] - Ask before connecting: No
[21:55:45] - User name: stephen123 (Team 1971)
[21:55:45] - User ID: XXXXXXXXXX
[21:55:45] - Machine ID: 1
[21:55:45]
[21:55:46] Loaded queue successfully.
[21:55:46]
[21:55:46] - Autosending finished units... [21:55:46][21:55:46] + Processing work unit
Trying to send all finished work units
[21:55:46] Core required: FahCore_a3.exe
[21:55:46] Core found.
[21:55:46] + No unsent completed units remaining.
[21:55:46] - Autosend completed
[21:55:46] Working on queue slot 01 [November 25 21:55:46 UTC]
[21:55:46] + Working ...
[21:55:46] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 01 -np 8 -checkpoint 5 -verbose -lifeline 80 -version 629'
[21:55:46]
[21:55:46] *------------------------------*
[21:55:46] Folding@Home Gromacs SMP Core
[21:55:46] Version 2.22 (May 7 2010)
[21:55:46]
[21:55:46] Preparing to commence simulation
[21:55:46] - Looking at optimizations...
[21:55:46] - Files status OK
[21:55:49] - Expanded 24861359 -> 30796293 (decompressed 123.8 percent)
[21:55:49] Called DecompressByteArray: compressed_data_size=24861359 data_size=30796293, decompressed_data_size=30796293 diff=0
[21:55:49] - Digital signature verified
[21:55:49]
[21:55:49] Project: 6900 (Run 10, Clone 11, Gen 1)
[21:55:49]
[21:55:50] Assembly optimizations on if available.
[21:55:50] Entering M.D.
[21:55:56] Using Gromacs checkpoints
[21:56:06] fcSaveRestoreState: I/O failed dir=0, var=B068FFB4, varsize=20
[21:56:06] fcCheckPointResume: failure in call to fcSaveRestoreState() to restore cpt hash.
[21:56:07] fcSaveRestoreState: I/O failed dir=0, var=B058BFB4, varsize=20
[21:56:07] fcCheckPointResume: failure in call to fcSaveRestoreState() to restore cpt hash.
[21:56:07] fcSaveRestoreState: I/O failed dir=0, var=B060DFB4, varsize=20
[21:56:07] fcCheckPointResume: failure in call to fcSaveRestoreState() to restore cpt hash.
[21:56:07] mdrun returned 3
[21:56:07] Gromacs detected an invalid checkpoint. Restarting...fcSaveRestoreState: I/O failed dir=0, var=B0383FB4, varsize=20
[21:56:08] fcCheckPointResume: failure in call to fcSaveRestoreState() to restore cpt hash.
[21:56:08] fcSaveRestoreState: I/O failed dir=0, var=B0509FB4, varsize=20
[21:56:08] fcCheckPointResume: failure in call to fcSaveRestoreState() to restore cpt hash.
[21:56:09] Can't open checkpoint file
[21:56:09] Can't open checkpoint file
[21:56:09] Resuming from checkpoint
[21:56:09] Can't open checkpoint file
[21:56:32]
[21:56:32] Folding@home Core Shutdown: UNKNOWN_ERROR
[21:56:32] CoreStatus = 62 (98)
[21:56:32] + Restarting core (settings changed)
[21:56:32]
[21:56:32] + Processing work unit
[21:56:32] Core required: FahCore_a3.exe
[21:56:32] Core found.
[21:56:32] Working on queue slot 01 [November 25 21:56:32 UTC]
[21:56:32] + Working ...
[21:56:32] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 01 -np 8 -checkpoint 5 -notermcheck -verbose -lifeline 80 -version 629'
[21:56:33]
[21:56:33] *------------------------------*
[21:56:33] Folding@Home Gromacs SMP Core
[21:56:33] Version 2.22 (May 7 2010)
[21:56:33]
[21:56:33] Preparing to commence simulation
[21:56:33] - Looking at optimizations...
[21:56:33] - Not checking prior termination.
[21:56:35] - Expanded 24861359 -> 30796293 (decompressed 123.8 percent)
[21:56:35] Called DecompressByteArray: compressed_data_size=24861359 data_size=30796293, decompressed_data_size=30796293 diff=0
[21:56:36] - Digital signature verified
[21:56:36]
[21:56:36] Project: 6900 (Run 10, Clone 11, Gen 1)
[21:56:36]
[21:56:36] Assembly optimizations on if available.
[21:56:36] Entering M.D.
[21:56:48] Completed 0 out of 250000 steps (0%)
[22:34:59] Completed 2500 out of 250000 steps (1%)
I've never seen that error either, but it does make sense. While you were upgrading, you probably failed to allow the OS to complete the shutdown process normally and parts of the checkpoint file were still in cache when you killed the power. Whether that's what happened or not, FAH detected an invalid checkpoint and had to start over, just as you said.stephen123 wrote:OK, thanks. That means the unit downloaded, ran, failed, started over from scratch and ran again without acquiring a new unit.
I'm including a part of my log in this post, because the failure mode does not look familiar to me:Code: Select all
[21:56:07] mdrun returned 3 [21:56:07] Gromacs detected an invalid checkpoint. [21:56:09] Can't open checkpoint file [21:56:32] [21:56:32] Folding@home Core Shutdown: UNKNOWN_ERROR [21:56:32] CoreStatus = 62 (98) [21:56:32] + Restarting core (settings changed) [21:56:32] [21:56:32] + Processing work unit [21:56:32] Core required: FahCore_a3.exe [21:56:32] Core found. [21:56:32] Working on queue slot 01 [November 25 21:56:32 UTC] [21:56:32] + Working ... [21:56:32] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 01 -np 8 -checkpoint 5 -notermcheck -verbose -lifeline 80 -version 629' [21:56:33] [21:56:33] *------------------------------* [21:56:33] Folding@Home Gromacs SMP Core [21:56:33] Version 2.22 (May 7 2010) [21:56:33] [21:56:33] Preparing to commence simulation [21:56:33] - Looking at optimizations... [21:56:33] - Not checking prior termination. [21:56:35] - Expanded 24861359 -> 30796293 (decompressed 123.8 percent) [21:56:35] Called DecompressByteArray: compressed_data_size=24861359 data_size=30796293, decompressed_data_size=30796293 diff=0 [21:56:36] - Digital signature verified [21:56:36] [21:56:36] Project: 6900 (Run 10, Clone 11, Gen 1) [21:56:36] [21:56:36] Assembly optimizations on if available. [21:56:36] Entering M.D. [21:56:48] Completed 0 out of 250000 steps (0%) [22:34:59] Completed 2500 out of 250000 steps (1%)