Project: 7703 (Run 2, Clone 5, Gen 7) fails

Moderators: Site Moderators, FAHC Science Team

Post Reply
Jan van de Velde
Posts: 30
Joined: Wed Aug 27, 2008 4:09 pm
Location: The Netherlands

Project: 7703 (Run 2, Clone 5, Gen 7) fails

Post by Jan van de Velde »

For the second time Project: 7703 (Run 2, Clone 5, Gen 7) failed somewhere over halfway on my machine.

This time without any apparent reason it started all over upon restarting my machine:

Code: Select all

[13:21:16] Called DecompressByteArray: compressed_data_size=1004645 data_size=2267148, decompressed_data_size=2267148 diff=0
[13:21:16] - Digital signature verified
[13:21:16] 
[13:21:16] Project: 7703 (Run 2, Clone 5, Gen 7)
[13:21:16] 
[13:21:19] Assembly optimizations on if available.
[13:21:19] Entering M.D.
[13:21:25] Using Gromacs checkpoints
[13:21:26] Mapping NT from 1 to 1 
[13:21:33] Resuming from checkpoint
[13:21:34] Verified work/wudata_01.log
[13:21:34] Verified work/wudata_01.trr
[13:21:34] Verified work/wudata_01.xtc
[13:21:34] Verified work/wudata_01.edr
[13:21:36] Completed 562800 out of 1000000 steps  (56%)
[13:58:57] Completed 570000 out of 1000000 steps  (57%)
[14:51:15] Completed 580000 out of 1000000 steps  (58%)

Folding@Home Client Shutdown.


--- Opening Log file [December 19 15:03:36 UTC] 


# Windows CPU Systray Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program FilesFolding@home\CORE1\Folding@home\Folding@home-x86


[15:03:36] - Ask before connecting: No
[15:03:36] - User name: Jan_van_de_Velde (Team 48658)
[15:03:36] - User ID: 63E313039CFE675
[15:03:36] - Machine ID: 2
[15:03:36] 
[15:03:37] Loaded queue successfully.
[15:03:37] Initialization complete
[15:03:37] 
[15:03:37] + Processing work unit
[15:03:37] Core required: FahCore_a4.exe
[15:03:37] Core found.
[15:03:38] Working on queue slot 01 [December 19 15:03:38 UTC]
[15:03:38] + Working ...
[15:03:39] 
[15:03:39] *------------------------------*
[15:03:39] Folding@Home Gromacs GB Core
[15:03:39] Version 2.27 (Dec. 15, 2010)
[15:03:39] 
[15:03:39] Preparing to commence simulation
[15:03:39] - Looking at optimizations...
[15:03:39] - Files status OK
[15:03:40] - Expanded 1004645 -> 2267148 (decompressed 225.6 percent)
[15:03:40] Called DecompressByteArray: compressed_data_size=1004645 data_size=2267148, decompressed_data_size=2267148 diff=0
[15:03:40] - Digital signature verified
[15:03:41] 
[15:03:41] Project: 7703 (Run 2, Clone 5, Gen 7)
[15:03:41] 
[15:03:41] Assembly optimizations on if available.
[15:03:41] Entering M.D.
[15:03:47] Mapping NT from 1 to 1 
[15:03:54] Completed 0 out of 1000000 steps  (0%)
[16:00:41] Completed 10000 out of 1000000 steps  (1%)
[16:57:25] Completed 20000 out of 1000000 steps  (2%)
[17:54:19] Completed 30000 out of 1000000 steps  (3%)
[18:57:10] Completed 40000 out of 1000000 steps  (4%)
As this was the second time on this same WU (first time it ran into problems was about a week ago) I have now deleted the entire workmap and the machine is now working on another project.

Mod Edit: Changed Quote Tags To Code Tags - PantherX
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Project: 7703 (Run 2, Clone 5, Gen 7) fails

Post by PantherX »

The WU isn't a bad one as it was completed by another donor:
Your WU (P7703 R2 C5 G7) was added to the stats database on 2011-12-06 02:08:20 for 825 points of credit.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Jan van de Velde
Posts: 30
Joined: Wed Aug 27, 2008 4:09 pm
Location: The Netherlands

Re: Project: 7703 (Run 2, Clone 5, Gen 7) fails

Post by Jan van de Velde »

Can anyone find a reason in those logs why that WU started all over again?
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 7703 (Run 2, Clone 5, Gen 7) fails

Post by bruce »

Jan van de Velde wrote:Can anyone find a reason in those logs why that WU started all over again?
When it restarts correctly, you see these messages:
[13:21:33] Resuming from checkpoint
[13:21:34] Verified work/wudata_01.log
[13:21:34] Verified work/wudata_01.trr
[13:21:34] Verified work/wudata_01.xtc
[13:21:34] Verified work/wudata_01.edr

Whenever you restart, the checkpoint information is verified, and if it is found to be corrupt, the WU is restarted from the beginning.

Data can be corrupted by some other program modifying something (sometimes an AntiVirus program or whatever) but it can also be corrupted if the OS was shut down in a way that prevented it from writing all FAH data from cache to the harddisk (such as a power failure or BSOD).
Jan van de Velde
Posts: 30
Joined: Wed Aug 27, 2008 4:09 pm
Location: The Netherlands

Re: Project: 7703 (Run 2, Clone 5, Gen 7) fails

Post by Jan van de Velde »

In other words, no clear reason to be found here? In all the years I have been Folding I had the occasional restart, usually to be traced back to not properly closing (e.g. power failure, and indicated in the logs in such a manner that even I could understand that something went horribly wrong), two restarts from scratch in the same WU without any clear indications as to the reason now were a matter of chance, and the chance of it happening a third time would have been extremely slight.

Well, since this mishap I have turned in a few other WU's without a hitch. So let's keep folding. :ewink:
Joe_H
Site Admin
Posts: 7927
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: Project: 7703 (Run 2, Clone 5, Gen 7) fails

Post by Joe_H »

It does look like no clear reason can be determined from the log file. I can add one other possibility, the checkpoint could have coincided with the shutdown. That can cause it to be corrupted. I have tracked a few restarts from the beginning on my folding machines to that. When I can, I check the modified times on the checkpoint files before shutting down and make sure to give the system a minute or two to flush the data all the way to disk.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Post Reply