Page 1 of 1

smp folding is unstable

Posted: Wed Jul 31, 2013 3:11 am
by beer
HI
Latly my smp folding on my i7 has become unstable. I am getting a window in windows that says it stops working with a error message. nothing happens before I manuel press the OK butten. Then I either get na new WU or it restarts from a checkpoint. Any idear why?
Problem signature:
Problem Event Name: APPCRASH
Application Name: FahCore_a3.exe
Application Version: 0.0.0.0
Application Timestamp: 4d4720af
Fault Module Name: FahCore_a3.exe
Fault Module Version: 0.0.0.0
Fault Module Timestamp: 4d4720af
Exception Code: c0000005
Exception Offset: 002649d0
OS Version: 6.1.7601.2.1.0.256.48
Locale ID: 1030
Additional Information 1: 0a9e
Additional Information 2: 0a9e372d3b4ad19135b953a78882e789
Additional Information 3: 0a9e
Additional Information 4: 0a9e372d3b4ad19135b953a78882e789

Read our privacy statement online:
http://go.microsoft.com/fwlink/?linkid= ... cid=0x0409

If the online privacy statement is not available, please read our privacy statement offline:
C:\Windows\system32\en-US\erofflps.txt

Code: Select all

03:04:14:WARNING:WU02:FS01:FahCore returned an unknown error code which probably indicates that it crashed
03:04:14:WARNING:WU02:FS01:FahCore returned: UNKNOWN_ENUM (-1073741819 = 0xc0000005)
03:04:15:WU02:FS01:Starting
03:04:15:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/beer/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 02 -suffix 01 -version 703 -lifeline 2920 -checkpoint 15 -np 6
03:04:15:WU02:FS01:Started FahCore on PID 4316
03:04:15:WU02:FS01:Core PID:668
03:04:15:WU02:FS01:FahCore 0xa3 started
03:04:15:WU02:FS01:0xa3:
03:04:15:WU02:FS01:0xa3:*------------------------------*
03:04:15:WU02:FS01:0xa3:Folding@Home Gromacs SMP Core
03:04:15:WU02:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
03:04:15:WU02:FS01:0xa3:
03:04:15:WU02:FS01:0xa3:Preparing to commence simulation
03:04:15:WU02:FS01:0xa3:- Ensuring status. Please wait.
03:04:24:WU02:FS01:0xa3:- Looking at optimizations...
03:04:24:WU02:FS01:0xa3:- Working with standard loops on this execution.
03:04:24:WU02:FS01:0xa3:- Previous termination of core was improper.
03:04:24:WU02:FS01:0xa3:- Going to use standard loops.
03:04:24:WU02:FS01:0xa3:- Files status OK
03:04:25:WU02:FS01:0xa3:- Expanded 3850183 -> 4393460 (decompressed 114.1 percent)
03:04:25:WU02:FS01:0xa3:Called DecompressByteArray: compressed_data_size=3850183 data_size=4393460, decompressed_data_size=4393460 diff=0
03:04:25:WU02:FS01:0xa3:- Digital signature verified
03:04:25:WU02:FS01:0xa3:
03:04:25:WU02:FS01:0xa3:Project: 8573 (Run 0, Clone 8, Gen 140)
03:04:25:WU02:FS01:0xa3:
03:04:25:WU02:FS01:0xa3:Entering M.D.
03:04:31:WU02:FS01:0xa3:Using Gromacs checkpoints
03:04:31:WU02:FS01:0xa3:Mapping NT from 6 to 6 
03:04:31:WU02:FS01:0xa3:Resuming from checkpoint
03:04:31:WU02:FS01:0xa3:Verified 02/wudata_01.log
03:04:31:WU02:FS01:0xa3:Verified 02/wudata_01.trr
03:04:31:WU02:FS01:0xa3:Verified 02/wudata_01.edr
03:04:32:WU02:FS01:0xa3:Completed 14000 out of 500000 step

Code: Select all

*********************** Log Started 2013-07-30T19:37:49Z ***********************
19:37:49:************************* Folding@home Client *************************
19:37:49:      Website: http://folding.stanford.edu/
19:37:49:    Copyright: (c) 2009-2013 Stanford University
19:37:49:       Author: Joseph Coffland <[email protected]>
19:37:49:         Args: 
19:37:49:       Config: C:/Users/beer/AppData/Roaming/FAHClient/config.xml
19:37:49:******************************** Build ********************************
19:37:49:      Version: 7.3.6
19:37:49:         Date: Feb 18 2013
19:37:49:         Time: 15:25:17
19:37:49:      SVN Rev: 3923
19:37:49:       Branch: fah/trunk/client
19:37:49:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
19:37:49:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
19:37:49:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
19:37:49:     Platform: win32 XP
19:37:49:         Bits: 32
19:37:49:         Mode: Release
19:37:49:******************************* System ********************************
19:37:49:          CPU: Intel(R) Core(TM) i7-4770S CPU @ 3.10GHz
19:37:49:       CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
19:37:49:         CPUs: 8
19:37:49:       Memory: 3.94GiB
19:37:49:  Free Memory: 3.19GiB
19:37:49:      Threads: WINDOWS_THREADS
19:37:49:  Has Battery: false
19:37:49:   On Battery: false
19:37:49:   UTC offset: 2
19:37:49:          PID: 2920
19:37:49:          CWD: C:/Users/beer/AppData/Roaming/FAHClient
19:37:49:           OS: Windows 7 Professional
19:37:49:      OS Arch: AMD64
19:37:49:         GPUs: 1
19:37:49:        GPU 0: NVIDIA:3 GK104 [GeForce GTX 660 Ti]
19:37:49:         CUDA: 3.0
19:37:49:  CUDA Driver: 5050
19:37:49:Win32 Service: false
19:37:49:***********************************************************************
19:37:49:<config>
19:37:49:  <!-- Folding Slot Configuration -->
19:37:49:  <power v='full'/>
19:37:49:
19:37:49:  <!-- Network -->
19:37:49:  <proxy v=':8080'/>
19:37:49:
19:37:49:  <!-- User Information -->
19:37:49:  <passkey v='********************************'/>
19:37:49:  <user v='jonasvejlin'/>
19:37:49:
19:37:49:  <!-- Folding Slots -->
19:37:49:  <slot id='0' type='GPU'>
19:37:49:    <client-type v='advanced'/>
19:37:49:  </slot>
19:37:49:  <slot id='1' type='CPU'>
19:37:49:    <cpus v='6'/>
19:37:49:  </slot>
19:37:49:</config>
19:37:49:Trying to access database...
19:37:49:Successfully acquired database lock
19:37:49:Enabled folding slot 00: READY gpu:0:GK104 [GeForce GTX 660 Ti]
19:37:49:Enabled folding slot 01: READY cpu:6
19:37:49:WU00:FS00:Starting
19:37:49:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/beer/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 703 -lifeline 2920 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
19:37:49:WU00:FS00:Started FahCore on PID 3352
19:37:49:WU00:FS00:Core PID:3368
19:37:49:WU00:FS00:FahCore 0x17 started
19:37:49:WU02:FS01:Starting
19:37:49:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/beer/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 02 -suffix 01 -version 703 -lifeline 2920 -checkpoint 15 -np 6
19:37:49:WU02:FS01:Started FahCore on PID 3380
19:37:49:WU02:FS01:Core PID:3400
19:37:49:WU02:FS01:FahCore 0xa3 started
19:37:50:WU00:FS00:0x17:*********************** Log Started 2013-07-30T19:37:50Z ***********************
19:37:50:WU00:FS00:0x17:Project: 7810 (Run 0, Clone 26, Gen 27)
19:37:50:WU00:FS00:0x17:Unit: 0x0000001f0a3b1e8651d2320c5870f43f
19:37:50:WU00:FS00:0x17:CPU: 0x00000000000000000000000000000000
19:37:50:WU00:FS00:0x17:Machine: 0
19:37:50:WU00:FS00:0x17:Digital signatures verified
19:37:50:WU00:FS00:0x17:  Found a checkpoint file
19:37:50:WU02:FS01:0xa3:
19:37:50:WU02:FS01:0xa3:*------------------------------*
19:37:50:WU02:FS01:0xa3:Folding@Home Gromacs SMP Core
19:37:50:WU02:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
19:37:50:WU02:FS01:0xa3:
19:37:50:WU02:FS01:0xa3:Preparing to commence simulation
19:37:50:WU02:FS01:0xa3:- Ensuring status. Please wait.
19:37:59:WU02:FS01:0xa3:- Looking at optimizations...
19:37:59:WU02:FS01:0xa3:- Working with standard loops on this execution.
19:37:59:WU02:FS01:0xa3:- Previous termination of core was improper.
19:37:59:WU02:FS01:0xa3:- Files status OK
19:37:59:WU02:FS01:0xa3:- Expanded 3850183 -> 4393460 (decompressed 114.1 percent)
19:37:59:WU02:FS01:0xa3:Called DecompressByteArray: compressed_data_size=3850183 data_size=4393460, decompressed_data_size=4393460 diff=0
19:37:59:WU02:FS01:0xa3:- Digital signature verified
19:37:59:WU02:FS01:0xa3:
19:37:59:WU02:FS01:0xa3:Project: 8573 (Run 0, Clone 8, Gen 140)
19:37:59:WU02:FS01:0xa3:
19:37:59:WU02:FS01:0xa3:Entering M.D.
19:38:05:WU02:FS01:0xa3:Using Gromacs checkpoints
19:38:05:WU02:FS01:0xa3:Mapping NT from 6 to 6 
19:38:06:WU02:FS01:0xa3:Resuming from checkpoint
19:38:06:WU02:FS01:0xa3:Verified 02/wudata_01.log
19:38:06:WU02:FS01:0xa3:Verified 02/wudata_01.trr
19:38:06:WU02:FS01:0xa3:Verified 02/wudata_01.edr
Mod edit: quote tags changed to code tags

Re: smp folding is unstable

Posted: Wed Jul 31, 2013 3:14 am
by 7im
The c000005 error is typically a window crash, related to memory if memory serves. Reduce the OC and run a Memtest. Search the forum for more examples of the error and solutions.

Re: smp folding is unstable

Posted: Wed Jul 31, 2013 3:21 am
by N0OA
Are you over-clocking the machine?

N0OA

Re: smp folding is unstable

Posted: Wed Jul 31, 2013 6:23 am
by beer
I did run memtest for 30 min without finding anything wrong.
It might be becouse bios/UEFI did detect the ram as DDR3-1333 where it is DDR3-1066 (eg auto overcloacking). Change the settings to DDR3-1066 And I hope it solve the problem (I will report back when I come home tonight if it is the problem)

Re: smp folding is unstable

Posted: Wed Jul 31, 2013 12:11 pm
by PantherX
beer wrote:I did run memtest for 30 min without finding anything wrong...
FYI, memtest is usually run overnight, i.e. at least 8 hours for a few passes.

Re: smp folding is unstable

Posted: Wed Jul 31, 2013 4:38 pm
by beer
PantherX: oh. I did not know that. Last time I did have suspicion about bad ram I run it for a few minuts before memtest did detect bad errors

Re: smp folding is unstable

Posted: Wed Jul 31, 2013 6:00 pm
by 7im
Depends on how thorough you want to test the memory.

1 or 2 passes is enough to catch serious memory errors. When I build a new system, I test for 24 hours to allow for the ambient room temperatures to cycle up and down. Or if I am getting intermittant errors, or inconsistent timing on errors, I let it run for several hours just to be certain.

Sounds like the Memory Speed OC setting could have been the issue, and not bad memory. But if the fah problem comes back, do run memtest longer.