Project: 7809 (Run 1, Clone 486, Gen 16)
Posted: Mon Dec 19, 2011 1:09 pm
This SMP one stopped early (but I can't entirely fathom out why). Running on slot 00 V7 appears to download a GPU work unit using the same slot number (00:35:51).
Restarted it when I saw the error (maybe 24 hours later indicated by a window telling me the core had stopped working) and after continuing where it left off finally gave up with BAD_WORK_UNIT (starting at 11:58:15:Unit 00:mdrun returned 255)
Nothing overclocked
Code: Select all
23:00:09:Unit 00:Completed 1170000 out of 1500000 steps (78%)
23:02:02:Unit 02:Completed 39000000 out of 50000000 steps (78%).
23:06:24:Unit 02:Completed 39500000 out of 50000000 steps (79%).
23:10:46:Unit 02:Completed 40000000 out of 50000000 steps (80%).
23:15:08:Unit 02:Completed 40500000 out of 50000000 steps (81%).
23:19:30:Unit 02:Completed 41000000 out of 50000000 steps (82%).
23:23:52:Unit 02:Completed 41500000 out of 50000000 steps (83%).
23:28:14:Unit 02:Completed 42000000 out of 50000000 steps (84%).
23:32:35:Unit 02:Completed 42500000 out of 50000000 steps (85%).
23:36:57:Unit 02:Completed 43000000 out of 50000000 steps (86%).
23:37:56:Unit 00:Completed 1185000 out of 1500000 steps (79%)
23:41:19:Unit 02:Completed 43500000 out of 50000000 steps (87%).
23:45:45:Unit 02:Completed 44000000 out of 50000000 steps (88%).
23:50:18:Unit 02:Completed 44500000 out of 50000000 steps (89%).
23:54:51:Unit 02:Completed 45000000 out of 50000000 steps (90%).
23:59:23:Unit 02:Completed 45500000 out of 50000000 steps (91%).
00:03:56:Unit 02:Completed 46000000 out of 50000000 steps (92%).
00:08:29:Unit 02:Completed 46500000 out of 50000000 steps (93%).
00:13:01:Unit 02:Completed 47000000 out of 50000000 steps (94%).
00:17:34:Unit 02:Completed 47500000 out of 50000000 steps (95%).
00:22:06:Unit 02:Completed 48000000 out of 50000000 steps (96%).
00:26:39:Unit 02:Completed 48500000 out of 50000000 steps (97%).
00:31:13:Unit 02:Completed 49000000 out of 50000000 steps (98%).
00:35:48:Unit 02:Completed 49500000 out of 50000000 steps (99%).
00:35:49:Connecting to assign-GPU.stanford.edu:80
00:35:49:News: Welcome to Folding@Home
00:35:49:Assigned to work server 171.67.108.54
00:35:49:Requesting new work unit for slot 00: RUNNING gpu:0:"GF108 [GeForce GT 430]" from 171.67.108.54
00:35:49:Connecting to 171.67.108.54:8080
00:35:51:Slot 00: Downloading 44.34KiB
00:35:51:Slot 00: Download complete
00:35:51:Received Unit: id:01 state:DOWNLOAD error:OK project:6804 run:7 clone:26 gen:652 core:0x15 unit:0x000002a86652edc64e03a0da2fc5dc2e
00:40:21:Unit 02:Completed 50000000 out of 50000000 steps (100%).
00:40:22:Unit 02:Finished fah_main status=0
00:40:22:Unit 02:Successful run
00:40:22:Unit 02:DynamicWrapper: Finished Work Unit: sleep=10000
00:40:32:Unit 02:Reserved 244640 bytes for xtc file; Cosm status=0
00:40:32:Unit 02:Allocated 244640 bytes for xtc file
00:40:32:Unit 02:- Reading up to 244640 from "02/wudata_01.xtc": Read 244640
00:40:32:Unit 02:Read 244640 bytes from xtc file; available packet space=786185824
00:40:32:Unit 02:xtc file hash check passed.
00:40:32:Unit 02:Reserved 75840 75840 786185824 bytes for arc file=<02/wudata_01.trr> Cosm status=0
00:40:32:Unit 02:Allocated 75840 bytes for arc file
00:40:32:Unit 02:- Reading up to 75840 from "02/wudata_01.trr": Read 75840
00:40:32:Unit 02:Read 75840 bytes from arc file; available packet space=786109984
00:40:32:Unit 02:trr file hash check passed.
00:40:32:Unit 02:Allocated 544 bytes for edr file
00:40:32:Unit 02:Read bedfile
00:40:32:Unit 02:edr file hash check passed.
00:40:32:Unit 02:Allocated 120378 bytes for logfile
00:40:32:Unit 02:Read logfile
00:40:32:Unit 02:GuardedRun: success in DynamicWrapper
00:40:32:Unit 02:GuardedRun: done
00:40:32:Unit 02:Run: GuardedRun completed.
00:40:36:Unit 02:+ Opened results file
00:40:36:Unit 02:- Writing 441914 bytes of core data to disk...
00:40:36:Unit 02:Done: 441402 -> 330656 (compressed to 74.9 percent)
00:40:36:Unit 02: ... Done.
00:40:36:Unit 02:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
00:40:36:Unit 02:Shutting down core
00:40:36:Unit 02:
00:40:36:Unit 02:Folding@home Core Shutdown: FINISHED_UNIT
00:40:37:FahCore, running Unit 02, returned: FINISHED_UNIT (100 = 0x64)
00:40:37:Sending unit results: id:02 state:SEND error:OK project:6800 run:18493 clone:0 gen:500 core:0x15 unit:0x000002470a3b1e644ddbf4cc1db32384
00:40:37:Unit 02: Uploading 323.41KiB to 171.64.65.64
00:40:37:Starting Unit 01
00:40:37:Connecting to 171.64.65.64:8080
00:40:37:Running core: C:/Users/Baz/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -lifeline 4052 -version 701 -checkpoint 6 -gpu 0
00:40:37:Started core on PID 3560
00:40:37:FahCore 0x15 started
00:40:38:Unit 01:
00:40:38:Unit 01:*------------------------------*
00:40:38:Unit 01:Folding@Home GPU Core
00:40:38:Unit 01:Version 2.20 (Tue Aug 2 12:06:37 PDT 2011)
00:40:38:Unit 01:Build host SimbiosNvdWin7
00:40:38:Unit 01:Board Type NVIDIA/CUDA
00:40:38:Unit 01:Core 15
00:40:38:Unit 01:
00:40:38:Unit 01:Window's signal control handler registered.
00:40:38:Unit 01:Preparing to commence simulation
00:40:38:Unit 01:- Looking at optimizations...
00:40:38:Unit 01:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
00:40:38:Unit 01:- Created dyn
00:40:38:Unit 01:- Files status OK
00:40:38:Unit 01:sizeof(CORE_PACKET_HDR) = 512 file=<>
00:40:38:Unit 01:- Expanded 44890 -> 170279 (decompressed 379.3 percent)
00:40:38:Unit 01:Called DecompressByteArray: compressed_data_size=44890 data_size=170279, decompressed_data_size=170279 diff=0
00:40:38:Unit 01:- Digital signature verified
00:40:38:Unit 01:
00:40:38:Unit 01:Project: 6804 (Run 7, Clone 26, Gen 652)
00:40:38:Unit 01:
00:40:38:Unit 01:Assembly optimizations on if available.
00:40:38:Unit 01:Entering M.D.
00:40:39:Unit 01:Tpr hash 01/wudata_01.tpr: 1969052385 98349488 1738012785 4156280720 1892614771
00:40:39:Unit 01:calling fah_main gpuDeviceId=0
00:40:39:Unit 01:Working on ALZHEIMER'S DISEASE AMYLOID
00:40:39:Unit 01:Client config unavailable.
00:40:40:Unit 01:Starting GUI Server
00:40:43:Unit 02: 85.34%
00:40:44:Unit 02: Upload complete
00:40:44:Server responded WORK_ACK (400)
00:40:44:Final credit estimate, 1298.00 points
00:40:45:Cleaning up Unit 02
00:41:43:Unit 01:Setting checkpoint frequency: 500000
00:41:43:Unit 01:Completed 3 out of 50000000 steps (0%).
00:46:14:Unit 01:Completed 500000 out of 50000000 steps (1%).
00:50:47:Unit 01:Completed 1000000 out of 50000000 steps (2%).
00:55:19:Unit 01:Completed 1500000 out of 50000000 steps (3%).
00:59:52:Unit 01:Completed 2000000 out of 50000000 steps (4%).
01:04:08:Unit 01:Completed 2500000 out of 50000000 steps (5%).
01:08:41:Unit 01:Completed 3000000 out of 50000000 steps (6%).
01:13:13:Unit 01:Completed 3500000 out of 50000000 steps (7%).
01:17:45:Unit 01:Completed 4000000 out of 50000000 steps (8%).
01:22:18:Unit 01:Completed 4500000 out of 50000000 steps (9%).
01:26:50:Unit 01:Completed 5000000 out of 50000000 steps (10%).
01:31:23:Unit 01:Completed 5500000 out of 50000000 steps (11%).
01:35:55:Unit 01:Completed 6000000 out of 50000000 steps (12%).
01:40:28:Unit 01:Completed 6500000 out of 50000000 steps (13%).
01:45:00:Unit 01:Completed 7000000 out of 50000000 steps (14%).
01:49:32:Unit 01:Completed 7500000 out of 50000000 steps (15%).
01:54:05:Unit 01:Completed 8000000 out of 50000000 steps (16%).
01:58:37:Unit 01:Completed 8500000 out of 50000000 steps (17%).
02:03:07:Unit 01:Completed 9000000 out of 50000000 steps (18%).
02:07:40:Unit 01:Completed 9500000 out of 50000000 steps (19%).
02:12:12:Unit 01:Completed 10000000 out of 50000000 steps (20%).
02:16:44:Unit 01:Completed 10500000 out of 50000000 steps (21%).
02:21:16:Unit 01:Completed 11000000 out of 50000000 steps (22%).
02:25:49:Unit 01:Completed 11500000 out of 50000000 steps (23%).
02:30:21:Unit 01:Completed 12000000 out of 50000000 steps (24%).
02:34:53:Unit 01:Completed 12500000 out of 50000000 steps (25%).
02:39:25:Unit 01:Completed 13000000 out of 50000000 steps (26%).
02:43:58:Unit 01:Completed 13500000 out of 50000000 steps (27%).
02:48:30:Unit 01:Completed 14000000 out of 50000000 steps (28%).
02:53:01:Unit 01:Completed 14500000 out of 50000000 steps (29%).
02:57:33:Unit 01:Completed 15000000 out of 50000000 steps (30%).
03:02:06:Unit 01:Completed 15500000 out of 50000000 steps (31%).
03:06:38:Unit 01:Completed 16000000 out of 50000000 steps (32%).
03:11:10:Unit 01:Completed 16500000 out of 50000000 steps (33%).
03:15:42:Unit 01:Completed 17000000 out of 50000000 steps (34%).
Code: Select all
09:48:28:FahCore, running Unit 00, returned: UNKNOWN_ENUM (-1073741819 = 0xc0000005)
09:48:28:Starting Unit 00
09:48:28:Running core: C:/Users/Baz/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -lifeline 4052 -version 701 -checkpoint 6 -np 2
09:48:30:Started core on PID 5536
09:48:30:FahCore 0xa4 started
09:48:30:Unit 00:
09:48:30:Unit 00:*------------------------------*
09:48:30:Unit 00:Folding@Home Gromacs GB Core
09:48:30:Unit 00:Version 2.27 (Dec. 15, 2010)
09:48:30:Unit 00:
09:48:30:Unit 00:Preparing to commence simulation
09:48:30:Unit 00:- Ensuring status. Please wait.
09:48:39:Unit 00:- Looking at optimizations...
09:48:39:Unit 00:- Working with standard loops on this execution.
09:48:39:Unit 00:- Previous termination of core was improper.
09:48:39:Unit 00:- Files status OK
09:48:40:Unit 00:- Expanded 2079173 -> 5386224 (decompressed 259.0 percent)
09:48:40:Unit 00:Called DecompressByteArray: compressed_data_size=2079173 data_size=5386224, decompressed_data_size=5386224 diff=0
09:48:40:Unit 00:- Digital signature verified
09:48:40:Unit 00:
09:48:40:Unit 00:Project: 7809 (Run 1, Clone 486, Gen 16)
09:48:40:Unit 00:
09:48:40:Unit 00:Entering M.D.
09:48:46:Unit 00:Using Gromacs checkpoints
09:48:46:Unit 00:Mapping NT from 2 to 2
09:48:47:Unit 00:Resuming from checkpoint
09:48:47:Unit 00:Verified 00/wudata_01.log
09:48:47:Unit 00:Verified 00/wudata_01.trr
09:48:47:Unit 00:Verified 00/wudata_01.xtc
09:48:47:Unit 00:Verified 00/wudata_01.edr
09:48:47:Unit 00:Completed 1185560 out of 1500000 steps (79%)
09:50:23:Unit 01:Completed 18500000 out of 50000000 steps (37%).
09:54:48:Unit 01:Completed 19000000 out of 50000000 steps (38%).
09:59:15:Unit 01:Completed 19500000 out of 50000000 steps (39%).
10:03:40:Unit 01:Completed 20000000 out of 50000000 steps (40%).
10:08:06:Unit 01:Completed 20500000 out of 50000000 steps (41%).
10:12:30:Unit 01:Completed 21000000 out of 50000000 steps (42%).
10:16:55:Unit 01:Completed 21500000 out of 50000000 steps (43%).
10:21:19:Unit 01:Completed 22000000 out of 50000000 steps (44%).
10:25:43:Unit 01:Completed 22500000 out of 50000000 steps (45%).
10:30:06:Unit 01:Completed 23000000 out of 50000000 steps (46%).
10:30:13:Unit 00:Completed 1200000 out of 1500000 steps (80%)
10:34:30:Unit 01:Completed 23500000 out of 50000000 steps (47%).
10:38:54:Unit 01:Completed 24000000 out of 50000000 steps (48%).
10:43:21:Unit 01:Completed 24500000 out of 50000000 steps (49%).
10:47:46:Unit 01:Completed 25000000 out of 50000000 steps (50%).
10:52:11:Unit 01:Completed 25500000 out of 50000000 steps (51%).
10:56:35:Unit 01:Completed 26000000 out of 50000000 steps (52%).
11:01:00:Unit 01:Completed 26500000 out of 50000000 steps (53%).
11:05:23:Unit 01:Completed 27000000 out of 50000000 steps (54%).
11:09:47:Unit 01:Completed 27500000 out of 50000000 steps (55%).
11:13:28:Unit 00:Completed 1215000 out of 1500000 steps (81%)
11:14:10:Unit 01:Completed 28000000 out of 50000000 steps (56%).
11:18:35:Unit 01:Completed 28500000 out of 50000000 steps (57%).
11:23:00:Unit 01:Completed 29000000 out of 50000000 steps (58%).
11:27:24:Unit 01:Completed 29500000 out of 50000000 steps (59%).
11:31:50:Unit 01:Completed 30000000 out of 50000000 steps (60%).
11:36:14:Unit 01:Completed 30500000 out of 50000000 steps (61%).
11:40:39:Unit 01:Completed 31000000 out of 50000000 steps (62%).
11:45:02:Unit 01:Completed 31500000 out of 50000000 steps (63%).
11:49:25:Unit 01:Completed 32000000 out of 50000000 steps (64%).
11:53:47:Unit 01:Completed 32500000 out of 50000000 steps (65%).
11:55:24:Unit 00:Completed 1230000 out of 1500000 steps (82%)
11:58:12:Unit 01:Completed 33000000 out of 50000000 steps (66%).
11:58:15:Unit 00:mdrun returned 255
11:58:15:Unit 00:Going to send back what have done -- stepsTotalG=1500000
11:58:15:Unit 00:Work fraction=0.8207 steps=1500000.
11:58:19:Unit 00:logfile size=49580 infoLength=49580 edr=0 trr=25
11:58:19:Unit 00:logfile size: 49580 info=49580 bed=0 hdr=25
11:58:19:Unit 00:- Writing 50118 bytes of core data to disk...
11:58:19:Unit 00:Done: 49606 -> 7937 (compressed to 16.0 percent)
11:58:19:Unit 00: ... Done.
11:58:20:FahCore, running Unit 00, returned: BAD_WORK_UNIT (114 = 0x72)
11:58:20:Sending unit results: id:00 state:SEND error:FAULTY project:7809 run:1 clone:486 gen:16 core:0xa4 unit:0x000000110a3b1e874e3108c65ee81acc
11:58:20:Unit 00: Uploading 8.25KiB to 171.64.65.99
11:58:20:Connecting to 171.64.65.99:8080
11:58:20:Connecting to assign3.stanford.edu:8080
11:58:21:News: Welcome to Folding@Home
11:58:21:Assigned to work server 128.113.12.163
11:58:21:Requesting new work unit for slot 02: READY smp:2 from 128.113.12.163
11:58:21:Connecting to 128.113.12.163:8080
11:58:21:Unit 00: Upload complete
11:58:21:Server responded WORK_ACK (400)
11:58:21:Cleaning up Unit 00
Code: Select all
*********************** Log Started 2011-12-15T14:10:17 ************************
14:10:17:************************* Folding@home Client *************************
14:10:17: Website: http://folding.stanford.edu/
14:10:17: Copyright: (c) 2009-2011 Stanford University
14:10:17: Author: Joseph Coffland <[email protected]>
14:10:17: Args: --lifeline 4656 --command-port=36330
14:10:17: Config: C:/Users/Baz/AppData/Roaming/FAHClient/config.xml
14:10:17:******************************** Build ********************************
14:10:17: Version: 7.1.38
14:10:17: Date: Oct 6 2011
14:10:17: Time: 19:57:04
14:10:17: SVN Rev: 3080
14:10:17: Branch: fah/trunk/client
14:10:17: Compiler: Intel(R) C++ MSVC 1500 mode 1200
14:10:17: Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
14:10:17: /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT
14:10:17: Platform: win32 XP
14:10:17: Bits: 32
14:10:17: Mode: Release
14:10:17:******************************* System ********************************
14:10:17: CPU: AMD Phenom(tm) II X2 545 Processor
14:10:17: CPU ID: AuthenticAMD Family 16 Model 4 Stepping 2
14:10:17: CPUs: 2
14:10:17: Memory: 4.00GiB
14:10:17: Free Memory: 2.94GiB
14:10:17: Threads: WINDOWS_THREADS
14:10:17: On Battery: false
14:10:17: UTC offset: 0
14:10:17: PID: 4052
14:10:17: CWD: C:/Users/Baz/AppData/Roaming/FAHClient
14:10:17: OS: Windows 7 Ultimate
14:10:17: OS Arch: AMD64
14:10:17: GPUs: 1
14:10:17: GPU 0: FERMI:1 GF108 [GeForce GT 430]
14:10:17: CUDA: 2.1
14:10:17: CUDA Driver: 4010
14:10:17:Win32 Service: false
14:10:17:***********************************************************************
14:10:17:<config>
14:10:17: <!-- FahCore Control -->
14:10:17: <checkpoint v='6'/>
14:10:17:
14:10:17: <!-- Folding Slot Configuration -->
14:10:17: <gpu v='true'/>
14:10:17:
14:10:17: <!-- Network -->
14:10:17: <proxy v=':8080'/>
14:10:17:
14:10:17: <!-- User Information -->
14:10:17: <passkey v='********************************'/>
14:10:17: <team v='76486'/>
14:10:17: <user v='baz657'/>
14:10:17:
14:10:17: <!-- Folding Slots -->
14:10:17: <slot id='0' type='GPU'/>
14:10:17: <slot id='2' type='SMP'/>
14:10:17:</config>
14:10:17:Trying to access database...
14:10:17:Successfully acquired database lock
14:10:17:Enabled folding slot 00: READY gpu:0:"GF108 [GeForce GT 430]"
14:10:17:Enabled folding slot 02: READY smp:2