Project: 5767 (Run 0, Clone 185, Gen 793)
Posted: Tue Jun 30, 2009 10:25 pm
I got Nans unstable machine error with this one. Here's part of log:
Code: Select all
[20:26:34]
[20:26:34] *------------------------------*
[20:26:34] Folding@Home GPU Core
[20:26:34] Version 1.27 (Thu Jun 18 14:02:10 PDT 2009)
[20:26:34]
[20:26:34] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[20:26:34] Build host: amoeba
[20:26:34] Board Type: Nvidia
[20:26:34] Core :
[20:26:34] Preparing to commence simulation
[20:26:34] - Looking at optimizations...
[20:26:34] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[20:26:34] - Created dyn
[20:26:34] - Files status OK
[20:26:34] - Expanded 46773 -> 252912 (decompressed 540.7 percent)
[20:26:34] Called DecompressByteArray: compressed_data_size=46773 data_size=252912, decompressed_data_size=252912 diff=0
[20:26:34] - Digital signature verified
[20:26:34]
[20:26:34] Project: 5767 (Run 0, Clone 185, Gen 793)
[20:26:34]
[20:26:34] Assembly optimizations on if available.
[20:26:34] Entering M.D.
[20:26:40] Tpr hash work/wudata_01.tpr: 991414023 1275687679 86440814 1097676542 1318018278
[20:26:40]
[20:26:40] Calling fah_main args: 14 usage=100
[20:26:40]
[20:26:41] Working on Protein
[20:26:41] Client config found, loading data.
[20:26:41] Starting GUI Server
[20:26:41] mdrun_gpu returned
[20:26:41] NANs detected on GPU
[20:26:41]
[20:26:41] Folding@home Core Shutdown: UNSTABLE_MACHINE
[20:26:44] CoreStatus = 7A (122)
[20:26:44] Sending work to server
[20:26:44] Project: 5767 (Run 0, Clone 185, Gen 793)
[20:26:44] - Read packet limit of 540015616... Set to 524286976.
[20:26:44] - Error: Could not get length of results file work/wuresults_01.dat
[20:26:44] - Error: Could not read unit 01 file. Removing from queue.
[20:26:44] Trying to send all finished work units
[20:26:44] + No unsent completed units remaining.
[20:26:44] - Preparing to get new work unit...
[20:26:44] + Attempting to get work packet
[20:26:44] - Will indicate memory of 8188 MB
[20:26:44] - Connecting to assignment server
[20:26:44] Connecting to http://assign-GPU.stanford.edu:8080/
[20:26:45] Posted data.
[20:26:45] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[20:26:45] + News From Folding@Home: Welcome to Folding@Home
[20:26:45] Loaded queue successfully.
[20:26:45] Connecting to http://171.67.108.11:8080/
[20:26:45] Posted data.
[20:26:45] Initial: 0000; - Receiving payload (expected size: 47285)
[20:26:45] Conversation time very short, giving reduced weight in bandwidth avg
[20:26:45] - Downloaded at ~92 kB/s
[20:26:45] - Averaged speed for that direction ~98 kB/s
[20:26:45] + Received work.
[20:26:45] Trying to send all finished work units
[20:26:45] + No unsent completed units remaining.
[20:26:45] + Closed connections
[20:26:50]
[20:26:50] + Processing work unit
[20:26:50] Core required: FahCore_11.exe
[20:26:50] Core found.
[20:26:50] Working on queue slot 02 [June 30 20:26:50 UTC]
[20:26:50] + Working ...
[20:26:50] - Calling '.\FahCore_11.exe -dir work/ -suffix 02 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 2648 -version 623'
[20:26:50]
[20:26:50] *------------------------------*
[20:26:50] Folding@Home GPU Core
[20:26:50] Version 1.27 (Thu Jun 18 14:02:10 PDT 2009)
[20:26:50]
[20:26:50] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[20:26:50] Build host: amoeba
[20:26:50] Board Type: Nvidia
[20:26:50] Core :
[20:26:50] Preparing to commence simulation
[20:26:50] - Looking at optimizations...
[20:26:50] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[20:26:50] - Created dyn
[20:26:50] - Files status OK
[20:26:50] - Expanded 46773 -> 252912 (decompressed 540.7 percent)
[20:26:50] Called DecompressByteArray: compressed_data_size=46773 data_size=252912, decompressed_data_size=252912 diff=0
[20:26:50] - Digital signature verified
[20:26:50]
[20:26:50] Project: 5767 (Run 0, Clone 185, Gen 793)
[20:26:50]
[20:26:51] Assembly optimizations on if available.
[20:26:51] Entering M.D.
[20:26:56] Tpr hash work/wudata_02.tpr: 991414023 1275687679 86440814 1097676542 1318018278
[20:26:56]
[20:26:56] Calling fah_main args: 14 usage=100
[20:26:56]
[20:26:57] Working on Protein
[20:26:57] Client config found, loading data.
[20:26:57] mdrun_gpu returned
[20:26:57] NANs detected on GPU
[20:26:57]
[20:26:57] Folding@home Core Shutdown: UNSTABLE_MACHINE
[20:27:01] CoreStatus = 7A (122)
[20:27:01] Sending work to server
[20:27:01] Project: 5767 (Run 0, Clone 185, Gen 793)
[20:27:01] - Read packet limit of 540015616... Set to 524286976.
[20:27:01] - Error: Could not get length of results file work/wuresults_02.dat
[20:27:01] - Error: Could not read unit 02 file. Removing from queue.
[20:27:01] EUE limit exceeded. Pausing 24 hours.