Page 1 of 1

WU 6801 (Run 8950, Clone 1, Gen 3) - Failing mdrun_gpu 52

Posted: Wed Jun 08, 2011 4:02 am
by smcpoland
So in the last log the only WU I was assigned was 6801 (Run 8950, Clone 1, Gen 3) 22 times and it always failed.

I've been folding with the same set up for the last 3-4 weeks with no changes no GPU overclocking etc...everything works

except this one unit.

Been happening for 2 days now. I think this is a bad unit and needs to be disposed of.

Thanks and regards

Code: Select all

                       Folding@Home Client Version 6.41r2

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\Temporary\Documents\Folding@home-gpu
Executable: C:\Users\Temporary\Documents\Folding@home-gpu\[email protected]


[02:41:25] - Ask before connecting: No
[02:41:25] - User name: Sean_McPoland (Team 35947)
[02:41:25] - User ID: 4A580CB52896CB89
[02:41:25] - Machine ID: 1
[02:41:25] 
[02:41:25] Gpu type=3 species=20.
[02:41:25] Loaded queue successfully.
[02:41:25] - Preparing to get new work unit...
[02:41:25] Cleaning up work directory
[02:41:25] + Attempting to get work packet
[02:41:25] Passkey found
[02:41:25] Gpu type=3 species=20.
[02:41:25] - Connecting to assignment server
[02:41:27] - Successful: assigned to (171.64.65.64).
[02:41:27] + News From Folding@Home: Welcome to Folding@Home
[02:41:27] Loaded queue successfully.
[02:41:27] Gpu type=3 species=20.
[02:41:30] + Closed connections
[02:41:30] 
[02:41:30] + Processing work unit
[02:41:30] Core required: FahCore_15.exe
[02:41:30] Core found.
[02:41:30] Working on queue slot 03 [June 8 02:41:30 UTC]
[02:41:30] + Working ...
[02:41:30] 
[02:41:30] *------------------------------*
[02:41:30] Folding@Home GPU Core
[02:41:30] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[02:41:30] 
[02:41:30] Build host: SimbiosNvdWin7
[02:41:30] Board Type: NVIDIA/CUDA
[02:41:30] Core      : x=15
[02:41:30]  Window's signal control handler registered.
[02:41:30] Preparing to commence simulation
[02:41:30] - Looking at optimizations...
[02:41:30] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[02:41:30] - Created dyn
[02:41:30] - Files status OK
[02:41:30] sizeof(CORE_PACKET_HDR) = 512 file=<>
[02:41:30] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[02:41:30] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[02:41:30] - Digital signature verified
[02:41:30] 
[02:41:30] Project: 6801 (Run 8950, Clone 1, Gen 3)
[02:41:30] 
[02:41:30] Assembly optimizations on if available.
[02:41:30] Entering M.D.
[02:41:32] Tpr hash work/wudata_03.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[02:41:32] Working on ALZHEIMER'S DISEASE AMYLOID
[02:41:32] Client config found, loading data.
[02:41:32] Setting checkpoint frequency: 500000
[02:41:32] Setting checkpoint frequency: 500000
[02:41:32] Starting GUI Server
[02:42:44] Completed    500000 out of 50000000 steps (1%).
[02:42:44] mdrun_gpu returned 52
[02:42:44] NANs detected on GPU
[02:42:44] 
[02:42:44] Folding@home Core Shutdown: UNSTABLE_MACHINE
[02:42:48] CoreStatus = 7A (122)
[02:42:48] Sending work to server
[02:42:48] Project: 6801 (Run 8950, Clone 1, Gen 3)
[02:42:48] - Read packet limit of 540015616... Set to 524286976.
[02:42:48] - Error: Could not get length of results file work/wuresults_03.dat
[02:42:48] - Error: Could not read unit 03 file. Removing from queue.
[02:42:48] - Preparing to get new work unit...
[02:42:48] Cleaning up work directory
[02:42:48] + Attempting to get work packet
[02:42:48] Passkey found
[02:42:48] Gpu type=3 species=20.
[02:42:48] - Connecting to assignment server
[02:42:49] - Successful: assigned to (171.64.65.64).
[02:42:49] + News From Folding@Home: Welcome to Folding@Home
[02:42:50] Loaded queue successfully.
[02:42:50] Gpu type=3 species=20.
[02:42:52] + Closed connections
[02:42:57] 
[02:42:57] + Processing work unit
[02:42:57] Core required: FahCore_15.exe
[02:42:57] Core found.
[02:42:57] Working on queue slot 04 [June 8 02:42:57 UTC]
[02:42:57] + Working ...
[02:42:57] 
[02:42:57] *------------------------------*
[02:42:57] Folding@Home GPU Core
[02:42:57] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[02:42:57] 
[02:42:57] Build host: SimbiosNvdWin7
[02:42:57] Board Type: NVIDIA/CUDA
[02:42:57] Core      : x=15
[02:42:57]  Window's signal control handler registered.
[02:42:57] Preparing to commence simulation
[02:42:57] - Looking at optimizations...
[02:42:57] DeleteFrameFiles: successfully deleted file=work/wudata_04.ckp
[02:42:57] - Created dyn
[02:42:57] - Files status OK
[02:42:57] sizeof(CORE_PACKET_HDR) = 512 file=<>
[02:42:57] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[02:42:57] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[02:42:57] - Digital signature verified
[02:42:57] 
[02:42:57] Project: 6801 (Run 8950, Clone 1, Gen 3)
[02:42:57] 
[02:42:57] Assembly optimizations on if available.
[02:42:57] Entering M.D.

Folding@Home Client Shutdown.


--- Opening Log file [June 8 03:06:40 UTC] 


# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.41r2

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\Temporary\Documents\Folding@home-gpu
Executable: C:\Users\Temporary\Documents\Folding@home-gpu\[email protected]


[03:06:40] - Ask before connecting: No
[03:06:40] - User name: Sean_McPoland (Team 35947)
[03:06:40] - User ID: 4A580CB52896CB89
[03:06:40] - Machine ID: 1
[03:06:40] 
[03:06:40] Gpu type=3 species=20.
[03:06:40] Loaded queue successfully.
[03:06:40] 
[03:06:40] + Processing work unit
[03:06:40] Core required: FahCore_15.exe
[03:06:40] Core found.
[03:06:40] Working on queue slot 04 [June 8 03:06:40 UTC]
[03:06:40] + Working ...
[03:06:40] 
[03:06:40] *------------------------------*
[03:06:40] Folding@Home GPU Core
[03:06:40] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:06:40] 
[03:06:40] Build host: SimbiosNvdWin7
[03:06:40] Board Type: NVIDIA/CUDA
[03:06:40] Core      : x=15
[03:06:40]  Window's signal control handler registered.
[03:06:40] Preparing to commence simulation
[03:06:40] - Looking at optimizations...
[03:06:40] - Files status OK
[03:06:40] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:06:40] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:06:40] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:06:40] - Digital signature verified
[03:06:40] 
[03:06:40] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:06:40] 
[03:06:40] Assembly optimizations on if available.
[03:06:40] Entering M.D.
[03:06:42] Tpr hash work/wudata_04.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:06:42] Working on ALZHEIMER'S DISEASE AMYLOID
[03:06:42] Client config found, loading data.
[03:06:43] Starting GUI Server
[03:06:43] Setting checkpoint frequency: 500000
[03:06:43] Setting checkpoint frequency: 500000
[03:07:50] Completed    500000 out of 50000000 steps (1%).
[03:07:50] mdrun_gpu returned 52
[03:07:50] NANs detected on GPU
[03:07:50] 
[03:07:50] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:07:54] CoreStatus = 7A (122)
[03:07:54] Sending work to server
[03:07:54] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:07:54] - Read packet limit of 540015616... Set to 524286976.
[03:07:54] - Error: Could not get length of results file work/wuresults_04.dat
[03:07:54] - Error: Could not read unit 04 file. Removing from queue.
[03:07:54] - Preparing to get new work unit...
[03:07:54] Cleaning up work directory
[03:07:54] + Attempting to get work packet
[03:07:54] Passkey found
[03:07:54] Gpu type=3 species=20.
[03:07:54] - Connecting to assignment server
[03:07:56] - Successful: assigned to (171.64.65.64).
[03:07:56] + News From Folding@Home: Welcome to Folding@Home
[03:07:56] Loaded queue successfully.
[03:07:56] Gpu type=3 species=20.
[03:07:59] + Closed connections
[03:08:04] 
[03:08:04] + Processing work unit
[03:08:04] Core required: FahCore_15.exe
[03:08:04] Core found.
[03:08:04] Working on queue slot 05 [June 8 03:08:04 UTC]
[03:08:04] + Working ...
[03:08:04] 
[03:08:04] *------------------------------*
[03:08:04] Folding@Home GPU Core
[03:08:04] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:08:04] 
[03:08:04] Build host: SimbiosNvdWin7
[03:08:04] Board Type: NVIDIA/CUDA
[03:08:04] Core      : x=15
[03:08:04]  Window's signal control handler registered.
[03:08:04] Preparing to commence simulation
[03:08:04] - Looking at optimizations...
[03:08:04] DeleteFrameFiles: successfully deleted file=work/wudata_05.ckp
[03:08:04] - Created dyn
[03:08:04] - Files status OK
[03:08:04] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:08:04] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:08:04] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:08:04] - Digital signature verified
[03:08:04] 
[03:08:04] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:08:04] 
[03:08:04] Assembly optimizations on if available.
[03:08:04] Entering M.D.
[03:08:06] Tpr hash work/wudata_05.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:08:06] Working on ALZHEIMER'S DISEASE AMYLOID
[03:08:06] Client config found, loading data.
[03:08:07] Setting checkpoint frequency: 500000
[03:08:07] Setting checkpoint frequency: 500000
[03:08:07] Starting GUI Server
[03:09:14] Completed    500000 out of 50000000 steps (1%).
[03:09:14] mdrun_gpu returned 52
[03:09:14] NANs detected on GPU
[03:09:14] 
[03:09:14] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:09:16] CoreStatus = 7A (122)
[03:09:16] Sending work to server
[03:09:16] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:09:16] - Read packet limit of 540015616... Set to 524286976.
[03:09:16] - Error: Could not get length of results file work/wuresults_05.dat
[03:09:16] - Error: Could not read unit 05 file. Removing from queue.
[03:09:16] - Preparing to get new work unit...
[03:09:16] Cleaning up work directory
[03:09:16] + Attempting to get work packet
[03:09:16] Passkey found
[03:09:16] Gpu type=3 species=20.
[03:09:16] - Connecting to assignment server
[03:09:21] - Successful: assigned to (171.64.65.64).
[03:09:21] + News From Folding@Home: Welcome to Folding@Home
[03:09:21] Loaded queue successfully.
[03:09:21] Gpu type=3 species=20.
[03:09:24] + Closed connections
[03:09:29] 
[03:09:29] + Processing work unit
[03:09:29] Core required: FahCore_15.exe
[03:09:29] Core found.
[03:09:29] Working on queue slot 06 [June 8 03:09:29 UTC]
[03:09:29] + Working ...
[03:09:29] 
[03:09:29] *------------------------------*
[03:09:29] Folding@Home GPU Core
[03:09:29] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:09:29] 
[03:09:29] Build host: SimbiosNvdWin7
[03:09:29] Board Type: NVIDIA/CUDA
[03:09:29] Core      : x=15
[03:09:29]  Window's signal control handler registered.
[03:09:29] Preparing to commence simulation
[03:09:29] - Looking at optimizations...
[03:09:29] DeleteFrameFiles: successfully deleted file=work/wudata_06.ckp
[03:09:29] - Created dyn
[03:09:29] - Files status OK
[03:09:29] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:09:29] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:09:29] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:09:29] - Digital signature verified
[03:09:29] 
[03:09:29] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:09:29] 
[03:09:29] Assembly optimizations on if available.
[03:09:29] Entering M.D.
[03:09:31] Tpr hash work/wudata_06.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:09:31] Working on ALZHEIMER'S DISEASE AMYLOID
[03:09:31] Client config found, loading data.
[03:09:32] Setting checkpoint frequency: 500000
[03:09:32] Setting checkpoint frequency: 500000
[03:09:32] Starting GUI Server
[03:10:40] Completed    500000 out of 50000000 steps (1%).
[03:10:40] mdrun_gpu returned 52
[03:10:40] NANs detected on GPU
[03:10:40] 
[03:10:40] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:10:43] CoreStatus = 7A (122)
[03:10:43] Sending work to server
[03:10:43] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:10:43] - Read packet limit of 540015616... Set to 524286976.
[03:10:43] - Error: Could not get length of results file work/wuresults_06.dat
[03:10:43] - Error: Could not read unit 06 file. Removing from queue.
[03:10:43] - Preparing to get new work unit...
[03:10:43] Cleaning up work directory
[03:10:43] + Attempting to get work packet
[03:10:43] Passkey found
[03:10:43] Gpu type=3 species=20.
[03:10:43] - Connecting to assignment server
[03:10:47] - Successful: assigned to (171.64.65.64).
[03:10:47] + News From Folding@Home: Welcome to Folding@Home
[03:10:47] Loaded queue successfully.
[03:10:47] Gpu type=3 species=20.
[03:10:50] + Closed connections
[03:10:55] 
[03:10:55] + Processing work unit
[03:10:55] Core required: FahCore_15.exe
[03:10:55] Core found.
[03:10:55] Working on queue slot 07 [June 8 03:10:55 UTC]
[03:10:55] + Working ...
[03:10:55] 
[03:10:55] *------------------------------*
[03:10:55] Folding@Home GPU Core
[03:10:55] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:10:55] 
[03:10:55] Build host: SimbiosNvdWin7
[03:10:55] Board Type: NVIDIA/CUDA
[03:10:55] Core      : x=15
[03:10:55]  Window's signal control handler registered.
[03:10:55] Preparing to commence simulation
[03:10:55] - Looking at optimizations...
[03:10:55] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[03:10:55] - Created dyn
[03:10:55] - Files status OK
[03:10:55] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:10:55] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:10:55] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:10:55] - Digital signature verified
[03:10:55] 
[03:10:55] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:10:55] 
[03:10:55] Assembly optimizations on if available.
[03:10:55] Entering M.D.
[03:10:57] Tpr hash work/wudata_07.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:10:57] Working on ALZHEIMER'S DISEASE AMYLOID
[03:10:57] Client config found, loading data.
[03:10:57] Setting checkpoint frequency: 500000
[03:10:57] Setting checkpoint frequency: 500000
[03:10:57] Starting GUI Server
[03:12:05] Completed    500000 out of 50000000 steps (1%).
[03:12:05] mdrun_gpu returned 52
[03:12:05] NANs detected on GPU
[03:12:05] 
[03:12:05] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:12:07] CoreStatus = 7A (122)
[03:12:07] Sending work to server
[03:12:07] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:12:07] - Read packet limit of 540015616... Set to 524286976.
[03:12:07] - Error: Could not get length of results file work/wuresults_07.dat
[03:12:07] - Error: Could not read unit 07 file. Removing from queue.
[03:12:07] - Preparing to get new work unit...
[03:12:07] Cleaning up work directory
[03:12:07] + Attempting to get work packet
[03:12:07] Passkey found
[03:12:07] Gpu type=3 species=20.
[03:12:07] - Connecting to assignment server
[03:12:09] - Successful: assigned to (171.64.65.64).
[03:12:09] + News From Folding@Home: Welcome to Folding@Home
[03:12:09] Loaded queue successfully.
[03:12:09] Gpu type=3 species=20.
[03:12:12] + Closed connections
[03:12:17] 
[03:12:17] + Processing work unit
[03:12:17] Core required: FahCore_15.exe
[03:12:17] Core found.
[03:12:17] Working on queue slot 08 [June 8 03:12:17 UTC]
[03:12:17] + Working ...
[03:12:17] 
[03:12:17] *------------------------------*
[03:12:17] Folding@Home GPU Core
[03:12:17] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:12:17] 
[03:12:17] Build host: SimbiosNvdWin7
[03:12:17] Board Type: NVIDIA/CUDA
[03:12:17] Core      : x=15
[03:12:17]  Window's signal control handler registered.
[03:12:17] Preparing to commence simulation
[03:12:17] - Looking at optimizations...
[03:12:17] DeleteFrameFiles: successfully deleted file=work/wudata_08.ckp
[03:12:17] - Created dyn
[03:12:17] - Files status OK
[03:12:17] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:12:17] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:12:17] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:12:17] - Digital signature verified
[03:12:17] 
[03:12:17] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:12:17] 
[03:12:17] Assembly optimizations on if available.
[03:12:17] Entering M.D.
[03:12:19] Tpr hash work/wudata_08.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:12:19] Working on ALZHEIMER'S DISEASE AMYLOID
[03:12:19] Client config found, loading data.
[03:12:19] Setting checkpoint frequency: 500000
[03:12:19] Setting checkpoint frequency: 500000
[03:12:19] Starting GUI Server

Folding@Home Client Shutdown.


--- Opening Log file [June 8 03:47:36 UTC] 


# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.41r2

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\Temporary\Documents\Folding@home-gpu
Executable: C:\Users\Temporary\Documents\Folding@home-gpu\[email protected]


[03:47:36] - Ask before connecting: No
[03:47:36] - User name: Sean_McPoland (Team 35947)
[03:47:36] - User ID: 4A580CB52896CB89
[03:47:36] - Machine ID: 1
[03:47:36] 
[03:47:36] Gpu type=3 species=20.
[03:47:36] Loaded queue successfully.
[03:47:36] 
[03:47:36] + Processing work unit
[03:47:36] Core required: FahCore_15.exe
[03:47:36] Core found.
[03:47:36] Working on queue slot 08 [June 8 03:47:36 UTC]
[03:47:36] + Working ...
[03:47:36] 
[03:47:36] *------------------------------*
[03:47:36] Folding@Home GPU Core
[03:47:36] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:47:36] 
[03:47:36] Build host: SimbiosNvdWin7
[03:47:36] Board Type: NVIDIA/CUDA
[03:47:36] Core      : x=15
[03:47:36]  Window's signal control handler registered.
[03:47:36] Preparing to commence simulation
[03:47:36] - Looking at optimizations...
[03:47:36] DeleteFrameFiles: successfully deleted file=work/wudata_08.ckp
[03:47:37] - Created dyn
[03:47:37] - Files status OK
[03:47:37] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:47:37] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:47:37] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:47:37] - Digital signature verified
[03:47:37] 
[03:47:37] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:47:37] 
[03:47:37] Assembly optimizations on if available.
[03:47:37] Entering M.D.
[03:47:39] Tpr hash work/wudata_08.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:47:39] Working on ALZHEIMER'S DISEASE AMYLOID
[03:47:39] Client config found, loading data.
[03:47:39] Starting GUI Server
[03:47:39] Setting checkpoint frequency: 500000
[03:47:39] Setting checkpoint frequency: 500000
[03:48:49] Completed    500000 out of 50000000 steps (1%).
[03:48:49] mdrun_gpu returned 52
[03:48:49] NANs detected on GPU
[03:48:49] 
[03:48:49] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:48:53] CoreStatus = 7A (122)
[03:48:53] Sending work to server
[03:48:53] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:48:53] - Read packet limit of 540015616... Set to 524286976.
[03:48:53] - Error: Could not get length of results file work/wuresults_08.dat
[03:48:53] - Error: Could not read unit 08 file. Removing from queue.
[03:48:53] - Preparing to get new work unit...
[03:48:53] Cleaning up work directory
[03:48:53] + Attempting to get work packet
[03:48:53] Passkey found
[03:48:53] Gpu type=3 species=20.
[03:48:53] - Connecting to assignment server
[03:48:55] - Successful: assigned to (171.64.65.64).
[03:48:55] + News From Folding@Home: Welcome to Folding@Home
[03:48:55] Loaded queue successfully.
[03:48:55] Gpu type=3 species=20.
[03:48:57] + Closed connections
[03:49:02] 
[03:49:02] + Processing work unit
[03:49:02] Core required: FahCore_15.exe
[03:49:02] Core found.
[03:49:02] Working on queue slot 09 [June 8 03:49:02 UTC]
[03:49:02] + Working ...
[03:49:03] 
[03:49:03] *------------------------------*
[03:49:03] Folding@Home GPU Core
[03:49:03] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:49:03] 
[03:49:03] Build host: SimbiosNvdWin7
[03:49:03] Board Type: NVIDIA/CUDA
[03:49:03] Core      : x=15
[03:49:03]  Window's signal control handler registered.
[03:49:03] Preparing to commence simulation
[03:49:03] - Looking at optimizations...
[03:49:03] DeleteFrameFiles: successfully deleted file=work/wudata_09.ckp
[03:49:03] - Created dyn
[03:49:03] - Files status OK
[03:49:03] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:49:03] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:49:03] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:49:03] - Digital signature verified
[03:49:03] 
[03:49:03] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:49:03] 
[03:49:03] Assembly optimizations on if available.
[03:49:03] Entering M.D.
[03:49:05] Tpr hash work/wudata_09.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:49:05] Working on ALZHEIMER'S DISEASE AMYLOID
[03:49:05] Client config found, loading data.
[03:49:05] Starting GUI Server
[03:49:05] Setting checkpoint frequency: 500000
[03:49:05] Setting checkpoint frequency: 500000
[03:50:12] Completed    500000 out of 50000000 steps (1%).
[03:50:12] mdrun_gpu returned 52
[03:50:12] NANs detected on GPU
[03:50:12] 
[03:50:12] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:50:15] CoreStatus = 7A (122)
[03:50:15] Sending work to server
[03:50:15] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:50:15] - Read packet limit of 540015616... Set to 524286976.
[03:50:15] - Error: Could not get length of results file work/wuresults_09.dat
[03:50:15] - Error: Could not read unit 09 file. Removing from queue.
[03:50:15] - Preparing to get new work unit...
[03:50:15] Cleaning up work directory
[03:50:15] + Attempting to get work packet
[03:50:15] Passkey found
[03:50:15] Gpu type=3 species=20.
[03:50:15] - Connecting to assignment server
[03:50:17] - Successful: assigned to (171.64.65.64).
[03:50:17] + News From Folding@Home: Welcome to Folding@Home
[03:50:17] Loaded queue successfully.
[03:50:17] Gpu type=3 species=20.
[03:50:19] + Closed connections
[03:50:24] 
[03:50:24] + Processing work unit
[03:50:24] Core required: FahCore_15.exe
[03:50:24] Core found.
[03:50:24] Working on queue slot 00 [June 8 03:50:24 UTC]
[03:50:24] + Working ...
[03:50:24] 
[03:50:24] *------------------------------*
[03:50:24] Folding@Home GPU Core
[03:50:24] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:50:24] 
[03:50:24] Build host: SimbiosNvdWin7
[03:50:24] Board Type: NVIDIA/CUDA
[03:50:24] Core      : x=15
[03:50:24]  Window's signal control handler registered.
[03:50:24] Preparing to commence simulation
[03:50:24] - Looking at optimizations...
[03:50:24] DeleteFrameFiles: successfully deleted file=work/wudata_00.ckp
[03:50:24] - Created dyn
[03:50:24] - Files status OK
[03:50:24] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:50:24] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:50:24] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:50:24] - Digital signature verified
[03:50:24] 
[03:50:24] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:50:24] 
[03:50:24] Assembly optimizations on if available.
[03:50:24] Entering M.D.
[03:50:26] Tpr hash work/wudata_00.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:50:26] Working on ALZHEIMER'S DISEASE AMYLOID
[03:50:26] Client config found, loading data.
[03:50:26] Starting GUI Server
[03:50:27] Setting checkpoint frequency: 500000
[03:50:27] Setting checkpoint frequency: 500000
[03:51:33] Completed    500000 out of 50000000 steps (1%).
[03:51:33] mdrun_gpu returned 52
[03:51:33] NANs detected on GPU
[03:51:33] 
[03:51:33] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:51:37] CoreStatus = 7A (122)
[03:51:37] Sending work to server
[03:51:37] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:51:37] - Read packet limit of 540015616... Set to 524286976.
[03:51:37] - Error: Could not get length of results file work/wuresults_00.dat
[03:51:37] - Error: Could not read unit 00 file. Removing from queue.
[03:51:37] - Preparing to get new work unit...
[03:51:37] Cleaning up work directory
[03:51:37] + Attempting to get work packet
[03:51:37] Passkey found
[03:51:37] Gpu type=3 species=20.
[03:51:37] - Connecting to assignment server
[03:51:39] - Successful: assigned to (171.64.65.64).
[03:51:39] + News From Folding@Home: Welcome to Folding@Home
[03:51:39] Loaded queue successfully.
[03:51:39] Gpu type=3 species=20.
[03:51:42] + Closed connections
[03:51:47] 
[03:51:47] + Processing work unit
[03:51:47] Core required: FahCore_15.exe
[03:51:47] Core found.
[03:51:47] Working on queue slot 01 [June 8 03:51:47 UTC]
[03:51:47] + Working ...
[03:51:47] 
[03:51:47] *------------------------------*
[03:51:47] Folding@Home GPU Core
[03:51:47] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:51:47] 
[03:51:47] Build host: SimbiosNvdWin7
[03:51:47] Board Type: NVIDIA/CUDA
[03:51:47] Core      : x=15
[03:51:47]  Window's signal control handler registered.
[03:51:47] Preparing to commence simulation
[03:51:47] - Looking at optimizations...
[03:51:47] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[03:51:47] - Created dyn
[03:51:47] - Files status OK
[03:51:47] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:51:47] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:51:47] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:51:47] - Digital signature verified
[03:51:47] 
[03:51:47] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:51:47] 
[03:51:47] Assembly optimizations on if available.
[03:51:47] Entering M.D.
[03:51:49] Tpr hash work/wudata_01.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:51:49] Working on ALZHEIMER'S DISEASE AMYLOID
[03:51:49] Client config found, loading data.
[03:51:49] Setting checkpoint frequency: 500000
[03:51:49] Setting checkpoint frequency: 500000
[03:51:49] Starting GUI Server
[03:52:56] Completed    500000 out of 50000000 steps (1%).
[03:52:56] mdrun_gpu returned 52
[03:52:56] NANs detected on GPU
[03:52:56] 
[03:52:56] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:52:59] CoreStatus = 7A (122)
[03:52:59] Sending work to server
[03:52:59] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:52:59] - Read packet limit of 540015616... Set to 524286976.
[03:52:59] - Error: Could not get length of results file work/wuresults_01.dat
[03:52:59] - Error: Could not read unit 01 file. Removing from queue.
[03:52:59] - Preparing to get new work unit...
[03:52:59] Cleaning up work directory
[03:52:59] + Attempting to get work packet
[03:52:59] Passkey found
[03:52:59] Gpu type=3 species=20.
[03:52:59] - Connecting to assignment server
[03:53:01] - Successful: assigned to (171.64.65.64).
[03:53:01] + News From Folding@Home: Welcome to Folding@Home
[03:53:01] Loaded queue successfully.
[03:53:01] Gpu type=3 species=20.
[03:53:04] + Closed connections
[03:53:09] 
[03:53:09] + Processing work unit
[03:53:09] Core required: FahCore_15.exe
[03:53:09] Core found.
[03:53:09] Working on queue slot 02 [June 8 03:53:09 UTC]
[03:53:09] + Working ...
[03:53:09] 
[03:53:09] *------------------------------*
[03:53:09] Folding@Home GPU Core
[03:53:09] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:53:09] 
[03:53:09] Build host: SimbiosNvdWin7
[03:53:09] Board Type: NVIDIA/CUDA
[03:53:09] Core      : x=15
[03:53:09]  Window's signal control handler registered.
[03:53:09] Preparing to commence simulation
[03:53:09] - Looking at optimizations...
[03:53:09] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[03:53:09] - Created dyn
[03:53:09] - Files status OK
[03:53:09] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:53:09] - Expanded 43852 -> 171827 (decompressed 391.8 percent)
[03:53:09] Called DecompressByteArray: compressed_data_size=43852 data_size=171827, decompressed_data_size=171827 diff=0
[03:53:09] - Digital signature verified
[03:53:09] 
[03:53:09] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:53:09] 
[03:53:09] Assembly optimizations on if available.
[03:53:09] Entering M.D.
[03:53:11] Tpr hash work/wudata_02.tpr:  3607300280 4160975690 466456219 2840422234 798880461
[03:53:11] Working on ALZHEIMER'S DISEASE AMYLOID
[03:53:11] Client config found, loading data.
[03:53:11] Setting checkpoint frequency: 500000
[03:53:11] Setting checkpoint frequency: 500000
[03:53:11] Starting GUI Server
[03:54:17] Completed    500000 out of 50000000 steps (1%).
[03:54:18] mdrun_gpu returned 52
[03:54:18] NANs detected on GPU
[03:54:18] 
[03:54:18] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:54:21] CoreStatus = 7A (122)
[03:54:21] Sending work to server
[03:54:21] Project: 6801 (Run 8950, Clone 1, Gen 3)
[03:54:21] - Read packet limit of 540015616... Set to 524286976.
[03:54:21] - Error: Could not get length of results file work/wuresults_02.dat
[03:54:21] - Error: Could not read unit 02 file. Removing from queue.
[03:54:21] EUE limit exceeded. Pausing 24 hours.

Folding@Home Client Shutdown.

Re: WU 6801 (Run 8950, Clone 1, Gen 3) - Failing mdrun_gpu 5

Posted: Wed Jun 08, 2011 4:41 am
by PantherX
Two reports in the WU Database and I am waiting for another report to mark it as a bad one.

Re: WU 6801 (Run 8950, Clone 1, Gen 3) - Failing mdrun_gpu 5

Posted: Wed Jun 08, 2011 6:16 am
by 7im
It helps if you also post hardware configs, overclock speeds, etc.

Re: WU 6801 (Run 8950, Clone 1, Gen 3) - Failing mdrun_gpu 5

Posted: Wed Jun 08, 2011 9:14 am
by smcpoland
Apologies:

ASUS Maximus IV rev 3
Intel 2600K @ at Stock
EVGA 580 GTX at Stock
Corsair Dominator 16GB at Stock
Corsair 300 SSD (x2) in RAID 1

Windows 7 all patches applied
NVidia 266 Drivers

at Stock means no Overclocking applied

After I finished cleaning up like PantherX stated on another post and restarted - WU's started working again...

regards