Preliminary trouble report.
According to popular opinion, drivers 310.90 provide better FAH stability than previous versions so I have been running on them.
When downloading the CUDA TOOLKIT it wanted to install an earlier version and I declined. I did not reboot (yet). Since that time, I've had two GPU timeout aborts, which have been EXTREMELY rare.
Code: Select all
02:40:34:WU01:FS01:0x15:Completed 4000000 out of 40000000 steps (10%).
02:49:34:WU01:FS01:0x15:Run: exception thrown in GuardedRun -- cannot continue further.
02:49:34:WU01:FS01:0x15:Going to send back what have done -- stepsTotalG=40000000
02:49:34:WU01:FS01:0x15:Work fraction=0.1038 steps=40000000.
02:49:38:WU01:FS01:0x15:logfile size=7370 infoLength=7370 edr=0 trr=23
02:49:38:WU01:FS01:0x15:+ Opened results file
02:49:38:WU01:FS01:0x15:- Writing 7906 bytes of core data to disk...
02:49:38:WU01:FS01:0x15:Done: 7394 -> 2590 (compressed to 35.0 percent)
02:49:38:WU01:FS01:0x15: ... Done.
02:49:38:WU01:FS01:0x15:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
02:49:38:WU01:FS01:0x15:
02:49:38:WU01:FS01:0x15:Folding@home Core Shutdown: UNSTABLE_MACHINE
02:49:38:WARNING:WU01:FS01:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
02:49:38:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:7623 run:276 clone:0 gen:85 core:0x15 unit:0x0000007c664f2dd14fe4fa026a9d395e
02:49:38:WU01:FS01:Uploading 3.03KiB to 171.64.65.105
02:49:38:WU01:FS01:Connecting to 171.64.65.105:8080
02:49:38:WU00:FS01:Connecting to assign-GPU.stanford.edu:80
02:49:39:WU01:FS01:Upload complete
02:49:39:WU01:FS01:Server responded WORK_ACK (400)
Code: Select all
22:19:58:WU00:FS01:0x15:Completed 400000 out of 40000000 steps (1%).
22:31:02:WU00:FS01:0x15:Completed 800000 out of 40000000 steps (2%).
22:41:06:WU00:FS01:0x15:Run: exception thrown in GuardedRun -- cannot continue further.
22:41:06:WU00:FS01:0x15:Going to send back what have done -- stepsTotalG=40000000
22:41:07:WU00:FS01:0x15:Work fraction=0.0291 steps=40000000.
22:41:10:WU00:FS01:0x15:logfile size=7371 infoLength=7371 edr=0 trr=23
22:41:10:WU00:FS01:0x15:+ Opened results file
22:41:10:WU00:FS01:0x15:- Writing 7907 bytes of core data to disk...
22:41:10:WU00:FS01:0x15:Done: 7395 -> 2603 (compressed to 35.1 percent)
22:41:10:WU00:FS01:0x15: ... Done.
22:41:10:WU00:FS01:0x15:DeleteFrameFiles: successfully deleted file=00/wudata_01.ckp
22:41:11:WU00:FS01:0x15:
22:41:11:WU00:FS01:0x15:Folding@home Core Shutdown: UNSTABLE_MACHINE
22:41:11:WARNING:WU00:FS01:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
22:41:11:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:7626 run:701 clone:0 gen:35 core:0x15 unit:0x0000002a664f2dd14fe61dca458f7883
22:41:11:WU00:FS01:Uploading 3.04KiB to 171.64.65.105
22:41:11:WU00:FS01:Connecting to 171.64.65.105:8080
22:41:11:WU01:FS01:Connecting to assign-GPU.stanford.edu:80
22:41:11:WU01:FS01:News: Welcome to Folding@Home
22:41:11:WU01:FS01:Assigned to work server 171.64.65.105
22:41:11:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:"GK106 [GeForce GTX 650 Ti]" from 171.64.65.105
22:41:11:WU01:FS01:Connecting to 171.64.65.105:8080
22:41:11:WU00:FS01:Upload complete
22:41:11:WU00:FS01:Server responded WORK_ACK (400)
My plan is to reboot to do any cleanup after installing the Toolkit, then reinstall 310.90 to make sure all updates are compatible with the newer drivers. Then I'll try the CUDA benchmarks again.
NOTE: Installing 310.90 graphics drivers includes something called NVidia Update which probably revises some of the supporting .dlls that TOOLKIT delivered.