13001 (327, 2, 79) 99.99% and hangs
Posted: Sat Feb 14, 2015 1:42 am
My apologies if I've posted this in the wrong place for help. Feel free to move it if necessary.
I've recently got my GPU folding by upgrading my drivers (currently running 14.12 drivers.) It processed the first project fine, total point count of 17,000+ over 10 days. Second project gave me 8,000 points over 4 days.
I got a new project (13000) about 4 days ago. When it started out, the estimated time for completion was about 10 days, at roughly 17,000 points. The third day (yesterday) into this project , I noticed that it had dropped down to 4 days left for completion. Last night, it really sped up, showing about 2 hours for completion, and total point count started showing 152,000+. I've NEVER seen point counts that high. Well, the two hours pass, progress shows 99.99% and hangs. I let it process for a few more hours last night thinking a collection server might be down etc. But it still showed 99.99% when I shut down my system for the night.
I was hoping that what I thought was an error, would correct itself overnight with a system shutdown. System starts out today, progress starts out at 32-33% with time to finish less than 2 hours. Still shows 152,000+ PPD. Progress gets to 99.99% and hangs again. I assumed it was a bad WU, so I deleted it and retrieved another.
I got 13001 (327, 2, 79) a few hours ago, it starts fine, shows roughly 2 hours to complete, and 152,000+ PPD. Two hours pass, 99.99% and hangs. I can see now in the log that it's processing the WU, and it just passed 3% a short time ago. The GPU is active, but I find it really strange that it zips through progress and then hangs at 99.99%.
Is this an error? Should I be concerned? Am I overreacting?
My system is i5 750, Radeon 5750, Win 7 64bit SP1, 16GB ram. Stock clocks on both CPU and GPU. If any additional information is needed, please let me know.
I've recently got my GPU folding by upgrading my drivers (currently running 14.12 drivers.) It processed the first project fine, total point count of 17,000+ over 10 days. Second project gave me 8,000 points over 4 days.
I got a new project (13000) about 4 days ago. When it started out, the estimated time for completion was about 10 days, at roughly 17,000 points. The third day (yesterday) into this project , I noticed that it had dropped down to 4 days left for completion. Last night, it really sped up, showing about 2 hours for completion, and total point count started showing 152,000+. I've NEVER seen point counts that high. Well, the two hours pass, progress shows 99.99% and hangs. I let it process for a few more hours last night thinking a collection server might be down etc. But it still showed 99.99% when I shut down my system for the night.
I was hoping that what I thought was an error, would correct itself overnight with a system shutdown. System starts out today, progress starts out at 32-33% with time to finish less than 2 hours. Still shows 152,000+ PPD. Progress gets to 99.99% and hangs again. I assumed it was a bad WU, so I deleted it and retrieved another.
I got 13001 (327, 2, 79) a few hours ago, it starts fine, shows roughly 2 hours to complete, and 152,000+ PPD. Two hours pass, 99.99% and hangs. I can see now in the log that it's processing the WU, and it just passed 3% a short time ago. The GPU is active, but I find it really strange that it zips through progress and then hangs at 99.99%.
Is this an error? Should I be concerned? Am I overreacting?
My system is i5 750, Radeon 5750, Win 7 64bit SP1, 16GB ram. Stock clocks on both CPU and GPU. If any additional information is needed, please let me know.
Code: Select all
*********************** Log Started 2015-02-13T21:27:57Z ***********************
21:27:58:WU02:FS01:Cleaning up
21:27:59:WU00:FS01:Connecting to 171.67.108.200:80
21:28:00:WU00:FS01:Assigned to work server 140.163.4.231
21:28:00:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:Juniper [Radeon HD 5700/6750] from 140.163.4.231
21:28:00:WU00:FS01:Connecting to 140.163.4.231:8080
21:28:00:WU00:FS01:Downloading 4.84MiB
21:28:06:WU00:FS01:Download 62.02%
21:28:08:WU00:FS01:Download complete
21:28:08:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13001 run:327 clone:2 gen:79 core:0x17 unit:0x0000009a538b3db75328ac6a663613c8
21:28:08:WU00:FS01:Starting
21:28:08:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 3632 -checkpoint 15 -gpu 0 -gpu-vendor ati
21:28:08:WU00:FS01:Started FahCore on PID 1744
21:28:08:WU00:FS01:Core PID:4504
21:28:08:WU00:FS01:FahCore 0x17 started
21:28:09:WU00:FS01:0x17:*********************** Log Started 2015-02-13T21:28:09Z ***********************
21:28:09:WU00:FS01:0x17:Project: 13001 (Run 327, Clone 2, Gen 79)
21:28:09:WU00:FS01:0x17:Unit: 0x0000009a538b3db75328ac6a663613c8
21:28:09:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
21:28:09:WU00:FS01:0x17:Machine: 1
21:28:09:WU00:FS01:0x17:Reading tar file state.xml
21:28:10:WU00:FS01:0x17:Reading tar file system.xml
21:28:11:WU00:FS01:0x17:Reading tar file integrator.xml
21:28:11:WU00:FS01:0x17:Reading tar file core.xml
21:28:11:WU00:FS01:0x17:Digital signatures verified
21:28:11:WU00:FS01:0x17:Folding@home GPU core17
21:28:11:WU00:FS01:0x17:Version 0.0.52
21:29:05:FS01:Paused
21:29:05:FS01:Shutting core down
21:29:05:WU00:FS01:0x17:WARNING:Console control signal 1 on PID 4504
21:29:05:WU00:FS01:0x17:Exiting, please wait. . .
21:29:48:FS01:Unpaused
21:31:02:FS01:Paused
21:31:02:FS01:Shutting core down
21:32:03:WARNING:FS01:Killing WU00
21:32:03:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
21:41:27:FS01:Unpaused
21:41:27:WU00:FS01:Starting
21:41:27:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 3632 -checkpoint 15 -gpu 0 -gpu-vendor ati
21:41:27:WU00:FS01:Started FahCore on PID 1560
21:41:27:WU00:FS01:Core PID:1332
21:41:27:WU00:FS01:FahCore 0x17 started
21:41:28:WU00:FS01:0x17:*********************** Log Started 2015-02-13T21:41:27Z ***********************
21:41:28:WU00:FS01:0x17:Project: 13001 (Run 327, Clone 2, Gen 79)
21:41:28:WU00:FS01:0x17:Unit: 0x0000009a538b3db75328ac6a663613c8
21:41:28:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
21:41:28:WU00:FS01:0x17:Machine: 1
21:41:28:WU00:FS01:0x17:Reading tar file state.xml
21:41:29:WU00:FS01:0x17:Reading tar file system.xml
21:41:30:WU00:FS01:0x17:Reading tar file integrator.xml
21:41:30:WU00:FS01:0x17:Reading tar file core.xml
21:41:30:WU00:FS01:0x17:Digital signatures verified
21:41:30:WU00:FS01:0x17:Folding@home GPU core17
21:41:30:WU00:FS01:0x17:Version 0.0.52
21:45:28:WU00:FS01:0x17:Completed 0 out of 5000000 steps (0%)
21:45:28:WU00:FS01:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
22:56:07:WU00:FS01:0x17:Completed 50000 out of 5000000 steps (1%)
00:05:30:WU00:FS01:0x17:Completed 100000 out of 5000000 steps (2%)
01:14:08:WU00:FS01:0x17:Completed 150000 out of 5000000 steps (3%)