Page 1 of 2
Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Wed Apr 04, 2012 11:35 am
by Pick2
Instant EUE 10 times in a row
Debian V6.34 client , Stable machine before and after.
Code: Select all
[08:39:42] Project: 6097 (Run 0, Clone 36, Gen 192)
[08:39:42]
[08:39:42] Assembly optimizations on if available.
[08:39:42] Entering M.D.
[08:39:48] Mapping NT from 4 to 4
[08:39:49] Completed 0 out of 500000 steps (0%)
[08:39:49] CoreStatus = 8B (139)
[08:39:49] Client-core communications error: ERROR 0x8b
[08:39:49]
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Thu Apr 05, 2012 9:09 am
by PantherX
There isn't any thing in the WU Database yet so I have marked it for a follow-up.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Sat Apr 07, 2012 6:02 am
by rab38505
I've been getting core status 0xFF on this same WU for the last day and a half. This WU appears to have failed thousands of times in a row, immediately each time. All previous WU's worked fine on this machine and new WU I got after forcing it to quit trying this one is working fine, also. Core i7 920, Win7 x64.
Code: Select all
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.34
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: E:\programs\folding-smp3
Service: E:\programs\folding-smp3\fah6
Arguments: -svcstart -d E:\programs\folding-smp3 -bigadv -advmethods -smp 8 -forceasm
Launched as a service.
Entered E:\programs\folding-smp3 to do work.
[05:53:41] - Ask before connecting: No
[05:53:41] - User name: rab38505 (Team 111065)
[05:53:41] - User ID: 21C6CA210D361B40
[05:53:41] - Machine ID: 8
[05:53:41]
[05:53:41] Work directory not found. Creating...
[05:53:41] Could not open work queue, generating new queue...
[05:53:41] - Preparing to get new work unit...
[05:53:41] Cleaning up work directory
[05:53:41] + Attempting to get work packet
[05:53:41] Passkey found
[05:53:41] - Connecting to assignment server
[05:53:42] - Successful: assigned to (128.143.231.202).
[05:53:42] + News From Folding@Home: Welcome to Folding@Home
[05:53:42] Loaded queue successfully.
[05:53:46] + Closed connections
[05:53:46]
[05:53:46] + Processing work unit
[05:53:46] Core required: FahCore_a3.exe
[05:53:46] Core found.
[05:53:46] Working on queue slot 01 [April 7 05:53:46 UTC]
[05:53:46] + Working ...
[05:53:46]
[05:53:46] *------------------------------*
[05:53:46] Folding@Home Gromacs SMP Core
[05:53:46] Version 2.27 (Dec. 15, 2010)
[05:53:46]
[05:53:46] Preparing to commence simulation
[05:53:46] - Assembly optimizations manually forced on.
[05:53:46] - Not checking prior termination.
[05:53:47] - Expanded 3811517 -> 4169428 (decompressed 109.3 percent)
[05:53:47] Called DecompressByteArray: compressed_data_size=3811517 data_size=4169428, decompressed_data_size=4169428 diff=0
[05:53:47] - Digital signature verified
[05:53:47]
[05:53:47] Project: 6097 (Run 0, Clone 36, Gen 192)
[05:53:47]
[05:53:47] Assembly optimizations on if available.
[05:53:47] Entering M.D.
[05:53:53] Mapping NT from 8 to 8
[05:53:53] Completed 0 out of 500000 steps (0%)
[05:53:56] CoreStatus = FF (255)
[05:53:56] Sending work to server
[05:53:56] Project: 6097 (Run 0, Clone 36, Gen 192)
[05:53:56] - Error: Could not get length of results file work/wuresults_01.dat
[05:53:56] - Error: Could not read unit 01 file. Removing from queue.
[05:53:56] - Preparing to get new work unit...
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Mon Apr 09, 2012 10:05 pm
by HutchinsonJC
When I turned off -bigadv I stopped getting this WU and was able to crunch others. Someone at anandtech on my post there mentioned that 6097 isn't even a -bigadv unit so I'm not sure how that works. With -bigadv on, I was previously crunching with no issues up until I got this WU, at which point it turned into a seeming infinite loop of this error result.
Code: Select all
[01:40:13] Project: 6097 (Run 0, Clone 36, Gen 192)
[01:40:13]
[01:40:13] Assembly optimizations on if available.
[01:40:13] Entering M.D.
[01:40:19] Mapping NT from 12 to 12
[01:40:19] Completed 0 out of 500000 steps (0%)
[01:40:36] CoreStatus = C0000005 (-1073741819)
[01:40:36] Client-core communications error: ERROR 0xc0000005
[01:40:36] Deleting current work unit & continuing...
[01:41:16] - Preparing to get new work unit...
[01:41:16] Cleaning up work directory
[01:41:16] + Attempting to get work packet
[01:41:16] Passkey found
[01:41:16] - Connecting to assignment server
[01:41:18] - Successful: assigned to (128.143.231.202).
[01:41:18] + News From Folding@Home: Welcome to Folding@Home
[01:41:18] Loaded queue successfully.
[01:42:13] + Closed connections
[01:42:18]
[01:42:18] + Processing work unit
[01:42:18] Core required: FahCore_a3.exe
[01:42:18] Core found.
[01:42:18] Working on queue slot 04 [April 9 01:42:18 UTC]
[01:42:18] + Working ...
[01:42:19]
[01:42:19] *------------------------------*
[01:42:19] Folding@Home Gromacs SMP Core
[01:42:19] Version 2.27 (Dec. 15, 2010)
[01:42:19]
[01:42:19] Preparing to commence simulation
[01:42:19] - Assembly optimizations manually forced on.
[01:42:19] - Not checking prior termination.
[01:42:19] - Expanded 3811517 -> 4169428 (decompressed 109.3 percent)
[01:42:19] Called DecompressByteArray: compressed_data_size=3811517 data_size=4169428, decompressed_data_size=4169428 diff=0
[01:42:19] - Digital signature verified
[01:42:19]
[01:42:19] Project: 6097 (Run 0, Clone 36, Gen 192)
[01:42:19]
[01:42:19] Assembly optimizations on if available.
[01:42:19] Entering M.D.
[01:42:25] Mapping NT from 12 to 12
[01:42:26] Completed 0 out of 500000 steps (0%)
[01:45:21] CoreStatus = C0000005 (-1073741819)
[01:45:21] Client-core communications error: ERROR 0xc0000005
[01:45:21] Deleting current work unit & continuing...
Folding@Home Client Shutdown at user request.
Folding@Home Client Shutdown.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Tue Apr 10, 2012 12:17 am
by sortofageek
There are now three reports of failure for Project: 6097 (Run 0, Clone 36, Gen 192) in this forum and two reports in the database which appear to be additional. At least those two are from folding names different from your forum names. Consequently, I reported it as a bad WU.
Thanks to all of you for your reports.
The WU (P6097,R0,C36,G192) has been reported as a bad WU. Note that the list of reported WUs are stopped daily at 8am pacific time.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Tue Apr 10, 2012 11:34 pm
by Leonardo
I had this work unit fail multiple times at startup yesterday. Interestingly enough, the machine that downloaded it is a dedicated bigadv Folder.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Wed Apr 11, 2012 6:28 pm
by Biffa
I'm not running bigadv and am still getting this WU
Code: Select all
[18:20:03] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 02 -np 4 -checkpoint 15 -verbose -lifeline 2936 -version 634'
[18:20:03]
[18:20:03] *------------------------------*
[18:20:03] Folding@Home Gromacs SMP Core
[18:20:03] Version 2.27 (Dec. 15, 2010)
[18:20:03]
[18:20:03] Preparing to commence simulation
[18:20:03] - Assembly optimizations manually forced on.
[18:20:03] - Not checking prior termination.
[18:20:05] - Expanded 3811517 -> 4169428 (decompressed 109.3 percent)
[18:20:05] Called DecompressByteArray: compressed_data_size=3811517 data_size=41
69428, decompressed_data_size=4169428 diff=0
[18:20:05] - Digital signature verified
[18:20:05]
[18:20:05] Project: 6097 (Run 0, Clone 36, Gen 192)
[18:20:05]
[18:20:05] Assembly optimizations on if available.
[18:20:05] Entering M.D.
[18:20:11] Mapping NT from 4 to 4
[18:20:12] Completed 0 out of 500000 steps (0%)
[18:22:17] CoreStatus = C0000005 (-1073741819)
[18:22:17] Client-core communications error: ERROR 0xc0000005
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Thu Apr 12, 2012 1:43 am
by Joe_H
Biffa wrote:I'm not running bigadv and am still getting this WU
Project 6097 is not bigadv, it is a fahcore A3. Would like to be inconvenienced by this, but these A3 WU's require a later revision of the core than is available for OS X.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Thu Apr 12, 2012 5:22 am
by CannonFodder08
This WU has failed for me numerous times today too.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Thu Apr 12, 2012 10:24 am
by Biffa
Joe_H wrote:Biffa wrote:I'm not running bigadv and am still getting this WU
Project 6097 is not bigadv, it is a fahcore A3. Would like to be inconvenienced by this, but these A3 WU's require a later revision of the core than is available for OS X.
I know mate, but earlier someone said that he had -bigadv and was getting it, then turned bigadv off and stopped getting it, I was just pointing out that was probably just a coincidence
For the record, I had to remove all work data in the FAH folder and change the client ID to stop getting them.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Thu Apr 12, 2012 12:18 pm
by HutchinsonJC
Last night I got another one and started up another seeming infinite loop of crashes. I think I'm gonna try that client ID change idea.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Thu Apr 12, 2012 12:59 pm
by sortofageek
I have flagged the problem you are all having with this WU. Apparently it isn't going to be an easy fix, but I wanted to let you know your reports are appreciated and you have the attention of Pande Group. They are working on it.
Thanks for your participation and your patience.
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Thu Apr 12, 2012 2:19 pm
by kasson
This WU should be stopped now. Thanks for the reports (and to our tireless mods).
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Tue Apr 17, 2012 10:32 pm
by Tim_H
I had this unit fail on me today several times. here are a few.
Code: Select all
[21:42:42] *------------------------------*
[21:42:42] Folding@Home Gromacs SMP Core
[21:42:42] Version 2.27 (Dec. 15, 2010)
[21:42:42]
[21:42:42] Preparing to commence simulation
[21:42:42] - Looking at optimizations...
[21:42:42] - Created dyn
[21:42:42] - Files status OK
[21:42:42] - Expanded 3811517 -> 4169428 (decompressed 109.3 percent)
[21:42:42] Called DecompressByteArray: compressed_data_size=3811517 data_size=4169428, decompressed_data_size=4169428 diff=0
[21:42:42] - Digital signature verified
[21:42:42]
[21:42:42] Project: 6097 (Run 0, Clone 36, Gen 192)
[21:42:42]
[21:42:42] Assembly optimizations on if available.
[21:42:42] Entering M.D.
[21:42:48] Mapping NT from 2 to 2
[21:42:49] Completed 0 out of 500000 steps (0%)
[21:42:50] CoreStatus = 8B (139)
[21:42:50] Client-core communications error: ERROR 0x8b
[21:42:50] Deleting current work unit & continuing...
[21:44:09] Trying to send all finished work units
[21:44:09] + No unsent completed units remaining.
[21:44:09] - Preparing to get new work unit...
[21:44:09] Cleaning up work directory
[21:44:09] + Attempting to get work packet
[21:44:09] Passkey found
[21:44:09] - Will indicate memory of 2008 MB
[21:44:09] - Connecting to assignment server
[21:44:09] Connecting to http://assign.stanford.edu:8080/
[21:44:09] Posted data.
[21:44:09] Initial: 8F80; - Successful: assigned to (128.143.231.202).
[21:44:09] + News From Folding@Home: Welcome to Folding@Home
[21:44:10] Loaded queue successfully.
[21:44:10] Sent data
[21:44:10] Connecting to http://128.143.231.202:8080/
[21:44:12] Posted data.
[21:44:12] Initial: 0000; - Receiving payload (expected size: 3812029)
[21:44:15] - Downloaded at ~1240 kB/s
[21:44:15] - Averaged speed for that direction ~1691 kB/s
[21:44:15] + Received work.
[21:44:15] + Closed connections
[21:44:20]
[21:44:20] + Processing work unit
[21:44:20] Core required: FahCore_a3.exe
[21:44:20] Core found.
[21:44:20] Working on queue slot 02 [April 17 21:44:20 UTC]
[21:44:20] + Working ...
[21:44:20] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 02 -np 2 -priority 96 -checkpoint 15 -verbose -lifeline 1561 -version 634'
[21:44:20]
[21:44:20] *------------------------------*
[21:44:20] Folding@Home Gromacs SMP Core
[21:44:20] Version 2.27 (Dec. 15, 2010)
[21:44:20]
[21:44:20] Preparing to commence simulation
[21:44:20] - Looking at optimizations...
[21:44:20] - Created dyn
[21:44:20] - Files status OK
[21:44:20] - Expanded 3811517 -> 4169428 (decompressed 109.3 percent)
[21:44:20] Called DecompressByteArray: compressed_data_size=3811517 data_size=4169428, decompressed_data_size=4169428 diff=0
[21:44:20] - Digital signature verified
[21:44:20]
[21:44:20] Project: 6097 (Run 0, Clone 36, Gen 192)
[21:44:20]
[21:44:20] Assembly optimizations on if available.
[21:44:20] Entering M.D.
[21:44:26] Mapping NT from 2 to 2
[21:44:27] Completed 0 out of 500000 steps (0%)
[21:44:28] CoreStatus = 8B (139)
[21:44:28] Client-core communications error: ERROR 0x8b
[21:44:28] Deleting current work unit & continuing...
[21:45:49] Trying to send all finished work units
[21:45:49] + No unsent completed units remaining.
[21:45:49] - Preparing to get new work unit...
[21:45:49] Cleaning up work directory
[21:45:49] + Attempting to get work packet
[21:45:49] Passkey found
[21:45:49] - Will indicate memory of 2008 MB
[21:45:49] - Connecting to assignment server
[21:45:49] Connecting to http://assign.stanford.edu:8080/
[21:45:49] Posted data.
[21:45:49] Initial: 8F80; - Successful: assigned to (128.143.231.202).
[21:45:49] + News From Folding@Home: Welcome to Folding@Home
[21:45:49] Loaded queue successfully.
[21:45:49] Sent data
[21:45:49] Connecting to http://128.143.231.202:8080/
[21:45:52] Posted data.
[21:45:52] Initial: 0000; - Receiving payload (expected size: 3812029)
[21:45:54] - Downloaded at ~1861 kB/s
[21:45:54] - Averaged speed for that direction ~1725 kB/s
[21:45:54] + Received work.
[21:45:54] + Closed connections
[21:45:59]
[21:45:59] + Processing work unit
[21:45:59] Core required: FahCore_a3.exe
[21:45:59] Core found.
[21:45:59] Working on queue slot 03 [April 17 21:45:59 UTC]
[21:45:59] + Working ...
[21:45:59] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 03 -np 2 -priority 96 -checkpoint 15 -verbose -lifeline 1561 -version 634'
[21:45:59]
[21:45:59] *------------------------------*
[21:45:59] Folding@Home Gromacs SMP Core
[21:45:59] Version 2.27 (Dec. 15, 2010)
[21:45:59]
[21:45:59] Preparing to commence simulation
[21:45:59] - Looking at optimizations...
[21:45:59] - Created dyn
[21:45:59] - Files status OK
[21:45:59] - Expanded 3811517 -> 4169428 (decompressed 109.3 percent)
[21:45:59] Called DecompressByteArray: compressed_data_size=3811517 data_size=4169428, decompressed_data_size=4169428 diff=0
[21:45:59] - Digital signature verified
[21:45:59]
[21:45:59] Project: 6097 (Run 0, Clone 36, Gen 192)
[21:45:59]
[21:45:59] Assembly optimizations on if available.
[21:45:59] Entering M.D.
[21:46:05] Mapping NT from 2 to 2
[21:46:06] Completed 0 out of 500000 steps (0%)
[21:46:07] CoreStatus = 8B (139)
[21:46:07] Client-core communications error: ERROR 0x8b
[21:46:07]
Folding@Home will go to sleep for 1 day as there have been 5 consecutive Cores executed which failed to complete a work unit.
[21:46:07] (To wake it up early, quit the application and restart it.)
[21:46:07] If problems persist, please visit our website at http://folding.stanford.edu for help.
[21:46:07] + Sleeping...
Re: Project: 6097 (Run 0, Clone 36, Gen 192) EUE
Posted: Tue Apr 17, 2012 11:08 pm
by sortofageek
Thanks for the report. I will let them know.