Page 1 of 1

Project: 2606 (Run 9, Clone 167, Gen 58)

Posted: Mon Jun 02, 2008 4:55 pm
by MoneyGuyBK
FYI.... 2605s have been working fine until I got this EUE message that I have never seen before:

Code: Select all

[18:58:36] Completed 490000 out of 500000 steps  (98 percent)
[19:10:34] Writing local files
[19:10:34] Completed 495000 out of 500000 steps  (99 percent)
[19:22:33] Writing local files
[19:22:33] Completed 500000 out of 500000 steps  (100 percent)
[19:22:33] Writing final coordinates.
[19:22:33] Past main M.D. loop
[19:22:34] Will end MPI now
[19:23:33] 
[19:23:33] Finished Work Unit:
[19:23:33] - Reading up to 3723696 from "work/wudata_08.arc": Read 3723696
[19:23:34] - Reading up to 1939276 from "work/wudata_08.xtc": Read 1939276
[19:23:34] goefile size: 0
[19:23:34] logfile size: 24941
[19:23:34] Leaving Run
[19:23:38] - Writing 5692313 bytes of core data to disk...
[19:23:38]   ... Done.
[19:23:39] - Shutting down core
[19:23:39] 
[19:23:39] Folding@home Core Shutdown: FINISHED_UNIT
[19:23:44] CoreStatus = 64 (100)
[19:23:44] Unit 8 finished with 78 percent of time to deadline remaining.
[19:23:44] Updated performance fraction: 0.785595
[19:23:44] Sending work to server


[19:23:44] + Attempting to send results
[19:23:44] - Reading file work/wuresults_08.dat from core
[19:23:44]   (Read 5692313 bytes from disk)
[19:23:44] Connecting to http://171.64.65.56:8080/
[19:25:13] Posted data.
[19:25:13] Initial: 0000; - Uploaded at ~61 kB/s
[19:25:14] - Averaged speed for that direction ~59 kB/s
[19:25:14] + Results successfully sent
[19:25:14] Thank you for your contribution to Folding@Home.
[19:25:14] + Number of Units Completed: 26

[19:29:18] - Warning: Could not delete all work unit files (8): Core returned invalid code
[19:29:18] Trying to send all finished work units
[19:29:18] + No unsent completed units remaining.
[19:29:18] - Preparing to get new work unit...
[19:29:18] + Attempting to get work packet
[19:29:18] - Will indicate memory of 512 MB
[19:29:18] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 8
[19:29:18] - Connecting to assignment server
[19:29:18] Connecting to http://assign.stanford.edu:8080/
[19:29:18] Posted data.
[19:29:18] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[19:29:18] + News From Folding@Home: Welcome to Folding@Home
[19:29:19] Loaded queue successfully.
[19:29:19] Connecting to http://171.64.65.56:8080/
[19:29:21] Posted data.
[19:29:21] Initial: 0000; - Receiving payload (expected size: 2448457)
[19:29:23] - Downloaded at ~1195 kB/s
[19:29:23] - Averaged speed for that direction ~1520 kB/s
[19:29:23] + Received work.
[19:29:23] Trying to send all finished work units
[19:29:23] + No unsent completed units remaining.
[19:29:23] + Closed connections
[19:29:23] 
[19:29:23] + Processing work unit
[19:29:23] Core required: FahCore_a1.exe
[19:29:23] Core found.
[19:29:23] Working on Unit 09 [June 1 19:29:23]
[19:29:23] + Working ...
[19:29:23] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 09 -checkpoint 30 -verbose -lifeline 5586 -version 602'

[19:29:24] 
[19:29:24] *------------------------------*
[19:29:24] Folding@Home Gromacs SMP Core
[19:29:24] Version 1.74 (November 27, 2006)
[19:29:24] 
[19:29:24] Preparing to commence simulation
[19:29:24] - Ensuring status. Please wait.
[19:29:24] - Starting from initial work packet
[19:29:24] 
[19:29:24] Project: 2605 (Run 9, Clone 237, Gen 56)
[19:29:24] 
[19:29:24] Assembly optimizations on if available.
[19:29:24] Entering M.D.
[19:29:41]  percent)
[19:29:41] - Starting from initial work packet
[19:29:41] 
[19:29:41] Project: 2605 (Run 9, Clone 237, Gen 56)
[19:29:41] 
[19:29:41] Entering M.D.
[19:29:48] Protein: Protein in POPC
[19:29:48] Writing local files
[19:29:49] Extra SSE boost OK.
[19:41:53] es
[19:41:53] Completed 5000 out of 500000 steps  (1 percent)
[19:53:51] Writing local files
[19:53:52] Completed 10000 out of 500000 steps  (2 percent)
[20:05:48] Writing local files
* Truncated
[14:55:32] Completed 485000 out of 500000 steps  (97 percent)
[15:07:32] Writing local files
[15:07:32] Completed 490000 out of 500000 steps  (98 percent)
[15:19:35] Writing local files
[15:19:35] Completed 495000 out of 500000 steps  (99 percent)
[15:19:55] - Autosending finished units...
[15:19:55] Trying to send all finished work units
[15:19:55] + No unsent completed units remaining.
[15:19:55] - Autosend completed
[15:31:38] Writing local files
[15:31:38] Completed 500000 out of 500000 steps  (100 percent)
[15:31:38] Writing final coordinates.
[15:31:38] Past main M.D. loop
[15:31:38] Will end MPI now
[15:32:38] 
[15:32:38] Finished Work Unit:
[15:32:38] - Reading up to 3721200 from "work/wudata_09.arc": Read 3721200
[15:32:38] - Reading up to 1774832 from "work/wudata_09.xtc": Read 1774832
[15:32:38] goefile size: 0
[15:32:38] logfile size: 17315
[15:32:38] Leaving Run
[15:32:41] - Writing 5517747 bytes of core data to disk...
[15:32:41]   ... Done.
[15:32:42] - Shutting down core
[15:32:42] 
[15:32:42] Folding@home Core Shutdown: FINISHED_UNIT
[15:32:47] CoreStatus = 64 (100)
[15:32:47] Unit 9 finished with 79 percent of time to deadline remaining.
[15:32:47] Updated performance fraction: 0.786691
[15:32:47] Sending work to server


[15:32:47] + Attempting to send results
[15:32:47] - Reading file work/wuresults_09.dat from core
[15:32:47]   (Read 5517747 bytes from disk)
[15:32:47] Connecting to http://171.64.65.56:8080/
[15:34:12] Posted data.
[15:34:12] Initial: 0000; - Uploaded at ~62 kB/s
[15:34:13] - Averaged speed for that direction ~60 kB/s
[15:34:13] + Results successfully sent
[15:34:13] Thank you for your contribution to Folding@Home.
[15:34:13] + Number of Units Completed: 27

[15:38:18] - Warning: Could not delete all work unit files (9): Core returned invalid code
[15:38:18] Trying to send all finished work units
[15:38:18] + No unsent completed units remaining.
[15:38:18] - Preparing to get new work unit...
[15:38:18] + Attempting to get work packet
[15:38:18] - Will indicate memory of 512 MB
[15:38:18] - Connecting to assignment server
[15:38:18] Connecting to http://assign.stanford.edu:8080/
[15:38:18] Posted data.
[15:38:18] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[15:38:18] + News From Folding@Home: Welcome to Folding@Home
[15:38:18] Loaded queue successfully.
[15:38:18] Connecting to http://171.64.65.56:8080/
[15:38:21] Posted data.
[15:38:21] Initial: 0000; - Receiving payload (expected size: 2437794)
[15:38:22] - Downloaded at ~2380 kB/s
[15:38:22] - Averaged speed for that direction ~1692 kB/s
[15:38:22] + Received work.
[15:38:22] Trying to send all finished work units
[15:38:22] + No unsent completed units remaining.
[15:38:22] + Closed connections
[15:38:22] 
[15:38:22] + Processing work unit
[15:38:22] Core required: FahCore_a1.exe
[15:38:22] Core found.
[15:38:22] Working on Unit 00 [June 2 15:38:22]
[15:38:22] + Working ...
[15:38:22] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 00 -checkpoint 30 -verbose -lifeline 5586 -version 602'

[15:38:22] 
[15:38:22] *------------------------------*
[15:38:22] Folding@Home Gromacs SMP Core
[15:38:22] Version 1.74 (November 27, 2006)
[15:38:22] 
[15:38:22] Preparing to commence simulation
[15:38:22] - Ensuring status. Please wait.
[15:38:22] - Starting from initial work packet
[15:38:22] 
[15:38:22] Project: 2605 (Run 9, Clone 167, Gen 58)
[15:38:22] 
[15:38:22] Assembly optimizations on if available.
[15:38:22] Entering M.D.
[15:38:39]  percent)
[15:38:39] cket
[15:38:39] 
[15:38:39] Project: 2605 (Run 9, Clone 167, Gen 58)
[15:38:39] 
[15:38:39] Entering M.D.
[15:38:40] ne 167, Gen 58)
[15:38:40] 
[15:38:40] Entering M.D.
[15:38:47] Protein: Protein in POPC
[15:38:47] Writing local files
[15:38:47] Extra SSE boost OK.
[15:38:48] 0])
[15:38:48] 
[15:38:48] Simulation instability has been encountereinstability has been encountered. The run has entered a
[15:38:48]   state from which no further progress can be madet of the simulation, however if you
[15:38:48]   often see other project units terminating early like this
[15:38:48]   too, you may wish to check the stabike this
[15:38:48]   too, you may wish to check the stability of your computer (issues
[15:38:48]   such as high temperature, overclocking, etc.).
[15:38:48] Going to send back what have done.
[15:38:48] logfile size: 8121  ... Done.
[15:38:48] Finalizing output
[15:38:48] Gromacs errorFinalizing output
[15:38:48] Gromacs error.
[15:38:48]  Shutdown: EARLY_UNIT_END
[15:38:53] CoreStatus = 72 (114)
[15:38:53] Sending work to server


[15:38:53] + Attempting to send results
[15:38:53] - Reading file work/wuresults_00.dat from core
[15:38:53]   (Read 8670 bytes from disk)
[15:38:53] Connecting to http://171.64.65.56:8080/
[15:38:53] Posted data.
[15:38:53] Initial: 0000; Conversation time very short, giving reduced weight in bandwidth avg
[15:38:53] - Uploaded at ~18 kB/s
[15:38:53] - Averaged speed for that direction ~55 kB/s
[15:38:53] + Results successfully sent
[15:38:53] Thank you for your contribution to Folding@Home.
[15:42:58] - Warning: Could not delete all work unit files (0): Core returned invalid code
[15:42:58] Trying to send all finished work units
[15:42:58] + No unsent completed units remaining.
[15:42:58] - Preparing to get new work unit...
[15:42:58] + Attempting to get work packet
[15:42:58] - Will indicate memory of 512 MB
[15:42:58] - Connecting to assignment server
[15:42:58] Connecting to http://assign.stanford.edu:8080/
[15:42:58] Posted data.
[15:42:58] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[15:42:58] + News From Folding@Home: Welcome to Folding@Home
[15:42:58] Loaded queue successfully.
[15:42:58] Connecting to http://171.64.65.56:8080/
[15:43:01] Posted data.
[15:43:01] Initial: 0000; - Receiving payload (expected size: 2443546)
[15:43:02] - Downloaded at ~2386 kB/s
[15:43:02] - Averaged speed for that direction ~1831 kB/s
[15:43:02] + Received work.
[15:43:02] Trying to send all finished work units
[15:43:02] + No unsent completed units remaining.
[15:43:02] + Closed connections
[15:43:07] 
[15:43:07] + Processing work unit
[15:43:07] Core required: FahCore_a1.exe
[15:43:07] Core found.
[15:43:07] Working on Unit 01 [June 2 15:43:07]
[15:43:07] + Working ...
[15:43:07] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 30 -verbose -lifeline 5586 -version 602'

[15:43:08] 
[15:43:08] *------------------------------*
[15:43:08] Folding@Home Gromacs SMP Core
[15:43:08] Version 1.74 (November 27, 2006)
[15:43:08] 
[15:43:08] Preparing to commence simulation
[15:43:08] - Ensuring status. Please wait.
[15:43:08] - Starting from initial work packet
[15:43:08] 
[15:43:08] Project: 2605 (Run 9, Clone 227, Gen 58)
[15:43:08] 
[15:43:08] Assembly optimizations on if available.
[15:43:08] Entering M.D.
[15:43:25]  percent)
[15:43:25] - Starting from initial work packet
[15:43:25] 
[15:43:25] Project: 2605 (Run 9, Clone 227, Gen 58)
[15:43:25] 
[15:43:25] Entering M.D.
[15:43:32] Protein: Protein in POPC
[15:43:32] Writing local files
[15:43:33] Extra SSE boost OK.
[15:55:39] es
[15:55:39] Completed 5000 out of 500000 steps  (1 percent)
[16:07:41] Writing local files
[16:07:41] Completed 10000 out of 500000 steps  (2 percent)
[16:19:42] Writing local files
[16:19:42] Completed 15000 out of 500000 steps  (3 percent)
[16:31:41] Writing local files
[16:31:41] Completed 20000 out of 500000 steps  (4 percent)
[16:43:39] Writing local files
[16:43:39] Completed 25000 out of 500000 steps  (5 percent)



Peace

Re: Project: 2606 (Run 9, Clone 167, Gen 58)

Posted: Mon Jun 02, 2008 5:14 pm
by 7im
That error language Gromacs specific. IMO, it's what happens when a WU tries to fold in a way that is outside the expected parameters for that protein.

For reference that instability language is found in other EUE types: http://fahwiki.net/index.php?title=EUE_Types