- [18:49:21] *------------------------------*
[18:49:21] Folding@Home Gromacs SMP Core
[18:49:21] Version 1.95 (2007)
[18:49:21]
[18:49:21] Preparing to commence simulation
[18:49:21] - Ensuring status. Please wait.
[18:49:38] - Assembly optimizations manually forced on.
[18:49:38] - Not checking prior termination.
[18:49:38] Need version 207
[18:49:38] Error: Work unit read from disk is invalid
[18:49:38] Finalizing output
[18:49:41] - Expanded 4844194 -> 23991465 (decompressed 495.2 percent)
[18:49:42]
[18:49:42] Project: 2669 (Run 4, Clone 19, Gen 111)
[18:49:42]
[18:49:43] Assembly optimizations on if available.
[18:49:43] Entering M.D.
NNODES=4, MYRANK=0, HOSTNAME=Macintosh-3.local
NNODES=4, MYRANK=1, HOSTNAME=Macintosh-3.local
NNODES=4, MYRANK=3, HOSTNAME=Macintosh-3.local
NNODES=4, MYRANK=2, HOSTNAME=Macintosh-3.local
NODEID=0 argc=19
NODEID=1 argc=19
G R O M A C S (-:
Groningen Machine for Chemical Simulation
VERSION 3.3.99_development_20080208 (-:
Written by David van der Spoel, Erik Lindahl, Berk Hess, and others.
Copyright (c) 1991-2000, University of Groningen, The Netherlands.
Copyright (c) 2001-2008, The GROMACS development team,
check out http://www.gromacs.org for more information.
mdrun (-:
NODEID=2 argc=19
Reading file work/wudata_05.tpr, VERSION 3.3.99_development_20070618 (single precision)
NODEID=3 argc=19
Note: tpx file_version 48, software version 54
Making 1D domain decomposition 1 x 1 x 4
starting mdrun '22908 system'
6781886 steps, 13563.8 ps.
[18:49:53] Completed 0 out of 6781886 steps (0 %)
[19:04:58] Timer requesting checkpoint
Writing checkpoint, step 21219500 at Tue Jun 30 04:04:59 2009
Writing checkpoint, step 21220880 at Tue Jun 30 04:19:58 2009
[19:20:05] Timer requesting checkpoint
Writing checkpoint, step 21222260 at Tue Jun 30 04:34:57 2009
[19:35:10] Timer requesting checkpoint
Writing checkpoint, step 21223640 at Tue Jun 30 04:49:56 2009
[19:50:16] Timer requesting checkpoint
Writing checkpoint, step 21225020 at Tue Jun 30 05:04:55 2009
[20:05:22] Timer requesting checkpoint
Writing checkpoint, step 21226400 at Tue Jun 30 05:19:54 2009
[20:20:27] Timer requesting checkpoint
Writing checkpoint, step 21227790 at Tue Jun 30 05:35:00 2009
[20:35:33] Timer requesting checkpoint
Writing checkpoint, step 21229170 at Tue Jun 30 05:49:58 2009
[20:50:38] Timer requesting checkpoint
Writing checkpoint, step 21230550 at Tue Jun 30 06:04:57 2009
[21:05:43] Timer requesting checkpoint
Writing checkpoint, step 21231930 at Tue Jun 30 06:19:56 2009
[21:20:48] Timer requesting checkpoint
Writing checkpoint, step 21233310 at Tue Jun 30 06:34:55 2009
[21:35:54] Timer requesting checkpoint
Writing checkpoint, step 21234690 at Tue Jun 30 06:49:54 2009
[21:50:59] Timer requesting checkpoint
Writing checkpoint, step 21236080 at Tue Jun 30 07:05:00 2009
[22:06:05] Timer requesting checkpoint
Writing checkpoint, step 21237460 at Tue Jun 30 07:20:00 2009
[22:21:10] Timer requesting checkpoint
Project: 2669 (Run 4, Clone 19, Gen 111)
Moderators: Site Moderators, FAHC Science Team
-
- Posts: 4
- Joined: Sat Dec 08, 2007 5:41 am
- Location: Tsuchiura, Japan
Project: 2669 (Run 4, Clone 19, Gen 111)
This is one of those giant work units. I will delete mine and start again.
Last edited by susato on Fri Jul 17, 2009 9:21 pm, edited 1 time in total.
Reason: fix formatting in title
Reason: fix formatting in title
-
- Posts: 4
- Joined: Sat Dec 08, 2007 5:41 am
- Location: Tsuchiura, Japan
Re: Project 2669 (Run 4, Clone 19, Gen 111)
Please kill this corrupted unit. I trash my work folder and unit info file, but I keep getting this same unit repeatedly. My computer will remain off until you post that this unit has been killed.
Re: Project 2669 (Run 4, Clone 19, Gen 111)
I notified the appropriate Pande Group member soon after your first post. Until they can take action, there's nothing else that can be done.
If you can't get rid of it by deleting it, change your MachineID to a number you are not using.
If you can't get rid of it by deleting it, change your MachineID to a number you are not using.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
-
- Posts: 4
- Joined: Sat Dec 08, 2007 5:41 am
- Location: Tsuchiura, Japan
Re: Project 2669 (Run 4, Clone 19, Gen 111)
Thanks, Bruce, for alerting the researcher in charge. Also, your trick of changing the MachineID worked, so I am up and running again with a different WU.
-
- Posts: 62
- Joined: Sun Dec 02, 2007 6:02 am
Re: Project 2669 (Run 4, Clone 19, Gen 111)
This bad work unit is still in the wild as I've just got it again ..........
Can you put the word out again.
Luck ...............
Code: Select all
[06:05:16] Folding@Home Gromacs SMP Core
[06:05:16] Version 2.07 (Sun Apr 19 14:51:09 PDT 2009)
[06:05:16]
[06:05:16] Preparing to commence simulation
[06:05:16] - Ensuring status. Please wait.
[06:05:17] Called DecompressByteArray: compressed_data_size=4844194 data_size=23991465, decompressed_data_size=23991465 diff=0
[06:05:19] - Digital signature verified
[06:05:19]
[06:05:19] Project: 2669 (Run 4, Clone 19, Gen 111)
[06:05:19]
[06:05:19] Assembly optimizations on if available.
[06:05:19] Entering M.D.
[06:05:28] on if available.
[06:05:28] Entering M.D.
[06:05:38] Completed 0 out of 6781886 steps (0%)
Luck ...............
Re: Project 2669 (Run 4, Clone 19, Gen 111)
PM sent. Thanks for the report.
Re: Project 2669 (Run 4, Clone 19, Gen 111)
Code: Select all
[20:26:06] - Ask before connecting: No
[20:26:06] - User name: markp1989 (Team 45032)
[20:26:06] - User ID: 3F94ED4322D297FC
[20:26:06] - Machine ID: 1
[20:26:06]
[20:26:06] Loaded queue successfully.
[20:26:06]
[20:26:06] + Processing work unit
[20:26:06] Core required: FahCore_a2.exe
[20:26:06] Core found.
[20:26:06] Working on Unit 01 [July 11 20:26:06]
[20:26:06] + Working ...
[20:26:06]
[20:26:06] *------------------------------*
[20:26:06] Folding@Home Gromacs SMP Core
[20:26:06] Version 2.07 (Sun Apr 19 14:51:09 PDT 2009)
[20:26:06]
[20:26:06] Preparing to commence simulation
[20:26:06] - Ensuring status. Please wait.
[20:26:07] Called DecompressByteArray: compressed_data_size=4844194 data_size=23
991465, decompressed_data_size=23991465 diff=0
[20:26:07] Called DecompressByteArray: compressed_data_size=4844194 data_size=23
991465, decompressed_data_size=23991465 diff=0
[20:26:07] - Digital signature verified
[20:26:07]
[20:26:07] Project: 2669 (Run 4, Clone 19, Gen 111)
[20:26:07]
[20:26:07] Assembly optimizations on if available.
[20:26:07] Entering M.D.
[20:26:07] - Digital signature verified
[20:26:07]
[20:26:07] Project: 2669 (Run 4, Clone 19, Gen 111)
[20:26:07]
[20:26:07] Assembly optimizations on if available.
[20:26:07] Entering M.D.
[20:26:13] Using Gromacs checkpoints
[20:26:13] Using Gromacs checkpoints
[20:26:17]
[20:26:17] Entering M.D.
[20:26:17]
[20:26:17] Entering M.D.
[20:26:23] Using Gromacs checkpoints
[20:26:23] Using Gromacs checkpoints
[20:26:28] Resuming from checkpoint
[20:26:28] Verified work/wudata_01.log
[20:26:28] Verified work/wudata_01.trr
[20:26:28] Verified work/wudata_01.xtc
[20:26:28] Verified work/wudata_01.edr
[20:26:28] Resuming from checkpoint
[20:26:28] Verified work/wudata_01.log
[20:26:28] Verified work/wudata_01.trr
[20:26:28] Verified work/wudata_01.xtc
[20:26:28] Verified work/wudata_01.edr
[20:26:28] Completed 40496 out of 6781886 steps (0%)
edit: just looked in the unit info file. it says i have completed 56117697% of the wu so il gues il delete it and cary on with another wu
Re: Project 2669 (Run 4, Clone 19, Gen 111)
Mark, it's a known bad WU, so just delete it. If you're using the console client, the best way is to stop folding, then start folding again using the -delete xx flag where xx is the queue position of the bad unit. (in your case xx = 01). The client will start up, delete the bad WU from the queue, then shut down again. Then you can restart Folding using your usual flags.
Re: Project 2669 (Run 4, Clone 19, Gen 111)
I just got this one assigned to me this morning... Hmmm... Deleting and moving on.
Re: Project: 2669 (Run 4, Clone 19, Gen 111)
I just got this one again!... Hmmm!!!... Deleting again and moving on.
Re: Project: 2669 (Run 4, Clone 19, Gen 111)
Thanks for the new report Don - Another PM sent.