ok I'm trying to launch again from the last backup before the first crash.
Response in a few minutes.
Project: 3062 (Run 3, Clone 26, Gen 1)
Moderators: Site Moderators, FAHC Science Team
Re: Project: 3062 (Run 3, Clone 26, Gen 1)
Team #35819 P2P-Community
Re: Project: 3062 (Run 3, Clone 26, Gen 1)
It works
ikki@neptune:~/fah/debug/inst1$ ./fah6 -smp -verbosity 9 -configonly
Note: Please read the license agreement (fah6 -license). Further
use of this software requires that you have read and accepted this agreement.
Folding@Home User Configuration
4 cores detected
--- Opening Log file [January 3 20:46:30]
# SMP Client ##################################################################
###############################################################################
Folding@Home Client Version 6.00beta1
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: /home/ikki/fah/debug/inst1
Executable: ./fah6
Arguments: -smp -verbosity 9 -configonly
[20:46:30] - Ask before connecting: No
[20:46:30] - User name: ikkibis (Team 35819)
[20:46:30] - User ID: 6016C7B6674B5FDE
[20:46:30] - Machine ID: 1
[20:46:30]
[20:46:30] Configuring Folding@Home...
User name [ikkibis]?
Team Number [35819]?
Passkey [mypasskey]
Ask before fetching/sending work (no/yes) [no]?
Use proxy (yes/no) [no]?
Acceptable size of work assignment and work result packets (bigger units
may have large memory demands) -- 'small' is <5MB, 'normal' is <10MB, and
'big' is >10MB (small/normal/big) [big]?
Change advanced options (yes/no) [no]? yes
Core Priority (idle/low) [idle]?
Disable highly optimized assembly code (no/yes) [no]?
Interval, in minutes, between checkpoints (3-30) [15]?
Memory, in MB, to indicate (2014 available) [2014]?
Set -advmethods flag always, requesting new advanced
scientific cores and/or work units if available (no/yes) [yes]?
Ignore any deadline information (mainly useful if
system clock frequently has errors) (no/yes) [no]? yes
Machine ID (1-16) [1]? 2
[20:46:58] - Ask before connecting: No
[20:46:58] - User name: ikkibis (Team 35819)
[20:46:58] - User ID: 6016C7B6674B5FDE
[20:46:58] - Machine ID: 2
[20:46:58]
[20:46:58] -configonly flag given, so exiting.
Complété
ikki@neptune:~/fah/debug/inst1$ ./fah6 -smp -verbosity 9
Note: Please read the license agreement (fah6 -license). Further
use of this software requires that you have read and accepted this agreement.
4 cores detected
--- Opening Log file [January 3 20:47:15]
# SMP Client ##################################################################
###############################################################################
Folding@Home Client Version 6.00beta1
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: /home/ikki/fah/debug/inst1
Executable: ./fah6
Arguments: -smp -verbosity 9
[20:47:15] - Ask before connecting: No
[20:47:15] - User name: ikkibis (Team 35819)
[20:47:15] - User ID: 6016C7B6674B5FDE
[20:47:15] - Machine ID: 2
[20:47:15]
[20:47:15] Loaded queue successfully.
[20:47:15] Unit 7's deadline (December 24 20:18) has passed.
[20:47:15]
[20:47:15] + Processing work unit
[20:47:15] Core required: FahCore_a1.exe
[20:47:15] Core found.
[20:47:15] - Autosending finished units...
[20:47:15] Trying to send all finished work units
[20:47:15] + No unsent completed units remaining.
[20:47:15] - Autosend completed
[20:47:15] Working on Unit 07 [January 3 20:47:15]
[20:47:15] + Working ...
[20:47:15] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 07 -checkpoint 15 -verbose -lifeline 16158 -version 600'
[20:47:15]
[20:47:15] *------------------------------*
[20:47:15] Folding@Home Gromacs SMP Core
[20:47:15] Version 1.74 (November 27, 2006)
[20:47:15]
[20:47:15] Preparing to commence simulation
[20:47:15] - Ensuring status. Please wait.
[20:47:32] - Looking at optimizations...
[20:47:32] - Working with standard loops on this execution.
[20:47:32] - Previous termination of core was improper.
[20:47:32] - Going to use standard loops.
[20:47:32] - Files status OK
[20:47:32] - Expanded 607662 -> 3257309 (decompressed 536.0 percent)
[20:47:32]
[20:47:32] Project: 3062 (Run 3, Clone 26, Gen 1)
[20:47:32]
[20:47:32] Entering M.D.
NNODES=4, MYRANK=1, HOSTNAME=neptune
NNODES=4, MYRANK=2, HOSTNAME=neptune
NNODES=4, MYRANK=3, HOSTNAME=neptune
NNODES=4, MYRANK=0, HOSTNAME=neptune
NODEID=0 argc=15
NODEID=3 argc=15
NODEID=1 argc=15
NODEID=2 argc=15
Written by David van der Spoel, Erik Lindahl, Berk Hess, and others.
Copyright (c) 1991-2000, University of Groningen, The Netherlands.
Copyright (c) 2001-2004, The GROMACS development team,
check out http://www.gromacs.org for more information.
This inclusion of Gromacs code in the Folding@Home Core is under
a special license (see http://folding.stanford.edu/gromacs.html)
specially granted to Stanford by the copyright holders. If you
are interested in using Gromacs, visit http://www.gromacs.org where
you can download a free version of Gromacs under
the terms of the GNU General Public License (GPL) as published
by the Free Software Foundation; either version 2 of the License,
or (at your option) any later version.
[20:47:38] Calling FAH init
(single precision)
starting mdrun 'p3062_lambda5_99sb'
5000000 steps, 10000.0 ps.
[20:47:39] mbda5_99sb
[20:47:39] Writing local files
[20:47:39] Completed 2900000 out of 5000000 steps (58 percent)
[20:47:39] Extra SSE boost OK.
[20:47:39]
[20:47:39] Completed 2900000 out of 5000000 steps (58 percent)
[20:47:39] Extra SSE boost OK.
[20:59:45] Writing local files
[20:59:45] Completed 2950000 out of 5000000 steps (59 percent)
[21:12:23] Writing local files
[21:12:23] Completed 3000000 out of 5000000 steps (60 percent)
[21:25:03] Writing local files
[21:25:03] Completed 3050000 out of 5000000 steps (61 percent)
Team #35819 P2P-Community
Re: Project: 3062 (Run 3, Clone 26, Gen 1)
OK, so the WU that you published probably won't be as much help as I though it might._ikki_ wrote:It works
Likely conclusions:
The EUE was not caused by your hardware or configuration issues.
Since it was repeatable on your machine, it might help if ChelseaOilman forced it to restart from 0 to see if the EUE in the full run is repeatable for him.
Did anybody check carefully for memory leaks / handles leaks?
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
Re: Project: 3062 (Run 3, Clone 26, Gen 1)
No, I didn'tbruce wrote: Did anybody check carefully for memory leaks / handles leaks?
Team #35819 P2P-Community