Page 1 of 1

Project: 2669 (Run 8, Clone 44, Gen 93) : too many steps

Posted: Fri Mar 13, 2009 6:22 am
by Adak
This is my "wild child" WU:

Code: Select all

Current Work Unit
-----------------
Name: Gromacs
Tag: P2669R8C44G93
Download time: March 12 10:56:46
Due time: March 15 10:56:46
[color=#FF0000]Progress: 22884808%  [/color][||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||]
And that is the most progressing WU, I've ever seen! :)

It's also taking 91/2 HOURS on my Quad, per 1% of progress, according to the log:


Each checkpoint is 30 minutes on a 2.5GHz dual Quad Xeon 5420 that does nothing but fold 24/7 atm, with 8.04 Ubuntu Linux.

Code: Select all

[10:56:46] *------------------------------*
[10:56:46] Folding@Home Gromacs SMP Core
[10:56:46] Version 2.02 (Wed Aug 27 13:11:25 PDT 2008)
[10:56:46] 
[10:56:46] Preparing to commence simulation
[10:56:46] - Ensuring status. Please wait.
[10:56:47] Called DecompressByteArray: compressed_data_size=4830446 data_size=23976217, decompressed_data_size=23976217 diff=0
[10:56:47] - Digital signature verified
[10:56:47] 
[10:56:47] Project: 2669 (Run 8, Clone 44, Gen 93)
[10:56:47] 
[10:56:47] Assembly optimizations on if available.
[10:56:47] Entering M.D.
[10:56:57] (Run 8, Clone 44, Gen 93)
[10:56:57] 
[10:56:57] Entering M.D.
NNODES=4, MYRANK=0, HOSTNAME=adak-Rocketfish
NNODES=4, MYRANK=2, HOSTNAME=adak-Rocketfish
NNODES=4, MYRANK=3, HOSTNAME=adak-Rocketfish
NODEID=2 argc=19
NODEID=3 argc=19
NNODES=4, MYRANK=1, HOSTNAME=adak-Rocketfish
NODEID=0 argc=19
                         :-)  G  R  O  M  A  C  S  (-:

                   Groningen Machine for Chemical Simulation

                 :-)  VERSION 3.3.99_development_200800503  (-:


      Written by David van der Spoel, Erik Lindahl, Berk Hess, and others.
       Copyright (c) 1991-2000, University of Groningen, The Netherlands.
             Copyright (c) 2001-2008, The GROMACS development team,
            check out http://www.gromacs.org for more information.


                                :-)  mdrun  (-:

Reading file work/wudata_00.tpr, VERSION 3.3.99_development_20070618 (single precision)
NODEID=1 argc=19
Note: tpx file_version 48, software version 56
Making 1D domain decomposition 1 x 1 x 4
starting mdrun '22860 system'
18749999 steps,  37500.0 ps.

Writing checkpoint, step 4760880 at Thu Mar 12 04:27:04 2009
[11:28:53] - Autosending finished units... [March 12 11:28:53 UTC]
[11:28:53] Trying to send all finished work units
[11:28:53] + No unsent completed units remaining.
[11:28:53] - Autosend completed

Writing checkpoint, step 4771730 at Thu Mar 12 04:57:04 2009

Writing checkpoint, step 4782600 at Thu Mar 12 05:27:05 2009

Writing checkpoint, step 4793450 at Thu Mar 12 05:57:05 2009



Writing checkpoint, step 4934790 at Thu Mar 12 12:27:05 2009
[19:34:34] 1%)

Writing checkpoint, step 4945650 at Thu Mar 12 12:57:04 2009

Writing checkpoint, step 4956530 at Thu Mar 12 13:27:05 2009

Writing checkpoint, step 4967420 at Thu Mar 12 13:57:05 2009

Writing checkpoint, step 4978310 at Thu Mar 12 14:27:04 2009

Writing checkpoint, step 4989180 at Thu Mar 12 14:57:05 2009

Writing checkpoint, step 5000060 at Thu Mar 12 15:27:05 2009

Writing checkpoint, step 5010940 at Thu Mar 12 15:57:05 2009

Writing checkpoint, step 5021820 at Thu Mar 12 16:27:05 2009
[23:28:53] - Autosending finished units... [March 12 23:28:53 UTC]
[23:28:53] Trying to send all finished work units
[23:28:53] + No unsent completed units remaining.
[23:28:53] - Autosend completed

Writing checkpoint, step 5033250 at Thu Mar 12 16:57:05 2009

Writing checkpoint, step 5044370 at Thu Mar 12 17:27:05 2009

Writing checkpoint, step 5055510 at Thu Mar 12 17:57:04 2009

Writing checkpoint, step 5066670 at Thu Mar 12 18:27:05 2009

Writing checkpoint, step 5077830 at Thu Mar 12 18:57:04 2009

Writing checkpoint, step 5088980 at Thu Mar 12 19:27:05 2009

Writing checkpoint, step 5100110 at Thu Mar 12 19:57:04 2009

Writing checkpoint, step 5111260 at Thu Mar 12 20:27:05 2009

Writing checkpoint, step 5122410 at Thu Mar 12 20:57:05 2009
[04:04:05] Completed 375009 out of 18749999 steps  (2%)
The other smp WU is proceeding normally, and this one will be stopped, of course.

Re: Project 2669 - We're making *Progress*!! :)

Posted: Fri Mar 13, 2009 11:19 am
by toTOW
Thanks for the report. I marked this WU as a bad one.

You should upgrade your core to 2.04 to avoid future occurrences of this bug.