Project: 2673 (Run 8, Clone 15, Gen 82) too big & slow

Moderators: Site Moderators, FAHC Science Team

Post Reply
susato
Site Moderator
Posts: 511
Joined: Fri Nov 30, 2007 4:57 am
Location: Team MacOSX
Contact:

Project: 2673 (Run 8, Clone 15, Gen 82) too big & slow

Post by susato »

Reported on another forum: this WU has 23000002 steps and is taking 12 hours for each %.

Code: Select all

--- Opening Log file [February 7 18:45:42 UTC]


# Mac OS X SMP Console Edition ################################################
###############################################################################

                      Folding@Home Client Version 6.20

                         http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /Users/bulldogg6982/Library/InCrease/unit2
Executable: /Users/bulldogg6982/Library/InCrease/unit2/fah6
Arguments: -local -advmethods -forceasm -verbosity 9 -smp -smp 8

Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.

[18:45:42] - Ask before connecting: No
[18:45:42] - User name: Spikeanator6982 (Team 1971)
[18:45:42] - User ID: 50B9E94C7A82C24D
[18:45:42] - Machine ID: 2
[18:45:42]
[18:45:42] Loaded queue successfully.
[18:45:42]
[18:45:42] - Autosending finished units... [February 7 18:45:42 UTC]
[18:45:42] + Processing work unit
[18:45:42] Trying to send all finished work units
[18:45:42] Core required: FahCore_a2.exe
[18:45:42] + No unsent completed units remaining.
[18:45:42] - Autosend completed
[18:45:42] Core found.
[18:45:42] - Using generic ./mpiexec
[18:45:42] Working on queue slot 09 [February 7 18:45:42 UTC]
[18:45:42] + Working ...
[18:45:42] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 09 -checkpoint 15 -forceasm -verbose -lifeline 414 -version 620'

[18:45:42]
[18:45:42] *------------------------------*
[18:45:42] Folding@Home Gromacs SMP Core
[18:45:42] Version 2.01 (Wed Jul 16 08:26:53 PDT 2008)
[18:45:42]
[18:45:42] Preparing to commence simulation
[18:45:42] - Ensuring status. Please wait.
[18:45:52] - Assembly optimizations manually forced on.
[18:45:52] - Not checking prior termination.
[18:45:53] - Expanded 4834529 -> 24039989 (decompressed 497.2 percent)
[18:45:53] Called DecompressByteArray: compressed_data_size=4834529 data_size=24039989, decompressed_data_size=24039989 diff=0
[18:45:53] - Digital signature verified
[18:45:53]
[18:45:53] Project: 2673 (Run 8, Clone 15, Gen 82)
[18:45:53]
[18:45:54] Assembly optimizations on if available.
[18:45:54] Entering M.D.
[18:46:00] Will resume from checkpoint file
[18:46:00] Node 1 initialized
[18:46:03] Resuming from checkpoint
[18:46:03] Verified work/wudata_09.log
[18:46:04] Verified work/wudata_09.trr
[18:46:05] Verified work/wudata_09.xtc
[18:46:05] Verified work/wudata_09.edr
[18:46:05] Completed 230012 out of 23000002 steps  (1%)
[21:44:11] ***** Got a SIGTERM signal (15)
[21:44:11] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [February 8 00:52:15 UTC]


# Mac OS X SMP Console Edition ################################################
###############################################################################

                      Folding@Home Client Version 6.20

                         http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /Users/bulldogg6982/Library/InCrease/unit2
Executable: /Users/bulldogg6982/Library/InCrease/unit2/fah6
Arguments: -local -advmethods -forceasm -verbosity 9 -smp -smp 8

Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.

[00:52:15] - Ask before connecting: No
[00:52:15] - User name: Spikeanator6982 (Team 1971)
[00:52:15] - User ID: 50B9E94C7A82C24D
[00:52:15] - Machine ID: 2
[00:52:15]
[00:52:15] Loaded queue successfully.
[00:52:15]
[00:52:15] + Processing work unit
[00:52:15] Core required: FahCore_a2.exe
[00:52:15] - Autosending finished units... [February 8 00:52:15 UTC]
[00:52:15] Trying to send all finished work units
[00:52:15] + No unsent completed units remaining.
[00:52:15] - Autosend completed
[00:52:15] Core found.
[00:52:15] - Using generic ./mpiexec
[00:52:15] Working on queue slot 09 [February 8 00:52:15 UTC]
[00:52:15] + Working ...
[00:52:15] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 09 -checkpoint 15 -forceasm -verbose -lifeline 818 -version 620'

[00:52:15]
[00:52:15] *------------------------------*
[00:52:15] Folding@Home Gromacs SMP Core
[00:52:15] Version 2.01 (Wed Jul 16 08:26:53 PDT 2008)
[00:52:15]
[00:52:15] Preparing to commence simulation
[00:52:15] - Ensuring status. Please wait.
[00:52:25] - Assembly optimizations manually forced on.
[00:52:25] - Not checking prior termination.
[00:52:26] - Expanded 4834529 -> 24039989 (decompressed 497.2 percent)
[00:52:27] Called DecompressByteArray: compressed_data_size=4834529 data_size=24039989, decompressed_data_size=24039989 diff=0
[00:52:27] - Digital signature verified
[00:52:27]
[00:52:27] Project: 2673 (Run 8, Clone 15, Gen 82)
[00:52:27]
[00:52:27] Assembly optimizations on if available.
[00:52:27] Entering M.D.
[00:52:33] Will resume from checkpoint file
[00:52:34] Node 3 initialized
[00:52:36] Resuming from checkpoint
[00:52:36] Verified work/wudata_09.log
[00:52:37] Verified work/wudata_09.trr
[00:52:38] Verified work/wudata_09.xtc
[00:52:38] Verified work/wudata_09.edr
[00:52:38] Completed 230022 out of 23000002 steps  (1%)
[06:52:16] - Autosending finished units... [February 8 06:52:16 UTC]
[06:52:16] Trying to send all finished work units
[06:52:16] + No unsent completed units remaining.
[06:52:16] - Autosend completed
[12:52:17] - Autosending finished units... [February 8 12:52:17 UTC]
[12:52:17] Trying to send all finished work units
[12:52:17] + No unsent completed units remaining.
[12:52:17] - Autosend completed
[13:56:02] Completed 460002 out of 23000002 steps  (2%)
[18:52:18] - Autosending finished units... [February 8 18:52:18 UTC]
[18:52:18] Trying to send all finished work units
[18:52:18] + No unsent completed units remaining.
[18:52:18] - Autosend completed
[00:52:20] - Autosending finished units... [February 9 00:52:20 UTC]
[00:52:20] Trying to send all finished work units
[00:52:20] + No unsent completed units remaining.
[00:52:20] - Autosend completed
[02:22:51] Completed 690002 out of 23000002 steps  (3%)
toTOW
Site Moderator
Posts: 6433
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2673 (Run 8, Clone 15, Gen 82) too big & slow

Post by toTOW »

That's an old core ... I think it's time to upgrade to 2.04 ...
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
bollix47
Posts: 2976
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project: 2673 (Run 8, Clone 15, Gen 82) too big & slow

Post by bollix47 »

I don't think there are any WUs with 23,000,000 steps.

Sooo I would say that's a bad WU and should be deleted (queue.dat, unininfo.txt, and work folder contents).

There have been others like this and Peter has pulled them.

And yes, delete your a2 core so you get the newer 2.04 which has a 'fix' in it that's supposed to stop the generating of these long WUs.
Image
susato
Site Moderator
Posts: 511
Joined: Fri Nov 30, 2007 4:57 am
Location: Team MacOSX
Contact:

Re: Project: 2673 (Run 8, Clone 15, Gen 82) too big & slow

Post by susato »

Yes, the WU was reported as bad, and the user was advised to upgrade client and core. Good eye, bollix.
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: Project: 2673 (Run 8, Clone 15, Gen 82) too big & slow

Post by kasson »

Thanks--I stopped it.
Post Reply