Page 1 of 1

2675 (Run 3, Clone 111, Gen 132) - Slow - 17hrs for 1%

Posted: Mon Sep 14, 2009 2:18 pm
by parkut
This WU is defective, with too many steps. Killed it after 17 hours compute
on the first step, as it will never finish in time.

Code: Select all

[14:15:15] Folding@Home Gromacs SMP Core
[14:15:15] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[14:15:15] [14:15:15] Preparing to commence simulation
[14:15:15] - Ensuring status. Please wait.
[14:15:16] Called DecompressByteArray: compressed_data_size=4842538 data_size=23994061, decompressed_data_size=23994061 diff=0
[14:15:16] - Digital signature verified
[14:15:16] [14:15:16] Project: 2675 (Run 3, Clone 111, Gen 132)
[14:15:16] [14:15:16] Assembly optimizations on if available.
[14:15:16] Entering M.D.
[14:15:26] un 3, Clone 111, Gen 132)
[14:15:26] [14:15:26] Entering M.D.
[18:53:18] - Autosending finished units... [September 13 18:53:18 UTC]
[18:53:18] Trying to send all finished work units
[18:53:18] + No unsent completed units remaining.
[18:53:18] - Autosend completed
[00:53:17] - Autosending finished units... [September 14 00:53:17 UTC]
[00:53:17] Trying to send all finished work units
[00:53:17] + No unsent completed units remaining.
[00:53:17] - Autosend completed
[06:53:16] - Autosending finished units... [September 14 06:53:16 UTC]
[06:53:16] Trying to send all finished work units
[06:53:16] + No unsent completed units remaining.
[06:53:16] - Autosend completed
[07:28:32] ompleted 260001 out of 26000002 steps  (1%)
[12:53:15] - Autosending finished units... [September 14 12:53:15 UTC]
[12:53:15] Trying to send all finished work units
[12:53:15] + No unsent completed units remaining.
[12:53:15] - Autosend completed
[13:52:16] ***** Got a SIGTERM signal (15)
[13:52:16] Killing all core threads

Folding@Home Client Shutdown.

Re: 2675 (Run 3, Clone 111, Gen 132) - Slow - 17hrs for 1%

Posted: Wed Sep 30, 2009 12:27 pm
by parkut
Got it again

Code: Select all

model name	: Intel(R) Core(TM)2 CPU          4400  @ 2.00GHz
cpu MHz		: 1999.944
cache size	: 2048 KB
Memory: 1.96 GB physical, 1.94 GB virtual
...
Client Version 6.24R3  
Core: FahCore_a2.exe
Core Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
Current Work Unit
-----------------
Name: p2675_IBX in water
Tag: P2675R3C111G132
Download time: September 28 15:00:04
Due time: October 1 15:00:04
Progress: 1%  [__________]
...
Project: 2675 (Run 3, Clone 111, Gen 132) 1920.00 pts (0.708 pt/hr)
...
[07:31:02] - Autosend completed
[07:31:02] + No unsent completed units remaining.
[07:31:02] Trying to send all finished work units
[07:31:02] - Autosending finished units... [September 30 07:31:02 UTC]
[01:31:02] - Autosend completed
[01:31:02] + No unsent completed units remaining.
[01:31:02] Trying to send all finished work units
[01:31:02] - Autosending finished units... [September 30 01:31:02 UTC]
[19:31:02] - Autosend completed
[19:31:02] + No unsent completed units remaining.
[19:31:02] Trying to send all finished work units
[19:31:02] - Autosending finished units... [September 29 19:31:02 UTC]
[18:06:47] Completed 260001 out of 26000002 steps  (1%)
[13:31:02] - Autosend completed
[13:31:02] + No unsent completed units remaining.
[13:31:02] Trying to send all finished work units
[13:31:02] - Autosending finished units... [September 29 13:31:02 UTC]
[07:31:02] - Autosend completed
[07:31:02] + No unsent completed units remaining.
[07:31:02] Trying to send all finished work units
[07:31:02] - Autosending finished units... [September 29 07:31:02 UTC]
[01:31:03] - Autosend completed
[01:31:03] + No unsent completed units remaining.
[01:31:03] Trying to send all finished work units
[01:31:03] - Autosending finished units... [September 29 01:31:03 UTC]
[19:31:02] - Autosend completed
[19:31:02] + No unsent completed units remaining.
[19:31:02] Trying to send all finished work units
[19:31:02] - Autosending finished units... [September 28 19:31:02 UTC]
[15:00:25] Completed 0 out of 26000002 steps  (0%)
[15:00:15] Entering M.D.

Re: 2675 (Run 3, Clone 111, Gen 132) - Slow - 17hrs for 1%

Posted: Wed Sep 30, 2009 2:49 pm
by jrweiss
If it does the same thing, you may have to shut down the client, delete queue.dat and unitinfo files and /work folder, then restart the client. It may take up to 5 repeats to get a new WU.

26 Million steps?????

Posted: Tue Nov 03, 2009 3:58 am
by Miller855
I just got a 26 million step Wu, that means 1% is 260K steps, this 2Ghz dual core running OS 10.6 Snow Leopard takes about 30 hours to run a 250k to 300k Wu
that means this work unit will take about 3000 hours, that's 125 Days, ouch!

Image

Re: 26 Million steps?????

Posted: Tue Nov 03, 2009 4:23 am
by Miller855
as of 3:21 UTC, just 1 hour ago, here is what InCrease says

Image

Re: 26 Million steps?????

Posted: Tue Nov 03, 2009 6:30 am
by bollix47
Projects in this range usually show 2500000 steps so 26 million is going to be a bad WU. Reporting it in the Problems with a specific WU forum and then deleting it until you get a 'good' WU is the 'normal' procedure. It's obvious that it won't finish before the final deadline. :ewink:

gl

Re: 26 Million steps?????

Posted: Tue Nov 03, 2009 11:40 am
by kiore
Yep 2600002 that's a little specific don't you think..Got to be a bad one.
kiore.

project 2675, 3,111,132, 26 million steps

Posted: Tue Nov 03, 2009 10:10 pm
by Miller855
i just got i Wu with 26,000,002 steps 1% = 260k steps my iMac w/ a 2 Ghz, dual core intel chip and running OSX 10.6 Snow Leopard
takes 30 hours for a 250k to 300k step wu, so this will take about 3000 hours, about 125 days, ouch!

*** I first reported this in the Mac OS X Clients forum they directed me here ****

EDIT by Mod: All three topics merged.
This WU has been reported to the Pande Group.

Re: 2675 (Run 3, Clone 111, Gen 132) - Slow - 17hrs for 1%

Posted: Wed Nov 04, 2009 4:28 am
by Miller855
i stopped it but now i can't get anything to run it says missing work file, i don't understand
here is the screen shot

Image

Re: 2675 (Run 3, Clone 111, Gen 132) - Slow - 17hrs for 1%

Posted: Wed Nov 04, 2009 5:53 am
by Miller855
after reading other threads about problems getting wu's
i disabled folding@home, deleted my A2 core, unitinfo.txt, and queue.dat
reinstalled the software, and after 2 or 3 restarts it downloaded a new core and got a new wu
but all that took an hour, 5:11 UTC, but now i am back folding........

Re: 2675 (Run 3, Clone 111, Gen 132) - Slow - 17hrs for 1%

Posted: Wed Nov 11, 2009 11:08 pm
by Ragnar Dan

Code: Select all

[21:22:51] Project: 2675 (Run 3, Clone 111, Gen 132)
[21:22:51] 
[21:22:51] Assembly optimizations on if available.
[21:22:51] Entering M.D.
[21:22:57] Using Gromacs checkpoints
[21:23:00] Multi-core optimizations on
[21:23:05] Resuming from checkpoint
[21:23:05] Verified work/wudata_03.log
[21:23:05] Verified work/wudata_03.trr
[21:23:05] Verified work/wudata_03.xtc
[21:23:05] Verified work/wudata_03.edr
[21:23:07] Completed 6480 out of 26000002 steps  (0%)
I got one of these, and it appears to be taking too long, and after a restart to make sure it wasn't the machine (a total of about 2 hours 45 minutes folding time), I'm now about to dump it.

Re: 2675 (Run 3, Clone 111, Gen 132) - Slow - 17hrs for 1%

Posted: Sun Nov 15, 2009 6:41 am
by Ragnar Dan
I just got this same WU one more time on the machine where it came last time. This time it's been on the machine for 4 hours and 44 minutes and made no progress.

I thought the purpose of this subforum was to alert Stanford to the existence of such WU's, and that would cause them (after investigation) to remove them from the servers so we wouldn't see them again. ?