Page 1 of 1

Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Fri Feb 18, 2011 2:41 am
by Slash_2CPU
[20:37:08] + Processing work unit
[20:37:08] Core required: FahCore_15.exe
[20:37:08] Core found.
[20:37:08] Working on queue slot 02 [February 17 20:37:08 UTC]
[20:37:08] + Working ...
[20:37:08]
[20:37:08] *------------------------------*
[20:37:08] Folding@Home GPU Core
[20:37:08] Version 2.15 (Tue Nov 16 08:44:57 PST 2010)
[20:37:08]
[20:37:08] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.42 for 80x86
[20:37:08] Build host: amoeba
[20:37:08] Board Type: NVIDIA/CUDA
[20:37:08] Core : x=15
[20:37:08] Window's signal control handler registered.
[20:37:08] Preparing to commence simulation
[20:37:08] - Looking at optimizations...
[20:37:08] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[20:37:08] - Created dyn
[20:37:08] - Files status OK
[20:37:08] sizeof(CORE_PACKET_HDR) = 512 file=<>
[20:37:08] - Expanded 45489 -> 170279 (decompressed 374.3 percent)
[20:37:08] Called DecompressByteArray: compressed_data_size=45489 data_size=170279, decompressed_data_size=170279 diff=0
[20:37:08] - Digital signature verified
[20:37:08]
[20:37:08] Project: 11179 (Run 0, Clone 244, Gen 32)
[20:37:08]
[20:37:08] Assembly optimizations on if available.
[20:37:08] Entering M.D.
[20:37:10] Tpr hash work/wudata_02.tpr: 4095727081 1621339293 1181440448 1331908370 397717764
[20:37:10] Working on ALZHEIMER'S DISEASE AMYLOID
[20:37:10] Client config found, loading data.
[20:37:10] Starting GUI Server
[20:37:11] Setting checkpoint frequency: 500000
[20:37:11] Setting checkpoint frequency: 500000
[20:39:51] Completed 500000 out of 50000000 steps (1%).
[20:39:51] mdrun_gpu returned 52
[20:39:51] NANs detected on GPU
[20:39:51]
[20:39:51] Folding@home Core Shutdown: UNSTABLE_MACHINE
[20:39:54] CoreStatus = 7A (122)
[20:39:54] Sending work to server
[20:39:54] Project: 11179 (Run 0, Clone 244, Gen 32)
[20:39:54] - Read packet limit of 540015616... Set to 524286976.
[20:39:54] - Error: Could not get length of results file work/wuresults_02.dat
[20:39:54] - Error: Could not read unit 02 file. Removing from queue.
[20:39:54] Trying to send all finished work units
[20:39:54] + No unsent completed units remaining.
[20:39:54] - Preparing to get new work unit...
[20:39:54] Cleaning up work directory
[20:39:54] + Attempting to get work packet
[20:39:54] - Will indicate memory of 6141 MB
[20:39:54] Gpu type=2 species=30.
Any known problem, or should I check the card? This card has been in service for over 1 year, and this is the first time it has EUE'd to sleep. Two GTX 285's in this box, and the other one is working OK(as is the SMP client), but it has not had one of these WU's.

And yes, I always run -verbosity 7.

Re: Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Fri Feb 18, 2011 6:05 pm
by Slash_2CPU
Anyone?

Re: Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Fri Feb 18, 2011 6:23 pm
by 7im
We're all on vacation this morning, sorry, you'll have to wait a bit longer. At the tone, please leave your message.

Re: Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Fri Feb 18, 2011 7:35 pm
by Slash_2CPU
lol.

./sarcasm
I expect instantaneous service for free, just like all the others. (insert the usual flaming, whining, and crying here)
./end sarcasm

It's still sending me these ?bad? wu's.

Re: Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Fri Feb 18, 2011 7:56 pm
by 7im
Are you getting that one same work unit over and over (11179 (Run 0, Clone 244, Gen 32)), or getting many WUs from the same project 11179? If several WUs, post the PRCG numbers from one or two more so that a Mod can check them, once they happen along... that will help to determine if this is a bunch of bad WUs or a HW problem.

Re: Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Fri Feb 18, 2011 10:14 pm
by Slash_2CPU
It's all the same PRCG. I'm not that unlucky.

Got same project but a different "R" about an hour ago, and all is back up and running fine.

Re: Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Fri Feb 18, 2011 10:57 pm
by 7im
Just a bad work unit then. Good to hear its back up and running.

Re: Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Sat Feb 19, 2011 12:27 am
by toTOW
There's no data for this WU in the stats DB yet :(

Re: Project: 11179 (Run 0, Clone 244, Gen 32)

Posted: Thu Mar 10, 2011 5:24 pm
by sortofageek
FYI, Project: 11179 (Run 0, Clone 244, Gen 32) was added to the stats database on 2011-03-09 01:08:16 for 0 points of credit. It is looking very much like a bad WU. The folder's name was different from your member name here.