10117 (Run 719, Clone 1, Gen 7)
Posted: Thu Mar 10, 2011 2:46 pm
This one's given me fits. I can't rule out machine instability as the WU failed at a different point 12 times in a row. Wiping Work dir and queue.dat didn't cause a different WU to be downloaded, but adding -advmethods did the trick. This is on a GPU that has completed 8000+ WUs, but instability has to start somewhere. I did a clean install of the drivers, 266.58 in the middle of the failures. If the WU has been completed by someone else, at least I'll know to change cards.
[23:39:04] Completed 700000 out of 10000000 steps (7%).
[23:39:05] mdrun_gpu returned 52
[23:39:05] NANs detected on GPU
[23:39:05]
[23:39:05] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:39:07] CoreStatus = 7A (122)
[23:39:04] Completed 700000 out of 10000000 steps (7%).
[23:39:05] mdrun_gpu returned 52
[23:39:05] NANs detected on GPU
[23:39:05]
[23:39:05] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:39:07] CoreStatus = 7A (122)