Page 1 of 1
Project: 5014 (Run 0, Clone 437, Gen 95)
Posted: Tue Feb 03, 2009 2:13 pm
by mpapad
Code: Select all
[02:19:13] Project: 5014 (Run 0, Clone 437, Gen 95)
[02:19:13]
[02:19:13] Assembly optimizations on if available.
[02:19:13] Entering M.D.
[02:19:19] Working on 576 p5005_supervillin_e1
[02:19:20] Client config found, loading data.
[02:19:20] mdrun_gpu returned
[02:19:20] NANs detected on GPU
[02:19:20]
[02:19:20] Folding@home Core Shutdown: UNSTABLE_MACHINE
[02:19:23] CoreStatus = 7A (122)
[02:19:23] Sending work to server
[02:19:23] Project: 5014 (Run 0, Clone 437, Gen 95)
Bad WU?
Re: Project: 5014 (Run 0, Clone 437, Gen 95)
Posted: Tue Feb 03, 2009 4:19 pm
by toTOW
There's no data for this WU in the DB yet ...
Re: Project: 5014 (Run 0, Clone 437, Gen 95)
Posted: Fri Feb 06, 2009 4:53 am
by bruce
There's still no data in the db.
THe next few lines after the end of your FAHlog clip would be interesting. My guess is that after CoreStatus = 7A, the WU is deleted and the server gets no indication that anything is wrong -- just that you didn't return anything.
It generally looks like this.
Code: Select all
[..] CoreStatus = 7A (122)
[..] Sending work to server
[..] Project: xxxx (Run xx, Clone xx, Gen xx)
[..] - Read packet limit of 540015616... Set to 524286976.
[..] - Error: Could not get length of results file work/wuresults_05.dat
[..] - Error: Could not read unit 05 file. Removing from queue.
The real conclusion is that there was nothing to send so it was removed from the queue without uploading a report.
I'm going to mark that WU as bad.