Page 1 of 1

Project: 2671 (Run 8, Clone 95, Gen 76) CoreStatus = 1

Posted: Mon Aug 10, 2009 1:30 pm
by toTOW
I don't know what happened on this WU :( :

CoreStatus = 1 @ 100%

Code: Select all

[02:19:35] Project: 2671 (Run 8, Clone 95, Gen 76)
[02:19:35] 
[02:19:35] Entering M.D.
[02:19:41] Using Gromacs checkpoints
[02:19:43] Resuming from checkpoint
[02:19:43] Verified work/wudata_04.log
[02:19:43] Verified work/wudata_04.trr
[02:19:44] Verified work/wudata_04.xtc
[02:19:44] Verified work/wudata_04.edr
[02:19:44] Completed 135008 out of 250000 steps  (54%)
[...]
[06:33:37] Completed 250000 out of 250000 steps  (100%)
[06:33:42] CoreStatus = 1 (1)
[06:33:42] Sending work to server
[06:33:42] Project: 2671 (Run 8, Clone 95, Gen 76)
[06:33:42] - Error: Could not get length of results file work/wuresults_04.dat
[06:33:42] - Error: Could not read unit 04 file. Removing from queue.

Re: Project: 2671 (Run 8, Clone 95, Gen 76) CoreStatus = 1

Posted: Mon Aug 10, 2009 1:34 pm
by MtM
CORESTATUS=1 should be client-core communication error. Did the wuresult.dat file get created succesfully? Did the core's terminate properly?

Re: Project: 2671 (Run 8, Clone 95, Gen 76) CoreStatus = 1

Posted: Mon Aug 10, 2009 1:39 pm
by toTOW
Look at the end of the log ... it lost the WU ... :(

Re: Project: 2671 (Run 8, Clone 95, Gen 76) CoreStatus = 1

Posted: Mon Aug 10, 2009 1:43 pm
by MtM
Sorry you're right, lost it, and there are only five seconds between the 100% entry and the corestatus message, which indicates a> the client did not wait for core's to be terminated, which leads to suspect the core's are not at fault, b> if the core's terminated properly but the results file could not be written it's either the wu itself ( hopefully ) or a client bug?