Page 1 of 1

Project: 4461 (Run 640, Clone 0, Gen 25), Core: 78

Posted: Mon Jun 01, 2009 12:56 pm
by DrBB1
For the second WU in a row, I have had a WU apparently complete successfully, but it has not been recorded. However, at least this time I have a record from the log. I noticed the line, "goefile: size 0," is included. I had a similar issue with a number of WUs last summer, so I have two questions now:

1. What does "goefile: size 0" mean, and is does it reflect a cause--or result--of the problem?
2. I believe most--if not all--my WUs last summer were eventually recorded. Is the log entry below consistent with that happening again?

Image

Thanks in advance for the help.

Code: Select all

Slot 09  Empty/Deleted
Project: 4461 (Run 640, Clone 0, Gen 25), Core: 78
Work server: 171.67.108.13:8080
Collection server: 171.67.108.17
Download date: May 29 06:05:41
Finished date: May 31 10:51:38

Code: Select all

--- Opening Log file [May 30 22:25:25] 


# Windows Graphical Edition ###################################################
###############################################################################

                       Folding@Home Client Version 5.03

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files\Folding@Home


[22:25:25] - Ask before connecting: No
[22:25:25] - User name: DrBB1 (Team 0324498)
[22:25:25] - User ID: 41257AD02312174
[22:25:25] - Machine ID: 1
[22:25:25] 
[22:25:25] Loaded queue successfully.
[22:25:28] Initialization complete
[22:25:28] + Benchmarking ...
[22:25:40] 
[22:25:40] + Processing work unit
[22:25:40] Core required: FahCore_78.exe
[22:25:40] Core found.
[22:25:40] Working on Unit 05 [May 30 22:25:40]
[22:25:40] + Working ...
[22:25:58] 
[22:25:58] *------------------------------*
[22:25:58] Folding@Home Gromacs Core
[22:25:58] Version 1.90 (March 8, 2006)
[22:25:58] 
[22:25:58] Preparing to commence simulation
[22:25:58] - Looking at optimizations...
[22:25:58] - Files status OK
[22:26:00] - Expanded 237763 -> 1167721 (decompressed 491.1 percent)
[22:26:01] 
[22:26:01] Project: 4441 (Run 291, Clone 4, Gen 20)
[22:26:01] 
[22:26:03] Assembly optimizations on if available.
[22:26:03] Entering M.D.
[22:26:26] (Starting from checkpoint)
[22:26:26] Protein: p4441_Seq42_Amber03
[22:26:26] 
[22:26:26] Writing local files
[22:28:38] Completed 1211312 out of 1500000 steps  (81)
[22:28:38] Extra 3DNow boost OK.
[22:58:09] Writing local files
[22:58:09] Completed 1215000 out of 1500000 steps  (81)
[23:07:46] + Screensaver:  engaging
[23:36:08] + Screensaver:  no longer idle, ending
[02:02:22] + Screensaver:  engaging
[02:07:10] Writing local files
[02:07:11] Completed 1230000 out of 1500000 steps  (82)
[03:12:44] Writing local files
[03:12:44] Completed 1245000 out of 1500000 steps  (83)
[04:19:24] Writing local files
[04:19:24] Completed 1260000 out of 1500000 steps  (84)
[04:25:40] + Working...
[05:29:11] Writing local files
[05:29:11] Completed 1275000 out of 1500000 steps  (85)
[08:03:52] Writing local files
[08:03:52] Completed 1290000 out of 1500000 steps  (86)
[10:23:21] Writing local files
[10:23:21] Completed 1305000 out of 1500000 steps  (87)
[10:25:40] + Working...
[11:32:33] Writing local files
[11:32:33] Completed 1320000 out of 1500000 steps  (88)
[12:41:32] Writing local files
[12:41:33] Completed 1335000 out of 1500000 steps  (89)
[13:50:32] Writing local files
[13:50:32] Completed 1350000 out of 1500000 steps  (90)
[14:59:36] Writing local files
[14:59:36] Completed 1365000 out of 1500000 steps  (91)
[16:08:41] Writing local files
[16:08:41] Completed 1380000 out of 1500000 steps  (92)
[16:25:40] + Working...
[17:17:42] Writing local files
[17:17:42] Completed 1395000 out of 1500000 steps  (93)
[18:26:45] Writing local files
[18:26:45] Completed 1410000 out of 1500000 steps  (94)
[19:35:38] Writing local files
[19:35:38] Completed 1425000 out of 1500000 steps  (95)
[20:44:24] Writing local files
[20:44:24] Completed 1440000 out of 1500000 steps  (96)
[21:53:21] Writing local files
[21:53:21] Completed 1455000 out of 1500000 steps  (97)
[22:25:40] + Working...
[23:02:17] Writing local files
[23:02:17] Completed 1470000 out of 1500000 steps  (98)
[00:11:16] Writing local files
[00:11:16] Completed 1485000 out of 1500000 steps  (99)
[01:20:15] Writing local files
[01:20:15] Completed 1500000 out of 1500000 steps  (100)
[01:20:15] Writing final coordinates.
[01:20:16] Past main M.D. loop
[01:21:16] 
[01:21:16] Finished Work Unit:
[01:21:16] - Reading up to 188856 from "work/wudata_05.arc": Read 188856
[01:21:16] - Reading up to 17868 from "work/wudata_05.xtc": Read 17868
[01:21:16] goefile size: 0
[01:21:16] logfile size: 26067
[01:21:16] Leaving Run
[01:21:16] - Writing 239795 bytes of core data to disk...
[01:21:17] Done: 239283 -> 209570 (compressed to 87.5 percent)
[01:21:17]   ... Done.
[01:21:17] - Shutting down core
[01:21:17] 
[01:21:17] Folding@home Core Shutdown: FINISHED_UNIT
[01:21:20] CoreStatus = 64 (100)
[01:21:20] Sending work to server


[01:21:20] + Attempting to send results
[01:21:23] + Results successfully sent
[01:21:23] Thank you for your contribution to Folding@Home.
[01:21:23] + Number of Units Completed: 190

Re: Project: 4461 (Run 640, Clone 0, Gen 25), Core: 78

Posted: Mon Jun 01, 2009 2:36 pm
by susato
Your FAHlog.txt says
[01:21:20] + Attempting to send results
[01:21:23] + Results successfully sent
[01:21:23] Thank you for your contribution to Folding@Home.
This means that the unit has been logged in at Stanford.

In the first code block you posted, the queue entry referred to
Project: 4461 (Run 640, Clone 0, Gen 25)
starting May 29 and ending May 31
which finished for 225 points on a P4 machine at 2009-05-31 04:09:27 (Stanford time) and was credited to team 39340.

but in the second code block the unit was
Project: 4441 (Run 291, Clone 4, Gen 20)
starting May 30 and ending today, June 1.
which finished for 225 points on an Athlon machine at 2009-05-31 20:10:41 (Stanford time) and was credited to team 324498.

They are two different work units. Perhaps you're not seeing the points because they are going to your other team?
Was the other "missing" unit this one: Project: 3863 (Run 366, Clone 9, Gen 7) which finished just after midnight on 5/29 for 604 points? That was also credited to the lower-numbered team.

Re: Project: 4461 (Run 640, Clone 0, Gen 25), Core: 78

Posted: Mon Jun 01, 2009 8:03 pm
by bruce
DrBB1 wrote:What does "goefile: size 0" mean, and is does it reflect a cause--or result--of the problem?
As far as I know, this message is neither. It's a cosmetic issue that can safely be ignored.

Re: Project: 4461 (Run 640, Clone 0, Gen 25), Core: 78

Posted: Mon Jun 01, 2009 9:40 pm
by DrBB1
bruce,

Thanks for the clarification. I only seem to notice it when there is a problem, so I apparently incorrectly assumed it was related to a problem.

susato,

Hmm...I'll need to look into this further when I get home. I may have included the wrong log information, though I fold only for team 39340, and always have. I'll see if the client was accidently changed to include the wrong team number. Could that explain why I'm not getting credit??? Evenif I were to have changed teams wouldn't I still get credit for the WU?

Re: Project: 4461 (Run 640, Clone 0, Gen 25), Core: 78

Posted: Tue Jun 02, 2009 12:27 am
by susato
Yes, I think it's an error.
Navigate to http://fah-web.stanford.edu/cgi-bin/mai ... =userstats and type in your username. It shows you folding for two teams, the second of which hasn't even been established yet. Check your client.cfg (open with a text editor) to see which machine (or machines) are folding for the wrong team. Then stop Folding (ideally 4-5 minutes after a frame end early in a work unit, or between work units) and run it with the -configonly flag to change the team number on that client. When you restart it with the customary flags it will pick up where it left off and all your points will be going to the correct team.

One thing I noticed about that machine - on the Team 324498 account you have under 800 points but nearly 1100 WU. Is that machine chronically unstable, or did it have a run of EUE's that is now over? Do keep an eye on that machine - check the queue now and then, run a memory test if you have doubts, and post back here if you see runs of deleted units.

Re: Project: 4461 (Run 640, Clone 0, Gen 25), Core: 78

Posted: Tue Jun 02, 2009 12:50 pm
by DrBB1
Hi susato,

Indeed, my secondary client had the team number changed. I have changed it back and everything should be properly credited (I hope) moving forward. Is there any way to merge the 700+ points back into my team 39340 account?

Also, re:

Code: Select all

One thing I noticed about that machine - on the Team 324498 account you have under 800 points but nearly 1100 WU. Is that machine chronically unstable, or did it have a run of EUE's that is now over? Do keep an eye on that machine - check the queue now and then, run a memory test if you have doubts, and post back here if you see runs of deleted units.
See thread viewtopic.php?f=19&t=10113. That problem seems to have resolved itself. My fahlog-prev.txt file got overwritten because the log got so large so fast. However, I am now wondering whether it is possible that problem was caused by the change in team number. What do you think?

Anyway, thanks. If problems continue, I'll post.

Re: Project: 4461 (Run 640, Clone 0, Gen 25), Core: 78

Posted: Tue Jun 02, 2009 3:48 pm
by susato
The problems you had should not be related to the use of a non-existent Team number. That doesn't seem to matter at all.

Thanks for the link to your other topic. I read that one too, when you first posted it, but couldn't improve on Bruce's advice. I'm glad the problem has resolved. Of course, let us know if it happens again.

Sorry, once points are allocated to a certain user or team, they cannot be reallocated. On the bright side, if you ever want to start a team of your own, you know which number to use! :wink: