Page 1 of 1

Project: 6503 (Run 3, Clone 90, Gen 70)

Posted: Mon Oct 25, 2010 6:33 am
by GreyWhiskers
My first post since I started folding back in June 2010 - my first serious WU error.

I've run into a buzzsaw on this WU. The problem is with my wife's old Toshiba laptop I've been folding with since June, and average one WU or so per day - close to 100 PPD. It's on essentially 24/7. This supplements the "heavy duty" folding on my HP Pavillion Pent IV with HT, 3.2 GHz and an overclocked HIS ATI HD4760 1GB mem, AGP bus which averages about 1300 PPD.

I encountered a common problem reported on the forum:
[05:40:55] Folding@home Core Shutdown: UNKNOWN_ERROR
[05:40:59] CoreStatus = 79 (121)
[05:40:59] Client-core communications error: ERROR 0x79
[05:40:59] This is a sign of more serious problems, shutting down.

I deleted everything in the work folder four times over the last hour or so, deleted the queue.dat file, and restarted the Folding@Home application. I seem to be assigned back to the same work unit, with the same results. I even deleted the core 78 app once, which the Folding@home app dutifully re-downloaded.

I've been running with the -advmethods flag for the last few weeks - this was one of the suggestions in earlier posts. I've never seen any new or different WUs because of it.

My HFM log shows that I've successfully processed nine 6503 projects - the earliest being on 25 June, the latest yesterday, Oct 23. None of them were with Run3/Clone90/Gen70.

I'll wait a while then try again to see if I might get a different WU.

Any suggestions?

Code: Select all

[05:40:32] + Processing work unit
[05:40:32] Core required: FahCore_78.exe
[05:40:32] Core found.
[05:40:32] - Autosending finished units... [October 25 05:40:32 UTC]
[05:40:32] Trying to send all finished work units
[05:40:32] + No unsent completed units remaining.
[05:40:32] - Autosend completed
[05:40:32] Working on queue slot 01 [October 25 05:40:32 UTC]
[05:40:32] + Working ...
[05:40:32] - Calling '.\FahCore_78.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 168 -version 623'

[05:40:32] 
[05:40:32] *------------------------------*
[05:40:32] Folding@Home Gromacs Core
[05:40:32] Version 1.90 (March 8, 2006)
[05:40:32] 
[05:40:32] Preparing to commence simulation
[05:40:32] - Looking at optimizations...
[05:40:32] - Created dyn
[05:40:32] - Files status OK
[05:40:32] 
[05:40:32] Folding@home Core Shutdown: MISSING_WORK_FILES
[05:40:37] CoreStatus = 74 (116)
[05:40:37] The core could not find the work files specified. Removing from queue
[05:40:37] Deleting current work unit & continuing...
[05:40:41] Trying to send all finished work units
[05:40:41] + No unsent completed units remaining.
[05:40:41] - Preparing to get new work unit...
[05:40:41] + Attempting to get work packet
[05:40:41] - Will indicate memory of 2039 MB
[05:40:41] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 13, Stepping: 8
[05:40:41] - Connecting to assignment server
[05:40:41] Connecting to http://assign.stanford.edu:8080/
[05:40:41] Posted data.
[05:40:41] Initial: 40AB; - Successful: assigned to (171.64.65.62).
[05:40:41] + News From Folding@Home: Welcome to Folding@Home
[05:40:41] Loaded queue successfully.
[05:40:41] Connecting to http://171.64.65.62:8080/
[05:40:42] Posted data.
[05:40:42] Initial: 0000; - Receiving payload (expected size: 512982)
[05:40:43] - Downloaded at ~500 kB/s
[05:40:43] - Averaged speed for that direction ~375 kB/s
[05:40:43] + Received work.
[05:40:43] + Closed connections

[05:40:48] + Processing work unit
[05:40:48] Core required: FahCore_78.exe
[05:40:48] Core found.
[05:40:48] Working on queue slot 02 [October 25 05:40:48 UTC]
[05:40:48] + Working ...
[05:40:48] - Calling '.\FahCore_78.exe -dir work/ -suffix 02 -checkpoint 15 -verbose -lifeline 168 -version 623'

[05:40:48] 
[05:40:48] *------------------------------*
[05:40:48] Folding@Home Gromacs Core
[05:40:48] Version 1.90 (March 8, 2006)
[05:40:48] 
[05:40:48] Preparing to commence simulation
[05:40:48] - Looking at optimizations...
[05:40:48] - Created dyn
[05:40:48] - Files status OK
[05:40:48] - Expanded 512470 -> 2522873 (decompressed 492.2 percent)
[05:40:48] - Starting from initial work packet
[05:40:48] 
[05:40:48] Project: 6503 (Run 3, Clone 90, Gen 70)
[05:40:48] 
[05:40:48] Assembly optimizations on if available.
[05:40:48] Entering M.D.
[05:40:55] Gromacs error.
[05:40:55] 
[05:40:55] Folding@home Core Shutdown: UNKNOWN_ERROR
[05:40:59] CoreStatus = 79 (121)
[05:40:59] Client-core communications error: ERROR 0x79
[05:40:59] This is a sign of more serious problems, shutting down.
GreyWhiskers (Saratoga, Ca, USA)

Re: Project: 6503 (Run 3, Clone 90, Gen 70)

Posted: Mon Oct 25, 2010 6:50 am
by GreyWhiskers
I looked back at the Work folder and found a new tidbit at the end more explicitly describing the "fatal error"
Fatal error: symtab get_symtab_handle 16783 not found

Code: Select all

Log file opened: nodeid 0, nnodes = 1, host = unknown, process = 5124

  Gromacs is Copyright (c) 1991-2003, University of Groningen, The Netherlands
        This inclusion of Gromacs code in the Folding@Home Core is under
        a special license (see http://folding.stanford.edu/gromacs.html)
         specially granted to Stanford by the copyright holders. If you
          are interested in using Gromacs, visit www.gromacs.org where
                you can download a free version of Gromacs under
         the terms of the GNU General Public License (GPL) as published
       by the Free Software Foundation; either version 2 of the License,
                     or (at your option) any later version.


++++++++ PLEASE CITE THE FOLLOWING REFERENCE ++++++++
E. Lindahl and B. Hess and D. van der Spoel
GROMACS 3.0: A package for molecular simulation and trajectory analysis
J. Mol. Mod. 7 (2001) pp. 306-317
-------- -------- --- Thank You --- -------- --------


++++++++ PLEASE CITE THE FOLLOWING REFERENCE ++++++++
H. J. C. Berendsen, D. van der Spoel and R. van Drunen
GROMACS: A message-passing parallel molecular dynamics implementation
Comp. Phys. Comm. 91 (1995) pp. 43-56
-------- -------- --- Thank You --- -------- --------

Fatal error: symtab get_symtab_handle 16783 not found
GreyWhiskers
GreyWhiskers wrote:My first post since I started folding back in June 2010 - my first serious WU error.

I've run into a buzzsaw on this WU. The problem is with my wife's old Toshiba laptop I've been folding with since June, and average one WU or so per day - close to 100 PPD. It's on essentially 24/7. This supplements the "heavy duty" folding on my HP Pavillion Pent IV with HT, 3.2 GHz and an overclocked HIS ATI HD4760 1GB mem, AGP bus which averages about 1300 PPD.

I encountered a common problem reported on the forum:
[05:40:55] Folding@home Core Shutdown: UNKNOWN_ERROR
[05:40:59] CoreStatus = 79 (121)
[05:40:59] Client-core communications error: ERROR 0x79
[05:40:59] This is a sign of more serious problems, shutting down.

I deleted everything in the work folder four times over the last hour or so, deleted the queue.dat file, and restarted the Folding@Home application. I seem to be assigned back to the same work unit, with the same results. I even deleted the core 78 app once, which the Folding@home app dutifully re-downloaded.

I've been running with the -advmethods flag for the last few weeks - this was one of the suggestions in earlier posts. I've never seen any new or different WUs because of it.

My HFM log shows that I've successfully processed nine 6503 projects - the earliest being on 25 June, the latest yesterday, Oct 23. None of them were with Run3/Clone90/Gen70.

I'll wait a while then try again to see if I might get a different WU.

Any suggestions?

Code: Select all

[05:40:32] + Processing work unit
[05:40:32] Core required: FahCore_78.exe
[05:40:32] Core found.
[05:40:32] - Autosending finished units... [October 25 05:40:32 UTC]
[05:40:32] Trying to send all finished work units
[05:40:32] + No unsent completed units remaining.
[05:40:32] - Autosend completed
[05:40:32] Working on queue slot 01 [October 25 05:40:32 UTC]
[05:40:32] + Working ...
[05:40:32] - Calling '.\FahCore_78.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 168 -version 623'

[05:40:32] 
[05:40:32] *------------------------------*
[05:40:32] Folding@Home Gromacs Core
[05:40:32] Version 1.90 (March 8, 2006)
[05:40:32] 
[05:40:32] Preparing to commence simulation
[05:40:32] - Looking at optimizations...
[05:40:32] - Created dyn
[05:40:32] - Files status OK
[05:40:32] 
[05:40:32] Folding@home Core Shutdown: MISSING_WORK_FILES
[05:40:37] CoreStatus = 74 (116)
[05:40:37] The core could not find the work files specified. Removing from queue
[05:40:37] Deleting current work unit & continuing...
[05:40:41] Trying to send all finished work units
[05:40:41] + No unsent completed units remaining.
[05:40:41] - Preparing to get new work unit...
[05:40:41] + Attempting to get work packet
[05:40:41] - Will indicate memory of 2039 MB
[05:40:41] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 13, Stepping: 8
[05:40:41] - Connecting to assignment server
[05:40:41] Connecting to http://assign.stanford.edu:8080/
[05:40:41] Posted data.
[05:40:41] Initial: 40AB; - Successful: assigned to (171.64.65.62).
[05:40:41] + News From Folding@Home: Welcome to Folding@Home
[05:40:41] Loaded queue successfully.
[05:40:41] Connecting to http://171.64.65.62:8080/
[05:40:42] Posted data.
[05:40:42] Initial: 0000; - Receiving payload (expected size: 512982)
[05:40:43] - Downloaded at ~500 kB/s
[05:40:43] - Averaged speed for that direction ~375 kB/s
[05:40:43] + Received work.
[05:40:43] + Closed connections

[05:40:48] + Processing work unit
[05:40:48] Core required: FahCore_78.exe
[05:40:48] Core found.
[05:40:48] Working on queue slot 02 [October 25 05:40:48 UTC]
[05:40:48] + Working ...
[05:40:48] - Calling '.\FahCore_78.exe -dir work/ -suffix 02 -checkpoint 15 -verbose -lifeline 168 -version 623'

[05:40:48] 
[05:40:48] *------------------------------*
[05:40:48] Folding@Home Gromacs Core
[05:40:48] Version 1.90 (March 8, 2006)
[05:40:48] 
[05:40:48] Preparing to commence simulation
[05:40:48] - Looking at optimizations...
[05:40:48] - Created dyn
[05:40:48] - Files status OK
[05:40:48] - Expanded 512470 -> 2522873 (decompressed 492.2 percent)
[05:40:48] - Starting from initial work packet
[05:40:48] 
[05:40:48] Project: 6503 (Run 3, Clone 90, Gen 70)
[05:40:48] 
[05:40:48] Assembly optimizations on if available.
[05:40:48] Entering M.D.
[05:40:55] Gromacs error.
[05:40:55] 
[05:40:55] Folding@home Core Shutdown: UNKNOWN_ERROR
[05:40:59] CoreStatus = 79 (121)
[05:40:59] Client-core communications error: ERROR 0x79
[05:40:59] This is a sign of more serious problems, shutting down.
GreyWhiskers (Saratoga, Ca, USA)

Re: Project: 6503 (Run 3, Clone 90, Gen 70)

Posted: Mon Oct 25, 2010 6:51 am
by Fireball0236
Just for the record, a new topic is made for each different PRCG (Project, Run, Clone, Gen). But I'm sure a moderator can split out your post, and this one into a topic of its own.

As for your question, delete the queue.dat file, the work folder, and change the Machine ID of your client. Then you should be assigned a different WU.


~ Fireball0236

Re: Project: 6503 (Run 3, Clone 254, Gen 53)

Posted: Mon Oct 25, 2010 7:39 am
by GreyWhiskers
Thanks, Fireball, for your gentle protocol reminder to a first-time poster and for the solution (for me at least) to repeatedly getting a hosed up WU.

The old laptop with a Dothan Pentium M 740 is now happily devoting every single spare cycle to the folding. That's always a good sign.

GreyWhiskers.

Re: Project: 6503 (Run 3, Clone 90, Gen 70)

Posted: Mon Oct 25, 2010 3:41 pm
by sortofageek
Thank you for your report and welcome to the site. :)

I checked the database, but I find no data back on Project: 6503 (Run 3, Clone 90, Gen 70) yet. I'll flag it and we'll let you know if it is a bad WU or if someone else manages to complete it.

Re: Project: 6503 (Run 3, Clone 90, Gen 70)

Posted: Tue Nov 02, 2010 2:02 am
by sortofageek
I have another report on that WU from a team mate.

Code: Select all

----------------------------------------------------------------

*------------------------------*
Folding@Home Gromacs Core
Version 1.90 (March 8, 2006)

Preparing to commence simulation
- Looking at optimizations...
- Created dyn
- Files status OK
- Expanded 512470 -> 2522873 (decompressed 492.2 percent)
- Starting from initial work packet

Project: 6503 (Run 3, Clone 90, Gen 70)

Assembly optimizations on if available.
Entering M.D.
Gromacs error.

Folding@home Core Shutdown: UNKNOWN_ERROR

--------------------------------------------------------------
A window opens with the follow ing messge:

"Folding@home has run into a serious error running the core, and will shutdown."

I don't think we're going to see anything in the database on this one. Nobody seems to be able to get that far, so ... The WU (P6503,R3,C90,G70) has been reported as a bad WU.