Project: 2686 (Run 3, Clone 7, Gen 26)

Moderators: Site Moderators, FAHC Science Team

Post Reply
Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Project: 2686 (Run 3, Clone 7, Gen 26)

Post by Tobit »

(Posted at the request of sortofageek for possible escalation)

This is from a teammate, F@H username = Apollo

Something very odd happened with this WU. Logging stopped at 11% when an already completed WU waiting in queue was uploaded to the server. After the upload was complete, the A3 core crashes (Windows error message generated) and this is when the user terminated the client in order to restart. Upon restarting, the A3 core crashes again @ 11% and the cycle repeats until the third time when the WU becomes corrupt and a new WU is requested. Unfortunately, a lot of this, as a result of the crashing core, is not logged properly and the following is pretty much the only log activity I have.

Edit: System Info: Dual Xeon Quad Harpertown based server, Win7 64-bit, 4GB Memory

Code: Select all

[16:22:26] Working on queue slot 06 [October 8 16:22:26 UTC]
[16:22:26] + Working ...
[16:22:26] 
[16:22:26] - Couldn't send HTTP request to server
[16:22:26] *------------------------------*
[16:22:26]   (Got status 503)
[16:22:26] Folding@Home Gromacs SMP Core
[16:22:26] + Could not connect to Work Server (results)
[16:22:26] Version 2.22 (Mar 12, 2010)
[16:22:26]     (171.64.65.56:8080)
[16:22:26] 
[16:22:26] + Retrying using alternative port
[16:22:26] Preparing to commence simulation
[16:22:26] - Couldn't send HTTP request to server
[16:22:26] - Ensuring status. Please wait.
[16:22:26]   (Got status 503)
[16:22:27] + Could not connect to Work Server (results)
[16:22:27]     (171.64.65.56:80)
[16:22:27] - Error: Could not transmit unit 05 (completed October 8) to work server.

[16:22:27] + Attempting to send results [October 8 16:22:27 UTC]
[16:22:36] - Assembly optimizations manually forced on.
[16:22:36] - Not checking prior termination.
[16:22:45] - Expanded 25463101 -> 31941441 (decompressed 125.4 percent)
[16:22:45] Called DecompressByteArray: compressed_data_size=25463101 data_size=31941441, decompressed_data_size=31941441 diff=0
[16:22:45] - Digital signature verified
[16:22:45] 
[16:22:45] Project: 2686 (Run 3, Clone 7, Gen 26)
[16:22:45] 
[16:22:46] Assembly optimizations on if available.
[16:22:46] Entering M.D.
[16:22:52] - Couldn't send HTTP request to server
[16:22:52] + Could not connect to Work Server (results)
[16:22:52]     (171.67.108.25:8080)
[16:22:52] + Retrying using alternative port
[16:22:52] Using Gromacs checkpoints
[16:23:00] Resuming from checkpoint
[16:23:00] Verified work/wudata_06.log
[16:23:00] Verified work/wudata_06.trr
[16:23:00] Verified work/wudata_06.xtc
[16:23:00] Verified work/wudata_06.edr
[16:23:02] Completed 7700 out of 250000 steps  (3%)
[16:23:12] - Couldn't send HTTP request to server
[16:23:12] + Could not connect to Work Server (results)
[16:23:12]     (171.67.108.25:80)
[16:23:12]   Could not transmit unit 05 to Collection server; keeping in queue.
[17:00:27] Completed 10000 out of 250000 steps  (4%)
[17:41:50] Completed 12500 out of 250000 steps  (5%)
[18:22:40] Completed 15000 out of 250000 steps  (6%)
[19:03:58] Completed 17500 out of 250000 steps  (7%)
[19:43:54] Completed 20000 out of 250000 steps  (8%)
[20:23:22] Completed 22500 out of 250000 steps  (9%)
[21:02:47] Completed 25000 out of 250000 steps  (10%)
[21:42:16] Completed 27500 out of 250000 steps  (11%)
[22:23:13] + Attempting to send results [October 8 22:23:13 UTC]
[22:30:02] + Results successfully sent
[22:30:02] Thank you for your contribution to Folding@Home.
[22:30:03] + Number of Units Completed: 70

Folding@Home Client Shutdown at user request.

Folding@Home Client Shutdown.
toTOW
Site Moderator
Posts: 6435
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2686 (Run 3, Clone 7, Gen 26)

Post by toTOW »

No data in the DB for this WU yet ... :(
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project: 2686 (Run 3, Clone 7, Gen 26)

Post by sortofageek »

Thanks for posting this. We'll keep an eye on Project: 2686 (Run 3, Clone 7, Gen 26).

As toTOW said, no data back on this yet. Last night I was seeing something different which made me think we might want to escalate. Now it just needs to be watched to see if we might need to mark it bad.
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project: 2686 (Run 3, Clone 7, Gen 26)

Post by sortofageek »

This WU was added to the stats database on 2010-10-14 18:09:38 for 73959.3 points of credit. The user name was not Apollo.
Post Reply