Project: 2671 (Run 2, Clone 22, Gen 76)

Moderators: Site Moderators, FAHC Science Team

Post Reply
Amaruk
Posts: 254
Joined: Fri Jun 20, 2008 3:57 am
Location: Watching from the Woods

Project: 2671 (Run 2, Clone 22, Gen 76)

Post by Amaruk »

It seems I'm running this one twice.

Here is end of the previous log...

Code: Select all

[05:44:19] + No unsent completed units remaining.
[05:44:19] - Preparing to get new work unit...
[05:44:19] + Attempting to get work packet
[05:44:19] - Will indicate memory of 3707 MB
[05:44:19] - Connecting to assignment server
[05:44:19] Connecting to http://assign.stanford.edu:8080/
[05:44:19] Posted data.
[05:44:19] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[05:44:19] + News From Folding@Home: Welcome to Folding@Home
[05:44:19] Loaded queue successfully.
[05:44:19] Connecting to http://171.67.108.24:8080/
[05:44:27] Posted data.
[05:44:27] Initial: 0000; - Receiving payload (expected size: 4844202)
[05:44:32] - Downloaded at ~946 kB/s
[05:44:32] - Averaged speed for that direction ~1021 kB/s
[05:44:32] + Received work.
[05:44:32] Trying to send all finished work units
[05:44:32] + No unsent completed units remaining.
[05:44:32] + Closed connections
[05:44:32] 
[05:44:32] + Processing work unit
[05:44:32] Core required: FahCore_a2.exe
[05:44:32] Core found.
[05:44:32] Working on Unit 04 [January 29 05:44:32]
[05:44:32] + Working ...
[05:44:32] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 17314 -version 602'
[05:44:32] 
[05:44:32] *------------------------------*
[05:44:32] Folding@Home Gromacs SMP Core
[05:44:32] Version 2.01 (Wed Aug 13 13:11:25 PDT 2008)
[05:44:32] 
[05:44:32] Preparing to commence simulation
[05:44:32] - Ensuring status. Please wait.
[05:44:33] Called DecompressByteArray: compressed_data_size=4843690 data_size=24022769, decompressed_data_size=24022769 diff=0
[05:44:33] - Digital signature verified
[05:44:33] 
[05:44:33] Project: 2671 (Run 2, Clone 22, Gen 76)
[05:44:33] 
[05:44:33] Assembly optimizations on if available.
[05:44:33] Entering M.D.
[05:44:42] (Run 2, Clone 22, Gen 76)
[05:44:42] 
[05:44:42] Entering M.D.
[05:57:52] Completed 5008 out of 250000 steps  (2%)
[06:04:22] Completed 7508 out of 250000 steps  (3%)
[06:10:52] Completed 10008 out of 250000 steps  (4%)
[06:17:22] Completed 12508 out of 250000 steps  (5%)
[06:23:51] Completed 15008 out of 250000 steps  (6%)
[06:30:22] Completed 17508 out of 250000 steps  (7%)
[06:36:52] Completed 20008 out of 250000 steps  (8%)
[06:43:22] Completed 22508 out of 250000 steps  (9%)
[06:49:52] Completed 25008 out of 250000 steps  (10%)
[06:56:22] Completed 27508 out of 250000 steps  (11%)
[07:02:52] Completed 30008 out of 250000 steps  (12%)
[07:09:22] Completed 32508 out of 250000 steps  (13%)
[07:15:52] Completed 35008 out of 250000 steps  (14%)
[07:22:22] Completed 37508 out of 250000 steps  (15%)
[07:28:51] Completed 40008 out of 250000 steps  (16%)
[07:35:21] Completed 42508 out of 250000 steps  (17%)
[07:41:51] Completed 45008 out of 250000 steps  (18%)
[07:48:21] Completed 47508 out of 250000 steps  (19%)
[07:54:51] Completed 50008 out of 250000 steps  (20%)
[08:01:21] Completed 52508 out of 250000 steps  (21%)
[08:07:50] Completed 55008 out of 250000 steps  (22%)
[08:14:20] Completed 57508 out of 250000 steps  (23%)
[08:20:48] Completed 60008 out of 250000 steps  (24%)
[08:27:18] Completed 62508 out of 250000 steps  (25%)
[08:33:48] Completed 65008 out of 250000 steps  (26%)
[08:40:18] Completed 67508 out of 250000 steps  (27%)
[08:46:48] Completed 70008 out of 250000 steps  (28%)
[08:53:17] Completed 72508 out of 250000 steps  (29%)
[08:59:47] Completed 75008 out of 250000 steps  (30%)
[09:06:16] Completed 77508 out of 250000 steps  (31%)
[09:12:46] Completed 80008 out of 250000 steps  (32%)
[09:19:15] Completed 82508 out of 250000 steps  (33%)
[09:25:44] Completed 85008 out of 250000 steps  (34%)
[09:32:13] Completed 87508 out of 250000 steps  (35%)
[09:38:42] Completed 90008 out of 250000 steps  (36%)
[09:45:11] Completed 92508 out of 250000 steps  (37%)
[09:51:40] Completed 95008 out of 250000 steps  (38%)
[09:58:09] Completed 97508 out of 250000 steps  (39%)
[10:04:39] Completed 100008 out of 250000 steps  (40%)
[10:11:07] Completed 102508 out of 250000 steps  (41%)
[10:17:36] Completed 105008 out of 250000 steps  (42%)
[10:24:05] Completed 107508 out of 250000 steps  (43%)
[10:30:34] Completed 110008 out of 250000 steps  (44%)
[10:37:02] Completed 112508 out of 250000 steps  (45%)
[10:43:32] Completed 115008 out of 250000 steps  (46%)
[10:46:35] - Autosending finished units...
[10:46:35] Trying to send all finished work units
[10:46:35] + No unsent completed units remaining.
[10:46:35] - Autosend completed
[10:50:00] Completed 117508 out of 250000 steps  (47%)
[10:56:29] Completed 120008 out of 250000 steps  (48%)
[11:02:57] Completed 122508 out of 250000 steps  (49%)
[11:09:26] Completed 125008 out of 250000 steps  (50%)
[11:15:55] Completed 127508 out of 250000 steps  (51%)
[11:22:24] Completed 130008 out of 250000 steps  (52%)
[11:28:53] Completed 132508 out of 250000 steps  (53%)
[11:35:23] Completed 135008 out of 250000 steps  (54%)
[11:41:51] Completed 137508 out of 250000 steps  (55%)
[11:48:20] Completed 140008 out of 250000 steps  (56%)
[11:54:49] Completed 142508 out of 250000 steps  (57%)
[12:01:18] Completed 145008 out of 250000 steps  (58%)
[12:07:47] Completed 147508 out of 250000 steps  (59%)
[12:14:16] Completed 150008 out of 250000 steps  (60%)
[12:20:45] Completed 152508 out of 250000 steps  (61%)
[12:27:14] Completed 155008 out of 250000 steps  (62%)
[12:33:45] Completed 157508 out of 250000 steps  (63%)
[12:40:14] Completed 160008 out of 250000 steps  (64%)
[12:46:44] Completed 162508 out of 250000 steps  (65%)
[12:53:14] Completed 165008 out of 250000 steps  (66%)
[12:59:43] Completed 167508 out of 250000 steps  (67%)
[13:06:12] Completed 170008 out of 250000 steps  (68%)
[13:12:41] Completed 172508 out of 250000 steps  (69%)
[13:19:11] Completed 175008 out of 250000 steps  (70%)
[13:25:41] Completed 177508 out of 250000 steps  (71%)
[13:32:11] Completed 180008 out of 250000 steps  (72%)
[13:38:40] Completed 182508 out of 250000 steps  (73%)
[13:45:09] Completed 185008 out of 250000 steps  (74%)
[13:51:39] Completed 187508 out of 250000 steps  (75%)
[13:58:08] Completed 190008 out of 250000 steps  (76%)
[14:04:38] Completed 192508 out of 250000 steps  (77%)
[14:11:08] Completed 195008 out of 250000 steps  (78%)
[14:17:37] Completed 197508 out of 250000 steps  (79%)
[14:24:06] Completed 200008 out of 250000 steps  (80%)
[14:30:35] Completed 202508 out of 250000 steps  (81%)
[14:37:04] Completed 205008 out of 250000 steps  (82%)
[14:43:32] Completed 207508 out of 250000 steps  (83%)
[14:50:02] Completed 210008 out of 250000 steps  (84%)
[14:56:31] Completed 212508 out of 250000 steps  (85%)
[15:03:00] Completed 215008 out of 250000 steps  (86%)
[15:09:30] Completed 217508 out of 250000 steps  (87%)
[15:15:59] Completed 220008 out of 250000 steps  (88%)
[15:22:29] Completed 222508 out of 250000 steps  (89%)
[15:28:59] Completed 225008 out of 250000 steps  (90%)
[15:35:29] Completed 227508 out of 250000 steps  (91%)
[15:41:59] Completed 230008 out of 250000 steps  (92%)
[15:48:28] Completed 232508 out of 250000 steps  (93%)
[15:54:59] Completed 235008 out of 250000 steps  (94%)
[16:01:29] Completed 237508 out of 250000 steps  (95%)
[16:07:58] Completed 240008 out of 250000 steps  (96%)
[16:14:27] Completed 242508 out of 250000 steps  (97%)
[16:20:56] Completed 245008 out of 250000 steps  (98%)
[16:27:26] Completed 247508 out of 250000 steps  (99%)
[16:34:56] 
[16:34:56] Finished Work Unit:
[16:34:56] - Reading up to 21170016 from "work/wudata_04.trr": Read 21170016
[16:34:56] trr file hash check passed.
[16:34:56] - Reading up to 27135720 from "work/wudata_04.xtc": Read 27135720
[16:34:57] xtc file hash check passed.
[16:34:57] edr file hash check passed.
[16:34:57] logfile size: 178754
[16:34:57] Leaving Run
[16:34:58] - Writing 48697354 bytes of core data to disk...
[16:34:58]   ... Done.
[16:35:02] - Shutting down core
[16:46:35] - Autosending finished units...
[16:46:35] Trying to send all finished work units
[16:46:35] + No unsent completed units remaining.
[16:46:35] - Autosend completed
[22:46:35] - Autosending finished units...
[22:46:35] Trying to send all finished work units
[22:46:35] + No unsent completed units remaining.
[22:46:35] - Autosend completed
...and here is the restart.

Code: Select all

--- Opening Log file [January 30 03:04:40] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                         http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/hope/folding
Executable: ./fah6
Arguments: -smp -verbosity 9

[03:04:40] - Ask before connecting: No
[03:04:40] - User name: Amaruk (Team 50625)
[03:04:40] - User ID: 967AB2251A7B278
[03:04:40] - Machine ID: 1
[03:04:40] 
[03:04:40] Loaded queue successfully.
[03:04:40] - Autosending finished units...
[03:04:40] Trying to send all finished work units
[03:04:40] + No unsent completed units remaining.
[03:04:40] - Autosend completed
[03:04:40] 
[03:04:40] + Processing work unit
[03:04:40] Core required: FahCore_a2.exe
[03:04:40] Core found.
[03:04:40] Working on Unit 04 [January 30 03:04:40]
[03:04:40] + Working ...
[03:04:40] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 5978 -version 602'
[03:04:40] 
[03:04:40] *------------------------------*
[03:04:40] Folding@Home Gromacs SMP Core
[03:04:40] Version 2.01 (Wed Aug 13 13:11:25 PDT 2008)
[03:04:40] 
[03:04:40] Preparing to commence simulation
[03:04:40] - Ensuring status. Please wait.
[03:04:50] - Looking at optimizations...
[03:04:50] - Working with standard loops on this execution.
[03:04:50] - Files status OK
[03:04:50] Need version 202
[03:04:50] Error: Work unit read from disk is invalid
[03:04:51] - Expanded 4843690 -> 24022769 (decompressed 495.9 percent)
[03:04:51] Called DecompressByteArray: compressed_data_size=4843690 data_size=24022769, decompressed_data_size=24022769 diff=0
[03:04:51] - Digital signature verified
[03:04:51] 
[03:04:51] Project: 2671 (Run 2, Clone 22, Gen 76)
[03:04:51] 
[03:04:51] Entering M.D.
[03:11:48] Completed 2508 out of 250000 steps  (1%)
Curious to see if this got turned in the first time.... :?
Image
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 2671 (Run 2, Clone 22, Gen 76)

Post by bruce »

No, it did not get turned in the first time. There is no message saying "Thank you...."

After the "leaving run" message there should be a "corestatus = ...." message and it's not there. For some reason the FahCore did not shut down causing your client to hang rather than complete the WU. This bug is listed in the known issues and is probably related to networking changes. You might want to use qfix to recover the first WU rather than process it again. viewtopic.php?p=1440

To avoid similar problems, be sure your IP address is not "automatic" (assigned by DHCP). Adding a logical loopback adapter also seems to help. If you're on a wireless connection, do not move out of range while SMP is running.
bollix47
Posts: 2976
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project: 2671 (Run 2, Clone 22, Gen 76)

Post by bollix47 »

It's possible you fell victim to the same problem some of us have experienced in the last couple days:

viewtopic.php?f=44&t=8195
Image
Amaruk
Posts: 254
Joined: Fri Jun 20, 2008 3:57 am
Location: Watching from the Woods

Re: Project: 2671 (Run 2, Clone 22, Gen 76)

Post by Amaruk »

bollix47, I think you're right.

Bruce, I did have some connectivity issues in December and the first week of January due to my ISP 'upgrading' their firmware to supposedly improve my service. This did not cause any problems with any of my folders, and outside of that I've had no issues whatsoever. Thanks for the qfix link - those instructions didn't work but I found another post by uncle_fungus that did (viewtopic.php?f=44&t=3889) and was able to return this WU. Or at least what was left of it... :wink:

Thanks for the help,

Amaruk
Image
Post Reply