Page 1 of 1

Project: 6900 (Run 43, Clone 18, Gen 11)

Posted: Tue Feb 01, 2011 5:42 am
by Amaruk
This was a bit of a curiosity. The core seems to have died around 18:15 or so.

Code: Select all

[09:07:25] + Attempting to get work packet
[09:07:25] Passkey found
[09:07:25] - Will indicate memory of 12279 MB
[09:07:25] - Connecting to assignment server
[09:07:25] Connecting to http://assign.stanford.edu:8080/
[09:07:26] Posted data.
[09:07:26] Initial: ED82; - Successful: assigned to (130.237.232.141).
[09:07:26] + News From Folding@Home: Welcome to Folding@Home
[09:07:26] Loaded queue successfully.
[09:07:26] Sent data
[09:07:26] Connecting to http://130.237.232.141:8080/
[09:07:34] Posted data.
[09:07:34] Initial: 0000; - Receiving payload (expected size: 24867487)
[09:08:56] - Downloaded at ~296 kB/s
[09:08:56] - Averaged speed for that direction ~443 kB/s
[09:08:56] + Received work.
[09:08:57] Trying to send all finished work units
[09:08:57] + No unsent completed units remaining.
[09:08:57] + Closed connections
[09:08:57] 
[09:08:57] + Processing work unit
[09:08:57] Core required: FahCore_a3.exe
[09:08:57] Core found.
[09:08:57] Working on queue slot 09 [January 30 09:08:57 UTC]
[09:08:57] + Working ...
[09:08:57] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 09 -np 16 -checkpoint 15 -verbose -lifeline 184 -version 630'

[09:08:57] 
[09:08:57] *------------------------------*
[09:08:57] Folding@Home Gromacs SMP Core
[09:08:57] Version 2.22 (Mar 12, 2010)
[09:08:57] 
[09:08:57] Preparing to commence simulation
[09:08:57] - Looking at optimizations...
[09:08:57] - Created dyn
[09:08:57] - Files status OK
[09:09:01] - Expanded 24866975 -> 30796293 (decompressed 123.8 percent)
[09:09:01] Called DecompressByteArray: compressed_data_size=24866975 data_size=30796293, decompressed_data_size=30796293 diff=0
[09:09:01] - Digital signature verified
[09:09:01] 
[09:09:01] Project: 6900 (Run 43, Clone 18, Gen 11)
[09:09:01] 
[09:09:02] Assembly optimizations on if available.
[09:09:02] Entering M.D.
[09:09:10] Completed 0 out of 250000 steps  (0%)
[09:27:05] Completed 2500 out of 250000 steps  (1%)
[09:44:56] Completed 5000 out of 250000 steps  (2%)
[10:02:45] Completed 7500 out of 250000 steps  (3%)
[10:20:37] Completed 10000 out of 250000 steps  (4%)
[10:38:29] Completed 12500 out of 250000 steps  (5%)
[10:56:24] Completed 15000 out of 250000 steps  (6%)
[11:14:13] Completed 17500 out of 250000 steps  (7%)
[11:32:04] Completed 20000 out of 250000 steps  (8%)
[11:50:02] Completed 22500 out of 250000 steps  (9%)
[12:08:01] Completed 25000 out of 250000 steps  (10%)
[12:26:01] Completed 27500 out of 250000 steps  (11%)
[12:43:47] Completed 30000 out of 250000 steps  (12%)
[13:01:37] Completed 32500 out of 250000 steps  (13%)
[13:19:28] Completed 35000 out of 250000 steps  (14%)
[13:37:22] Completed 37500 out of 250000 steps  (15%)
[13:55:18] Completed 40000 out of 250000 steps  (16%)
[14:13:05] Completed 42500 out of 250000 steps  (17%)
[14:31:10] Completed 45000 out of 250000 steps  (18%)
[14:37:30] - Autosending finished units... [January 30 14:37:30 UTC]
[14:37:30] Trying to send all finished work units
[14:37:30] + No unsent completed units remaining.
[14:37:30] - Autosend completed
[14:49:09] Completed 47500 out of 250000 steps  (19%)
[15:07:10] Completed 50000 out of 250000 steps  (20%)
[15:25:16] Completed 52500 out of 250000 steps  (21%)
[15:43:17] Completed 55000 out of 250000 steps  (22%)
[16:01:19] Completed 57500 out of 250000 steps  (23%)
[16:19:14] Completed 60000 out of 250000 steps  (24%)
[16:37:12] Completed 62500 out of 250000 steps  (25%)
[16:55:04] Completed 65000 out of 250000 steps  (26%)
[17:13:07] Completed 67500 out of 250000 steps  (27%)
[17:31:03] Completed 70000 out of 250000 steps  (28%)
[17:48:45] Completed 72500 out of 250000 steps  (29%)
[18:06:50] Completed 75000 out of 250000 steps  (30%)
[20:37:30] - Autosending finished units... [January 30 20:37:30 UTC]
[20:37:30] Trying to send all finished work units
[20:37:30] + No unsent completed units remaining.
[20:37:30] - Autosend completed
[02:37:30] - Autosending finished units... [January 31 02:37:30 UTC]
[02:37:30] Trying to send all finished work units
[02:37:30] + No unsent completed units remaining.
[02:37:30] - Autosend completed
[03:43:31] CoreStatus = C0000029 (-1073741783)
[03:43:31] Client-core communications error: ERROR 0xc0000029
[03:43:31] Deleting current work unit & continuing...
[03:43:38] Killing all core threads
[03:43:38] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[03:43:38] ***** Got a SIGTERM signal (2)
[03:43:38] Killing all core threads
[03:43:38] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.
No error messages at the time it seems to have stopped. In spite of [03:43:31] Deleting current work unit & continuing... on shutdown it picked up right where it left off.

Code: Select all


--- Opening Log file [January 31 03:44:03 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.30

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\Blitz\SMP 630
Executable: C:\Users\Blitz\SMP 630\Folding@home-Win32-x86.exe
Arguments: -smp -verbosity 9 -bigadv 

[03:44:03] - Ask before connecting: No
[03:44:03] - User name: Amaruk (Team 50625)
[03:44:03] - User ID: 7DE849C1119CCCE4
[03:44:03] - Machine ID: 1
[03:44:03] 
[03:44:03] Loaded queue successfully.
[03:44:03] 
[03:44:03] - Autosending finished units... [January 31 03:44:03 UTC]
[03:44:03] + Processing work unit
[03:44:03] Trying to send all finished work units
[03:44:03] Core required: FahCore_a3.exe
[03:44:03] + No unsent completed units remaining.
[03:44:03] Core found.
[03:44:03] - Autosend completed
[03:44:03] Working on queue slot 09 [January 31 03:44:03 UTC]
[03:44:03] + Working ...
[03:44:03] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 09 -np 16 -checkpoint 15 -verbose -lifeline 3228 -version 630'

[03:44:04] 
[03:44:04] *------------------------------*
[03:44:04] Folding@Home Gromacs SMP Core
[03:44:04] Version 2.22 (Mar 12, 2010)
[03:44:04] 
[03:44:04] Preparing to commence simulation
[03:44:04] - Looking at optimizations...
[03:44:04] - Files status OK
[03:44:08] - Expanded 24866975 -> 30796293 (decompressed 123.8 percent)
[03:44:08] Called DecompressByteArray: compressed_data_size=24866975 data_size=30796293, decompressed_data_size=30796293 diff=0
[03:44:08] - Digital signature verified
[03:44:08] 
[03:44:08] Project: 6900 (Run 43, Clone 18, Gen 11)
[03:44:08] 
[03:44:09] Assembly optimizations on if available.
[03:44:09] Entering M.D.
[03:44:15] Using Gromacs checkpoints
[03:44:22] Resuming from checkpoint
[03:44:22] Verified work/wudata_09.log
[03:44:22] Verified work/wudata_09.trr
[03:44:22] Verified work/wudata_09.xtc
[03:44:22] Verified work/wudata_09.edr
[03:44:23] Completed 75325 out of 250000 steps  (30%)
[03:59:43] Completed 77500 out of 250000 steps  (31%)
[04:17:11] Completed 80000 out of 250000 steps  (32%)
[04:34:43] Completed 82500 out of 250000 steps  (33%)
[04:52:15] Completed 85000 out of 250000 steps  (34%)
[05:09:43] Completed 87500 out of 250000 steps  (35%)
[05:27:17] Completed 90000 out of 250000 steps  (36%)
[05:44:52] Completed 92500 out of 250000 steps  (37%)
[06:02:47] Completed 95000 out of 250000 steps  (38%)
[06:20:22] Completed 97500 out of 250000 steps  (39%)
[06:37:55] Completed 100000 out of 250000 steps  (40%)
[06:55:27] Completed 102500 out of 250000 steps  (41%)
[07:13:03] Completed 105000 out of 250000 steps  (42%)
[07:30:34] Completed 107500 out of 250000 steps  (43%)
[07:48:06] Completed 110000 out of 250000 steps  (44%)
[08:05:39] Completed 112500 out of 250000 steps  (45%)
[08:23:13] Completed 115000 out of 250000 steps  (46%)
[08:40:44] Completed 117500 out of 250000 steps  (47%)
[08:58:13] Completed 120000 out of 250000 steps  (48%)
[09:15:46] Completed 122500 out of 250000 steps  (49%)
[09:33:15] Completed 125000 out of 250000 steps  (50%)
[09:44:03] - Autosending finished units... [January 31 09:44:03 UTC]
[09:44:03] Trying to send all finished work units
[09:44:03] + No unsent completed units remaining.
[09:44:03] - Autosend completed
[09:50:48] Completed 127500 out of 250000 steps  (51%)
[10:08:20] Completed 130000 out of 250000 steps  (52%)
[10:25:56] Completed 132500 out of 250000 steps  (53%)
[10:43:30] Completed 135000 out of 250000 steps  (54%)
[11:01:07] Completed 137500 out of 250000 steps  (55%)
[11:18:41] Completed 140000 out of 250000 steps  (56%)
[11:36:17] Completed 142500 out of 250000 steps  (57%)
[11:53:52] Completed 145000 out of 250000 steps  (58%)
[12:11:24] Completed 147500 out of 250000 steps  (59%)
[12:28:56] Completed 150000 out of 250000 steps  (60%)
[12:46:37] Completed 152500 out of 250000 steps  (61%)
[13:04:08] Completed 155000 out of 250000 steps  (62%)
[13:21:36] Completed 157500 out of 250000 steps  (63%)
[13:39:11] Completed 160000 out of 250000 steps  (64%)
[13:56:46] Completed 162500 out of 250000 steps  (65%)
[14:14:23] Completed 165000 out of 250000 steps  (66%)
[14:31:55] Completed 167500 out of 250000 steps  (67%)
[14:49:28] Completed 170000 out of 250000 steps  (68%)
[15:07:01] Completed 172500 out of 250000 steps  (69%)
[15:24:34] Completed 175000 out of 250000 steps  (70%)
[15:42:03] Completed 177500 out of 250000 steps  (71%)
[15:44:03] - Autosending finished units... [January 31 15:44:03 UTC]
[15:44:03] Trying to send all finished work units
[15:44:03] + No unsent completed units remaining.
[15:44:03] - Autosend completed
[15:59:31] Completed 180000 out of 250000 steps  (72%)
[16:17:06] Completed 182500 out of 250000 steps  (73%)
[16:34:36] Completed 185000 out of 250000 steps  (74%)
[16:52:08] Completed 187500 out of 250000 steps  (75%)
[17:09:42] Completed 190000 out of 250000 steps  (76%)
[17:27:16] Completed 192500 out of 250000 steps  (77%)
[17:44:52] Completed 195000 out of 250000 steps  (78%)
[18:02:21] Completed 197500 out of 250000 steps  (79%)
[18:19:51] Completed 200000 out of 250000 steps  (80%)
[18:37:25] Completed 202500 out of 250000 steps  (81%)
[18:54:58] Completed 205000 out of 250000 steps  (82%)
[19:12:30] Completed 207500 out of 250000 steps  (83%)
[19:30:04] Completed 210000 out of 250000 steps  (84%)
[19:47:36] Completed 212500 out of 250000 steps  (85%)
[20:05:07] Completed 215000 out of 250000 steps  (86%)
[20:22:37] Completed 217500 out of 250000 steps  (87%)
[20:40:05] Completed 220000 out of 250000 steps  (88%)
[20:57:31] Completed 222500 out of 250000 steps  (89%)
[21:15:09] Completed 225000 out of 250000 steps  (90%)
[21:32:38] Completed 227500 out of 250000 steps  (91%)
[21:44:03] - Autosending finished units... [January 31 21:44:03 UTC]
[21:44:03] Trying to send all finished work units
[21:44:03] + No unsent completed units remaining.
[21:44:03] - Autosend completed
[21:50:11] Completed 230000 out of 250000 steps  (92%)
[22:07:44] Completed 232500 out of 250000 steps  (93%)
[22:25:13] Completed 235000 out of 250000 steps  (94%)
[22:42:41] Completed 237500 out of 250000 steps  (95%)
[23:00:14] Completed 240000 out of 250000 steps  (96%)
[23:17:46] Completed 242500 out of 250000 steps  (97%)
[23:35:13] Completed 245000 out of 250000 steps  (98%)
[23:52:41] Completed 247500 out of 250000 steps  (99%)
[00:10:19] Completed 250000 out of 250000 steps  (100%)
[00:10:33] DynamicWrapper: Finished Work Unit: sleep=10000
[00:10:43] 
[00:10:43] Finished Work Unit:
[00:10:43] - Reading up to 52713120 from "work/wudata_09.trr": Read 52713120
[00:10:44] trr file hash check passed.
[00:10:44] - Reading up to 47028968 from "work/wudata_09.xtc": Read 47028968
[00:10:44] xtc file hash check passed.
[00:10:44] edr file hash check passed.
[00:10:44] logfile size: 201306
[00:10:44] Leaving Run
[00:10:49] - Writing 100111334 bytes of core data to disk...
[00:10:50]   ... Done.
[00:11:20] - Shutting down core
[00:11:20] 
[00:11:20] Folding@home Core Shutdown: FINISHED_UNIT
[00:11:28] CoreStatus = 64 (100)
[00:11:28] Unit 9 finished with 73 percent of time to deadline remaining.
[00:11:28] Updated performance fraction: 0.787644
[00:11:28] Sending work to server
[00:11:28] Project: 6900 (Run 43, Clone 18, Gen 11)


[00:11:28] + Attempting to send results [February 1 00:11:28 UTC]
[00:11:28] - Reading file work/wuresults_09.dat from core
[00:11:28]   (Read 100111334 bytes from disk)
[00:11:28] Connecting to http://130.237.232.141:8080/
[00:18:32] Posted data.
[00:18:32] Initial: 0000; - Uploaded at ~230 kB/s
[00:18:33] - Averaged speed for that direction ~228 kB/s
[00:18:33] + Results successfully sent
[00:18:33] Thank you for your contribution to Folding@Home.
[00:18:33] + Number of Units Completed: 259
Uploaded OK, points credited, everything seems normal except for that 'pause'... :?

Re: Project: 6900 (Run 43, Clone 18, Gen 11)

Posted: Tue Feb 01, 2011 9:10 pm
by toTOW
Maybe a hardware instability ?

Some else has also been able to complete it fine, so it's not a WU problem.

Re: Project: 6900 (Run 43, Clone 18, Gen 11)

Posted: Sun Feb 06, 2011 5:20 am
by Amaruk
This machine has been very reliable, but I did consider the possibility of hardware (specifically memory) instability.

Took it offline this weekend to run Memtest for 24 hours (12 passes) - no errors.

Image

I do find it interesting the WU was issued more than once. My understanding is this happens if there is some issue with either the WU itself or the results.