Project: 2665 (Run 3, Clone 9, Gen 53) long 1-4 NaN
Posted: Wed Oct 01, 2008 5:18 am
Looks like another bad WU. Can someone please check?
Ps. How many different clients is the the same WU assigned to?
(WU as defined by a unique combination of Project number and (Run, Clone, Gen))
[09:34:50] Project: 2665 (Run 3, Clone 9, Gen 53)
...
[02:13:26] Completed 122500 out of 250000 steps (49 percent)
[02:19:52] Warning: long 1-4 interactions
[02:19:53] Quit 101 - NaN detected: (ener[20])
[02:19:53]
[02:19:53] Simulation instability has been encountered. The run has entered a
[02:19:53] state from which no further progress can be made.
[02:19:53] This may be the correct result of the simulation, however if you
[02:19:53] often see other project units terminating early like this
[02:19:53] too, you may wish to check the stability of your computer (issues
[02:19:53] such as high temperature, overclocking, etc.).
[02:19:53] Going to send back what have done.
[02:19:53] logfile size: 100774
[02:19:53] - Writing 101324 bytes of core data to disk...
[02:19:53] ... Done.
[02:19:53] - Failed to delete work/wudata_03.arc
[02:19:53] Warning: check for stray files
[02:21:53]
[02:21:53] Folding@home Core Shutdown: EARLY_UNIT_END
[02:21:53]
[02:21:53] Folding@home Core Shutdown: EARLY_UNIT_END
[02:21:55] CoreStatus = 7B (123)
[02:21:55] Client-core communications error: ERROR 0x7b
[02:21:55] Deleting current work unit & continuing...
[02:21:55] Using generic mpiexec calls
[02:23:58] - Warning: Could not delete all work unit files (3): Core returned in
valid code
[02:23:58] Trying to send all finished work units
[02:23:58] + No unsent completed units remaining.
Ps. How many different clients is the the same WU assigned to?
(WU as defined by a unique combination of Project number and (Run, Clone, Gen))
[09:34:50] Project: 2665 (Run 3, Clone 9, Gen 53)
...
[02:13:26] Completed 122500 out of 250000 steps (49 percent)
[02:19:52] Warning: long 1-4 interactions
[02:19:53] Quit 101 - NaN detected: (ener[20])
[02:19:53]
[02:19:53] Simulation instability has been encountered. The run has entered a
[02:19:53] state from which no further progress can be made.
[02:19:53] This may be the correct result of the simulation, however if you
[02:19:53] often see other project units terminating early like this
[02:19:53] too, you may wish to check the stability of your computer (issues
[02:19:53] such as high temperature, overclocking, etc.).
[02:19:53] Going to send back what have done.
[02:19:53] logfile size: 100774
[02:19:53] - Writing 101324 bytes of core data to disk...
[02:19:53] ... Done.
[02:19:53] - Failed to delete work/wudata_03.arc
[02:19:53] Warning: check for stray files
[02:21:53]
[02:21:53] Folding@home Core Shutdown: EARLY_UNIT_END
[02:21:53]
[02:21:53] Folding@home Core Shutdown: EARLY_UNIT_END
[02:21:55] CoreStatus = 7B (123)
[02:21:55] Client-core communications error: ERROR 0x7b
[02:21:55] Deleting current work unit & continuing...
[02:21:55] Using generic mpiexec calls
[02:23:58] - Warning: Could not delete all work unit files (3): Core returned in
valid code
[02:23:58] Trying to send all finished work units
[02:23:58] + No unsent completed units remaining.
Code: Select all
Note: Please read the license agreement (Folding@home-Win32-x86.exe -license). F
urther
use of this software requires that you have read and accepted this agreement.
4 cores detected
If you see this twice, MPI is working
If you see this twice, MPI is working
--- Opening Log file [September 30 09:34:16 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.22 SMP Beta2r3
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\FahSmp2ndClient
Executable: C:\FahSmp2ndClient\Folding@home-Win32-x86.exe
Arguments: -smp -verbosity 9
[09:34:16] - Ask before connecting: No
[09:34:16] - User name: [EV]Solar (Team 104636)
[09:34:16] - User ID: xxxxx
[09:34:16] - Machine ID: 3
[09:34:16]
[09:34:16] Loaded queue successfully.
[09:34:16]
[09:34:16] - Autosending finished units... [September 30 09:34:16 UTC]
[09:34:16] Trying to send all finished work units
[09:34:16] + No unsent completed units remaining.
[09:34:16] + Processing work unit
[09:34:16] - Autosend completed
[09:34:16] Work type a1 not eligible for variable processors
[09:34:16] Core required: FahCore_a1.exe
[09:34:16] Core found.
[09:34:16] Using generic mpiexec calls
[09:34:16] Working on queue slot 03 [September 30 09:34:16 UTC]
[09:34:16] + Working ...
[09:34:16] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe
-dir work/ -suffix 03 -checkpoint 30 -verbose -lifeline 5660 -version 622'
[09:34:16]
[09:34:16] *------------------------------*
[09:34:16] Folding@Home Gromacs SMP Core
[09:34:16] Version 1.74 (March 10, 2007)
[09:34:16]
[09:34:16] Preparing to commence simulation
[09:34:16] - Ensuring status. Please wait.
[09:34:33] - Looking at optimizations...
[09:34:33] - Working with standard loops on this execution.
[09:34:33] - Previous termination of core was improper.
[09:34:33] - Going to use standard loops.
[09:34:33] - Files status OK
[09:34:50] - Expanded 4714990 -> 24426905 (decompressed 518.0 percent)
[09:34:50]
[09:34:50] Project: 2665 (Run 3, Clone 9, Gen 53)
[09:34:50]
[09:34:56] Entering M.D.
[09:35:04] Warning: cannot record process ID
[09:35:05] Calling FAH init
[09:35:08] Read topology
[09:35:08] (Starting from checkpoint)
[09:35:08] Read checkpoint
[09:35:11] Protein: HGG in water
[09:35:11] Writing local files
[09:35:28] Extra SSE boost OK.
[09:35:29] Writing local files
[09:35:29] Completed 0 out of 250000 steps (0 percent)
[09:59:35] Writing local files
[09:59:35] Completed 2500 out of 250000 steps (1 percent)
[10:19:56] Writing local files
[10:19:56] Completed 5000 out of 250000 steps (2 percent)
[10:40:11] Writing local files
[10:40:12] Completed 7500 out of 250000 steps (3 percent)
[11:00:28] Writing local files
[11:00:28] Completed 10000 out of 250000 steps (4 percent)
[11:20:45] Writing local files
[11:20:45] Completed 12500 out of 250000 steps (5 percent)
[11:41:00] Writing local files
[11:41:01] Completed 15000 out of 250000 steps (6 percent)
[12:01:13] Writing local files
[12:01:13] Completed 17500 out of 250000 steps (7 percent)
[12:21:38] Writing local files
[12:21:38] Completed 20000 out of 250000 steps (8 percent)
[12:41:53] Writing local files
[12:41:54] Completed 22500 out of 250000 steps (9 percent)
[13:02:09] Writing local files
[13:02:09] Completed 25000 out of 250000 steps (10 percent)
[13:22:25] Writing local files
[13:22:25] Completed 27500 out of 250000 steps (11 percent)
[13:42:43] Writing local files
[13:42:44] Completed 30000 out of 250000 steps (12 percent)
[14:03:10] Writing local files
[14:03:10] Completed 32500 out of 250000 steps (13 percent)
[14:23:23] Writing local files
[14:23:23] Completed 35000 out of 250000 steps (14 percent)
[14:43:40] Writing local files
[14:43:40] Completed 37500 out of 250000 steps (15 percent)
[15:03:57] Writing local files
[15:03:58] Completed 40000 out of 250000 steps (16 percent)
[15:24:14] Writing local files
[15:24:14] Completed 42500 out of 250000 steps (17 percent)
[15:34:16] - Autosending finished units... [September 30 15:34:16 UTC]
[15:34:16] Trying to send all finished work units
[15:34:16] + No unsent completed units remaining.
[15:34:16] - Autosend completed
[15:44:31] Writing local files
[15:44:31] Completed 45000 out of 250000 steps (18 percent)
[16:04:48] Writing local files
[16:04:48] Completed 47500 out of 250000 steps (19 percent)
[16:25:07] Writing local files
[16:25:07] Completed 50000 out of 250000 steps (20 percent)
[16:45:31] Writing local files
[16:45:31] Completed 52500 out of 250000 steps (21 percent)
[17:05:47] Writing local files
[17:05:47] Completed 55000 out of 250000 steps (22 percent)
[17:26:05] Writing local files
[17:26:05] Completed 57500 out of 250000 steps (23 percent)
[17:46:22] Writing local files
[17:46:22] Completed 60000 out of 250000 steps (24 percent)
[18:06:40] Writing local files
[18:06:41] Completed 62500 out of 250000 steps (25 percent)
[18:26:59] Writing local files
[18:26:59] Completed 65000 out of 250000 steps (26 percent)
[18:47:17] Writing local files
[18:47:17] Completed 67500 out of 250000 steps (27 percent)
[19:07:34] Writing local files
[19:07:34] Completed 70000 out of 250000 steps (28 percent)
[19:27:39] Writing local files
[19:27:39] Completed 72500 out of 250000 steps (29 percent)
[19:47:59] Writing local files
[19:47:59] Completed 75000 out of 250000 steps (30 percent)
[20:08:18] Writing local files
[20:08:18] Completed 77500 out of 250000 steps (31 percent)
[20:28:37] Writing local files
[20:28:37] Completed 80000 out of 250000 steps (32 percent)
[20:48:55] Writing local files
[20:48:55] Completed 82500 out of 250000 steps (33 percent)
[21:09:14] Writing local files
[21:09:14] Completed 85000 out of 250000 steps (34 percent)
[21:29:32] Writing local files
[21:29:33] Completed 87500 out of 250000 steps (35 percent)
[21:34:16] - Autosending finished units... [September 30 21:34:16 UTC]
[21:34:16] Trying to send all finished work units
[21:34:16] + No unsent completed units remaining.
[21:34:16] - Autosend completed
[21:49:48] Writing local files
[21:49:49] Completed 90000 out of 250000 steps (36 percent)
[22:10:08] Writing local files
[22:10:08] Completed 92500 out of 250000 steps (37 percent)
[22:30:27] Writing local files
[22:30:28] Completed 95000 out of 250000 steps (38 percent)
[22:50:46] Writing local files
[22:50:46] Completed 97500 out of 250000 steps (39 percent)
[23:11:06] Writing local files
[23:11:06] Completed 100000 out of 250000 steps (40 percent)
[23:31:25] Writing local files
[23:31:25] Completed 102500 out of 250000 steps (41 percent)
[23:51:44] Writing local files
[23:51:44] Completed 105000 out of 250000 steps (42 percent)
[00:11:57] Writing local files
[00:11:58] Completed 107500 out of 250000 steps (43 percent)
[00:31:45] Writing local files
[00:31:46] Completed 110000 out of 250000 steps (44 percent)
[00:52:17] Writing local files
[00:52:18] Completed 112500 out of 250000 steps (45 percent)
[01:12:34] Writing local files
[01:12:35] Completed 115000 out of 250000 steps (46 percent)
[01:32:52] Writing local files
[01:32:52] Completed 117500 out of 250000 steps (47 percent)
[01:53:08] Writing local files
[01:53:08] Completed 120000 out of 250000 steps (48 percent)
[02:13:26] Writing local files
[02:13:26] Completed 122500 out of 250000 steps (49 percent)
[02:19:52] Warning: long 1-4 interactions
[02:19:53] Quit 101 - NaN detected: (ener[20])
[02:19:53]
[02:19:53] Simulation instability has been encountered. The run has entered a
[02:19:53] state from which no further progress can be made.
[02:19:53] This may be the correct result of the simulation, however if you
[02:19:53] often see other project units terminating early like this
[02:19:53] too, you may wish to check the stability of your computer (issues
[02:19:53] such as high temperature, overclocking, etc.).
[02:19:53] Going to send back what have done.
[02:19:53] logfile size: 100774
[02:19:53] - Writing 101324 bytes of core data to disk...
[02:19:53] ... Done.
[02:19:53] - Failed to delete work/wudata_03.arc
[02:19:53] Warning: check for stray files
[02:21:53]
[02:21:53] Folding@home Core Shutdown: EARLY_UNIT_END
[02:21:53]
[02:21:53] Folding@home Core Shutdown: EARLY_UNIT_END
[02:21:55] CoreStatus = 7B (123)
[02:21:55] Client-core communications error: ERROR 0x7b
[02:21:55] Deleting current work unit & continuing...
[02:21:55] Using generic mpiexec calls
[02:23:58] - Warning: Could not delete all work unit files (3): Core returned in
valid code
[02:23:58] Trying to send all finished work units
[02:23:58] + No unsent completed units remaining.
[02:23:58] - Preparing to get new work unit...
[02:23:58] + Attempting to get work packet
[02:23:58] - Will indicate memory of 2046 MB
[02:23:58] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 1
1
[02:23:58] - Connecting to assignment server
[02:23:58] Connecting to http://assign.stanford.edu:8080/
[02:23:58] Posted data.
[02:23:58] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[02:23:58] + News From Folding@Home: Welcome to Folding@Home
[02:23:58] Loaded queue successfully.
[02:23:58] Connecting to http://171.64.65.64:8080/
[02:23:59] Posted data.
[02:23:59] Initial: 0000; + Could not connect to Work Server
[02:23:59] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.