Project 2665 (run 3, clone 506, gen 148)

Moderators: Site Moderators, FAHC Science Team

Post Reply
verdeva
Posts: 30
Joined: Mon Dec 03, 2007 1:40 pm
Location: Seattle, WA

Project 2665 (run 3, clone 506, gen 148)

Post by verdeva »

Twice this unit ended at 43 % with long interactions.

Code: Select all

[12:40:55] Timered checkpoint triggered.
[12:45:47] Writing local files
[12:45:48] Completed 102500 out of 250000 steps  (41 percent)
[13:00:47] Timered checkpoint triggered.
[13:05:36] Writing local files
[13:05:37] Completed 105000 out of 250000 steps  (42 percent)
[13:20:36] Timered checkpoint triggered.
[13:25:25] Writing local files
[13:25:26] Completed 107500 out of 250000 steps  (43 percent)
[13:39:51] Warning:  long 1-4 interactions
[13:40:18] - Autosending finished units... [November 26 13:40:18 UTC]
[13:40:18] Trying to send all finished work units
[13:40:18] + No unsent completed units remaining.
[13:40:18] - Autosend completed
[16:25:25] At least 3 hours since checkpoint written...
[16:25:25] 
[16:25:25] Folding@home Core Shutdown: EARLY_UNIT_END
[16:25:25] 
[16:25:25] Folding@home Core Shutdown: EARLY_UNIT_END
[16:25:30] CoreStatus = 7B (123)
[16:25:30] Sending work to server
[16:25:30] Project: 2665 (Run 3, Clone 506, Gen 148)
[16:25:30] - Error: Could not get length of results file work/wuresults_08.dat
[16:25:30] - Error: Could not read unit 08 file. Removing from queue.
[16:25:30] Trying to send all finished work units
[16:25:30] + No unsent completed units remaining.
[16:25:30] - Preparing to get new work unit...
[16:25:30] Cleaning up work directory
[16:25:30] + Attempting to get work packet
[16:25:30] - Will indicate memory of 1023 MB
[16:25:30] - Connecting to assignment server
[16:25:30] Connecting to http://assign.stanford.edu:8080/
[16:25:30] Posted data.
[16:25:30] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[16:25:30] + News From Folding@Home: Welcome to Folding@Home
[16:25:31] Loaded queue successfully.
[16:25:31] Connecting to http://171.64.65.64:8080/
[16:25:36] Posted data.
[16:25:36] Initial: 0000; - Receiving payload (expected size: 4679921)
[16:25:41] - Downloaded at ~914 kB/s
[16:25:41] - Averaged speed for that direction ~1029 kB/s
[16:25:41] + Received work.
[16:25:41] Trying to send all finished work units
[16:25:41] + No unsent completed units remaining.
[16:25:41] + Closed connections
[16:25:46] 
[16:25:46] + Processing work unit
[16:25:46] Work type a1 not eligible for variable processors
[16:25:46] Core required: FahCore_a1.exe
[16:25:46] Core found.
[16:25:46] Using generic mpiexec calls
[16:25:46] Working on queue slot 09 [November 26 16:25:46 UTC]
[16:25:46] + Working ...
[16:25:46] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 09 -checkpoint 15 -forceasm -verbose -lifeline 3312 -version 624'

[16:25:48] 
[16:25:48] *------------------------------*
[16:25:48] Folding@Home Gromacs SMP Core
[16:25:48] Version 1.74 (March 10, 2007)
[16:25:48] 
[16:25:48] Preparing to commence simulation
[16:25:48] - Assembly optimizations manually forced on.
[16:25:48] - Not checking prior termination.
[16:25:55] - Expanded 4679409 -> 24426905 (decompressed 522.0 percent)
[16:25:55] - Starting from initial work packet
[16:25:55] 
[16:25:55] Project: 2665 (Run 3, Clone 506, Gen 148)
[16:25:55] 
[16:25:55] Assembly optimizations on if available.
[16:25:55] Entering M.D.
[16:26:26]  on if available.
[16:26:26] Entering M.D.
[16:26:32] Rejecting checkpoint
[16:26:34] Protein: HGG in water
[16:26:34] Writing local files
[16:26:44] Extra SSE boost OK.
[16:26:45] Writing local files
[16:26:45] Completed 0 out of 250000 steps  (0 percent)
[16:41:46] Timered checkpoint triggered.
[16:46:38] Writing local files
[16:46:38] Completed 2500 out of 250000 steps  (1 percent)
[17:01:38] Timered checkpoint triggered.
[17:06:26] Writing local files
[17:06:27] Completed 5000 out of 250000 steps  (2 percent)
[17:21:27] Timered checkpoint triggered.
[17:26:14] Writing local files
[17:26:14] Completed 7500 out of 250000 steps  (3 percent)
Edit by Mod: Added [code][/code] tags.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project 2665 (run 3, clone 506, gen 148)

Post by bruce »

Please post enough of the log so we can see the second failure.
verdeva
Posts: 30
Joined: Mon Dec 03, 2007 1:40 pm
Location: Seattle, WA

Re: Project 2665 (run 3, clone 506, gen 148)

Post by verdeva »

Sorry its gone to the bit-bucket; but trust me, it was exactly the same as you see above.
Post Reply