Project: 2665 (Run 2, Clone 179, Gen 62) Sigterm/Hang
Posted: Fri Oct 31, 2008 6:12 pm
Found this wu hung at 90%, restarted, ran for about a minute
then got sigterm. Deleted queue.dat and work folder, was
assigned a new WU.
Normally when client is hung, processor load goes to zero, and
my monitoring script restarts client. This time the cores remained
engaged doing something.. I discovered problem looking at log
file and noting no progress was being made.
[08:59:28] Timered checkpoint triggered.
[09:01:03] Warning: long 1-4 interactions
[10:46:25] - Autosending finished units... [October 31 10:46:25 UTC]
[10:46:25] Trying to send all finished work units
[10:46:25] + No unsent completed units remaining.
[10:46:25] - Autosend completed
[16:46:25] - Autosending finished units... [October 31 16:46:25 UTC]
[16:46:25] Trying to send all finished work units
[16:46:25] + No unsent completed units remaining.
[16:46:25] - Autosend completed
Client Version 6.23 Beta R1
Current Work Unit
-----------------
Name: p2665_IBX in water
Tag: P2665R2C179G62
Download time: October 30 03:59:24
Due time: November 5 03:59:24
Progress: 90% [|||||||||_]
[17:59:47] Completed 226961 out of 250000 steps (90 percent)
[17:59:47] d 226961 out of 250000 steps (90 percent)
[17:59:48] Extra SSE boost OK.
[18:01:58] Finalizing output
[18:02:02] CoreStatus = 66 (102)
[18:02:02] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[18:02:02] Killing all core threads
Folding@Home Client Shutdown.
then got sigterm. Deleted queue.dat and work folder, was
assigned a new WU.
Normally when client is hung, processor load goes to zero, and
my monitoring script restarts client. This time the cores remained
engaged doing something.. I discovered problem looking at log
file and noting no progress was being made.
[08:59:28] Timered checkpoint triggered.
[09:01:03] Warning: long 1-4 interactions
[10:46:25] - Autosending finished units... [October 31 10:46:25 UTC]
[10:46:25] Trying to send all finished work units
[10:46:25] + No unsent completed units remaining.
[10:46:25] - Autosend completed
[16:46:25] - Autosending finished units... [October 31 16:46:25 UTC]
[16:46:25] Trying to send all finished work units
[16:46:25] + No unsent completed units remaining.
[16:46:25] - Autosend completed
Client Version 6.23 Beta R1
Current Work Unit
-----------------
Name: p2665_IBX in water
Tag: P2665R2C179G62
Download time: October 30 03:59:24
Due time: November 5 03:59:24
Progress: 90% [|||||||||_]
[17:59:47] Completed 226961 out of 250000 steps (90 percent)
[17:59:47] d 226961 out of 250000 steps (90 percent)
[17:59:48] Extra SSE boost OK.
[18:01:58] Finalizing output
[18:02:02] CoreStatus = 66 (102)
[18:02:02] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[18:02:02] Killing all core threads
Folding@Home Client Shutdown.