Project: 2668 (Run 1, Clone 443, Gen 54) hanging

Moderators: Site Moderators, FAHC Science Team

Post Reply
parkut
Posts: 365
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Project: 2668 (Run 1, Clone 443, Gen 54) hanging

Post by parkut »

This particular WU seems to be hanging. My "watchdog" script notices
the system load drops to zero and restarts it. So far it seems to be
progressing between "hangs", but I will be going away for the new year
holiday and it may fail.

model name : Intel(R) Core(TM)2 Duo CPU E6750 @ 2.66GHz
cpu MHz : 1998.000
cache size : 4096 KB
Memory: 976.11 MB physical, 1.94 GB virtual
...
Current Work Unit
-----------------
Name: p2668_IBX in water
Tag: P2668R1C443G54
Download time: December 31 09:02:47
Due time: January 3 09:02:47
Progress: 31% [|||_______]

[13:06:23] Completed 55000 out of 250000 steps (22%)
hung
[14:13:10] Completed 57500 out of 250000 steps (23%)
hung
[15:16:36] Completed 67500 out of 250000 steps (27%)
hung
[16:16:17] Completed 77500 out of 250000 steps (31%)
hung
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: Project: 2668 (Run 1, Clone 443, Gen 54) hanging

Post by kasson »

Thanks for the report. We're not certain what causes these "hangs" or whether it's WU-specific. Let us know how this one turns out.
Post Reply