Moderators: Site Moderators , FAHC Science Team
Foxery
Posts: 118 Joined: Mon Mar 03, 2008 3:11 am
Hardware configuration: Intel Core2 Quad Q9300 (Intel P35 chipset) Radeon 3850, 512MB model (Catalyst 8.10) Windows XP, SP2
Location: Syracuse, NY
Post
by Foxery » Wed Oct 01, 2008 2:13 am
"Gromacs cannot continue further." - Failed multiple times,
always at 6%, and attempted to re-download FahCore_a1 several times before finally getting a new WU. Partial results sent using qfix.
Log snippet:
Code: Select all
[19:22:14] + Processing work unit
[19:22:14] Work type a1 not eligible for variable processors
[19:22:14] Core required: FahCore_a1.exe
[19:22:14] Core found.
[19:22:14] Working on queue slot 02 [September 30 19:22:14 UTC]
[19:22:14] + Working ...
[19:22:14] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 02 -checkpoint 16 -verbose -lifeline 4272 -version 622'
[19:22:15]
[19:22:15] *------------------------------*
[19:22:15] Folding@Home Gromacs SMP Core
[19:22:15] Version 1.76 (February 23, 2008)
[19:22:15]
[19:22:15] Preparing to commence simulation
[19:22:15] - Ensuring status. Please wait.
[19:22:32] - Looking at optimizations...
[19:22:32] - Working with standard loops on this execution.
[19:22:32] - Created dyn
[19:22:32] - Files status OK
[19:22:41] - Expanded 4740740 -> 24426905 (decompressed 515.2 percent)
[19:22:42] - Starting from initial work packet
[19:22:42]
[19:22:42] Project: 2665 (Run 0, Clone 897, Gen 42)
[19:22:42]
[19:22:44] Entering M.D.
[19:22:50] Rejecting checkpoint
[19:22:52] Protein: HGG in water
[19:22:52] Writing local files
[19:22:59] Extra SSE boost OK.
[19:22:59] Writing local files
[19:22:59] Completed 0 out of 250000 steps (0 percent)
[19:36:55] Writing local files
[19:36:55] Completed 2500 out of 250000 steps (1 percent)
[19:50:50] Writing local files
[19:50:50] Completed 5000 out of 250000 steps (2 percent)
[20:04:44] Writing local files
[20:04:44] Completed 7500 out of 250000 steps (3 percent)
[20:18:37] Writing local files
[20:18:37] Completed 10000 out of 250000 steps (4 percent)
[20:32:30] Writing local files
[20:32:31] Completed 12500 out of 250000 steps (5 percent)
[20:46:24] Writing local files
[20:46:24] Completed 15000 out of 250000 steps (6 percent)
[20:53:35] Gromacs cannot continue further.
[20:53:35] Going to send back what have done.
[20:53:35] logfile size: 20430
[20:53:35] - Writing 20966 bytes of core data to disk...
[20:53:35] ... Done.
[20:53:35] - Failed to delete work/wudata_02.sas
[20:53:35] - Failed to delete work/wudata_02.goe
[20:53:35] Warning: check for stray files
[20:55:35]
[20:55:35] Folding@home Core Shutdown: EARLY_UNIT_END
[20:55:35]
[20:55:35] Folding@home Core Shutdown: EARLY_UNIT_END
[20:55:39] CoreStatus = 63 (99)
[20:55:39] + Error starting Folding@Home core.
[20:55:44]
[20:55:44] + Processing work unit
[20:55:44] Work type a1 not eligible for variable processors
[20:55:44] Core required: FahCore_a1.exe
[20:55:44] Core found.
[20:55:44] Working on queue slot 02 [September 30 20:55:44 UTC]
[20:55:44] + Working ...
[20:55:44] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 02 -checkpoint 16 -verbose -lifeline 4272 -version 622'
[20:55:44]
[20:55:44] *------------------------------*
[20:55:44] Folding@Home Gromacs SMP Core
[20:55:44] Version 1.76 (February 23, 2008)
[20:55:44]
[20:55:44] Preparing to commence simulation
[20:55:44] - Ensuring status. Please wait.
[20:55:44] Created dyn
[20:55:44] - Files status OK
[20:55:44]
[20:55:44] Folding@home Core Shutdown: MISSING_WORK_FILES
[20:55:44] Finalizing output
[20:56:01] ation of core was improper.
[20:56:01] - Going to use OK
[20:56:01] ndard loops.
[20:56:01] - Files status OK
[20:58:01] SSING_WORK_FILES
[20:58:01] Finalizing output
[20:58:01] G_WORK_FILES
[20:58:01] Finalizing output
[20:58:04] CoreStatus = 1 (1)
[20:58:04] Client-core communications error: ERROR 0x1
[20:58:04] - Attempting to download new core...
[20:58:04] + Downloading new core: FahCore_a1.exe
[20:58:04] Downloading core (/~pande/Win32/x86_Deino/Core_a1.fah from www.stanford.edu)
[20:58:05] Initial: AFDE; + 10240 bytes downloaded
(Repeat...)
Core2 Quad/Q9300, Radeon 3850/512MB (WinXP SP2)
toTOW
Site Moderator
Posts: 6429 Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:
Post
by toTOW » Wed Oct 01, 2008 10:21 am
For the time being, you're the only one to have reported this WU.
Hi Foxery (team 198),
Your WU (P2665 R0 C897 G42) was added to the stats database on 2008-09-30 16:55:55 for 91.61 points of credit.
Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
adisor19
Posts: 2 Joined: Mon Dec 03, 2007 4:27 pm
Post
by adisor19 » Wed Oct 01, 2008 1:38 pm
Whoa, i thought i was the only one !
Yes, i've been having the EXACT same problem. My WU always crashes around 6% - 7%. Thank GOD for Google pointing me to this
Code: Select all
--- Opening Log file [September 30 15:40:38 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.22 SMP Beta2
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Documents and Settings\astancescu\Desktop\FAH6.22beta2-win32-SMP-deino
Executable: Folding@home-Win32-x86.exe
Arguments: -smp -deino
[15:40:38] - Ask before connecting: No
[15:40:38] - User name: adisor19 (Team 2630)
[15:40:38] - User ID: 6C9C3C900A93EFD2
[15:40:38] - Machine ID: 1
[15:40:38]
[15:40:38] Loaded queue successfully.
[15:40:38]
[15:40:38] + Processing work unit
[15:40:38] Work type a1 not eligible for variable processors
[15:40:38] Core required: FahCore_a1.exe
[15:40:38] Core found.
[15:40:38] Working on queue slot 01 [September 30 15:40:38 UTC]
[15:40:38] + Working ...
[15:40:40]
[15:40:40] *------------------------------*
[15:40:40] Folding@Home Gromacs SMP Core
[15:40:40] Version 1.76 (February 23, 2008)
[15:40:40]
[15:40:40] Preparing to commence simulation
[15:40:40] - Ensuring status. Please wait.
[15:40:57] - Looking at optimizations...
[15:40:57] - Working with standard loops on this execution.
[15:40:57] Examination of work files indicates 8 consecutive improper terminations of core.
[15:41:08] - Expanded 4727617 -> 24426905 (decompressed 516.6 percent)
[15:41:09]
[15:41:09] Project: 2665 (Run 0, Clone 530, Gen 47)
[15:41:09]
[15:41:22] Entering M.D.
[15:41:30] Calling FAH init
[15:41:33] Read topology
[15:41:33] (Starting from checkpoint)
[15:41:33] Read checkpoint
[15:41:34] Protein: HGG in water
[15:41:34] Writing local files
[15:41:46] Extra SSE boost OK.
[15:41:47] Writing local files
[15:41:47] Completed 0 out of 250000 steps (0 percent)
[16:25:22] Writing local files
[16:25:22] Completed 2500 out of 250000 steps (1 percent)
[17:09:07] Writing local files
[17:09:07] Completed 5000 out of 250000 steps (2 percent)
[17:52:50] Writing local files
[17:52:50] Completed 7500 out of 250000 steps (3 percent)
[18:36:32] Writing local files
[18:36:33] Completed 10000 out of 250000 steps (4 percent)
[19:20:19] Writing local files
[19:20:19] Completed 12500 out of 250000 steps (5 percent)
[20:04:12] Writing local files
[20:04:12] Completed 15000 out of 250000 steps (6 percent)
[20:48:03] Writing local files
[20:48:03] Completed 17500 out of 250000 steps (7 percent)
[21:08:54] Gromacs cannot continue further.
[21:08:54] Going to send back what have done.
[21:08:54] logfile size: 32739
[21:08:54] - Writing 33275 bytes of core data to disk...
[21:08:54] ... Done.
[21:08:54] No C.P. to delete.
[21:08:54] - Failed to delete work/wudata_01.bed
[21:08:54] - Failed to delete work/wudata_01.chk
[21:08:54] - Failed to delete work/wudata_01.sas
[21:08:54] - Failed to delete work/wudata_01.goe
[21:08:54] Warning: check for stray files
[21:10:54]
[21:10:54] Folding@home Core Shutdown: EARLY_UNIT_END
[21:10:54]
[21:10:54] Folding@home Core Shutdown: EARLY_UNIT_END
[21:10:59] CoreStatus = 63 (99)
[21:10:59] + Error starting Folding@Home core.
[21:11:04]
[21:11:04] + Processing work unit
[21:11:04] Work type a1 not eligible for variable processors
[21:11:04] Core required: FahCore_a1.exe
[21:11:04] Core found.
[21:11:04] Working on queue slot 01 [September 30 21:11:04 UTC]
[21:11:04] + Working ...
[21:11:05]
[21:11:05] *------------------------------*
[21:11:05] Folding@Home Gromacs SMP Core
[21:11:05] Version 1.76 (February 23, 2008)
[21:11:05]
[21:11:05] Preparing to commence simulation
[21:11:05] - Ensuring status. Please wait.
[21:11:22] - Looking at optimizations...
[21:11:22] - Working with standard loops on this execution.
[21:11:22] - Previous termination of core was improper.
[21:11:22] - Going to use standard loops.
[21:11:22] - Files status OK
[21:13:22]
[21:13:22] Folding@home Core Shutdown: MISSING_WORK_FILES
[21:13:22] Finalizing output
[21:13:26] CoreStatus = 1 (1)
[21:13:26] Client-core communications error: ERROR 0x1
[21:13:26] This is a sign of more serious problems, shutting down.
What's going on here ? It's been doing this for a few days now non stop
Adi
toTOW
Site Moderator
Posts: 6429 Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:
Post
by toTOW » Wed Oct 01, 2008 2:08 pm
First, update your client : viewtopic.php?f=46&t=4913
Then, you can try qfix : viewtopic.php?f=8&t=191
Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
adisor19
Posts: 2 Joined: Mon Dec 03, 2007 4:27 pm
Post
by adisor19 » Wed Oct 01, 2008 2:16 pm
toTOW wrote: First, update your client : viewtopic.php?f=46&t=4913
Then, you can try qfix : viewtopic.php?f=8&t=191
OK thanks for the info. I have updated the client and i've started a new run. We'll see how this goes, and I'll post an update to confirm if everything is back to normal.
Thanks again,
Adi