Page 1 of 1

Project: 2669 (Run 2, Clone 60, Gen 88)

Posted: Wed Jun 24, 2009 2:27 am
by geokilla
Before I post the log, I'd just like to point out that I'm using the Notfred client and that I understand it's not widely supported. There's also that the error is fairly new to Folding@Home. I'm posting the log so hopefully that the mods and admin and folding team can determine the cause of the error and a solution to it. Google says that it seems to be based on a bad checkpoint though.

Code: Select all

--- Opening Log file [June 23 12:20:34] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /etc/folding/1
Executable: ./fah6
Arguments: -local -forceasm -smp 4 

Warning:
 By using the -forceasm flag, you are overriding
 safeguards in the program. If you did not intend to
 do this, please restart the program without -forceasm.
 If work units are not completing fully (and particularly
 if your machine is overclocked), then please discontinue
 use of the flag.

[12:20:34] - Ask before connecting: No
[12:20:34] - User name: geokilla (Team 38296)
[12:20:34] - User ID: 51BF9B7E31882B4A
[12:20:34] - Machine ID: 1
[12:20:34] 
[12:20:34] Loaded queue successfully.
[12:20:34] 
[12:20:34] + Processing work unit
[12:20:34] At least 4 processors must be requested.Core required: FahCore_a2.exe
[12:20:34] Core not found.
[12:20:34] - Core is not present or corrupted.
[12:20:34] - Attempting to download new core...
[12:20:34] + Downloading new core: FahCore_a2.exe
[12:20:37] + 10240 bytes downloaded
. . .
. . . 
[12:20:46] + 1770268 bytes downloaded
[12:20:46] Verifying core Core_a2.fah...
[12:20:46] Signature is VALID
[12:20:46] 
[12:20:46] Trying to unzip core FahCore_a2.exe
[12:20:46] Decompressed FahCore_a2.exe (4341288 bytes) successfully
[12:20:46] + Core successfully engaged
[12:20:51] 
[12:20:51] + Processing work unit
[12:20:51] At least 4 processors must be requested.Core required: FahCore_a2.exe
[12:20:51] Core found.
[12:20:51] Working on Unit 02 [June 23 12:20:51]
[12:20:51] + Working ...
[12:20:51] 
[12:20:51] *------------------------------*
[12:20:51] Folding@Home Gromacs SMP Core
[12:20:51] Version 2.07 (Sun Apr 19 14:51:09 PDT 2009)
[12:20:51] 
[12:20:51] Preparing to commence simulation
[12:20:51] - Ensuring status. Please wait.
[12:21:01] - Assembly optimizations manually forced on.
[12:21:01] - Not checking prior termination.
[12:21:09] - Expanded 4836049 -> 23977273 (decompressed 495.8 percent)
[12:21:15] Called DecompressByteArray: compressed_data_size=4836049 data_size=23977273, decompressed_data_size=23977273 diff=0
[12:21:15] - Digital signature verified
[12:21:15] 
[12:21:15] Project: 2669 (Run 2, Clone 60, Gen 88)
[12:21:15] 
[12:21:15] Assembly optimizations on if available.
[12:21:15] Entering M.D.
[12:21:21] Using Gromacs checkpoints
[12:21:23] Multi-core optimizations on
[12:21:28] Resuming from checkpoint
[12:21:29] Verified work/wudata_02.log
[12:21:29] Verified work/wudata_02.trr
[12:21:29] Verified work/wudata_02.xtc
[12:21:29] Verified work/wudata_02.edr
[12:21:32] Completed 152510 out of 250000 steps  (61%)
[12:48:39] Completed 155000 out of 250000 steps  (62%)
[13:06:37] Completed 157500 out of 250000 steps  (63%)
[13:23:54] Completed 160000 out of 250000 steps  (64%)
[13:41:13] Completed 162500 out of 250000 steps  (65%)
[13:45:50] CoreStatus = FF (255)
[13:45:50] Client-core communications error: ERROR 0xff
[13:45:50] Deleting current work unit & continuing...
[13:46:04] - Preparing to get new work unit...
[13:46:04] + Attempting to get work packet
[13:46:04] - Connecting to assignment server
[13:46:05] - Successful: assigned to (171.64.65.56).
[13:46:05] + News From Folding@Home: Welcome to Folding@Home
[13:46:05] Loaded queue successfully.
[13:46:45] + Closed connections
[13:46:50] 
[13:46:50] + Processing work unit
[13:46:50] At least 4 processors must be requested.Core required: FahCore_a2.exe
[13:46:50] Core found.
[13:46:50] Working on Unit 03 [June 23 13:46:50]
[13:46:50] + Working ...
[13:46:50] 
[13:46:50] *------------------------------*
[13:46:50] Folding@Home Gromacs SMP Core
[13:46:50] Version 2.07 (Sun Apr 19 14:51:09 PDT 2009)
[13:46:50] 
[13:46:50] Preparing to commence simulation
[13:46:50] - Ensuring status. Please wait.
[13:46:59] - Assembly optimizations manually forced on.
[13:46:59] - Not checking prior termination.
[13:47:03] - Expanded 4836049 -> 23977273 (decompressed 495.8 percent)
[13:47:04] Called DecompressByteArray: compressed_data_size=4836049 data_size=23977273, decompressed_data_size=23977273 diff=0
[13:47:04] - Digital signature verified
[13:47:04] 
[13:47:04] Project: 2669 (Run 2, Clone 60, Gen 88)
[13:47:04] 
[13:47:04] Assembly optimizations on if available.
[13:47:04] Entering M.D.
[13:47:14] Multi-core optimizations on
[13:47:20] Completed 0 out of 250000 steps  (0%)
[14:25:37] Completed 2500 out of 250000 steps  (1%)
[15:08:49] Completed 5000 out of 250000 steps  (2%)
[15:47:57] Completed 7500 out of 250000 steps  (3%)
[16:29:58] Completed 10000 out of 250000 steps  (4%)
[17:13:55] Completed 12500 out of 250000 steps  (5%)
[17:53:34] Completed 15000 out of 250000 steps  (6%)
[18:14:51] Completed 17500 out of 250000 steps  (7%)
[18:32:35] Completed 20000 out of 250000 steps  (8%)
[18:49:41] Completed 22500 out of 250000 steps  (9%)

Re: Project: 2669 (Run 2, Clone 60, Gen 88)

Posted: Wed Jun 24, 2009 5:26 am
by bruce
I'm not sure what Error 0xff is, but if you're right about it being a bad checkpoint, I'd guess that whatever storage media that the files are being written to is too small. How much free space is there in the partition in which FAH stores its work files?

Re: Project: 2669 (Run 2, Clone 60, Gen 88)

Posted: Wed Jun 24, 2009 5:51 pm
by geokilla
Um I run the Notfred client, so it's a diskless folding client. The partition where the VMX files are located is over 80GB big. As for the place where it stores the F@H files, I have no idea. Diskless folding has its problems I guess.

Re: Project: 2669 (Run 2, Clone 60, Gen 88)

Posted: Sat Jun 27, 2009 12:22 pm
by susato
This post and the few after it in the Third Party Applications section of the forum may help to solve your problem. I too have run out of memory in the VM when running notfred's on a Windows host machine. it's easy to increase the VM's RAM but not obvious how to increase the size of the VM's virtual hard disk. I get the impression that most people using notfred's app (note that it ISN'T a client; the client is Stanford's) use the USB stick version.

To stay on topic, let's save this thread for other reports of trouble with the actual work unit, and redirect discussion of the notfred app back to the appropriate thread in the Third Party Apps forum area. Many thanks!