Project: 6041 (Run 0, Clone 14, Gen 35)

Moderators: Site Moderators, FAHC Science Team

Post Reply
chrisretusn
Posts: 101
Joined: Sat Feb 02, 2008 10:12 am
Hardware configuration: AMD Athlon(tm) 64 X2 Dual Core Processor 4000+
AMD Athlon(tm) XP 2600+
Location: Philippines

Project: 6041 (Run 0, Clone 14, Gen 35)

Post by chrisretusn »

Code: Select all

# Linux SMP Console Edition ###################################################
                       Folding@Home Client Version 6.29
Launch directory: /home/folding/fah
Executable: ./fah6
Arguments: -verbosity 9 -smp 

[21:20:01] + Attempting to get work packet
[21:20:01] Passkey found
[21:20:01] - Will indicate memory of 3832 MB
[21:20:01] - Connecting to assignment server
[21:20:01] Connecting to http://assign.stanford.edu:8080/
[21:20:02] Posted data.
[21:20:02] Initial: 40AB; - Successful: assigned to (171.64.65.54).
[21:20:02] + News From Folding@Home: Welcome to Folding@Home
[21:20:02] Loaded queue successfully.
[21:20:02] Connecting to http://171.64.65.54:8080/
[21:20:05] Posted data.
[21:20:05] Initial: 0000; - Receiving payload (expected size: 7884767)
[21:22:30] - Downloaded at ~53 kB/s
[21:22:30] - Averaged speed for that direction ~50 kB/s
[21:22:30] + Received work.
[21:22:30] Trying to send all finished work units
[21:22:30] + No unsent completed units remaining.
[21:22:30] + Closed connections
[21:22:30] 
[21:22:30] + Processing work unit
[21:22:30] Core required: FahCore_a3.exe
[21:22:30] Core found.
[21:22:30] Working on queue slot 03 [August 14 21:22:30 UTC]
[21:22:30] + Working ...
[21:22:30] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 03 -np 2 -checkpoint 15 -verbose -lifeline 1570 -version 629'

[21:22:30] 
[21:22:30] *------------------------------*
[21:22:30] Folding@Home Gromacs SMP Core
[21:22:30] Version 2.22 (June 10, 2010)
[21:22:30] 
[21:22:30] Preparing to commence simulation
[21:22:30] - Looking at optimizations...
[21:22:30] - Created dyn
[21:22:30] - Files status OK
[21:22:31] - Expanded 7884255 -> 10126021 (decompressed 128.4 percent)
[21:22:31] Called DecompressByteArray: compressed_data_size=7884255 data_size=10126021, decompressed_data_size=10126021 diff=0
[21:22:31] - Digital signature verified
[21:22:31] 
[21:22:31] Project: 6041 (Run 0, Clone 14, Gen 35)
[21:22:31] 
[21:22:32] Assembly optimizations on if available.
[21:22:32] Entering M.D.
Starting 2 threads
NNODES=2, MYRANK=0, HOSTNAME=thread #0
NNODES=2, MYRANK=1, HOSTNAME=thread #1
Reading file work/wudata_03.tpr, VERSION 4.0.99_development_20090605 (single precision)
Making 1D domain decomposition 2 x 1 x 1
starting mdrun '8817 system'
9000001 steps,  36000.0 ps (continuing from step 8750001,  35000.0 ps).
[21:22:44] Completed 0 out of 250000 steps  (0%)

step 8750002: Water molecule starting at atom 282568 can not be settled.
Check for bad contacts and/or reduce the timestep if appropriate.

Step 8750003  Warning: pressure scaling more than 1%, mu: 1.01826 1.01826 1.01826

Step 8750003  Warning: pressure scaling more than 1%, mu: 1.01826 1.01826 1.01826

step 8750003: Water molecule starting at atom 195802 can not be settled.
Check for bad contacts and/or reduce the timestep if appropriate.

Step 8750004:
The charge group starting at atom 22219 moved than the distance allowed by the domain decomposition (7.699520) in direction X
distance out of cell -17.413467
Old coordinates:    8.137    0.128    0.728
New coordinates:   -9.714   10.687    3.879
Old cell boundaries in direction X:    7.561   15.123
New cell boundaries in direction X:    7.700   15.399

-------------------------------------------------------
Program mdrun, VERSION 4.0.99-dev-20100610-b6a86-dirty
Source code file: /data0/FAHdev/a3_development/gromacs/src/mdlib/domdec.c, line: 4081

Fatal error:
A charge group moved too far between two domain decomposition steps
This usually means that your system is not well equilibrated
For more information and tips for troubleshooting, please check the GROMACS
website at http://www.gromacs.org/Documentation/Errors
-------------------------------------------------------

Thanx for Using GROMACS - Have a Nice Day

[21:22:47] mdrun returned 255
[21:22:47] Going to send back what have done -- stepsTotalG=250000
[21:22:47] Work fraction=4374873.0000 steps=250000.
[21:22:51] logfile size=11815 infoLength=11815 edr=25 trr=1
[21:22:51] logfile size: 11815 info=11815 bed=25 hdr=1
[21:22:51] - Writing 12353 bytes of core data to disk...
[21:22:51]   ... Done.
[21:23:23] 
[21:23:23] Folding@home Core Shutdown: UNSTABLE_MACHINE
[21:23:23] CoreStatus = 7A (122)
[21:23:23] Sending work to server
[21:23:23] Project: 6041 (Run 0, Clone 14, Gen 35)


[21:23:23] + Attempting to send results [August 14 21:23:23 UTC]
[21:23:23] - Reading file work/wuresults_03.dat from core
[21:23:23]   (Read 12353 bytes from disk)
[21:23:23] Connecting to http://171.64.65.54:8080/
[21:23:25] Posted data.
[21:23:25] Initial: 0000; - Uploaded at ~6 kB/s
[21:23:25] - Averaged speed for that direction ~15 kB/s
[21:23:25] + Results successfully sent
[21:23:25] Thank you for your contribution to Folding@Home.
I noticed this in my log today. I would have never noticed it had I not looked. The WU before this one processed and finished normally, the current WU is also processing nornally at 66%. Both the previous WU and current WU are different projects. This is my first Project 6041.

I looked up CoreStatus = 7A (122) in the Wiki, says it's related to GPU, as you can see this is not a GPU. This is a AMD Athlon(tm) 64 X2 Dual Core Processor 4000+ running under Slackware 13.1

A search revealed other WU's with the same "CoreStatus = 7A (122)" and the "mdrun returned 255" in logs but all slight different.

Figured the log (information maybe useful to someone so I am reporting it.
Image
Folding on Slackware Linux.
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project: 6041 (Run 0, Clone 14, Gen 35)

Post by sortofageek »

Project: 6041 (Run 0, Clone 14, Gen 35) was successfully completed by another folder, so it is not a bad WU.

Hi chrisretusn (team 2291),
Your WU (P6041 R0 C14 G35) was added to the stats database on 2010-08-14 15:09:25 for 0.54 points of credit.

From what I could see, the error reported for what you returned may indicate an instability of some sort. If you see repeats of this, you may want to check for hardware instability of some sort.
chrisretusn
Posts: 101
Joined: Sat Feb 02, 2008 10:12 am
Hardware configuration: AMD Athlon(tm) 64 X2 Dual Core Processor 4000+
AMD Athlon(tm) XP 2600+
Location: Philippines

Re: Project: 6041 (Run 0, Clone 14, Gen 35)

Post by chrisretusn »

Thanks sortofageek. Hopefully it was a fluke. I will keep an eye out for this.
Image
Folding on Slackware Linux.
Post Reply