Two identical servers, different performance

Moderators: Site Moderators, FAHC Science Team

grizli
Posts: 8
Joined: Tue Sep 20, 2011 11:15 am

Two identical servers, different performance

Post by grizli »

Hello everyone,
So I have 2 boxes that have been folding for a while now. Since day one - machine A has been producing more PPD than second box.
They have been ordered together, identical everything.
Machine A PPD - 36k
Machine B PPD - 25k

The difference is significant, so I decided to see if someone can suggest how to resolve this and make machine B perform the same as A.

Specs on both machines are:
2xE5620, 12Gb RAM.

Thanks
Macaholic
Site Moderator
Posts: 811
Joined: Thu Nov 29, 2007 11:57 pm
Location: 1 Infinite Loop

Re: Two identical servers, different performance

Post by Macaholic »

Welcome to the forums. Have you checked your log files? It is highly unlikely that both machines have worked on the same project work units. Various units are worth varying points depending on several factors. That would explain the difference in points you are seeing. :)
Fold! It does a body good!™
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Two identical servers, different performance

Post by 7im »

Hello grizli, welcome to the forum.

As another helpful diagnostic step, please post the first 50 lines of each of the fahlog.txt files, starting with the ############### Client Version Here ###############
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
spitter3
Posts: 7
Joined: Wed Jun 23, 2010 12:27 am

Re: Two identical servers, different performance

Post by spitter3 »

Switch hard drives, if the machines are the same its the quickest way to find out if its a programming issue or a hardware issue!!
Nathan_P
Posts: 1180
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 x5670@3.2 Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 E5-2665@2.3 Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: Two identical servers, different performance

Post by Nathan_P »

spitter3 wrote:Switch hard drives, if the machines are the same its the quickest way to find out if its a programming issue or a hardware issue!!
Why would he need to do that. HDD performance is virtually meaningless for folding.

Client configuration - only viewable from the asked for log files, incorrect cpu/ram set up, heat throttling and different PPD from different projects are all far more likely problems.
Image
Grandpa_01
Posts: 1122
Joined: Wed Mar 04, 2009 7:36 am
Hardware configuration: 3 - Supermicro H8QGi-F AMD MC 6174=144 cores 2.5Ghz, 96GB G.Skill DDR3 1333Mhz Ubuntu 10.10
2 - Asus P6X58D-E i7 980X 4.4Ghz 6GB DDR3 2000 A-Data 64GB SSD Ubuntu 10.10
1 - Asus Rampage Gene III 17 970 4.3Ghz DDR3 2000 2-500GB Segate 7200.11 0-Raid Ubuntu 10.10
1 - Asus G73JH Laptop i7 740QM 1.86Ghz ATI 5870M

Re: Two identical servers, different performance

Post by Grandpa_01 »

Swapping hard drives could rule out client configuration. If you swap the drives and machine A drops to 25K and machine B goes up to 35K then it is definitely client configuration. But if all remains the same that only leaves one alternative. Hardware which is highly Likely since no to CPU's, sets of ram etc. that I have ever seen preformed exactly the same.
Image
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Two identical servers, different performance

Post by 7im »

Might be easier just to swap the FAH folders. ;)
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
k1wi
Posts: 910
Joined: Tue Sep 22, 2009 10:48 pm

Re: Two identical servers, different performance

Post by k1wi »

Swapping folders might be a good idea, because it could be something as simple as a heatsink being incorrectly mounted, or dodgy ram. I note that it's a dual-cpu set up which I imagine could add complexity somewhat considerably.
Mactin
Posts: 222
Joined: Sun Dec 02, 2007 1:08 pm
Location: Côte-des-Neiges, Montréal, Québec

Re: Two identical servers, different performance

Post by Mactin »

Hello Grizli,
Welcome to the forums.
For How long ? The gap is big but might not me statisticaly significant if the time frame is not long enogh.
H
Image
grizli
Posts: 8
Joined: Tue Sep 20, 2011 11:15 am

Re: Two identical servers, different performance

Post by grizli »

Here are 2 latest starts of each machine from FAHlog.txt:
Machine A: (faster one)

Code: Select all

# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\temp\fa
Executable: FAH6.34-win32-SMP.exe
Arguments: -smp 16 -bigadv 

[14:32:08] - Ask before connecting: No
[14:32:08] - User name: grizli 
[14:32:08] - User ID: xxxxxxxxxxxxxxxxxxx
[14:32:08] - Machine ID: 1
[14:32:08] 
[14:32:08] Loaded queue successfully.
[14:32:08] - Preparing to get new work unit...
[14:32:08] Project: 6900 (Run 13, Clone 10, Gen 34)
[14:32:08] Cleaning up work directory


[14:32:08] + Attempting to send results [September 5 14:32:08 UTC]
[14:32:09] + Attempting to get work packet
[14:32:09] Passkey found
[14:32:09] - Connecting to assignment server
[14:32:09] - Successful: assigned to (171.67.108.22).
[14:32:09] + News From Folding@Home: Welcome to Folding@Home
[14:32:10] Loaded queue successfully.
[14:32:58] + Closed connections
[14:32:58] 
[14:32:58] + Processing work unit
[14:32:58] Core required: FahCore_a5.exe
[14:32:58] Core found.
[14:32:58] Working on queue slot 06 [September 5 14:32:58 UTC]
[14:32:58] + Working ...
[14:32:58] 
[14:32:58] *------------------------------*
[14:32:58] Folding@Home Gromacs SMP Core
[14:32:58] Version 2.27 (Mar 12, 2010)
[14:32:58] 
[14:32:58] Preparing to commence simulation
[14:32:58] - Looking at optimizations...
[14:32:58] - Created dyn
[14:32:58] - Files status OK
[14:33:05] - Expanded 25468947 -> 31941441 (decompressed 125.4 percent)
[14:33:05] Called DecompressByteArray: compressed_data_size=25468947 data_size=31941441, decompressed_data_size=31941441 diff=0
[14:33:05] - Digital signature verified
[14:33:05] 
[14:33:05] Project: 2686 (Run 6, Clone 9, Gen 150)
[14:33:05] 
[14:33:05] Assembly optimizations on if available.
[14:33:05] Entering M.D.
[14:33:12] Mapping NT from 16 to 16 
[14:33:16] Completed 0 out of 250000 steps  (0%)
[15:00:33] Completed 2500 out of 250000 steps  (1%)
Machine B:(slower one)

Code: Select all

# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\temp\fa
Executable: FAH6.34-win32-SMP.exe
Arguments: -smp 16 -bigadv 

[12:25:24] - Ask before connecting: No
[12:25:24] - User name: grizli
[12:25:24] - User ID not found locally
[12:25:24] + Requesting User ID from server
[12:25:25] - Machine ID: 1
[12:25:25] 
[12:25:25] Loaded queue successfully.
[12:25:25] 
[12:25:25] + Processing work unit
[12:25:25] Core required: FahCore_a5.exe
[12:25:25] Core found.
[12:25:25] Working on queue slot 01 [September 20 12:25:25 UTC]
[12:25:25] + Working ...
[12:25:25] 
[12:25:25] *------------------------------*
[12:25:25] Folding@Home Gromacs SMP Core
[12:25:25] Version 2.27 (Mar 12, 2010)
[12:25:25] 
[12:25:25] Preparing to commence simulation
[12:25:25] - Ensuring status. Please wait.
[12:25:35] - Looking at optimizations...
[12:25:35] - Working with standard loops on this execution.
[12:25:35] - Previous termination of core was improper.
[12:25:35] - Files status OK
[12:25:41] - Expanded 24869644 -> 30796292 (decompressed 123.8 percent)
[12:25:41] Called DecompressByteArray: compressed_data_size=24869644 data_size=30796292, decompressed_data_size=30796292 diff=0
[12:25:42] - Digital signature verified
[12:25:42] 
[12:25:42] Project: 6900 (Run 3, Clone 24, Gen 57)
[12:25:42] 
[12:25:42] Entering M.D.
[12:25:48] Using Gromacs checkpoints
[12:25:49] Mapping NT from 16 to 16 
[12:26:00] Resuming from checkpoint
[12:26:01] Verified work/wudata_01.log
[12:26:03] Verified work/wudata_01.trr
[12:26:03] Verified work/wudata_01.xtc
[12:26:03] Verified work/wudata_01.edr
[12:26:04] Completed 192860 out of 250000 steps  (77%)
[13:11:06] Completed 195000 out of 250000 steps  (78%)
One thing I noticed - Machine B could not find UserID??? Why is that? Both have Passkey in client.cfg files.

Machines have been folding for over a month now.

Another thing to consider - I have another 2 boxes that are the same:
2xE5335 with 8Gb of RAM. The same story - 11K PPD vs 8K PPD.

Mod Edit: Added Code Tags - PantherX
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Two identical servers, different performance

Post by 7im »

[12:25:24] + Requesting User ID from server

The above message should only happen once, when the client is first installed.

Since the client has been running for a month, that indicates a problem. The client needs to be run with a user account that has accesss to write to the Windows Registry.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Magic Michael
Posts: 22
Joined: Tue Dec 04, 2007 8:09 am
Hardware configuration: 1. 2x E5520 @2.27 GHz with 6x 4GB Reg ECC RAM and 1x Geforce GTX 660ti. Windoze -smp 12 and -gpu
2. 2x E5620 @2.53 GHz 16GB Reg ECC RAM. Debian Linux with -smp 16
3. the occasional borg
Location: Berlin / Deutschland

Re: Two identical servers, different performance

Post by Magic Michael »

Have you checked if all cores are used ? Sometimes my Xeons have one core stuck in C1/Halt mode until I restart the folding client. CentOS though, not Windoze.
Image
grizli
Posts: 8
Joined: Tue Sep 20, 2011 11:15 am

Re: Two identical servers, different performance

Post by grizli »

7im wrote:[12:25:24] + Requesting User ID from server

The above message should only happen once, when the client is first installed.

Since the client has been running for a month, that indicates a problem. The client needs to be run with a user account that has accesss to write to the Windows Registry.
I just restarted the client - went through fine this time. It was running under administrator account. Not sure what that was all about, but PPD still hasn't changed :(
grizli
Posts: 8
Joined: Tue Sep 20, 2011 11:15 am

Re: Two identical servers, different performance

Post by grizli »

Magic Michael wrote:Have you checked if all cores are used ? Sometimes my Xeons have one core stuck in C1/Halt mode until I restart the folding client. CentOS though, not Windoze.
I checked via Task manager - all 100%
i7GTX550Ti8Gb
Posts: 11
Joined: Wed Sep 14, 2011 2:11 pm

Re: Two identical servers, different performance

Post by i7GTX550Ti8Gb »

seems 2 different projects to me, and what i noticed is that voth your machines are configured as machine number 1?

Not sure wether it helped, not a server expert :)

me
Post Reply