P18308 (GRO_A8) won't run

Moderators: Site Moderators, FAHC Science Team

Post Reply
JimF
Posts: 652
Joined: Thu Jan 21, 2010 2:03 pm

P18308 (GRO_A8) won't run

Post by JimF »

It will run for a few minutes with low output (initially 3,3800 PPD, then rising to 200 kPPD, and then falling back to low), and then start over.
This is on Unbuntu 20.04.4.

Code: Select all

17:07:54:WU01:FS00:0xa8:*********************** Log Started 2022-03-16T17:07:54Z ***********************
17:07:54:WU01:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
17:07:54:WU01:FS00:0xa8:       Core: Gromacs
17:07:54:WU01:FS00:0xa8:       Type: 0xa8
17:07:54:WU01:FS00:0xa8:    Version: 0.0.12
17:07:54:WU01:FS00:0xa8:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
17:07:54:WU01:FS00:0xa8:  Copyright: 2020 foldingathome.org
17:07:54:WU01:FS00:0xa8:   Homepage: https://foldingathome.org/
17:07:54:WU01:FS00:0xa8:       Date: Jan 16 2021
17:07:54:WU01:FS00:0xa8:       Time: 19:24:44
17:07:54:WU01:FS00:0xa8:   Compiler: GNU 8.3.0
17:07:54:WU01:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
17:07:54:WU01:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
17:07:54:WU01:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
17:07:54:WU01:FS00:0xa8:       Bits: 64
17:07:54:WU01:FS00:0xa8:       Mode: Release
17:07:54:WU01:FS00:0xa8:       SIMD: avx2_256
17:07:54:WU01:FS00:0xa8:     OpenMP: ON
17:07:54:WU01:FS00:0xa8:       CUDA: OFF
17:07:54:WU01:FS00:0xa8:       Args: -dir 01 -suffix 01 -version 706 -lifeline 340123 -checkpoint 15 -np
17:07:54:WU01:FS00:0xa8:             23
17:07:54:WU01:FS00:0xa8:************************************ libFAH ************************************
17:07:54:WU01:FS00:0xa8:       Date: Jan 16 2021
17:07:54:WU01:FS00:0xa8:       Time: 19:21:38
17:07:54:WU01:FS00:0xa8:   Compiler: GNU 8.3.0
17:07:54:WU01:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
17:07:54:WU01:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
17:07:54:WU01:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
17:07:54:WU01:FS00:0xa8:       Bits: 64
17:07:54:WU01:FS00:0xa8:       Mode: Release
17:07:54:WU01:FS00:0xa8:************************************ CBang *************************************
17:07:54:WU01:FS00:0xa8:       Date: Jan 16 2021
17:07:54:WU01:FS00:0xa8:       Time: 19:21:24
17:07:54:WU01:FS00:0xa8:   Compiler: GNU 8.3.0
17:07:54:WU01:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
17:07:54:WU01:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
17:07:54:WU01:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
17:07:54:WU01:FS00:0xa8:       Bits: 64
17:07:54:WU01:FS00:0xa8:       Mode: Release
17:07:54:WU01:FS00:0xa8:************************************ System ************************************
17:07:54:WU01:FS00:0xa8:        CPU: AMD Ryzen 9 5900X 12-Core Processor
17:07:54:WU01:FS00:0xa8:     CPU ID: AuthenticAMD Family 25 Model 33 Stepping 2
17:07:54:WU01:FS00:0xa8:       CPUs: 24
17:07:54:WU01:FS00:0xa8:     Memory: 15.55GiB
17:07:54:WU01:FS00:0xa8:Free Memory: 11.33GiB
17:07:54:WU01:FS00:0xa8:    Threads: POSIX_THREADS
17:07:54:WU01:FS00:0xa8: OS Version: 5.13
17:07:54:WU01:FS00:0xa8:Has Battery: false
17:07:54:WU01:FS00:0xa8: On Battery: false
17:07:54:WU01:FS00:0xa8: UTC Offset: -4
17:07:54:WU01:FS00:0xa8:        PID: 340127
17:07:54:WU01:FS00:0xa8:        CWD: /var/snap/folding-at-home-fcole90/common/work
17:07:54:WU01:FS00:0xa8:********************************************************************************
17:07:54:WU01:FS00:0xa8:Project: 18308 (Run 195, Clone 1, Gen 65)
17:07:54:WU01:FS00:0xa8:Unit: 0x00000000000000000000000000000000
17:07:54:WU01:FS00:0xa8:Digital signatures verified
17:07:54:WU01:FS00:0xa8:Calling: mdrun -c frame65.gro -s frame65.tpr -x frame65.xtc -cpt 15 -nt 23 -ntmpi 1
17:07:54:WU01:FS00:0xa8:Steps: first=16250000 total=16500000
17:07:55:WU01:FS00:0xa8:Completed 1 out of 250000 steps (0%)
17:08:13:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: P18308 (GRO_A8) won't run

Post by Joe_H »

Core_A* deals with threads a bit differently so you often can get away with thread settings that are large primes, but 23 may either be too large or triggers an error with this WU. Try stopping and starting with a lower thread count like 16, 18 or 20.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
JimF
Posts: 652
Joined: Thu Jan 21, 2010 2:03 pm

Re: P18308 (GRO_A8) won't run

Post by JimF »

Thanks. I have tried it on 22 and 20 cores with about the same result.
But this is a new machine, and I am using Precision Boost Overdrive (PBO) with a negative offset to cool the CPU a bit.
Normally, it sets the voltage and clock automatically without problems, but I expect it has introduced a bit of instability here.

I will fix it eventually.

PS - It is OK now. If you are familiar with PBO, I disabled "Curve Optimizer" and just set the Power Limit to 105 watts.
The CPU is now running at a very reasonable 75C at normal clock speed, and I am getting about 400 kPPD on core P18412.
This is with a single fan 120mm Arctic Liquid Freezer II cooler.

Previously, it was hitting 91C, which I think caused the problem. Each Ryzen is a little different, and you have to optimize them differently.
Post Reply