I seem to have received a bad _a8 core update [OS X]

Moderators: Site Moderators, FAHC Science Team

garaden
Posts: 2
Joined: Thu Sep 24, 2020 2:11 pm

FahCore returned: INTERRUPTED loop on core a8-0.0.7

Post by garaden »

My client downloaded a new a8 core (0.0.7) last night and it's constantly outputting FahCore returned: INTERRUPTED (102 = 0x66).

Code: Select all

06:14:19:WU00:FS00:Connecting to assign1.foldingathome.org:80
06:14:19:WU00:FS00:Assigned to work server 178.174.196.138
06:14:19:WU00:FS00:Requesting new work unit for slot 00: READY cpu:3 from 178.174.196.138
06:14:19:WU00:FS00:Connecting to 178.174.196.138:8080
06:14:20:WU00:FS00:Downloading 1.88MiB
06:14:21:WU00:FS00:Download complete
06:14:21:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:16810 run:6 clone:548 gen:26 core:0xa8 unit:0x0000001db2aec48a5eff2cc166d781d9
06:14:21:WU00:FS00:Downloading core from http://cores.foldingathome.org/osx/64bit-avx2-256/a8-0.0.7/Core_a8.fah
06:14:21:WU00:FS00:Connecting to cores.foldingathome.org:80
06:14:21:WU00:FS00:FahCore a8: Downloading 6.22MiB
06:14:22:WU00:FS00:FahCore a8: Download complete
06:14:23:WU00:FS00:Valid core signature
06:14:23:WU00:FS00:Unpacked 15.63MiB to cores/cores.foldingathome.org/osx/64bit-avx2-256/a8-0.0.7/Core_a8.fah/FahCore_a8
06:14:23:WU00:FS00:Starting
06:14:23:WU00:FS00:Running FahCore: /usr/local/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/cores.foldingathome.org/osx/64bit-avx2-256/a8-0.0.7/Core_a8.fah/FahCore_a8" -dir 00 -suffix 01 -version 706 -lifeline 150 -checkpoint 15 -np 3
06:14:23:WU00:FS00:Started FahCore on PID 23323
06:14:23:WU00:FS00:Core PID:23324
06:14:23:WU00:FS00:FahCore 0xa8 started
06:14:24:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
06:14:24:WU00:FS00:Starting
06:14:24:WU00:FS00:Running FahCore: /usr/local/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/cores.foldingathome.org/osx/64bit-avx2-256/a8-0.0.7/Core_a8.fah/FahCore_a8" -dir 00 -suffix 01 -version 706 -lifeline 150 -checkpoint 15 -np 3
06:14:24:WU00:FS00:Started FahCore on PID 23325
06:14:24:WU00:FS00:Core PID:23326
06:14:24:WU00:FS00:FahCore 0xa8 started
06:14:25:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
06:15:24:WU00:FS00:Starting
06:15:24:WU00:FS00:Running FahCore: /usr/local/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/cores.foldingathome.org/osx/64bit-avx2-256/a8-0.0.7/Core_a8.fah/FahCore_a8" -dir 00 -suffix 01 -version 706 -lifeline 150 -checkpoint 15 -np 3
06:15:24:WU00:FS00:Started FahCore on PID 23366
06:15:24:WU00:FS00:Core PID:23367
06:15:24:WU00:FS00:FahCore 0xa8 started
06:15:25:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
06:16:25:WU00:FS00:Starting
06:16:25:WU00:FS00:Running FahCore: /usr/local/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/cores.foldingathome.org/osx/64bit-avx2-256/a8-0.0.7/Core_a8.fah/FahCore_a8" -dir 00 -suffix 01 -version 706 -lifeline 150 -checkpoint 15 -np 3
06:16:25:WU00:FS00:Started FahCore on PID 23402
06:16:25:WU00:FS00:Core PID:23403
06:16:25:WU00:FS00:FahCore 0xa8 started
06:16:25:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
My client is saying I'm currently on attempt 467 :( It's been like this for about 8 hours.

The work unit before that one completed fine, same project, using a8-0.0.6.

Code: Select all

00:07:43:WU01:FS00:Connecting to assign1.foldingathome.org:80
00:07:43:WU01:FS00:Assigned to work server 178.174.196.138
00:07:43:WU01:FS00:Requesting new work unit for slot 00: READY cpu:3 from 178.174.196.138
00:07:43:WU01:FS00:Connecting to 178.174.196.138:8080
00:07:43:WU01:FS00:Downloading 1.88MiB
00:07:46:WU01:FS00:Download complete
00:07:46:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:16810 run:89 clone:956 gen:4 core:0xa8 unit:0x00000004b2aec48a5eff208e7cc8d572
00:07:46:WU01:FS00:Starting
00:07:46:WU01:FS00:Running FahCore: /usr/local/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/cores.foldingathome.org/osx/64bit-avx2-256/a8-0.0.6/Core_a8.fah/FahCore_a8" -dir 01 -suffix 01 -version 706 -lifeline 150 -checkpoint 15 -np 3
00:07:46:WU01:FS00:Started FahCore on PID 9606
00:07:46:WU01:FS00:Core PID:9607
00:07:46:WU01:FS00:FahCore 0xa8 started
00:07:47:WU01:FS00:0xa8:*********************** Log Started 2020-09-24T00:07:46Z ***********************
00:07:47:WU01:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
00:07:47:WU01:FS00:0xa8:       Core: Gromacs
00:07:47:WU01:FS00:0xa8:       Type: 0xa8
00:07:47:WU01:FS00:0xa8:    Version: 0.0.6
00:07:47:WU01:FS00:0xa8:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
00:07:47:WU01:FS00:0xa8:  Copyright: 2020 foldingathome.org
00:07:47:WU01:FS00:0xa8:   Homepage: https://foldingathome.org/
00:07:47:WU01:FS00:0xa8:       Date: Jul 25 2020
00:07:47:WU01:FS00:0xa8:       Time: 17:01:41
00:07:47:WU01:FS00:0xa8:   Revision: bb73a0bb4b90096d0c4f404ba28d26418056a589
00:07:47:WU01:FS00:0xa8:     Branch: master
00:07:47:WU01:FS00:0xa8:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.3 (clang-1103.0.32.62)
00:07:47:WU01:FS00:0xa8:    Options: -std=c++14 -fsigned-char -stdlib=libc++ -O3 -funroll-loops -fno-pie
00:07:47:WU01:FS00:0xa8:             -mmacosx-version-min=10.9
00:07:47:WU01:FS00:0xa8:   Platform: darwin 19.5.0
00:07:47:WU01:FS00:0xa8:       Bits: 64
00:07:47:WU01:FS00:0xa8:       Mode: Release
00:07:47:WU01:FS00:0xa8:       SIMD: avx2_256
00:07:47:WU01:FS00:0xa8:     OpenMP: ON
00:07:47:WU01:FS00:0xa8:       CUDA: OFF
00:07:47:WU01:FS00:0xa8:       Args: -dir 01 -suffix 01 -version 706 -lifeline 9606 -checkpoint 15 -np 3
00:07:47:WU01:FS00:0xa8:************************************ libFAH ************************************
00:07:47:WU01:FS00:0xa8:       Date: Jul 25 2020
00:07:47:WU01:FS00:0xa8:       Time: 16:58:36
00:07:47:WU01:FS00:0xa8:   Revision: bb73a0bb4b90096d0c4f404ba28d26418056a589
00:07:47:WU01:FS00:0xa8:     Branch: master
00:07:47:WU01:FS00:0xa8:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.3 (clang-1103.0.32.62)
00:07:47:WU01:FS00:0xa8:    Options: -std=c++14 -fsigned-char -stdlib=libc++ -O3 -funroll-loops -fno-pie
00:07:47:WU01:FS00:0xa8:             -mmacosx-version-min=10.9
00:07:47:WU01:FS00:0xa8:   Platform: darwin 19.5.0
00:07:47:WU01:FS00:0xa8:       Bits: 64
00:07:47:WU01:FS00:0xa8:       Mode: Release
00:07:47:WU01:FS00:0xa8:************************************ CBang *************************************
00:07:47:WU01:FS00:0xa8:       Date: Jul 25 2020
00:07:47:WU01:FS00:0xa8:       Time: 16:58:19
00:07:47:WU01:FS00:0xa8:   Revision: bb73a0bb4b90096d0c4f404ba28d26418056a589
00:07:47:WU01:FS00:0xa8:     Branch: master
00:07:47:WU01:FS00:0xa8:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.3 (clang-1103.0.32.62)
00:07:47:WU01:FS00:0xa8:    Options: -std=c++14 -fsigned-char -stdlib=libc++ -O3 -funroll-loops -fno-pie
00:07:47:WU01:FS00:0xa8:             -mmacosx-version-min=10.9 -fPIC
00:07:47:WU01:FS00:0xa8:   Platform: darwin 19.5.0
00:07:47:WU01:FS00:0xa8:       Bits: 64
00:07:47:WU01:FS00:0xa8:       Mode: Release
00:07:47:WU01:FS00:0xa8:************************************ System ************************************
00:07:47:WU01:FS00:0xa8:        CPU: Intel(R) Core(TM) i7-7700HQ CPU @ 2.80GHz
00:07:47:WU01:FS00:0xa8:     CPU ID: GenuineIntel Family 6 Model 158 Stepping 9
00:07:47:WU01:FS00:0xa8:       CPUs: 8
00:07:47:WU01:FS00:0xa8:     Memory: 16.00GiB
00:07:47:WU01:FS00:0xa8:Free Memory: 1.97GiB
00:07:47:WU01:FS00:0xa8:    Threads: POSIX_THREADS
00:07:47:WU01:FS00:0xa8: OS Version: 10.15
00:07:47:WU01:FS00:0xa8:Has Battery: true
00:07:47:WU01:FS00:0xa8: On Battery: false
00:07:47:WU01:FS00:0xa8: UTC Offset: -4
00:07:47:WU01:FS00:0xa8:        PID: 9607
00:07:47:WU01:FS00:0xa8:        CWD: /Library/Application Support/FAHClient/work
00:07:47:WU01:FS00:0xa8:********************************************************************************
00:07:47:WU01:FS00:0xa8:Project: 16810 (Run 89, Clone 956, Gen 4)
00:07:47:WU01:FS00:0xa8:Unit: 0x00000004b2aec48a5eff208e7cc8d572
00:07:47:WU01:FS00:0xa8:Reading tar file core.xml
00:07:47:WU01:FS00:0xa8:Reading tar file frame4.tpr
00:07:47:WU01:FS00:0xa8:Digital signatures verified
00:07:47:WU01:FS00:0xa8:Calling: mdrun -c frame4.gro -s frame4.tpr -x frame4.xtc -cpt 15 -nt 3 -ntmpi 1
00:07:47:WU01:FS00:0xa8:Steps: first=2000000 total=2500000
00:07:49:WU01:FS00:0xa8:Completed 1 out of 500000 steps (0%)
00:11:23:WU01:FS00:0xa8:Completed 5000 out of 500000 steps (1%)
...
06:04:48:WU01:FS00:0xa8:Completed 495000 out of 500000 steps (99%)
06:04:49:WU00:FS00:Connecting to assign1.foldingathome.org:80
[93m06:04:49:WARNING:WU00:FS00:Failed to get assignment from 'assign1.foldingathome.org:80': No WUs available for this configuration[0m
06:04:49:WU00:FS00:Connecting to assign2.foldingathome.org:80
[93m06:04:50:WARNING:WU00:FS00:Failed to get assignment from 'assign2.foldingathome.org:80': No WUs available for this configuration[0m
06:04:50:WU00:FS00:Connecting to assign3.foldingathome.org:80
[93m06:04:50:WARNING:WU00:FS00:Failed to get assignment from 'assign3.foldingathome.org:80': No WUs available for this configuration[0m
06:04:50:WU00:FS00:Connecting to assign4.foldingathome.org:80
[93m06:04:50:WARNING:WU00:FS00:Failed to get assignment from 'assign4.foldingathome.org:80': No WUs available for this configuration[0m
[91m06:04:50:ERROR:WU00:FS00:Exception: Could not get an assignment[0m
06:04:50:WU00:FS00:Connecting to assign1.foldingathome.org:80
[93m06:04:51:WARNING:WU00:FS00:Failed to get assignment from 'assign1.foldingathome.org:80': No WUs available for this configuration[0m
06:04:51:WU00:FS00:Connecting to assign2.foldingathome.org:80
06:04:51:WU00:FS00:Assigned to work server 143.89.243.111
06:04:51:WU00:FS00:Requesting new work unit for slot 00: RUNNING cpu:3 from 143.89.243.111
06:04:51:WU00:FS00:Connecting to 143.89.243.111:8080
[91m06:04:52:ERROR:WU00:FS00:Exception: Server did not assign work unit[0m
06:05:50:WU00:FS00:Connecting to assign1.foldingathome.org:80
[93m06:05:51:WARNING:WU00:FS00:Failed to get assignment from 'assign1.foldingathome.org:80': No WUs available for this configuration[0m
06:05:51:WU00:FS00:Connecting to assign2.foldingathome.org:80
06:05:51:WU00:FS00:Assigned to work server 143.89.243.111
06:05:51:WU00:FS00:Requesting new work unit for slot 00: RUNNING cpu:3 from 143.89.243.111
06:05:51:WU00:FS00:Connecting to 143.89.243.111:8080
[91m06:05:52:ERROR:WU00:FS00:Exception: Server did not assign work unit[0m
06:07:28:WU00:FS00:Connecting to assign1.foldingathome.org:80
[93m06:07:28:WARNING:WU00:FS00:Failed to get assignment from 'assign1.foldingathome.org:80': No WUs available for this configuration[0m
06:07:28:WU00:FS00:Connecting to assign2.foldingathome.org:80
[93m06:07:28:WARNING:WU00:FS00:Failed to get assignment from 'assign2.foldingathome.org:80': No WUs available for this configuration[0m
06:07:28:WU00:FS00:Connecting to assign3.foldingathome.org:80
[93m06:07:29:WARNING:WU00:FS00:Failed to get assignment from 'assign3.foldingathome.org:80': No WUs available for this configuration[0m
06:07:29:WU00:FS00:Connecting to assign4.foldingathome.org:80
[93m06:07:29:WARNING:WU00:FS00:Failed to get assignment from 'assign4.foldingathome.org:80': No WUs available for this configuration[0m
[91m06:07:29:ERROR:WU00:FS00:Exception: Could not get an assignment[0m
06:09:08:WU01:FS00:0xa8:Completed 500000 out of 500000 steps (100%)
06:09:11:WU01:FS00:0xa8:Saving result file ../logfile_01.txt
06:09:11:WU01:FS00:0xa8:Saving result file ener.edr
06:09:11:WU01:FS00:0xa8:Saving result file frame4.gro
06:09:11:WU01:FS00:0xa8:Saving result file frame4.xtc
06:09:11:WU01:FS00:0xa8:Saving result file md.log
06:09:11:WU01:FS00:0xa8:Saving result file science.log
06:09:11:WU01:FS00:0xa8:Saving result file state.cpt
06:09:11:WU01:FS00:0xa8:Folding@home Core Shutdown: FINISHED_UNIT
06:09:11:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
06:09:11:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:16810 run:89 clone:956 gen:4 core:0xa8 unit:0x00000004b2aec48a5eff208e7cc8d572
06:09:11:WU01:FS00:Uploading 8.01MiB to 178.174.196.138
06:09:11:WU01:FS00:Connecting to 178.174.196.138:8080
06:09:17:WU01:FS00:Upload 32.79%
06:09:23:WU01:FS00:Upload 73.39%
06:09:27:WU01:FS00:Upload complete
06:09:27:WU01:FS00:Server responded WORK_ACK (400)
06:09:27:WU01:FS00:Final credit estimate, 9019.00 points
06:09:27:WU01:FS00:Cleaning up

Worth noting that it took about 5 minutes to find a new work unit, as you can see from the timestamps.

My startup log:

Code: Select all

21:19:22:Trying to access database...
21:19:24:Successfully acquired database lock
21:19:25:Downloading GPUs.txt from assign1.foldingathome.org:80
21:19:25:Connecting to assign1.foldingathome.org:80
21:19:25:Read GPUs.txt
21:19:26:Enabled folding slot 00: PAUSED cpu:3 (by user)
21:19:26:****************************** FAHClient ******************************
21:19:26:    Version: 7.6.13
21:19:26:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:19:26:  Copyright: 2020 foldingathome.org
21:19:26:   Homepage: https://foldingathome.org/
21:19:26:       Date: Apr 27 2020
21:19:26:       Time: 21:20:45
21:19:26:   Revision: 5a652817f46116b6e135503af97f18e094414e3b
21:19:26:     Branch: master
21:19:26:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.0 (clang-1100.0.33.8)
21:19:26:    Options: -std=c++11 -O3 -funroll-loops -mmacosx-version-min=10.7
21:19:26:             -Wno-unused-local-typedefs -stdlib=libc++
21:19:26:   Platform: darwin 19.2.0
21:19:26:       Bits: 64
21:19:26:       Mode: Release
21:19:26:     Config: /Library/Application Support/FAHClient/config.xml
21:19:26:******************************** CBang ********************************
21:19:26:       Date: Apr 24 2020
21:19:26:       Time: 17:07:50
21:19:26:   Revision: ea081a3b3b0f4a37c4d0440b4f1bc184197c7797
21:19:26:     Branch: master
21:19:26:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.0 (clang-1100.0.33.8)
21:19:26:    Options: -std=c++11 -O3 -funroll-loops -mmacosx-version-min=10.7
21:19:26:             -Wno-unused-local-typedefs -stdlib=libc++ -fPIC
21:19:26:   Platform: darwin 19.2.0
21:19:26:       Bits: 64
21:19:26:       Mode: Release
21:19:26:******************************* System ********************************
21:19:26:        CPU: Intel(R) Core(TM) i7-7700HQ CPU @ 2.80GHz
21:19:26:     CPU ID: GenuineIntel Family 6 Model 158 Stepping 9
21:19:26:       CPUs: 8
21:19:26:     Memory: 16.00GiB
21:19:26:Free Memory: 14.17GiB
21:19:26:    Threads: POSIX_THREADS
21:19:26: OS Version: 10.15
21:19:26:Has Battery: true
21:19:26: On Battery: true
21:19:26: UTC Offset: -4
21:19:26:        PID: 150
21:19:26:        CWD: /Library/Application Support/FAHClient
21:19:26:         OS: Darwin 19.6.0 x86_64
21:19:26:    OS Arch: AMD64
21:19:26:       GPUs: 1
21:19:26:      GPU 0: Bus:1 Slot:0 Func:0 AMD:5 Baffin XT [Radeon RX 460]
21:19:26:       CUDA: Not detected: Failed to open dynamic library 'libcuda.dylib':
21:19:26:             dlopen(libcuda.dylib, 1): image not found
21:19:26:     OpenCL: Not detected: Failed to open dynamic library 'libOpenCL.dylib':
21:19:26:             dlopen(libOpenCL.dylib, 1): image not found
21:19:26:******************************* libFAH ********************************
21:19:26:       Date: Apr 15 2020
21:19:26:       Time: 14:43:28
21:19:26:   Revision: 216968bc7025029c841ed6e36e81a03a316890d3
21:19:26:     Branch: master
21:19:26:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.0 (clang-1100.0.33.8)
21:19:26:    Options: -std=c++11 -O3 -funroll-loops -mmacosx-version-min=10.7
21:19:26:             -Wno-unused-local-typedefs -stdlib=libc++
21:19:26:   Platform: darwin 19.2.0
21:19:26:       Bits: 64
21:19:26:       Mode: Release
21:19:26:***********************************************************************
21:19:26:<config>
21:19:26:  <!-- Folding Slot Configuration -->
21:19:26:  <cause v='COVID_19'/>
21:19:26:
21:19:26:  <!-- Network -->
21:19:26:  <proxy v=':8080'/>
21:19:26:
21:19:26:  <!-- Slot Control -->
21:19:26:  <pause-on-start v='true'/>
21:19:26:  <power v='LIGHT'/>
21:19:26:
21:19:26:  <!-- User Information -->
21:19:26:  <passkey v='*****'/>
21:19:26:  <team v='3213'/>
21:19:26:  <user v='garaden'/>
21:19:26:
21:19:26:  <!-- Folding Slots -->
21:19:26:  <slot id='0' type='CPU'/>
21:19:26:</config>

Other context:

Computer Specs: Quad-Core Intel Core i7 2.8 GHz. 16 GB RAM. Hyper-Threading enabled.
Network Connection: Starry Internet. Basically cable. 200 Mbits up and down.
Operating System: macOS Catalina version 10.15.6.
Overclocked?: None
Stable?: Haven't run any stability testing software, but this is my first time having a problem. This is my main work & personal computer.
Software: Client version 7.6.13. Core version a8-0.0.7, AVX2-256.
WU details: Gromacs avx2_256. PRCG 16810 (6, 548, 26)

Is there some way for me to roll back to a8-0.0.6? I could delete this WU and get a new one, but I'd prefer to complete this one if possible.
Last edited by Joe_H on Thu Sep 24, 2020 2:47 pm, edited 1 time in total.
Reason: merged with existing topic
Image

MacBook Pro Quad-Core i7 2.8 GHz AVX2-256, 16 GB RAM, 3 threads
garaden
Posts: 2
Joined: Thu Sep 24, 2020 2:11 pm

Re: I seem to have received a bad _a8 core update

Post by garaden »

Same for me (INTERRUPTED loop, a8-0.0.7, project 16810, OS X). Sorry I'm just finding this topic now, I posted a new one in the "CPU Projects - released FAHCores _a4 & _a7" board because there was another a8 topic in there. I'll try deleting the core, and pausing if that doesn't work.
Image

MacBook Pro Quad-Core i7 2.8 GHz AVX2-256, 16 GB RAM, 3 threads
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: I seem to have received a bad _a8 core update

Post by Joe_H »

Neil-B wrote:Hmm ... both cases osx ... maybe a bit early to call it a pattern but? ... Joe_H have you had any of this project run through your kit recently?
Just checking after waking up this morning, one of my Mac's did pick up a 16810 WU overnight and update the folding core. Fails at startup, not even getting as far as where the version is identified. Client is set for Beta, but had fallen back to a non-Beta project on assignment.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
aetch
Posts: 447
Joined: Thu Jun 25, 2020 3:04 pm
Location: Between chair and keyboard

Re: I seem to have received a bad _a8 core update

Post by aetch »

Just as a counterpoint, I'm running a Windows 10 machine with a Ryzen 3900x.
My system downloaded a8 0.0.7 about 12 hours ago.
It has since ran 3off a7 projects and is currently on its 7th a8 project.
Although there were periods where there was a shortage of WUs I couldn't find any errors in the logs.
I am not in the Beta program.
Folding Rigs - None (25-Jun-2022)

ImageImage
ShootThePicture
Posts: 10
Joined: Thu Sep 24, 2020 4:39 am

Re: I seem to have received a bad _a8 core update

Post by ShootThePicture »

Okay. I unchecked follow and refreshed the log file. I have included the beginning below. I'm still not sure if this is what you are looking for. By the way, I woke up this morning to find my two other computers have also hit the same issue. They are both Macs too. One is an iMac and the other is a MacBook Pro. If this isn't what you are looking for, I can simply restart the computer. It's worth trying out in my book anyways.

Code: Select all

*********************** Log Started 2020-09-12T22:40:32Z ***********************
22:40:32:Trying to access database...
22:40:40:Successfully acquired database lock
22:40:42:ERROR:Exception: Could not get IP address for assign1.foldingathome.org: nodename nor servname provided, or not known
22:40:42:ERROR:Exception: Could not get IP address for assign2.foldingathome.org: nodename nor servname provided, or not known
22:40:42:ERROR:Exception: Could not get IP address for assign3.foldingathome.org: nodename nor servname provided, or not known
22:40:42:ERROR:Exception: Could not get IP address for assign4.foldingathome.org: nodename nor servname provided, or not known
22:40:42:ERROR:Exception: Failed to find any IP addresses for assignment servers
22:40:42:Enabled folding slot 00: READY cpu:7
22:40:47:****************************** FAHClient ******************************
22:40:47:    Version: 7.6.13
22:40:47:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
22:40:47:  Copyright: 2020 foldingathome.org
22:40:47:   Homepage: https://foldingathome.org/
22:40:47:       Date: Apr 27 2020
22:40:47:       Time: 21:20:45
22:40:47:   Revision: 5a652817f46116b6e135503af97f18e094414e3b
22:40:47:     Branch: master
22:40:47:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.0 (clang-1100.0.33.8)
22:40:47:    Options: -std=c++11 -O3 -funroll-loops -mmacosx-version-min=10.7
22:40:47:             -Wno-unused-local-typedefs -stdlib=libc++
22:40:47:   Platform: darwin 19.2.0
22:40:47:       Bits: 64
22:40:47:       Mode: Release
22:40:47:     Config: /Library/Application Support/FAHClient/config.xml
22:40:47:******************************** CBang ********************************
22:40:47:       Date: Apr 24 2020
22:40:47:       Time: 17:07:50
22:40:47:   Revision: ea081a3b3b0f4a37c4d0440b4f1bc184197c7797
22:40:47:     Branch: master
22:40:47:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.0 (clang-1100.0.33.8)
22:40:47:    Options: -std=c++11 -O3 -funroll-loops -mmacosx-version-min=10.7
22:40:47:             -Wno-unused-local-typedefs -stdlib=libc++ -fPIC
22:40:47:   Platform: darwin 19.2.0
22:40:47:       Bits: 64
22:40:47:       Mode: Release
22:40:47:******************************* System ********************************
22:40:47:        CPU: Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz
22:40:47:     CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
22:40:47:       CPUs: 8
22:40:47:     Memory: 32.00GiB
22:40:47:Free Memory: 28.89GiB
22:40:47:    Threads: POSIX_THREADS
22:40:47: OS Version: 10.14
22:40:47:Has Battery: false
22:40:47: On Battery: false
22:40:47: UTC Offset: -7
22:40:47:        PID: 48
22:40:47:        CWD: /Library/Application Support/FAHClient
22:40:47:         OS: Darwin 18.7.0 x86_64
22:40:47:    OS Arch: AMD64
22:40:47:       GPUs: 0
22:40:47:       CUDA: Not detected: Failed to open dynamic library 'libcuda.dylib':
22:40:47:             dlopen(libcuda.dylib, 1): image not found
22:40:47:     OpenCL: Not detected: Failed to open dynamic library 'libOpenCL.dylib':
22:40:47:             dlopen(libOpenCL.dylib, 1): image not found
22:40:47:******************************* libFAH ********************************
22:40:47:       Date: Apr 15 2020
22:40:47:       Time: 14:43:28
22:40:47:   Revision: 216968bc7025029c841ed6e36e81a03a316890d3
22:40:47:     Branch: master
22:40:47:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.0 (clang-1100.0.33.8)
22:40:47:    Options: -std=c++11 -O3 -funroll-loops -mmacosx-version-min=10.7
22:40:47:             -Wno-unused-local-typedefs -stdlib=libc++
22:40:47:   Platform: darwin 19.2.0
22:40:47:       Bits: 64
22:40:47:       Mode: Release
22:40:47:***********************************************************************
22:40:47:<config>
22:40:47:  <!-- Network -->
22:40:47:  <proxy v=':8080'/>
22:40:47:
22:40:47:  <!-- User Information -->
22:40:47:  <user v='JumpRaven'/>
22:40:47:
22:40:47:  <!-- Folding Slots -->
22:40:47:  <slot id='0' type='CPU'/>
22:40:47:</config>
22:40:48:WU01:FS00:Starting
22:40:53:WU01:FS00:Running FahCore: /usr/local/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/cores.foldingathome.org/osx/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7" -dir 01 -suffix 01 -version 706 -lifeline 48 -checkpoint 15 -np 7
22:40:53:WU01:FS00:Started FahCore on PID 273
22:40:57:WU01:FS00:Core PID:274
22:40:57:WU01:FS00:FahCore 0xa7 started
22:40:59:WU01:FS00:0xa7:*********************** Log Started 2020-09-12T22:40:58Z ***********************
22:40:59:WU01:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
22:40:59:WU01:FS00:0xa7:       Type: 0xa7
22:40:59:WU01:FS00:0xa7:       Core: Gromacs
22:40:59:WU01:FS00:0xa7:       Args: -dir 01 -suffix 01 -version 706 -lifeline 273 -checkpoint 15 -np 7
22:40:59:WU01:FS00:0xa7:************************************ CBang *************************************
22:40:59:WU01:FS00:0xa7:       Date: Nov 27 2019
22:40:59:WU01:FS00:0xa7:       Time: 03:27:01
22:40:59:WU01:FS00:0xa7:   Revision: d25803215b59272441049dfa05a0a9bf7a6e3c48
22:40:59:WU01:FS00:0xa7:     Branch: master
22:40:59:WU01:FS00:0xa7:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.0 (clang-1100.0.33.8)
22:40:59:WU01:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -mmacosx-version-min=10.7
22:40:59:WU01:FS00:0xa7:             -Wno-unused-local-typedefs -stdlib=libc++ -fPIC
22:40:59:WU01:FS00:0xa7:   Platform: darwin 19.0.0
22:40:59:WU01:FS00:0xa7:       Bits: 64
22:40:59:WU01:FS00:0xa7:       Mode: Release
22:40:59:WU01:FS00:0xa7:************************************ System ************************************
22:40:59:WU01:FS00:0xa7:        CPU: Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz
22:40:59:WU01:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
22:40:59:WU01:FS00:0xa7:       CPUs: 8
22:40:59:WU01:FS00:0xa7:     Memory: 32.00GiB
22:40:59:WU01:FS00:0xa7:Free Memory: 28.66GiB
22:40:59:WU01:FS00:0xa7:    Threads: POSIX_THREADS
22:40:59:WU01:FS00:0xa7: OS Version: 10.14
22:40:59:WU01:FS00:0xa7:Has Battery: false
22:40:59:WU01:FS00:0xa7: On Battery: false
22:40:59:WU01:FS00:0xa7: UTC Offset: -7
22:40:59:WU01:FS00:0xa7:        PID: 274
22:40:59:WU01:FS00:0xa7:        CWD: /Library/Application Support/FAHClient/work
22:40:59:WU01:FS00:0xa7:******************************** Build - libFAH ********************************
22:40:59:WU01:FS00:0xa7:    Version: 0.0.19
22:40:59:WU01:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
22:40:59:WU01:FS00:0xa7:  Copyright: 2019 foldingathome.org
22:40:59:WU01:FS00:0xa7:   Homepage: https://foldingathome.org/
22:40:59:WU01:FS00:0xa7:       Date: Nov 25 2019
22:40:59:WU01:FS00:0xa7:       Time: 16:41:59
22:40:59:WU01:FS00:0xa7:   Revision: d5b5c747532224f986b7cd02c968ed9a20c16d6e
22:40:59:WU01:FS00:0xa7:     Branch: master
22:40:59:WU01:FS00:0xa7:   Compiler: GNU 4.2.1 Compatible Apple LLVM 11.0.0 (clang-1100.0.33.8)
22:40:59:WU01:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -mmacosx-version-min=10.7
22:40:59:WU01:FS00:0xa7:             -Wno-unused-local-typedefs -stdlib=libc++
22:40:59:WU01:FS00:0xa7:   Platform: darwin 19.0.0
22:40:59:WU01:FS00:0xa7:       Bits: 64
22:40:59:WU01:FS00:0xa7:       Mode: Release
22:40:59:WU01:FS00:0xa7:************************************ Build *************************************
22:40:59:WU01:FS00:0xa7:       SIMD: avx_256
22:40:59:WU01:FS00:0xa7:********************************************************************************
22:40:59:WU01:FS00:0xa7:Project: 16423 (Run 874, Clone 3, Gen 105)
22:40:59:WU01:FS00:0xa7:Unit: 0x0000007796880e6e5e8a8efb8f021f74
22:40:59:WU01:FS00:0xa7
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: I seem to have received a bad _a8 core update

Post by Joe_H »

The problem with the OS X version of the 0.0.7 core has been identified, they are working on a fix.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
ShootThePicture
Posts: 10
Joined: Thu Sep 24, 2020 4:39 am

Re: I seem to have received a bad _a8 core update

Post by ShootThePicture »

Good. Do you happen to know if I should leave the clients paused or if keeping them running will allow the fix to get pushed out to them?
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: I seem to have received a bad _a8 core update

Post by Joe_H »

Leave the client paused for now. Not sure exactly how the fix will be set up yet, will post when that information is available.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
belloq
Posts: 40
Joined: Thu Sep 24, 2020 12:58 pm

Re: I seem to have received a bad _a8 core update [OS X]

Post by belloq »

I've been able to get my systems up and running with WUs for other projects for now.
SilvioMartin
Posts: 30
Joined: Thu Sep 24, 2020 6:06 pm
Hardware configuration: iMac 2017 Intel Quad-Core i5 3,4 GHz, 8 GB RAM, Radeon Pro 560 4 GB, typically with the latest macOS update. 5 Raspberry Pi 4B (2 GB).
Location: Oberhausen, Germany
Contact:

Re: I seem to have received a bad _a8 core update [OS X]

Post by SilvioMartin »

@belloq: I was just about to report the same issue. Here is how I got my client going:

1. Pause all slots
2. Close FAHControl
3. Delete the content of /Library/Application Support/FAHClient/work (you need admin rights for that)
4. Reboot your Mac
5. Launch FAHControl
6. Fold

This is not a permanent cure. Your FAH client will get stuck again the next time, when a work unit of project 16810 is assigned to your computer. I tried to reduce the chance by setting the cause preference to some other than COVID-19 (button Configure on top left, then tab Advanced). Not sure if this has a big effect. After setting to Cancer this morning my FAHClient got stuck twice again.
My Raspberry Pi folding rack: http://www.anne-emscher.net/fah/
psaam0001
Posts: 383
Joined: Mon May 18, 2020 2:02 am
Location: Ruckersville, Virginia, USA

Re: I seem to have received a bad _a8 core update [OS X]

Post by psaam0001 »

I am not on the beta team, but I am spot checking my Fedora & Windows systems just to see if I accidentally got the a8 update that you are all talking about (so I can monitor those systems and report issues via PM). Being that my systems are getting hit w/a number of 16810 WU's at the moment.

Edit: Windows 7 and Fedora 32 systems are running the last public a8 core release. Will check my Windows 10 system ASAP. Though I don't expect to see it there either.

My apologies for being concerned without cause.

Paul
Last edited by psaam0001 on Thu Sep 24, 2020 7:39 pm, edited 1 time in total.
SilvioMartin
Posts: 30
Joined: Thu Sep 24, 2020 6:06 pm
Hardware configuration: iMac 2017 Intel Quad-Core i5 3,4 GHz, 8 GB RAM, Radeon Pro 560 4 GB, typically with the latest macOS update. 5 Raspberry Pi 4B (2 GB).
Location: Oberhausen, Germany
Contact:

Re: I seem to have received a bad _a8 core update [OS X]

Post by SilvioMartin »

I just noted that this thread is in the beta FAHCores board and about the _a8 core. I have the same issue with project 16810 work units using the _a7 core in macOS 10.15.6, and so far only with work units of this particular project. So it is not specific to the _a8 core.
My Raspberry Pi folding rack: http://www.anne-emscher.net/fah/
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: I seem to have received a bad _a8 core update [OS X]

Post by Joe_H »

Assignments of WUs from Project 16810 have been temporarily stopped, so clearing out the WU and restarting should get a WU from a different project.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
ShootThePicture
Posts: 10
Joined: Thu Sep 24, 2020 4:39 am

Re: I seem to have received a bad _a8 core update [OS X]

Post by ShootThePicture »

Okay. Then I'm going to unpause my machines. It'll drop the WUs after a day or so based on the Timeout and Expiration dates.
Neil-B
Posts: 2027
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: I seem to have received a bad _a8 core update [OS X]

Post by Neil-B »

I'll let Joe_H explain which method is best how to clear this quicker - could advise on windows but I may miss-state for Mac ... but you shouldn't need to wait until expiration to get folding again
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
Post Reply