Folding unable to start folding on Linux Docker Container

Moderators: Site Moderators, FAHC Science Team

Post Reply
seasonalskier
Posts: 4
Joined: Mon Aug 03, 2020 6:21 pm

Folding unable to start folding on Linux Docker Container

Post by seasonalskier »

Hello,

I have recently tried to install FAH on my home server (openmediavault build 5.3.9) inside docker. I am seeing it attempt to start and the immediately shut itself down with this error code: [93m17:58:17:WARNING:WU01:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m 17:58:19:Caught signal SIGTERM(15) on PID 267. I am not exactly sure what these error messages are trying to indicate is going wrong. Any help would be appreciated. I have included the log file as well as the script used to create the container.


F@H log:

Code: Select all

*********************** Log Started 2020-09-14T17:52:53Z ***********************
17:52:53:Trying to access database...
17:52:53:Successfully acquired database lock
17:52:53:Downloading GPUs.txt from assign1.foldingathome.org:80
17:52:53:Connecting to assign1.foldingathome.org:80
17:52:54:Read GPUs.txt
17:52:54:Enabled folding slot 00: PAUSED cpu:7 (not configured)
17:52:54:****************************** FAHClient ******************************
17:52:54:    Version: 7.6.13
17:52:54:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
17:52:54:  Copyright: 2020 foldingathome.org
17:52:54:   Homepage: https://foldingathome.org/
17:52:54:       Date: Apr 28 2020
17:52:54:       Time: 04:20:16
17:52:54:   Revision: 5a652817f46116b6e135503af97f18e094414e3b
17:52:54:     Branch: master
17:52:54:   Compiler: GNU 8.3.0
17:52:54:    Options: -std=c++11 -ffunction-sections -fdata-sections -O3 -funroll-loops
17:52:54:             -fno-pie
17:52:54:   Platform: linux2 4.19.0-5-amd64
17:52:54:       Bits: 64
17:52:54:       Mode: Release
17:52:54:       Args: --http-addresses 0.0.0.0:7396 --allow 0/0 --web-allow 0/0
17:52:54:             --command-allow-no-pass 0/0
17:52:54:******************************** CBang ********************************
17:52:54:       Date: Apr 25 2020
17:52:54:       Time: 00:07:53
17:52:54:   Revision: ea081a3b3b0f4a37c4d0440b4f1bc184197c7797
17:52:54:     Branch: master
17:52:54:   Compiler: GNU 8.3.0
17:52:54:    Options: -std=c++11 -ffunction-sections -fdata-sections -O3 -funroll-loops
17:52:54:             -fno-pie -fPIC
17:52:54:   Platform: linux2 4.19.0-5-amd64
17:52:54:       Bits: 64
17:52:54:       Mode: Release
17:52:54:******************************* System ********************************
17:52:54:        CPU: Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz
17:52:54:     CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
17:52:54:       CPUs: 8
17:52:54:     Memory: 7.70GiB
17:52:54:Free Memory: 213.05MiB
17:52:54:    Threads: POSIX_THREADS
17:52:54: OS Version: 5.7
17:52:54:Has Battery: false
17:52:54: On Battery: false
17:52:54: UTC Offset: -4
17:52:54:        PID: 267
17:52:54:        CWD: /config
17:52:54:         OS: Linux 5.7.0-0.bpo.2-amd64 x86_64
17:52:54:    OS Arch: AMD64
17:52:54:       GPUs: 0
17:52:54:       CUDA: Not detected: cuInit() returned 100
17:52:54:     OpenCL: Not detected: clGetPlatformIDs() returned -1001
17:52:54:******************************* libFAH ********************************
17:52:54:       Date: Apr 15 2020
17:52:54:       Time: 21:43:24
17:52:54:   Revision: 216968bc7025029c841ed6e36e81a03a316890d3
17:52:54:     Branch: master
17:52:54:   Compiler: GNU 8.3.0
17:52:54:    Options: -std=c++11 -ffunction-sections -fdata-sections -O3 -funroll-loops
17:52:54:             -fno-pie
17:52:54:   Platform: linux2 4.19.0-5-amd64
17:52:54:       Bits: 64
17:52:54:       Mode: Release
17:52:54:***********************************************************************
17:52:54:<config>
17:52:54:  <!-- Folding Slots -->
17:52:54:  <slot id='0' type='CPU'/>
17:52:54:</config>
17:53:14:16:192.168.1.105:New Web session
17:53:54:Saving configuration to config.xml
17:53:54:<config>
17:53:54:  <!-- Folding Slots -->
17:53:54:  <slot id='0' type='CPU'/>
17:53:54:</config>
17:53:54:Set client configured
17:53:54:WU00:FS00:Connecting to assign1.foldingathome.org:80
17:53:54:WU00:FS00:Connecting to assign1.foldingathome.org:80
17:53:55:WU00:FS00:Assigned to work server 69.94.66.7
17:53:55:WU00:FS00:Requesting new work unit for slot 00: READY cpu:7 from 69.94.66.7
17:53:55:WU00:FS00:Connecting to 69.94.66.7:8080
17:53:55:WU00:FS00:Downloading 2.83MiB
17:54:01:WU00:FS00:Download 99.36%
17:54:01:WU00:FS00:Download complete
17:54:01:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:14379 run:689 clone:1 gen:478 core:0xa7 unit:0x00000221455e42075e933047e36fc018
17:54:01:WU00:FS00:Downloading core from http://cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah
17:54:01:WU00:FS00:Connecting to cores.foldingathome.org:80
17:54:01:WU00:FS00:FahCore a7: Downloading 8.17MiB
17:54:07:WU00:FS00:FahCore a7: 65.79%
17:54:09:WU00:FS00:FahCore a7: Download complete
17:54:09:WU00:FS00:Valid core signature
17:54:09:WU00:FS00:Unpacked 18.52MiB to cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7
17:54:10:WU00:FS00:Starting
17:54:10:WU00:FS00:Running FahCore: /app/usr/bin/FAHCoreWrapper /config/cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 706 -lifeline 267 -checkpoint 15 -np 7
17:54:10:WU00:FS00:Started FahCore on PID 290
17:54:10:WU00:FS00:Core PID:294
17:54:10:WU00:FS00:FahCore 0xa7 started
[93m17:54:10:WARNING:WU00:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m
17:54:11:WU00:FS00:Starting
17:54:11:WU00:FS00:Running FahCore: /app/usr/bin/FAHCoreWrapper /config/cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 706 -lifeline 267 -checkpoint 15 -np 7
17:54:11:WU00:FS00:Started FahCore on PID 295
17:54:11:WU00:FS00:Core PID:299
17:54:11:WU00:FS00:FahCore 0xa7 started
[93m17:54:11:WARNING:WU00:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m
17:55:11:WU00:FS00:Starting
17:55:11:WU00:FS00:Running FahCore: /app/usr/bin/FAHCoreWrapper /config/cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 706 -lifeline 267 -checkpoint 15 -np 7
17:55:11:WU00:FS00:Started FahCore on PID 300
17:55:11:WU00:FS00:Core PID:304
17:55:11:WU00:FS00:FahCore 0xa7 started
[93m17:55:11:WARNING:WU00:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m
17:55:56:Saving configuration to config.xml
17:55:56:<config>
17:55:56:  <!-- Slot Control -->
17:55:56:  <power v='LIGHT'/>
17:55:56:
17:55:56:  <!-- User Information -->
17:55:56:  <passkey v='*****'/>
17:55:56:  <team v='223518'/>
17:55:56:  <user v='Seasonalskier'/>
17:55:56:
17:55:56:  <!-- Folding Slots -->
17:55:56:  <slot id='0' type='CPU'/>
17:55:56:</config>
17:56:11:WU00:FS00:Starting
[93m17:56:11:WARNING:WU00:FS00:Changed SMP threads from 7 to 3 this can cause some work units to fail[0m
17:56:11:WU00:FS00:Running FahCore: /app/usr/bin/FAHCoreWrapper /config/cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 706 -lifeline 267 -checkpoint 15 -np 3
17:56:11:WU00:FS00:Started FahCore on PID 305
17:56:11:WU00:FS00:Core PID:309
17:56:11:WU00:FS00:FahCore 0xa7 started
[93m17:56:11:WARNING:WU00:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m
17:57:11:WU00:FS00:Starting
17:57:11:WU00:FS00:Running FahCore: /app/usr/bin/FAHCoreWrapper /config/cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 706 -lifeline 267 -checkpoint 15 -np 3
17:57:11:WU00:FS00:Started FahCore on PID 310
17:57:11:WU00:FS00:Core PID:314
17:57:11:WU00:FS00:FahCore 0xa7 started
[93m17:57:11:WARNING:WU00:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m
[93m17:57:11:WARNING:WU00:FS00:Too many errors, failing[0m
17:57:12:WU00:FS00:Sending unit results: id:00 state:SEND error:FAILED project:14379 run:689 clone:1 gen:478 core:0xa7 unit:0x00000221455e42075e933047e36fc018
17:57:12:WU00:FS00:Connecting to 69.94.66.7:8080
17:57:12:WU01:FS00:Connecting to assign1.foldingathome.org:80
17:57:12:WU01:FS00:Assigned to work server 128.252.203.10
17:57:12:WU01:FS00:Requesting new work unit for slot 00: READY cpu:3 from 128.252.203.10
17:57:12:WU01:FS00:Connecting to 128.252.203.10:8080
17:57:12:WU00:FS00:Server responded WORK_ACK (400)
17:57:12:WU00:FS00:Cleaning up
17:57:13:WU01:FS00:Downloading 2.22MiB
17:57:15:WU01:FS00:Download complete
17:57:15:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:17201 run:0 clone:492 gen:160 core:0xa7 unit:0x000000b680fccb0a5f32ff1dc6ee3c48
17:57:16:WU01:FS00:Starting
17:57:16:WU01:FS00:Running FahCore: /app/usr/bin/FAHCoreWrapper /config/cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7 -dir 01 -suffix 01 -version 706 -lifeline 267 -checkpoint 15 -np 3
17:57:16:WU01:FS00:Started FahCore on PID 315
17:57:16:WU01:FS00:Core PID:319
17:57:16:WU01:FS00:FahCore 0xa7 started
[93m17:57:16:WARNING:WU01:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m
17:57:17:WU01:FS00:Starting
17:57:17:WU01:FS00:Running FahCore: /app/usr/bin/FAHCoreWrapper /config/cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7 -dir 01 -suffix 01 -version 706 -lifeline 267 -checkpoint 15 -np 3
17:57:17:WU01:FS00:Started FahCore on PID 320
17:57:17:WU01:FS00:Core PID:324
17:57:17:WU01:FS00:FahCore 0xa7 started
[93m17:57:17:WARNING:WU01:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m
17:57:58:Saving configuration to config.xml
17:57:58:<config>
17:57:58:  <!-- Slot Control -->
17:57:58:  <power v='MEDIUM'/>
17:57:58:
17:57:58:  <!-- User Information -->
17:57:58:  <passkey v='*****'/>
17:57:58:  <team v='223518'/>
17:57:58:  <user v='Seasonalskier'/>
17:57:58:
17:57:58:  <!-- Folding Slots -->
17:57:58:  <slot id='0' type='CPU'/>
17:57:58:</config>
17:58:17:WU01:FS00:Starting
[93m17:58:17:WARNING:WU01:FS00:Changed SMP threads from 3 to 7 this can cause some work units to fail[0m
[93m17:58:17:WARNING:WU01:FS00:AS lowered CPUs from 7 to 3[0m
17:58:17:WU01:FS00:Running FahCore: /app/usr/bin/FAHCoreWrapper /config/cores/cores.foldingathome.org/lin/64bit-avx-256/a7-0.0.19/Core_a7.fah/FahCore_a7 -dir 01 -suffix 01 -version 706 -lifeline 267 -checkpoint 15 -np 3
17:58:17:WU01:FS00:Started FahCore on PID 325
17:58:17:WU01:FS00:Core PID:329
17:58:17:WU01:FS00:FahCore 0xa7 started
[93m17:58:17:WARNING:WU01:FS00:FahCore returned: FAILED_3 (255 = 0xff)[0m
17:58:19:Caught signal SIGTERM(15) on PID 267
17:58:19:Exiting, please wait. . .
17:58:20:Clean exit
Docker Configuration:

Code: Select all

#!/bin/bash
docker run -d --restart="always" --name=foldingathome -e PUID=1000 -e PGID=100 -v /etc/localtime:/etc/localtime:ro -v /srv/dev-disk-by-label-Files/Docker-configs/Folding-At-Home:/config -p 7396:7396 -p 36330:36330 linuxserver/foldingathome
JohnChodera
Pande Group Member
Posts: 470
Joined: Fri Feb 22, 2013 9:59 pm

Re: Folding unable to start folding on Linux Docker Containe

Post by JohnChodera »

Have you checked out the official Docker containers we created with NVIDIA's help?
https://github.com/foldingathome/containers

There are some important lessons we learned in constructing those!

~ John Chodera // MSKCC
seasonalskier
Posts: 4
Joined: Mon Aug 03, 2020 6:21 pm

Re: Folding unable to start folding on Linux Docker Containe

Post by seasonalskier »

This is a headless server, so no GPU installed. Should I be attempting to install a GPU container for a CPU only computer?
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Folding unable to start folding on Linux Docker Containe

Post by PantherX »

Welcome to the F@H Forum seasonalskier,

I am pretty sure that CPU folding is supported but I don't have any personal experience. However, I might be able to ask someone who might have similar experience so let's see what happens.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Jesse_V
Site Moderator
Posts: 2851
Joined: Mon Jul 18, 2011 4:44 am
Hardware configuration: OS: Windows 10, Kubuntu 19.04
CPU: i7-6700k
GPU: GTX 970, GTX 1080 TI
RAM: 24 GB DDR4
Location: Western Washington

Re: Folding unable to start folding on Linux Docker Containe

Post by Jesse_V »

Hi seasonalskier,

Is /srv/dev-disk-by-label-Files/Docker-configs/ persistent storage? If you look in the Docker-configs/Folding-At-Home directory in there, are the files owned by 1000:100, as specified in your Docker setup? I'm wondering if the core simply didn't have the right permissions to execute or the files could not be read.

I'd recommend checking out https://github.com/FoldingAtHome/containers or https://github.com/yurinnick/folding-at-home-docker. I'm not familiar with linuxserver/foldingathome or how its Docker file is set up, but I believe those other instances should be fairly reliable. I tested them out several weeks ago and successfully got the CPU to work, so I believe they should work for you too.
F@h is now the top computing platform on the planet and nothing unites people like a dedicated fight against a common enemy. This virus affects all of us. Lets end it together.
seasonalskier
Posts: 4
Joined: Mon Aug 03, 2020 6:21 pm

Re: Folding unable to start folding on Linux Docker Containe

Post by seasonalskier »

The location is persistent storage and the docker user and group both have R/W access at the locations.

I am not sure what is wrong with the linuxserver repo or how its supposed to be set up, but I was able to remove that repo and install the yurinnick repo and that is folding great now! :) Thank you for the help!
Post Reply