Page 1 of 7

155.247.166.220 downloads stalled

Posted: Fri May 01, 2020 4:03 pm
by CKWarner
For about 24 hours now this server has been failing to give WUs. It says that it's going to, and sometimes (but not that often) the download will start, but it stalls out. The WUs are small - 2.8 MB - and they just don't download. All the time that the client thinks it's going to be getting a WU from this server - any moment now, for sure - it doesn't try to get WUs from any other server, so the work just gets frozen.

Re: 155.247.166.220 downloads stalled

Posted: Fri May 01, 2020 5:50 pm
by HaloJones
likewise. I've blocked it on my firewall but the clients once they've been directed to that server seem unable to go anywhere else.

Re: 155.247.166.220 downloads stalled

Posted: Fri May 01, 2020 7:23 pm
by CKWarner
I managed to unstick the client by deleting the GPU slot and restarting, and it got WUs from elsewhere after I added the GPU slot back. Hopefully whatever's happening server side can get fixed.

Re: 155.247.166.220 downloads stalled

Posted: Fri May 01, 2020 9:26 pm
by info2x
Add me to the list

Code: Select all

21:12:53:WU00:FS01:Connecting to 65.254.110.245:8080
21:12:57:WU00:FS01:Assigned to work server 155.247.166.220
21:12:57:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP104 [GeForce GTX 1070 Ti] 8186 from 155.247.166.220
21:12:57:WU00:FS01:Connecting to 155.247.166.220:8080
21:12:58:WU00:FS01:Downloading 11.56MiB
21:13:06:WU00:FS01:Download 2.16%
21:13:14:WU00:FS01:Download 3.24%
21:13:23:WU00:FS01:Download 4.33%
21:14:02:WU00:FS01:Download 4.87%
21:14:14:WU00:FS01:Download 5.95%
21:15:17:WU00:FS01:Download 6.49%
21:15:23:WU00:FS01:Download 7.44%
21:15:23:ERROR:WU00:FS01:Exception: Transfer failed
21:15:23:WU00:FS01:Connecting to 65.254.110.245:8080
21:15:24:WU00:FS01:Assigned to work server 155.247.166.220
21:15:24:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP104 [GeForce GTX 1070 Ti] 8186 from 155.247.166.220
21:15:24:WU00:FS01:Connecting to 155.247.166.220:8080
21:15:24:WU00:FS01:Downloading 11.56MiB
21:15:33:WU00:FS01:Download 2.70%
21:15:40:WU00:FS01:Download 4.32%
21:15:47:WU00:FS01:Download 5.41%
21:15:53:WU00:FS01:Download 7.03%
21:16:07:WU00:FS01:Download 8.11%
21:16:17:WU00:FS01:Download 8.65%
21:16:25:WU00:FS01:Download 9.73%
21:16:32:WU00:FS01:Download 11.35%
21:16:40:WU00:FS01:Download 14.59%
21:16:48:WU00:FS01:Download 16.76%
Clock says 21:26 at this point.

Re: 155.247.166.220 downloads stalled

Posted: Fri May 01, 2020 10:48 pm
by BillAnderson
Me too:
13:48:07:WU01:FS01:Cleaning up
13:50:36:WU00:FS01:Connecting to 65.254.110.245:8080
13:50:36:WU00:FS01:Assigned to work server 155.247.166.220
13:50:36:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM206 [GeForce GTX 950] 1572 from 155.247.166.220
13:50:36:WU00:FS01:Connecting to 155.247.166.220:8080
13:50:36:WU00:FS01:Downloading 2.83MiB
13:50:46:WU00:FS01:Download 11.02%
13:50:56:WU00:FS01:Download 15.43%
13:51:11:WU00:FS01:Download 24.25%
13:51:20:WU00:FS01:Download 26.46%
******************************* Date: 2020-05-01 *******************************
22:41:37:FS01:Paused
22:41:45:FS01:Unpaused

Image

Re: 155.247.166.220 downloads stalled

Posted: Fri May 01, 2020 11:07 pm
by BillAnderson
CKWarner wrote:I managed to unstick the client by deleting the GPU slot and restarting, and it got WUs from elsewhere after I added the GPU slot back. Hopefully whatever's happening server side can get fixed.
So silly me tried the same thing. In STATUS > WORKCUE > I deleted the GPU slot and then tried to add another GPU slot. It didn't seem to fix anything. I made one other change (which I immediately forgot what it was and now:
on STATUS > FOLDING SLOTS the list is empty
When I start FAHControl I get a popup that says

"On Client "local" 127.0.0.1:36330: Option gpu-index has no default value and is not set.
iter should be a GtkTreeIter"

(that is not a typo that is what the screenpopup says)

on FAHControl the STATUS, SYSTEM INFO and LOG tabs are all greyed out in fact the whole right side of the page is greyed out.

Suggestions about what I can do now?

Image

Re: 155.247.166.220 downloads stalled

Posted: Fri May 01, 2020 11:21 pm
by CKWarner
BillAnderson wrote:Suggestions about what I can do now?
I think that what you actually did was delete the local control connection. If that's the case, you should be able to add back in the localhost one with the +Add button near the bottom of that pane.

You can add/remove slots from the Slots tab of the Configure window.

Re: 155.247.166.220 downloads stalled

Posted: Sat May 02, 2020 12:26 am
by info2x
I'm just going to use this time to upgrade my client since they put out a new one. May as well right?

Re: 155.247.166.220 downloads stalled

Posted: Sat May 02, 2020 12:46 am
by PantherX
You can also clean your system, perform OS updates if you feel like it. Once all the maintenance work is done, you can hopefully start folding for a long time :)

Re: 155.247.166.220 downloads stalled

Posted: Sat May 02, 2020 1:15 am
by BillAnderson
info2x wrote:I'm just going to use this time to upgrade my client since they put out a new one. May as well right?
I ran the upgrade to 7.6.9 but it would not set up my GPU. I was told it is a known issue so I uninstalled 7.6.9 and went back to 7.5.1
(Although I might add PantherX did provide me a workaround to get GPU working on 7.6.9 that I will try over the weekend)

Image

Re: 155.247.166.220 downloads stalled

Posted: Sat May 02, 2020 1:30 am
by info2x
BillAnderson wrote:
info2x wrote:I'm just going to use this time to upgrade my client since they put out a new one. May as well right?
I ran the upgrade to 7.6.9 but it would not set up my GPU. I was told it is a known issue so I uninstalled 7.6.9 and went back to 7.5.1
(Although I might add PantherX did provide me a workaround to get GPU working on 7.6.9 that I will try over the weekend)

Image
Well then I'm going to consider myself super lucky. CPU/GPU both were set up and my GPU was able to download a work unit from 155.247.166.220. It was very slow, but it did download.

Only issue I noticed was when I went to install the new version it hung up on shutting down the existing clients which was odd because I quit everything manually before I started the install. Oh well a restart later everything installed.

Re: 155.247.166.220 downloads stalled

Posted: Sat May 02, 2020 11:02 pm
by info2x
Spoke too soon. Back to stalled downloads from this server again.

Re: 155.247.166.220 downloads stalled

Posted: Sun May 03, 2020 2:26 am
by vvoelz
We're working on the problem. We've seen similar problems before -- they might arise from how the server code deals with stale connections, compounded with network issues on campus. We have restarted the server code; let us know if the problem persists

Re: 155.247.166.220 downloads stalled

Posted: Sun May 03, 2020 8:54 am
by HaloJones
HI, it's not giving out units this morning (UTC 09:45) although the serverstats page says it's up.

Re: 155.247.166.220 downloads stalled

Posted: Sun May 03, 2020 3:34 pm
by info2x
I had issues at 0315 and 0932 per the time in the logs today. Had to restart both systems to get work.