130.237.232.237 going down for maintenance

Moderators: Site Moderators, FAHC Science Team

csm725
Posts: 7
Joined: Tue Feb 14, 2012 5:38 pm

Re: 130.237.232.237 going down for maintenance

Post by csm725 »

I use Langouste and am experiencing this issue with a 6903. Should I leave the machine idling? Will the WU upload (and will I download a new one) automatically when the server works again?
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: 130.237.232.237 going down for maintenance

Post by kasson »

The server had an issue this morning, compounding the diagnostic uncertainty for these other questions. We anticipate it should be back up within an hour.
csm725
Posts: 7
Joined: Tue Feb 14, 2012 5:38 pm

Re: 130.237.232.237 going down for maintenance

Post by csm725 »

Thanks for the prompt response, Kasson. Appreciated.
Nathan_P
Posts: 1180
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 x5670@3.2 Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 E5-2665@2.3 Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: 130.237.232.237 going down for maintenance

Post by Nathan_P »

csm725 wrote:I use Langouste and am experiencing this issue with a 6903. Should I leave the machine idling? Will the WU upload (and will I download a new one) automatically when the server works again?
Possibly - possibly not, personally i would go into the Langouste folder and copy the file back to your main work directory,stop the client and restart it with the -sendall switch, that should do it. This is one of those times where langouste isn't much help ( it cost me a WU once so i stopped using it and take the small ppd hit instead). I would wait though until kasson confirms that the server is back up
Image
csm725
Posts: 7
Joined: Tue Feb 14, 2012 5:38 pm

Re: 130.237.232.237 going down for maintenance

Post by csm725 »

After 4 tries of connecting to 130.237.232.237, I downloaded a 6097. I will let the rig fold overnight and I assume it'll pick a bigadv WU up when the server is back up.
Biffa
Posts: 69
Joined: Sun Nov 16, 2008 11:40 pm
Hardware configuration: RTX2080Ti
Threadripper 1950X
Location: UK
Contact:

Re: 130.237.232.237 going down for maintenance

Post by Biffa »

Is this server still having upload issues? I'm having trouble with a single 6903 :(

Also noted that the server client type is set to classic from here http://fah-web.stanford.edu/serverstat.html was that the issue before?

FWIW this is direct from client with no proxy (langouste or otherwise)

Have tried changing MAchine ID

Code: Select all

--- Opening Log file [April 26 08:46:02 UTC] 


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/Biffa/fah
Executable: ./fah6
Arguments: -send all -verbosity 9 -smp 48 -bigadv -verbosity 9 -forceasm 

[08:46:02] - Ask before connecting: No
[08:46:02] - User name: Biffa (Team 10)
[08:46:02] - User ID: 1080DCA737FA22D9
[08:46:02] - Machine ID: 1
[08:46:02] 
[08:46:02] Loaded queue successfully.
[08:46:02] Attempting to return result(s) to server...
[08:46:02] Trying to send all finished work units
[08:46:02] Project: 6903 (Run 1, Clone 14, Gen 84)


[08:46:02] + Attempting to send results [April 26 08:46:02 UTC]
[08:46:02] - Reading file work/wuresults_02.dat from core
[08:46:03]   (Read 222415734 bytes from disk)
[08:46:03] Connecting to http://130.237.232.237:8080/
[08:53:17] - Couldn't send HTTP request to server
[08:53:17] + Could not connect to Work Server (results)
[08:53:17]     (130.237.232.237:8080)
[08:53:17] + Retrying using alternative port
[08:53:17] Connecting to http://130.237.232.237:80/
[08:56:37] - Couldn't send HTTP request to server
[08:56:37] + Could not connect to Work Server (results)
[08:56:37]     (130.237.232.237:80)
[08:56:37] - Error: Could not transmit unit 02 (completed April 25) to work server.
[08:56:37] - 7 failed uploads of this unit.
[08:56:37]   Keeping unit 02 in queue.
[08:56:37] + Sent 0 of 1 completed units to the server
[08:56:37] - Failed to send all units to server
[08:56:37] ***** Got a SIGTERM signal (15)
[08:56:37] Killing all core threads

Folding@Home Client Shutdown.
Image
Biffa
Posts: 69
Joined: Sun Nov 16, 2008 11:40 pm
Hardware configuration: RTX2080Ti
Threadripper 1950X
Location: UK
Contact:

Re: 130.237.232.237 going down for maintenance

Post by Biffa »

Got another one that won't upload to this server now :(

Gonna miss the deadline for these Wu's at this stage.
Image
Biffa
Posts: 69
Joined: Sun Nov 16, 2008 11:40 pm
Hardware configuration: RTX2080Ti
Threadripper 1950X
Location: UK
Contact:

Re: 130.237.232.237 going down for maintenance

Post by Biffa »

Uploads working again.
Image
dubzhouse
Posts: 9
Joined: Fri Sep 09, 2011 1:07 am

Re: 130.237.232.237 going down for maintenance

Post by dubzhouse »

I am still having issues uploading a wu that was completed four days ago to this server. I am able to get a new wu but unable to upload to this server or another server.
Any ideas on what could be causing this. I keep getting unable to connect to server.
Grandpa_01
Posts: 1122
Joined: Wed Mar 04, 2009 7:36 am
Hardware configuration: 3 - Supermicro H8QGi-F AMD MC 6174=144 cores 2.5Ghz, 96GB G.Skill DDR3 1333Mhz Ubuntu 10.10
2 - Asus P6X58D-E i7 980X 4.4Ghz 6GB DDR3 2000 A-Data 64GB SSD Ubuntu 10.10
1 - Asus Rampage Gene III 17 970 4.3Ghz DDR3 2000 2-500GB Segate 7200.11 0-Raid Ubuntu 10.10
1 - Asus G73JH Laptop i7 740QM 1.86Ghz ATI 5870M

Re: 130.237.232.237 going down for maintenance

Post by Grandpa_01 »

Somebody need to go kick this server it has a 94% net load and is handing out the 511 byte WU's 1 right after the other. My Gulfies are having a gay old time trying to send and receive work. :wink:
Image
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
KMac
Posts: 31
Joined: Thu Feb 17, 2011 6:50 pm

Re: 130.237.232.237 going down for maintenance

Post by KMac »

It will also not successfully accept a completed 8101 work unit if you manage to avoid a 512 byte WU.
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: 130.237.232.237 going down for maintenance

Post by kasson »

This server has been having issues. 8101 is on a different and more reliable server; we will direct most traffic to that instead.
ei57
Posts: 64
Joined: Thu Jun 12, 2008 10:23 am

Re: 130.237.232.237 going down for maintenance

Post by ei57 »

Any idea when these issues will be adressed?
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 130.237.232.237 going down for maintenance

Post by bruce »

ei57 wrote:Any idea when these issues will be adressed?
You've been around the forum long enough to know Stanford doesn't make predictions. The standard answer to a "when" question is one of three possible answers: 1) "Not soon" or 2) "Soon" or 3) It's already fixed.

You already know they're working on the problem. If this is like most problems, almost all of the time to fix it is spent diagnosing the exact nature of the problem; once they know exactly what is causing the problem, fixing it is probably quite easy. If the exact cause of the problem is not known yet, no estimate can be given. (Yes, there are some exceptions).

If you read back through this topic, you already know that several potential fixes have already been tried and the problem is still there, so it's not going to be easy to fix.

Putting 8101 is on a different and more reliable server and redirecting assignment requests to that server is at least a case of "it's already fixed" though there's still work to needs to be done that's no longer critical.
Post Reply