Page 1 of 1

171.67.108 and 171.64.65.65 Problems?

Posted: Wed Nov 05, 2008 2:45 pm
by DrBB1
Are there new problems with either or both of these two servers receiving work? I have included both log information and queue information (see slot 2 and slot 5). [BTW: Did I lose the work from slot 2?

I have only completed a couple of WUs with Windows 6.20 client so I don't have a lot of data, but my folding has been incredibly slow since I upgraded from 5.02. Are these two issues (inability to upload completed work and slow folding) completely unrelated and, if so, is the slow folding just a coincidence (the projects are new for me as well) or is this a problem with 6.20? If the latter, I'll consider going back to 5.02.

Thanks for the help. :e?:

Code: Select all

[12:15:46] Project: 2611 (Run 0, Clone 285, Gen 104)


[12:15:46] + Attempting to send results [November 5 12:15:46 UTC]
[12:15:48] - Couldn't send HTTP request to server
[12:15:48] + Could not connect to Work Server (results)
[12:15:48]     (171.64.65.65:8080)
[12:15:48] + Retrying using alternative port
[12:15:48] - Couldn't send HTTP request to server
[12:15:48]   (Got status 503)


[12:15:48] + Could not connect to Work Server (results)
[12:15:48]     (171.64.65.65:80)
[12:15:48] - Error: Could not transmit unit 05 (completed November 5) to work server.


[12:15:48] + Attempting to send results [November 5 12:15:48 UTC]
[12:15:48] - Couldn't send HTTP request to server
[12:15:48]   (Got status 503)
[12:15:48] + Could not connect to Work Server (results)
[12:15:48]     (171.67.108.25:8080)
[12:15:48] + Retrying using alternative port
[12:15:49] - Couldn't send HTTP request to server
[12:15:49]   (Got status 503)
[12:15:49] + Could not connect to Work Server (results)
[12:15:49]     (171.67.108.25:80)
[12:15:49]   Could not transmit unit 05 to Collection server; keeping in queue.

Code: Select all

[14:22:23] Printing Queue Information
Current Queue: 
Slot 07  Empty/Deleted

Slot 08  Empty/Deleted

Slot 09  Empty/Deleted

Slot 00  Empty/Deleted

Slot 01  Empty/Deleted
Project: 4419 (Run 8, Clone 2, Gen 98), Core: 81
Work server: 171.64.122.72:8080
Collection server: 171.67.108.17
Download date: October 28 19:40:34
Finished date: October 28 23:12:08

Slot 02  Empty/Deleted
Project: 4111 (Run 9, Clone 4, Gen 13), Core: 81
Work server: 171.64.65.111:8080
Collection server: 171.67.108.17
Download date: October 28 23:18:44
Finished date: October 30 19:10:39
Failed uploads: 10

Slot 03  Empty/Deleted
Project: 2611 (Run 0, Clone 285, Gen 104), Core: 78
Work server: 171.64.65.65:8080
Collection server: 171.67.108.25
Download date: October 31 15:05:45
Finished date: January 1 00:00:00

Slot 04  Empty/Deleted
Project: 2611 (Run 0, Clone 285, Gen 104), Core: 78
Work server: 171.64.65.65:8080
Collection server: 171.67.108.25
Download date: October 31 16:54:32
Finished date: January 1 00:00:00

Slot 05  Done     
Project: 2611 (Run 0, Clone 285, Gen 104), Core: 78
Work server: 171.64.65.65:8080
Collection server: 171.67.108.25
Download date: October 31 18:32:42
Finished date: November 5 00:53:57
Failed uploads: 5

Slot 06 *Ready    
Project: 2526 (Run 41, Clone 86, Gen 1), Core: 78
Work server: 171.64.122.136:8080
Collection server: 171.67.108.17
Download date: November 5 00:54:42
Deadline date: December 28 00:54:42

PF: 0.946319 based on last 3 slot(s)

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Thu Nov 06, 2008 1:14 am
by zxy
STU's servers keep downing,times never been easy for us FAHers~~

[00:54:59] Folding@home Core Shutdown: FINISHED_UNIT
[00:55:03] CoreStatus = 64 (100)
[00:55:03] Sending work to server
[00:55:03] Project: 5015 (Run 1, Clone 614, Gen 253)


[00:55:03] + Attempting to send results [November 6 00:55:03 UTC]
[00:55:04] - Couldn't send HTTP request to server
[00:55:04] + Could not connect to Work Server (results)
[00:55:04] (171.64.65.20:8080)
[00:55:04] + Retrying using alternative port
[00:55:06] - Couldn't send HTTP request to server
[00:55:06] + Could not connect to Work Server (results)
[00:55:06] (171.64.65.20:80)
[00:55:06] - Error: Could not transmit unit 02 (completed November 6) to work server.
[00:55:06] Keeping unit 02 in queue.
[00:55:06] Project: 5015 (Run 1, Clone 614, Gen 253)


[00:55:06] + Attempting to send results [November 6 00:55:06 UTC]
[00:55:07] - Couldn't send HTTP request to server
[00:55:07] + Could not connect to Work Server (results)
[00:55:07] (171.64.65.20:8080)
[00:55:07] + Retrying using alternative port
[00:55:09] - Couldn't send HTTP request to server
[00:55:09] + Could not connect to Work Server (results)
[00:55:09] (171.64.65.20:80)
[00:55:09] - Error: Could not transmit unit 02 (completed November 6) to work server.


[00:55:09] + Attempting to send results [November 6 00:55:09 UTC]
[00:55:10] - Couldn't send HTTP request to server
[00:55:10] + Could not connect to Work Server (results)
[00:55:10] (171.67.108.25:8080)
[00:55:10] + Retrying using alternative port
[00:55:11] - Couldn't send HTTP request to server
[00:55:11] + Could not connect to Work Server (results)
[00:55:11] (171.67.108.25:80)
[00:55:11] Could not transmit unit 02 to Collection server; keeping in queue.
[00:55:11] - Preparing to get new work unit...
[00:55:11] + Attempting to get work packet
[00:55:11] - Connecting to assignment server
[00:55:12] - Successful: assigned to (171.64.65.106).
[00:55:12] + News From Folding@Home: GPU folding beta
[00:55:12] Loaded queue successfully.
[00:55:15] Project: 5015 (Run 1, Clone 614, Gen 253)


[00:55:15] + Attempting to send results [November 6 00:55:15 UTC]
[00:55:16] - Couldn't send HTTP request to server
[00:55:16] + Could not connect to Work Server (results)
[00:55:16] (171.64.65.20:8080)
[00:55:16] + Retrying using alternative port
[00:55:18] - Couldn't send HTTP request to server
[00:55:18] + Could not connect to Work Server (results)
[00:55:18] (171.64.65.20:80)
[00:55:18] - Error: Could not transmit unit 02 (completed November 6) to work server.


[00:55:18] + Attempting to send results [November 6 00:55:18 UTC]
[00:55:19] - Couldn't send HTTP request to server
[00:55:19] + Could not connect to Work Server (results)
[00:55:19] (171.67.108.25:8080)
[00:55:19] + Retrying using alternative port
[00:55:21] - Couldn't send HTTP request to server
[00:55:21] + Could not connect to Work Server (results)
[00:55:21] (171.67.108.25:80)
[00:55:21] Could not transmit unit 02 to Collection server; keeping in queue.
[00:55:21] + Closed connections
[00:55:21]
[00:55:21] + Processing work unit
[00:55:21] Core required: FahCore_11.exe
[00:55:21] Core found.
[00:55:21] Working on queue slot 03 [November 6 00:55:21 UTC]
[00:55:21] + Working ...

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Thu Nov 06, 2008 2:52 am
by 3dski
+1

My log file looks very much like the one above, including those specific IP addresses, starting around 09:00 GMT, 11/5.

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Thu Nov 06, 2008 3:02 am
by ppetrone
Well, let me check with Edgar and Peter to see if this means a problem.

Thanks,
Paula

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Thu Nov 06, 2008 3:19 am
by lobuxracer
All of my GPU clients - 7 of them - are unable to upload (some have 3 WUs in queue), but keep getting work to do. I get no response from 171.67.108.25. Any idea when it will be up?

Also - server has no record of this unit - what does this mean?

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Thu Nov 06, 2008 8:27 am
by Rahade
Yep, I also have problem with sending results to 171.67.108.25. My client started to try send them around 10 hours ago. Still no joy.

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Thu Nov 06, 2008 10:29 am
by zxy
Rahade wrote:Yep, I also have problem with sending results to 171.67.108.25. My client started to try send them around 10 hours ago. Still no joy.
the servers suck,i just want to show my finger to them


[09:21:11] Completed 96%
[09:22:47] Completed 97%
[09:24:23] Completed 98%
[09:25:58] Completed 99%
[09:27:34] Completed 100%
[09:27:34] Successful run
[09:27:34] DynamicWrapper: Finished Work Unit: sleep=10000
[09:27:44] Reserved 1127156 bytes for xtc file; Cosm status=0
[09:27:44] Allocated 1127156 bytes for xtc file
[09:27:44] - Reading up to 1127156 from "work/wudata_09.xtc": Read 1127156
[09:27:44] Read 1127156 bytes from xtc file; available packet space=261016332
[09:27:44] xtc file hash check passed.
[09:27:44] Reserved 34800 34800 261016332 bytes for arc file=<work/wudata_09.trr> Cosm status=0
[09:27:44] Allocated 34800 bytes for arc file
[09:27:44] - Reading up to 34800 from "work/wudata_09.trr": Read 34800
[09:27:44] Read 34800 bytes from arc file; available packet space=260981532
[09:27:44] trr file hash check passed.
[09:27:44] Allocated 560 bytes for edr file
[09:27:44] Read bedfile
[09:27:44] edr file hash check passed.
[09:27:44] Allocated 130928 bytes for logfile
[09:27:44] Read logfile
[09:27:44] GuardedRun: success in DynamicWrapper
[09:27:44] GuardedRun: done
[09:27:44] Run: GuardedRun completed.
[09:27:46] - Writing 1293956 bytes of core data to disk...
[09:27:46] ... Done.
[09:27:47] - Shutting down core
[09:27:47]
[09:27:47] Folding@home Core Shutdown: FINISHED_UNIT
[09:27:50] CoreStatus = 64 (100)
[09:27:50] Sending work to server
[09:27:50] Project: 5506 (Run 5, Clone 677, Gen 254)


[09:27:50] + Attempting to send results [November 6 09:27:50 UTC]
[09:27:51] - Couldn't send HTTP request to server
[09:27:51] + Could not connect to Work Server (results)
[09:27:51] (171.64.65.106:8080)
[09:27:51] + Retrying using alternative port
[09:27:53] - Couldn't send HTTP request to server
[09:27:53] + Could not connect to Work Server (results)
[09:27:53] (171.64.65.106:80)
[09:27:53] - Error: Could not transmit unit 09 (completed November 6) to work server.
[09:27:53] Keeping unit 09 in queue.
[09:27:53] Project: 5506 (Run 5, Clone 677, Gen 254)


[09:27:53] + Attempting to send results [November 6 09:27:53 UTC]
[09:27:55] - Couldn't send HTTP request to server
[09:27:55] + Could not connect to Work Server (results)
[09:27:55] (171.64.65.106:8080)
[09:27:55] + Retrying using alternative port
[09:27:56] - Couldn't send HTTP request to server
[09:27:56] + Could not connect to Work Server (results)
[09:27:56] (171.64.65.106:80)
[09:27:56] - Error: Could not transmit unit 09 (completed November 6) to work server.


[09:27:56] + Attempting to send results [November 6 09:27:56 UTC]
[09:27:57] - Couldn't send HTTP request to server
[09:27:57] (Got status 503)
[09:27:57] + Could not connect to Work Server (results)
[09:27:57] (171.67.108.25:8080)
[09:27:57] + Retrying using alternative port
[09:27:57] - Couldn't send HTTP request to server
[09:27:57] (Got status 503)
[09:27:57] + Could not connect to Work Server (results)
[09:27:57] (171.67.108.25:80)
[09:27:57] Could not transmit unit 09 to Collection server; keeping in queue.
[09:27:57] - Preparing to get new work unit...
[09:27:57] + Attempting to get work packet
[09:27:57] - Connecting to assignment server
[09:27:59] - Successful: assigned to (171.64.65.20).
[09:27:59] + News From Folding@Home: GPU folding beta
[09:27:59] Loaded queue successfully.
[09:28:01] Project: 5506 (Run 5, Clone 677, Gen 254)


[09:28:01] + Attempting to send results [November 6 09:28:01 UTC]
[09:28:03] - Couldn't send HTTP request to server
[09:28:03] + Could not connect to Work Server (results)
[09:28:03] (171.64.65.106:8080)
[09:28:03] + Retrying using alternative port
[09:28:04] - Couldn't send HTTP request to server
[09:28:04] + Could not connect to Work Server (results)
[09:28:04] (171.64.65.106:80)
[09:28:04] - Error: Could not transmit unit 09 (completed November 6) to work server.


[09:28:04] + Attempting to send results [November 6 09:28:04 UTC]
[09:28:05] - Couldn't send HTTP request to server
[09:28:05] (Got status 503)
[09:28:05] + Could not connect to Work Server (results)
[09:28:05] (171.67.108.25:8080)
[09:28:05] + Retrying using alternative port
[09:28:05] - Couldn't send HTTP request to server
[09:28:05] (Got status 503)
[09:28:05] + Could not connect to Work Server (results)
[09:28:05] (171.67.108.25:80)
[09:28:05] Could not transmit unit 09 to Collection server; keeping in queue.
[09:28:05] + Closed connections

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Thu Nov 06, 2008 4:11 pm
by kasson
Since different people keep track of different servers, it's probably most effective to keep each server in its own thread. I can only speak for 65.65 here; it is accepting work units, but because the work units are on the large side it's limited to accepting 100 transactions at a time right now. So it may take some trying to get through. (We had more simultaneous transactions, but the server binary was using more than the 8G of physical memory on the machine and getting slow.)

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Thu Nov 06, 2008 8:17 pm
by DrBB1

Code: Select all

[23:47:23] + Attempting to send results [November 5 23:47:23 UTC]
[23:47:24] - Couldn't send HTTP request to server
[23:47:24] + Could not connect to Work Server (results)
[23:47:24]     (171.64.65.65:8080)
[23:47:24] + Retrying using alternative port
[23:48:02] + Results successfully sent
[23:48:02] Thank you for your contribution to Folding@Home.
[23:48:02] + Number of Units Completed: 3

[23:48:03] + Working...
Just wanted to report 65.65 finally accepted the WU that started this thread....Image

Seriously, I do appreciate the vast complexity of the enterprise and the extreme effort it takes to to make an operation like FAH work--even at a world-class institution like Stanford there are never enough resources to keep things running smoothly 24/7, and when something goes awry, it can be something obvious or it may take an indefinite amount of time to diagnose and fix. It helps me to remember that I'm not really folding for the points. I'm folding for my kids and future generations to have a better life. Thanks to all--including the volunteers--who are providing us the opportunity to help and supporting the enterprise.

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Sat Nov 08, 2008 4:44 am
by G-Byte
I am having alot of trouble with this one too. 12 failed uploads and I don't know if it is too late to send the results in. So what did I do? Waste whatever time it took and all the time the upload failed? I do this for personal reasons but....

___________________________________________________________
Error: Could not transmit unit 05 (completed November 8) to work server.

Slot 05 Done
Project: 5506 (Run 5, Clone 413, Gen 236), Core: 11
Work server: 171.64.65.106:8080
Collection server: 171.67.108.25
Download date: November 7 22:31:40
Finished date: November 8 01:38:24
Failed uploads: 12

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Sat Nov 08, 2008 6:18 am
by lobuxracer
This is getting ridiculous. 7 of 9 GPU clients have "+ Sent 0 of 1 completed units to the server." I wish I could say it's not been an issue, but it's ON GOING. Not only that, but I keep getting fahcore_13 projects from the assignment server when 7im tells us these should not be showing up. It's been well over 24 hours since this problem popped up and still I get these lame WUs.

Is the network so completely undersized it just can't handle the load? How frustrating is it for PG? It's certainly frustrating for those of us dealing with EUEs and Beta testing cores when we've not been told we're true beta testers. How much longer until this gets sorted out?

Re: 171.67.108 and 171.64.65.65 Problems?

Posted: Sat Nov 08, 2008 7:44 am
by mikeb12
me too, failed sends across all gpu's since early this morning..
all waiting in queue...
Error: Could not transmit unit
this morning.... just sprouted overnight...

9600gso

Code: Select all

[07:03:05] + Attempting to send results [November 8 07:03:05 UTC]
[07:03:05] - Successful: assigned to (171.64.65.106).
[07:03:05] + News From Folding@Home: GPU folding beta
[07:03:05] Loaded queue successfully.
[07:03:06] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[07:03:06] - Couldn't send HTTP request to server
[07:03:06] + Could not connect to Work Server (results)
[07:03:06]     (171.64.65.20:8080)
[07:03:06] + Retrying using alternative port
[07:03:07] - Couldn't send HTTP request to server
[07:03:07] + Could not connect to Work Server (results)
[07:03:07]     (171.64.65.20:80)
[07:03:07] - Error: Could not transmit unit 09 (completed November 8) to work server.
[07:03:07] - Read packet limit of 540015616... Set to 524286976.


[07:03:07] + Attempting to send results [November 8 07:03:07 UTC]
[07:03:07] - Couldn't send HTTP request to server
[07:03:07]   (Got status 503)
[07:03:07] + Could not connect to Work Server (results)
[07:03:07]     (171.67.108.25:8080)
[07:03:07] + Retrying using alternative port
[07:03:08] - Couldn't send HTTP request to server
[07:03:08]   (Got status 503)
[07:03:08] + Could not connect to Work Server (results)
[07:03:08]     (171.67.108.25:80)
[07:03:08]   Could not transmit unit 09 to Collection server; keeping in queue.

9800gt

Code: Select all

[07:43:42] + Attempting to send results [November 8 07:43:42 UTC]
[07:43:44] - Couldn't send HTTP request to server
[07:43:44] + Could not connect to Work Server (results)
[07:43:44]     (171.64.65.20:8080)
[07:43:44] + Retrying using alternative port
[07:43:45] - Couldn't send HTTP request to server
[07:43:45] + Could not connect to Work Server (results)
[07:43:45]     (171.64.65.20:80)
[07:43:45] - Error: Could not transmit unit 08 (completed November 8) to work server.
[07:43:45] - Read packet limit of 540015616... Set to 524286976.


[07:43:45] + Attempting to send results [November 8 07:43:45 UTC]
[07:43:45] - Couldn't send HTTP request to server
[07:43:45]   (Got status 503)
[07:43:45] + Could not connect to Work Server (results)
[07:43:45]     (171.67.108.25:8080)
[07:43:45] + Retrying using alternative port
[07:43:45] - Couldn't send HTTP request to server
[07:43:45]   (Got status 503)
[07:43:45] + Could not connect to Work Server (results)
[07:43:45]     (171.67.108.25:80)
[07:43:45]   Could not transmit unit 08 to Collection server; keeping in queue.

8800gt

Code: Select all

[06:50:24] + Attempting to send results [November 8 06:50:24 UTC]
[06:50:25] - Couldn't send HTTP request to server
[06:50:25] + Could not connect to Work Server (results)
[06:50:25]     (171.64.65.20:8080)
[06:50:25] + Retrying using alternative port
[06:50:26] - Couldn't send HTTP request to server
[06:50:26] + Could not connect to Work Server (results)
[06:50:26]     (171.64.65.20:80)
[06:50:26] - Error: Could not transmit unit 01 (completed November 8) to work server.
[06:50:26] - Read packet limit of 540015616... Set to 524286976.


[06:50:26] + Attempting to send results [November 8 06:50:26 UTC]
[06:50:26] - Couldn't send HTTP request to server
[06:50:26]   (Got status 503)
[06:50:26] + Could not connect to Work Server (results)
[06:50:26]     (171.67.108.25:8080)
[06:50:26] + Retrying using alternative port
[06:50:26] - Couldn't send HTTP request to server
[06:50:26]   (Got status 503)
[06:50:26] + Could not connect to Work Server (results)
[06:50:26]     (171.67.108.25:80)
[06:50:26]   Could not transmit unit 01 to Collection server; keeping in queue.
[06:50:26] + Closed connections

8800gt

Code: Select all

[07:07:30] + Attempting to send results [November 8 07:07:30 UTC]
[07:07:32] - Couldn't send HTTP request to server
[07:07:32] + Could not connect to Work Server (results)
[07:07:32]     (171.64.65.20:8080)
[07:07:32] + Retrying using alternative port
[07:07:33] - Couldn't send HTTP request to server
[07:07:33] + Could not connect to Work Server (results)
[07:07:33]     (171.64.65.20:80)
[07:07:33] - Error: Could not transmit unit 01 (completed November 8) to work server.


[07:07:33] + Attempting to send results [November 8 07:07:33 UTC]
[07:07:33] - Couldn't send HTTP request to server
[07:07:33]   (Got status 503)
[07:07:33] + Could not connect to Work Server (results)
[07:07:33]     (171.67.108.25:8080)
[07:07:33] + Retrying using alternative port
[07:07:33] - Couldn't send HTTP request to server
[07:07:33]   (Got status 503)
[07:07:33] + Could not connect to Work Server (results)
[07:07:33]     (171.67.108.25:80)
[07:07:33]   Could not transmit unit 01 to Collection server; keeping in queue.