171.67.108 and 171.64.65.65 Problems?

Moderators: Site Moderators, FAHC Science Team

Post Reply
DrBB1
Posts: 136
Joined: Wed Mar 26, 2008 12:30 am
Location: SE PA

171.67.108 and 171.64.65.65 Problems?

Post by DrBB1 »

Are there new problems with either or both of these two servers receiving work? I have included both log information and queue information (see slot 2 and slot 5). [BTW: Did I lose the work from slot 2?

I have only completed a couple of WUs with Windows 6.20 client so I don't have a lot of data, but my folding has been incredibly slow since I upgraded from 5.02. Are these two issues (inability to upload completed work and slow folding) completely unrelated and, if so, is the slow folding just a coincidence (the projects are new for me as well) or is this a problem with 6.20? If the latter, I'll consider going back to 5.02.

Thanks for the help. :e?:

Code: Select all

[12:15:46] Project: 2611 (Run 0, Clone 285, Gen 104)


[12:15:46] + Attempting to send results [November 5 12:15:46 UTC]
[12:15:48] - Couldn't send HTTP request to server
[12:15:48] + Could not connect to Work Server (results)
[12:15:48]     (171.64.65.65:8080)
[12:15:48] + Retrying using alternative port
[12:15:48] - Couldn't send HTTP request to server
[12:15:48]   (Got status 503)


[12:15:48] + Could not connect to Work Server (results)
[12:15:48]     (171.64.65.65:80)
[12:15:48] - Error: Could not transmit unit 05 (completed November 5) to work server.


[12:15:48] + Attempting to send results [November 5 12:15:48 UTC]
[12:15:48] - Couldn't send HTTP request to server
[12:15:48]   (Got status 503)
[12:15:48] + Could not connect to Work Server (results)
[12:15:48]     (171.67.108.25:8080)
[12:15:48] + Retrying using alternative port
[12:15:49] - Couldn't send HTTP request to server
[12:15:49]   (Got status 503)
[12:15:49] + Could not connect to Work Server (results)
[12:15:49]     (171.67.108.25:80)
[12:15:49]   Could not transmit unit 05 to Collection server; keeping in queue.

Code: Select all

[14:22:23] Printing Queue Information
Current Queue: 
Slot 07  Empty/Deleted

Slot 08  Empty/Deleted

Slot 09  Empty/Deleted

Slot 00  Empty/Deleted

Slot 01  Empty/Deleted
Project: 4419 (Run 8, Clone 2, Gen 98), Core: 81
Work server: 171.64.122.72:8080
Collection server: 171.67.108.17
Download date: October 28 19:40:34
Finished date: October 28 23:12:08

Slot 02  Empty/Deleted
Project: 4111 (Run 9, Clone 4, Gen 13), Core: 81
Work server: 171.64.65.111:8080
Collection server: 171.67.108.17
Download date: October 28 23:18:44
Finished date: October 30 19:10:39
Failed uploads: 10

Slot 03  Empty/Deleted
Project: 2611 (Run 0, Clone 285, Gen 104), Core: 78
Work server: 171.64.65.65:8080
Collection server: 171.67.108.25
Download date: October 31 15:05:45
Finished date: January 1 00:00:00

Slot 04  Empty/Deleted
Project: 2611 (Run 0, Clone 285, Gen 104), Core: 78
Work server: 171.64.65.65:8080
Collection server: 171.67.108.25
Download date: October 31 16:54:32
Finished date: January 1 00:00:00

Slot 05  Done     
Project: 2611 (Run 0, Clone 285, Gen 104), Core: 78
Work server: 171.64.65.65:8080
Collection server: 171.67.108.25
Download date: October 31 18:32:42
Finished date: November 5 00:53:57
Failed uploads: 5

Slot 06 *Ready    
Project: 2526 (Run 41, Clone 86, Gen 1), Core: 78
Work server: 171.64.122.136:8080
Collection server: 171.67.108.17
Download date: November 5 00:54:42
Deadline date: December 28 00:54:42

PF: 0.946319 based on last 3 slot(s)
========
DrBB1
zxy
Posts: 10
Joined: Sat Aug 30, 2008 12:58 pm

Re: 171.67.108 and 171.64.65.65 Problems?

Post by zxy »

STU's servers keep downing,times never been easy for us FAHers~~

[00:54:59] Folding@home Core Shutdown: FINISHED_UNIT
[00:55:03] CoreStatus = 64 (100)
[00:55:03] Sending work to server
[00:55:03] Project: 5015 (Run 1, Clone 614, Gen 253)


[00:55:03] + Attempting to send results [November 6 00:55:03 UTC]
[00:55:04] - Couldn't send HTTP request to server
[00:55:04] + Could not connect to Work Server (results)
[00:55:04] (171.64.65.20:8080)
[00:55:04] + Retrying using alternative port
[00:55:06] - Couldn't send HTTP request to server
[00:55:06] + Could not connect to Work Server (results)
[00:55:06] (171.64.65.20:80)
[00:55:06] - Error: Could not transmit unit 02 (completed November 6) to work server.
[00:55:06] Keeping unit 02 in queue.
[00:55:06] Project: 5015 (Run 1, Clone 614, Gen 253)


[00:55:06] + Attempting to send results [November 6 00:55:06 UTC]
[00:55:07] - Couldn't send HTTP request to server
[00:55:07] + Could not connect to Work Server (results)
[00:55:07] (171.64.65.20:8080)
[00:55:07] + Retrying using alternative port
[00:55:09] - Couldn't send HTTP request to server
[00:55:09] + Could not connect to Work Server (results)
[00:55:09] (171.64.65.20:80)
[00:55:09] - Error: Could not transmit unit 02 (completed November 6) to work server.


[00:55:09] + Attempting to send results [November 6 00:55:09 UTC]
[00:55:10] - Couldn't send HTTP request to server
[00:55:10] + Could not connect to Work Server (results)
[00:55:10] (171.67.108.25:8080)
[00:55:10] + Retrying using alternative port
[00:55:11] - Couldn't send HTTP request to server
[00:55:11] + Could not connect to Work Server (results)
[00:55:11] (171.67.108.25:80)
[00:55:11] Could not transmit unit 02 to Collection server; keeping in queue.
[00:55:11] - Preparing to get new work unit...
[00:55:11] + Attempting to get work packet
[00:55:11] - Connecting to assignment server
[00:55:12] - Successful: assigned to (171.64.65.106).
[00:55:12] + News From Folding@Home: GPU folding beta
[00:55:12] Loaded queue successfully.
[00:55:15] Project: 5015 (Run 1, Clone 614, Gen 253)


[00:55:15] + Attempting to send results [November 6 00:55:15 UTC]
[00:55:16] - Couldn't send HTTP request to server
[00:55:16] + Could not connect to Work Server (results)
[00:55:16] (171.64.65.20:8080)
[00:55:16] + Retrying using alternative port
[00:55:18] - Couldn't send HTTP request to server
[00:55:18] + Could not connect to Work Server (results)
[00:55:18] (171.64.65.20:80)
[00:55:18] - Error: Could not transmit unit 02 (completed November 6) to work server.


[00:55:18] + Attempting to send results [November 6 00:55:18 UTC]
[00:55:19] - Couldn't send HTTP request to server
[00:55:19] + Could not connect to Work Server (results)
[00:55:19] (171.67.108.25:8080)
[00:55:19] + Retrying using alternative port
[00:55:21] - Couldn't send HTTP request to server
[00:55:21] + Could not connect to Work Server (results)
[00:55:21] (171.67.108.25:80)
[00:55:21] Could not transmit unit 02 to Collection server; keeping in queue.
[00:55:21] + Closed connections
[00:55:21]
[00:55:21] + Processing work unit
[00:55:21] Core required: FahCore_11.exe
[00:55:21] Core found.
[00:55:21] Working on queue slot 03 [November 6 00:55:21 UTC]
[00:55:21] + Working ...
Image
3dski
Posts: 13
Joined: Fri Aug 01, 2008 3:56 pm

Re: 171.67.108 and 171.64.65.65 Problems?

Post by 3dski »

+1

My log file looks very much like the one above, including those specific IP addresses, starting around 09:00 GMT, 11/5.
Image
ppetrone
Pande Group Member
Posts: 115
Joined: Wed Dec 12, 2007 6:20 pm
Location: Stanford
Contact:

Re: 171.67.108 and 171.64.65.65 Problems?

Post by ppetrone »

Well, let me check with Edgar and Peter to see if this means a problem.

Thanks,
Paula
lobuxracer
Posts: 18
Joined: Mon Aug 18, 2008 5:49 am

Re: 171.67.108 and 171.64.65.65 Problems?

Post by lobuxracer »

All of my GPU clients - 7 of them - are unable to upload (some have 3 WUs in queue), but keep getting work to do. I get no response from 171.67.108.25. Any idea when it will be up?

Also - server has no record of this unit - what does this mean?
Image
Rahade
Posts: 1
Joined: Wed Nov 05, 2008 11:49 am

Re: 171.67.108 and 171.64.65.65 Problems?

Post by Rahade »

Yep, I also have problem with sending results to 171.67.108.25. My client started to try send them around 10 hours ago. Still no joy.
zxy
Posts: 10
Joined: Sat Aug 30, 2008 12:58 pm

Re: 171.67.108 and 171.64.65.65 Problems?

Post by zxy »

Rahade wrote:Yep, I also have problem with sending results to 171.67.108.25. My client started to try send them around 10 hours ago. Still no joy.
the servers suck,i just want to show my finger to them


[09:21:11] Completed 96%
[09:22:47] Completed 97%
[09:24:23] Completed 98%
[09:25:58] Completed 99%
[09:27:34] Completed 100%
[09:27:34] Successful run
[09:27:34] DynamicWrapper: Finished Work Unit: sleep=10000
[09:27:44] Reserved 1127156 bytes for xtc file; Cosm status=0
[09:27:44] Allocated 1127156 bytes for xtc file
[09:27:44] - Reading up to 1127156 from "work/wudata_09.xtc": Read 1127156
[09:27:44] Read 1127156 bytes from xtc file; available packet space=261016332
[09:27:44] xtc file hash check passed.
[09:27:44] Reserved 34800 34800 261016332 bytes for arc file=<work/wudata_09.trr> Cosm status=0
[09:27:44] Allocated 34800 bytes for arc file
[09:27:44] - Reading up to 34800 from "work/wudata_09.trr": Read 34800
[09:27:44] Read 34800 bytes from arc file; available packet space=260981532
[09:27:44] trr file hash check passed.
[09:27:44] Allocated 560 bytes for edr file
[09:27:44] Read bedfile
[09:27:44] edr file hash check passed.
[09:27:44] Allocated 130928 bytes for logfile
[09:27:44] Read logfile
[09:27:44] GuardedRun: success in DynamicWrapper
[09:27:44] GuardedRun: done
[09:27:44] Run: GuardedRun completed.
[09:27:46] - Writing 1293956 bytes of core data to disk...
[09:27:46] ... Done.
[09:27:47] - Shutting down core
[09:27:47]
[09:27:47] Folding@home Core Shutdown: FINISHED_UNIT
[09:27:50] CoreStatus = 64 (100)
[09:27:50] Sending work to server
[09:27:50] Project: 5506 (Run 5, Clone 677, Gen 254)


[09:27:50] + Attempting to send results [November 6 09:27:50 UTC]
[09:27:51] - Couldn't send HTTP request to server
[09:27:51] + Could not connect to Work Server (results)
[09:27:51] (171.64.65.106:8080)
[09:27:51] + Retrying using alternative port
[09:27:53] - Couldn't send HTTP request to server
[09:27:53] + Could not connect to Work Server (results)
[09:27:53] (171.64.65.106:80)
[09:27:53] - Error: Could not transmit unit 09 (completed November 6) to work server.
[09:27:53] Keeping unit 09 in queue.
[09:27:53] Project: 5506 (Run 5, Clone 677, Gen 254)


[09:27:53] + Attempting to send results [November 6 09:27:53 UTC]
[09:27:55] - Couldn't send HTTP request to server
[09:27:55] + Could not connect to Work Server (results)
[09:27:55] (171.64.65.106:8080)
[09:27:55] + Retrying using alternative port
[09:27:56] - Couldn't send HTTP request to server
[09:27:56] + Could not connect to Work Server (results)
[09:27:56] (171.64.65.106:80)
[09:27:56] - Error: Could not transmit unit 09 (completed November 6) to work server.


[09:27:56] + Attempting to send results [November 6 09:27:56 UTC]
[09:27:57] - Couldn't send HTTP request to server
[09:27:57] (Got status 503)
[09:27:57] + Could not connect to Work Server (results)
[09:27:57] (171.67.108.25:8080)
[09:27:57] + Retrying using alternative port
[09:27:57] - Couldn't send HTTP request to server
[09:27:57] (Got status 503)
[09:27:57] + Could not connect to Work Server (results)
[09:27:57] (171.67.108.25:80)
[09:27:57] Could not transmit unit 09 to Collection server; keeping in queue.
[09:27:57] - Preparing to get new work unit...
[09:27:57] + Attempting to get work packet
[09:27:57] - Connecting to assignment server
[09:27:59] - Successful: assigned to (171.64.65.20).
[09:27:59] + News From Folding@Home: GPU folding beta
[09:27:59] Loaded queue successfully.
[09:28:01] Project: 5506 (Run 5, Clone 677, Gen 254)


[09:28:01] + Attempting to send results [November 6 09:28:01 UTC]
[09:28:03] - Couldn't send HTTP request to server
[09:28:03] + Could not connect to Work Server (results)
[09:28:03] (171.64.65.106:8080)
[09:28:03] + Retrying using alternative port
[09:28:04] - Couldn't send HTTP request to server
[09:28:04] + Could not connect to Work Server (results)
[09:28:04] (171.64.65.106:80)
[09:28:04] - Error: Could not transmit unit 09 (completed November 6) to work server.


[09:28:04] + Attempting to send results [November 6 09:28:04 UTC]
[09:28:05] - Couldn't send HTTP request to server
[09:28:05] (Got status 503)
[09:28:05] + Could not connect to Work Server (results)
[09:28:05] (171.67.108.25:8080)
[09:28:05] + Retrying using alternative port
[09:28:05] - Couldn't send HTTP request to server
[09:28:05] (Got status 503)
[09:28:05] + Could not connect to Work Server (results)
[09:28:05] (171.67.108.25:80)
[09:28:05] Could not transmit unit 09 to Collection server; keeping in queue.
[09:28:05] + Closed connections
Image
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: 171.67.108 and 171.64.65.65 Problems?

Post by kasson »

Since different people keep track of different servers, it's probably most effective to keep each server in its own thread. I can only speak for 65.65 here; it is accepting work units, but because the work units are on the large side it's limited to accepting 100 transactions at a time right now. So it may take some trying to get through. (We had more simultaneous transactions, but the server binary was using more than the 8G of physical memory on the machine and getting slow.)
DrBB1
Posts: 136
Joined: Wed Mar 26, 2008 12:30 am
Location: SE PA

Re: 171.67.108 and 171.64.65.65 Problems?

Post by DrBB1 »

Code: Select all

[23:47:23] + Attempting to send results [November 5 23:47:23 UTC]
[23:47:24] - Couldn't send HTTP request to server
[23:47:24] + Could not connect to Work Server (results)
[23:47:24]     (171.64.65.65:8080)
[23:47:24] + Retrying using alternative port
[23:48:02] + Results successfully sent
[23:48:02] Thank you for your contribution to Folding@Home.
[23:48:02] + Number of Units Completed: 3

[23:48:03] + Working...
Just wanted to report 65.65 finally accepted the WU that started this thread....Image

Seriously, I do appreciate the vast complexity of the enterprise and the extreme effort it takes to to make an operation like FAH work--even at a world-class institution like Stanford there are never enough resources to keep things running smoothly 24/7, and when something goes awry, it can be something obvious or it may take an indefinite amount of time to diagnose and fix. It helps me to remember that I'm not really folding for the points. I'm folding for my kids and future generations to have a better life. Thanks to all--including the volunteers--who are providing us the opportunity to help and supporting the enterprise.
========
DrBB1
G-Byte
Posts: 8
Joined: Sat Nov 08, 2008 4:39 am
Hardware configuration: MaxCore 555(216) @1100m/1566s - M2N32-SLI Phenom 9850 @2860
MaxCore 555(216) @000m/1512s " "
Vmware/Ubantu64 x2
Folding since October 21, 2008
G-Byte: Overclocker.net, Team 37726

Re: 171.67.108 and 171.64.65.65 Problems?

Post by G-Byte »

I am having alot of trouble with this one too. 12 failed uploads and I don't know if it is too late to send the results in. So what did I do? Waste whatever time it took and all the time the upload failed? I do this for personal reasons but....

___________________________________________________________
Error: Could not transmit unit 05 (completed November 8) to work server.

Slot 05 Done
Project: 5506 (Run 5, Clone 413, Gen 236), Core: 11
Work server: 171.64.65.106:8080
Collection server: 171.67.108.25
Download date: November 7 22:31:40
Finished date: November 8 01:38:24
Failed uploads: 12
MaxCore 55(216) @1100m/1566s - M2N32-SLI Phenom 9850 @2860
MaxCore 55(216) @999m/1512s "......................................... "
Vmware/Ubantu64 x2
Folding since October 21, 2008
G-Byte: Overclocker.net, Team 37726
lobuxracer
Posts: 18
Joined: Mon Aug 18, 2008 5:49 am

Re: 171.67.108 and 171.64.65.65 Problems?

Post by lobuxracer »

This is getting ridiculous. 7 of 9 GPU clients have "+ Sent 0 of 1 completed units to the server." I wish I could say it's not been an issue, but it's ON GOING. Not only that, but I keep getting fahcore_13 projects from the assignment server when 7im tells us these should not be showing up. It's been well over 24 hours since this problem popped up and still I get these lame WUs.

Is the network so completely undersized it just can't handle the load? How frustrating is it for PG? It's certainly frustrating for those of us dealing with EUEs and Beta testing cores when we've not been told we're true beta testers. How much longer until this gets sorted out?
Image
mikeb12
Posts: 28
Joined: Tue Feb 12, 2008 11:51 am
Location: South Carolina USA

Re: 171.67.108 and 171.64.65.65 Problems?

Post by mikeb12 »

me too, failed sends across all gpu's since early this morning..
all waiting in queue...
Error: Could not transmit unit
this morning.... just sprouted overnight...

9600gso

Code: Select all

[07:03:05] + Attempting to send results [November 8 07:03:05 UTC]
[07:03:05] - Successful: assigned to (171.64.65.106).
[07:03:05] + News From Folding@Home: GPU folding beta
[07:03:05] Loaded queue successfully.
[07:03:06] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[07:03:06] - Couldn't send HTTP request to server
[07:03:06] + Could not connect to Work Server (results)
[07:03:06]     (171.64.65.20:8080)
[07:03:06] + Retrying using alternative port
[07:03:07] - Couldn't send HTTP request to server
[07:03:07] + Could not connect to Work Server (results)
[07:03:07]     (171.64.65.20:80)
[07:03:07] - Error: Could not transmit unit 09 (completed November 8) to work server.
[07:03:07] - Read packet limit of 540015616... Set to 524286976.


[07:03:07] + Attempting to send results [November 8 07:03:07 UTC]
[07:03:07] - Couldn't send HTTP request to server
[07:03:07]   (Got status 503)
[07:03:07] + Could not connect to Work Server (results)
[07:03:07]     (171.67.108.25:8080)
[07:03:07] + Retrying using alternative port
[07:03:08] - Couldn't send HTTP request to server
[07:03:08]   (Got status 503)
[07:03:08] + Could not connect to Work Server (results)
[07:03:08]     (171.67.108.25:80)
[07:03:08]   Could not transmit unit 09 to Collection server; keeping in queue.

9800gt

Code: Select all

[07:43:42] + Attempting to send results [November 8 07:43:42 UTC]
[07:43:44] - Couldn't send HTTP request to server
[07:43:44] + Could not connect to Work Server (results)
[07:43:44]     (171.64.65.20:8080)
[07:43:44] + Retrying using alternative port
[07:43:45] - Couldn't send HTTP request to server
[07:43:45] + Could not connect to Work Server (results)
[07:43:45]     (171.64.65.20:80)
[07:43:45] - Error: Could not transmit unit 08 (completed November 8) to work server.
[07:43:45] - Read packet limit of 540015616... Set to 524286976.


[07:43:45] + Attempting to send results [November 8 07:43:45 UTC]
[07:43:45] - Couldn't send HTTP request to server
[07:43:45]   (Got status 503)
[07:43:45] + Could not connect to Work Server (results)
[07:43:45]     (171.67.108.25:8080)
[07:43:45] + Retrying using alternative port
[07:43:45] - Couldn't send HTTP request to server
[07:43:45]   (Got status 503)
[07:43:45] + Could not connect to Work Server (results)
[07:43:45]     (171.67.108.25:80)
[07:43:45]   Could not transmit unit 08 to Collection server; keeping in queue.

8800gt

Code: Select all

[06:50:24] + Attempting to send results [November 8 06:50:24 UTC]
[06:50:25] - Couldn't send HTTP request to server
[06:50:25] + Could not connect to Work Server (results)
[06:50:25]     (171.64.65.20:8080)
[06:50:25] + Retrying using alternative port
[06:50:26] - Couldn't send HTTP request to server
[06:50:26] + Could not connect to Work Server (results)
[06:50:26]     (171.64.65.20:80)
[06:50:26] - Error: Could not transmit unit 01 (completed November 8) to work server.
[06:50:26] - Read packet limit of 540015616... Set to 524286976.


[06:50:26] + Attempting to send results [November 8 06:50:26 UTC]
[06:50:26] - Couldn't send HTTP request to server
[06:50:26]   (Got status 503)
[06:50:26] + Could not connect to Work Server (results)
[06:50:26]     (171.67.108.25:8080)
[06:50:26] + Retrying using alternative port
[06:50:26] - Couldn't send HTTP request to server
[06:50:26]   (Got status 503)
[06:50:26] + Could not connect to Work Server (results)
[06:50:26]     (171.67.108.25:80)
[06:50:26]   Could not transmit unit 01 to Collection server; keeping in queue.
[06:50:26] + Closed connections

8800gt

Code: Select all

[07:07:30] + Attempting to send results [November 8 07:07:30 UTC]
[07:07:32] - Couldn't send HTTP request to server
[07:07:32] + Could not connect to Work Server (results)
[07:07:32]     (171.64.65.20:8080)
[07:07:32] + Retrying using alternative port
[07:07:33] - Couldn't send HTTP request to server
[07:07:33] + Could not connect to Work Server (results)
[07:07:33]     (171.64.65.20:80)
[07:07:33] - Error: Could not transmit unit 01 (completed November 8) to work server.


[07:07:33] + Attempting to send results [November 8 07:07:33 UTC]
[07:07:33] - Couldn't send HTTP request to server
[07:07:33]   (Got status 503)
[07:07:33] + Could not connect to Work Server (results)
[07:07:33]     (171.67.108.25:8080)
[07:07:33] + Retrying using alternative port
[07:07:33] - Couldn't send HTTP request to server
[07:07:33]   (Got status 503)
[07:07:33] + Could not connect to Work Server (results)
[07:07:33]     (171.67.108.25:80)
[07:07:33]   Could not transmit unit 01 to Collection server; keeping in queue.
Post Reply