GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Moderators: Site Moderators, FAHC Science Team

7im
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: What do we do with all of the unsent workunits?

Post by 7im »

Isn't this the same issue? http://foldingforum.org/viewtopic.php?f ... &start=210

Edit by Mod:
Topics merged.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
weedacres
Posts: 138
Joined: Mon Dec 24, 2007 11:18 pm
Hardware configuration: UserNames: weedacres_gpu ...
Location: Eastern Washington

Re: What do we do with all of the unsent workunits?

Post by weedacres »

7im wrote:Isn't this the same issue? http://foldingforum.org/viewtopic.php?f ... &start=210
It started with all of the server problems but If you'll look at the log in the first post you'll see that the message I get back when trying to do a manual send is:

Code: Select all

- Warning: Asked to send unfinished unit to server
So far this has been ignored in the other thread.
Image
ArVee
Posts: 121
Joined: Sun Dec 02, 2007 9:25 am

Re: What do we do with all of the unsent workunits?

Post by ArVee »

There's no official announcement re a fix yet, but I just had 5 of 6 backlogged WU's go up under an autosend. That was for one gpu on one machine, so far nothing autosent from either of two on another machine. Hope others see something similar. :)
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: What do we do with all of the unsent workunits?

Post by bruce »

weedacres wrote:. . .If you'll look at the log in the first post you'll see that the message I get back when trying to do a manual send is:

Code: Select all

- Warning: Asked to send unfinished unit to server
So far this has been ignored in the other thread.
That's because in this context, the message is virtually new and you're probably the first one to notice it, not because it has been ignored. It means they're one step closer to fixing the problem.

Since this is just another symptom of the same problem, the topics should be merged.
weedacres
Posts: 138
Joined: Mon Dec 24, 2007 11:18 pm
Hardware configuration: UserNames: weedacres_gpu ...
Location: Eastern Washington

Re: What do we do with all of the unsent workunits?

Post by weedacres »

bruce wrote:
That's because in this context, the message is virtually new and you're probably the first one to notice it, not because it has been ignored. It means they're one step closer to fixing the problem.

Since this is just another symptom of the same problem, the topics should be merged.
Ok, if you say so.
I currently have 78 completed workunits dated 2/14 that were moved into backup so I could continue to operate. Plus 1 from 2/16 and 2 from 2/17 that are in the active folders but won't upload.
Image
Flathead74
Posts: 266
Joined: Sun Dec 02, 2007 6:08 pm
Location: Central New York
Contact:

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Flathead74 »

Code: Select all

- Warning: Asked to send unfinished unit to server
Perhaps weedacres was the first person to notice this message, but not the only one.

I had also noticed this message on one of my systems,
but since weedacres had already posted about it, I did not chime in with a "me too".
This was on February 15 05:49:05 UTC.

It occurred when I attempted to manually upload "finished" work.
I tried with two different WUs, then gave up.

[05:48:16] Project: 10503 (Run 134, Clone 0, Gen 0)
[05:48:16] - Warning: Asked to send unfinished unit to server

[05:49:05] Project: 10xxx (Run 246, Clone 0, Gen 0)
[05:49:05] - Warning: Asked to send unfinished unit to server

I still have 119 WUs that were moved to backup so that I, too, could continue to operate.
Jolly-Swagman
Posts: 11
Joined: Tue Jul 01, 2008 9:18 am

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Jolly-Swagman »

Still getting issues with 171.67.108.21, gets new work on occasions but still have WU,s in Queue from back as far as Feb 04 2010 with hundreds of attenpts to upload ,

So may as well just shut the GPU,s down ,,,, after all I am the bunny paying for electricity to run these for what!! Science!!

That,s not happening if they dont get sent then is it!!
Image
Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Tobit »

Holy cow, CPU and net-load are through the roof right now on .17 .25 and .26 collection servers. I have one client that refuses to time out and move on. heh.

[02:41:21] Connecting to http://171.67.108.26:8080/
[02:51:36] Posted data.

it is now 02:58 and it is still sitting here. I guess it is time to manually intervene.
Amaruk
Posts: 254
Joined: Fri Jun 20, 2008 3:57 am
Location: Watching from the Woods

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Amaruk »

Flathead74 wrote:

Code: Select all

- Warning: Asked to send unfinished unit to server
Perhaps weedacres was the first person to notice this message, but not the only one.

I had also noticed this message on one of my systems,
but since weedacres had already posted about it, I did not chime in with a "me too".
This was on February 15 05:49:05 UTC.

It occurred when I attempted to manually upload "finished" work.
I tried with two different WUs, then gave up.

[05:48:16] Project: 10503 (Run 134, Clone 0, Gen 0)
[05:48:16] - Warning: Asked to send unfinished unit to server

[05:49:05] Project: 10xxx (Run 246, Clone 0, Gen 0)
[05:49:05] - Warning: Asked to send unfinished unit to server

I still have 119 WUs that were moved to backup so that I, too, could continue to operate.
Me too. :wink:
Image
weedacres
Posts: 138
Joined: Mon Dec 24, 2007 11:18 pm
Hardware configuration: UserNames: weedacres_gpu ...
Location: Eastern Washington

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by weedacres »

Perhaps weedacres was the first person to notice this message, but not the only one.
Me too. :wink:
Thanks guys, I was beginning to think I was the only one and couldn't figure out why.
:e(
Image
BSW_rama
Posts: 1
Joined: Wed Feb 25, 2009 3:49 pm
Hardware configuration: Celeron [email protected], 4GB RAM Transcend 800,
8800GTS 600/1750/2000 8600GS 300/1500/800,
P5B. HiPro 400W.
Location: HoBocu6upck
Contact:

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by BSW_rama »

[06:34:37] + Attempting to send results [February 18 06:34:37 UTC]
[06:34:39] (171.64.65.71:8080)
[06:34:42] (171.64.65.71:80)
[06:34:42] - Error: Could not transmit unit 03 (completed February 17) to work server.
in - ok
out - error
last time for my results 20 february.
Project: 10102 (Run 684, Clone 7, Gen 9) aka 548
i not will do 548 units.
Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Tobit »

.108.21 -- it is very weird how some WUs to this server upload just fine and others do not. I have had a WU in my queue for a few hours now that has been unable to transmit. However, some work I received from this server after this one have uploaded fine. My clients still hang very frequently when trying to connect to the 108.26 CS. At this moment, CPU load is acceptable and net-load is extremely low on .108.21-- however, CPU and net-load are still through the roof on all CS.

Code: Select all

Slot 09  Done
Project: 5781 (Run 10, Clone 80, Gen 4), Core: 11
Work server: 171.67.108.21:8080
Collection server: 171.67.108.26
Download date: February 17 22:09:32
Finished date: February 18 02:39:04
Failed uploads: 9

Code: Select all

Launch directory: C:\fah\gpu1
Executable: [email protected]
Arguments: -send all -verbosity 9 

[12:58:10] - Ask before connecting: No
[12:58:10] - User name: Tobit (Team 33)
[12:58:10] - User ID: 1FA10FEE5F260BA4
[12:58:10] - Machine ID: 3
[12:58:10] 
[12:58:10] Loaded queue successfully.
[12:58:10] Attempting to return result(s) to server...
[12:58:10] Trying to send all finished work units
[12:58:10] Project: 5781 (Run 10, Clone 80, Gen 4)
[12:58:10] - Read packet limit of 540015616... Set to 524286976.

[12:58:10] + Attempting to send results [February 18 12:58:10 UTC]
[12:58:10] - Reading file work/wuresults_09.dat from core
[12:58:10]   (Read 168749 bytes from disk)
[12:58:10] Connecting to http://171.67.108.21:8080/
[12:58:11] - Couldn't send HTTP request to server
[12:58:11] + Could not connect to Work Server (results)
[12:58:11]     (171.67.108.21:8080)
[12:58:11] + Retrying using alternative port
[12:58:11] Connecting to http://171.67.108.21:80/
[12:58:32] - Couldn't send HTTP request to server
[12:58:32] + Could not connect to Work Server (results)
[12:58:32]     (171.67.108.21:80)
[12:58:32] - Error: Could not transmit unit 09 (completed February 18) to work server.
[12:58:32] - 9 failed uploads of this unit.
[12:58:32] - Read packet limit of 540015616... Set to 524286976.

[12:58:32] + Attempting to send results [February 18 12:58:32 UTC]
[12:58:32] - Reading file work/wuresults_09.dat from core
[12:58:32]   (Read 168749 bytes from disk)
[12:58:32] Connecting to http://171.67.108.26:8080/
Marine Iguana
Posts: 3
Joined: Mon Jan 25, 2010 5:19 am

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by Marine Iguana »

Thought i would add mine here too even though it's the same as everyone else only been able to send a few GPU WU's as far as i know and worried about the deadline for the rest of them :(

Code: Select all

[04:40:08] + Attempting to send results [February 18 04:40:08 UTC]
[04:40:10] + Results successfully sent
[04:40:10] Thank you for your contribution to Folding@Home.
[04:40:10] + Number of Units Completed: 175

[04:40:14] Project: 5786 (Run 2, Clone 39, Gen 32)


[04:40:14] + Attempting to send results [February 18 04:40:14 UTC]
[04:40:19] - Couldn't send HTTP request to server
[04:40:19] + Could not connect to Work Server (results)
[04:40:19]     (171.67.108.21:8080)
[04:40:19] + Retrying using alternative port
[04:40:40] - Couldn't send HTTP request to server
[04:40:40] + Could not connect to Work Server (results)
[04:40:40]     (171.67.108.21:80)
[04:40:40] - Error: Could not transmit unit 01 (completed February 17) to work server.


[04:40:40] + Attempting to send results [February 18 04:40:40 UTC]
[04:41:19] - Couldn't send HTTP request to server
[04:41:19] + Could not connect to Work Server (results)
[04:41:19]     (171.67.108.26:8080)
[04:41:19] + Retrying using alternative port
[04:41:19] - Couldn't send HTTP request to server
[04:41:19]   (Got status 503)
[04:41:19] + Could not connect to Work Server (results)
[04:41:19]     (171.67.108.26:80)
[04:41:19]   Could not transmit unit 01 to Collection server; keeping in queue.
Image
HaloJones
Posts: 906
Joined: Thu Jul 24, 2008 10:16 am

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by HaloJones »

You know, it was bad enough to constantly deal with crap data causing 24 hour pauses in my folding through claiming that the problems were all at my end but now?

Code: Select all

[13:46:06] + Attempting to send results [February 18 13:46:06 UTC]
[13:46:08] - Couldn't send HTTP request to server
[13:46:08] + Could not connect to Work Server (results)
[13:46:08]     (171.67.108.26:8080)
[13:46:08] + Retrying using alternative port
[13:46:08] - Couldn't send HTTP request to server
[13:46:08]   (Got status 503)
[13:46:08] + Could not connect to Work Server (results)
[13:46:08]     (171.67.108.26:80)
[13:46:08]   Could not transmit unit 03 to Collection server; keeping in queue.
So a four hour "pause" while the client does nothing?! I didn't buy expensive hardware for a system that can't deliver simple data for me to work on. :(

Seriously peed off.
single 1070

Image
JimF
Posts: 651
Joined: Thu Jan 21, 2010 2:03 pm

Re: GPU server status 171.67.108.21, 171.64.65.71,171.67.108.26

Post by JimF »

I also have a status 503 failure report on 171.67.108.26. Whether that is unusual or not is beyond me.

Code: Select all

[19:54:33] + Attempting to send results [February 18 19:54:33 UTC]
[19:54:35] - Server does not have record of this unit. Will try again later.
[19:54:35] - Error: Could not transmit unit 03 (completed February 18) to work server.
[19:54:35]   Keeping unit 03 in queue.
[19:54:35] Project: 10105 (Run 428, Clone 0, Gen 11)
[19:54:35] - Read packet limit of 540015616... Set to 524286976.


[19:54:35] + Attempting to send results [February 18 19:54:35 UTC]
[19:54:37] - Server does not have record of this unit. Will try again later.
[19:54:37] - Error: Could not transmit unit 03 (completed February 18) to work server.
[19:54:37] - Read packet limit of 540015616... Set to 524286976.


[19:54:37] + Attempting to send results [February 18 19:54:37 UTC]
[19:54:42] - Server does not have record of this unit. Will try again later.
[19:54:42]   Could not transmit unit 03 to Collection server; keeping in queue.
[19:54:42] - Preparing to get new work unit...
[19:54:42] + Attempting to get work packet
[19:54:42] - Connecting to assignment server
[19:54:42] - Successful: assigned to (171.67.108.21).
[19:54:42] + News From Folding@Home: Welcome to Folding@Home
[19:54:43] Loaded queue successfully.
[19:54:44] Project: 10105 (Run 428, Clone 0, Gen 11)
[19:54:44] - Read packet limit of 540015616... Set to 524286976.


[19:54:44] + Attempting to send results [February 18 19:54:44 UTC]
[19:54:45] - Server does not have record of this unit. Will try again later.
[19:54:45] - Error: Could not transmit unit 03 (completed February 18) to work server.
[19:54:45] - Read packet limit of 540015616... Set to 524286976.


[19:54:45] + Attempting to send results [February 18 19:54:45 UTC]
[20:09:51] + Could not connect to Work Server (results)
[20:09:51]     (171.67.108.26:8080)
[20:09:51] + Retrying using alternative port
[20:09:51] - Couldn't send HTTP request to server
[20:09:51]   (Got status 503)
[20:09:51] + Could not connect to Work Server (results)
[20:09:51]     (171.67.108.26:80)
[20:09:51]   Could not transmit unit 03 to Collection server; keeping in queue.
[20:09:51] + Closed connections
Post Reply