Page 1 of 7

130.237.232.141

Posted: Sun Nov 21, 2010 9:13 am
by Dave_Goodchild
130.237.232.141 appears to be down, it's pingable but doesn't respond on port 80, I've got work queued up waiting to be sent back since yesterday(no bonus for me), collection server has no record of the WU's either.

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 2:44 pm
by Dave_Goodchild
Hi,

Any news on this please?

Even if it's just a we're looking into it.

The clients currently just retrying to sent the WU's every few hours and at 100Mb + per WU it's wasting lots of bandwidth.

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 3:43 pm
by Tobit
According to the server status page, it appears to be up and functioning properly. Maybe it is something on the net between you and the server. In any event, I will ask Dr. Kasson to check it.

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 3:55 pm
by Dave_Goodchild
Thanks for the reply, it doesn't appear to be an issue between me and the server as the connection is refused at the server side, I have tried from both connections at home and at work with the same issue.

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 3:59 pm
by Tobit
Yup, I see that now. I am unable to connect now myself. The Doctor has been informed.

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 4:17 pm
by Dave_Goodchild
Thanks

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 5:50 pm
by kasson
This has never been up--it's not yet supported but on our to-do list. We hope to have it working soon but no ETA (probably days, not hours and not weeks).

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 5:54 pm
by Tobit
kasson wrote:This has never been up--it's not yet supported but on our to-do list. We hope to have it working soon but no ETA (probably days, not hours and not weeks).
I wonder why he has work waiting to be uploaded to that server then.

Dave, could you please post your logfile, surrounded by

Code: Select all

 tags so we can have a better look at what is going on?

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 7:00 pm
by bruce
Tobit wrote:Dave, could you please post your logfile, surrounded by

Code: Select all

 tags so we can have a better look at what is going on?[/quote]

We need to see what happened when the WU was originally downloaded.  Is there any chance that you've moved the WU from one computer to another?

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 7:03 pm
by Dave_Goodchild
Hmmm sounds very odd I wonder how I got assigned to it, I've copied the section of the log for this work unit, hopefully this will help solve the mystery.

There's no chance the WU was moved between computers.

Code: Select all

 08:24:11] Trying to send all finished work units
[08:24:11] + No unsent completed units remaining.
[08:24:11] - Preparing to get new work unit...
[08:24:11] Cleaning up work directory
[08:24:11] + Attempting to get work packet
[08:24:11] Passkey found
[08:24:11] - Will indicate memory of 12279 MB
[08:24:11] - Connecting to assignment server
[08:24:11] Connecting to http://assign.stanford.edu:8080/
[08:24:12] Posted data.
[08:24:12] Initial: ED82; - Successful: assigned to (130.237.232.141).
[08:24:12] + News From Folding@Home: Welcome to Folding@Home
[08:24:12] Loaded queue successfully.
[08:24:12] Sent data
[08:24:12] Connecting to http://130.237.232.141:8080/
[08:24:20] Posted data.
[08:24:20] Initial: 0000; - Receiving payload (expected size: 24860597)
[08:24:44] - Downloaded at ~1011 kB/s
[08:24:44] - Averaged speed for that direction ~339 kB/s
[08:24:44] + Received work.
[08:24:44] Trying to send all finished work units
[08:24:44] + No unsent completed units remaining.
[08:24:44] + Closed connections
[08:24:44] 
[08:24:44] + Processing work unit
[08:24:44] Core required: FahCore_a3.exe
[08:24:44] Core found.
[08:24:44] Working on queue slot 07 [November 19 08:24:44 UTC]
[08:24:44] + Working ...
[08:24:44] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 07 -np 21 -checkpoint 15 -verbose -lifeline 5680 -version 630'

[08:24:44] 
[08:24:44] *------------------------------*
[08:24:44] Folding@Home Gromacs SMP Core
[08:24:44] Version 2.22 (Mar 12, 2010)
[08:24:44] 
[08:24:44] Preparing to commence simulation
[08:24:44] - Looking at optimizations...
[08:24:44] - Created dyn
[08:24:44] - Files status OK
[08:24:49] - Expanded 24860085 -> 30796293 (decompressed 123.8 percent)
[08:24:49] Called DecompressByteArray: compressed_data_size=24860085 data_size=30796293, decompressed_data_size=30796293 diff=0
[08:24:49] - Digital signature verified
[08:24:49] 
[08:24:49] Project: 6900 (Run 10, Clone 7, Gen 2)
[08:24:49] 
[08:24:49] Assembly optimizations on if available.
[08:24:49] Entering M.D.
[08:24:57] Completed 0 out of 250000 steps  (0%)
[08:38:00] Completed 2500 out of 250000 steps  (1%)
[08:51:02] Completed 5000 out of 250000 steps  (2%)
[09:04:04] Completed 7500 out of 250000 steps  (3%)
[09:17:06] Completed 10000 out of 250000 steps  (4%)
[09:30:10] Completed 12500 out of 250000 steps  (5%)
[09:43:12] Completed 15000 out of 250000 steps  (6%)
[09:56:13] Completed 17500 out of 250000 steps  (7%)
[10:09:11] Completed 20000 out of 250000 steps  (8%)
[10:22:12] Completed 22500 out of 250000 steps  (9%)
[10:35:11] Completed 25000 out of 250000 steps  (10%)
[10:48:11] Completed 27500 out of 250000 steps  (11%)
[11:01:12] Completed 30000 out of 250000 steps  (12%)
[11:06:24] - Autosending finished units... [November 19 11:06:24 UTC]
[11:06:24] Trying to send all finished work units
[11:06:24] + No unsent completed units remaining.
[11:06:24] - Autosend completed
[11:14:14] Completed 32500 out of 250000 steps  (13%)
[11:27:16] Completed 35000 out of 250000 steps  (14%)
[11:40:18] Completed 37500 out of 250000 steps  (15%)
[11:53:17] Completed 40000 out of 250000 steps  (16%)
[12:06:19] Completed 42500 out of 250000 steps  (17%)
[12:19:19] Completed 45000 out of 250000 steps  (18%)
[12:32:19] Completed 47500 out of 250000 steps  (19%)
[12:45:19] Completed 50000 out of 250000 steps  (20%)
[12:58:18] Completed 52500 out of 250000 steps  (21%)
[13:11:19] Completed 55000 out of 250000 steps  (22%)
[13:24:17] Completed 57500 out of 250000 steps  (23%)
[13:37:17] Completed 60000 out of 250000 steps  (24%)
[13:50:18] Completed 62500 out of 250000 steps  (25%)
[14:03:18] Completed 65000 out of 250000 steps  (26%)
[14:16:18] Completed 67500 out of 250000 steps  (27%)
[14:29:18] Completed 70000 out of 250000 steps  (28%)
[14:42:17] Completed 72500 out of 250000 steps  (29%)
[14:55:17] Completed 75000 out of 250000 steps  (30%)
[15:08:15] Completed 77500 out of 250000 steps  (31%)
[15:21:15] Completed 80000 out of 250000 steps  (32%)
[15:34:15] Completed 82500 out of 250000 steps  (33%)
[15:47:14] Completed 85000 out of 250000 steps  (34%)
[16:00:13] Completed 87500 out of 250000 steps  (35%)
[16:13:12] Completed 90000 out of 250000 steps  (36%)
[16:26:11] Completed 92500 out of 250000 steps  (37%)
[16:39:09] Completed 95000 out of 250000 steps  (38%)
[16:52:09] Completed 97500 out of 250000 steps  (39%)
[17:05:09] Completed 100000 out of 250000 steps  (40%)
[17:06:24] - Autosending finished units... [November 19 17:06:24 UTC]
[17:06:24] Trying to send all finished work units
[17:06:24] + No unsent completed units remaining.
[17:06:24] - Autosend completed
[17:18:10] Completed 102500 out of 250000 steps  (41%)
[17:31:10] Completed 105000 out of 250000 steps  (42%)
[17:44:11] Completed 107500 out of 250000 steps  (43%)
[17:57:12] Completed 110000 out of 250000 steps  (44%)
[18:10:12] Completed 112500 out of 250000 steps  (45%)
[18:23:09] Completed 115000 out of 250000 steps  (46%)
[18:36:08] Completed 117500 out of 250000 steps  (47%)
[18:49:07] Completed 120000 out of 250000 steps  (48%)
[19:02:07] Completed 122500 out of 250000 steps  (49%)
[19:15:06] Completed 125000 out of 250000 steps  (50%)
[19:28:05] Completed 127500 out of 250000 steps  (51%)
[19:41:07] Completed 130000 out of 250000 steps  (52%)
[19:54:04] Completed 132500 out of 250000 steps  (53%)
[20:07:03] Completed 135000 out of 250000 steps  (54%)
[20:20:02] Completed 137500 out of 250000 steps  (55%)
[20:33:02] Completed 140000 out of 250000 steps  (56%)
[20:46:02] Completed 142500 out of 250000 steps  (57%)
[20:59:01] Completed 145000 out of 250000 steps  (58%)
[21:12:01] Completed 147500 out of 250000 steps  (59%)
[21:24:58] Completed 150000 out of 250000 steps  (60%)
[21:37:58] Completed 152500 out of 250000 steps  (61%)
[21:50:56] Completed 155000 out of 250000 steps  (62%)
[22:03:56] Completed 157500 out of 250000 steps  (63%)
[22:16:55] Completed 160000 out of 250000 steps  (64%)
[22:29:54] Completed 162500 out of 250000 steps  (65%)
[22:42:54] Completed 165000 out of 250000 steps  (66%)
[22:55:53] Completed 167500 out of 250000 steps  (67%)
[23:06:24] - Autosending finished units... [November 19 23:06:24 UTC]
[23:06:24] Trying to send all finished work units
[23:06:24] + No unsent completed units remaining.
[23:06:24] - Autosend completed
[23:08:50] Completed 170000 out of 250000 steps  (68%)
[23:21:50] Completed 172500 out of 250000 steps  (69%)
[23:34:49] Completed 175000 out of 250000 steps  (70%)
[23:47:48] Completed 177500 out of 250000 steps  (71%)
[00:00:47] Completed 180000 out of 250000 steps  (72%)
[00:13:46] Completed 182500 out of 250000 steps  (73%)
[00:26:45] Completed 185000 out of 250000 steps  (74%)
[00:39:42] Completed 187500 out of 250000 steps  (75%)
[00:52:41] Completed 190000 out of 250000 steps  (76%)
[01:05:40] Completed 192500 out of 250000 steps  (77%)
[01:18:39] Completed 195000 out of 250000 steps  (78%)
[01:31:37] Completed 197500 out of 250000 steps  (79%)
[01:44:37] Completed 200000 out of 250000 steps  (80%)
[01:57:35] Completed 202500 out of 250000 steps  (81%)
[02:10:34] Completed 205000 out of 250000 steps  (82%)
[02:23:30] Completed 207500 out of 250000 steps  (83%)
[02:36:29] Completed 210000 out of 250000 steps  (84%)
[02:49:27] Completed 212500 out of 250000 steps  (85%)
[03:02:26] Completed 215000 out of 250000 steps  (86%)
[03:15:24] Completed 217500 out of 250000 steps  (87%)
[05:06:24] - Autosending finished units... [November 20 05:06:24 UTC]
[05:06:24] Trying to send all finished work units
[05:06:24] + No unsent completed units remaining.
[05:06:24] - Autosend completed
[08:54:12] Killing all core threads
[08:54:12] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[08:54:12] ***** Got a SIGTERM signal (2)
[08:54:12] Killing all core threads
[08:54:12] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [November 20 08:54:37 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.30

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\SMP
Executable: C:\SMP\[email protected]
Arguments: -smp 21 -bigadv -verbosity 9 

[08:54:37] - Ask before connecting: No
[08:54:37] - User name: Dave_Goodchild (Team 35947)
[08:54:37] - User ID: 4D2F7CD11E7E8458
[08:54:37] - Machine ID: 1
[08:54:37] 
[08:54:37] Loaded queue successfully.
[08:54:37] 
[08:54:37] - Autosending finished units... [November 20 08:54:37 UTC]
[08:54:37] + Processing work unit
[08:54:37] Trying to send all finished work units
[08:54:37] Core required: FahCore_a3.exe
[08:54:37] + No unsent completed units remaining.
[08:54:37] Core found.
[08:54:37] - Autosend completed
[08:54:37] Working on queue slot 07 [November 20 08:54:37 UTC]
[08:54:37] + Working ...
[08:54:37] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 07 -np 21 -checkpoint 15 -verbose -lifeline 5960 -version 630'

[08:54:37] 
[08:54:37] *------------------------------*
[08:54:37] Folding@Home Gromacs SMP Core
[08:54:37] Version 2.22 (Mar 12, 2010)
[08:54:37] 
[08:54:37] Preparing to commence simulation
[08:54:37] - Ensuring status. Please wait.
[08:54:46] - Looking at optimizations...
[08:54:46] - Working with standard loops on this execution.
[08:54:46] - Previous termination of core was improper.
[08:54:46] - Files status OK
[08:54:51] - Expanded 24860085 -> 30796293 (decompressed 123.8 percent)
[08:54:51] Called DecompressByteArray: compressed_data_size=24860085 data_size=30796293, decompressed_data_size=30796293 diff=0
[08:54:51] - Digital signature verified
[08:54:51] 
[08:54:51] Project: 6900 (Run 10, Clone 7, Gen 2)
[08:54:51] 
[08:54:51] Entering M.D.
[08:54:57] Using Gromacs checkpoints
[08:55:05] Resuming from checkpoint
[08:55:05] Verified work/wudata_07.log
[08:55:06] Verified work/wudata_07.trr
[08:55:06] Verified work/wudata_07.xtc
[08:55:06] Verified work/wudata_07.edr
[08:55:06] Completed 216460 out of 250000 steps  (86%)
[09:00:33] Completed 217500 out of 250000 steps  (87%)
[09:13:36] Completed 220000 out of 250000 steps  (88%)
[09:26:39] Completed 222500 out of 250000 steps  (89%)
[09:39:41] Completed 225000 out of 250000 steps  (90%)
[09:52:45] Completed 227500 out of 250000 steps  (91%)
[10:05:50] Completed 230000 out of 250000 steps  (92%)
[10:18:54] Completed 232500 out of 250000 steps  (93%)
[10:31:58] Completed 235000 out of 250000 steps  (94%)
[10:45:03] Completed 237500 out of 250000 steps  (95%)
[10:58:07] Completed 240000 out of 250000 steps  (96%)
[11:11:12] Completed 242500 out of 250000 steps  (97%)
[11:24:15] Completed 245000 out of 250000 steps  (98%)
[11:37:19] Completed 247500 out of 250000 steps  (99%)
[11:50:23] Completed 250000 out of 250000 steps  (100%)
[11:50:34] DynamicWrapper: Finished Work Unit: sleep=10000
[11:50:44] 
[11:50:44] Finished Work Unit:
[11:50:44] - Reading up to 52713120 from "work/wudata_07.trr": Read 52713120
[11:50:45] trr file hash check passed.
[11:50:45] - Reading up to 47030228 from "work/wudata_07.xtc": Read 47030228
[11:50:45] xtc file hash check passed.
[11:50:45] edr file hash check passed.
[11:50:45] logfile size: 200414
[11:50:45] Leaving Run
[11:50:46] - Writing 100111702 bytes of core data to disk...
[11:50:48]   ... Done.
[11:50:58] - Shutting down core
[11:50:58] 
[11:50:58] Folding@home Core Shutdown: FINISHED_UNIT
[11:51:03] CoreStatus = 64 (100)
[11:51:03] Unit 7 finished with 81 percent of time to deadline remaining.
[11:51:03] Updated performance fraction: 0.874810
[11:51:03] Sending work to server
[11:51:03] Project: 6900 (Run 10, Clone 7, Gen 2)


[11:51:03] + Attempting to send results [November 20 11:51:03 UTC]
[11:51:03] - Reading file work/wuresults_07.dat from core
[11:51:03]   (Read 100111702 bytes from disk)
[11:51:03] Connecting to http://130.237.232.141:8080/
[11:51:03] - Couldn't send HTTP request to server
[11:51:03] + Could not connect to Work Server (results)
[11:51:03]     (130.237.232.141:8080)
[11:51:03] + Retrying using alternative port
[11:51:03] Connecting to http://130.237.232.141:80/
[11:51:22] - Couldn't send HTTP request to server
[11:51:22] + Could not connect to Work Server (results)
[11:51:22]     (130.237.232.141:80)
[11:51:22] - Error: Could not transmit unit 07 (completed November 20) to work server.
[11:51:22] - 1 failed uploads of this unit.
[11:51:22]   Keeping unit 07 in queue.
[11:51:22] Trying to send all finished work units
[11:51:22] Project: 6900 (Run 10, Clone 7, Gen 2)


[11:51:22] + Attempting to send results [November 20 11:51:22 UTC]
[11:51:22] - Reading file work/wuresults_07.dat from core
[11:51:22]   (Read 100111702 bytes from disk)
[11:51:22] Connecting to http://130.237.232.141:8080/
[11:51:22] - Couldn't send HTTP request to server
[11:51:22] + Could not connect to Work Server (results)
[11:51:22]     (130.237.232.141:8080)
[11:51:22] + Retrying using alternative port
[11:51:22] Connecting to http://130.237.232.141:80/
[11:51:41] - Couldn't send HTTP request to server
[11:51:41] + Could not connect to Work Server (results)
[11:51:41]     (130.237.232.141:80)
[11:51:41] - Error: Could not transmit unit 07 (completed November 20) to work server.
[11:51:41] - 2 failed uploads of this unit.


[11:51:41] + Attempting to send results [November 20 11:51:41 UTC]
[11:51:41] - Reading file work/wuresults_07.dat from core
[11:51:41]   (Read 100111702 bytes from disk)
[11:51:41] Connecting to http://130.237.165.141:8080/
[11:51:41] - Couldn't send HTTP request to server
[11:51:41] + Could not connect to Work Server (results)
[11:51:41]     (130.237.165.141:8080)
[11:51:41] + Retrying using alternative port
[11:51:41] Connecting to http://130.237.165.141:80/
[12:06:11] Posted data.
[12:06:12] Initial: 0000; - Server does not have record of this unit. Will try again later.
[12:06:12]   Could not transmit unit 07 to Collection server; keeping in queue.
[12:06:12] + Sent 0 of 1 completed units to the server
[12:06:12] - Preparing to get new work unit...
[12:06:12] Cleaning up work directory
Let me know what you want me to do with the waiting WU.

Re: 130.237.232.141:80

Posted: Mon Nov 22, 2010 7:10 pm
by Tobit
Kasson: According to psummary, p6900 is hosted from 130.237.232.141 so something is indeed using this IP address and he definitely received work from that IP unless there is some IP masquerading going on.

Re: 130.237.232.141

Posted: Mon Nov 22, 2010 7:27 pm
by bruce
Yes, he did download the WU from that server, but it was port 8080, which means it has to be returned to port 8080. The original title threw me off because it specified ...141:80 but I've removed those last three characters.

The client is trying to return it to the right port and failing, which is a different issue.

Code: Select all

[11:51:03] Connecting to http://130.237.232.141:8080/
[11:51:03] - Couldn't send HTTP request to server
We need to ignore all the error messages and problems associated with 130.237.232.141:80 and only look at what's going on with 130.237.232.141:8080.

@Dave_Goodchild:
Please post the recent segments of FAHlog that show upload attempts to 130.237.232.141:8080. Please confirm that you can open http://130.237.232.141:8080 in your browser. (It does work from here.)

Re: 130.237.232.141

Posted: Mon Nov 22, 2010 7:35 pm
by Tobit
bruce wrote:and only look at what's going on with 130.237.232.141:8080.
Yup, that port is working for me. Dave, what happens when you put http://130.237.232.141:8080 into your browser? Do you get "OK"?

Re: 130.237.232.141

Posted: Mon Nov 22, 2010 8:16 pm
by Dave_Goodchild
Guys, sorry for the misleading port number I pasted the wrong line out of the log.

I did test both 80 & 8080 in a browser though with the same result.

Code: Select all

[10:40:30] - Autosending finished units... [November 22 10:40:30 UTC]
[10:40:30] Trying to send all finished work units
[10:40:30] Project: 6900 (Run 10, Clone 7, Gen 2)


[10:40:30] + Attempting to send results [November 22 10:40:30 UTC]
[10:40:30] - Reading file work/wuresults_07.dat from core
[10:40:30]   (Read 100111702 bytes from disk)
[10:40:30] Connecting to http://130.237.232.141:8080/
[10:40:31] - Couldn't send HTTP request to server
[10:40:31] + Could not connect to Work Server (results)
[10:40:31]     (130.237.232.141:8080)
[10:40:31] + Retrying using alternative port
[10:40:31] Connecting to http://130.237.232.141:80/
[10:40:50] - Couldn't send HTTP request to server
[10:40:50] + Could not connect to Work Server (results)
[10:40:50]     (130.237.232.141:80)
[10:40:50] - Error: Could not transmit unit 07 (completed November 20) to work server.
[10:40:50] - 13 failed uploads of this unit.
After seeing your post I tried connecting again and now get OK, I checked the logs and the WU has now sent sucessfully.

Seeing as I tested on multiple machines over different internet connections I can only assume someone has fixed something.

There is still the mystery that this machine shouldn't be giving out WU's?

Thanks for your help, I'm guessing the bonus points are long gone :(

Re: 130.237.232.141

Posted: Mon Nov 22, 2010 10:07 pm
by bruce
Well, based on your original report I did ping the Pande Group and they did look into it, deciding that port 80 has never worked (for either download or upload, so that's not a problem). Something else might have gotten fixed in the process.

As the official documentation says, they can't promise a bonus (or 100% server reliability) but they'll do their best. Overall, I think they do a pretty good job, but that's not particularly reassuring when you're the one that didn't get the bonus.