Page 2 of 7

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 3:26 pm
by Jester
Still no good here.......

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 3:34 pm
by sbinh
kasson wrote:We tweaked a few settings; this may help.

We "ain't" see any help yet, but it got me in trouble ... :lol: :lol: .. Caught one of my system has this issue and thought it was OS corruption (due to multiple power outtages) . Re-installing OS now .... :| :| :| :| :|

Issue still persists on other systems.

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 3:52 pm
by ZeroHero
ZeroHero wrote:I'm doing well again - after the above mentioned tweaks. :D
Forgot to tell, that - at 23:33:25 UTC - I was assigned to different server than 130.237.232.141 - namely 171.67.108.22.
From then on everything was fine.
I'm still rolling, OS = XP/Pro/64, SMP Client Version 6.34 - doing Project: 2685 (Run 2, Clone 16, Gen 129) :D

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 3:55 pm
by slugbug
Turns out something corrupted my folding directory. Did a quick re-install and everything is working again. Strange how others experienced the same issue at the same time.

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 4:07 pm
by ZeroHero
slugbug wrote:Turns out something corrupted my folding directory. Did a quick re-install and everything is working again. Strange how others experienced the same issue at the same time.
I tried that too - but no luck at all. Not until I was assigned another server!

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 4:08 pm
by sbinh
That's not true .. I didn't re-install .. just removed -bigadv tag and it works fine. It must be something to do with -bigadv.

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 4:13 pm
by sick willie
I've had this problem on 3 different machines now.

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 4:23 pm
by HaloJones
Same here. Gone to simple smp for now.

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 7:01 pm
by kasson
Is anyone successfully getting work from this server? If not, I'll take it off the AS. The downside of that is that we'll probably run out of bigadv work (except for the 12+ core server), although that may get hit harder as well.

Re: Problems receiving work from 130.237.232.141

Posted: Sat Sep 03, 2011 7:57 pm
by firedfly
I just uploaded a WU and was assigned 130.237.232.141 for the next WU. I'm now receiving the FILE_IO_ERROR.

Re: Problems receiving work from 130.237.232.141

Posted: Sun Sep 04, 2011 12:09 am
by KMac
kasson wrote:Is anyone successfully getting work from this server? If not, I'll take it off the AS...
No IO Error anymore, but the client sits idle without intervention. Remove the bigadv flag and it runs again.

Code: Select all

--- Opening Log file [September 3 22:24:32 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\F@H\SMP
Executable: C:\Program Files (x86)\F@H\FAH.exe
Arguments: -oneunit -forceasm -smp -bigadv -verbosity 9 

[22:24:32] - Ask before connecting: No
[22:24:32] - User name: KMac (Team 33)
[22:24:32] - User ID: 7F87BEF1323CE7CF
[22:24:32] - Machine ID: 2
[22:24:32] 
[22:24:32] Work directory not found. Creating...
[22:24:32] Could not open work queue, generating new queue...
[22:24:32] - Preparing to get new work unit...
[22:24:32] - Autosending finished units... [September 3 22:24:32 UTC]
[22:24:32] Cleaning up work directory
[22:24:32] Trying to send all finished work units
[22:24:32] + No unsent completed units remaining.
[22:24:32] - Autosend completed
[22:24:32] + Attempting to get work packet
[22:24:32] Passkey found
[22:24:32] - Will indicate memory of 16359 MB
[22:24:32] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 7
[22:24:32] - Connecting to assignment server
[22:24:32] Connecting to http://assign.stanford.edu:8080/
[22:24:33] Posted data.
[22:24:33] Initial: ED82; - Successful: assigned to (130.237.232.141).
[22:24:33] + News From Folding@Home: Welcome to Folding@Home
[22:24:33] Loaded queue successfully.
[22:24:33] Sent data
[22:24:33] Connecting to http://130.237.232.141:8080/
[22:26:33] Posted data.
[22:46:33] Initial: 00DA; + Could not connect to Work Server
[23:06:33] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[23:06:43] + Attempting to get work packet
[23:06:43] Passkey found
[23:06:43] - Will indicate memory of 16359 MB
[23:06:43] - Connecting to assignment server
[23:06:43] Connecting to http://assign.stanford.edu:8080/
[23:06:43] Posted data.
[23:06:43] Initial: ED82; - Successful: assigned to (130.237.232.141).
[23:06:43] + News From Folding@Home: Welcome to Folding@Home
[23:06:43] Loaded queue successfully.
[23:06:43] Sent data
[23:06:43] Connecting to http://130.237.232.141:8080/
[23:08:44] Posted data.
[23:28:44] Initial: 00DA; + Could not connect to Work Server
[23:48:44] - Attempt #2  to get work failed, and no other work to do.
Waiting before retry.
[23:49:06] + Attempting to get work packet
[23:49:06] Passkey found
[23:49:06] - Will indicate memory of 16359 MB
[23:49:06] - Connecting to assignment server
[23:49:06] Connecting to http://assign.stanford.edu:8080/
[23:49:06] Posted data.
[23:49:06] Initial: ED82; - Successful: assigned to (130.237.232.141).
[23:49:06] + News From Folding@Home: Welcome to Folding@Home
[23:49:07] Loaded queue successfully.
[23:49:07] Sent data
[23:49:07] Connecting to http://130.237.232.141:8080/
[23:51:07] Posted data.
[23:57:35] Killing all core threads
[23:57:35] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[23:57:35] ***** Got a SIGTERM signal (2)
[23:57:35] Killing all core threads
[23:57:35] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [September 3 23:57:38 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\F@H\SMP
Executable: C:\Program Files (x86)\F@H\FAH.exe
Arguments: -oneunit -forceasm -smp -bigadv -verbosity 9 

[23:57:38] - Ask before connecting: No
[23:57:38] - User name: KMac (Team 33)
[23:57:38] - User ID: 7F87BEF1323CE7CF
[23:57:38] - Machine ID: 2
[23:57:38] 
[23:57:38] Loaded queue successfully.
[23:57:38] - Preparing to get new work unit...
[23:57:38] - Autosending finished units... [September 3 23:57:38 UTC]
[23:57:38] Cleaning up work directory
[23:57:38] Trying to send all finished work units
[23:57:38] + Attempting to get work packet
[23:57:38] + No unsent completed units remaining.
[23:57:38] Passkey found
[23:57:38] - Autosend completed
[23:57:38] - Will indicate memory of 16359 MB
[23:57:38] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 7
[23:57:38] - Connecting to assignment server
[23:57:38] Connecting to http://assign.stanford.edu:8080/
[23:57:39] Posted data.
[23:57:39] Initial: ED82; - Successful: assigned to (130.237.232.141).
[23:57:39] + News From Folding@Home: Welcome to Folding@Home
[23:57:39] Loaded queue successfully.
[23:57:39] Sent data
[23:57:39] Connecting to http://130.237.232.141:8080/
[23:58:47] Killing all core threads
[23:58:47] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[23:58:47] ***** Got a SIGTERM signal (2)
[23:58:47] Killing all core threads
[23:58:47] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [September 3 23:59:37 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\F@H\SMP
Executable: C:\Program Files (x86)\F@H\FAH.exe
Arguments: -oneunit -forceasm -smp -bigadv -verbosity 9 

[23:59:37] - Ask before connecting: No
[23:59:37] - User name: KMac (Team 33)
[23:59:37] - User ID: 7F87BEF1323CE7CF
[23:59:37] - Machine ID: 2
[23:59:37] 
[23:59:38] Loaded queue successfully.
[23:59:38] - Preparing to get new work unit...
[23:59:38] - Autosending finished units... [September 3 23:59:38 UTC]
[23:59:38] Cleaning up work directory
[23:59:38] Trying to send all finished work units
[23:59:38] + Attempting to get work packet
[23:59:38] + No unsent completed units remaining.
[23:59:38] Passkey found
[23:59:38] - Autosend completed
[23:59:38] - Will indicate memory of 16359 MB
[23:59:38] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 7
[23:59:38] - Connecting to assignment server
[23:59:38] Connecting to http://assign.stanford.edu:8080/
[23:59:38] Posted data.
[23:59:38] Initial: ED82; - Successful: assigned to (130.237.232.141).
[23:59:38] + News From Folding@Home: Welcome to Folding@Home
[23:59:38] Loaded queue successfully.
[23:59:38] Sent data
[23:59:38] Connecting to http://130.237.232.141:8080/
[00:01:38] Posted data.
[00:03:39] Killing all core threads
[00:03:39] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[00:03:39] ***** Got a SIGTERM signal (2)
[00:03:39] Killing all core threads
[00:03:39] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [September 4 00:03:43 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\F@H\SMP
Executable: C:\Program Files (x86)\F@H\FAH.exe
Arguments: -oneunit -forceasm -smp -verbosity 9 

[00:03:43] - Ask before connecting: No
[00:03:43] - User name: KMac (Team 33)
[00:03:43] - User ID: 7F87BEF1323CE7CF
[00:03:43] - Machine ID: 2
[00:03:43] 
[00:03:44] Loaded queue successfully.
[00:03:44] - Preparing to get new work unit...
[00:03:44] - Autosending finished units... [September 4 00:03:44 UTC]
[00:03:44] Cleaning up work directory
[00:03:44] Trying to send all finished work units
[00:03:44] + No unsent completed units remaining.
[00:03:44] + Attempting to get work packet
[00:03:44] - Autosend completed
[00:03:44] Passkey found
[00:03:44] - Will indicate memory of 16359 MB
[00:03:44] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 7
[00:03:44] - Connecting to assignment server
[00:03:44] Connecting to http://assign.stanford.edu:8080/
[00:03:44] Posted data.
[00:03:44] Initial: 8F80; - Successful: assigned to (128.143.199.96).
[00:03:44] + News From Folding@Home: Welcome to Folding@Home
[00:03:44] Loaded queue successfully.
[00:03:44] Sent data
[00:03:44] Connecting to http://128.143.199.96:8080/
[00:03:45] Posted data.
[00:03:45] Initial: 0000; - Receiving payload (expected size: 1766997)
[00:03:48] - Downloaded at ~575 kB/s
[00:03:48] - Averaged speed for that direction ~575 kB/s
[00:03:48] + Received work.
[00:03:48] + Closed connections
[00:03:48] 
[00:03:48] + Processing work unit
[00:03:48] Core required: FahCore_a3.exe
[00:03:48] Core found.
[00:03:48] Working on queue slot 01 [September 4 00:03:48 UTC]
[00:03:48] + Working ...
[00:03:48] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 01 -np 8 -nocpulock -checkpoint 3 -forceasm -verbose -lifeline 7532 -version 634'

[00:03:48] 
[00:03:48] *------------------------------*
[00:03:48] Folding@Home Gromacs SMP Core
[00:03:48] Version 2.27 (Dec. 15, 2010)
[00:03:48] 
[00:03:48] Preparing to commence simulation
[00:03:48] - Assembly optimizations manually forced on.
[00:03:48] - Not checking prior termination.
[00:03:48] - Expanded 1766485 -> 2257064 (decompressed 127.7 percent)
[00:03:48] Called DecompressByteArray: compressed_data_size=1766485 data_size=2257064, decompressed_data_size=2257064 diff=0
[00:03:48] - Digital signature verified
[00:03:48] 
[00:03:48] Project: 6950 (Run 0, Clone 36, Gen 309)
[00:03:48] 
[00:03:48] Assembly optimizations on if available.
[00:03:48] Entering M.D.
[00:03:54] Mapping NT from 8 to 8 
[00:03:54] Completed 0 out of 500000 steps  (0%)

Re: Problems receiving work from 130.237.232.141

Posted: Sun Sep 04, 2011 4:45 am
by stevew
Same hang under Wine. No response from SMP work server. After about an hour FAH6.34 fetched FahCore_a3.exe and is folding small stuff. Added -oneunit to client.cfg and will try for -bigadv later.

Re: Problems receiving work from 130.237.232.141

Posted: Sun Sep 04, 2011 9:31 am
by bollix47
As can be seen in the log when the client tried to return Project: 6900 (Run 59, Clone 19, Gen 22) the client hung. After a restart it did try to send again but there's no indication that it sent. Normal upload time on my setup for these WUs is 13min30sec.

Code: Select all

[05:24:18] Completed 250000 out of 250000 steps  (100%)
[05:24:28] DynamicWrapper: Finished Work Unit: sleep=10000
[05:24:38] 
[05:24:38] Finished Work Unit:
[05:24:38] - Reading up to 52713120 from "work/wudata_01.trr": Read 52713120
[05:24:38] trr file hash check passed.
[05:24:38] - Reading up to 47030384 from "work/wudata_01.xtc": Read 47030384
[05:24:38] xtc file hash check passed.
[05:24:38] edr file hash check passed.
[05:24:38] logfile size: 195491
[05:24:38] Leaving Run
[05:24:40] - Writing 100106935 bytes of core data to disk...
[05:24:42]   ... Done.
[05:24:55] - Shutting down core
[05:24:55] 
[05:24:55] Folding@home Core Shutdown: FINISHED_UNIT
[05:25:06] CoreStatus = 64 (100)
[05:25:06] Unit 1 finished with 69 percent of time to deadline remaining.
[05:25:06] Updated performance fraction: 0.761142
[05:25:06] Sending work to server
[05:25:06] Project: 6900 (Run 59, Clone 19, Gen 22)


[05:25:06] + Attempting to send results [September 4 05:25:06 UTC]
[05:25:06] - Reading file work/wuresults_01.dat from core
[05:25:06]   (Read 100106935 bytes from disk)
[05:25:06] Connecting to http://130.237.232.141:8080/  <<<<<<<<<<<<<<<<<<<<<<<<<<
[08:09:18] - Autosending finished units... [September 4 08:09:18 UTC]
[08:09:18] Trying to send all finished work units
[08:09:18] - Already sending work
[08:09:18] + Sent 0 of 1 completed units to the server
[08:09:18] - Autosend completed
[08:42:45] Killing all core threads
[08:42:45] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[08:42:45] ***** Got a SIGTERM signal (2)
[08:42:45] Killing all core threads
[08:42:45] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [September 4 08:42:53 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\fah\smp
Executable: FAH6.34-win32-SMP.exe
Arguments: -verbosity 9 -smp 8 -bigadv 

[08:42:53] - Ask before connecting: No
[08:42:53] - User name: bollix47 (Team 39340)
[08:42:53] - User ID: XXXXXXXXXXX65DC
[08:42:53] - Machine ID: 4
[08:42:53] 
[08:42:53] Loaded queue successfully.
[08:42:53] - Preparing to get new work unit...
[08:42:53] - Autosending finished units... [September 4 08:42:53 UTC]
[08:42:53] Cleaning up work directory
[08:42:53] Trying to send all finished work units
[08:42:53] Project: 6900 (Run 59, Clone 19, Gen 22)


[08:42:53] + Attempting to send results [September 4 08:42:53 UTC]
[08:42:53] - Reading file work/wuresults_01.dat from core
[08:42:53]   (Read 100106935 bytes from disk)
[08:42:53] Connecting to http://130.237.232.141:8080/  <<<<<<<<<<<<<<<<<<<<<<<
[08:42:53] + Attempting to get work packet
[08:42:53] Passkey found
[08:42:53] - Will indicate memory of 8168 MB
[08:42:53] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 7
[08:42:53] - Connecting to assignment server
[08:42:53] Connecting to http://assign.stanford.edu:8080/
[08:42:54] Posted data.
[08:42:54] Initial: 8F80; - Successful: assigned to (128.143.199.97).
[08:42:54] + News From Folding@Home: Welcome to Folding@Home
[08:42:54] Loaded queue successfully.
[08:42:54] Sent data
[08:42:54] Connecting to http://128.143.199.97:8080/
[08:42:57] Posted data.
[08:42:57] Initial: 0000; - Receiving payload (expected size: 2112415)
[08:42:59] - Downloaded at ~1031 kB/s
[08:42:59] - Averaged speed for that direction ~752 kB/s
[08:42:59] + Received work.
[08:42:59] + Closed connections
[08:42:59] 
[08:42:59] + Processing work unit
[08:42:59] Core required: FahCore_a3.exe
[08:42:59] Core found.
[08:42:59] Working on queue slot 02 [September 4 08:42:59 UTC]
[08:42:59] + Working ...
[08:42:59] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 02 -np 8 -priority 96 -checkpoint 30 -verbose -lifeline 3064 -version 634'

[08:42:59] 
[08:42:59] *------------------------------*
[08:42:59] Folding@Home Gromacs SMP Core
[08:42:59] Version 2.27 (Dec. 15, 2010)
[08:42:59] 
[08:42:59] Preparing to commence simulation
[08:42:59] - Looking at optimizations...
[08:42:59] - Created dyn
[08:42:59] - Files status OK
[08:43:00] - Expanded 2111903 -> 3093280 (decompressed 146.4 percent)
[08:43:00] Called DecompressByteArray: compressed_data_size=2111903 data_size=3093280, decompressed_data_size=3093280 diff=0
[08:43:00] - Digital signature verified
[08:43:00] 
[08:43:00] Project: 7508 (Run 0, Clone 10, Gen 24)
[08:43:00] 
[08:43:00] Assembly optimizations on if available.
[08:43:00] Entering M.D.
[08:43:06] Mapping NT from 8 to 8 
[08:43:06] Completed 0 out of 500000 steps  (0%)
[08:46:37] Completed 5000 out of 500000 steps  (1%)
[08:50:08] Completed 10000 out of 500000 steps  (2%)
[08:53:39] Completed 15000 out of 500000 steps  (3%)
[08:57:09] Completed 20000 out of 500000 steps  (4%)
[09:00:41] Completed 25000 out of 500000 steps  (5%)
[09:04:11] Completed 30000 out of 500000 steps  (6%)
[09:07:42] Completed 35000 out of 500000 steps  (7%)
[09:11:13] Completed 40000 out of 500000 steps  (8%)
[09:14:44] Completed 45000 out of 500000 steps  (9%)
[09:18:17] Completed 50000 out of 500000 steps  (10%)

Re: Problems receiving work from 130.237.232.141

Posted: Sun Sep 04, 2011 10:51 am
by Dave_Goodchild
Yep got issues uploading to this server here as well, got two machines trying to upload one has been trying for nearly two hours, upload time is normally 7-15 mins depending on size of WU.

Re: Problems receiving work from 130.237.232.141

Posted: Sun Sep 04, 2011 11:24 am
by ei57
Same issues here, no upload. Tried one client with -send all and saw normal activity for about 15-20 seconds, then the networking went idle. Usually these WU's are uploaded in less than 2 minutes.

Project: 6900 (Run 45, Clone 3, Gen 41)