128.143.231.201 (BA) acting up?

Moderators: Site Moderators, FAHC Science Team

PinHead
Posts: 285
Joined: Tue Jan 24, 2012 3:43 am
Hardware configuration: Quad Q9550 2.83 contains the GPU 57xx - running SMP and GPU
Quad Q6700 2.66 running just SMP
2P 32core Interlagos SMP on linux

Re: 128.143.231.201 (BA) acting up?

Post by PinHead »

Finished my SMP WU from a few hours ago, but when I switch back to BA this is all that I get ( over and over ):

Code: Select all

[23:49:18] Trying to send all finished work units
[23:49:18] + No unsent completed units remaining.
[23:49:18] - Preparing to get new work unit...
[23:49:18] Cleaning up work directory
[23:49:18] + Attempting to get work packet
[23:49:18] Passkey found
[23:49:18] - Will indicate memory of 16078 MB
[23:49:18] - Connecting to assignment server
[23:49:18] Connecting to http://assign.stanford.edu:8080/
[23:49:19] Posted data.
[23:49:19] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[23:49:19] + News From Folding@Home: Welcome to Folding@Home
[23:49:19] Loaded queue successfully.
[23:49:19] Sent data
[23:49:19] Connecting to http://128.143.231.201:8080/
[23:49:19] Posted data.
[23:49:19] Initial: 0000; - Receiving payload (expected size: 512)
[23:49:19] Conversation time very short, giving reduced weight in bandwidth avg
[23:49:19] - Downloaded at ~1 kB/s
[23:49:19] - Averaged speed for that direction ~17 kB/s
[23:49:19] + Received work.
[23:49:19] + Closed connections
[23:49:24]
[23:49:24] + Processing work unit
[23:49:24] Core required: FahCore_a5.exe
[23:49:24] Core found.
[23:49:24] Working on queue slot 00 [March 30 23:49:24 UTC]
[23:49:24] + Working ...
[23:49:24] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 00 -np 24 -checkpoint 15 -verbose -lifeline 1984 -version 634'

[23:49:24]
[23:49:24] *------------------------------*
[23:49:24] Folding@Home Gromacs SMP Core
[23:49:24] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[23:49:24]
[23:49:24] Preparing to commence simulation
[23:49:24] - Looking at optimizations...
[23:49:24] - Created dyn
[23:49:24] - Files status OK
[23:49:24] Couldn't Decompress
[23:49:24] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[23:49:24] -Error: Couldn't update checksum variables
[23:49:24] Error: Could not open work file
[23:49:24]
[23:49:24] Folding@home Core Shutdown: FILE_IO_ERROR
[23:49:24] CoreStatus = 75 (117)
[23:49:24] Error opening or reading from a file.
[23:49:24] Deleting current work unit & continuing...
[23:49:24] Trying to send all finished work units

Nathan_P
Posts: 1164
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 [email protected] Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 [email protected] Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: 128.143.231.201 (BA) acting up?

Post by Nathan_P »

still playing up, I've switched to smp until further notice for this rig - lets see what happens with my other rig in about 5 hours
Image
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: 128.143.231.201 (BA) acting up?

Post by P5-133XL »

Just as a side question, has anyone experimented/explored running the NaCl client on one of these BA machines. Is there a CPU limit to the NaCl? Does the network speed limit PPD more than the CPU? I'm curious as to the PPD when compared to SMP/BA?
Image
Nathan_P
Posts: 1164
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 [email protected] Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 [email protected] Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: 128.143.231.201 (BA) acting up?

Post by Nathan_P »

I tried to get the NaCl client to work on my daily rig but it wouldn't start folding, followed the FAQ and still couldn't get it to fold. I might have another go this weekend. The rig in question is at the lower end of the BA spectrum (dual x5670) but its the only one that has windows and chrome on it.

As for my earlier post, my 2nd rig is just starting a 8104.
Image
PinHead
Posts: 285
Joined: Tue Jan 24, 2012 3:43 am
Hardware configuration: Quad Q9550 2.83 contains the GPU 57xx - running SMP and GPU
Quad Q6700 2.66 running just SMP
2P 32core Interlagos SMP on linux

Re: 128.143.231.201 (BA) acting up?

Post by PinHead »

I think the problem still exists.

Earlier today, my 64 core boxes picked up SMP's but are now working on BA WU. The 24 core box can't seem to get a BA WU. Still trying to download 512 byte 8105, over and over.
PinHead
Posts: 285
Joined: Tue Jan 24, 2012 3:43 am
Hardware configuration: Quad Q9550 2.83 contains the GPU 57xx - running SMP and GPU
Quad Q6700 2.66 running just SMP
2P 32core Interlagos SMP on linux

Re: 128.143.231.201 (BA) acting up?

Post by PinHead »

So is there anything else I can try?

There haven't been any software changes on this box and now it can't seem to pull a BA WU. This has been going on for a couple of days. Server still says 16 cores but can't seem to give my 24 core box a correct assignment and delivery. My 64 core boxes seem to work fine, only one short glitch 1 day ago and back to work on BA units.

Here is the start up:

Code: Select all

Note: Please read the license agreement (fah6 -license). Further 
use of this software requires that you have read and accepted this agreement.

24 cores detected


--- Opening Log file [April 1 22:13:56 UTC] 


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /media/Fold/FAH
Executable: ./fah6
Arguments: -smp -bigadv -verbosity 9 

[22:13:56] - Ask before connecting: No
[22:13:56] - User name: PinHead (Team 4)
[22:13:56] - User ID: XXXXXXXXXXXXXX
[22:13:56] - Machine ID: 1
[22:13:56] 
[22:13:56] Loaded queue successfully.
[A2:13:56] 
[22:13:56] - Autosending finished units... [A2:13:1 22:13:56 UTC]
[22:13:56] + Processing work unit
[22:13:56] Trying to send all finished work units
[22:13:56] Core required: FahCore_a5.exe
[22:13:56] + No unsent completed units remaining.
[22:13:56] Core found.
[22:13:56] - Autosend completed
[22:13:56] Working on queue slot 03 [April 1 22:13:56 UTC]
[22:13:56] + Working ...
[22:13:56] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 24 -checkpoint 15 -verbose -lifeline 11393 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11398
thekraken: Logging to thekraken.log
[22:13:56] 
[22:13:56] *------------------------------*
[22:13:56] Folding@Home Gromacs SMP Core
[22:13:56] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:13:56] 
[22:13:56] Preparing to commence simulation
[22:13:56] - Looking at optimizations...
[22:13:56] - Created dyn
[22:13:56] - Files status OK
[22:13:56] Couldn't Decompress
[22:13:56] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:13:56] -Error: Couldn't update checksum variables
[22:13:56] Error: Could not open work file
[22:13:56] 
[22:13:56] Folding@home Core Shutdown: FILE_IO_ERROR
[22:13:56] CoreStatus = 75 (117)
[22:13:56] Error opening or reading from a file.
[22:13:56] Deleting current work unit & continuing...
thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11401
thekraken: Logging to thekraken.log
[22:13:56] Trying to send all finished work units
[22:13:56] + No unsent completed units remaining.
[22:13:56] - Preparing to get new work unit...
[22:13:56] Cleaning up work directory
[22:13:56] + Attempting to get work packet
[22:13:56] Passkey found
[22:13:56] - Will indicate memory of 16078 MB
[22:13:56] - Connecting to assignment server
[22:13:56] Connecting to http://assign.stanford.edu:8080/
[22:13:57] Posted data.
[22:13:57] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[22:13:57] + News From Folding@Home: Welcome to Folding@Home
[22:13:57] Loaded queue successfully.
[22:13:57] Sent data
[22:13:57] Connecting to http://128.143.231.201:8080/
[22:13:57] Posted data.
[22:13:57] Initial: 0000; - Receiving payload (expected size: 512)
[22:13:57] Conversation time very short, giving reduced weight in bandwidth avg
[22:13:57] - Downloaded at ~1 kB/s
[22:13:57] - Averaged speed for that direction ~29 kB/s
[22:13:57] + Received work.
[22:13:57] + Closed connections
[22:14:02] 
[22:14:02] + Processing work unit
[22:14:02] Core required: FahCore_a5.exe
[22:14:02] Core found.
[22:14:02] Working on queue slot 04 [April 1 22:14:02 UTC]
[22:14:02] + Working ...
[22:14:02] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 24 -checkpoint 15 -verbose -lifeline 11393 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11410
thekraken: Logging to thekraken.log
[22:14:02] 
[22:14:02] *------------------------------*
[22:14:02] Folding@Home Gromacs SMP Core
[22:14:02] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:14:02] 
[22:14:02] Preparing to commence simulation
[22:14:02] - Looking at optimizations...
[22:14:02] - Created dyn
[22:14:02] - Files status OK
[22:14:02] Couldn't Decompress
[22:14:02] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:14:02] -Error: Couldn't update checksum variables
[22:14:02] Error: Could not open work file
[22:14:02] 
[22:14:02] Folding@home Core Shutdown: FILE_IO_ERROR
[22:14:03] CoreStatus = 75 (117)
[22:14:03] Error opening or reading from a file.
[22:14:03] Deleting current work unit & continuing...
thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11413
thekraken: Logging to thekraken.log
[22:14:03] Trying to send all finished work units
[22:14:03] + No unsent completed units remaining.
[22:14:03] - Preparing to get new work unit...
[22:14:03] Cleaning up work directory
[22:14:03] + Attempting to get work packet
[22:14:03] Passkey found
[22:14:03] - Will indicate memory of 16078 MB
[22:14:03] - Connecting to assignment server
[22:14:03] Connecting to http://assign.stanford.edu:8080/
[22:14:03] Posted data.
[22:14:03] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[22:14:03] + News From Folding@Home: Welcome to Folding@Home
[22:14:03] Loaded queue successfully.
[22:14:03] Sent data
[22:14:03] Connecting to http://128.143.231.201:8080/
[22:14:03] Posted data.
[22:14:03] Initial: 0000; - Receiving payload (expected size: 512)
[22:14:03] Conversation time very short, giving reduced weight in bandwidth avg
[22:14:03] - Downloaded at ~1 kB/s
[22:14:03] - Averaged speed for that direction ~26 kB/s
[22:14:03] + Received work.
[22:14:03] + Closed connections
[22:14:08] 
[22:14:08] + Processing work unit
[22:14:08] Core required: FahCore_a5.exe
[22:14:08] Core found.
[22:14:08] Working on queue slot 05 [April 1 22:14:08 UTC]
[22:14:08] + Working ...
[22:14:08] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 05 -np 24 -checkpoint 15 -verbose -lifeline 11393 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11424
thekraken: Logging to thekraken.log
[22:14:08] 
[22:14:08] *------------------------------*
[22:14:08] Folding@Home Gromacs SMP Core
[22:14:08] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:14:08] 
[22:14:08] Preparing to commence simulation
[22:14:08] - Looking at optimizations...
[22:14:08] - Created dyn
[22:14:08] - Files status OK
[22:14:08] Couldn't Decompress
[22:14:08] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:14:08] -Error: Couldn't update checksum variables
[22:14:08] Error: Could not open work file
[22:14:08] 
[22:14:08] Folding@home Core Shutdown: FILE_IO_ERROR
[22:14:09] CoreStatus = 75 (117)
[22:14:09] Error opening or reading from a file.
[22:14:09] Deleting current work unit & continuing...
thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11427
thekraken: Logging to thekraken.log
[22:14:09] Trying to send all finished work units
[22:14:09] + No unsent completed units remaining.
[22:14:09] - Preparing to get new work unit...
[22:14:09] Cleaning up work directory
[22:14:09] + Attempting to get work packet
[22:14:09] Passkey found
[22:14:09] - Will indicate memory of 16078 MB
[22:14:09] - Connecting to assignment server
[22:14:09] Connecting to http://assign.stanford.edu:8080/
[22:14:09] Posted data.
[22:14:09] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[22:14:09] + News From Folding@Home: Welcome to Folding@Home
[22:14:09] Loaded queue successfully.
[22:14:09] Sent data
[22:14:09] Connecting to http://128.143.231.201:8080/
[22:14:09] Posted data.
[22:14:09] Initial: 0000; - Receiving payload (expected size: 512)
[22:14:09] Conversation time very short, giving reduced weight in bandwidth avg
[22:14:09] - Downloaded at ~1 kB/s
[22:14:09] - Averaged speed for that direction ~23 kB/s
[22:14:09] + Received work.
[22:14:09] + Closed connections
[22:14:14] 
[22:14:14] + Processing work unit
[22:14:14] Core required: FahCore_a5.exe
[22:14:14] Core found.
[22:14:14] Working on queue slot 06 [April 1 22:14:14 UTC]
[22:14:14] + Working ...
[22:14:14] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 06 -np 24 -checkpoint 15 -verbose -lifeline 11393 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11436
thekraken: Logging to thekraken.log
[22:14:15] 
[22:14:15] *------------------------------*
[22:14:15] Folding@Home Gromacs SMP Core
[22:14:15] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:14:15] 
[22:14:15] Preparing to commence simulation
[22:14:15] - Looking at optimizations...
[22:14:15] - Created dyn
[22:14:15] - Files status OK
[22:14:15] Couldn't Decompress
[22:14:15] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:14:15] -Error: Couldn't update checksum variables
[22:14:15] Error: Could not open work file
[22:14:15] 
[22:14:15] Folding@home Core Shutdown: FILE_IO_ERROR
[22:14:15] CoreStatus = 75 (117)
[22:14:15] Error opening or reading from a file.
[22:14:15] Deleting current work unit & continuing...
thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11439
thekraken: Logging to thekraken.log
[22:14:15] Trying to send all finished work units
[22:14:15] + No unsent completed units remaining.
[22:14:15] - Preparing to get new work unit...
[22:14:15] Cleaning up work directory
[22:14:15] + Attempting to get work packet
[22:14:15] Passkey found
[22:14:15] - Will indicate memory of 16078 MB
[22:14:15] - Connecting to assignment server
[22:14:15] Connecting to http://assign.stanford.edu:8080/
[22:14:15] Posted data.
[22:14:15] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[22:14:15] + News From Folding@Home: Welcome to Folding@Home
[22:14:15] Loaded queue successfully.
[22:14:15] Sent data
[22:14:15] Connecting to http://128.143.231.201:8080/
[22:14:16] Posted data.
[22:14:16] Initial: 0000; - Receiving payload (expected size: 512)
[22:14:16] Conversation time very short, giving reduced weight in bandwidth avg
[22:14:16] - Downloaded at ~1 kB/s
[22:14:16] - Averaged speed for that direction ~20 kB/s
[22:14:16] + Received work.
[22:14:16] + Closed connections
^C[22:14:16] ***** Got an Activate signal (2)
[22:14:16] Killing all core threads

The queueinfo indicates that it is getting assigned a BA unit, it just seems to get bad data to retrieve it:

Code: Select all

[22:23:14] Loaded queue successfully.
[22:23:14] Printing Queue Information
Current Queue: 
Slot 08  Empty/Deleted
Project: 8571 (Run 0, Clone 5, Gen 499), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 1 01:56:55
Finished date: April 1 08:46:10

Slot 09  Empty/Deleted
Project: 8568 (Run 1, Clone 3, Gen 328), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 1 08:50:21
Finished date: April 1 15:24:53

Slot 00  Empty/Deleted
Project: 8577 (Run 1, Clone 7, Gen 325), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 1 15:35:24
Finished date: April 1 22:07:36

Slot 01  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:13:34
Finished date: January 1 00:00:00

Slot 02  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:13:35
Finished date: January 1 00:00:00

Slot 03  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:13:41
Finished date: January 1 00:00:00

Slot 04  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:13:57
Finished date: January 1 00:00:00

Slot 05  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:14:03
Finished date: January 1 00:00:00

Slot 06  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:14:09
Finished date: January 1 00:00:00

Slot 07 *Ready    
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:14:16
Deadline date: April 5 22:14:16

PF: 0.979206 based on last 4 slot(s)
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 128.143.231.201 (BA) acting up?

Post by bruce »

I was told that a problem with bad WUs was fixed a few hours ago. Have you rebooted/reset everything and let the client start fresh? After that, if the problem persists, let us know.

The server 128.143.231.201 has very few WUs. Shouldn't you be redirected to another BA server?
PinHead
Posts: 285
Joined: Tue Jan 24, 2012 3:43 am
Hardware configuration: Quad Q9550 2.83 contains the GPU 57xx - running SMP and GPU
Quad Q6700 2.66 running just SMP
2P 32core Interlagos SMP on linux

Re: 128.143.231.201 (BA) acting up?

Post by PinHead »

Ok, I powered off, switched the power supply off and let all residual energy drain. After restart I removed the work folder, deleted the queue.dat, machinedependent.dat and unitinfo.txt file.

This time, the steps seemed to have worked and I am getting an expected download size of more than 512.

Thanks bruce!
Macaholic
Site Moderator
Posts: 811
Joined: Thu Nov 29, 2007 11:57 pm
Location: 1 Infinite Loop

Re: 128.143.231.201 (BA) acting up?

Post by Macaholic »

bruce wrote:I was told that a problem with bad WUs was fixed a few hours ago. Have you rebooted/reset everything and let the client start fresh? After that, if the problem persists, let us know.

The server 128.143.231.201 has very few WUs. Shouldn't you be redirected to another BA server?
Not fixed. Units are still there.

Code: Select all

[15:19:51] + Attempting to send results [April 2 15:19:51 UTC]
[15:19:51] - Reading file work/wuresults_05.dat from core
[15:19:51]   (Read 91401212 bytes from disk)
[15:19:51] Connecting to http://128.143.231.201:8080/
[15:36:56] Posted data.
[15:36:56] Initial: 0000; - Uploaded at ~87 kB/s
[15:36:56] - Averaged speed for that direction ~86 kB/s
[15:36:56] + Results successfully sent
[15:36:56] Thank you for your contribution to Folding@Home.
[15:36:56] + Number of Units Completed: 652

[15:53:59] Trying to send all finished work units
[15:53:59] + No unsent completed units remaining.
[15:53:59] - Preparing to get new work unit...
[15:53:59] Cleaning up work directory
[15:59:59] + Attempting to get work packet
[15:59:59] Passkey found
[15:59:59] - Will indicate memory of 32233 MB
[15:59:59] - Connecting to assignment server
[15:59:59] Connecting to http://assign.stanford.edu:8080/
[16:00:22] Posted data.
[16:00:22] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[16:00:22] + News From Folding@Home: Welcome to Folding@Home
[16:00:22] Loaded queue successfully.
[16:00:22] Sent data
[16:00:22] Connecting to http://128.143.231.201:8080/
[16:00:31] Posted data.
[16:00:31] Initial: 0000; - Receiving payload (expected size: 512)
[16:00:31] Conversation time very short, giving reduced weight in bandwidth avg
[16:00:31] - Downloaded at ~1 kB/s
[16:00:31] - Averaged speed for that direction ~360 kB/s
[16:00:31] + Received work.
[16:00:31] Trying to send all finished work units
[16:00:31] + No unsent completed units remaining.
[16:00:31] + Closed connections
[16:00:31] 
[16:00:31] + Processing work unit
[16:00:31] Core required: FahCore_a5.exe
[16:00:31] Core found.
[16:00:31] Working on queue slot 06 [April 2 16:00:31 UTC]
[16:00:31] + Working ...
[16:00:31] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 06 -np 48 -checkpoint 15 -forceasm -verbose -lifeline 2762 -version 634'

[16:00:32] 
[16:00:32] *------------------------------*
[16:00:32] Folding@Home Gromacs SMP Core
[16:00:32] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[16:00:32] 
[16:00:32] Preparing to commence simulation
[16:00:32] - Assembly optimizations manually forced on.
[16:00:32] - Not checking prior termination.
[16:00:32] Couldn't Decompress
[16:00:32] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[16:00:32] -Error: Couldn't update checksum variables
[16:00:32] Error: Could not open work file
[16:00:32] 
[16:00:32] Folding@home Core Shutdown: FILE_IO_ERROR
[16:00:32] CoreStatus = 75 (117)
[16:00:32] Error opening or reading from a file.
[16:00:32] Deleting current work unit & continuing...
[16:00:32] Trying to send all finished work units
[16:00:32] + No unsent completed units remaining.
[16:00:32] - Preparing to get new work unit...
[16:00:32] Cleaning up work directory
[16:06:30] + Attempting to get work packet
[16:06:30] Passkey found
[16:06:30] - Will indicate memory of 32233 MB
[16:06:30] - Connecting to assignment server
[16:06:30] Connecting to http://assign.stanford.edu:8080/
[16:06:31] Posted data.
[16:06:31] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[16:06:31] + News From Folding@Home: Welcome to Folding@Home
[16:06:31] Loaded queue successfully.
[16:06:31] Sent data
[16:06:31] Connecting to http://128.143.231.201:8080/
[16:06:31] Posted data.
[16:06:31] Initial: 0000; - Receiving payload (expected size: 512)
[16:06:31] Conversation time very short, giving reduced weight in bandwidth avg
[16:06:31] - Downloaded at ~1 kB/s
[16:06:31] - Averaged speed for that direction ~320 kB/s
[16:06:31] + Received work.
[16:06:31] + Closed connections
[16:06:36] 
[16:06:36] + Processing work unit
[16:06:36] Core required: FahCore_a5.exe
[16:06:36] Core found.
[16:06:36] Working on queue slot 07 [April 2 16:06:36 UTC]
[16:06:36] + Working ...
[16:06:36] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 07 -np 48 -checkpoint 15 -forceasm -verbose -lifeline 2762 -version 634'

[16:06:36] 
[16:06:36] *------------------------------*
[16:06:36] Folding@Home Gromacs SMP Core
[16:06:36] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[16:06:36] 
[16:06:36] Preparing to commence simulation
[16:06:36] - Assembly optimizations manually forced on.
[16:06:36] - Not checking prior termination.
[16:06:36] Couldn't Decompress
[16:06:36] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[16:06:36] -Error: Couldn't update checksum variables
[16:06:36] Error: Could not open work file
[16:06:36] 
[16:06:36] Folding@home Core Shutdown: FILE_IO_ERROR
[16:06:36] CoreStatus = 75 (117)
[16:06:36] Error opening or reading from a file.
[16:06:36] Deleting current work unit & continuing...
[16:06:36] Trying to send all finished work units
[16:06:36] + No unsent completed units remaining.
[16:06:36] - Preparing to get new work unit...
[16:06:36] Cleaning up work directory
[16:07:46] ***** Got an Activate signal (2)
[16:07:46] Killing all core threads

Folding@Home Client Shutdown.
Fold! It does a body good!™
-alias-
Posts: 121
Joined: Sun Feb 22, 2009 1:20 pm

Re: 128.143.231.201 (BA) acting up?

Post by -alias- »

I have the same problem randomly with all my 6 servers, and it have been going on for over a week now. This is what happend. When this this incident occurs I delete everything but not the config, and starting a new fresh fah, and this can run ok for several WUs before it happend again. Clip from the latest log:

Code: Select all

[06:42:53] Trying to send all finished work units
[06:42:53] + No unsent completed units remaining.
[06:42:53] - Preparing to get new work unit...
[06:42:53] Cleaning up work directory
[06:42:53] + Attempting to get work packet
[06:42:53] Passkey found
[06:42:53] - Will indicate memory of 32217 MB
[06:42:53] - Connecting to assignment server
[06:42:53] Connecting to http://assign.stanford.edu:8080/
[06:42:54] Posted data.
[06:42:54] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[06:42:54] + News From Folding@Home: Welcome to Folding@Home
[06:42:54] Loaded queue successfully.
[06:42:54] Sent data
[06:42:54] Connecting to http://128.143.231.201:8080/
[06:42:55] Posted data.
[06:42:55] Initial: 0000; - Receiving payload (expected size: 512)
[06:42:55] Conversation time very short, giving reduced weight in bandwidth avg
[06:42:55] - Downloaded at ~1 kB/s
[06:42:55] - Averaged speed for that direction ~1 kB/s
[06:42:55] + Received work.
[06:42:55] + Closed connections
[06:43:00] 
[06:43:00] + Processing work unit
[06:43:00] Core required: FahCore_a5.exe
[06:43:00] Core found.
[06:43:00] Working on queue slot 07 [April 3 06:43:00 UTC]
[06:43:00] + Working ...
[06:43:00] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 07 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[06:43:00] 
[06:43:00] *------------------------------*
[06:43:00] Folding@Home Gromacs SMP Core
[06:43:00] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[06:43:00] 
[06:43:00] Preparing to commence simulation
[06:43:00] - Looking at optimizations...
[06:43:00] - Created dyn
[06:43:00] - Files status OK
[06:43:00] Couldn't Decompress
[06:43:00] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[06:43:00] -Error: Couldn't update checksum variables
[06:43:00] Error: Could not open work file
[06:43:00] 
[06:43:00] Folding@home Core Shutdown: FILE_IO_ERROR
[06:43:00] CoreStatus = 75 (117)
[06:43:00] Error opening or reading from a file.
[06:43:00] Deleting current work unit & continuing...
[06:43:00] Trying to send all finished work units
[06:43:00] + No unsent completed units remaining.
[06:43:00] - Preparing to get new work unit...
If I am not there to stop it, this could go on for several hours, until I detects it, deletes it and start a new fresch fah again!

To me, this look like bad WUs over and over again from server http://128.143.231.201

To show my point, I print a clip from the same log that shows that Project: 8583 (Run 0, Clone 1, Gen 477) is downloaded from server http://128.143.231.202 and folding normal before the bad WU occors again, as before P8583 came down.

Code: Select all

[01:28:17] + News From Folding@Home: Welcome to Folding@Home
[01:28:17] Loaded queue successfully.
[01:28:17] Sent data
[01:28:17] Connecting to http://128.143.231.201:8080/
[01:28:18] Posted data.
[01:28:18] Initial: 0000; - Receiving payload (expected size: 512)
[01:28:18] Conversation time very short, giving reduced weight in bandwidth avg
[01:28:18] - Downloaded at ~1 kB/s
[01:28:18] - Averaged speed for that direction ~1 kB/s
[01:28:18] + Received work.
[01:28:18] + Closed connections
[01:28:23] 
[01:28:23] + Processing work unit
[01:28:23] Core required: FahCore_a5.exe
[01:28:23] Core found.
[01:28:23] Working on queue slot 01 [April 3 01:28:23 UTC]
[01:28:23] + Working ...
[01:28:23] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 01 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:28:23] 
[01:28:23] *------------------------------*
[01:28:23] Folding@Home Gromacs SMP Core
[01:28:23] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:28:23] 
[01:28:23] Preparing to commence simulation
[01:28:23] - Looking at optimizations...
[01:28:23] - Created dyn
[01:28:23] - Files status OK
[01:28:23] Couldn't Decompress
[01:28:23] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:28:23] -Error: Couldn't update checksum variables
[01:28:23] Error: Could not open work file
[01:28:23] 
[01:28:23] Folding@home Core Shutdown: FILE_IO_ERROR
[01:28:23] CoreStatus = 75 (117)
[01:28:23] Error opening or reading from a file.
[01:28:23] Deleting current work unit & continuing...
[01:28:23] Trying to send all finished work units
[01:28:23] + No unsent completed units remaining.
[01:28:23] - Preparing to get new work unit...
[01:28:23] Cleaning up work directory
[01:28:23] + Attempting to get work packet
[01:28:23] Passkey found
[01:28:23] - Will indicate memory of 32217 MB
[01:28:23] - Connecting to assignment server
[01:28:23] Connecting to http://assign.stanford.edu:8080/
[01:28:24] Posted data.
[01:28:24] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[01:28:24] + News From Folding@Home: Welcome to Folding@Home
[01:28:24] Loaded queue successfully.
[01:28:24] Sent data
[01:28:24] Connecting to http://128.143.231.201:8080/
[01:28:25] Posted data.
[01:28:25] Initial: 0000; - Receiving payload (expected size: 512)
[01:28:25] Conversation time very short, giving reduced weight in bandwidth avg
[01:28:25] - Downloaded at ~1 kB/s
[01:28:25] - Averaged speed for that direction ~1 kB/s
[01:28:25] + Received work.
[01:28:25] + Closed connections
[01:28:30] 
[01:28:30] + Processing work unit
[01:28:30] Core required: FahCore_a5.exe
[01:28:30] Core found.
[01:28:30] Working on queue slot 02 [April 3 01:28:30 UTC]
[01:28:30] + Working ...
[01:28:30] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 02 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:28:30] 
[01:28:30] *------------------------------*
[01:28:30] Folding@Home Gromacs SMP Core
[01:28:30] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:28:30] 
[01:28:30] Preparing to commence simulation
[01:28:30] - Looking at optimizations...
[01:28:30] - Created dyn
[01:28:30] - Files status OK
[01:28:30] Couldn't Decompress
[01:28:30] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:28:30] -Error: Couldn't update checksum variables
[01:28:30] Error: Could not open work file
[01:28:30] 
[01:28:30] Folding@home Core Shutdown: FILE_IO_ERROR
[01:28:30] CoreStatus = 75 (117)
[01:28:30] Error opening or reading from a file.
[01:28:30] Deleting current work unit & continuing...
[01:28:30] Trying to send all finished work units
[01:28:30] + No unsent completed units remaining.
[01:28:30] - Preparing to get new work unit...
[01:28:30] Cleaning up work directory
[01:28:30] + Attempting to get work packet
[01:28:30] Passkey found
[01:28:30] - Will indicate memory of 32217 MB
[01:28:30] - Connecting to assignment server
[01:28:30] Connecting to http://assign.stanford.edu:8080/
[01:28:31] Posted data.
[01:28:31] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[01:28:31] + News From Folding@Home: Welcome to Folding@Home
[01:28:31] Loaded queue successfully.
[01:28:31] Sent data
[01:28:31] Connecting to http://128.143.231.201:8080/
[01:28:32] Posted data.
[01:28:32] Initial: 0000; - Receiving payload (expected size: 512)
[01:28:32] Conversation time very short, giving reduced weight in bandwidth avg
[01:28:32] - Downloaded at ~1 kB/s
[01:28:32] - Averaged speed for that direction ~1 kB/s
[01:28:32] + Received work.
[01:28:32] + Closed connections
[01:28:37] 
[01:28:37] + Processing work unit
[01:28:37] Core required: FahCore_a5.exe
[01:28:37] Core found.
[01:28:37] Working on queue slot 03 [April 3 01:28:37 UTC]
[01:28:37] + Working ...
[01:28:37] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:28:37] 
[01:28:37] *------------------------------*
[01:28:37] Folding@Home Gromacs SMP Core
[01:28:37] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:28:37] 
[01:28:37] Preparing to commence simulation
[01:28:37] - Looking at optimizations...
[01:28:37] - Created dyn
[01:28:37] - Files status OK
[01:28:37] Couldn't Decompress
[01:28:37] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:28:37] -Error: Couldn't update checksum variables
[01:28:37] Error: Could not open work file
[01:28:37] 
[01:28:37] Folding@home Core Shutdown: FILE_IO_ERROR
[01:28:37] CoreStatus = 75 (117)
[01:28:37] Error opening or reading from a file.
[01:28:37] Deleting current work unit & continuing...
[01:28:37] Trying to send all finished work units
[01:28:37] + No unsent completed units remaining.
[01:28:37] - Preparing to get new work unit...
[01:28:37] Cleaning up work directory
[01:28:37] + Attempting to get work packet
[01:28:37] Passkey found
[01:28:37] - Will indicate memory of 32217 MB
[01:28:37] - Connecting to assignment server
[01:28:37] Connecting to http://assign.stanford.edu:8080/
[01:28:38] Posted data.
[01:28:38] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[01:28:38] + News From Folding@Home: Welcome to Folding@Home
[01:28:38] Loaded queue successfully.
[01:28:38] Sent data
[01:28:38] Connecting to http://128.143.231.201:8080/
[01:28:39] Posted data.
[01:28:39] Initial: 0000; - Receiving payload (expected size: 512)
[01:28:39] Conversation time very short, giving reduced weight in bandwidth avg
[01:28:39] - Downloaded at ~1 kB/s
[01:28:39] - Averaged speed for that direction ~1 kB/s
[01:28:39] + Received work.
[01:28:39] + Closed connections
[01:28:44] 
[01:28:44] + Processing work unit
[01:28:44] Core required: FahCore_a5.exe
[01:28:44] Core found.
[01:28:44] Working on queue slot 04 [April 3 01:28:44 UTC]
[01:28:44] + Working ...
[01:28:44] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:28:44] 
[01:28:44] *------------------------------*
[01:28:44] Folding@Home Gromacs SMP Core
[01:28:44] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:28:44] 
[01:28:44] Preparing to commence simulation
[01:28:44] - Looking at optimizations...
[01:28:44] - Created dyn
[01:28:44] - Files status OK
[01:28:44] Couldn't Decompress
[01:28:44] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:28:44] -Error: Couldn't update checksum variables
[01:28:44] Error: Could not open work file
[01:28:44] 
[01:28:44] Folding@home Core Shutdown: FILE_IO_ERROR
[01:28:44] CoreStatus = 75 (117)
[01:28:44] Error opening or reading from a file.
[01:28:44] Deleting current work unit & continuing...
[01:28:44] Trying to send all finished work units
[01:28:44] + No unsent completed units remaining.
[01:28:44] - Preparing to get new work unit...
[01:28:44] Cleaning up work directory
[01:28:44] + Attempting to get work packet
[01:28:44] Passkey found
[01:28:44] - Will indicate memory of 32217 MB
[01:28:44] - Connecting to assignment server
[01:28:44] Connecting to http://assign.stanford.edu:8080/
[01:40:00] - Couldn't send HTTP request to server
[01:40:00] + Could not connect to Assignment Server
[01:40:00] Connecting to http://assign2.stanford.edu:80/
[01:40:02] Posted data.
[01:40:02] Initial: 8F80; - Successful: assigned to (128.143.231.202).
[01:40:02] + News From Folding@Home: Welcome to Folding@Home
[01:40:02] Loaded queue successfully.
[01:40:02] Sent data
[01:40:02] Connecting to http://128.143.231.202:80/
[01:40:03] Posted data.
[01:40:03] Initial: 0000; - Receiving payload (expected size: 3848746)
[01:40:12] - Downloaded at ~417 kB/s
[01:40:12] - Averaged speed for that direction ~84 kB/s
[01:40:12] + Received work.
[01:40:12] + Closed connections
[01:40:17] 
[01:40:17] + Processing work unit
[01:40:17] Core required: FahCore_a3.exe
[01:40:17] Core found.
[01:40:17] Working on queue slot 05 [April 3 01:40:17 UTC]
[01:40:17] + Working ...
[01:40:17] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 05 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:40:17] 
[01:40:17] *------------------------------*
[01:40:17] Folding@Home Gromacs SMP Core
[01:40:17] Version 2.27 (Dec. 15, 2010)
[01:40:17] 
[01:40:17] Preparing to commence simulation
[01:40:17] - Looking at optimizations...
[01:40:17] - Created dyn
[01:40:17] - Files status OK
[01:40:18] - Expanded 3848234 -> 4382484 (decompressed 113.8 percent)
[01:40:18] Called DecompressByteArray: compressed_data_size=3848234 data_size=4382484, decompressed_data_size=4382484 diff=0
[01:40:18] - Digital signature verified
[01:40:18] 
[01:40:18] Project: 8583 (Run 0, Clone 1, Gen 477)
[01:40:18] 
[01:40:18] Assembly optimizations on if available.
[01:40:18] Entering M.D.
[01:40:24] Mapping NT from 64 to 64 
[01:40:25] Completed 0 out of 500000 steps  (0%)
[01:42:30] Completed 5000 out of 500000 steps  (1%)
[01:44:44] Completed 10000 out of 500000 steps  (2%)
[01:46:52] Completed 15000 out of 500000 steps  (3%)
[01:48:59] Completed 20000 out of 500000 steps  (4%)
[01:51:00] Completed 25000 out of 500000 steps  (5%)
[01:52:59] Completed 30000 out of 500000 steps  (6%)
[01:54:56] Completed 35000 out of 500000 steps  (7%)
[01:56:57] Completed 40000 out of 500000 steps  (8%)
[01:58:57] Completed 45000 out of 500000 steps  (9%)
[02:00:56] Completed 50000 out of 500000 steps  (10%)
[02:02:57] Completed 55000 out of 500000 steps  (11%)
[02:04:56] Completed 60000 out of 500000 steps  (12%)
[02:06:58] Completed 65000 out of 500000 steps  (13%)
[02:08:59] Completed 70000 out of 500000 steps  (14%)
[02:11:01] Completed 75000 out of 500000 steps  (15%)
[02:13:00] Completed 80000 out of 500000 steps  (16%)
[02:14:58] Completed 85000 out of 500000 steps  (17%)
[02:16:56] Completed 90000 out of 500000 steps  (18%)
[02:18:55] Completed 95000 out of 500000 steps  (19%)
[02:20:53] Completed 100000 out of 500000 steps  (20%)
[02:22:57] Completed 105000 out of 500000 steps  (21%)
[02:25:01] Completed 110000 out of 500000 steps  (22%)
[02:27:02] Completed 115000 out of 500000 steps  (23%)
[02:29:03] Completed 120000 out of 500000 steps  (24%)
[02:31:03] Completed 125000 out of 500000 steps  (25%)
[02:33:02] Completed 130000 out of 500000 steps  (26%)
[02:35:02] Completed 135000 out of 500000 steps  (27%)
[02:37:01] Completed 140000 out of 500000 steps  (28%)
[02:39:02] Completed 145000 out of 500000 steps  (29%)
[02:41:01] Completed 150000 out of 500000 steps  (30%)
[02:43:00] Completed 155000 out of 500000 steps  (31%)
[02:44:57] Completed 160000 out of 500000 steps  (32%)
[02:46:55] Completed 165000 out of 500000 steps  (33%)
[02:48:54] Completed 170000 out of 500000 steps  (34%)
[02:51:21] Completed 175000 out of 500000 steps  (35%)
[02:53:20] Completed 180000 out of 500000 steps  (36%)
[02:55:20] Completed 185000 out of 500000 steps  (37%)
[02:57:29] Completed 190000 out of 500000 steps  (38%)
[02:59:28] Completed 195000 out of 500000 steps  (39%)
[03:01:29] Completed 200000 out of 500000 steps  (40%)
[03:03:30] Completed 205000 out of 500000 steps  (41%)
[03:05:35] Completed 210000 out of 500000 steps  (42%)
[03:07:37] Completed 215000 out of 500000 steps  (43%)
[03:09:38] Completed 220000 out of 500000 steps  (44%)
[03:11:36] Completed 225000 out of 500000 steps  (45%)
[03:13:35] Completed 230000 out of 500000 steps  (46%)
[03:15:36] Completed 235000 out of 500000 steps  (47%)
[03:17:36] Completed 240000 out of 500000 steps  (48%)
[03:19:36] Completed 245000 out of 500000 steps  (49%)
[03:21:36] Completed 250000 out of 500000 steps  (50%)
[03:23:42] Completed 255000 out of 500000 steps  (51%)
[03:25:41] Completed 260000 out of 500000 steps  (52%)
[03:27:42] Completed 265000 out of 500000 steps  (53%)
[03:29:43] Completed 270000 out of 500000 steps  (54%)
[03:31:41] Completed 275000 out of 500000 steps  (55%)
[03:33:42] Completed 280000 out of 500000 steps  (56%)
[03:35:41] Completed 285000 out of 500000 steps  (57%)
[03:37:40] Completed 290000 out of 500000 steps  (58%)
[03:39:41] Completed 295000 out of 500000 steps  (59%)
[03:41:40] Completed 300000 out of 500000 steps  (60%)
[03:43:44] Completed 305000 out of 500000 steps  (61%)
[03:45:46] Completed 310000 out of 500000 steps  (62%)
[03:47:51] Completed 315000 out of 500000 steps  (63%)
[03:49:52] Completed 320000 out of 500000 steps  (64%)
[03:51:54] Completed 325000 out of 500000 steps  (65%)
[03:53:53] Completed 330000 out of 500000 steps  (66%)
[03:55:52] Completed 335000 out of 500000 steps  (67%)
[03:57:58] Completed 340000 out of 500000 steps  (68%)
[04:00:35] Completed 345000 out of 500000 steps  (69%)
[04:02:35] Completed 350000 out of 500000 steps  (70%)
[04:04:36] Completed 355000 out of 500000 steps  (71%)
[04:06:35] Completed 360000 out of 500000 steps  (72%)
[04:08:39] Completed 365000 out of 500000 steps  (73%)
[04:10:37] Completed 370000 out of 500000 steps  (74%)
[04:12:37] Completed 375000 out of 500000 steps  (75%)
[04:14:39] Completed 380000 out of 500000 steps  (76%)
[04:17:02] Completed 385000 out of 500000 steps  (77%)
[04:19:01] Completed 390000 out of 500000 steps  (78%)
[04:21:03] Completed 395000 out of 500000 steps  (79%)
[04:23:05] Completed 400000 out of 500000 steps  (80%)
[04:25:07] Completed 405000 out of 500000 steps  (81%)
[04:27:09] Completed 410000 out of 500000 steps  (82%)
[04:29:09] Completed 415000 out of 500000 steps  (83%)
[04:31:09] Completed 420000 out of 500000 steps  (84%)
[04:33:10] Completed 425000 out of 500000 steps  (85%)
[04:35:16] Completed 430000 out of 500000 steps  (86%)
[04:37:17] Completed 435000 out of 500000 steps  (87%)
[04:39:19] Completed 440000 out of 500000 steps  (88%)
[04:41:31] Completed 445000 out of 500000 steps  (89%)
[04:43:35] Completed 450000 out of 500000 steps  (90%)
[04:46:00] Completed 455000 out of 500000 steps  (91%)
[04:48:01] Completed 460000 out of 500000 steps  (92%)
[04:50:00] Completed 465000 out of 500000 steps  (93%)
[04:52:00] Completed 470000 out of 500000 steps  (94%)
[04:53:59] Completed 475000 out of 500000 steps  (95%)
[04:55:59] Completed 480000 out of 500000 steps  (96%)
[04:58:00] Completed 485000 out of 500000 steps  (97%)
[05:00:00] Completed 490000 out of 500000 steps  (98%)
[05:02:01] Completed 495000 out of 500000 steps  (99%)
[05:04:04] Completed 500000 out of 500000 steps  (100%)
[05:04:06] DynamicWrapper: Finished Work Unit: sleep=10000
[05:04:16] 
[05:04:16] Finished Work Unit:
[05:04:16] - Reading up to 8055024 from "work/wudata_05.trr": Read 8055024
[05:04:16] trr file hash check passed.
[05:04:16] edr file hash check passed.
[05:04:16] logfile size: 61112
[05:04:16] Leaving Run
[05:04:18] - Writing 8152968 bytes of core data to disk...
[05:04:19] Done: 8152456 -> 7529428 (compressed to 92.3 percent)
[05:04:19]   ... Done.
[05:04:20] - Shutting down core
[05:04:20] 
[05:04:20] Folding@home Core Shutdown: FINISHED_UNIT
[05:04:20] CoreStatus = 64 (100)
[05:04:20] Unit 5 finished with 99 percent of time to deadline remaining.
[05:04:20] Updated performance fraction: 0.848119
[05:04:20] Sending work to server
[05:04:20] Project: 8583 (Run 0, Clone 1, Gen 477)


[05:04:20] + Attempting to send results [April 3 05:04:20 UTC]
[05:04:20] - Reading file work/wuresults_05.dat from core
[05:04:20]   (Read 7529940 bytes from disk)
[05:04:20] Connecting to http://128.143.231.202:8080/
[05:04:34] Posted data.
[05:04:34] Initial: 0000; - Uploaded at ~525 kB/s
[05:04:34] - Averaged speed for that direction ~556 kB/s
[05:04:34] + Results successfully sent
[05:04:34] Thank you for your contribution to Folding@Home.
[05:04:34] + Number of Units Completed: 857

[05:04:34] Trying to send all finished work units
[05:04:34] + No unsent completed units remaining.
[05:04:34] - Preparing to get new work unit...
[05:04:34] Cleaning up work directory
[05:04:34] + Attempting to get work packet
[05:04:34] Passkey found
[05:04:34] - Will indicate memory of 32217 MB
[05:04:34] - Connecting to assignment server
[05:04:34] Connecting to http://assign.stanford.edu:8080/
[05:04:36] Posted data.
[05:04:36] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[05:04:36] + News From Folding@Home: Welcome to Folding@Home
[05:04:36] Loaded queue successfully.
[05:04:36] Sent data
[05:04:36] Connecting to http://128.143.231.201:8080/
[05:04:36] Posted data.
[05:04:36] Initial: 0000; - Receiving payload (expected size: 512)
[05:04:36] Conversation time very short, giving reduced weight in bandwidth avg
[05:04:36] - Downloaded at ~1 kB/s
[05:04:36] - Averaged speed for that direction ~75 kB/s
[05:04:36] + Received work.
[05:04:36] Trying to send all finished work units
[05:04:36] + No unsent completed units remaining.
[05:04:36] + Closed connections
[05:04:36] 
[05:04:36] + Processing work unit
[05:04:36] Core required: FahCore_a5.exe
[05:04:36] Core found.
[05:04:36] Working on queue slot 06 [April 3 05:04:36 UTC]
[05:04:36] + Working ...
[05:04:36] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 06 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[05:04:36] 
[05:04:36] *------------------------------*
[05:04:36] Folding@Home Gromacs SMP Core
[05:04:36] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[05:04:36] 
[05:04:36] Preparing to commence simulation
[05:04:36] - Looking at optimizations...
[05:04:36] - Created dyn
[05:04:36] - Files status OK
[05:04:36] Couldn't Decompress
[05:04:36] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[05:04:36] -Error: Couldn't update checksum variables
[05:04:36] Error: Could not open work file
[05:04:36] 
[05:04:36] Folding@home Core Shutdown: FILE_IO_ERROR
[05:04:37] CoreStatus = 75 (117)
[05:04:37] Error opening or reading from a file.
[05:04:37] Deleting current work unit & continuing...
[05:04:37] Trying to send all finished work units
[05:04:37] + No unsent completed units remaining.
[05:04:37] - Preparing to get new work unit...
[05:04:37] Cleaning up work directory
[05:04:37] + Attempting to get work packet
EXT64
Posts: 323
Joined: Mon Apr 09, 2012 11:54 pm

Re: 128.143.231.201 (BA) acting up?

Post by EXT64 »

I am also getting this randomly as well. Also seemingly random, sometimes after some failures it will pickup an SMP, then start failing again. Not a big deal as the SMP run pretty fast, so I just wait until they are completed and delete the previously mentioned files.
Nathan_P
Posts: 1164
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 [email protected] Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 [email protected] Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: 128.143.231.201 (BA) acting up?

Post by Nathan_P »

I've given up, until its fixed both my machines are on SMP, which is still nice at 300k PPD and a lot less strain on the net connection
Image
-alias-
Posts: 121
Joined: Sun Feb 22, 2009 1:20 pm

Re: 128.143.231.201 (BA) acting up?

Post by -alias- »

It does not seem to be a priority to fix this problem at PG, when no one takes the time to comment on the issue properly? I think I give up also, but I choose to shut them all down if this does not stop very soon, or maybe this is a way to get rid of us BA-folders sooner.
bollix47
Posts: 2959
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: 128.143.231.201 (BA) acting up?

Post by bollix47 »

I've had no problems with my bigadv setup but I am using v7. No bad WUs and no switching to SMP. Is anyone that is using v7 having a problem? It might help if we can narrow the focus for PG.
Image
Nicolas_orleans
Posts: 114
Joined: Wed Aug 08, 2012 3:08 am

Re: 128.143.231.201 (BA) acting up?

Post by Nicolas_orleans »

(until now) no issues with v6 + Langouste
MSI Z77A-GD55 - Core i5-3550 - PNY RTX 4080 Super @ 2715 MHz - Ubuntu 24.04 - 6.8 kernel
MSI MPG B550 - Ryzen 5 5600X - EVGA GTX 980 Ti Hybrid @ 1366 MHz - Ubuntu 24.04 - 6.8 kernel
Post Reply