130.237.232.237 going down for maintenance

Moderators: Site Moderators, FAHC Science Team

Macaholic
Site Moderator
Posts: 811
Joined: Thu Nov 29, 2007 11:57 pm
Location: 1 Infinite Loop

Re: 130.237.232.237 going down for maintenance

Post by Macaholic »

kasson wrote:It looks like we might have a problem on 130.237.232.237; we're taking it down in order to investigate.
Further investigation might be needed;

Code: Select all

[16:49:53] Completed 245000 out of 250000 steps  (98%)
[17:05:16] Completed 247500 out of 250000 steps  (99%)
[17:20:38] Completed 250000 out of 250000 steps  (100%)
[17:21:08] DynamicWrapper: Finished Work Unit: sleep=10000
[17:21:18] 
[17:21:18] Finished Work Unit:
[17:21:18] - Reading up to 121622496 from "work/wudata_04.trr": Read 121622496
[17:21:20] trr file hash check passed.
[17:21:20] - Reading up to 108761364 from "work/wudata_04.xtc": Read 108761364
[17:21:21] xtc file hash check passed.
[17:21:21] edr file hash check passed.
[17:21:21] logfile size: 209481
[17:21:21] Leaving Run
[17:21:24] - Writing 230766333 bytes of core data to disk...
[17:22:35] Done: 230765821 -> 222401801 (compressed to 3.3 percent)
[17:22:35]   ... Done.
[19:11:08] - Shutting down core
[19:11:08] 
[19:11:08] Folding@home Core Shutdown: FINISHED_UNIT
[19:22:42] CoreStatus = 64 (100)
[19:22:42] Unit 4 finished with 90 percent of time to deadline remaining.
[19:22:42] Updated performance fraction: 0.907647
[19:22:42] Sending work to server
[19:22:42] Project: 6903 (Run 11, Clone 12, Gen 91)


[19:22:42] + Attempting to send results [April 13 19:22:42 UTC]
[19:22:42] - Reading file work/wuresults_04.dat from core
[19:22:42]   (Read 222402313 bytes from disk)
[19:22:42] Connecting to http://130.237.232.237:8080/
[20:05:08] Posted data.
[20:05:08] Initial: 0000; - Uploaded at ~85 kB/s
[20:05:08] - Averaged speed for that direction ~81 kB/s
[20:05:08] + Results successfully sent
[20:05:08] Thank you for your contribution to Folding@Home.
[20:05:08] + Number of Units Completed: 115

[20:21:09] - Autosending finished units... [April 13 20:21:09 UTC]
[20:21:09] Trying to send all finished work units
[20:21:09] + No unsent completed units remaining.
[20:21:09] - Autosend completed
[20:49:59] Trying to send all finished work units
[20:49:59] + No unsent completed units remaining.
[20:49:59] - Preparing to get new work unit...
[20:49:59] Cleaning up work directory
[21:02:15] + Attempting to get work packet
[21:02:15] Passkey found
[21:02:15] - Will indicate memory of 48395 MB
[21:02:15] - Connecting to assignment server
[21:02:15] Connecting to http://assign.stanford.edu:8080/
[21:02:16] Posted data.
[21:02:16] Initial: ED82; - Successful: assigned to (130.237.232.237).
[21:02:16] + News From Folding@Home: Welcome to Folding@Home
[21:02:16] Loaded queue successfully.
[21:02:16] Sent data
[21:02:16] Connecting to http://130.237.232.237:8080/
[21:02:16] Posted data.
[21:02:16] Initial: 0000; - Receiving payload (expected size: 512)
[21:02:16] Conversation time very short, giving reduced weight in bandwidth avg
[21:02:16] - Downloaded at ~1 kB/s
[21:02:16] - Averaged speed for that direction ~91 kB/s
[21:02:16] + Received work.
[21:02:16] Trying to send all finished work units
[21:02:16] + No unsent completed units remaining.
[21:02:16] + Closed connections
[21:02:16] 
[21:02:16] + Processing work unit
[21:02:16] Core required: FahCore_a5.exe
[21:02:16] Core found.
[21:02:16] Working on queue slot 05 [April 13 21:02:16 UTC]
[21:02:16] + Working ...
[21:02:16] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 05 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[21:02:16] 
[21:02:16] *------------------------------*
[21:02:16] Folding@Home Gromacs SMP Core
[21:02:16] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[21:02:16] 
[21:02:16] Preparing to commence simulation
[21:02:16] - Assembly optimizations manually forced on.
[21:02:16] - Not checking prior termination.
[21:02:16] Couldn't Decompress
[21:02:16] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[21:02:16] -Error: Couldn't update checksum variables
[21:02:16] Error: Could not open work file
[21:02:16] 
[21:02:16] Folding@home Core Shutdown: FILE_IO_ERROR
[21:02:17] CoreStatus = 75 (117)
[21:02:17] Error opening or reading from a file.
[21:02:17] Deleting current work unit & continuing...
[21:02:17] Trying to send all finished work units
[21:02:17] + No unsent completed units remaining.
[21:02:17] - Preparing to get new work unit...
[21:02:17] Cleaning up work directory
[21:07:34] + Attempting to get work packet
[21:07:34] Passkey found
[21:07:34] - Will indicate memory of 48395 MB
[21:07:34] - Connecting to assignment server
[21:07:34] Connecting to http://assign.stanford.edu:8080/
[21:07:34] Posted data.
[21:07:34] Initial: ED82; - Successful: assigned to (130.237.232.237).
[21:07:34] + News From Folding@Home: Welcome to Folding@Home
[21:07:34] Loaded queue successfully.
[21:07:34] Sent data
[21:07:34] Connecting to http://130.237.232.237:8080/
[21:07:35] Posted data.
[21:07:35] Initial: 0000; - Receiving payload (expected size: 512)
[21:07:35] Conversation time very short, giving reduced weight in bandwidth avg
[21:07:35] - Downloaded at ~1 kB/s
[21:07:35] - Averaged speed for that direction ~81 kB/s
[21:07:35] + Received work.
[21:07:35] + Closed connections
[21:07:40] 
[21:07:40] + Processing work unit
[21:07:40] Core required: FahCore_a5.exe
[21:07:40] Core found.
[21:07:40] Working on queue slot 06 [April 13 21:07:40 UTC]
[21:07:40] + Working ...
[21:07:40] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 06 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[21:07:40] 
[21:07:40] *------------------------------*
[21:07:40] Folding@Home Gromacs SMP Core
[21:07:40] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[21:07:40] 
[21:07:40] Preparing to commence simulation
[21:07:40] - Assembly optimizations manually forced on.
[21:07:40] - Not checking prior termination.
[21:07:40] Couldn't Decompress
[21:07:40] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[21:07:40] -Error: Couldn't update checksum variables
[21:07:40] Error: Could not open work file
[21:07:40] 
[21:07:40] Folding@home Core Shutdown: FILE_IO_ERROR
[21:07:40] CoreStatus = 75 (117)
[21:07:40] Error opening or reading from a file.
[21:07:40] Deleting current work unit & continuing...
[21:07:40] Trying to send all finished work units
[21:07:40] + No unsent completed units remaining.
[21:07:40] - Preparing to get new work unit...
[21:07:40] Cleaning up work directory
[21:19:57] + Attempting to get work packet
[21:19:57] Passkey found
[21:19:57] - Will indicate memory of 48395 MB
[21:19:57] - Connecting to assignment server
[21:19:57] Connecting to http://assign.stanford.edu:8080/
[21:19:57] Posted data.
[21:19:57] Initial: ED82; - Successful: assigned to (130.237.232.237).
[21:19:57] + News From Folding@Home: Welcome to Folding@Home
[21:19:58] Loaded queue successfully.
[21:19:58] Sent data
[21:19:58] Connecting to http://130.237.232.237:8080/
[21:19:58] Posted data.
[21:19:58] Initial: 0000; - Receiving payload (expected size: 512)
[21:19:58] Conversation time very short, giving reduced weight in bandwidth avg
[21:19:58] - Downloaded at ~1 kB/s
[21:19:58] - Averaged speed for that direction ~72 kB/s
[21:19:58] + Received work.
[21:19:58] + Closed connections
[21:20:03] 
[21:20:03] + Processing work unit
[21:20:03] Core required: FahCore_a5.exe
[21:20:03] Core found.
[21:20:03] Working on queue slot 07 [April 13 21:20:03 UTC]
[21:20:03] + Working ...
[21:20:03] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 07 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[21:20:03] 
[21:20:03] *------------------------------*
[21:20:03] Folding@Home Gromacs SMP Core
[21:20:03] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[21:20:03] 
[21:20:03] Preparing to commence simulation
[21:20:03] - Assembly optimizations manually forced on.
[21:20:03] - Not checking prior termination.
[21:20:03] Couldn't Decompress
[21:20:03] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[21:20:03] -Error: Couldn't update checksum variables
[21:20:03] Error: Could not open work file
[21:20:03] 
[21:20:03] Folding@home Core Shutdown: FILE_IO_ERROR
[21:20:03] CoreStatus = 75 (117)
[21:20:03] Error opening or reading from a file.
[21:20:03] Deleting current work unit & continuing...
[21:20:04] Trying to send all finished work units
[21:20:04] + No unsent completed units remaining.
[21:20:04] - Preparing to get new work unit...
[21:20:04] Cleaning up work directory
[21:32:12] + Attempting to get work packet
[21:32:12] Passkey found
[21:32:12] - Will indicate memory of 48395 MB
[21:32:12] - Connecting to assignment server
[21:32:12] Connecting to http://assign.stanford.edu:8080/
[21:32:13] Posted data.
[21:32:13] Initial: ED82; - Successful: assigned to (130.237.232.237).
[21:32:13] + News From Folding@Home: Welcome to Folding@Home
[21:32:13] Loaded queue successfully.
[21:32:13] Sent data
[21:32:13] Connecting to http://130.237.232.237:8080/
[21:32:13] Posted data.
[21:32:13] Initial: 0000; - Receiving payload (expected size: 512)
[21:32:13] Conversation time very short, giving reduced weight in bandwidth avg
[21:32:13] - Downloaded at ~1 kB/s
[21:32:13] - Averaged speed for that direction ~64 kB/s
[21:32:13] + Received work.
[21:32:13] + Closed connections
[21:32:18] 
[21:32:18] + Processing work unit
[21:32:18] Core required: FahCore_a5.exe
[21:32:18] Core found.
[21:32:18] Working on queue slot 08 [April 13 21:32:18 UTC]
[21:32:18] + Working ...
[21:32:18] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 08 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[21:32:19] 
[21:32:19] *------------------------------*
[21:32:19] Folding@Home Gromacs SMP Core
[21:32:19] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[21:32:19] 
[21:32:19] Preparing to commence simulation
[21:32:19] - Assembly optimizations manually forced on.
[21:32:19] - Not checking prior termination.
[21:32:19] Couldn't Decompress
[21:32:19] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[21:32:19] -Error: Couldn't update checksum variables
[21:32:19] Error: Could not open work file
[21:32:19] 
[21:32:19] Folding@home Core Shutdown: FILE_IO_ERROR
[21:32:19] CoreStatus = 75 (117)
[21:32:19] Error opening or reading from a file.
[21:32:19] Deleting current work unit & continuing...
[21:32:19] Trying to send all finished work units
[21:32:19] + No unsent completed units remaining.
[21:32:19] - Preparing to get new work unit...
[21:32:19] Cleaning up work directory
[21:44:34] + Attempting to get work packet
[21:44:34] Passkey found
[21:44:34] - Will indicate memory of 48395 MB
[21:44:34] - Connecting to assignment server
[21:44:34] Connecting to http://assign.stanford.edu:8080/
[21:44:34] Posted data.
[21:44:34] Initial: ED82; - Successful: assigned to (130.237.232.237).
[21:44:34] + News From Folding@Home: Welcome to Folding@Home
[21:44:34] Loaded queue successfully.
[21:44:34] Sent data
[21:44:34] Connecting to http://130.237.232.237:8080/
[21:44:35] Posted data.
[21:44:35] Initial: 0000; - Receiving payload (expected size: 512)
[21:44:35] Conversation time very short, giving reduced weight in bandwidth avg
[21:44:35] - Downloaded at ~1 kB/s
[21:44:35] - Averaged speed for that direction ~57 kB/s
[21:44:35] + Received work.
[21:44:35] + Closed connections
[21:44:40] 
[21:44:40] + Processing work unit
[21:44:40] Core required: FahCore_a5.exe
[21:44:40] Core found.
[21:44:40] Working on queue slot 09 [April 13 21:44:40 UTC]
[21:44:40] + Working ...
[21:44:40] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 09 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[21:44:40] 
[21:44:40] *------------------------------*
[21:44:40] Folding@Home Gromacs SMP Core
[21:44:40] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[21:44:40] 
[21:44:40] Preparing to commence simulation
[21:44:40] - Assembly optimizations manually forced on.
[21:44:40] - Not checking prior termination.
[21:44:40] Couldn't Decompress
[21:44:40] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[21:44:40] -Error: Couldn't update checksum variables
[21:44:40] Error: Could not open work file
[21:44:40] 
[21:44:40] Folding@home Core Shutdown: FILE_IO_ERROR
[21:44:40] CoreStatus = 75 (117)
[21:44:40] Error opening or reading from a file.
[21:44:40] Deleting current work unit & continuing...
[21:44:40] Trying to send all finished work units
[21:44:40] + No unsent completed units remaining.
[21:44:40] - Preparing to get new work unit...
[21:44:40] Cleaning up work directory
[21:49:58] + Attempting to get work packet
[21:49:58] Passkey found
[21:49:58] - Will indicate memory of 48395 MB
[21:49:58] - Connecting to assignment server
[21:49:58] Connecting to http://assign.stanford.edu:8080/
[21:49:59] Posted data.
[21:49:59] Initial: ED82; - Successful: assigned to (130.237.232.237).
[21:49:59] + News From Folding@Home: Welcome to Folding@Home
[21:49:59] Loaded queue successfully.
[21:49:59] Sent data
[21:49:59] Connecting to http://130.237.232.237:8080/
[21:50:00] Posted data.
[21:50:00] Initial: 0000; - Receiving payload (expected size: 512)
[21:50:00] Conversation time very short, giving reduced weight in bandwidth avg
[21:50:00] - Downloaded at ~1 kB/s
[21:50:00] - Averaged speed for that direction ~51 kB/s
[21:50:00] + Received work.
[21:50:00] + Closed connections
[21:50:05] 
[21:50:05] + Processing work unit
[21:50:05] Core required: FahCore_a5.exe
[21:50:05] Core found.
[21:50:05] Working on queue slot 00 [April 13 21:50:05 UTC]
[21:50:05] + Working ...
[21:50:05] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 00 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[21:50:05] 
[21:50:05] *------------------------------*
[21:50:05] Folding@Home Gromacs SMP Core
[21:50:05] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[21:50:05] 
[21:50:05] Preparing to commence simulation
[21:50:05] - Assembly optimizations manually forced on.
[21:50:05] - Not checking prior termination.
[21:50:05] Couldn't Decompress
[21:50:05] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[21:50:05] -Error: Couldn't update checksum variables
[21:50:05] Error: Could not open work file
[21:50:05] 
[21:50:05] Folding@home Core Shutdown: FILE_IO_ERROR
[21:50:05] CoreStatus = 75 (117)
[21:50:05] Error opening or reading from a file.
[21:50:05] Deleting current work unit & continuing...
[21:50:05] Trying to send all finished work units
[21:50:05] + No unsent completed units remaining.
[21:50:05] - Preparing to get new work unit...
[21:50:05] Cleaning up work directory
[21:55:24] + Attempting to get work packet
[21:55:24] Passkey found
[21:55:24] - Will indicate memory of 48395 MB
[21:55:24] - Connecting to assignment server
[21:55:24] Connecting to http://assign.stanford.edu:8080/
[21:55:24] Posted data.
[21:55:24] Initial: ED82; - Successful: assigned to (130.237.232.237).
[21:55:24] + News From Folding@Home: Welcome to Folding@Home
[21:55:25] Loaded queue successfully.
[21:55:25] Sent data
[21:55:25] Connecting to http://130.237.232.237:8080/
[21:55:25] Posted data.
[21:55:25] Initial: 0000; - Receiving payload (expected size: 512)
[21:55:25] Conversation time very short, giving reduced weight in bandwidth avg
[21:55:25] - Downloaded at ~1 kB/s
[21:55:25] - Averaged speed for that direction ~45 kB/s
[21:55:25] + Received work.
[21:55:25] + Closed connections
[21:55:30] 
[21:55:30] + Processing work unit
[21:55:30] Core required: FahCore_a5.exe
[21:55:30] Core found.
[21:55:30] Working on queue slot 01 [April 13 21:55:30 UTC]
[21:55:30] + Working ...
[21:55:30] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 01 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[21:55:30] 
[21:55:30] *------------------------------*
[21:55:30] Folding@Home Gromacs SMP Core
[21:55:30] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[21:55:30] 
[21:55:30] Preparing to commence simulation
[21:55:30] - Assembly optimizations manually forced on.
[21:55:30] - Not checking prior termination.
[21:55:30] Couldn't Decompress
[21:55:30] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[21:55:30] -Error: Couldn't update checksum variables
[21:55:30] Error: Could not open work file
[21:55:30] 
[21:55:30] Folding@home Core Shutdown: FILE_IO_ERROR
[21:55:30] CoreStatus = 75 (117)
[21:55:30] Error opening or reading from a file.
[21:55:30] Deleting current work unit & continuing...
[21:55:31] Trying to send all finished work units
[21:55:31] + No unsent completed units remaining.
[21:55:31] - Preparing to get new work unit...
[21:55:31] Cleaning up work directory
[22:00:49] + Attempting to get work packet
[22:00:49] Passkey found
[22:00:49] - Will indicate memory of 48395 MB
[22:00:49] - Connecting to assignment server
[22:00:49] Connecting to http://assign.stanford.edu:8080/
[22:00:49] Posted data.
[22:00:49] Initial: ED82; - Successful: assigned to (130.237.232.237).
[22:00:49] + News From Folding@Home: Welcome to Folding@Home
[22:00:49] Loaded queue successfully.
[22:00:49] Sent data
[22:00:49] Connecting to http://130.237.232.237:8080/
[22:00:50] Posted data.
[22:00:50] Initial: 0000; - Receiving payload (expected size: 512)
[22:00:50] Conversation time very short, giving reduced weight in bandwidth avg
[22:00:50] - Downloaded at ~1 kB/s
[22:00:50] - Averaged speed for that direction ~40 kB/s
[22:00:50] + Received work.
[22:00:50] + Closed connections
[22:00:55] 
[22:00:55] + Processing work unit
[22:00:55] Core required: FahCore_a5.exe
[22:00:55] Core found.
[22:00:55] Working on queue slot 02 [April 13 22:00:55 UTC]
[22:00:55] + Working ...
[22:00:55] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 02 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[22:00:55] 
[22:00:55] *------------------------------*
[22:00:55] Folding@Home Gromacs SMP Core
[22:00:55] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:00:55] 
[22:00:55] Preparing to commence simulation
[22:00:55] - Assembly optimizations manually forced on.
[22:00:55] - Not checking prior termination.
[22:00:55] Couldn't Decompress
[22:00:55] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:00:55] -Error: Couldn't update checksum variables
[22:00:55] Error: Could not open work file
[22:00:55] 
[22:00:55] Folding@home Core Shutdown: FILE_IO_ERROR
[22:00:55] CoreStatus = 75 (117)
[22:00:55] Error opening or reading from a file.
[22:00:55] Deleting current work unit & continuing...
[22:00:55] Trying to send all finished work units
[22:00:55] + No unsent completed units remaining.
[22:00:55] - Preparing to get new work unit...
[22:00:55] Cleaning up work directory
[22:06:14] + Attempting to get work packet
[22:06:14] Passkey found
[22:06:14] - Will indicate memory of 48395 MB
[22:06:14] - Connecting to assignment server
[22:06:14] Connecting to http://assign.stanford.edu:8080/
[22:06:19] Posted data.
[22:06:19] Initial: ED82; - Successful: assigned to (130.237.232.237).
[22:06:19] + News From Folding@Home: Welcome to Folding@Home
[22:06:19] Loaded queue successfully.
[22:06:19] Sent data
[22:06:19] Connecting to http://130.237.232.237:8080/
[22:06:20] Posted data.
[22:06:20] Initial: 0000; - Receiving payload (expected size: 512)
[22:06:20] Conversation time very short, giving reduced weight in bandwidth avg
[22:06:20] - Downloaded at ~1 kB/s
[22:06:20] - Averaged speed for that direction ~36 kB/s
[22:06:20] + Received work.
[22:06:20] + Closed connections
[22:06:25] 
[22:06:25] + Processing work unit
[22:06:25] Core required: FahCore_a5.exe
[22:06:25] Core found.
[22:06:25] Working on queue slot 03 [April 13 22:06:25 UTC]
[22:06:25] + Working ...
[22:06:25] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 64 -checkpoint 15 -forceasm -verbose -lifeline 1900 -version 634'

[22:06:25] 
[22:06:25] *------------------------------*
[22:06:25] Folding@Home Gromacs SMP Core
[22:06:25] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:06:25] 
[22:06:25] Preparing to commence simulation
[22:06:25] - Assembly optimizations manually forced on.
[22:06:25] - Not checking prior termination.
[22:06:25] Couldn't Decompress
[22:06:25] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:06:25] -Error: Couldn't update checksum variables
[22:06:25] Error: Could not open work file
[22:06:25] 
[22:06:25] Folding@home Core Shutdown: FILE_IO_ERROR
[22:06:25] CoreStatus = 75 (117)
[22:06:25] Error opening or reading from a file.
[22:06:25] Deleting current work unit & continuing...
[22:06:25] Trying to send all finished work units
[22:06:25] + No unsent completed units remaining.
[22:06:25] - Preparing to get new work unit...
[22:06:25] Cleaning up work directory
[22:12:15] ***** Got an Activate signal (2)
[22:12:15] Killing all core threads

Folding@Home Client Shutdown.
Thanks.
Fold! It does a body good!™
slider11
Posts: 5
Joined: Sat Mar 03, 2012 12:57 am

Re: Failed uploads: 9 cannot ping 130.237.232.237:8080

Post by slider11 »

im having the same issue with the same server. it sits there attempting to connect for 2 hours then pings out at its max time. tofu did yours finally send?
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Failed uploads: 9 cannot ping 130.237.232.237:8080

Post by bruce »

tofuwombat wrote:Cannot upload, cannot get and "OK" with firefox.
We need to change our expectations. The web page containing "OK" is no longer being installed on Stanford's servers so except for a few servers that still have that page, one of two things happens: You get a 404 error or you get a blank page with no indication of error.

Both http://130.237.232.237/ and http://130.237.232.237:8080/ seem to be working correctly, as far as testing for an active web server is concerned. It's somewhat more rigorous a test than a simple ping, but it still doesn't guarantee that the server is fully functional for either uploading or downloading.

According to serverstat, it is available for either uploading or downloading although it has very few WUs to be distributed. Most of the projects on that servers have large results, so you can expect them to take a long time to upload.

By the way, you cannot ping 130.237.232.238:8080. The ping command does not accept the designation of a port number 8080 using the ":8080" notation. At the present time, I can ping 130.237.232.238
slider11
Posts: 5
Joined: Sat Mar 03, 2012 12:57 am

Re: Failed uploads: 9 cannot ping 130.237.232.237:8080

Post by slider11 »

so bruce when it gives the "[10:28:05] + Attempting to send results [April 5 10:28:05 UTC]" and then sits there for 2 hours until it hits its max time what is that signifying? i know i have crappy internet where i live and for the most part ive blamed that but i have a feeling thats not the reason. ive checked the stats page and have seen that the server im attempting to connect to is up and running. im running a new 2p system and this is only the 2nd wu i have finished. i accidently deleted the first one after some issues but that one wouldnt send either.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Failed uploads: 9 cannot ping 130.237.232.237:8080

Post by bruce »

I'm not sure what upload rate is required for those projects since none of my machines will run them :( but Stanford does have reasonable expectations for internet connections.

These projects have a really large number of atoms, and I'd expect the results to be correspondingly large, but like I said, I don't know how large. I'm also not able to tell if the problem is primarily at your end or at the server's end. (Sorry). Hopefully, somebody with a little more knowledge of this server and these project will be along soon and offer more useful information.
bollix47
Posts: 2958
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Failed uploads: 9 cannot ping 130.237.232.237:8080

Post by bollix47 »

FWIW

The files that upload for P6903 and P6904 are ~222meg. My 1 meg upload connection takes ~30 minutes to send those up to the server.
slider11
Posts: 5
Joined: Sat Mar 03, 2012 12:57 am

Re: Failed uploads: 9 cannot ping 130.237.232.237:8080

Post by slider11 »

Oh boy. My connection is 382 kb/s meaning I'm lucky if I upload at 50-60 kb/s
snikygel
Posts: 9
Joined: Sat Feb 18, 2012 4:21 pm

Re: 130.237.232.237 going down for maintenance

Post by snikygel »

Hi,

I have problem getting WU from the said server. Here's the log. Please advise. Thanks.

Code: Select all

# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/folding/fah
Executable: ./fah6
Arguments: -smp -bigadv -verbosity 9 

[01:31:25] - Ask before connecting: No
[01:31:25] - User name: closetminer (Team 24)
[01:31:25] - User ID: 227C19787D89DA40
[01:31:25] - Machine ID: 1
[01:31:25] 
[01:31:25] Loaded queue successfully.
[01:31:25] - Preparing to get new work unit...
[01:31:25] - Autosending finished units... [April 16 01:31:25 UTC]
[01:31:25] Cleaning up work directory
[01:31:25] Trying to send all finished work units
[01:31:25] + No unsent completed units remaining.
[01:31:25] - Autosend completed
[01:31:25] + Attempting to get work packet
[01:31:25] Passkey found
[01:31:25] - Will indicate memory of 32235 MB
[01:31:25] - Connecting to assignment server
[01:31:25] Connecting to http://assign.stanford.edu:8080/
[01:31:26] Posted data.
[01:31:26] Initial: ED82; - Successful: assigned to (130.237.232.237).
[01:31:26] + News From Folding@Home: Welcome to Folding@Home
[01:31:26] Loaded queue successfully.
[01:31:26] Sent data
[01:31:26] Connecting to http://130.237.232.237:8080/
[01:31:27] Posted data.
[01:31:27] Initial: 0000; - Receiving payload (expected size: 512)
[01:31:27] Conversation time very short, giving reduced weight in bandwidth avg
[01:31:27] - Downloaded at ~1 kB/s
[01:31:27] - Averaged speed for that direction ~1 kB/s
[01:31:27] + Received work.
[01:31:27] + Closed connections
[01:31:27] 
[01:31:27] + Processing work unit
[01:31:27] Core required: FahCore_a5.exe
[01:31:27] Core found.
[01:31:27] Working on queue slot 03 [April 16 01:31:27 UTC]
[01:31:27] + Working ...
[01:31:27] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 48 -checkpoint 15 -verbose -lifeline 3508 -version 634'

thekraken: The Kraken 0.6 (compiled Sat Apr 14 15:50:54 WST 2012 by folding@H8QGi)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 3513
thekraken: Logging to thekraken.log
[01:31:27] 
[01:31:27] *------------------------------*
[01:31:27] Folding@Home Gromacs SMP Core
[01:31:27] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:31:27] 
[01:31:27] Preparing to commence simulation
[01:31:27] - Looking at optimizations...
[01:31:27] - Created dyn
[01:31:27] - Files status OK
[01:31:27] Couldn't Decompress
[01:31:27] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:31:27] -Error: Couldn't update checksum variables
[01:31:27] Error: Could not open work file
[01:31:27] 
[01:31:27] Folding@home Core Shutdown: FILE_IO_ERROR
[01:31:28] CoreStatus = 75 (117)
[01:31:28] Error opening or reading from a file.
[01:31:28] Deleting current work unit & continuing...
snikygel
Posts: 9
Joined: Sat Feb 18, 2012 4:21 pm

Re: 130.237.232.237 going down for maintenance

Post by snikygel »

Problem solved after deleting the file machinedepedent.dat
tofuwombat
Posts: 19
Joined: Mon Nov 22, 2010 4:06 pm

Re: Failed uploads: 9 cannot ping 130.237.232.237:8080

Post by tofuwombat »

slider11 wrote:im having the same issue with the same server. it sits there attempting to connect for 2 hours then pings out at its max time. tofu did yours finally send?
Nope, it's still here (240 tries currently). My network is fast. The machine turns in four smp units a day. The initial stuck WU (slot #5) is still here. Another that got stuck just after it (an EARLIER slot #6) did go after over a hundred tries.
I gave up on it. One day it was gone. This machine is hit/miss with http connections in firefox. one in three fail ("taking to long"). I realize the CPU is at 100%, but the browser bails in two seconds or so . . .
Is there a setting to change so that ~5min (or whatever) wouldn't be "too long"?

Thanks for the "plus one" slider11

Code: Select all

05:37:45] - Connecting to assignment server
[05:37:45] Connecting to http://assign.stanford.edu:8080/
[05:37:45] Posted data.
[05:37:45] Initial: 8F80; - Successful: assigned to (128.143.231.202).
[05:37:45] + News From Folding@Home: Welcome to Folding@Home
[05:37:46] Loaded queue successfully.
[05:37:46] Sent data
[05:37:46] Connecting to http://128.143.231.202:8080/
[05:37:47] Posted data.
[05:37:47] Initial: 0000; - Receiving payload (expected size: 3813232)
[05:37:49] - Downloaded at ~1861 kB/s
[05:37:49] - Averaged speed for that direction ~1692 kB/s
[05:37:49] + Received work.
[05:37:49] Trying to send all finished work units
[05:37:49] Project: 6903 (Run 5, Clone 7, Gen 74)


[05:37:49] + Attempting to send results [April 16 05:37:49 UTC]
[05:37:49] - Reading file work/wuresults_05.dat from core
[05:37:49]   (Read 222418260 bytes from disk)
[05:37:49] Connecting to http://130.237.232.237:8080/
[05:39:26] - Couldn't send HTTP request to server
[05:39:26] + Could not connect to Work Server (results)
[05:39:26]     (130.237.232.237:8080)
[05:39:26] + Retrying using alternative port
[05:39:26] Connecting to http://130.237.232.237:80/
[05:41:08] - Couldn't send HTTP request to server
[05:41:08] + Could not connect to Work Server (results)
[05:41:08]     (130.237.232.237:80)
[05:41:08] - Error: Could not transmit unit 05 (completed April 5) to work
server.
[05:41:08] - 236 failed uploads of this unit.
[05:41:08]   Keeping unit 05 in queue.
[05:41:08] + Sent 0 of 1 completed units to the server
[05:41:08] + Closed connections
[05:41:08]
[05:41:08] + Processing work unit
[05:41:08] Core required: FahCore_a3.exe
[05:41:08] Core found.
[05:41:08] Working on queue slot 06 [April 16 05:41:08 UTC]
[05:41:08] + Working ...
[05:41:08] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 06 -np
64 -priority 96 -checkpoint 30 -forceasm -verbose -lifeline 4155 -version
634'

thekraken: The Kraken 0.6 (compiled Sat Mar 24 01:56:45 PDT 2012 by
tofuwombat@schnellzug)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under
GPLv2
thekraken: PID: 10858
thekraken: Logging to thekraken.log
[05:41:08]
[05:41:08] *------------------------------*
[05:41:08] Folding@Home Gromacs SMP Core
[05:41:08] Version 2.27 (Dec. 15, 2010)
[05:41:08]
[05:41:08] Preparing to commence simulation
[05:41:08] - Assembly optimizations manually forced on.
[05:41:08] - Not checking prior termination.
[05:41:08] - Expanded 3812720 -> 4136808 (decompressed 108.5 percent)
[05:41:08] Called DecompressByteArray: compressed_data_size=3812720
data_size=4136808, decompressed_data_size=4136808 diff=0
[05:41:08] - Digital signature verified
[05:41:08]
[05:41:08] Project: 6098 (Run 9, Clone 80, Gen 132)
[05:41:08]
[05:41:08] Assembly optimizations on if available.
[05:41:08] Entering M.D.
                         :-)  G  R  O  M  A  C  S  (-:

                   Groningen Machine for Chemical Simulation

                            :-)  VERSION 4.5.3  (-:

        Written by Emile Apol, Rossen Apostolov, Herman J.C. Berendsen,
      Aldert van Buuren, Pär Bjelkmar, Rudi van Drunen, Anton Feenstra,
        Gerrit Groenhof, Peter Kasson, Per Larsson, Pieter Meulenhoff,
           Teemu Murtola, Szilard Pall, Sander Pronk, Roland Schulz,
                Michael Shirts, Alfons Sijbers, Peter Tieleman,

               Berk Hess, David van der Spoel, and Erik Lindahl.

       Copyright (c) 1991-2000, University of Groningen, The Netherlands.
            Copyright (c) 2001-2010, The GROMACS development team at
        Uppsala University & The Royal Institute of Technology, Sweden.
            check out http://www.gromacs.org for more information.


                               :-)  Gromacs  (-:

Reading file work/wudata_06.tpr, VERSION 4.5.1-dev-20100930-afd66-dirty
(single precision)
[05:41:14] Mapping NT from 64 to 64
Starting 64 threads
Making 3D domain decomposition 4 x 4 x 4
starting mdrun 'Solvated system'
66500000 steps, 266000.0 ps (continuing from step 66000000, 264000.0 ps).
[05:41:16] Completed 0 out of 500000 steps  (0%)
[05:45:47] Completed 5000 out of 500000 steps  (1%)
[05:50:24] Completed 10000 out of 500000 steps  (2%)

NOTE: Turning on dynamic load balancing

[05:55:04] Completed 15000 out of 500000 steps  (3%)
[05:59:50] Completed 20000 out of 500000 steps  (4%)
[06:04:35] Completed 25000 out of 500000 steps  (5%)
[06:09:22] Completed 30000 out of 500000 steps  (6%)
**********************************************************
./fah6 -verbosity 9 -queueinfo

Note: Please read the license agreement (fah6 -license). Further
use of this software requires that you have read and accepted this
agreement.



--- Opening Log file [April 16 18:42:29 UTC]


# Linux Console Edition
#######################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/tofuwombat/fah
Executable: ./fah6
Arguments: -verbosity 9 -queueinfo

[18:42:29] - Ask before connecting: No
[18:42:29] - User name: tofuwombat (Team 155278)
[18:42:29] - User ID: XXXXXXXXXXXXXXXXXXXX
[18:42:29] - Machine ID: 16
[18:42:29]
[18:42:29] Loaded queue successfully.
[18:42:29] Printing Queue Information
Current Queue:
Slot 08  Empty/Deleted
Project: 6097 (Run 0, Clone 23, Gen 220), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 13 20:22:39
Finished date: April 14 04:27:08

Slot 09  Empty/Deleted
Project: 6099 (Run 4, Clone 75, Gen 151), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 14 04:30:37
Finished date: April 14 12:34:22

Slot 00  Empty/Deleted
Project: 6098 (Run 6, Clone 10, Gen 131), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 14 12:39:32
Finished date: April 14 20:47:07

Slot 01  Empty/Deleted
Project: 6099 (Run 7, Clone 12, Gen 163), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 14 20:50:37
Finished date: April 15 04:51:24

Slot 02  Empty/Deleted
Project: 6099 (Run 1, Clone 3, Gen 145), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 15 04:56:19
Finished date: April 15 13:06:54

Slot 03  Empty/Deleted
Project: 6098 (Run 7, Clone 80, Gen 127), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 15 13:10:44
Finished date: April 15 21:23:44

Slot 04  Empty/Deleted
Project: 6097 (Run 0, Clone 38, Gen 210), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 15 21:29:29
Finished date: April 16 05:32:43

Slot 05  Done
Project: 6903 (Run 5, Clone 7, Gen 74), Core: a5
Work server: 130.237.232.237:8080
Collection server: 0.0.0.0
Download date: April 1 23:58:50
Finished date: April 5 01:30:24
Failed uploads: 240

Slot 06  Empty/Deleted
Project: 6098 (Run 9, Clone 80, Gen 132), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 16 05:37:49
Finished date: April 16 13:42:03

Slot 07 *Ready
Project: 6097 (Run 0, Clone 55, Gen 210), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 16 13:50:21
Deadline date: April 29 09:02:21

PF: 0.973364 based on last 4 slot(s)
[18:42:29] ***** Got a SIGTERM signal (15)
[18:42:29] Killing all core threads

Folding@Home Client Shutdown.
tofuwombat@schnellzug:~/fah$
Nathan_P
Posts: 1164
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 [email protected] Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 [email protected] Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: Failed uploads: 9 cannot ping 130.237.232.237:8080

Post by Nathan_P »

slider11 wrote:Oh boy. My connection is 382 kb/s meaning I'm lucky if I upload at 50-60 kb/s
You will still make it, i have the same connection speed and it manages the upload in about 1hour 30, that is however running full tilt with no
interuptions
Image
slider11
Posts: 5
Joined: Sat Mar 03, 2012 12:57 am

Re: 130.237.232.237 going down for maintenance

Post by slider11 »

snikygel wrote:Problem solved after deleting the file machinedepedent.dat
i hadnt thought about doing that. i had to do that before to get it to actually download a wu. i just deleted it and im gonna see if it sends now. thanks for sparkin the lightbulb on that one sniky
bollix47
Posts: 2958
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: 130.237.232.237 going down for maintenance

Post by bollix47 »

@EXT64

I appear to be having the exact same problem with 12.04(ext3) using v7 and bigadv. WUs will download but not upload when complete and although the FahCore_a5 is showing in system monitor it's cpu usage indicates that it is not actually running but it doesn't shut down. If I kill it the new WU starts up but the old one doesn't upload. I've gone back to v6 for now but would rather use v7.

Code: Select all

*********************** Log Started 2012-04-19T02:00:53Z ***********************
02:00:53:************************* Folding@home Client *************************
02:00:53:    Website: http://folding.stanford.edu/
02:00:53:  Copyright: (c) 2009-2012 Stanford University
02:00:53:     Author: Joseph Coffland <[email protected]>
02:00:53:       Args: 
02:00:53:     Config: /home/bollix/config.xml
02:00:53:******************************** Build ********************************
02:00:53:    Version: 7.1.52
02:00:53:       Date: Mar 20 2012
02:00:53:       Time: 13:19:11
02:00:53:    SVN Rev: 3515
02:00:53:     Branch: fah/trunk/client
02:00:53:   Compiler: GNU 4.6.2
02:00:53:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
02:00:53:             -fno-unsafe-math-optimizations -msse2
02:00:53:   Platform: linux2 3.2.0-1-amd64
02:00:53:       Bits: 64
02:00:53:       Mode: Release
02:00:53:******************************* System ********************************
02:00:53:        CPU: AMD Opteron(TM) Processor 6274
02:00:53:     CPU ID: AuthenticAMD Family 21 Model 1 Stepping 2
02:00:53:       CPUs: 64
02:00:53:     Memory: 31.42GiB
02:00:53:Free Memory: 30.22GiB
02:00:53:    Threads: POSIX_THREADS
02:00:53: On Battery: false
02:00:53: UTC offset: -4
02:00:53:        PID: 2772
02:00:53:        CWD: /home/bollix
02:00:53:         OS: Linux 3.2.0-23-generic x86_64
02:00:53:    OS Arch: AMD64
02:00:53:       GPUs: 1
02:00:53:      GPU 0: FERMI:1 GF108 [Quadro 600]
02:00:53:       CUDA: 2.1
02:00:53:CUDA Driver: 4020
02:00:53:***********************************************************************
02:00:53:<config>
02:00:53:  <!-- FahCore Control -->
02:00:53:  <checkpoint v='30'/>
02:00:53:  <core-priority v='low'/>
02:00:53:
02:00:53:  <!-- Folding Slot Configuration -->
02:00:53:  <max-packet-size v='big'/>
02:00:53:
02:00:53:  <!-- Network -->
02:00:53:  <proxy v=':8080'/>
02:00:53:
02:00:53:  <!-- Remote Command Server -->
02:00:53:  <command-allow v='127.0.0.1,192.168.2.100-192.168.2.149'/>
02:00:53:  <command-allow-no-pass v='127.0.0.1,192.168.2.100-192.168.2.149'/>
02:00:53:
02:00:53:  <!-- User Information -->
02:00:53:  <passkey v='********************************'/>
02:00:53:  <team v='39340'/>
02:00:53:  <user v='bollix47'/>
02:00:53:
02:00:53:  <!-- Folding Slots -->
02:00:53:  <slot id='0' type='SMP'>
02:00:53:    <client-type v='bigadv'/>
02:00:53:    <cpus v='-1'/>
02:00:53:  </slot>
02:00:53:</config>
02:00:53:Trying to access database...
02:00:53:Successfully acquired database lock
02:00:53:Enabled folding slot 00: READY smp:64
02:00:53:WU00:FS00:Starting
02:00:53:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /home/bollix/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a5.fah/FahCore_a5 -dir 00 -suffix 01 -version 701 -lifeline 2772 -checkpoint 30 -np 64
02:00:53:WU00:FS00:Started FahCore on PID 2780
02:00:53:WU00:FS00:Core PID:2784
02:00:53:WU00:FS00:FahCore 0xa5 started
02:00:54:WU00:FS00:0xa5:
02:00:54:WU00:FS00:0xa5:*------------------------------*
02:00:54:WU00:FS00:0xa5:Folding@Home Gromacs SMP Core
02:00:54:WU00:FS00:0xa5:Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
02:00:54:WU00:FS00:0xa5:
02:00:54:WU00:FS00:0xa5:Preparing to commence simulation
02:00:54:WU00:FS00:0xa5:- Looking at optimizations...
02:00:54:WU00:FS00:0xa5:- Files status OK
02:00:57:Server connection id=1 on 0.0.0.0:36330 from 192.168.2.105
02:01:02:WU00:FS00:0xa5:- Expanded 57210877 -> 71843392 (decompressed 50.5 percent)
02:01:02:WU00:FS00:0xa5:Called DecompressByteArray: compressed_data_size=57210877 data_size=71843392, decompressed_data_size=71843392 diff=0
02:01:03:WU00:FS00:0xa5:- Digital signature verified
02:01:03:WU00:FS00:0xa5:
02:01:03:WU00:FS00:0xa5:Project: 6904 (Run 2, Clone 36, Gen 109)
02:01:03:WU00:FS00:0xa5:
02:01:03:WU00:FS00:0xa5:Assembly optimizations on if available.
02:01:03:WU00:FS00:0xa5:Entering M.D.
02:01:12:WU00:FS00:0xa5:Mapping NT from 64 to 64 
02:01:22:WU00:FS00:0xa5:Completed 0 out of 250000 steps  (0%)
02:01:51:Server connection id=2 on 0.0.0.0:36330 from 192.168.2.105
02:01:54:Server connection id=1 ended
02:04:24:Server connection id=3 on 0.0.0.0:36330 from 192.168.2.105
02:04:34:Server connection id=4 on 0.0.0.0:36330 from 192.168.2.105
02:19:24:Server connection id=5 on 0.0.0.0:36330 from 192.168.2.105
02:24:05:WU00:FS00:0xa5:Completed 2500 out of 250000 steps  (1%)
02:46:46:WU00:FS00:0xa5:Completed 5000 out of 250000 steps  (2%)
03:09:43:WU00:FS00:0xa5:Completed 7500 out of 250000 steps  (3%)
03:32:44:WU00:FS00:0xa5:Completed 10000 out of 250000 steps  (4%)
03:55:41:WU00:FS00:0xa5:Completed 12500 out of 250000 steps  (5%)
04:19:12:WU00:FS00:0xa5:Completed 15000 out of 250000 steps  (6%)
04:42:23:WU00:FS00:0xa5:Completed 17500 out of 250000 steps  (7%)
05:06:05:WU00:FS00:0xa5:Completed 20000 out of 250000 steps  (8%)
05:29:07:WU00:FS00:0xa5:Completed 22500 out of 250000 steps  (9%)
05:52:36:WU00:FS00:0xa5:Completed 25000 out of 250000 steps  (10%)
06:16:40:WU00:FS00:0xa5:Completed 27500 out of 250000 steps  (11%)
06:40:33:WU00:FS00:0xa5:Completed 30000 out of 250000 steps  (12%)
07:04:17:WU00:FS00:0xa5:Completed 32500 out of 250000 steps  (13%)
07:27:31:WU00:FS00:0xa5:Completed 35000 out of 250000 steps  (14%)
07:50:53:WU00:FS00:0xa5:Completed 37500 out of 250000 steps  (15%)
******************************** Date: 19/04/12 ********************************
08:14:23:WU00:FS00:0xa5:Completed 40000 out of 250000 steps  (16%)
08:37:54:WU00:FS00:0xa5:Completed 42500 out of 250000 steps  (17%)
09:01:35:WU00:FS00:0xa5:Completed 45000 out of 250000 steps  (18%)
09:24:57:WU00:FS00:0xa5:Completed 47500 out of 250000 steps  (19%)
09:48:25:WU00:FS00:0xa5:Completed 50000 out of 250000 steps  (20%)
10:11:49:WU00:FS00:0xa5:Completed 52500 out of 250000 steps  (21%)
10:35:11:WU00:FS00:0xa5:Completed 55000 out of 250000 steps  (22%)
10:58:34:WU00:FS00:0xa5:Completed 57500 out of 250000 steps  (23%)
11:22:02:WU00:FS00:0xa5:Completed 60000 out of 250000 steps  (24%)
11:45:34:WU00:FS00:0xa5:Completed 62500 out of 250000 steps  (25%)
12:09:34:WU00:FS00:0xa5:Completed 65000 out of 250000 steps  (26%)
12:32:52:WU00:FS00:0xa5:Completed 67500 out of 250000 steps  (27%)
12:56:12:WU00:FS00:0xa5:Completed 70000 out of 250000 steps  (28%)
13:19:41:WU00:FS00:0xa5:Completed 72500 out of 250000 steps  (29%)
13:43:01:WU00:FS00:0xa5:Completed 75000 out of 250000 steps  (30%)
14:06:18:WU00:FS00:0xa5:Completed 77500 out of 250000 steps  (31%)
******************************** Date: 19/04/12 ********************************
14:29:15:WU00:FS00:0xa5:Completed 80000 out of 250000 steps  (32%)
14:52:35:WU00:FS00:0xa5:Completed 82500 out of 250000 steps  (33%)
15:15:47:WU00:FS00:0xa5:Completed 85000 out of 250000 steps  (34%)
15:39:29:WU00:FS00:0xa5:Completed 87500 out of 250000 steps  (35%)
16:03:10:WU00:FS00:0xa5:Completed 90000 out of 250000 steps  (36%)
16:26:30:WU00:FS00:0xa5:Completed 92500 out of 250000 steps  (37%)
16:49:56:WU00:FS00:0xa5:Completed 95000 out of 250000 steps  (38%)
17:13:24:WU00:FS00:0xa5:Completed 97500 out of 250000 steps  (39%)
17:36:50:WU00:FS00:0xa5:Completed 100000 out of 250000 steps  (40%)
17:59:55:WU00:FS00:0xa5:Completed 102500 out of 250000 steps  (41%)
18:23:17:WU00:FS00:0xa5:Completed 105000 out of 250000 steps  (42%)
18:46:41:WU00:FS00:0xa5:Completed 107500 out of 250000 steps  (43%)
19:09:59:WU00:FS00:0xa5:Completed 110000 out of 250000 steps  (44%)
19:33:31:WU00:FS00:0xa5:Completed 112500 out of 250000 steps  (45%)
19:56:39:WU00:FS00:0xa5:Completed 115000 out of 250000 steps  (46%)
20:19:56:WU00:FS00:0xa5:Completed 117500 out of 250000 steps  (47%)
******************************** Date: 19/04/12 ********************************
20:43:21:WU00:FS00:0xa5:Completed 120000 out of 250000 steps  (48%)
21:06:45:WU00:FS00:0xa5:Completed 122500 out of 250000 steps  (49%)
21:30:03:WU00:FS00:0xa5:Completed 125000 out of 250000 steps  (50%)
21:53:41:WU00:FS00:0xa5:Completed 127500 out of 250000 steps  (51%)
22:17:24:WU00:FS00:0xa5:Completed 130000 out of 250000 steps  (52%)
22:40:58:WU00:FS00:0xa5:Completed 132500 out of 250000 steps  (53%)
23:04:43:WU00:FS00:0xa5:Completed 135000 out of 250000 steps  (54%)
23:28:41:WU00:FS00:0xa5:Completed 137500 out of 250000 steps  (55%)
23:52:30:WU00:FS00:0xa5:Completed 140000 out of 250000 steps  (56%)
00:15:57:WU00:FS00:0xa5:Completed 142500 out of 250000 steps  (57%)
00:39:16:WU00:FS00:0xa5:Completed 145000 out of 250000 steps  (58%)
01:02:55:WU00:FS00:0xa5:Completed 147500 out of 250000 steps  (59%)
01:26:16:WU00:FS00:0xa5:Completed 150000 out of 250000 steps  (60%)
01:49:35:WU00:FS00:0xa5:Completed 152500 out of 250000 steps  (61%)
02:13:07:WU00:FS00:0xa5:Completed 155000 out of 250000 steps  (62%)
02:36:34:WU00:FS00:0xa5:Completed 157500 out of 250000 steps  (63%)
******************************** Date: 20/04/12 ********************************
02:59:44:WU00:FS00:0xa5:Completed 160000 out of 250000 steps  (64%)
03:23:03:WU00:FS00:0xa5:Completed 162500 out of 250000 steps  (65%)
03:46:23:WU00:FS00:0xa5:Completed 165000 out of 250000 steps  (66%)
04:09:44:WU00:FS00:0xa5:Completed 167500 out of 250000 steps  (67%)
04:32:56:WU00:FS00:0xa5:Completed 170000 out of 250000 steps  (68%)
04:56:48:WU00:FS00:0xa5:Completed 172500 out of 250000 steps  (69%)
05:20:27:WU00:FS00:0xa5:Completed 175000 out of 250000 steps  (70%)
05:43:56:WU00:FS00:0xa5:Completed 177500 out of 250000 steps  (71%)
06:07:45:WU00:FS00:0xa5:Completed 180000 out of 250000 steps  (72%)
06:31:15:WU00:FS00:0xa5:Completed 182500 out of 250000 steps  (73%)
06:54:11:WU00:FS00:0xa5:Completed 185000 out of 250000 steps  (74%)
07:17:35:WU00:FS00:0xa5:Completed 187500 out of 250000 steps  (75%)
07:41:13:WU00:FS00:0xa5:Completed 190000 out of 250000 steps  (76%)
08:04:40:WU00:FS00:0xa5:Completed 192500 out of 250000 steps  (77%)
08:27:55:WU00:FS00:0xa5:Completed 195000 out of 250000 steps  (78%)
08:51:22:WU00:FS00:0xa5:Completed 197500 out of 250000 steps  (79%)
******************************** Date: 20/04/12 ********************************
09:14:55:WU00:FS00:0xa5:Completed 200000 out of 250000 steps  (80%)
09:38:09:WU00:FS00:0xa5:Completed 202500 out of 250000 steps  (81%)
10:01:26:WU00:FS00:0xa5:Completed 205000 out of 250000 steps  (82%)
10:24:44:WU00:FS00:0xa5:Completed 207500 out of 250000 steps  (83%)
10:48:11:WU00:FS00:0xa5:Completed 210000 out of 250000 steps  (84%)
11:11:37:WU00:FS00:0xa5:Completed 212500 out of 250000 steps  (85%)
11:35:03:WU00:FS00:0xa5:Completed 215000 out of 250000 steps  (86%)
11:58:41:WU00:FS00:0xa5:Completed 217500 out of 250000 steps  (87%)
12:21:59:WU00:FS00:0xa5:Completed 220000 out of 250000 steps  (88%)
12:29:59:Server connection id=6 on 0.0.0.0:36330 from 192.168.2.106
12:45:33:WU00:FS00:0xa5:Completed 222500 out of 250000 steps  (89%)
13:09:20:WU00:FS00:0xa5:Completed 225000 out of 250000 steps  (90%)
13:18:01:Server connection id=6 ended
13:32:56:WU00:FS00:0xa5:Completed 227500 out of 250000 steps  (91%)
13:56:29:WU00:FS00:0xa5:Completed 230000 out of 250000 steps  (92%)
14:19:58:WU00:FS00:0xa5:Completed 232500 out of 250000 steps  (93%)
14:43:40:WU00:FS00:0xa5:Completed 235000 out of 250000 steps  (94%)
15:07:40:WU00:FS00:0xa5:Completed 237500 out of 250000 steps  (95%)
*****************************eted 240000 out of 250000 steps  (96%)
15:55*** Date: 20/04/12 ********************************
15:31:19:WU00:FS00:0xa5:Compl:02:WU00:FS00:0xa5:Completed 242500 out of 250000 steps  (97%)
16:19:06:WU00:FS00:0xa5:Completed 245000 out of 250000 steps  (98%)
16:42:46:WU00:FS00:0xa5:Completed 247500 out of 250000 steps  (99%)
17:06:16:WU00:FS00:0xa5:Completed 250000 out of 250000 steps  (100%)
17:06:17:WU01:FS00:Connecting to assign3.stanford.edu:8080
17:06:17:WU01:FS00:News: Welcome to Folding@Home
17:06:17:WU01:FS00:Assigned to work server 130.237.232.237
17:06:17:WU01:FS00:Requesting new work unit for slot 00: RUNNING smp:64 from 130.237.232.237
17:06:17:WU01:FS00:Connecting to 130.237.232.237:8080
17:06:30:WU01:FS00:Downloading 54.59MiB
17:06:36:WU01:FS00:Download 7.44%
17:06:42:WU01:FS00:Download 20.15%
17:06:48:WU01:FS00:Download 34.00%
17:06:48:WU00:FS00:0xa5:DynamicWrapper: Finished Work Unit: sleep=10000
17:06:54:WU01:FS00:Download 46.71%
17:06:58:WU00:FS00:0xa5:
17:06:58:WU00:FS00:0xa5:Finished Work Unit:
17:06:58:WU00:FS00:0xa5:- Reading up to 121544064 from "00/wudata_01.trr": Read 121544064
17:06:59:WU00:FS00:0xa5:trr file hash check passed.
17:07:00:WU01:FS00:Download 56.67%
17:07:00:WU00:FS00:0xa5:- Reading up to 108720488 from "00/wudata_01.xtc": Read 108720488
17:07:01:WU00:FS00:0xa5:xtc file hash check passed.
17:07:01:WU00:FS00:0xa5:edr file hash check passed.
17:07:01:WU00:FS00:0xa5:logfile size: 201744
17:07:01:WU00:FS00:0xa5:Leaving Run
17:07:01:WU00:FS00:0xa5:- Writing 230639288 bytes of core data to disk...
17:07:06:WU01:FS00:Download 66.74%
17:07:12:WU01:FS00:Download 77.27%
17:07:18:WU01:FS00:Download 87.35%
17:07:24:WU01:FS00:Download 97.31%
17:07:25:WU01:FS00:Download complete
17:07:25:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:OK project:6903 run:1 clone:29 gen:8 core:0xa5 unit:0x0000000c52be746d4e15c6b25ccf3d63
17:07:25:WU01:FS00:Downloading project 6903 description
17:07:25:WU01:FS00:Connecting to fah-web.stanford.edu:80
17:07:26:WU01:FS00:Project 6903 description downloaded successfully
17:08:17:WU00:FS00:0xa5:Done: 230638776 -> 222320979 (compressed to 3.2 percent)
17:08:18:WU00:FS00:0xa5:  ... Done.

at this point everything appeared to be frozen and fahcore_a5 was showing in system monitor but not running so I killed it  -  I've had a few do this to me lately and waited much longer than 5 minutes but the core never shut down


17:13:02:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
17:13:02:WU01:FS00:Starting
17:13:02:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /home/bollix/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a5.fah/FahCore_a5 -dir 01 -suffix 01 -version 701 -lifeline 2772 -checkpoint 30 -np 64
17:13:02:WU01:FS00:Started FahCore on PID 5444
17:13:02:WU01:FS00:Core PID:5448
17:13:02:WU01:FS00:FahCore 0xa5 started
17:13:02:WU01:FS00:0xa5:
17:13:02:WU01:FS00:0xa5:*------------------------------*
17:13:02:WU01:FS00:0xa5:Folding@Home Gromacs SMP Core
17:13:02:WU01:FS00:0xa5:Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
17:13:02:WU01:FS00:0xa5:
17:13:02:WU01:FS00:0xa5:Preparing to commence simulation
17:13:02:WU01:FS00:0xa5:- Looking at optimizations...
17:13:02:WU01:FS00:0xa5:- Created dyn
17:13:02:WU01:FS00:0xa5:- Files status OK
17:13:10:WU01:FS00:0xa5:- Expanded 57245886 -> 71846524 (decompressed 50.4 percent)
17:13:10:WU01:FS00:0xa5:Called DecompressByteArray: compressed_data_size=57245886 data_size=71846524, decompressed_data_size=71846524 diff=0
17:13:11:WU01:FS00:0xa5:- Digital signature verified
17:13:11:WU01:FS00:0xa5:
17:13:11:WU01:FS00:0xa5:Project: 6903 (Run 1, Clone 29, Gen 8)
17:13:11:WU01:FS00:0xa5:
17:13:11:WU01:FS00:0xa5:Assembly optimizations on if available.
17:13:11:WU01:FS00:0xa5:Entering M.D.
17:13:20:WU01:FS00:0xa5:Mapping NT from 64 to 64 
17:13:28:WU01:FS00:0xa5:Completed 0 out of 250000 steps  (0%)

no upload of wu00 so I paused it 


17:16:21:FS00:Paused
17:16:21:FS00:Shutting core down
17:16:26:WU01:FS00:0xa5:Client no longer detected. Shutting down core.
17:16:26:WU01:FS00:0xa5:
17:16:26:WU01:FS00:0xa5:Folding@home Core Shutdown: CLIENT_DIED
17:16:27:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
[93m17:16:44:WARNING:Caught signal SIGINT(2) on PID 2772[0m
17:16:44:Exiting, please wait. . .
17:16:46:Clean exit
After a restart the client returned to ws00 and started to process the 6904 from the beginning again even though the results were in work/00

Code: Select all

bollix@Gemini:~/work/00$ ls -l
total 858644
-rwxr-x--- 1 bollix bollix       659 Apr 18 21:47 logfile_01-20120419-020053.txt
-rwxr-x--- 1 bollix bollix      5344 Apr 20 13:08 logfile_01-20120420-173338.txt
-rwxr-x--- 1 bollix bollix       667 Apr 20 13:34 logfile_01.txt
-rw-r--r-- 1 bollix bollix      1895 Apr 20 13:18 log.txt
drwxrwxr-x 2 bollix bollix      4096 Apr 20 13:18 work
-rw-rw-r-- 1 bollix bollix  60776500 Apr 20 13:06 wudata_01.cpt
-rw-r--r-- 1 bollix bollix  57211389 Apr 18 21:44 wudata_01.dat
-rw-rw-r-- 1 bollix bollix         0 Apr 20 13:34 wudata_01.dyn
-rw-rw-r-- 1 bollix bollix    172480 Apr 20 13:06 wudata_01.edr
-rw-rw-r-- 1 bollix bollix 174719354 Apr 20 13:06 wudata_01.gro
-rw-rw-r-- 1 bollix bollix    201744 Apr 20 13:06 wudata_01.log
-rw-rw-r-- 1 bollix bollix  60776500 Apr 20 13:01 wudata_01_prev.cpt
-rw-rw-r-- 1 bollix bollix  71843392 Apr 20 13:33 wudata_01.tpr
-rw-rw-r-- 1 bollix bollix 121544064 Apr 20 13:13 wudata_01.trr
-rw-rw-r-- 1 bollix bollix 108720488 Apr 20 13:06 wudata_01.xtc
-rwxr-x--- 1 bollix bollix       512 Apr 20 13:33 wuinfo_01.dat
-rw-rw-r-- 1 bollix bollix 222321491 Apr 20 13:08 wuresults_01.dat
bollix@Gemini:~/work/00$ cd ../01
bollix@Gemini:~/work/01$ ls -l
total 195356
-rwxr-x--- 1 bollix bollix      657 Apr 20 13:16 logfile_01.txt
-rw-r--r-- 1 bollix bollix 57246398 Apr 20 13:07 wudata_01.dat
-rw-rw-r-- 1 bollix bollix        0 Apr 20 13:16 wudata_01.dyn
-rw-rw-r-- 1 bollix bollix     1480 Apr 20 13:13 wudata_01.edr
-rw-rw-r-- 1 bollix bollix    13565 Apr 20 13:16 wudata_01.log
-rw-rw-r-- 1 bollix bollix 71846524 Apr 20 13:13 wudata_01.tpr
-rw-rw-r-- 1 bollix bollix 60811248 Apr 20 13:13 wudata_01.trr
-rw-rw-r-- 1 bollix bollix  9883596 Apr 20 13:13 wudata_01.xtc
-rwxr-x--- 1 bollix bollix      512 Apr 20 13:13 wuinfo_01.dat
bollix@Gemini:~/work/01$ 
I'm just guessing but the problem appears to be that the a5 core doesn't shut down properly. :e?:
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: 130.237.232.237 going down for maintenance

Post by kasson »

Hmm--you reproducibly see this problem with v7 but not with v6? That is helpful information indeed. Can you confirm? Anyone else notice this trend?
bollix47
Posts: 2958
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: 130.237.232.237 going down for maintenance

Post by bollix47 »

I changed to v6 regular smp and the first one uploaded no problem. I'm doing 1 more regular then I'll switch to bigadv. It might be a day or so depending on which project I get but I will let you know as soon as I can.

And, yes this 'problem' has happened to me a few times in the last week. The core didn't shut down properly in all cases. Probably 4 or 5 WUs lost to science not to mention well over 1 million points for the team. :( Obviously the science loss is more important. :ewink:
Post Reply