Project 2662 (Run 2, Clone 353, Gen 10) Arc too large
Posted: Fri Aug 22, 2008 8:17 pm
Found this error in my logs: Arc too large at 2953318120
Meanwhile the workunit has been automatically deleted, also without credit.
Don't know what is wrong here, there was enough diskspace.
[10:34:31] Folding@Home Gromacs SMP Core
[10:34:31] Version 1.91 (2007)
[10:34:31]
[10:34:31] Preparing to commence simulation
[10:34:31] - Looking at optimizations...
[10:34:31] - Working with standa status OK
[10:34:31] Error: Work unit read from disk is invalid
[10:34:31] Finalizing output
[10:34:31]
[10:34:31] - Files status OK
[10:34:31] Error: Work unit read from disk is invalid
[10:34:31] Finalizing output
[10:34:33] - Expanded 4920986 -> 24360573 (decompressed 495.0 percent)
[10:34:33]
[10:34:33] Project: 2662 (Run 2, Clone 353, Gen 10)
[10:34:33]
[10:34:34] Entering M.D.
[10:34:50]
[10:34:50] Entering M.D.
[10:35:02] Completed 0 out of 250000 steps (0%)
[10:50:28] equesting checkpoint
[10:55:35] Timer requesting checkpoint
[10:56:24] Completed 1250 out of 250000 steps (1%)
[11:44:28] Timer requesting checkpoint
[11:49:35] Timer requesting checkpoint
...
[14:06:38] Timer requesting checkpoint
[14:11:47] Timer requesting checkpoint
[14:13:25] Completed 248750 out of 250000 steps (100%)
[14:18:35] Timer requesting checkpoint
[14:23:39] Timer requesting checkpoint
[14:28:47] Timer requesting checkpoint
[14:33:57] Timer requesting checkpoint
[14:36:32]
[14:36:32] Finished Work Unit:
[14:36:32] Arc too large at 2953318120
[14:36:32] - Reading up to 262143488 from "work/wudata_01.trr": Could not open file
[14:36:32] Error: could not open arcfile. Exiting.
[14:36:36] - Shutting down core
[14:38:36]
[14:38:36] Folding@home Core Shutdown: FILE_IO_ERROR
[14:41:55] CoreStatus = 75 (117)
[14:41:55] Error opening or reading from a file.
[14:41:55] Deleting current work unit & continuing...
[14:46:11] - Preparing to get new work unit...
[14:46:11] + Attempting to get work packet
[14:46:11] - Connecting to assignment server
[14:46:12] - Successful: assigned to (171.64.65.56).
In another slot I had one more workunit which I was never able to send back:
[10:04:55] + Attempting to send results [August 19 10:04:55 UTC]
[10:11:22] - Server does not have record of this unit. Will try again later.
[10:11:22] Could not transmit unit 00 to Collection server; keeping in queue.
[10:11:22] Project: 2662 (Run 0, Clone 106, Gen 13)
[10:11:22] + Attempting to send results [August 19 10:11:22 UTC]
[10:11:54] - Couldn't send HTTP request to server
[10:11:54] + Could not connect to Work Server (results)
[10:11:54] (171.64.65.56:8080)
[10:11:54] + Retrying using alternative port
[10:12:27] - Couldn't send HTTP request to server
[10:12:27] + Could not connect to Work Server (results)
[10:12:27] (171.64.65.56:80)
[10:12:27] - Error: Could not transmit unit 00 (completed August 19) to work server.
Meanwhile the workunit has been automatically deleted, also without credit.
Don't know what is wrong here, there was enough diskspace.
[10:34:31] Folding@Home Gromacs SMP Core
[10:34:31] Version 1.91 (2007)
[10:34:31]
[10:34:31] Preparing to commence simulation
[10:34:31] - Looking at optimizations...
[10:34:31] - Working with standa status OK
[10:34:31] Error: Work unit read from disk is invalid
[10:34:31] Finalizing output
[10:34:31]
[10:34:31] - Files status OK
[10:34:31] Error: Work unit read from disk is invalid
[10:34:31] Finalizing output
[10:34:33] - Expanded 4920986 -> 24360573 (decompressed 495.0 percent)
[10:34:33]
[10:34:33] Project: 2662 (Run 2, Clone 353, Gen 10)
[10:34:33]
[10:34:34] Entering M.D.
[10:34:50]
[10:34:50] Entering M.D.
[10:35:02] Completed 0 out of 250000 steps (0%)
[10:50:28] equesting checkpoint
[10:55:35] Timer requesting checkpoint
[10:56:24] Completed 1250 out of 250000 steps (1%)
[11:44:28] Timer requesting checkpoint
[11:49:35] Timer requesting checkpoint
...
[14:06:38] Timer requesting checkpoint
[14:11:47] Timer requesting checkpoint
[14:13:25] Completed 248750 out of 250000 steps (100%)
[14:18:35] Timer requesting checkpoint
[14:23:39] Timer requesting checkpoint
[14:28:47] Timer requesting checkpoint
[14:33:57] Timer requesting checkpoint
[14:36:32]
[14:36:32] Finished Work Unit:
[14:36:32] Arc too large at 2953318120
[14:36:32] - Reading up to 262143488 from "work/wudata_01.trr": Could not open file
[14:36:32] Error: could not open arcfile. Exiting.
[14:36:36] - Shutting down core
[14:38:36]
[14:38:36] Folding@home Core Shutdown: FILE_IO_ERROR
[14:41:55] CoreStatus = 75 (117)
[14:41:55] Error opening or reading from a file.
[14:41:55] Deleting current work unit & continuing...
[14:46:11] - Preparing to get new work unit...
[14:46:11] + Attempting to get work packet
[14:46:11] - Connecting to assignment server
[14:46:12] - Successful: assigned to (171.64.65.56).
In another slot I had one more workunit which I was never able to send back:
[10:04:55] + Attempting to send results [August 19 10:04:55 UTC]
[10:11:22] - Server does not have record of this unit. Will try again later.
[10:11:22] Could not transmit unit 00 to Collection server; keeping in queue.
[10:11:22] Project: 2662 (Run 0, Clone 106, Gen 13)
[10:11:22] + Attempting to send results [August 19 10:11:22 UTC]
[10:11:54] - Couldn't send HTTP request to server
[10:11:54] + Could not connect to Work Server (results)
[10:11:54] (171.64.65.56:8080)
[10:11:54] + Retrying using alternative port
[10:12:27] - Couldn't send HTTP request to server
[10:12:27] + Could not connect to Work Server (results)
[10:12:27] (171.64.65.56:80)
[10:12:27] - Error: Could not transmit unit 00 (completed August 19) to work server.