WUs not beint sent in

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
RISKinator
Posts: 2
Joined: Thu Feb 22, 2018 10:25 pm

WUs not beint sent in

Post by RISKinator »

I have 3 100% completed work units and they're not being sent in. The status for each says "Send" but they've been sitting here for days. I've already successfully completed and sent in 4 WUs, so I don't know why these ones are failing. Please help.
SteveWillis
Posts: 389
Joined: Fri Apr 15, 2016 12:42 am
Hardware configuration: PC 1:
Linux Mint 17.3
three gtx 1080 GPUs One on a powered header
Motherboard = [MB-AM3-AS-SB-990FXR2] qty 1 Asus Sabertooth 990FX(+59.99)
CPU = [CPU-AM3-FX-8320BR] qty 1 AMD FX 8320 Eight Core 3.5GHz(+41.99)

PC2:
Linux Mint 18
Open air case
Motherboard: ASUS Crosshair V Formula-Z AM3+ AMD 990FX SATA 6Gb/s USB 3.0 ATX AMD
AMD FD6300WMHKBOX FX-6300 6-Core Processor Black Edition with Cooler Master Hyper 212 EVO - CPU Cooler with 120mm PWM Fan
three gtx 1080,
one gtx 1080 TI on a powered header

Re: WUs not beint sent in

Post by SteveWillis »

Have you tried rebooting?
Image

1080 and 1080TI GPUs on Linux Mint
RISKinator
Posts: 2
Joined: Thu Feb 22, 2018 10:25 pm

Re: WUs not beint sent in

Post by RISKinator »

Yes I have. Upon restart it says that it's attempting to send the results and that it's on it's 6th attempt.
SteveWillis
Posts: 389
Joined: Fri Apr 15, 2016 12:42 am
Hardware configuration: PC 1:
Linux Mint 17.3
three gtx 1080 GPUs One on a powered header
Motherboard = [MB-AM3-AS-SB-990FXR2] qty 1 Asus Sabertooth 990FX(+59.99)
CPU = [CPU-AM3-FX-8320BR] qty 1 AMD FX 8320 Eight Core 3.5GHz(+41.99)

PC2:
Linux Mint 18
Open air case
Motherboard: ASUS Crosshair V Formula-Z AM3+ AMD 990FX SATA 6Gb/s USB 3.0 ATX AMD
AMD FD6300WMHKBOX FX-6300 6-Core Processor Black Edition with Cooler Master Hyper 212 EVO - CPU Cooler with 120mm PWM Fan
three gtx 1080,
one gtx 1080 TI on a powered header

Re: WUs not beint sent in

Post by SteveWillis »

Don't guess I can help then. I've never seen that error, and I checked to make sure. Sorry.
Image

1080 and 1080TI GPUs on Linux Mint
Joe_H
Site Admin
Posts: 7937
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: WUs not beint sent in

Post by Joe_H »

The Welcome topic includes directions on how to find and post your log file. If you could post the beginning section which gives system and configuration information, and enough more to show an upload attempt for a WU, that would be helpful in trying to diagnose the problem.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
HRD
Posts: 5
Joined: Sun Jan 07, 2018 12:48 am

Re: WUs not beint sent in

Post by HRD »

I have the same problem. A WU is 100% complete but send attempts repeatedly fail. Here's the log below.

I can't delete it; I've rebooted, run Onyx to tweak everything, cleared caches, etc. No apparent way to find this file or to possibly fix the connection it needs.
Any help much appreciated!


Code: Select all

*********************** Log Started 2018-03-05T15:24:55Z ***********************
15:24:55:************************* Folding@home Client *************************
15:24:55:    Website: http://folding.stanford.edu/
15:24:55:  Copyright: (c) 2009-2014 Stanford University
15:24:55:     Author: Joseph Coffland <[email protected]>
15:24:55:       Args: --child --lifeline 1804 --respawn
15:24:55:     Config: /Library/Application Support/FAHClient/config.xml
15:24:55:******************************** Build ********************************
15:24:55:    Version: 7.4.4
15:24:55:       Date: Mar 4 2014
15:24:55:       Time: 20:27:54
15:24:55:    SVN Rev: 4130
15:24:55:     Branch: fah/trunk/client
15:24:55:   Compiler: GNU 4.2.1 (Apple Inc. build 5666) (dot 3)
15:24:55:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
15:24:55:             -fno-unsafe-math-optimizations -msse3 -arch x86_64
15:24:55:             -mmacosx-version-min=10.6
15:24:55:   Platform: darwin 10.8.0
15:24:55:       Bits: 64
15:24:55:       Mode: Release
15:24:55:******************************* System ********************************
15:24:55:        CPU: Intel(R) Xeon(R) W-2140B CPU @ 3.20GHz
15:24:55:     CPU ID: GenuineIntel Family 6 Model 85 Stepping 4
15:24:55:       CPUs: 16
15:24:55:     Memory: 64.00GiB
15:24:55:Free Memory: 52.20GiB
15:24:55:    Threads: POSIX_THREADS
15:24:55: OS Version: 10.13
15:24:55:Has Battery: false
15:24:55: On Battery: false
15:24:55: UTC Offset: -8
15:24:55:        PID: 1813
15:24:55:        CWD: /Library/Application Support/FAHClient
15:24:55:         OS: Darwin 17.4.0 x86_64
15:24:55:    OS Arch: AMD64
15:24:55:       GPUs: 0
15:24:55:       CUDA: Not detected
15:24:55:***********************************************************************
15:24:55:<config>
15:24:55:  <!-- Folding Slot Configuration -->
15:24:55:  <cause v='CANCER'/>
15:24:55:
15:24:55:  <!-- Network -->
15:24:55:  <proxy v=':8080'/>
15:24:55:
15:24:55:  <!-- Slot Control -->
15:24:55:  <power v='full'/>
15:24:55:
15:24:55:  <!-- User Information -->
15:24:55:  <passkey v='********************************'/>
15:24:55:  <user v='HRD-1'/>
15:24:55:
15:24:55:  <!-- Folding Slots -->
15:24:55:  <slot id='0' type='CPU'/>
15:24:55:</config>
15:24:55:Trying to access database...
15:24:55:Successfully acquired database lock
15:24:55:Enabled folding slot 00: READY cpu:16
15:24:56:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:13744 run:148 clone:23 gen:37 core:0xa7 unit:0x000000250002894b59d589cff5e77264
15:24:56:WU01:FS00:Uploading 1.66MiB to 155.247.166.219
15:24:56:WU00:FS00:Starting
15:24:56:WU01:FS00:Connecting to 155.247.166.219:8080
15:24:56:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/fahwebx.stanford.edu/cores/OSX/AMD64/Core_a4.fah/FahCore_a4" -dir 00 -suffix 01 -version 704 -lifeline 1813 -checkpoint 15 -np 16
15:24:56:WU00:FS00:Started FahCore on PID 1820
15:24:56:WU00:FS00:Core PID:1826
15:24:56:WU00:FS00:FahCore 0xa4 started
15:24:56:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:24:56:WU01:FS00:Connecting to 155.247.166.219:80
15:24:56:WU00:FS00:0xa4:
15:24:56:WU00:FS00:0xa4:*------------------------------*
15:24:56:WU00:FS00:0xa4:Folding@Home Gromacs Core
15:24:56:WU00:FS00:0xa4:Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
15:24:56:WU00:FS00:0xa4:
15:24:56:WU00:FS00:0xa4:Preparing to commence simulation
15:24:56:WU00:FS00:0xa4:- Looking at optimizations...
15:24:56:WU00:FS00:0xa4:- Files status OK
15:24:56:WU00:FS00:0xa4:- Expanded 825169 -> 1398040 (decompressed 169.4 percent)
15:24:56:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825169 data_size=1398040, decompressed_data_size=1398040 diff=0
15:24:56:WU00:FS00:0xa4:- Digital signature verified
15:24:56:WU00:FS00:0xa4:
15:24:56:WU00:FS00:0xa4:Project: 9039 (Run 61, Clone 3, Gen 1520)
15:24:56:WU00:FS00:0xa4:
15:24:56:WU00:FS00:0xa4:Assembly optimizations on if available.
15:24:56:WU00:FS00:0xa4:Entering M.D.
15:25:02:WU00:FS00:0xa4:Using Gromacs checkpoints
15:25:02:WU00:FS00:0xa4:Mapping NT from 16 to 16 
15:25:02:WU00:FS00:0xa4:Resuming from checkpoint
15:25:02:WU00:FS00:0xa4:Verified 00/wudata_01.log
15:25:02:WU00:FS00:0xa4:Verified 00/wudata_01.trr
15:25:02:WU00:FS00:0xa4:Verified 00/wudata_01.xtc
15:25:02:WU00:FS00:0xa4:Verified 00/wudata_01.edr
15:25:02:WU00:FS00:0xa4:Completed 68235 out of 250000 steps  (27%)
15:25:07:8:127.0.0.1:New Web connection
15:25:27:WU00:FS00:0xa4:Completed 70000 out of 250000 steps  (28%)
15:26:00:WU00:FS00:0xa4:Completed 72500 out of 250000 steps  (29%)
15:26:11:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 155.247.166.219:80: Operation timed out
15:26:11:WU01:FS00:Trying to send results to collection server
15:26:11:WU01:FS00:Uploading 1.66MiB to 155.247.166.220
15:26:11:WU01:FS00:Connecting to 155.247.166.220:8080
15:26:11:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:26:11:WU01:FS00:Connecting to 155.247.166.220:80
15:26:33:WU00:FS00:0xa4:Completed 75000 out of 250000 steps  (30%)
15:27:04:Caught signal SIGPIPE(13) on PID 1813
15:27:05:Caught signal SIGPIPE(13) on PID 1813
15:27:05:WU00:FS00:0xa4:Completed 77500 out of 250000 steps  (31%)
15:27:27:ERROR:WU01:FS00:Exception: Failed to connect to 155.247.166.220:80: Operation timed out
15:27:27:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:13744 run:148 clone:23 gen:37 core:0xa7 unit:0x000000250002894b59d589cff5e77264
15:27:27:WU01:FS00:Uploading 1.66MiB to 155.247.166.219
15:27:27:WU01:FS00:Connecting to 155.247.166.219:8080
15:27:27:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:27:27:WU01:FS00:Connecting to 155.247.166.219:80
15:27:41:WU00:FS00:0xa4:Completed 80000 out of 250000 steps  (32%)
15:28:19:WU00:FS00:0xa4:Completed 82500 out of 250000 steps  (33%)
15:28:43:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 155.247.166.219:80: Operation timed out
15:28:43:WU01:FS00:Trying to send results to collection server
15:28:43:WU01:FS00:Uploading 1.66MiB to 155.247.166.220
15:28:43:WU01:FS00:Connecting to 155.247.166.220:8080
15:28:43:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:28:43:WU01:FS00:Connecting to 155.247.166.220:80
15:28:54:WU00:FS00:0xa4:Completed 85000 out of 250000 steps  (34%)
15:29:31:WU00:FS00:0xa4:Completed 87500 out of 250000 steps  (35%)
15:29:58:ERROR:WU01:FS00:Exception: Failed to connect to 155.247.166.220:80: Operation timed out
15:29:59:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:13744 run:148 clone:23 gen:37 core:0xa7 unit:0x000000250002894b59d589cff5e77264
15:29:59:WU01:FS00:Uploading 1.66MiB to 155.247.166.219
15:29:59:WU01:FS00:Connecting to 155.247.166.219:8080
15:29:59:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:29:59:WU01:FS00:Connecting to 155.247.166.219:80
15:30:09:WU00:FS00:0xa4:Completed 90000 out of 250000 steps  (36%)
15:30:48:WU00:FS00:0xa4:Completed 92500 out of 250000 steps  (37%)
15:31:14:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 155.247.166.219:80: Operation timed out
15:31:14:WU01:FS00:Trying to send results to collection server
15:31:14:WU01:FS00:Uploading 1.66MiB to 155.247.166.220
15:31:14:WU01:FS00:Connecting to 155.247.166.220:8080
15:31:15:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:31:15:WU01:FS00:Connecting to 155.247.166.220:80
15:31:26:WU00:FS00:0xa4:Completed 95000 out of 250000 steps  (38%)
15:32:05:WU00:FS00:0xa4:Completed 97500 out of 250000 steps  (39%)
15:32:30:ERROR:WU01:FS00:Exception: Failed to connect to 155.247.166.220:80: Operation timed out
15:32:31:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:13744 run:148 clone:23 gen:37 core:0xa7 unit:0x000000250002894b59d589cff5e77264
15:32:31:WU01:FS00:Uploading 1.66MiB to 155.247.166.219
15:32:31:WU01:FS00:Connecting to 155.247.166.219:8080
15:32:31:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:32:31:WU01:FS00:Connecting to 155.247.166.219:80
15:32:41:WU00:FS00:0xa4:Completed 100000 out of 250000 steps  (40%)
15:33:17:WU00:FS00:0xa4:Completed 102500 out of 250000 steps  (41%)
15:33:46:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 155.247.166.219:80: Operation timed out
15:33:46:WU01:FS00:Trying to send results to collection server
15:33:46:WU01:FS00:Uploading 1.66MiB to 155.247.166.220
15:33:46:WU01:FS00:Connecting to 155.247.166.220:8080
15:33:46:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:33:46:WU01:FS00:Connecting to 155.247.166.220:80
15:33:53:WU00:FS00:0xa4:Completed 105000 out of 250000 steps  (42%)
15:34:31:WU00:FS00:0xa4:Completed 107500 out of 250000 steps  (43%)
15:35:01:ERROR:WU01:FS00:Exception: Failed to connect to 155.247.166.220:80: Operation timed out
15:35:08:WU00:FS00:0xa4:Completed 110000 out of 250000 steps  (44%)
15:35:08:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:13744 run:148 clone:23 gen:37 core:0xa7 unit:0x000000250002894b59d589cff5e77264
15:35:08:WU01:FS00:Uploading 1.66MiB to 155.247.166.219
15:35:08:WU01:FS00:Connecting to 155.247.166.219:8080
15:35:08:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
15:35:08:WU01:FS00:Connecting to 155.247.166.219:80
Leonardo
Posts: 260
Joined: Tue Dec 04, 2007 5:09 am
Hardware configuration: GPU slots on home-built, purpose-built PCs.
Location: Eagle River, Alaska

Re: WUs not beint sent in

Post by Leonardo »

Same problem. I'm fairly sure this is associated with the problems Stanford Folding servers are experiencing presently. I've been folding for 15 years and have never seen completed WUs sitting for so long without sending. I'm sure Pande Lab/Stanford doesn't enjoy this any more than we, the donors do. I wish them success getting things sorted out.
Last edited by Leonardo on Mon Mar 05, 2018 4:43 pm, edited 1 time in total.
Image
HRD
Posts: 5
Joined: Sun Jan 07, 2018 12:48 am

Re: WUs not beint sent in

Post by HRD »

Hey thanks for responding. I kind of suspected it could be their servers.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: WUs not beint sent in

Post by bruce »

Server 155.247.166.219 doesn't happen to be at Stanford, but the problem has been reported, the servers restarted, and were down again within an hour. :(
HRD
Posts: 5
Joined: Sun Jan 07, 2018 12:48 am

Re: WUs not beint sent in

Post by HRD »

FYI: mine has gone, guess they fixed the server?
Post Reply