Search found 7 matches
- Tue Jun 09, 2020 11:19 pm
- Forum: Issues with a specific WU
- Topic: Project: 16438 Decomposition Fail
- Replies: 8
- Views: 2430
Re: Project: 16438 Decomposition Fail
Here is the log from this morning, including the download of Project: 16438 (Run 0, Clone 1034, Gen 125), along with the first five failed attempts at running the work unit. 07:34:06:WU01:FS00:Connecting to 65.254.110.245:8080 07:34:06:WU01:FS00:Assigned to work server 128.252.203.1 07:34:06:WU01:FS...
- Tue Jun 09, 2020 4:48 pm
- Forum: Issues with a specific WU
- Topic: Project: 16438 Decomposition Fail
- Replies: 8
- Views: 2430
Re: Project: 16438 Decomposition Fail
An update. As of this morning, this FAH client is still getting newly assigned WorkUnits from Project 16438, which will not start on a 15 core machine: 16:33:32:WU01:FS00:0xa7:*********************** Log Started 2020-06-09T16:33:32Z *********************** 16:33:32:WU01:FS00:0xa7:*******************...
- Sun Jun 07, 2020 7:16 pm
- Forum: Issues with a specific WU
- Topic: Project: 16438 Decomposition Fail
- Replies: 8
- Views: 2430
Re: Project: 16438 Decomposition Fail
Did you just add a GPU? No. I built these machines late March just for FAH. The FAH configuration has been running as-is (15 CPU only cores, 1 GPU+1 CPU core) since then. The only software change since then 21.May, when I did an apt-get upgrade. The nvidia driver changed from 418.74 to 418.113, alo...
- Fri Jun 05, 2020 1:46 am
- Forum: Issues with a specific WU
- Topic: Project: 16438 Decomposition Fail
- Replies: 8
- Views: 2430
Project: 16438 Decomposition Fail
As requested, here is my log showing Project: 16438 (Run 0, Clone 253, Gen 148) trying to run on a 15 core client. 21:57:12:WU00:FS00:Connecting to 65.254.110.245:8080 21:57:12:WU00:FS00:Assigned to work server 128.252.203.1 21:57:12:WU00:FS00:Requesting new work unit for slot 00: RUNNING cpu:15 fro...
- Thu Jun 04, 2020 4:33 pm
- Forum: Issues with a specific WU
- Topic: Project: 14524 (Run 373, Clone 0, Gen 44) Decomposition Fail
- Replies: 5
- Views: 1409
Re: Project: 14524 (Run 373, Clone 0, Gen 44) Decomposition
Same problem with Project: 16438 (Run 0, Clone 2880, Gen 98)
(Unable to decompose on a 15 core machine, works fine with 12 cores.)
(Unable to decompose on a 15 core machine, works fine with 12 cores.)
- Sun May 31, 2020 7:10 pm
- Forum: Issues with a specific WU
- Topic: Project: 14524 (Run 373, Clone 0, Gen 44) Decomposition Fail
- Replies: 5
- Views: 1409
Project: 14524 (Run 373, Clone 0, Gen 44) Decomposition Fail
Project: 14524 (Run 373, Clone 0, Gen 44) cannot be decomposed with a 15 core machine. Runs fine with 12 cores. I see this Project had similar issues last month. Looks like it still has work units that cannot be decomposed by a factor of 5? viewtopic.php?f=108&t=34821&p=330033&hilit=1452...
- Mon Mar 30, 2020 11:46 pm
- Forum: Issues with a specific WU
- Topic: PRCG 14580 (Run 0, Clone 424, Gen 1) Fatal Error, core count
- Replies: 1
- Views: 432
PRCG 14580 (Run 0, Clone 424, Gen 1) Fatal Error, core count
It appears PRCG 14580 cannot decompose the domain across a 15 core client, and is not configured to automatically reduce core count to compensate. Log Snippet: 17:06:55:WU02:FS00:0xa7:Project: 14580 (Run 0, Clone 424, Gen 1) 17:06:55:WU02:FS00:0xa7:Reading tar file core.xml 17:06:55:WU02:FS00:0xa7:R...