Search found 7 matches

by RunningTurtle
Tue Jun 09, 2020 11:19 pm
Forum: Issues with a specific WU
Topic: Project: 16438 Decomposition Fail
Replies: 8
Views: 2430

Re: Project: 16438 Decomposition Fail

Here is the log from this morning, including the download of Project: 16438 (Run 0, Clone 1034, Gen 125), along with the first five failed attempts at running the work unit. 07:34:06:WU01:FS00:Connecting to 65.254.110.245:8080 07:34:06:WU01:FS00:Assigned to work server 128.252.203.1 07:34:06:WU01:FS...
by RunningTurtle
Tue Jun 09, 2020 4:48 pm
Forum: Issues with a specific WU
Topic: Project: 16438 Decomposition Fail
Replies: 8
Views: 2430

Re: Project: 16438 Decomposition Fail

An update. As of this morning, this FAH client is still getting newly assigned WorkUnits from Project 16438, which will not start on a 15 core machine: 16:33:32:WU01:FS00:0xa7:*********************** Log Started 2020-06-09T16:33:32Z *********************** 16:33:32:WU01:FS00:0xa7:*******************...
by RunningTurtle
Sun Jun 07, 2020 7:16 pm
Forum: Issues with a specific WU
Topic: Project: 16438 Decomposition Fail
Replies: 8
Views: 2430

Re: Project: 16438 Decomposition Fail

Did you just add a GPU? No. I built these machines late March just for FAH. The FAH configuration has been running as-is (15 CPU only cores, 1 GPU+1 CPU core) since then. The only software change since then 21.May, when I did an apt-get upgrade. The nvidia driver changed from 418.74 to 418.113, alo...
by RunningTurtle
Fri Jun 05, 2020 1:46 am
Forum: Issues with a specific WU
Topic: Project: 16438 Decomposition Fail
Replies: 8
Views: 2430

Project: 16438 Decomposition Fail

As requested, here is my log showing Project: 16438 (Run 0, Clone 253, Gen 148) trying to run on a 15 core client. 21:57:12:WU00:FS00:Connecting to 65.254.110.245:8080 21:57:12:WU00:FS00:Assigned to work server 128.252.203.1 21:57:12:WU00:FS00:Requesting new work unit for slot 00: RUNNING cpu:15 fro...
by RunningTurtle
Thu Jun 04, 2020 4:33 pm
Forum: Issues with a specific WU
Topic: Project: 14524 (Run 373, Clone 0, Gen 44) Decomposition Fail
Replies: 5
Views: 1409

Re: Project: 14524 (Run 373, Clone 0, Gen 44) Decomposition

Same problem with Project: 16438 (Run 0, Clone 2880, Gen 98)

(Unable to decompose on a 15 core machine, works fine with 12 cores.)
by RunningTurtle
Sun May 31, 2020 7:10 pm
Forum: Issues with a specific WU
Topic: Project: 14524 (Run 373, Clone 0, Gen 44) Decomposition Fail
Replies: 5
Views: 1409

Project: 14524 (Run 373, Clone 0, Gen 44) Decomposition Fail

Project: 14524 (Run 373, Clone 0, Gen 44) cannot be decomposed with a 15 core machine. Runs fine with 12 cores. I see this Project had similar issues last month. Looks like it still has work units that cannot be decomposed by a factor of 5? viewtopic.php?f=108&t=34821&p=330033&hilit=1452...
by RunningTurtle
Mon Mar 30, 2020 11:46 pm
Forum: Issues with a specific WU
Topic: PRCG 14580 (Run 0, Clone 424, Gen 1) Fatal Error, core count
Replies: 1
Views: 432

PRCG 14580 (Run 0, Clone 424, Gen 1) Fatal Error, core count

It appears PRCG 14580 cannot decompose the domain across a 15 core client, and is not configured to automatically reduce core count to compensate. Log Snippet: 17:06:55:WU02:FS00:0xa7:Project: 14580 (Run 0, Clone 424, Gen 1) 17:06:55:WU02:FS00:0xa7:Reading tar file core.xml 17:06:55:WU02:FS00:0xa7:R...