Search found 5 matches

by Zzyzx
Sun Jul 05, 2020 2:49 am
Forum: CPU Projects - released FAHCores _a7 & _a8 (a4 retired)
Topic: _a7 core crashing in Gromacs
Replies: 31
Views: 56288

Re: _a7 core crashing in Gromacs (p16403)

FYI, the researcher has decided to err on the side of caution and have prevented 24 CPUs from receiving Project 16403. Still getting this assigned on 48 CPUs: 02:46:05:WU01:FS00:Starting 02:46:05:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/...
by Zzyzx
Wed May 13, 2020 10:00 pm
Forum: Issues with a specific WU
Topic: Project: 14576 (Run 0, Clone 2096, Gen 48)
Replies: 20
Views: 11397

Re: Project: 14576 (Run 0, Clone 2096, Gen 48)

Ran into one of these today. Folding on 45 until it passed. Project: 14576 (Run 0, Clone 3581, Gen 171). I do wonder if PantherX could work with the project owner to get this excluded from problematic core counts like with Project 16417.
by Zzyzx
Wed Apr 22, 2020 6:16 am
Forum: Issues with a specific WU
Topic: Project 16417 fails on high core count machines
Replies: 14
Views: 7156

Re: Project 16417 fails on high core count machines

Sorry, was responding to Zzyzx post which had log showing was running 48 then 47 threads … and I guess there may be a number between 32 and 47/48 that works as well … My bad. Yeah, I was getting the error with 48 threads. I found by turning it down to 47 (which actually decayed down to 45 because o...
by Zzyzx
Mon Apr 20, 2020 1:56 pm
Forum: Issues with a specific WU
Topic: Project 16417 fails on high core count machines
Replies: 14
Views: 7156

Re: Project 16417 fails on high core count machines

FYI, I have confirmation from the Project owner that Project 16417 will no longer be assigned to 24 CPUs. Thanks all for your report :) Hey there! I got assigned 16417 on a 24c/48t machine today and had the same issue: 13:47:45:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/c...
by Zzyzx
Sun Apr 19, 2020 2:57 am
Forum: Issues with a specific WU
Topic: Project 13862 (Run 0, Clone 263, Gen 106) WORK_QUIT (404)
Replies: 0
Views: 637

Project 13862 (Run 0, Clone 263, Gen 106) WORK_QUIT (404)

Greetings! This had a very long upload time and then the server responded with WORK_QUIT (404). I know things are busy lately, so I'm not sure if it's just an issue with network traffic getting to the server or what. 22:18:21:WU01:FS00:Connecting to 65.254.110.245:8080 22:18:21:WU01:FS00:Assigned to...