Search found 10 matches

by m1geo
Wed Apr 15, 2020 6:16 pm
Forum: CPU Projects - released FAHCores _a7 & _a8 (a4 retired)
Topic: _a7 core crashing in Gromacs
Replies: 31
Views: 39515

Re: _a7 core crashing in Gromacs

I'm not the OP. I was just reporting this was still ongoing, the previous last post said it was resolved.

I switched it down to 23 CPUs for the time being and that's working fine.

Keep up the good work.
by m1geo
Wed Apr 15, 2020 4:12 pm
Forum: CPU Projects - released FAHCores _a7 & _a8 (a4 retired)
Topic: _a7 core crashing in Gromacs
Replies: 31
Views: 39515

Re: _a7 core crashing in Gromacs

FYI, I have confirmation from the Project owner that Project 16417 will no longer be assigned to 24 CPUs. Thanks all for your report :) Just received the same from project 16403. 24 CPUs. I've changed to 23 now, and it's working. I guess I need to put a GPU in here to make use of the "spare&qu...
by m1geo
Fri Apr 03, 2020 1:16 pm
Forum: Problems with NVidia drivers
Topic: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_UNIT
Replies: 33
Views: 8057

Re: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_U

I finally found my issue. It's bizarre! One of the fan bearings has failed. When the controller tried to spin the fans up, the fan would spin a bit, then jam, then drag the 12V rail down on the GPU. That caused all kinds of weirdness. Simply unplugging the one fan and the card works fine. I have ord...
by m1geo
Thu Apr 02, 2020 12:50 pm
Forum: Problems with NVidia drivers
Topic: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_UNIT
Replies: 33
Views: 8057

Re: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_U

Thanks for the heads up. Using Uengine Heaven https://benchmark.unigine.com/heaven looping, I am able to get the GPU performance at [*]Graphics: -70 [*]Memory: +500 However, a 10 minute FAHBench session doesn't like that. For FAHBench, I need to run: [*]Graphics: -90 [*]Memory: +300 The card is an A...
by m1geo
Thu Apr 02, 2020 1:01 am
Forum: Problems with NVidia drivers
Topic: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_UNIT
Replies: 33
Views: 8057

Re: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_U

That's what I thought. I'm new to GPUs. I'm an electronic engineer, so I don't know the details of GPUs, Nvidia settings, parameters, etc., but I have a pragmatic considered approach. What I have learned this evening is that reducing the power limit down makes things behave, and I can complete the t...
by m1geo
Wed Apr 01, 2020 5:46 pm
Forum: Problems with NVidia drivers
Topic: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_UNIT
Replies: 33
Views: 8057

Re: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_U

Some more debugging with FAHBench https://fahbench.github.io/ ... The GPU benchmark runs to about 10% before falling over with either "NaN" error or some random exception (usually clEnqueueMapBuffer). https://www.george-smart.co.uk/wordpress/wp-content/uploads/2020/04/image-4.png https://w...
by m1geo
Wed Apr 01, 2020 1:20 pm
Forum: Problems with NVidia drivers
Topic: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_UNIT
Replies: 33
Views: 8057

Re: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_U

Yeah, I realise this. It was more a proof of concept that the GPU is there, will talk to the PC, and will run something without crashing.
by m1geo
Wed Apr 01, 2020 3:24 am
Forum: Problems with NVidia drivers
Topic: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_UNIT
Replies: 33
Views: 8057

Re: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_U

Hey, thanks for the confirmation. I've been doing some digging, and I notice something weird. As the GPU changes up through the power levels, the FAHBench https://fahbench.github.io/ benchmarker falls over as soon as the GPU enters power level 3 (P3). Now, I haven't ever overclocked the GPU (GTX 107...
by m1geo
Wed Apr 01, 2020 1:53 am
Forum: Problems with NVidia drivers
Topic: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_UNIT
Replies: 33
Views: 8057

Re: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_U

As an update to my post yesterday, I can compile CUDA applications and they work. bandwidthTest: george@ryzen:~/cuda-samples/Samples/bandwidthTest$ make /usr/bin/nvcc -ccbin g++ -I../../Common -m64 -gencode arch=compute_30,code=sm_30 -gencode arch=compute_35,code=sm_35 -gencode arch=compute_37,code=...
by m1geo
Tue Mar 31, 2020 2:26 am
Forum: Problems with NVidia drivers
Topic: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_UNIT
Replies: 33
Views: 8057

Re: GTX1060 Linux drv v430 Error compiling kernel BAD_WORK_U

I, too, have exactly this issue: Xubuntu 19.10 GeForce GTX 1070 8GB Nvidia Driver 440.64 (also tried 435, 430). I have nvidia-opencl-dev, ocl-icd-opencl-dev, etc installed. Machine info: 01:22:59:************************* Folding@home Client ************************* 01:22:59: Website: https://foldi...