Search found 339 matches

by JohnChodera
Tue Oct 13, 2020 4:56 am
Forum: Problems with NVidia drivers
Topic: Nvidia GPU error INTERRUPTED (102 = 0x66)
Replies: 29
Views: 37575

Re: Nvidia GPU error INTERRUPTED (102 = 0x66)

@pannu: Would you be willing to be pulled into the core dev testing slack? If so, DM me your email address and we'll invite you to help track this down.

Thanks so much for bearing with us!

~ John Chodera // MSKCC
by JohnChodera
Sat Oct 03, 2020 1:10 am
Forum: FAH Hardware
Topic: CUDA Update to FAHCore_22
Replies: 98
Views: 426879

Re: CUDA Update to FAHCore_22

> It would be nice if FAH would show "Please update nvidia drivers to >= v4xx.yy" instead of CUDA_ERROR_INVALID_PTX only.

We weren't sure this was the cause of the error until recently, but we can add this message in the next core release!

Thanks!

~ John Chodera // MSKCC
by JohnChodera
Sat Oct 03, 2020 1:05 am
Forum: Issues with a specific WU
Topic: Bad work unit??
Replies: 36
Views: 5999

Re: Bad work unit??

> When did the cuda update in folding drop again?? T

core22 0.0.13 with CUDA support rolled out on Mon 28 Sep for most folks (though BETA users had it more than a week earlier).

Is just this project giving you trouble, or all of them?

~ John Chodera // MSKCC
by JohnChodera
Fri Sep 25, 2020 9:39 pm
Forum: Issues with a specific WU
Topic: UNKNOWN_ENUM (-1073740791 = 0xc0000409)
Replies: 4
Views: 1244

Re: UNKNOWN_ENUM (-1073740791 = 0xc0000409)

Thanks for the update, and let us know if you end up with something like this again!

~ John Chodera // MSKCC
by JohnChodera
Fri Sep 25, 2020 9:36 pm
Forum: GPU Projects and FahCores
Topic: Core22 v0.0.13 return CUDA_ERROR_INVALID_PTX (218)
Replies: 9
Views: 9518

Re: Core22 v0.0.13 return CUDA_ERROR_INVALID_PTX (218)

> Core 22 v. 0.0.13 is apparently released to non-beta users too.

Whoops! This made it out of the testing lab accidentally. We're still doing a lot of testing, so nothing official yet...

~ John Chodera // MSKCC
by JohnChodera
Tue Sep 22, 2020 1:50 am
Forum: Problems with AMD/ATI drivers
Topic: Problems obtaining GPU WU
Replies: 42
Views: 54098

Re: Problems obtaining GPU WU

Just wanted to chime in here: We hope to be able to proceed with this refinement of GPUSpecies soon, but don't have a concrete timetable. Getting good benchmark data out for the comprehensive 1710x benchmark projects is our first priority! After that, it should be straightforward to begin to automat...
by JohnChodera
Sun Sep 20, 2020 6:50 pm
Forum: Discussions of General-FAH topics
Topic: BAD_WORK_UNIT (114 = 0x72)
Replies: 9
Views: 1413

Re: BAD_WORK_UNIT (114 = 0x72)

We have some bugfixes for AMD Cards coming in the next core release that may help here!

~ John Chodera // MSKCC
by JohnChodera
Sun Sep 20, 2020 4:01 pm
Forum: Discussions of General-FAH topics
Topic: Need option to limit PC resources usage
Replies: 17
Views: 2796

Re: Need option to limit PC resources usage

> Than why FAH doesn't have option to send data in bursts? As even amateurs could make simple utility to prevent gpu from heating down and heating up! As that was the problem! We are not talking here about anything overly complicated. And problem would be solved... We've explored this in the past, a...
by JohnChodera
Sun Sep 20, 2020 3:57 pm
Forum: Issues with a specific WU
Topic: Getting bad work unit (Unreleased driver issue)
Replies: 27
Views: 4766

Re: Getting bad work unit (Unreleased driver issue)

@silverpulser: If you want to experiment with this, it would be better to do it in the #core22 development slack I invited you to since we have an NVIDIA rep working with the driver team who can help guide you through specific tests!

~ John Chodera // MSKCC
by JohnChodera
Sun Sep 20, 2020 12:31 am
Forum: Issues with a specific WU
Topic: Getting bad work unit (Unreleased driver issue)
Replies: 27
Views: 4766

Re: Getting bad work unit (Unreleased driver issue)

Glad the driver rollback worked!

We've been working with some good folks at NVIDIA to better understand these failures, so if you have complete details on how to reproduce the issue, that would be super helpful!

~ John Chodera // MSKCC
by JohnChodera
Thu Sep 17, 2020 3:36 am
Forum: Issues with a specific WU
Topic: Getting bad work unit (Unreleased driver issue)
Replies: 27
Views: 4766

Re: Getting bad work unit

I'm glad you posted! We've been seeing a few cases of this error: clCreateContext (222) This isn't a valid OpenCL error code, and likely means there is something going wrong with the NVIDIA driver. Which driver version do you have installed? Is there any chance this was updated recently? Does reboot...
by JohnChodera
Wed Sep 16, 2020 6:08 pm
Forum: GPU Projects and FahCores
Topic: covid moonshot bad wu setup
Replies: 52
Views: 37491

Re: covid moonshot bad wu setup

I'm showing the stats credit for 13433 (SARS-CoV-2 Mpro monomer) is 70660, and for 13737 (Mpro dimer) is 118000. Are you seeing the base credit is _lower_ for 13437 than 13433?
by JohnChodera
Wed Sep 16, 2020 6:07 pm
Forum: GPU Projects and FahCores
Topic: covid moonshot bad wu setup
Replies: 52
Views: 37491

Re: covid moonshot bad wu setup

> The base points is actually lower for 13437 than 13433 so logically it should be a little bit quicker ... with these PRCGs someone will be able to see if anything obviously odd is going on.

Whoa, that may be a mistake. Looking into this.
by JohnChodera
Wed Sep 16, 2020 5:04 am
Forum: New Donors start here
Topic: anyone else getting less Moonshot projects?
Replies: 5
Views: 1242

Re: anyone else getting less Moonshot projects?

We're up to ~2/3 of the core22 assignments now, so I think this is helping.

Thanks again for the heads up!

~ John Chodera // MSKCC
by JohnChodera
Wed Sep 16, 2020 4:00 am
Forum: GPU Projects and FahCores
Topic: covid moonshot bad wu setup
Replies: 52
Views: 37491

Re: covid moonshot bad wu setup

> In the past few days I’ve been getting some Moonshot WUs that take over 4 hours to complete on either of my GPUs (one RTX 2060, one RTX 2060 KO, no overclocking) — an hour or more longer than what was usual earlier. I’ll take that as a sign that WUs can and will be adjusted for the new GPUs due to...