Created attachment 131503 [details] Transcription of kernel panic Since e4311ee51d1e2676001b2d8fcefd92bdd79aad85 "drm/nouveau/therm: remove ineffective workarounds for alarm bugs", my machine with a GTX 650 panics on boot. I will attach lspci output and a transcription of the panic, and I can provide more information or test patches as needed.
Created attachment 131504 [details] output of `lspci -nnnnnnn -vvvvvvvvv'
That's, uh, rather strange. Is this fully reproducible? Could I also see a kernel log with "nouveau.debug=debug" from a working kernel?
It certainly feels reproducible. I've tested ~20 boots past the listed commit, and ~10 from very close to before it, and if that commit isn't the dividing line, it's doing a very good job of pretending to be. (I agree with you that it looks pretty harmless, though.) I'm about to attach a dmesg from a working kernel (4.11.0-rc4) with "nouveau.modeset=1 nouveau.config=NvGrUseFW=1 nouveau.debug=debug" (my standard command line has "nouveau.modeset=1 nouveau.config=NvGrUseFW=1" if it matters).
Created attachment 131565 [details] output of dmesg on (working) 4.11.0-rc4 with nouveau.debug=debug
I just reverted e4311ee51d1e2676001b2d8fcefd92bdd79aad85 on -mainline, and the resulting kernel doesn't panic. It doesn't receive input from any USB devices, but I think that's probably unrelated. :) Following that, I reset to -mainline, then went through the four changes of the commit and tried reverting them individually. Reverting three of them does nothing, but reverting the change to drivers/gpu/drm/nouveau/nvkm/subdev/therm/fan.c does prevent the panic.
Created attachment 131704 [details] [review] test fix Can you give this patch a try please? I'm not 100% convinced this is the issue here, but I'd like to rule it in/out. I have identical hardware to a couple of the other reporters of this issue, but have been completely unable to reproduce for unknown reasons.
(In reply to Ben Skeggs from comment #6) > Created attachment 131704 [details] [review] [review] > test fix > > Can you give this patch a try please? > > I'm not 100% convinced this is the issue here, but I'd like to rule it > in/out. I have identical hardware to a couple of the other reporters of > this issue, but have been completely unable to reproduce for unknown reasons. I can reproduce the issue and I never got the crash with this patch, but I also only tried a few times. Maybe if you ask others with that issue to try it out you get enough confirmations?
*** Bug 101273 has been marked as a duplicate of this bug. ***
(In reply to Ben Skeggs from comment #6) > Created attachment 131704 [details] [review] [review] > test fix > > Can you give this patch a try please? I built against -mainline just now. Without the patch, I get the panic, and with the patch, I get no panic. I'll call that successful.
Hello, I have test the patch for linux 4.11.3 with the driver xf86-video-nouveau 1.0.15 and now my system starts without a kernel panic. Without the patch only linux < 4.11.3 start correct.
I can also confirm that it fixed the kernel panics for me with the patch applied.
Using kernel 4.11.5 fixed this.
It appears the fix has been merged into mainline as b4e382ca7586a63b6c1e5221ce0863ff867c2df6 "drm/nouveau/tmr: fully separate alarm execution/pending lists". I can also confirm that unpatched -mainline now boots - thank you for the fix!
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.