|Summary:||[regression] eaa2c988033b breaks GTX1050TI (GP107 / NV137)|
|Product:||xorg||Reporter:||Carlo Caione <carlo>|
|Component:||Driver/nouveau||Assignee:||Nouveau Project <nouveau>|
|Status:||RESOLVED DUPLICATE||QA Contact:||Xorg Project Team <xorg-team>|
|i915 platform:||i915 features:|
Description Carlo Caione 2017-07-12 10:07:03 UTC
Created attachment 132629 [details] GP107_regression_oops_log Hi, we have verified that commit eaa2c988033b ("drm/nouveau/gr/gp107: initial support") is actually breaking the GTX1050TI on an Acer laptop. It's not possible to reach Xorg anymore and the journal is filled with Oops as shown in the attachment. Thanks,
Comment 1 Karol Herbst 2017-07-12 10:22:05 UTC
(In reply to Carlo Caione from comment #0) > Created attachment 132629 [details] > GP107_regression_oops_log > > Hi, > we have verified that commit eaa2c988033b ("drm/nouveau/gr/gp107: initial > support") is actually breaking the GTX1050TI on an Acer laptop. > It's not possible to reach Xorg anymore and the journal is filled with Oops > as shown in the attachment. > > Thanks, in that commit initial acceleration support for GP107 GPUs were added which means there are more places to mess up. As a workaround you can boot with nouveau.modeset=0 or nouveau.noaccel=1. The real issue is something else, but this should give you at least a working setup again.
Comment 2 Carlo Caione 2017-07-12 10:31:25 UTC
> The real issue is something else, but this should give you at least > a working setup again. Thank you, but since I have the hardware in my availability for the next week or so I really would like if possible to help to debug better this issue and maybe come up with a proper fix :) Let me know if there is anything else I can provide that can help to debug this issue. Thanks.
Comment 3 Ilia Mirkin 2017-07-12 14:05:58 UTC
Jul 11 16:14:59 endless kernel: nouveau 0000:01:00.0: bus: MMIO read of 00000000 FAULT at 409800 [ TIMEOUT ] Jul 11 16:15:01 endless kernel: ------------[ cut here ]------------ Jul 11 16:15:01 endless kernel: WARNING: CPU: 1 PID: 11134 at /usr/src/packages/BUILD/linux-4.11.0/drivers/gpu/drm/nouveau/nvkm/engine/gr/gf100.c:1499 gf100_gr_init_ctxctl_ext+0x3cc/0x7f0 [nouveau] Jul 11 16:15:01 endless kernel: nouveau 0000:01:00.0: timeout So it's having trouble loading the firmware. Please confirm that you have the latest firmware for gp107 (as there was a window of time when linux-firmware contained bad firmware for gp107 -- some time ago now already, but you could have a bad snapshot). I believe I've seen this before though, also on a laptop. I see that it's using the fancy new thing instead of ACPI DSM for powering it on: Jul 11 16:14:58 endless kernel: VGA switcheroo: detected Optimus DSM method \_SB_.PCI0.PEG0.PEGP handle Jul 11 16:14:58 endless kernel: nouveau: detected PR support, will not use DSM Can you try to boot in a way that forces the ACPI stuff to be used? I think pcie_pm_port=off is the thing. And nouveau.runpm=0 for good measure.
Comment 4 Ilia Mirkin 2017-07-12 14:06:32 UTC
Comment 5 Carlo Caione 2017-07-12 15:44:42 UTC
> So it's having trouble loading the firmware. Please confirm that you have the > latest firmware for gp107 (as there was a window of time when linux-firmware > contained bad firmware for gp107 -- some time ago now already, but you could > have a bad snapshot). The firmware is the latest one shipped in the linux-firmware git repo >>> nouveau.noaccel=1 (see attachment) In this case the nouveau module is correct probed and I do not get any WARNING in the journal (even though a bit noisy on the PM part). >>> only pcie_port_pm=off The system hangs before reaching Xorg >>> nouveau.runpm=0 (with or without pcie_port_pm=off) (see attachment) Interestingly I can still reach Xorg but in the journal I have a lot of WARNINGs
Comment 8 Rhys Kidd 2017-07-12 16:56:23 UTC
Looks to be a duplicate of https://bugs.freedesktop.org/show_bug.cgi?id=100228 Was going to mark as such, but wanted to hear the thoughts of imirkin or others more experienced with nouveau first.
Comment 9 Carlo Caione 2017-07-17 09:52:04 UTC
In the log I have nouveau 0000:01:00.0: bus: MMIO read of 00000000 FAULT at 10f200 [ IBUS ]