Bug 100758

Summary: Kernel Oops when starting X nv50_dma_push
Product: xorg Reporter: Kevin Winchester <kjwinchester>
Component: Driver/nouveauAssignee: Nouveau Project <nouveau>
Status: RESOLVED MOVED QA Contact: Xorg Project Team <xorg-team>
Severity: normal    
Priority: medium CC: kjwinchester
Version: unspecified   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
Complete boot log
none
Xorg log none

Description Kevin Winchester 2017-04-22 14:58:04 UTC
Created attachment 130981 [details]
Complete boot log

Apr 22 11:38:11 karpov kernel: BUG: unable to handle kernel paging request at 00000000ffffffff
Apr 22 11:38:11 karpov kernel: IP: kmem_cache_alloc_trace+0x78/0x1b0
Apr 22 11:38:11 karpov kernel: PGD 1ac726067
Apr 22 11:38:12 karpov kernel: PUD 0
Apr 22 11:38:12 karpov kernel:
Apr 22 11:38:12 karpov kernel: Oops: 0000 [#1] PREEMPT SMP
Apr 22 11:38:12 karpov kernel: Modules linked in: ax25 arc4 snd_hda_codec_hdmi brcmsmac cordic brcmutil b43 mac80211 intel_powerclamp cfg80211 coretemp ssb kvm_intel snd_hda_codec_idt rng_core pcmcia snd_hda_codec_generic pcmcia_core uvcvideo kvm snd_hda_intel irqb
ypass videobuf2_vmalloc iTCO_wdt videobuf2_memops iTCO_vendor_support videobuf2_v4l2 i7core_edac snd_hda_codec videobuf2_core ir_lirc_codec r8169 edac_core crc32c_intel lirc_dev psmouse ir_rc6_decoder hp_wmi input_leds sparse_keymap mii snd_hda_core snd_hwdep rfkil
l snd_pcm videodev snd_timer snd pcspkr joydev intel_cstate bcma rc_rc6_mce i2c_i801 mousedev media jmb38x_ms memstick fjes soundcore lpc_ich acpi_cpufreq shpchp ene_ir tpm_tis tpm_tis_core ac tpm rc_core battery thermal evdev hp_accel lis3lv02d input_polldev mac_h
id sch_fq_codel ip_tables x_tables
Apr 22 11:38:12 karpov kernel:  hid_generic ext4 crc16 jbd2 fscrypto usbhid mbcache hid sr_mod cdrom sd_mod ahci libahci sdhci_pci ehci_pci serio_raw libata sdhci ehci_hcd atkbd libps2 firewire_ohci scsi_mod firewire_core usbcore mmc_core crc_itu_t usb_common i8042
 serio nouveau button video led_class mxm_wmi wmi i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm
Apr 22 11:38:12 karpov kernel: CPU: 3 PID: 593 Comm: Xorg Not tainted 4.10.11-1-ARCH #1
Apr 22 11:38:12 karpov kernel: Hardware name: Hewlett-Packard HP Pavilion dv7 Notebook PC/365C, BIOS F.1D 10/05/2010
Apr 22 11:38:12 karpov kernel: task: ffff8801b7f8b400 task.stack: ffffc900015b4000
Apr 22 11:38:12 karpov kernel: RIP: 0010:kmem_cache_alloc_trace+0x78/0x1b0
Apr 22 11:38:12 karpov kernel: RSP: 0018:ffffc900015b7b98 EFLAGS: 00010206
Apr 22 11:38:12 karpov kernel: RAX: 0000000000000000 RBX: ffff8801b7087400 RCX: 000000000002d783
Apr 22 11:38:12 karpov kernel: RDX: 000000000002d703 RSI: 000000000002d703 RDI: 000000000001b660
Apr 22 11:38:12 karpov kernel: RBP: ffffc900015b7bd0 R08: 00000000ffffffff R09: ffffffffa01d4c0f
Apr 22 11:38:12 karpov kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 00000000014080c0
Apr 22 11:38:12 karpov kernel: R13: ffff8801b9003540 R14: ffff8801b7087400 R15: ffff8801b9003540
Apr 22 11:38:12 karpov kernel: FS:  00007f3a101b9940(0000) GS:ffff8801bfcc0000(0000) knlGS:0000000000000000
Apr 22 11:38:12 karpov kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 22 11:38:12 karpov kernel: CR2: 00000000ffffffff CR3: 00000001aa1e1000 CR4: 00000000000006e0
Apr 22 11:38:12 karpov kernel: Call Trace:
Apr 22 11:38:12 karpov kernel:  nouveau_fence_new+0x4f/0xb0 [nouveau]
Apr 22 11:38:12 karpov kernel:  ? nv50_dma_push+0xc4/0x100 [nouveau]
Apr 22 11:38:12 karpov kernel:  nouveau_gem_ioctl_pushbuf+0xe7e/0x1610 [nouveau]
Apr 22 11:38:12 karpov kernel:  drm_ioctl+0x21b/0x4c0 [drm]
Apr 22 11:38:12 karpov kernel:  ? ___sys_recvmsg+0x160/0x1f0
Apr 22 11:38:12 karpov kernel:  ? nouveau_gem_ioctl_new+0x150/0x150 [nouveau]
Apr 22 11:38:12 karpov kernel:  ? do_readv_writev+0x151/0x240
Apr 22 11:38:12 karpov kernel:  nouveau_drm_ioctl+0x74/0xc0 [nouveau]
Apr 22 11:38:12 karpov kernel:  do_vfs_ioctl+0xa3/0x5f0
Apr 22 11:38:12 karpov kernel:  ? __fget+0x77/0xb0
Apr 22 11:38:12 karpov kernel:  SyS_ioctl+0x79/0x90
Apr 22 11:38:12 karpov kernel:  entry_SYSCALL_64_fastpath+0x1a/0xa9
Apr 22 11:38:12 karpov kernel: RIP: 0033:0x7f3a0e00f0d7
Apr 22 11:38:12 karpov kernel: RSP: 002b:00007ffc7aeb7728 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Apr 22 11:38:12 karpov kernel: RAX: ffffffffffffffda RBX: 0000000002627330 RCX: 00007f3a0e00f0d7
Apr 22 11:38:12 karpov kernel: RDX: 00007ffc7aeb7790 RSI: 00000000c0406481 RDI: 000000000000000b
Apr 22 11:38:12 karpov kernel: RBP: 00007ffc7aeb7460 R08: 0000000000000000 R09: 000000000000001c
Apr 22 11:38:12 karpov kernel: R10: 000000000265d820 R11: 0000000000000246 R12: 0000000040086409
Apr 22 11:38:12 karpov kernel: R13: 000000000000000b R14: 0000000002bc5ff0 R15: 00000000026331e0
Apr 22 11:38:12 karpov kernel: Code: 48 39 f2 75 e7 48 83 78 10 00 4c 8b 00 0f 84 02 01 00 00 4d 85 c0 0f 84 f9 00 00 00 49 63 45 20 49 8b 7d 00 48 8d 8a 80 00 00 00 <49> 8b 1c 00 4c 89 c0 65 48 0f c7 0f 0f 94 c0 84 c0 74 aa 49 63
Apr 22 11:38:12 karpov kernel: RIP: kmem_cache_alloc_trace+0x78/0x1b0 RSP: ffffc900015b7b98
Apr 22 11:38:12 karpov kernel: CR2: 00000000ffffffff
Apr 22 11:38:13 karpov kernel: ---[ end trace 153927933e455cb4 ]---
Comment 1 Kevin Winchester 2017-04-22 15:01:50 UTC
Versions:

Linux karpov 4.10.11-1-ARCH #1 SMP PREEMPT Tue Apr 18 08:39:42 CEST 2017 x86_64 GNU/Linux


core/linux 4.10.11-1
extra/xorg-server 1.19.3-2
extra/xf86-video-nouveau 1.0.15-1
extra/libdrm 2.4.80-1
extra/mesa 17.0.4-2
Comment 2 Kevin Winchester 2017-04-22 15:05:25 UTC
Created attachment 130982 [details]
Xorg log
Comment 3 Kevin Winchester 2017-04-22 15:06:51 UTC
01:00.0 VGA compatible controller: NVIDIA Corporation GT216M [GeForce GT 230M] (rev a2) (prog-if 00 [VGA controller])
        Subsystem: Hewlett-Packard Company Device 365c
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 29
        Region 0: Memory at d2000000 (32-bit, non-prefetchable) [size=16M]
        Region 1: Memory at c0000000 (64-bit, prefetchable) [size=256M]
        Region 3: Memory at d0000000 (64-bit, prefetchable) [size=32M]
        Region 5: I/O ports at 6000 [size=128]
        Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: <access denied>
        Kernel driver in use: nouveau
        Kernel modules: nouveau
Comment 4 Kevin Winchester 2017-04-22 15:08:37 UTC
[kevin@karpov ~]$ cat /etc/X11/xorg.conf.d/20-nouveau.conf

Section "Monitor"
          Identifier   "LVDS"
          Option "PreferredMode" "1600x900_60.0"
          Option "RightOf" "VGA"
EndSection

Section "Monitor"
          Identifier   "VGA"
          Option "PreferredMode" "1680x1050_59.9"
EndSection

Section "Device"
    Identifier "nvidia"
    Driver "nouveau"
    Option  "Monitor-LVDS-1" "LVDS"
    Option  "Monitor-VGA-1" "VGA"
EndSection
Comment 5 Kevin Winchester 2017-04-22 15:14:28 UTC
After this crash, I rebooted and I'm currently running sway on wayland and things seem to work well, but I've been having trouble with X for weeks now.

I first noticed issues when the following messages started appearing in my boot logs:

nouveau 0000:01:00.0: bios: OOB 1 015f1901 015f1901
nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8
nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8
nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8

I have a dual monitor setup and I was always running with no specific xorg.config files, just the following in my .xinitrc file:

xrandr --output LVDS-1 --preferred
xrandr --output HDMI-1 --preferred --primary --left-of LVDS-1

But suddenly after some updates, that was resulting in the same content being displayed on both screens, but I can still move my mouse cursor over onto the second screen.

I put in the xorg.conf file that I used to use (already pasted in) and it seemed to improve the situation, but there were still issues. The same content would still appear on both screens, but once I let the xfce power management shut off the monitors and then woke the computer back up, the screens would work correctly again.

Then this I installed a new kernel and nouveau driver, and after starting X everything froze and I found the Oops in my logs.
Comment 6 Ilia Mirkin 2017-04-22 15:45:33 UTC
(In reply to Kevin Winchester from comment #5)
> After this crash, I rebooted and I'm currently running sway on wayland and
> things seem to work well, but I've been having trouble with X for weeks now.
> 
> I first noticed issues when the following messages started appearing in my
> boot logs:
> 
> nouveau 0000:01:00.0: bios: OOB 1 015f1901 015f1901
> nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8
> nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8
> nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8

Those are "normal". By which I mean harmless. Should go away in some updated kernel.

> 
> I have a dual monitor setup and I was always running with no specific
> xorg.config files, just the following in my .xinitrc file:
> 
> xrandr --output LVDS-1 --preferred
> xrandr --output HDMI-1 --preferred --primary --left-of LVDS-1
> 
> But suddenly after some updates, that was resulting in the same content
> being displayed on both screens, but I can still move my mouse cursor over
> onto the second screen.
> 
> I put in the xorg.conf file that I used to use (already pasted in) and it
> seemed to improve the situation, but there were still issues. The same
> content would still appear on both screens, but once I let the xfce power
> management shut off the monitors and then woke the computer back up, the
> screens would work correctly again.
> 
> Then this I installed a new kernel and nouveau driver, and after starting X
> everything froze and I found the Oops in my logs.

Looks like your crash happened with 4.10.11. 4.10.12 includes a backport of a very important fix, https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable.git/commit/?id=2efa4bd3b644f115befbd225f45fce936e19cf2c , which I believe accounts for the crash that you saw.
Comment 7 Kevin Winchester 2017-04-22 16:11:06 UTC
Thank you, I'm going to try 4.11-rc7 which is available in the Arch AUR.
Comment 8 Kevin Winchester 2017-04-23 19:10:43 UTC
The updated kernel fixed the Oops, but I still see the incorrect dual screen behaviour until I wait for the screens to enter power saving mode and then wake them up.

Is there any additional debugging info I can provide that might help?
Comment 9 Martin Peres 2019-12-04 09:27:17 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/xorg/driver/xf86-video-nouveau/issues/344.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.