Bug 100758 - Kernel Oops when starting X nv50_dma_push
Summary: Kernel Oops when starting X nv50_dma_push
Status: NEW
Alias: None
Product: xorg
Classification: Unclassified
Component: Driver/nouveau (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Nouveau Project
QA Contact: Xorg Project Team
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-04-22 14:58 UTC by Kevin Winchester
Modified: 2017-04-23 19:10 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments
Complete boot log (91.63 KB, text/plain)
2017-04-22 14:58 UTC, Kevin Winchester
no flags Details
Xorg log (41.62 KB, text/x-log)
2017-04-22 15:05 UTC, Kevin Winchester
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Kevin Winchester 2017-04-22 14:58:04 UTC
Created attachment 130981 [details]
Complete boot log

Apr 22 11:38:11 karpov kernel: BUG: unable to handle kernel paging request at 00000000ffffffff
Apr 22 11:38:11 karpov kernel: IP: kmem_cache_alloc_trace+0x78/0x1b0
Apr 22 11:38:11 karpov kernel: PGD 1ac726067
Apr 22 11:38:12 karpov kernel: PUD 0
Apr 22 11:38:12 karpov kernel:
Apr 22 11:38:12 karpov kernel: Oops: 0000 [#1] PREEMPT SMP
Apr 22 11:38:12 karpov kernel: Modules linked in: ax25 arc4 snd_hda_codec_hdmi brcmsmac cordic brcmutil b43 mac80211 intel_powerclamp cfg80211 coretemp ssb kvm_intel snd_hda_codec_idt rng_core pcmcia snd_hda_codec_generic pcmcia_core uvcvideo kvm snd_hda_intel irqb
ypass videobuf2_vmalloc iTCO_wdt videobuf2_memops iTCO_vendor_support videobuf2_v4l2 i7core_edac snd_hda_codec videobuf2_core ir_lirc_codec r8169 edac_core crc32c_intel lirc_dev psmouse ir_rc6_decoder hp_wmi input_leds sparse_keymap mii snd_hda_core snd_hwdep rfkil
l snd_pcm videodev snd_timer snd pcspkr joydev intel_cstate bcma rc_rc6_mce i2c_i801 mousedev media jmb38x_ms memstick fjes soundcore lpc_ich acpi_cpufreq shpchp ene_ir tpm_tis tpm_tis_core ac tpm rc_core battery thermal evdev hp_accel lis3lv02d input_polldev mac_h
id sch_fq_codel ip_tables x_tables
Apr 22 11:38:12 karpov kernel:  hid_generic ext4 crc16 jbd2 fscrypto usbhid mbcache hid sr_mod cdrom sd_mod ahci libahci sdhci_pci ehci_pci serio_raw libata sdhci ehci_hcd atkbd libps2 firewire_ohci scsi_mod firewire_core usbcore mmc_core crc_itu_t usb_common i8042
 serio nouveau button video led_class mxm_wmi wmi i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm
Apr 22 11:38:12 karpov kernel: CPU: 3 PID: 593 Comm: Xorg Not tainted 4.10.11-1-ARCH #1
Apr 22 11:38:12 karpov kernel: Hardware name: Hewlett-Packard HP Pavilion dv7 Notebook PC/365C, BIOS F.1D 10/05/2010
Apr 22 11:38:12 karpov kernel: task: ffff8801b7f8b400 task.stack: ffffc900015b4000
Apr 22 11:38:12 karpov kernel: RIP: 0010:kmem_cache_alloc_trace+0x78/0x1b0
Apr 22 11:38:12 karpov kernel: RSP: 0018:ffffc900015b7b98 EFLAGS: 00010206
Apr 22 11:38:12 karpov kernel: RAX: 0000000000000000 RBX: ffff8801b7087400 RCX: 000000000002d783
Apr 22 11:38:12 karpov kernel: RDX: 000000000002d703 RSI: 000000000002d703 RDI: 000000000001b660
Apr 22 11:38:12 karpov kernel: RBP: ffffc900015b7bd0 R08: 00000000ffffffff R09: ffffffffa01d4c0f
Apr 22 11:38:12 karpov kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 00000000014080c0
Apr 22 11:38:12 karpov kernel: R13: ffff8801b9003540 R14: ffff8801b7087400 R15: ffff8801b9003540
Apr 22 11:38:12 karpov kernel: FS:  00007f3a101b9940(0000) GS:ffff8801bfcc0000(0000) knlGS:0000000000000000
Apr 22 11:38:12 karpov kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 22 11:38:12 karpov kernel: CR2: 00000000ffffffff CR3: 00000001aa1e1000 CR4: 00000000000006e0
Apr 22 11:38:12 karpov kernel: Call Trace:
Apr 22 11:38:12 karpov kernel:  nouveau_fence_new+0x4f/0xb0 [nouveau]
Apr 22 11:38:12 karpov kernel:  ? nv50_dma_push+0xc4/0x100 [nouveau]
Apr 22 11:38:12 karpov kernel:  nouveau_gem_ioctl_pushbuf+0xe7e/0x1610 [nouveau]
Apr 22 11:38:12 karpov kernel:  drm_ioctl+0x21b/0x4c0 [drm]
Apr 22 11:38:12 karpov kernel:  ? ___sys_recvmsg+0x160/0x1f0
Apr 22 11:38:12 karpov kernel:  ? nouveau_gem_ioctl_new+0x150/0x150 [nouveau]
Apr 22 11:38:12 karpov kernel:  ? do_readv_writev+0x151/0x240
Apr 22 11:38:12 karpov kernel:  nouveau_drm_ioctl+0x74/0xc0 [nouveau]
Apr 22 11:38:12 karpov kernel:  do_vfs_ioctl+0xa3/0x5f0
Apr 22 11:38:12 karpov kernel:  ? __fget+0x77/0xb0
Apr 22 11:38:12 karpov kernel:  SyS_ioctl+0x79/0x90
Apr 22 11:38:12 karpov kernel:  entry_SYSCALL_64_fastpath+0x1a/0xa9
Apr 22 11:38:12 karpov kernel: RIP: 0033:0x7f3a0e00f0d7
Apr 22 11:38:12 karpov kernel: RSP: 002b:00007ffc7aeb7728 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Apr 22 11:38:12 karpov kernel: RAX: ffffffffffffffda RBX: 0000000002627330 RCX: 00007f3a0e00f0d7
Apr 22 11:38:12 karpov kernel: RDX: 00007ffc7aeb7790 RSI: 00000000c0406481 RDI: 000000000000000b
Apr 22 11:38:12 karpov kernel: RBP: 00007ffc7aeb7460 R08: 0000000000000000 R09: 000000000000001c
Apr 22 11:38:12 karpov kernel: R10: 000000000265d820 R11: 0000000000000246 R12: 0000000040086409
Apr 22 11:38:12 karpov kernel: R13: 000000000000000b R14: 0000000002bc5ff0 R15: 00000000026331e0
Apr 22 11:38:12 karpov kernel: Code: 48 39 f2 75 e7 48 83 78 10 00 4c 8b 00 0f 84 02 01 00 00 4d 85 c0 0f 84 f9 00 00 00 49 63 45 20 49 8b 7d 00 48 8d 8a 80 00 00 00 <49> 8b 1c 00 4c 89 c0 65 48 0f c7 0f 0f 94 c0 84 c0 74 aa 49 63
Apr 22 11:38:12 karpov kernel: RIP: kmem_cache_alloc_trace+0x78/0x1b0 RSP: ffffc900015b7b98
Apr 22 11:38:12 karpov kernel: CR2: 00000000ffffffff
Apr 22 11:38:13 karpov kernel: ---[ end trace 153927933e455cb4 ]---
Comment 1 Kevin Winchester 2017-04-22 15:01:50 UTC
Versions:

Linux karpov 4.10.11-1-ARCH #1 SMP PREEMPT Tue Apr 18 08:39:42 CEST 2017 x86_64 GNU/Linux


core/linux 4.10.11-1
extra/xorg-server 1.19.3-2
extra/xf86-video-nouveau 1.0.15-1
extra/libdrm 2.4.80-1
extra/mesa 17.0.4-2
Comment 2 Kevin Winchester 2017-04-22 15:05:25 UTC
Created attachment 130982 [details]
Xorg log
Comment 3 Kevin Winchester 2017-04-22 15:06:51 UTC
01:00.0 VGA compatible controller: NVIDIA Corporation GT216M [GeForce GT 230M] (rev a2) (prog-if 00 [VGA controller])
        Subsystem: Hewlett-Packard Company Device 365c
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 29
        Region 0: Memory at d2000000 (32-bit, non-prefetchable) [size=16M]
        Region 1: Memory at c0000000 (64-bit, prefetchable) [size=256M]
        Region 3: Memory at d0000000 (64-bit, prefetchable) [size=32M]
        Region 5: I/O ports at 6000 [size=128]
        Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: <access denied>
        Kernel driver in use: nouveau
        Kernel modules: nouveau
Comment 4 Kevin Winchester 2017-04-22 15:08:37 UTC
[kevin@karpov ~]$ cat /etc/X11/xorg.conf.d/20-nouveau.conf

Section "Monitor"
          Identifier   "LVDS"
          Option "PreferredMode" "1600x900_60.0"
          Option "RightOf" "VGA"
EndSection

Section "Monitor"
          Identifier   "VGA"
          Option "PreferredMode" "1680x1050_59.9"
EndSection

Section "Device"
    Identifier "nvidia"
    Driver "nouveau"
    Option  "Monitor-LVDS-1" "LVDS"
    Option  "Monitor-VGA-1" "VGA"
EndSection
Comment 5 Kevin Winchester 2017-04-22 15:14:28 UTC
After this crash, I rebooted and I'm currently running sway on wayland and things seem to work well, but I've been having trouble with X for weeks now.

I first noticed issues when the following messages started appearing in my boot logs:

nouveau 0000:01:00.0: bios: OOB 1 015f1901 015f1901
nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8
nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8
nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8

I have a dual monitor setup and I was always running with no specific xorg.config files, just the following in my .xinitrc file:

xrandr --output LVDS-1 --preferred
xrandr --output HDMI-1 --preferred --primary --left-of LVDS-1

But suddenly after some updates, that was resulting in the same content being displayed on both screens, but I can still move my mouse cursor over onto the second screen.

I put in the xorg.conf file that I used to use (already pasted in) and it seemed to improve the situation, but there were still issues. The same content would still appear on both screens, but once I let the xfce power management shut off the monitors and then woke the computer back up, the screens would work correctly again.

Then this I installed a new kernel and nouveau driver, and after starting X everything froze and I found the Oops in my logs.
Comment 6 Ilia Mirkin 2017-04-22 15:45:33 UTC
(In reply to Kevin Winchester from comment #5)
> After this crash, I rebooted and I'm currently running sway on wayland and
> things seem to work well, but I've been having trouble with X for weeks now.
> 
> I first noticed issues when the following messages started appearing in my
> boot logs:
> 
> nouveau 0000:01:00.0: bios: OOB 1 015f1901 015f1901
> nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8
> nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8
> nouveau 0000:01:00.0: bios: OOB 1 b90900b8 b90900b8

Those are "normal". By which I mean harmless. Should go away in some updated kernel.

> 
> I have a dual monitor setup and I was always running with no specific
> xorg.config files, just the following in my .xinitrc file:
> 
> xrandr --output LVDS-1 --preferred
> xrandr --output HDMI-1 --preferred --primary --left-of LVDS-1
> 
> But suddenly after some updates, that was resulting in the same content
> being displayed on both screens, but I can still move my mouse cursor over
> onto the second screen.
> 
> I put in the xorg.conf file that I used to use (already pasted in) and it
> seemed to improve the situation, but there were still issues. The same
> content would still appear on both screens, but once I let the xfce power
> management shut off the monitors and then woke the computer back up, the
> screens would work correctly again.
> 
> Then this I installed a new kernel and nouveau driver, and after starting X
> everything froze and I found the Oops in my logs.

Looks like your crash happened with 4.10.11. 4.10.12 includes a backport of a very important fix, https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable.git/commit/?id=2efa4bd3b644f115befbd225f45fce936e19cf2c , which I believe accounts for the crash that you saw.
Comment 7 Kevin Winchester 2017-04-22 16:11:06 UTC
Thank you, I'm going to try 4.11-rc7 which is available in the Arch AUR.
Comment 8 Kevin Winchester 2017-04-23 19:10:43 UTC
The updated kernel fixed the Oops, but I still see the incorrect dual screen behaviour until I wait for the screens to enter power saving mode and then wake them up.

Is there any additional debugging info I can provide that might help?


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.