Summary: | kernel panic via the chromium browser | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | xorg | Reporter: | akayn <yena0xc5> | ||||||
Component: | Driver/nouveau | Assignee: | Nouveau Project <nouveau> | ||||||
Status: | RESOLVED MOVED | QA Contact: | Xorg Project Team <xorg-team> | ||||||
Severity: | critical | ||||||||
Priority: | medium | CC: | fdsfgs, mchandras, nouveau, yena0xc5 | ||||||
Version: | unspecified | ||||||||
Hardware: | Other | ||||||||
OS: | All | ||||||||
Whiteboard: | |||||||||
i915 platform: | i915 features: | ||||||||
Attachments: |
|
Description
akayn
2017-11-11 22:40:21 UTC
Which kernel version does your Ubuntu installation have? And with which GPU was it? I tried opening that page on my computer with Chromium, but it didn’t trigger a page fault. This is what I got instead: [Nov12 12:40] nouveau 0000:04:00.0: Xorg[394]: fail ttm_validate [ +0.000006] nouveau 0000:04:00.0: Xorg[394]: validating bo list [ +0.000006] nouveau 0000:04:00.0: Xorg[394]: validate: -12 [ +1.124277] nouveau 0000:04:00.0: gr: TRAP ch 10 [007fa04000 Xorg[394]] [ +0.000014] nouveau 0000:04:00.0: gr: SHADER a244020e, sph: 0x44020e, stage: 0x22 [ +5.515843] nouveau 0000:04:00.0: gr: TRAP ch 10 [007fa04000 Xorg[394]] [ +0.000016] nouveau 0000:04:00.0: gr: SHADER a244020e, sph: 0x44020e, stage: 0x22 [Nov12 12:41] nouveau 0000:04:00.0: Xorg[394]: fail ttm_validate [ +0.000005] nouveau 0000:04:00.0: Xorg[394]: validating bo list [ +0.000005] nouveau 0000:04:00.0: Xorg[394]: validate: -12 [Nov12 12:42] nouveau 0000:04:00.0: gr: TRAP ch 10 [007fa04000 Xorg[394]] [ +0.000010] nouveau 0000:04:00.0: gr: SHADER a244020e, sph: 0x44020e, stage: 0x22 [Nov12 12:43] nouveau 0000:04:00.0: gr: TRAP ch 10 [007fa04000 Xorg[394]] [ +0.000009] nouveau 0000:04:00.0: gr: SHADER a244020e, sph: 0x44020e, stage: 0x22 [ +1.273289] nouveau 0000:04:00.0: gr: TRAP ch 10 [007fa04000 Xorg[394]] [ +0.000015] nouveau 0000:04:00.0: gr: SHADER a244020e, sph: 0x44020e, stage: 0x22 [ +0.455464] nouveau 0000:04:00.0: gr: TRAP ch 10 [007fa04000 Xorg[394]] [ +0.000013] nouveau 0000:04:00.0: gr: SHADER a244020e, sph: 0x44020e, stage: 0x22 [ +0.518625] nouveau 0000:04:00.0: gr: TRAP ch 10 [007fa04000 Xorg[394]] [ +0.000014] nouveau 0000:04:00.0: gr: SHADER a244020e, sph: 0x44020e, stage: 0x22 GPU: GM206 Kernel: 4.13.12 Xorg: 1.19.5 If you're using anything but the latest software, you need to report this issue to the people relesing the software. This is the upstream bugtracker, which only deals in latest versions. 4.10 was released 6+ months ago, and who knows what horrid things are in the Ubuntu tree. Similarly, you should ensure you have the very latest mesa. Lastly, you have provided no evidence of a page fault, just a WARN somewhere in the code. (Which isn't great, but it's quite different.) hello look, the complete details about this incident including the kernel ver etc, are given. this is the kernel log first it warns and then it will panic the entire system, this is the fault: Nov 10 11:24:37 nitro kernel: [ 196.887267] nouveau 0000:01:00.0: fifo: read fault at 002b8c0000 engine 00 [PGRAPH] client 10 [] reason 02 [PAGE_NOT_PRESENT] on channel 8 [003f986000 chromium-browse[2658]] its clear that the driver try to read a non existing page. i can reproduce it every single time. if you cannot reproduce, then i would be very happy, becouse this would mean that it is only my own personal problem and not anything generic. i have tested it on latest ubuntu installation with your drivers, i dont know who should i submit this bug to if not you. what additional information should i give you? if you cannot reproduce with chromium, you may reproduce with chromium-asan that can be downloaded from here: https://commondatastorage.googleapis.com/chromium-browser-asan/index.html?prefix=linux-release/ download the latest. regards. It's a PTE error on the GPU (which has its own MMU), not on the CPU. It's trivial for (buggy) userspace to trigger these. additionally i will provide a better reproducer. Created attachment 135431 [details]
better reprodction
Nov 10 12:17:28 nitro kernel: [ 1612.249746] nouveau 0000:01:00.0:
chromium-browse[2557]:
fail ttm_validate
Nov 10 12:17:28 nitro kernel: [ 1612.249752] nouveau 0000:01:00.0:
chromium-browse[2557]: validating bo list
Nov
10 12:17:28 nitro kernel: [ 1612.249756] nouveau 0000:01:00.0:
chromium-browse[2557]: validate: -12
Nov 10 12:17:28 nitro kernel:
[ 1612.263405] nouveau 0000:01:00.0: gr: TRAP ch 9
[003f8f8000 chromium-browse[2557]]
Nov 10 12:17:28 nitro kernel:
[ 1612.263418] nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX:
80000049
Nov 10 12:17:28 nitro kernel: [ 1612.263423]
nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000049
Nov 10 12:17:28 nitro kernel:
[ 1612.263435] nouveau 0000:01:00.0: fifo: read fault at 004f4a4000 engine 00 [PGRAPH]
client 04 [GPC0/] reason 02 [PAGE_NOT_PRESENT] on channel 9 [003f8f8000 chromium-browse[2557]]
I also have this problem on openSUSE tumbleweed using the following components kernel 4.16.5 nouveau 1.0.15 gnome 3.28 May 03 08:13:52 aegean kernel: WARNING: CPU: 7 PID: 24573 at ../drivers/gpu/drm/nouveau/nouveau_bo.c:1291 nouveau_bo_move_ntfy+0xc9/0xd0 [nouveau] May 03 08:13:52 aegean kernel: Modules linked in: fuse nfsv3 nfs_acl nfs lockd grace fscache af_packet xt_CHECKSUM tun devlink ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype ip6t_rpfilter ip6t_R May 03 08:13:52 aegean kernel: snd_hda_codec_generic hid_generic usbhid intel_rapl sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm snd_hda_intel snd_hda_codec irqbypass crct10dif_pclmul crc32_pclmul crc32c_intel snd_ May 03 08:13:52 aegean kernel: vboxnetadp(O) vboxdrv(O) May 03 08:13:52 aegean kernel: CPU: 7 PID: 24573 Comm: kworker/u16:20 Tainted: G O 4.16.4-1-default #1 openSUSE Tumbleweed (unreleased) May 03 08:13:52 aegean kernel: Hardware name: Dell Inc. Precision Tower 5810/0HHV7N, BIOS A25 02/02/2018 May 03 08:13:52 aegean kernel: Workqueue: events_unbound async_run_entry_fn May 03 08:13:52 aegean kernel: RIP: 0010:nouveau_bo_move_ntfy+0xc9/0xd0 [nouveau] May 03 08:13:52 aegean kernel: RSP: 0018:ffffafdd41fd7b50 EFLAGS: 00010286 May 03 08:13:52 aegean kernel: RAX: 00000000fffffff0 RBX: ffff9da45e156d00 RCX: 0000000000000000 May 03 08:13:52 aegean kernel: RDX: ffff9da17788a3a8 RSI: 0000000000000296 RDI: 0000000000000296 May 03 08:13:52 aegean kernel: RBP: ffff9da424ba1000 R08: 00004f6784b8e400 R09: 0000000000000000 May 03 08:13:52 aegean kernel: R10: 0000000000000000 R11: 00000000003d0900 R12: ffff9da424ba12f0 May 03 08:13:52 aegean kernel: R13: ffff9da14eb2de80 R14: ffffafdd41fd7d28 R15: ffffafdd41fd7c40 May 03 08:13:52 aegean kernel: FS: 0000000000000000(0000) GS:ffff9da46fdc0000(0000) knlGS:0000000000000000 May 03 08:13:52 aegean kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 03 08:13:52 aegean kernel: CR2: 000055a9ca2c2328 CR3: 000000010600a006 CR4: 00000000001606e0 May 03 08:13:52 aegean kernel: Call Trace: May 03 08:13:52 aegean kernel: ttm_bo_handle_move_mem+0x248/0x5b0 [ttm] May 03 08:13:52 aegean kernel: ttm_bo_evict+0x115/0x2f0 [ttm] May 03 08:13:52 aegean kernel: ? nv50_disp_atomic_commit_tail+0x94c/0x1150 [nouveau] May 03 08:13:52 aegean kernel: ? kmem_cache_alloc_node_trace+0x19c/0x580 May 03 08:13:52 aegean kernel: ttm_mem_evict_first+0x190/0x200 [ttm] May 03 08:13:52 aegean kernel: ttm_bo_force_list_clean+0x7e/0x140 [ttm] May 03 08:13:52 aegean kernel: ? pci_pm_thaw+0x80/0x80 May 03 08:13:52 aegean kernel: nouveau_do_suspend+0x7b/0x2a0 [nouveau] May 03 08:13:52 aegean kernel: pci_pm_freeze+0x55/0xc0 May 03 08:13:52 aegean kernel: dpm_run_callback+0x4d/0x170 May 03 08:13:52 aegean kernel: __device_suspend+0x12a/0x4a0 May 03 08:13:52 aegean kernel: ? dpm_show_time+0xd0/0xd0 May 03 08:13:52 aegean kernel: async_suspend+0x1a/0x90 May 03 08:13:52 aegean kernel: async_run_entry_fn+0x37/0x140 May 03 08:13:52 aegean kernel: process_one_work+0x1d4/0x3f0 May 03 08:13:52 aegean kernel: worker_thread+0x2b/0x3d0 May 03 08:13:52 aegean kernel: ? process_one_work+0x3f0/0x3f0 May 03 08:13:52 aegean kernel: kthread+0x113/0x130 May 03 08:13:52 aegean kernel: ? kthread_create_worker_on_cpu+0x50/0x50 May 03 08:13:52 aegean kernel: ret_from_fork+0x3a/0x50 May 03 08:13:52 aegean kernel: Code: f0 49 39 c4 75 db e9 7a ff ff ff 48 3d 90 2a 69 c0 0f 85 6e ff ff ff 48 8b 87 f0 02 00 00 4c 8d a7 f0 02 00 00 48 8d 58 f0 eb d6 <0f> 0b eb c2 0f 1f 00 0f 1f 44 00 00 41 57 41 56 49 89 ce 41 55 May 03 08:13:52 aegean kernel: ---[ end trace 0f74df35408381b1 ]--- and the logs are then filled with May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: TRAP ch 15 [007e922000 gnome-shell[5215]] May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: SHADER a204020e, sph: 0x04020e, stage: 0x22 May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: TRAP ch 15 [007e922000 gnome-shell[5215]] May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: SHADER a204020e, sph: 0x04020e, stage: 0x22 May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: TRAP ch 15 [007e922000 gnome-shell[5215]] May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: SHADER a2040a0e, sph: 0x040a0e, stage: 0x22 May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: TRAP ch 15 [007e922000 gnome-shell[5215]] May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: SHADER a2040a0e, sph: 0x040a0e, stage: 0x22 May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: TRAP ch 15 [007e922000 gnome-shell[5215]] May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: SHADER a2040a0e, sph: 0x040a0e, stage: 0x22 May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: TRAP ch 15 [007e922000 gnome-shell[5215]] May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: SHADER a204020e, sph: 0x04020e, stage: 0x22 May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: TRAP ch 15 [007e922000 gnome-shell[5215]] May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: SHADER a2040a0e, sph: 0x040a0e, stage: 0x22 May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: TRAP ch 15 [007e922000 gnome-shell[5215]] May 03 08:13:52 aegean kernel: nouveau 0000:03:00.0: gr: SHADER a204020e, sph: 0x04020e, stage: 0x22 -- GitLab Migration Automatic Message -- This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity. You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/xorg/driver/xf86-video-nouveau/issues/380. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.