Bug 107979 - [CI][SHARDS] igt@drv_selftest@live_contexts - dmesg-fail - Failed to fill dword 0 [0/512] with gpu (rcs0) in ctx 0 > [full-ppgtt? no], err=-28
Summary: [CI][SHARDS] igt@drv_selftest@live_contexts - dmesg-fail - Failed to fill dwo...
Status: NEW
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-09-18 14:00 UTC by Martin Peres
Modified: 2018-11-30 13:22 UTC (History)
1 user (show)

See Also:
i915 platform: GLK
i915 features: GEM/Other


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Martin Peres 2018-09-18 14:00:27 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_4646/shard-glk6/igt@drv_selftest@live_contexts.html

<7> [169.042421] [drm:intel_power_well_enable [i915]] enabling AUX C
<4> [169.080730] ------------[ cut here ]------------
<4> [169.080734] WARN_ON(dev_priv->mm.object_count)
<4> [169.080861] WARNING: CPU: 0 PID: 5381 at drivers/gpu/drm/i915/i915_gem.c:5826 i915_gem_cleanup_early+0x162/0x190 [i915]
<4> [169.080863] Modules linked in: i915(+) snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal coretemp crct10dif_pclmul crc32_pclmul btusb btrtl btbcm btintel ghash_clmulni_intel bluetooth snd_hda_codec ecdh_generic snd_hwdep snd_hda_core snd_pcm r8169 mei_me mei prime_numbers i2c_hid pinctrl_geminilake pinctrl_intel [last unloaded: i915]
<4> [169.080934] CPU: 0 PID: 5381 Comm: drv_selftest Tainted: G     U            4.19.0-rc4-CI-CI_DRM_4836+ #1
<4> [169.080937] Hardware name: Intel Corporation NUC7CJYH/NUC7JYB, BIOS JYGLKCPX.86A.0027.2018.0125.1347 01/25/2018
<4> [169.080999] RIP: 0010:i915_gem_cleanup_early+0x162/0x190 [i915]
<4> [169.081002] Code: 00 00 48 c7 c2 30 53 3e a0 48 c7 c7 48 b8 2e a0 e8 f3 75 e7 e0 0f 0b 48 c7 c6 98 7d 41 a0 48 c7 c7 7a f0 3f a0 e8 7e 91 d9 e0 <0f> 0b e9 ea fe ff ff 48 c7 c6 c0 7d 41 a0 48 c7 c7 7a f0 3f a0 e8
<4> [169.081004] RSP: 0018:ffffc90023b4fb18 EFLAGS: 00010286
<4> [169.081009] RAX: 0000000000000000 RBX: ffff88026ae40000 RCX: 0000000000000001
<4> [169.081011] RDX: 0000000080000001 RSI: ffffffff820c2be6 RDI: 00000000ffffffff
<4> [169.081013] RBP: ffff88026ae477d8 R08: 00000000581395cb R09: 0000000000000000
<4> [169.081015] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffa03da620
<4> [169.081018] R13: ffffffffa04a38d0 R14: ffffffffa04a3860 R15: ffffc90023b4fe98
<4> [169.081020] FS:  00007f634e1f5980(0000) GS:ffff880277e00000(0000) knlGS:0000000000000000
<4> [169.081023] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [169.081025] CR2: 000055dc6d2e5fb0 CR3: 0000000271e16000 CR4: 0000000000340ef0
<4> [169.081027] Call Trace:
<4> [169.081086]  i915_driver_cleanup_early+0x30/0x70 [i915]
<4> [169.081142]  i915_driver_release+0xa/0x30 [i915]
<4> [169.081199]  i915_pci_remove+0x21/0x30 [i915]
<4> [169.081255]  i915_pci_probe+0x60/0xa0 [i915]
<4> [169.081265]  pci_device_probe+0xa1/0x130
<4> [169.081273]  really_probe+0x25d/0x3c0
<4> [169.081278]  driver_probe_device+0x10a/0x120
<4> [169.081282]  __driver_attach+0xdb/0x100
<4> [169.081287]  ? driver_probe_device+0x120/0x120
<4> [169.081290]  bus_for_each_dev+0x74/0xc0
<4> [169.081296]  bus_add_driver+0x15f/0x250
<4> [169.081300]  ? 0xffffffffa07bb000
<4> [169.081304]  driver_register+0x56/0xe0
<4> [169.081307]  ? 0xffffffffa07bb000
<4> [169.081311]  do_one_initcall+0x58/0x2e0
<4> [169.081316]  ? rcu_lockdep_current_cpu_online+0x8f/0xd0
<4> [169.081320]  ? do_init_module+0x1d/0x1ea
<4> [169.081324]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [169.081329]  ? kmem_cache_alloc_trace+0x264/0x290
<4> [169.081335]  do_init_module+0x56/0x1ea
<4> [169.081341]  load_module+0x26ba/0x29a0
<4> [169.081351]  ? vfs_read+0x122/0x140
<4> [169.081364]  ? __se_sys_finit_module+0xd3/0xf0
<4> [169.081367]  __se_sys_finit_module+0xd3/0xf0
<4> [169.081380]  do_syscall_64+0x55/0x190
<4> [169.081386]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [169.081389] RIP: 0033:0x7f634dac0839
<4> [169.081392] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [169.081395] RSP: 002b:00007ffe64b4f528 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [169.081399] RAX: ffffffffffffffda RBX: 00005608b13d8f90 RCX: 00007f634dac0839
<4> [169.081401] RDX: 0000000000000000 RSI: 00005608b13d9c80 RDI: 0000000000000006
<4> [169.081403] RBP: 00005608b13d9c80 R08: 65735f6576696c20 R09: 0000000000000000
<4> [169.081406] R10: 313d73747865746e R11: 0000000000000246 R12: 0000000000000000
<4> [169.081408] R13: 00005608b13d2920 R14: 0000000000000020 R15: 000000000000003c
<4> [169.081419] irq event stamp: 304080
<4> [169.081425] hardirqs last  enabled at (304079): [<ffffffff810f81ca>] console_unlock+0x3fa/0x5f0
<4> [169.081427] hardirqs last disabled at (304080): [<ffffffff81001910>] trace_hardirqs_off_thunk+0x1a/0x1c
<4> [169.081432] softirqs last  enabled at (303496): [<ffffffff81c0031d>] __do_softirq+0x31d/0x483
<4> [169.081436] softirqs last disabled at (303483): [<ffffffff8108c4c9>] irq_exit+0xa9/0xc0
<4> [169.081497] WARNING: CPU: 0 PID: 5381 at drivers/gpu/drm/i915/i915_gem.c:5826 i915_gem_cleanup_early+0x162/0x190 [i915]
<4> [169.081499] ---[ end trace be855b764a536956 ]---
<4> [169.083817] 
<4> [169.083824] ============================================
<4> [169.083827] WARNING: possible recursive locking detected
<4> [169.083832] 4.19.0-rc4-CI-CI_DRM_4836+ #1 Tainted: G     U  W        
<4> [169.083835] --------------------------------------------
<4> [169.083839] drv_selftest/5381 is trying to acquire lock:
<4> [169.083843] 000000004c4b7679 (&(&n->list_lock)->rlock){-.-.}, at: get_partial_node.isra.29+0x56/0x460
<4> [169.083853] \x0abut task is already holding lock:
<4> [169.083857] 000000004af7068d (&(&n->list_lock)->rlock){-.-.}, at: __kmem_cache_shutdown+0xa4/0x400
<4> [169.083867] \x0aother info that might help us debug this:
<4> [169.083871]  Possible unsafe locking scenario:\x0a
<4> [169.083875]        CPU0
<4> [169.083877]        ----
<4> [169.083880]   lock(&(&n->list_lock)->rlock);
<4> [169.083885]   lock(&(&n->list_lock)->rlock);
<4> [169.083889] \x0a *** DEADLOCK ***\x0a
<4> [169.083894]  May be due to missing lock nesting notation\x0a
<4> [169.083898] 4 locks held by drv_selftest/5381:
<4> [169.083901]  #0: 0000000091026b7b (&dev->mutex){....}, at: __driver_attach+0x55/0x100
<4> [169.083910]  #1: 0000000085b214d5 (cpu_hotplug_lock.rw_sem){++++}, at: kmem_cache_destroy+0x5d/0x2a0
<4> [169.083920]  #2: 0000000077b1e6f0 (slab_mutex){+.+.}, at: kmem_cache_destroy+0x6b/0x2a0
<4> [169.083928]  #3: 000000004af7068d (&(&n->list_lock)->rlock){-.-.}, at: __kmem_cache_shutdown+0xa4/0x400
<4> [169.083939] \x0astack backtrace:
<4> [169.083944] CPU: 0 PID: 5381 Comm: drv_selftest Tainted: G     U  W         4.19.0-rc4-CI-CI_DRM_4836+ #1
<4> [169.083949] Hardware name: Intel Corporation NUC7CJYH/NUC7JYB, BIOS JYGLKCPX.86A.0027.2018.0125.1347 01/25/2018
<4> [169.083955] Call Trace:
<4> [169.083961]  dump_stack+0x67/0x9b
<4> [169.083967]  __lock_acquire+0xc67/0x1b50
<4> [169.084039]  ? i915_pci_probe+0x60/0xa0 [i915]
<4> [169.084045]  ? rcu_lockdep_current_cpu_online+0x8f/0xd0
<4> [169.084101]  ? i915_pci_probe+0x5f/0xa0 [i915]
<4> [169.084158]  ? i915_pci_probe+0x5f/0xa0 [i915]
<4> [169.084164]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [169.084171]  ? module_assert_mutex_or_preempt+0xf/0x30
<4> [169.084176]  ? lock_acquire+0xa6/0x1c0
<4> [169.084179]  lock_acquire+0xa6/0x1c0
<4> [169.084184]  ? get_partial_node.isra.29+0x56/0x460
<4> [169.084191]  _raw_spin_lock+0x2a/0x40
<4> [169.084195]  ? get_partial_node.isra.29+0x56/0x460
<4> [169.084200]  get_partial_node.isra.29+0x56/0x460
<4> [169.084205]  ? __lock_acquire+0x3c8/0x1b50
<4> [169.084210]  ? ___slab_alloc.constprop.34+0x1af/0x390
<4> [169.084215]  ___slab_alloc.constprop.34+0x1af/0x390
<4> [169.084220]  ? __kmem_cache_shutdown+0x1b0/0x400
<4> [169.084225]  ? __kmem_cache_shutdown+0x1b0/0x400
<4> [169.084229]  ? __kmem_cache_shutdown+0x1b0/0x400
<4> [169.084233]  ? __slab_alloc.isra.27.constprop.33+0x3d/0x70
<4> [169.084237]  __slab_alloc.isra.27.constprop.33+0x3d/0x70
<4> [169.084243]  __kmalloc+0x29f/0x2e0
<4> [169.084247]  __kmem_cache_shutdown+0x1b0/0x400
<4> [169.084253]  shutdown_cache+0x10/0x1d0
<4> [169.084257]  kmem_cache_destroy+0x281/0x2a0
<4> [169.084318]  i915_gem_cleanup_early+0xa6/0x190 [i915]
<4> [169.084375]  i915_driver_cleanup_early+0x30/0x70 [i915]
<4> [169.084431]  i915_driver_release+0xa/0x30 [i915]
<4> [169.084487]  i915_pci_remove+0x21/0x30 [i915]
<4> [169.084548]  i915_pci_probe+0x60/0xa0 [i915]
<4> [169.084557]  pci_device_probe+0xa1/0x130
<4> [169.084562]  really_probe+0x25d/0x3c0
<4> [169.084566]  driver_probe_device+0x10a/0x120
<4> [169.084570]  __driver_attach+0xdb/0x100
<4> [169.084575]  ? driver_probe_device+0x120/0x120
<4> [169.084579]  bus_for_each_dev+0x74/0xc0
<4> [169.084583]  bus_add_driver+0x15f/0x250
<4> [169.084587]  ? 0xffffffffa07bb000
<4> [169.084590]  driver_register+0x56/0xe0
<4> [169.084594]  ? 0xffffffffa07bb000
<4> [169.084598]  do_one_initcall+0x58/0x2e0
<4> [169.084602]  ? rcu_lockdep_current_cpu_online+0x8f/0xd0
<4> [169.084606]  ? do_init_module+0x1d/0x1ea
<4> [169.084610]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [169.084614]  ? kmem_cache_alloc_trace+0x264/0x290
<4> [169.084618]  do_init_module+0x56/0x1ea
<4> [169.084623]  load_module+0x26ba/0x29a0
<4> [169.084629]  ? vfs_read+0x122/0x140
<4> [169.084635]  ? __se_sys_finit_module+0xd3/0xf0
<4> [169.084639]  __se_sys_finit_module+0xd3/0xf0
<4> [169.084645]  do_syscall_64+0x55/0x190
<4> [169.084650]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [169.084654] RIP: 0033:0x7f634dac0839
<4> [169.084658] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [169.084667] RSP: 002b:00007ffe64b4f528 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [169.084673] RAX: ffffffffffffffda RBX: 00005608b13d8f90 RCX: 00007f634dac0839
<4> [169.084677] RDX: 0000000000000000 RSI: 00005608b13d9c80 RDI: 0000000000000006
<4> [169.084681] RBP: 00005608b13d9c80 R08: 65735f6576696c20 R09: 0000000000000000
<4> [169.084685] R10: 313d73747865746e R11: 0000000000000246 R12: 0000000000000000
<4> [169.084690] R13: 00005608b13d2920 R14: 0000000000000020 R15: 000000000000003c
<3> [169.084701] =============================================================================
<3> [169.084715] BUG i915_vma (Tainted: G     U  W        ): Objects remaining in i915_vma on __kmem_cache_shutdown()
<3> [169.084730] -----------------------------------------------------------------------------\x0a
<3> [169.084745] INFO: Slab 0x000000005ae51cf7 objects=17 used=1 fp=0x00000000dfffdb1d flags=0x8000000000008100
<4> [169.084760] CPU: 0 PID: 5381 Comm: drv_selftest Tainted: G    BU  W         4.19.0-rc4-CI-CI_DRM_4836+ #1
<4> [169.084775] Hardware name: Intel Corporation NUC7CJYH/NUC7JYB, BIOS JYGLKCPX.86A.0027.2018.0125.1347 01/25/2018
<4> [169.084790] Call Trace:
<4> [169.084797]  dump_stack+0x67/0x9b
<4> [169.084806]  slab_err+0xa8/0xd0
<4> [169.084816]  ? __slab_alloc.isra.27.constprop.33+0x62/0x70
<4> [169.084828]  ? __kmalloc+0x1ac/0x2e0
<4> [169.084836]  __kmem_cache_shutdown+0x1ce/0x400
<4> [169.084848]  shutdown_cache+0x10/0x1d0
<4> [169.084856]  kmem_cache_destroy+0x281/0x2a0
<4> [169.084923]  i915_gem_cleanup_early+0xa6/0x190 [i915]
<4> [169.084985]  i915_driver_cleanup_early+0x30/0x70 [i915]
<4> [169.085045]  i915_driver_release+0xa/0x30 [i915]
<4> [169.085106]  i915_pci_remove+0x21/0x30 [i915]
<4> [169.085171]  i915_pci_probe+0x60/0xa0 [i915]
<4> [169.085182]  pci_device_probe+0xa1/0x130
<4> [169.085191]  really_probe+0x25d/0x3c0
<4> [169.085201]  driver_probe_device+0x10a/0x120
<4> [169.085210]  __driver_attach+0xdb/0x100
<4> [169.085218]  ? driver_probe_device+0x120/0x120
<4> [169.085226]  bus_for_each_dev+0x74/0xc0
<4> [169.085235]  bus_add_driver+0x15f/0x250
<4> [169.085243]  ? 0xffffffffa07bb000
<4> [169.085250]  driver_register+0x56/0xe0
<4> [169.085258]  ? 0xffffffffa07bb000
<4> [169.085265]  do_one_initcall+0x58/0x2e0
<4> [169.085272]  ? rcu_lockdep_current_cpu_online+0x8f/0xd0
<4> [169.085282]  ? do_init_module+0x1d/0x1ea
<4> [169.085290]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [169.085299]  ? kmem_cache_alloc_trace+0x264/0x290
<4> [169.085309]  do_init_module+0x56/0x1ea
<4> [169.085317]  load_module+0x26ba/0x29a0
<4> [169.085328]  ? vfs_read+0x122/0x140
<4> [169.085341]  ? __se_sys_finit_module+0xd3/0xf0
<4> [169.085350]  __se_sys_finit_module+0xd3/0xf0
<4> [169.085362]  do_syscall_64+0x55/0x190
<4> [169.085371]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [169.085379] RIP: 0033:0x7f634dac0839
<4> [169.085386] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [169.085412] RSP: 002b:00007ffe64b4f528 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [169.085425] RAX: ffffffffffffffda RBX: 00005608b13d8f90 RCX: 00007f634dac0839
<4> [169.085436] RDX: 0000000000000000 RSI: 00005608b13d9c80 RDI: 0000000000000006
<4> [169.085447] RBP: 00005608b13d9c80 R08: 65735f6576696c20 R09: 0000000000000000
<4> [169.085457] R10: 313d73747865746e R11: 0000000000000246 R12: 0000000000000000
<4> [169.085468] R13: 00005608b13d2920 R14: 0000000000000020 R15: 000000000000003c
<3> [169.085487] INFO: Object 0x00000000f26955e2 @offset=1024
<3> [169.085556] INFO: Allocated in i915_vma_instance+0x11c/0x8a0 [i915] age=63 cpu=0 pid=5381
<3> [169.085574] \x09kmem_cache_alloc+0x21c/0x280
<3> [169.085639] \x09i915_vma_instance+0x11c/0x8a0 [i915]
<3> [169.085702] \x09gpu_fill+0x3ff/0x9e0 [i915]
<3> [169.085763] \x09igt_ctx_exec+0x159/0x380 [i915]
<3> [169.085833] \x09__i915_subtests+0x5e/0xf0 [i915]
<3> [169.085896] \x09i915_gem_context_live_selftests+0xf3/0x180 [i915]
<3> [169.085968] \x09__run_selftests+0x10b/0x190 [i915]
<3> [169.086038] \x09i915_live_selftests+0x2c/0x60 [i915]
<3> [169.086098] \x09i915_pci_probe+0x50/0xa0 [i915]
<3> [169.086107] \x09pci_device_probe+0xa1/0x130
<3> [169.086115] \x09really_probe+0x25d/0x3c0
<3> [169.086122] \x09driver_probe_device+0x10a/0x120
<3> [169.086130] \x09__driver_attach+0xdb/0x100
<3> [169.086137] \x09bus_for_each_dev+0x74/0xc0
<3> [169.086145] \x09bus_add_driver+0x15f/0x250
<3> [169.086152] \x09driver_register+0x56/0xe0
<3> [169.087284] kmem_cache_destroy i915_vma: Slab cache still has objects
<4> [169.087299] CPU: 0 PID: 5381 Comm: drv_selftest Tainted: G    BU  W         4.19.0-rc4-CI-CI_DRM_4836+ #1
<4> [169.087316] Hardware name: Intel Corporation NUC7CJYH/NUC7JYB, BIOS JYGLKCPX.86A.0027.2018.0125.1347 01/25/2018
<4> [169.087332] Call Trace:
<4> [169.087341]  dump_stack+0x67/0x9b
<4> [169.087350]  kmem_cache_destroy+0x20e/0x2a0
<4> [169.087445]  i915_gem_cleanup_early+0xa6/0x190 [i915]
<4> [169.087515]  i915_driver_cleanup_early+0x30/0x70 [i915]
<4> [169.087583]  i915_driver_release+0xa/0x30 [i915]
<4> [169.087652]  i915_pci_remove+0x21/0x30 [i915]
<4> [169.087718]  i915_pci_probe+0x60/0xa0 [i915]
<4> [169.087729]  pci_device_probe+0xa1/0x130
<4> [169.087740]  really_probe+0x25d/0x3c0
<4> [169.087750]  driver_probe_device+0x10a/0x120
<4> [169.087760]  __driver_attach+0xdb/0x100
<4> [169.087770]  ? driver_probe_device+0x120/0x120
<4> [169.087779]  bus_for_each_dev+0x74/0xc0
<4> [169.087790]  bus_add_driver+0x15f/0x250
<4> [169.087798]  ? 0xffffffffa07bb000
<4> [169.087807]  driver_register+0x56/0xe0
<4> [169.087815]  ? 0xffffffffa07bb000
<4> [169.087824]  do_one_initcall+0x58/0x2e0
<4> [169.087833]  ? rcu_lockdep_current_cpu_online+0x8f/0xd0
<4> [169.087844]  ? do_init_module+0x1d/0x1ea
<4> [169.087854]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [169.087864]  ? kmem_cache_alloc_trace+0x264/0x290
<4> [169.087875]  do_init_module+0x56/0x1ea
<4> [169.087886]  load_module+0x26ba/0x29a0
<4> [169.087899]  ? vfs_read+0x122/0x140
<4> [169.087913]  ? __se_sys_finit_module+0xd3/0xf0
<4> [169.087923]  __se_sys_finit_module+0xd3/0xf0
<4> [169.087937]  do_syscall_64+0x55/0x190
<4> [169.087948]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [169.087958] RIP: 0033:0x7f634dac0839
<4> [169.087967] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [169.087995] RSP: 002b:00007ffe64b4f528 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [169.088010] RAX: ffffffffffffffda RBX: 00005608b13d8f90 RCX: 00007f634dac0839
<4> [169.088023] RDX: 0000000000000000 RSI: 00005608b13d9c80 RDI: 0000000000000006
<4> [169.088035] RBP: 00005608b13d9c80 R08: 65735f6576696c20 R09: 0000000000000000
<4> [169.088047] R10: 313d73747865746e R11: 0000000000000246 R12: 0000000000000000
<4> [169.088059] R13: 00005608b13d2920 R14: 0000000000000020 R15: 000000000000003c
<3> [169.088357] =============================================================================
<3> [169.088373] BUG drm_i915_gem_object (Tainted: G    BU  W        ): Objects remaining in drm_i915_gem_object on __kmem_cache_shutdown()
<3> [169.088393] -----------------------------------------------------------------------------\x0a
<3> [169.088411] INFO: Slab 0x0000000060cd326e objects=20 used=1 fp=0x000000007f449105 flags=0x8000000000008100
<4> [169.088428] CPU: 0 PID: 5381 Comm: drv_selftest Tainted: G    BU  W         4.19.0-rc4-CI-CI_DRM_4836+ #1
<4> [169.088444] Hardware name: Intel Corporation NUC7CJYH/NUC7JYB, BIOS JYGLKCPX.86A.0027.2018.0125.1347 01/25/2018
<4> [169.088460] Call Trace:
<4> [169.088469]  dump_stack+0x67/0x9b
<4> [169.088477]  slab_err+0xa8/0xd0
<4> [169.088489]  ? __slab_alloc.isra.27.constprop.33+0x62/0x70
<4> [169.088501]  ? __kmalloc+0x1ac/0x2e0
<4> [169.088511]  __kmem_cache_shutdown+0x1ce/0x400
<4> [169.088522]  ? lock_acquire+0xa6/0x1c0
<4> [169.088533]  shutdown_cache+0x10/0x1d0
<4> [169.088543]  kmem_cache_destroy+0x281/0x2a0
<4> [169.088623]  i915_gem_cleanup_early+0xb2/0x190 [i915]
<4> [169.088692]  i915_driver_cleanup_early+0x30/0x70 [i915]
<4> [169.088760]  i915_driver_release+0xa/0x30 [i915]
<4> [169.088828]  i915_pci_remove+0x21/0x30 [i915]
<4> [169.088895]  i915_pci_probe+0x60/0xa0 [i915]
<4> [169.088906]  pci_device_probe+0xa1/0x130
<4> [169.088916]  really_probe+0x25d/0x3c0
<4> [169.088926]  driver_probe_device+0x10a/0x120
<4> [169.088936]  __driver_attach+0xdb/0x100
<4> [169.088945]  ? driver_probe_device+0x120/0x120
<4> [169.088955]  bus_for_each_dev+0x74/0xc0
<4> [169.088965]  bus_add_driver+0x15f/0x250
<4> [169.088973]  ? 0xffffffffa07bb000
<4> [169.088982]  driver_register+0x56/0xe0
<4> [169.088990]  ? 0xffffffffa07bb000
<4> [169.088998]  do_one_initcall+0x58/0x2e0
<4> [169.089007]  ? rcu_lockdep_current_cpu_online+0x8f/0xd0
<4> [169.089018]  ? do_init_module+0x1d/0x1ea
<4> [169.089027]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [169.089037]  ? kmem_cache_alloc_trace+0x264/0x290
<4> [169.089048]  do_init_module+0x56/0x1ea
<4> [169.089058]  load_module+0x26ba/0x29a0
<4> [169.089070]  ? vfs_read+0x122/0x140
<4> [169.089084]  ? __se_sys_finit_module+0xd3/0xf0
<4> [169.089094]  __se_sys_finit_module+0xd3/0xf0
<4> [169.089108]  do_syscall_64+0x55/0x190
<4> [169.089118]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [169.089128] RIP: 0033:0x7f634dac0839
<4> [169.089136] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [169.089165] RSP: 002b:00007ffe64b4f528 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [169.089179] RAX: ffffffffffffffda RBX: 00005608b13d8f90 RCX: 00007f634dac0839
<4> [169.089192] RDX: 0000000000000000 RSI: 00005608b13d9c80 RDI: 0000000000000006
<4> [169.089204] RBP: 00005608b13d9c80 R08: 65735f6576696c20 R09: 0000000000000000
<4> [169.089216] R10: 313d73747865746e R11: 0000000000000246 R12: 0000000000000000
<4> [169.089228] R13: 00005608b13d2920 R14: 0000000000000020 R15: 000000000000003c
<3> [169.089249] INFO: Object 0x00000000e50b13a9 @offset=14464
<3> [169.089322] INFO: Allocated in i915_gem_object_create_internal+0x20/0x110 [i915] age=67 cpu=0 pid=5381
<3> [169.089339] \x09kmem_cache_alloc+0x21c/0x280
<3> [169.089410] \x09i915_gem_object_create_internal+0x20/0x110 [i915]
<3> [169.089481] \x09gpu_fill+0x126/0x9e0 [i915]
<3> [169.089548] \x09igt_ctx_exec+0x159/0x380 [i915]
<3> [169.089637] \x09__i915_subtests+0x5e/0xf0 [i915]
<3> [169.089699] \x09i915_gem_context_live_selftests+0xf3/0x180 [i915]
<3> [169.089770] \x09__run_selftests+0x10b/0x190 [i915]
<3> [169.089838] \x09i915_live_selftests+0x2c/0x60 [i915]
<3> [169.089899] \x09i915_pci_probe+0x50/0xa0 [i915]
<3> [169.089907] \x09pci_device_probe+0xa1/0x130
<3> [169.089915] \x09really_probe+0x25d/0x3c0
<3> [169.089922] \x09driver_probe_device+0x10a/0x120
<3> [169.089930] \x09__driver_attach+0xdb/0x100
<3> [169.089938] \x09bus_for_each_dev+0x74/0xc0
<3> [169.089945] \x09bus_add_driver+0x15f/0x250
<3> [169.089952] \x09driver_register+0x56/0xe0
<3> [169.090015] INFO: Freed in __i915_gem_free_objects+0x3ff/0x730 [i915] age=95 cpu=0 pid=7
<3> [169.090083] \x09__i915_gem_free_objects+0x3ff/0x730 [i915]
<3> [169.090147] \x09__i915_gem_free_work+0x5a/0x90 [i915]
<3> [169.090157] \x09process_one_work+0x245/0x610
<3> [169.090165] \x09worker_thread+0x37/0x380
<3> [169.090172] \x09kthread+0x119/0x130
<3> [169.090179] \x09ret_from_fork+0x24/0x50
<3> [169.092508] kmem_cache_destroy drm_i915_gem_object: Slab cache still has objects
<4> [169.092526] CPU: 0 PID: 5381 Comm: drv_selftest Tainted: G    BU  W         4.19.0-rc4-CI-CI_DRM_4836+ #1
<4> [169.092540] Hardware name: Intel Corporation NUC7CJYH/NUC7JYB, BIOS JYGLKCPX.86A.0027.2018.0125.1347 01/25/2018
<4> [169.092555] Call Trace:
<4> [169.092564]  dump_stack+0x67/0x9b
<4> [169.092574]  kmem_cache_destroy+0x20e/0x2a0
<4> [169.092670]  i915_gem_cleanup_early+0xb2/0x190 [i915]
<4> [169.092733]  i915_driver_cleanup_early+0x30/0x70 [i915]
<4> [169.092795]  i915_driver_release+0xa/0x30 [i915]
<4> [169.092856]  i915_pci_remove+0x21/0x30 [i915]
<4> [169.092921]  i915_pci_probe+0x60/0xa0 [i915]
<4> [169.092933]  pci_device_probe+0xa1/0x130
<4> [169.092943]  really_probe+0x25d/0x3c0
<4> [169.092953]  driver_probe_device+0x10a/0x120
<4> [169.092962]  __driver_attach+0xdb/0x100
<4> [169.092971]  ? driver_probe_device+0x120/0x120
<4> [169.092979]  bus_for_each_dev+0x74/0xc0
<4> [169.092988]  bus_add_driver+0x15f/0x250
<4> [169.092996]  ? 0xffffffffa07bb000
<4> [169.093004]  driver_register+0x56/0xe0
<4> [169.093012]  ? 0xffffffffa07bb000
<4> [169.093019]  do_one_initcall+0x58/0x2e0
<4> [169.093028]  ? rcu_lockdep_current_cpu_online+0x8f/0xd0
<4> [169.093039]  ? do_init_module+0x1d/0x1ea
<4> [169.093047]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [169.093057]  ? kmem_cache_alloc_trace+0x264/0x290
<4> [169.093067]  do_init_module+0x56/0x1ea
<4> [169.093078]  load_module+0x26ba/0x29a0
<4> [169.093089]  ? vfs_read+0x122/0x140
<4> [169.093103]  ? __se_sys_finit_module+0xd3/0xf0
<4> [169.093112]  __se_sys_finit_module+0xd3/0xf0
<4> [169.093125]  do_syscall_64+0x55/0x190
<4> [169.093134]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [169.093143] RIP: 0033:0x7f634dac0839
<4> [169.093151] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [169.093177] RSP: 002b:00007ffe64b4f528 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [169.093190] RAX: ffffffffffffffda RBX: 00005608b13d8f90 RCX: 00007f634dac0839
<4> [169.093201] RDX: 0000000000000000 RSI: 00005608b13d9c80 RDI: 0000000000000006
<4> [169.093212] RBP: 00005608b13d9c80 R08: 65735f6576696c20 R09: 0000000000000000
<4> [169.093223] R10: 313d73747865746e R11: 0000000000000246 R12: 0000000000000000
<4> [169.093234] R13: 00005608b13d2920 R14: 0000000000000020 R15: 000000000000003c
<4> [169.103425] i915: probe of 0000:00:02.0 failed with error -28
<6> [169.287672] [IGT] drv_selftest: exiting, ret=99
Comment 1 Chris Wilson 2018-09-18 14:25:31 UTC
The WARN is secondary, the root is
<3>[  169.023281] Failed to fill dword 0 [0/512] with gpu (rcs0) in ctx 0 [full-ppgtt? no], err=-28
Comment 2 Chris Wilson 2018-09-20 09:55:09 UTC
This should remove the distracting WARN:

commit 82c7c4fcbf84a0943b92050e08daec85f1d9670f (HEAD -> drm-intel-next-queued, drm-intel/drm-intel-next-queued)
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Wed Sep 19 20:55:13 2018 +0100

    drm/i915/selftests: Free the batch along the contexts error path
    
    Remember to release the batch bo reference if we hit an error trying to
    submit our MI_STORE_DWORD_IMM.
    
    References: https://bugs.freedesktop.org/show_bug.cgi?id=107979
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Reviewed-by: Mika Kuoppala <mika.kuoppala@linux.intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20180919195544.1511-9-chris@chris-wilson.co.uk
Comment 3 Lakshmi 2018-10-12 10:09:44 UTC
This issue occurred only once 3weeks 2 days ago.
Comment 4 Francesco Balestrieri 2018-11-28 08:21:36 UTC
Only seen once in more than a month, can we close it as non reproducible?
Comment 5 Martin Peres 2018-11-30 13:22:59 UTC
(In reply to Chris Wilson from comment #1)
> The WARN is secondary, the root is
> <3>[  169.023281] Failed to fill dword 0 [0/512] with gpu (rcs0) in ctx 0
> [full-ppgtt? no], err=-28

(In reply to Chris Wilson from comment #2)
> This should remove the distracting WARN:
> 
> commit 82c7c4fcbf84a0943b92050e08daec85f1d9670f (HEAD ->
> drm-intel-next-queued, drm-intel/drm-intel-next-queued)
> Author: Chris Wilson <chris@chris-wilson.co.uk>
> Date:   Wed Sep 19 20:55:13 2018 +0100
> 
>     drm/i915/selftests: Free the batch along the contexts error path
>     
>     Remember to release the batch bo reference if we hit an error trying to
>     submit our MI_STORE_DWORD_IMM.
>     
>     References: https://bugs.freedesktop.org/show_bug.cgi?id=107979
>     Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
>     Reviewed-by: Mika Kuoppala <mika.kuoppala@linux.intel.com>
>     Link:
> https://patchwork.freedesktop.org/patch/msgid/20180919195544.1511-9-
> chris@chris-wilson.co.uk

Thanks! I updated the filter in cibuglog.

We'll keep the bug open for another couple of month, waiting for it to get reproduced.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.