Bug 107120 - [CI] drv_selftests fail with: list_del corruption. next->prev should be ...
Summary: [CI] drv_selftests fail with: list_del corruption. next->prev should be ...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-07-05 09:40 UTC by Tomi Sarvela
Modified: 2018-11-19 19:47 UTC (History)
1 user (show)

See Also:
i915 platform: BXT, HSW, KBL, SNB
i915 features:


Attachments

Description Tomi Sarvela 2018-07-05 09:40:06 UTC
DRM-Tip build CI_DRM_4427 was ok:
aae8cb6bbd49dbadce13baff5077a7b025efd362
2018y-07m-04d-20h-04m-08s UTC integration manifest

DRM-Tip build CI_DRM_4428 was not ok:
7bc1be8128e30b4d581b913feff5c78909c00945
2018y-07m-04d-20h-25m-35s UTC integration manifest

Example trace:
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4429/shard-snb7/igt@drv_selftest@live_gtt.html

[   85.779906] Setting dangerous option live_selftests - tainting kernel
[   87.223007] drunk_hole timed out after 60722/130915
[   88.224006] walk_hole timed out at 4e87b000
[   90.043314] fill_hole timed out (npages=1, prime=379)
[   90.098792] ------------[ cut here ]------------
[   90.098803] list_del corruption. next->prev should be ffff88020bbf43a8, but was 6b6b6b6b6b6b6b6b
[   90.098825] WARNING: CPU: 4 PID: 5736 at lib/list_debug.c:56 __list_del_entry_valid+0x8a/0x90
[   90.098834] Modules linked in: i915(+) snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm broadcom bcm_phy_lib tg3 mei_me mei lpc_ich prime_numbers [last unloaded: i915]
[   90.098881] CPU: 4 PID: 5736 Comm: drv_selftest Tainted: G     U            4.18.0-rc3-CI-CI_DRM_4429+ #1
[   90.098891] Hardware name: Dell Inc. XPS 8300  /0Y2MRG, BIOS A06 10/17/2011
[   90.098900] RIP: 0010:__list_del_entry_valid+0x8a/0x90
[   90.098906] Code: be ff 0f 0b 31 c0 c3 48 89 f2 48 89 fe 48 c7 c7 d8 6e 0d 82 e8 87 b1 be ff 0f 0b 31 c0 c3 48 c7 c7 18 6f 0d 82 e8 76 b1 be ff <0f> 0b 31 c0 c3 90 53 48 89 fb 48 c7 c7 60 ec 28 82 e8 80 45 4a 00 
[   90.098970] RSP: 0018:ffffc90000323aa8 EFLAGS: 00010282
[   90.098977] RAX: 0000000000000000 RBX: ffff88020bbf42a8 RCX: 0000000000000001
[   90.098985] RDX: 0000000080000001 RSI: ffffffff820c638c RDI: 00000000ffffffff
[   90.098993] RBP: ffff88020bbf4430 R08: 0000000000000000 R09: 0000000000000000
[   90.099000] R10: ffffc90000323820 R11: ffffffff82244838 R12: 0000000000000000
[   90.099008] R13: ffff88020bbf42a8 R14: ffff88020bbf4600 R15: ffff88020bbf41c0
[   90.099016] FS:  00007fba67c3b980(0000) GS:ffff88022fb00000(0000) knlGS:0000000000000000
[   90.099025] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   90.099032] CR2: 0000000000a75500 CR3: 000000022250c004 CR4: 00000000000606e0
[   90.099040] Call Trace:
[   90.099105]  i915_address_space_fini+0x39/0xd0 [i915]
[   90.099151]  i915_ppgtt_release+0x12d/0x340 [i915]
[   90.099196]  i915_gem_fini_aliasing_ppgtt+0x4e/0x60 [i915]
[   90.099238]  i915_ggtt_cleanup_hw+0x28/0x1a0 [i915]
[   90.099281]  i915_driver_unload+0xea/0x110 [i915]
[   90.099322]  i915_pci_probe+0x52/0x90 [i915]
[   90.099330]  pci_device_probe+0xa1/0x130
[   90.099337]  driver_probe_device+0x306/0x480
[   90.099344]  __driver_attach+0xdb/0x100
[   90.099350]  ? driver_probe_device+0x480/0x480
[   90.099356]  ? driver_probe_device+0x480/0x480
[   90.099363]  bus_for_each_dev+0x74/0xc0
[   90.099370]  bus_add_driver+0x15f/0x250
[   90.099376]  ? 0xffffffffa06b2000
[   90.099382]  driver_register+0x56/0xe0
[   90.099387]  ? 0xffffffffa06b2000
[   90.099393]  do_one_initcall+0x58/0x370
[   90.099401]  ? kmem_cache_alloc_trace+0x209/0x2e0
[   90.099410]  do_init_module+0x56/0x1ea
[   90.099416]  load_module+0x2435/0x2b20
[   90.099428]  ? __se_sys_finit_module+0xd3/0xf0
[   90.099435]  __se_sys_finit_module+0xd3/0xf0
[   90.099444]  do_syscall_64+0x55/0x190
[   90.099452]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
[   90.099458] RIP: 0033:0x7fba67507839
[   90.099463] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48 
[   90.099527] RSP: 002b:00007ffd83732428 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[   90.099537] RAX: ffffffffffffffda RBX: 0000557c60156e30 RCX: 00007fba67507839
[   90.099545] RDX: 0000000000000000 RSI: 0000557c60157c20 RDI: 0000000000000004
[   90.099552] RBP: 0000557c60157c20 R08: 0000000000000004 R09: 0000000000000000
[   90.099560] R10: 00007ffd83732590 R11: 0000000000000246 R12: 0000000000000000
[   90.099568] R13: 0000557c6014dc00 R14: 0000000000000000 R15: 0000000000000037
[   90.099579] irq event stamp: 0
[   90.099583] hardirqs last  enabled at (0): [<0000000000000000>]           (null)
[   90.099593] hardirqs last disabled at (0): [<ffffffff81082192>] copy_process.part.7+0x322/0x1db0
[   90.099603] softirqs last  enabled at (0): [<ffffffff81082192>] copy_process.part.7+0x322/0x1db0
[   90.099612] softirqs last disabled at (0): [<0000000000000000>]           (null)
[   90.099622] WARNING: CPU: 4 PID: 5736 at lib/list_debug.c:56 __list_del_entry_valid+0x8a/0x90
[   90.099631] ---[ end trace 3cd3dca51543650f ]---
[   90.099659] ------------[ cut here ]------------
[   90.099669] list_del corruption. next->prev should be ffff88021db58ba0, but was 6b6b6b6b6b6b6b6b
[   90.099691] WARNING: CPU: 4 PID: 5736 at lib/list_debug.c:56 __list_del_entry_valid+0x8a/0x90
[   90.099708] Modules linked in: i915(+) snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm broadcom bcm_phy_lib tg3 mei_me mei lpc_ich prime_numbers [last unloaded: i915]
[   90.099804] CPU: 4 PID: 5736 Comm: drv_selftest Tainted: G     U  W         4.18.0-rc3-CI-CI_DRM_4429+ #1
[   90.099821] Hardware name: Dell Inc. XPS 8300  /0Y2MRG, BIOS A06 10/17/2011
[   90.099836] RIP: 0010:__list_del_entry_valid+0x8a/0x90
[   90.099846] Code: be ff 0f 0b 31 c0 c3 48 89 f2 48 89 fe 48 c7 c7 d8 6e 0d 82 e8 87 b1 be ff 0f 0b 31 c0 c3 48 c7 c7 18 6f 0d 82 e8 76 b1 be ff <0f> 0b 31 c0 c3 90 53 48 89 fb 48 c7 c7 60 ec 28 82 e8 80 45 4a 00 
[   90.099950] RSP: 0018:ffffc90000323b30 EFLAGS: 00010282
[   90.099961] RAX: 0000000000000000 RBX: ffff88021db58aa0 RCX: 0000000000000001
[   90.099975] RDX: 0000000080000001 RSI: 0000000000000001 RDI: 00000000ffffffff
[   90.099987] RBP: ffff88021db58c28 R08: 0000000000000000 R09: 0000000000000000
[   90.100000] R10: ffffc90000323ab8 R11: ffffffff82244838 R12: ffff88021db58c08
[   90.100013] R13: ffff88021db50068 R14: ffffffffa03627e0 R15: ffffc90000323ea0
[   90.100026] FS:  00007fba67c3b980(0000) GS:ffff88022fb00000(0000) knlGS:0000000000000000
[   90.100042] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   90.100054] CR2: 0000000000a75500 CR3: 000000022250c004 CR4: 00000000000606e0
[   90.100067] Call Trace:
[   90.100127]  i915_address_space_fini+0x39/0xd0 [i915]
[   90.100191]  i915_ggtt_cleanup_hw+0xc2/0x1a0 [i915]
[   90.100252]  i915_driver_unload+0xea/0x110 [i915]
[   90.100313]  i915_pci_probe+0x52/0x90 [i915]
[   90.100325]  pci_device_probe+0xa1/0x130
[   90.100335]  driver_probe_device+0x306/0x480
[   90.100347]  __driver_attach+0xdb/0x100
[   90.100357]  ? driver_probe_device+0x480/0x480
[   90.100368]  ? driver_probe_device+0x480/0x480
[   90.100379]  bus_for_each_dev+0x74/0xc0
[   90.100390]  bus_add_driver+0x15f/0x250
[   90.100400]  ? 0xffffffffa06b2000
[   90.100409]  driver_register+0x56/0xe0
[   90.100418]  ? 0xffffffffa06b2000
[   90.100427]  do_one_initcall+0x58/0x370
[   90.100438]  ? kmem_cache_alloc_trace+0x209/0x2e0
[   90.100452]  do_init_module+0x56/0x1ea
[   90.100463]  load_module+0x2435/0x2b20
[   90.100482]  ? __se_sys_finit_module+0xd3/0xf0
[   90.100493]  __se_sys_finit_module+0xd3/0xf0
[   90.100509]  do_syscall_64+0x55/0x190
[   90.100519]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
[   90.100531] RIP: 0033:0x7fba67507839
[   90.100538] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48 
[   90.100643] RSP: 002b:00007ffd83732428 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[   90.100660] RAX: ffffffffffffffda RBX: 0000557c60156e30 RCX: 00007fba67507839
[   90.100673] RDX: 0000000000000000 RSI: 0000557c60157c20 RDI: 0000000000000004
[   90.100686] RBP: 0000557c60157c20 R08: 0000000000000004 R09: 0000000000000000
[   90.100698] R10: 00007ffd83732590 R11: 0000000000000246 R12: 0000000000000000
[   90.100711] R13: 0000557c6014dc00 R14: 0000000000000000 R15: 0000000000000037
[   90.100729] irq event stamp: 0
[   90.100737] hardirqs last  enabled at (0): [<0000000000000000>]           (null)
[   90.100752] hardirqs last disabled at (0): [<ffffffff81082192>] copy_process.part.7+0x322/0x1db0
[   90.100769] softirqs last  enabled at (0): [<ffffffff81082192>] copy_process.part.7+0x322/0x1db0
[   90.100785] softirqs last disabled at (0): [<0000000000000000>]           (null)
[   90.100801] WARNING: CPU: 4 PID: 5736 at lib/list_debug.c:56 __list_del_entry_valid+0x8a/0x90
[   90.100816] ---[ end trace 3cd3dca515436510 ]---
[   90.128205] i915: probe of 0000:00:02.0 failed with error -25
Comment 1 Chris Wilson 2018-07-05 09:41:23 UTC
See https://patchwork.freedesktop.org/series/45958/
Comment 2 Chris Wilson 2018-07-05 11:10:06 UTC
commit eae4c9445354602926ff7cc4702c2516e2485a94
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Thu Jul 5 07:56:52 2018 +0100

    drm/i915/selftests: Use full release for local ppgtt allocation
    
    We can now use the full release mechanism (i915_ppgtt_put) for our local
    ppgtt allocation in igt_ppgtt_alloc.
    
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20180705065653.20449-2-chris@chris-wilson.co.uk
Comment 3 Martin Peres 2018-07-10 11:09:48 UTC
Thanks a lot, this is indeed fixed!


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.