Bug 108770 - [CI][SHARDS] igt@perf@oa-formats - dmesg-warn - BUG radix_tree_node (Tainted: G U ): Redzone overwritten
Summary: [CI][SHARDS] igt@perf@oa-formats - dmesg-warn - BUG radix_tree_node (Tainted:...
Status: REOPENED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: high normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-11-16 16:01 UTC by Martin Peres
Modified: 2019-01-19 16:03 UTC (History)
1 user (show)

See Also:
i915 platform: HSW
i915 features: Perf/OA


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Martin Peres 2018-11-16 16:01:09 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_4717/shard-hsw1/igt@perf@oa-formats.html

<7> [85.709152] [drm:i915_perf_open_ioctl [i915]] OA Buffer initialized, gtt offset = 0x2000000, vaddr = 00000000eeaa2c80, size = 16777216
<5> [85.734947] [drm] Skipping spurious, invalid OA report
<5> [85.734951] [drm] Skipping spurious, invalid OA report
<5> [85.734953] [drm] Skipping spurious, invalid OA report
<5> [85.734956] [drm] Skipping spurious, invalid OA report
<5> [85.734958] [drm] Skipping spurious, invalid OA report
<5> [85.734961] [drm] Skipping spurious, invalid OA report
<5> [85.734963] [drm] Skipping spurious, invalid OA report
<5> [85.734966] [drm] Skipping spurious, invalid OA report
<5> [85.734968] [drm] Skipping spurious, invalid OA report
<5> [85.734971] [drm] Skipping spurious, invalid OA report
<5> [85.779969] [drm] 218754 spurious OA report notices suppressed due to ratelimiting
<3> [85.780219] =============================================================================
<3> [85.780296] BUG radix_tree_node (Tainted: G     U           ): Redzone overwritten
<3> [85.780309] -----------------------------------------------------------------------------\x0a
<4> [85.780327] Disabling lock debugging due to kernel taint
<3> [85.780328] INFO: 0x00000000b25c168a-0x0000000094600793. First byte 0x0 instead of 0xbb
<3> [85.780333] INFO: Slab 0x00000000578ecb81 objects=35 used=35 fp=0x          (null) flags=0x8000000000010200
<3> [85.780339] INFO: Object 0x00000000cbb9ce62 @offset=23720 fp=0x0000000089925685\x0a
<3> [85.780346] Redzone 00000000b25c168a: 00 00 00 00 90 13 01 00                          ........
<3> [85.780351] Object 00000000cbb9ce62: 1d 50 71 00 00 00 00 00 00 00 00 00 00 00 00 00  .Pq.............
<3> [85.780356] Object 0000000080babb74: 18 00 00 00 f2 13 01 00 00 00 00 00 00 00 00 00  ................
<3> [85.780362] Object 00000000eca98709: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780368] Object 000000000948b4a3: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780373] Object 000000002d65e963: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780379] Object 0000000013715f91: 00 00 00 00 00 00 00 00 00 04 05 00 00 70 0d 4f  .............p.O
<3> [85.780385] Object 0000000037a9287d: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780390] Object 00000000ad634fab: 00 00 00 00 00 00 00 00 00 00 00 00 50 22 01 00  ............P"..
<3> [85.780396] Object 000000009c9e32c5: 1d 50 73 00 00 00 00 00 00 00 00 00 00 00 00 00  .Ps.............
<3> [85.780401] Object 00000000c68c35ac: 18 00 00 00 b2 22 01 00 00 00 00 00 00 00 00 00  ....."..........
<3> [85.780407] Object 000000000ae7cd5c: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780413] Object 00000000ade7853d: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780418] Object 00000000c2746754: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780424] Object 00000000ff41d3fe: 00 00 00 00 00 00 00 00 00 04 05 00 00 90 0d 4f  ...............O
<3> [85.780430] Object 0000000048a9b412: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780435] Object 00000000ad96283f: 00 00 00 00 00 00 00 00 00 00 00 00 a1 34 01 00  .............4..
<3> [85.780441] Object 00000000510b6a2c: 1d 50 75 00 00 00 00 00 00 00 00 00 00 00 00 00  .Pu.............
<3> [85.780447] Object 0000000048f60cfd: 18 00 00 00 03 35 01 00 00 00 00 00 00 00 00 00  .....5..........
<3> [85.780452] Object 00000000f3b857aa: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780458] Object 000000006bedd180: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780464] Object 0000000020ad1c0c: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780469] Object 000000004a5170a0: 00 00 00 00 00 00 00 00 00 04 05 00 00 b0 0d 4f  ...............O
<3> [85.780475] Object 00000000cf77617a: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780480] Object 0000000024bd8113: 00 00 00 00 00 00 00 00 00 00 00 00 22 4c 01 00  ............"L..
<3> [85.780486] Object 000000005809578d: 1d 50 77 00 00 00 00 00 00 00 00 00 00 00 00 00  .Pw.............
<3> [85.780492] Object 00000000101a7c35: 18 00 00 00 84 4c 01 00 00 00 00 00 00 00 00 00  .....L..........
<3> [85.780497] Object 00000000105b7e9d: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780503] Object 00000000bf617479: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780509] Object 0000000058378890: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780514] Object 0000000046179e28: 00 00 00 00 00 00 00 00 00 04 05 00 00 d0 0d 4f  ...............O
<3> [85.780520] Object 00000000b1deea66: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780526] Object 0000000060bdb138: 00 00 00 00 00 00 00 00 00 00 00 00 42 65 01 00  ............Be..
<3> [85.780531] Object 00000000ac63510e: 1d 50 79 00 00 00 00 00 00 00 00 00 00 00 00 00  .Py.............
<3> [85.780537] Object 0000000054046933: 18 00 00 00 a4 65 01 00 00 00 00 00 00 00 00 00  .....e..........
<3> [85.780542] Object 00000000426692c8: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780548] Object 00000000bacb6880: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
<3> [85.780554] Redzone 00000000b00402e1: 00 00 00 00 00 00 00 00                          ........
<3> [85.780559] Padding 000000004c8d8d12: 5a 5a 5a 5a 5a 5a 5a 5a                          ZZZZZZZZ
<4> [85.780566] CPU: 1 PID: 2229 Comm: perf Tainted: G    BU            4.20.0-rc2-CI-CI_DRM_5150+ #1
<4> [85.780566] Hardware name: MSI MS-7924/Z97M-G43(MS-7924), BIOS V1.12 02/15/2016
<4> [85.780567] Call Trace:
<4> [85.780570]  dump_stack+0x67/0x9b
<4> [85.780574]  check_bytes_and_report+0xbd/0x100
<4> [85.780576]  check_object+0x184/0x280
<4> [85.780578]  ? xas_alloc+0x7e/0xa0
<4> [85.780579]  alloc_debug_processing+0x183/0x190
<4> [85.780581]  ___slab_alloc.constprop.34+0x355/0x380
<4> [85.780582]  ? xas_alloc+0x7e/0xa0
<4> [85.780585]  ? __lock_acquire+0x3c7/0x1b00
<4> [85.780586]  ? xas_alloc+0x7e/0xa0
<4> [85.780588]  ? __slab_alloc.isra.27.constprop.33+0x3d/0x70
<4> [85.780589]  __slab_alloc.isra.27.constprop.33+0x3d/0x70
<4> [85.780590]  ? xas_alloc+0x7e/0xa0
<4> [85.780591]  kmem_cache_alloc+0x21c/0x280
<4> [85.780593]  xas_alloc+0x7e/0xa0
<4> [85.780594]  xas_create+0x1cc/0x4a0
<4> [85.780596]  xas_create_range+0x61/0x150
<4> [85.780599]  shmem_add_to_page_cache+0x154/0x320
<4> [85.780601]  shmem_getpage_gfp.isra.8+0x519/0xd90
<4> [85.780604]  shmem_read_mapping_page_gfp+0x3e/0x70
<4> [85.780635]  i915_gem_object_get_pages_gtt+0x212/0x6d0 [i915]
<4> [85.780638]  ? ___slab_alloc.constprop.34+0x21c/0x380
<4> [85.780639]  ? ___slab_alloc.constprop.34+0x21c/0x380
<4> [85.780664]  ? i915_vma_instance+0x11c/0x8a0 [i915]
<4> [85.780686]  ? __i915_gem_object_get_pages+0x18/0xb0 [i915]
<4> [85.780688]  ? lock_acquire+0xa6/0x1c0
<4> [85.780711]  ____i915_gem_object_get_pages+0x1d/0xa0 [i915]
<4> [85.780732]  __i915_gem_object_get_pages+0x59/0xb0 [i915]
<4> [85.780754]  __i915_vma_do_pin+0x97f/0xd60 [i915]
<4> [85.780777]  i915_gem_object_ggtt_pin+0x208/0x3f0 [i915]
<4> [85.780801]  i915_perf_open_ioctl+0xd80/0x11d0 [i915]
<4> [85.780825]  ? i915_oa_init_reg_state+0x30/0x30 [i915]
<4> [85.780828]  drm_ioctl_kernel+0x81/0xf0
<4> [85.780830]  drm_ioctl+0x2de/0x390
<4> [85.780853]  ? i915_oa_init_reg_state+0x30/0x30 [i915]
<4> [85.780856]  ? _raw_spin_unlock+0x29/0x40
<4> [85.780857]  ? __handle_mm_fault+0x81b/0xfc0
<4> [85.780860]  do_vfs_ioctl+0xa0/0x6e0
<4> [85.780861]  ? lock_acquire+0xa6/0x1c0
<4> [85.780863]  ? __task_pid_nr_ns+0xb9/0x1f0
<4> [85.780865]  ksys_ioctl+0x35/0x60
<4> [85.780867]  __x64_sys_ioctl+0x11/0x20
<4> [85.780868]  do_syscall_64+0x55/0x190
<4> [85.780870]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [85.780871] RIP: 0033:0x7f38b50925d7
<4> [85.780872] Code: b3 66 90 48 8b 05 b1 48 2d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 81 48 2d 00 f7 d8 64 89 01 48
<4> [85.780873] RSP: 002b:00007ffc454db1c8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
<4> [85.780874] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f38b50925d7
<4> [85.780875] RDX: 00007ffc454db260 RSI: 0000000040106476 RDI: 0000000000000005
<4> [85.780876] RBP: 00007ffc454db260 R08: 0000000000000000 R09: 0000000000000019
<4> [85.780876] R10: 00000000ffffffe7 R11: 0000000000000246 R12: 0000000040106476
<4> [85.780877] R13: 0000000000000005 R14: 0000000000000000 R15: 0000000000000000
<3> [85.780880] FIX radix_tree_node: Restoring 0x00000000b25c168a-0x0000000094600793=0xbb\x0a
<3> [85.780887] FIX radix_tree_node: Marking all objects used
Comment 1 Lionel Landwerlin 2019-01-19 15:58:44 UTC
I think this was fixed in :

commit e896d29a548d04371ce746f7d02a8488ff93d812
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Fri May 11 14:52:07 2018 +0100

    drm/i915/oa: Check that OA is disabled before unpinning

Unfortunately not tagged for stable...
Comment 2 Chris Wilson 2019-01-19 16:02:31 UTC
This is testing the tree months after that patch landed.
Comment 3 Chris Wilson 2019-01-19 16:03:40 UTC
Hmm, fwiw, it does look like a set of samples, as it looks to include an incrementing counter.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.