Bug 110326 - [CI][BAT] igt@i915_selftest@live_hugepages - dmesg-warn - WARN_ON(!list_empty(&dev_priv->contexts.list))
Summary: [CI][BAT] igt@i915_selftest@live_hugepages - dmesg-warn - WARN_ON(!list_empty...
Status: RESOLVED WORKSFORME
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: high normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2019-04-04 14:58 UTC by Lakshmi
Modified: 2019-06-03 05:25 UTC (History)
1 user (show)

See Also:
i915 platform: ICL
i915 features: GEM/Other


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Lakshmi 2019-04-04 14:58:04 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_4927/fi-icl-u2/igt@i915_selftest@live_hugepages.html

<6> [464.014592] [IGT] i915_selftest: executing
<6> [464.019508] [IGT] i915_selftest: starting subtest live_hugepages
<5> [464.063710] Setting dangerous option live_selftests - tainting kernel
<7> [464.082363] [drm:intel_pch_type [i915]] Found Ice Lake PCH
<7> [464.082396] [drm:i915_driver_load [i915]] WOPCM size: 1024KiB
<7> [464.082445] [drm:intel_uc_init_early [i915]] enable_guc=0 (submission:no huc:no)
<7> [464.082489] [drm:intel_uc_init_early [i915]] guc_log_level=0 (enabled:no, verbose:no, verbosity:0)
<7> [464.082523] [drm:intel_power_domains_init [i915]] Allowed DC state mask 0b
<7> [464.082858] [drm:intel_device_info_init_mmio [i915]] vcs2 fused off
<7> [464.082889] [drm:intel_device_info_init_mmio [i915]] vdbox enable: 0001, instances: 0001
<7> [464.082919] [drm:intel_device_info_init_mmio [i915]] vebox enable: 0001, instances: 0001
<6> [464.083058] [drm] Display disabled (module parameter)
<7> [464.083170] [drm:i915_ggtt_probe_hw [i915]] GGTT size = 4096M
<7> [464.083204] [drm:i915_ggtt_probe_hw [i915]] GMADR size = 256M
<7> [464.083259] [drm:i915_ggtt_probe_hw [i915]] DSM size = 60M
<6> [464.083274] i915 0000:00:02.0: vgaarb: deactivate vga console
<7> [464.083493] [drm:i915_gem_init_stolen [i915]] GEN6_STOLEN_RESERVED = 0x000000004fa000c7
<7> [464.083528] [drm:i915_gem_init_stolen [i915]] Memory reserved for graphics device: 61440K, usable: 59392K
<7> [464.083632] [drm:i915_driver_load [i915]] Initialized 7 GT workarounds
<7> [464.083837] [drm:intel_gvt_init [i915]] GVT-g is disabled by kernel params
<7> [464.083883] [drm:intel_opregion_setup [i915]] graphic opregion physical addr: 0x44f0e018
<7> [464.083953] [drm:intel_opregion_setup [i915]] ACPI OpRegion version 2.1.0
<7> [464.083993] [drm:intel_opregion_setup [i915]] Public ACPI methods supported
<7> [464.084033] [drm:intel_opregion_setup [i915]] SWSCI supported
<7> [464.094253] [drm:intel_opregion_setup [i915]] SWSCI GBDA callbacks 00000cb3, SBCB callbacks 00300583
<7> [464.094308] [drm:intel_opregion_setup [i915]] ASLE supported
<7> [464.094350] [drm:intel_opregion_setup [i915]] ASLE extension supported
<7> [464.094427] [drm:intel_opregion_setup [i915]] Found valid VBT in ACPI OpRegion (RVDA)
<7> [464.094461] [drm:i915_driver_load [i915]] DRAM type: DDR4
<7> [464.094492] [drm:skl_dram_get_dimm_info [i915]] CH0 DIMM L size: 8 GB, width: X8, ranks: 1, 16Gb DIMMs: no
<7> [464.094520] [drm:skl_dram_get_dimm_info [i915]] CH0 DIMM S size: 0 GB, width: X0, ranks: 0, 16Gb DIMMs: no
<7> [464.094546] [drm:skl_dram_get_channel_info [i915]] CH0 ranks: 1, 16Gb DIMMs: no
<7> [464.094573] [drm:skl_dram_get_dimm_info [i915]] CH1 DIMM L size: 8 GB, width: X8, ranks: 1, 16Gb DIMMs: no
<7> [464.094598] [drm:skl_dram_get_dimm_info [i915]] CH1 DIMM S size: 0 GB, width: X0, ranks: 0, 16Gb DIMMs: no
<7> [464.094626] [drm:skl_dram_get_channel_info [i915]] CH1 ranks: 1, 16Gb DIMMs: no
<7> [464.094651] [drm:i915_driver_load [i915]] Memory configuration is symmetric? yes
<7> [464.094676] [drm:i915_driver_load [i915]] DRAM bandwidth: 8533344 kBps, channels: 2
<7> [464.094699] [drm:i915_driver_load [i915]] DRAM ranks: 1, 16Gb DIMMs: no
<7> [464.094751] [drm:intel_bios_init [i915]] Skipping VBT init due to disabled display.
<7> [464.095058] [drm:intel_dsm_detect [i915]] no _DSM method for intel device
<7> [464.095117] [drm:i915_driver_load [i915]] rawclk rate: 19200 kHz
<7> [464.095151] [drm:gen9_set_dc_state [i915]] Setting DC state from 00 to 00
<7> [464.095219] [drm:icl_combo_phys_init [i915]] Port A combo PHY already enabled, won't reprogram it.
<7> [464.095289] [drm:icl_combo_phys_init [i915]] Port B combo PHY already enabled, won't reprogram it.
<7> [464.095383] [drm:intel_power_well_enable [i915]] enabling power well 1
<7> [464.095446] [drm:intel_dump_cdclk_state [i915]] Current CDCLK 307200 kHz, VCO 614400 kHz, ref 38400 kHz, bypass 50000 kHz, voltage level 0
<7> [464.095495] [drm:intel_power_well_enable [i915]] enabling always-on
<7> [464.095523] [drm:intel_power_well_enable [i915]] enabling DC off
<7> [464.095551] [drm:gen9_set_dc_state [i915]] Setting DC state from 00 to 00
<7> [464.095615] [drm:icl_combo_phys_init [i915]] Port A combo PHY already enabled, won't reprogram it.
<7> [464.095672] [drm:icl_combo_phys_init [i915]] Port B combo PHY already enabled, won't reprogram it.
<7> [464.095702] [drm:intel_power_well_enable [i915]] enabling power well 2
<7> [464.095731] [drm:intel_power_well_enable [i915]] enabling power well 3
<7> [464.095771] [drm:intel_power_well_enable [i915]] enabling power well 4
<7> [464.095826] [drm:intel_csr_ucode_init [i915]] Loading i915/icl_dmc_ver1_07.bin
<7> [464.096295] [drm:intel_fbc_init [i915]] Sanitized enable_fbc value: 1
<7> [464.096349] [drm:intel_print_wm_latency [i915]] Gen9 Plane WM0 latency 2 (2.0 usec)
<7> [464.096376] [drm:intel_print_wm_latency [i915]] Gen9 Plane WM1 latency 22 (22.0 usec)
<7> [464.096399] [drm:intel_print_wm_latency [i915]] Gen9 Plane WM2 latency 22 (22.0 usec)
<7> [464.096423] [drm:intel_print_wm_latency [i915]] Gen9 Plane WM3 latency 22 (22.0 usec)
<7> [464.096445] [drm:intel_print_wm_latency [i915]] Gen9 Plane WM4 latency 32 (32.0 usec)
<7> [464.096467] [drm:intel_print_wm_latency [i915]] Gen9 Plane WM5 latency 52 (52.0 usec)
<7> [464.096488] [drm:intel_print_wm_latency [i915]] Gen9 Plane WM6 latency 87 (87.0 usec)
<7> [464.096510] [drm:intel_print_wm_latency [i915]] Gen9 Plane WM7 latency 92 (92.0 usec)
<7> [464.096554] [drm:intel_modeset_init [i915]] 0 display pipe available.
<7> [464.096600] [drm:intel_dump_cdclk_state [i915]] Current CDCLK 307200 kHz, VCO 614400 kHz, ref 38400 kHz, bypass 50000 kHz, voltage level 0
<6> [464.096752] mei_hdcp mei::b638ab7e-94e2-4ea2-a552-d1c54b627f04:01: bound 0000:00:02.0 (ops i915_hdcp_component_ops [i915])
<7> [464.096796] [drm:intel_update_max_cdclk [i915]] Max CD clock rate: 652800 kHz
<7> [464.096839] [drm:intel_modeset_init [i915]] Max dotclock rate: 1305600 kHz
<7> [464.097262] [drm:intel_modeset_setup_hw_state [i915]] DPLL 0 hw state readout: crtc_mask 0x00000000, on 0
<7> [464.097309] [drm:intel_modeset_setup_hw_state [i915]] DPLL 1 hw state readout: crtc_mask 0x00000000, on 0
<7> [464.097351] [drm:intel_modeset_setup_hw_state [i915]] TBT PLL hw state readout: crtc_mask 0x00000000, on 0
<7> [464.097392] [drm:intel_modeset_setup_hw_state [i915]] MG PLL 1 hw state readout: crtc_mask 0x00000000, on 0
<7> [464.097431] [drm:intel_modeset_setup_hw_state [i915]] MG PLL 2 hw state readout: crtc_mask 0x00000000, on 0
<7> [464.097469] [drm:intel_modeset_setup_hw_state [i915]] MG PLL 3 hw state readout: crtc_mask 0x00000000, on 0
<7> [464.097505] [drm:intel_modeset_setup_hw_state [i915]] MG PLL 4 hw state readout: crtc_mask 0x00000000, on 0
<6> [464.098390] [drm] Finished loading DMC firmware i915/icl_dmc_ver1_07.bin (v1.7)
<7> [464.098498] [drm:i915_gem_init_ggtt [i915]] clearing unused GTT space: [1000, 100000000]
<7> [464.098798] [drm:intel_engine_init_ctx_wa [i915]] Initialized 4 context workarounds
<7> [464.099062] [drm:i915_gem_contexts_init [i915]] logical context support initialized
<7> [464.099986] [drm:logical_ring_init [i915]] Initialized 8 rcs0 workarounds
<7> [464.100039] [drm:logical_render_ring_init [i915]] Initialized 2 whitelist workarounds
<6> [464.104703] [drm] Initialized i915 1.6.0 20190328 for 0000:00:02.0 on minor 0
<7> [464.106131] [drm:intel_power_well_disable [i915]] disabling power well 4
<7> [464.106222] [drm:intel_power_well_disable [i915]] disabling power well 3
<7> [464.106287] [drm:intel_power_well_disable [i915]] disabling power well 2
<7> [464.106333] [drm:intel_power_well_disable [i915]] disabling DC off
<7> [464.106379] [drm:skl_enable_dc6 [i915]] Enabling DC6
<7> [464.106421] [drm:gen9_set_dc_state [i915]] Setting DC state from 00 to 02
<7> [464.106950] [drm:intel_power_well_disable [i915]] disabling always-on
<7> [464.106973] i915 device info: pciid=0x8a56 rev=0x07 platform=ICELAKE (subplatform=0x1) gen=11
<7> [464.106975] i915 device info: is_mobile: no
<7> [464.106978] i915 device info: is_lp: no
<7> [464.106980] i915 device info: is_alpha_support: no
<7> [464.106982] i915 device info: has_64bit_reloc: yes
<7> [464.106984] i915 device info: gpu_reset_clobbers_display: no
<7> [464.106986] i915 device info: has_reset_engine: yes
<7> [464.106988] i915 device info: has_fpga_dbg: yes
<7> [464.106990] i915 device info: has_guc: yes
<7> [464.106992] i915 device info: has_guc_ct: no
<7> [464.106994] i915 device info: has_l3_dpf: no
<7> [464.106996] i915 device info: has_llc: yes
<7> [464.106998] i915 device info: has_logical_ring_contexts: yes
<7> [464.107000] i915 device info: has_logical_ring_elsq: yes
<7> [464.107003] i915 device info: has_logical_ring_preemption: yes
<7> [464.107004] i915 device info: has_pooled_eu: no
<7> [464.107006] i915 device info: has_rc6: yes
<7> [464.107008] i915 device info: has_rc6p: no
<7> [464.107010] i915 device info: has_runtime_pm: yes
<7> [464.107013] i915 device info: has_snoop: no
<7> [464.107015] i915 device info: has_coherent_ggtt: no
<7> [464.107017] i915 device info: unfenced_needs_alignment: no
<7> [464.107019] i915 device info: hws_needs_physical: no
<7> [464.107021] i915 device info: cursor_needs_physical: no
<7> [464.107023] i915 device info: has_csr: yes
<7> [464.107025] i915 device info: has_ddi: yes
<7> [464.107027] i915 device info: has_dp_mst: yes
<7> [464.107029] i915 device info: has_fbc: yes
<7> [464.107031] i915 device info: has_gmch: no
<7> [464.107033] i915 device info: has_hotplug: yes
<7> [464.107035] i915 device info: has_ipc: yes
<7> [464.107037] i915 device info: has_overlay: no
<7> [464.107039] i915 device info: has_psr: yes
<7> [464.107041] i915 device info: overlay_needs_physical: no
<7> [464.107043] i915 device info: supports_tv: no
<7> [464.107046] i915 device info: slice total: 1, mask=0001
<7> [464.107048] i915 device info: subslice total: 4
<7> [464.107050] i915 device info: slice0: 4 subslices, mask=00cc
<7> [464.107053] i915 device info: EU total: 32
<7> [464.107055] i915 device info: EU per subslice: 8
<7> [464.107057] i915 device info: has slice power gating: yes
<7> [464.107059] i915 device info: has subslice power gating: yes
<7> [464.107060] i915 device info: has EU power gating: yes
<7> [464.107063] i915 device info: CS timestamp frequency: 19200 kHz
<6> [464.107064] [drm] DRM_I915_DEBUG enabled
<6> [464.107066] [drm] DRM_I915_DEBUG_GEM enabled
<6> [464.107068] [drm] DRM_I915_DEBUG_RUNTIME_PM enabled
<6> [464.107071] i915: Performing live selftests with st_random_seed=0x24c2e10f st_timeout=1000
<6> [464.107074] i915: Running hugepages
<6> [464.107277] i915: Running i915_gem_huge_page_live_selftests/igt_shrink_thp
<6> [464.109433] failed to allocate THP, finishing test early
<6> [464.109451] i915: Running i915_gem_huge_page_live_selftests/igt_ppgtt_pin_update
<7> [464.110179] [drm:intel_power_well_enable [i915]] enabling always-on
<6> [464.111014] i915: Running i915_gem_huge_page_live_selftests/igt_tmpfs_fallback
<6> [464.111063] i915: Running i915_gem_huge_page_live_selftests/igt_ppgtt_exhaust_huge
<7> [465.111367] igt_write_huge timed out on engine=1, offset_low=68600000 offset_high=ffff977d0000, max_page_size=10000
<7> [466.112266] igt_write_huge timed out on engine=0, offset_low=6fa00000 offset_high=ffff90210000, max_page_size=10000
<7> [467.113303] igt_write_huge timed out on engine=1, offset_low=21e00000 offset_high=ffffddf70000, max_page_size=10000
<7> [468.115074] igt_write_huge timed out on engine=0, offset_low=21e00000 offset_high=ffffddeb0000, max_page_size=10000
<7> [469.116458] igt_write_huge timed out on engine=6, offset_low=422600000 offset_high=fffbdd800000, max_page_size=200000
<7> [470.117801] igt_write_huge timed out on engine=2, offset_low=22e00000 offset_high=ffffdcdb0000, max_page_size=10000
<7> [471.119455] igt_write_huge timed out on engine=1, offset_low=468a00000 offset_high=fffb97200000, max_page_size=200000
<7> [472.120658] igt_write_huge timed out on engine=2, offset_low=22200000 offset_high=ffffdd8f0000, max_page_size=10000
<7> [473.121536] igt_write_huge timed out on engine=1, offset_low=21e00000 offset_high=ffffddc50000, max_page_size=10000
<7> [474.122414] igt_write_huge timed out on engine=1, offset_low=44ea00000 offset_high=fffbb1200000, max_page_size=200000
<7> [475.123929] igt_write_huge timed out on engine=2, offset_low=45ca00000 offset_high=fffba3200000, max_page_size=200000
<7> [476.125418] igt_write_huge timed out on engine=6, offset_low=22200000 offset_high=ffffdd870000, max_page_size=10000
<7> [477.126755] igt_write_huge timed out on engine=6, offset_low=450e00000 offset_high=fffbaee00000, max_page_size=200000
<7> [478.127850] igt_write_huge timed out on engine=2, offset_low=465200000 offset_high=fffb9aa00000, max_page_size=200000
<6> [478.128115] i915: Running i915_gem_huge_page_live_selftests/igt_ppgtt_gemfs_huge
<6> [478.129573] finishing test early, gemfs unable to allocate huge-page(s) with size=2097152
<6> [478.129583] i915: Running i915_gem_huge_page_live_selftests/igt_ppgtt_internal_huge
<7> [479.130394] igt_write_huge timed out on engine=6, offset_low=78800000 offset_high=ffff874b0000, max_page_size=10000
<7> [480.131396] igt_write_huge timed out on engine=1, offset_low=70000000 offset_high=ffff8fd10000, max_page_size=10000
<7> [481.132306] igt_write_huge timed out on engine=6, offset_low=62c00000 offset_high=ffff9d0b0000, max_page_size=10000
<7> [482.133592] igt_write_huge timed out on engine=6, offset_low=51000000 offset_high=ffffaed30000, max_page_size=10000
<7> [483.134322] igt_write_huge timed out on engine=0, offset_low=3ce00000 offset_high=ffffc2ef0000, max_page_size=10000
<7> [484.135782] igt_write_huge timed out on engine=0, offset_low=4f3a00000 offset_high=fffb0c400000, max_page_size=200000
<7> [484.136776] [drm:intel_power_well_enable [i915]] enabling DC off
<7> [484.137632] [drm:gen9_set_dc_state [i915]] Setting DC state from 02 to 00
<7> [484.137750] [drm:icl_combo_phys_init [i915]] Port A combo PHY already enabled, won't reprogram it.
<7> [484.137815] [drm:icl_combo_phys_init [i915]] Port B combo PHY already enabled, won't reprogram it.
<7> [484.137849] [drm:intel_power_well_enable [i915]] enabling power well 2
<7> [484.137883] [drm:intel_power_well_enable [i915]] enabling power well 3
<7> [484.137932] [drm:intel_power_well_enable [i915]] enabling power well 4
<4> [485.483401] ------------[ cut here ]------------
<4> [485.483403] WARN_ON(!list_empty(&dev_priv->contexts.list))
<4> [485.483460] WARNING: CPU: 4 PID: 5176 at drivers/gpu/drm/i915/i915_gem.c:5044 i915_gem_fini+0x13b/0x150 [i915]
<4> [485.483461] Modules linked in: i915(+) amdgpu gpu_sched ttm vgem snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic mei_hdcp btusb btrtl btbcm btintel x86_pkg_temp_thermal coretemp bluetooth crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_codec e1000e snd_hwdep snd_hda_core cdc_ether usbnet mii snd_pcm ptp pps_core mei_me i2c_i801 ecdh_generic mei prime_numbers [last unloaded: i915]
<4> [485.483475] CPU: 4 PID: 5176 Comm: i915_selftest Tainted: G     U            5.1.0-rc3-CI-CI_DRM_5869+ #1
<4> [485.483476] Hardware name: Intel Corporation Ice Lake Client Platform/IceLake U DDR4 SODIMM PD RVP TLC, BIOS ICLSFWR1.R00.3087.A00.1902250334 02/25/2019
<4> [485.483511] RIP: 0010:i915_gem_fini+0x13b/0x150 [i915]
<4> [485.483512] Code: 08 aa 00 00 48 81 c5 08 aa 00 00 48 39 c5 75 07 5b 5d 41 5c 41 5d c3 48 c7 c6 f0 fd 41 a0 48 c7 c7 d3 c6 44 a0 e8 25 2c dc e0 <0f> 0b 5b 5d 41 5c 41 5d c3 66 90 66 2e 0f 1f 84 00 00 00 00 00 e9
<4> [485.483514] RSP: 0018:ffffc90000233b00 EFLAGS: 00010286
<4> [485.483515] RAX: 0000000000000000 RBX: ffff888419589020 RCX: 0000000000000004
<4> [485.483516] RDX: 0000000000000006 RSI: ffff88848d6408b8 RDI: ffffffff8211dc3d
<4> [485.483517] RBP: ffff88841958aa08 R08: 0000000004dfabf4 R09: 0000000000000000
<4> [485.483518] R10: 0000000000000000 R11: 0000000000000000 R12: ffff888419587630
<4> [485.483519] R13: ffff888419580068 R14: ffffffffa04cc1e0 R15: ffffc90000233e98
<4> [485.483521] FS:  00007f30e50d5980(0000) GS:ffff88849ff00000(0000) knlGS:0000000000000000
<4> [485.483522] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [485.483523] CR2: 000055e40fbf1880 CR3: 00000004074d0003 CR4: 0000000000760ee0
<4> [485.483524] PKRU: 55555554
<4> [485.483525] Call Trace:
<4> [485.483557]  i915_driver_unload+0xd1/0x120 [i915]
<4> [485.483585]  i915_pci_remove+0x19/0x30 [i915]
<4> [485.483609]  i915_pci_probe+0x60/0xa0 [i915]
<4> [485.483613]  pci_device_probe+0xa1/0x120
<4> [485.483617]  really_probe+0xf3/0x3e0
<4> [485.483620]  driver_probe_device+0x10a/0x120
<4> [485.483623]  device_driver_attach+0x4b/0x50
<4> [485.483625]  __driver_attach+0x97/0x130
<4> [485.483627]  ? device_driver_attach+0x50/0x50
<4> [485.483629]  bus_for_each_dev+0x74/0xc0
<4> [485.483632]  bus_add_driver+0x13f/0x210
<4> [485.483634]  ? 0xffffffffa0bc3000
<4> [485.483636]  driver_register+0x56/0xe0
<4> [485.483637]  ? 0xffffffffa0bc3000
<4> [485.483640]  do_one_initcall+0x58/0x2e0
<4> [485.483642]  ? do_init_module+0x1d/0x1ea
<4> [485.483644]  ? rcu_read_lock_sched_held+0x6f/0x80
<4> [485.483646]  ? kmem_cache_alloc_trace+0x261/0x290
<4> [485.483649]  do_init_module+0x56/0x1ea
<4> [485.483652]  load_module+0x2701/0x29e0
<4> [485.483661]  ? __se_sys_finit_module+0xd3/0xf0
<4> [485.483662]  __se_sys_finit_module+0xd3/0xf0
<4> [485.483668]  do_syscall_64+0x55/0x190
<4> [485.483671]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [485.483672] RIP: 0033:0x7f30e4993839
<4> [485.483674] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [485.483675] RSP: 002b:00007fffd485a178 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [485.483677] RAX: ffffffffffffffda RBX: 000055688b4868f0 RCX: 00007f30e4993839
<4> [485.483678] RDX: 0000000000000000 RSI: 000055688b47ff30 RDI: 0000000000000006
<4> [485.483679] RBP: 000055688b47ff30 R08: 0000000000000004 R09: 0000000000000000
<4> [485.483680] R10: 00007fffd485a2f0 R11: 0000000000000246 R12: 0000000000000000
<4> [485.483681] R13: 000055688b480a70 R14: 0000000000000020 R15: 0000000000000048
<4> [485.483687] irq event stamp: 9050758
<4> [485.483690] hardirqs last  enabled at (9050757): [<ffffffff81126564>] vprintk_emit+0x124/0x320
<4> [485.483691] hardirqs last disabled at (9050758): [<ffffffff810019b0>] trace_hardirqs_off_thunk+0x1a/0x1c
<4> [485.483693] softirqs last  enabled at (9048830): [<ffffffff8183484c>] peernet2id+0x4c/0x70
<4> [485.483695] softirqs last disabled at (9048828): [<ffffffff8183482d>] peernet2id+0x2d/0x70
<4> [485.483731] WARNING: CPU: 4 PID: 5176 at drivers/gpu/drm/i915/i915_gem.c:5044 i915_gem_fini+0x13b/0x150 [i915]
<4> [485.483732] ---[ end trace 41161884b12e92f7 ]---
<4> [485.530386] i915: probe of 0000:00:02.0 failed with error -25
<6> [485.531243] [IGT] i915_selftest: exiting, ret=0

Setting the priority of this bug as highest.
Based on the impact of this bug to the customer, priority can be changed.
Comment 1 CI Bug Log 2019-04-04 14:59:47 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* ICL: igt@i915_selftest@live_hugepages - dmesg-warn - failed to allocate THP, finishing test early
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_4927/fi-icl-u2/igt@i915_selftest@live_hugepages.html

* ICL: igt@runner@aborted - fail - Previous test: i915_selftest (live_hugepages)
  - https://intel-gfx-ci.01.org/tree/drm-tip/IGT_4927/fi-icl-u2/igt@runner@aborted.html
Comment 2 Lakshmi 2019-04-04 15:00:06 UTC
Setting the priority of this bug as highest.
Based on the impact of this bug to the customer, priority can be changed.
Comment 3 Chris Wilson 2019-04-04 18:31:26 UTC
"failed to allocate THP, finishing test early" is a debug message (technically an info).

The root cause here is likely to be the memory corruption we see around icl.
Comment 4 CI Bug Log 2019-04-05 05:20:02 UTC
A CI Bug Log filter associated to this bug has been updated:

{- ICL: igt@i915_selftest@live_hugepages - dmesg-warn - failed to allocate THP, finishing test early -}
{+ ICL: igt@i915_selftest@live_hugepages - dmesg-warn - WARN_ON(!list_empty(&amp;dev_priv-&gt;contexts.list)) +}

 No new failures caught with the new filter
Comment 5 Francesco Balestrieri 2019-04-08 10:48:21 UTC
Another one that we'll keep observing, but no immediate action to be taken. I suggest moving to "high".
Comment 6 Jani Saarinen 2019-04-11 06:42:14 UTC
BIOS was updated last week on this U2 system.
Comment 7 Lakshmi 2019-04-11 06:49:11 UTC
(In reply to Francesco Balestrieri from comment #5)
> Another one that we'll keep observing, but no immediate action to be taken.
> I suggest moving to "high".

Yes, we we keep monitoring this issue, since it's a BAT failure we should see more often if it appears again. Setting the priority to high. 

The worst case would be that user can not use hugepages. 
@Francesco, What would be the impact hugepages doesn't work?
Comment 8 Chris Wilson 2019-04-11 06:53:06 UTC
The bug is nothing to do with hugepages; it's the general memcorruption bug.
Comment 9 Francesco Balestrieri 2019-04-11 07:44:57 UTC
Lakshmi, this appears to have the same root cause as Bug 110329
Comment 10 Jani Saarinen 2019-04-22 12:19:59 UTC
Should we dup this then?
Comment 11 Jani Saarinen 2019-04-22 12:22:42 UTC
Not seen after BIOS update.
Comment 12 Francesco Balestrieri 2019-06-03 05:25:25 UTC
Still not seen after two months. Closing.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.