Bug 102777

Summary: [BAT][GLK] igt@drv_module_reload dmesg-warn igt@drv_module_reload@basic-no-display|basic-reload-inject|basic-reload
Product: DRI Reporter: Marta Löfstedt <marta.lofstedt>
Component: DRM/IntelAssignee: Imre Deak <imre.deak>
Status: CLOSED FIXED QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: highest CC: intel-gfx-bugs, ricardo.vega
Version: DRI git   
Hardware: Other   
OS: All   
Whiteboard: ReadyForDev
i915 platform: GLK i915 features: power/Other

Description Marta Löfstedt 2017-09-15 10:19:51 UTC
On CI_DRM_3092 the fi-glk-2a-b was added to the lab and started timeout on register access, and hitting below WARN_ON during:
igt@drv_module_reload@basic-reload 
igt@drv_module_reload@basic-no-display
igt@drv_module_reload@basic-reload-inject

[  530.226475] [drm:intel_update_cdclk [i915]] Current CD clock rate: 79200 kHz, VCO: 633600 kHz, ref: 19200 kHz
[  530.226533] [drm:intel_power_well_enable [i915]] enabling always-on
[  530.226573] [drm:intel_power_well_enable [i915]] enabling DC off
[  530.226614] [drm:gen9_set_dc_state [i915]] Setting DC state from 00 to 00
[  530.226666] [drm:intel_power_well_enable [i915]] enabling power well 2
[  530.226730] [drm:intel_power_well_enable [i915]] enabling dpio-common-a
[  530.226798] [drm:bxt_ddi_phy_is_enabled [i915]] DDI PHY 1 powered, but common lane for port A powered down (PHY_CTL 00000600)
[  530.226915] [drm:intel_power_well_enable [i915]] enabling dpio-common-b
[  530.226981] [drm:bxt_ddi_phy_is_enabled [i915]] DDI PHY 1 powered, but common lane for port A powered down (PHY_CTL 00000600)
[  530.227045] [drm:bxt_ddi_phy_is_enabled [i915]] DDI PHY 1 powered, but common lane for port A powered down (PHY_CTL 00000600)
[  530.227308] [drm:_bxt_ddi_phy_init [i915]] DDI PHY 0 already enabled, won't reprogram it
[  530.227352] [drm:intel_power_well_enable [i915]] enabling dpio-common-c
[  530.228300] [drm:_bxt_ddi_phy_init [i915]] DDI PHY 2 already enabled, won't reprogram it
[  530.228346] [drm:intel_power_well_enable [i915]] enabling AUX A
[  530.232107] WARN_ON(intel_wait_for_register(dev_priv, ((const i915_reg_t){ .reg = ((((const u32 []){ 0x45404 })[((id) >> 4)])) }), (1 << (((id) & 0xf) * 2)), (1 << (((id) & 0xf) * 2)), 1))
[  530.232144] ------------[ cut here ]------------
[  530.232214] WARNING: CPU: 1 PID: 4494 at drivers/gpu/drm/i915/intel_runtime_pm.c:313 hsw_power_well_enable+0x123/0x130 [i915]
[  530.232217] Modules linked in: i915(+) vgem snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_codec snd_hwdep snd_hda_core snd_pcm r8169 mii prime_numbers i2c_hid pinctrl_geminilake pinctrl_intel [last unloaded: i915]
[  530.232280] CPU: 1 PID: 4494 Comm: drv_module_relo Tainted: G     U          4.13.0-CI-CI_DRM_3092+ #1
[  530.232284] Hardware name: Intel Corp. Geminilake/GLK RVP2 LP4SD (07), BIOS GELKRVPA.X64.0062.B30.1708222146 08/22/2017
[  530.232287] task: ffff880170072780 task.stack: ffffc9000078c000
[  530.232327] RIP: 0010:hsw_power_well_enable+0x123/0x130 [i915]
[  530.232331] RSP: 0018:ffffc9000078fa20 EFLAGS: 00010282
[  530.232336] RAX: 00000000000000b0 RBX: ffff880129810000 RCX: 0000000000000006
[  530.232339] RDX: 0000000000000006 RSI: ffffffff81cf7904 RDI: ffffffff81cae6ee
[  530.232343] RBP: ffffc9000078fa48 R08: ffff880170073098 R09: 0000000000000000
[  530.232346] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[  530.232349] R13: ffffffffa04176f0 R14: 0000000000000001 R15: aaaaaaaaaaaaaaab
[  530.232352] FS:  00007fe0f39e78c0(0000) GS:ffff88017fc80000(0000) knlGS:0000000000000000
[  530.232355] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  530.232358] CR2: 000055c087db3570 CR3: 00000001708d6000 CR4: 00000000003406e0
[  530.232361] Call Trace:
[  530.232403]  intel_power_well_enable+0x3a/0x50 [i915]
[  530.232442]  __intel_display_power_get_domain+0x81/0xa0 [i915]
[  530.232483]  intel_display_power_get+0x35/0x50 [i915]
[  530.232523]  intel_display_set_init_power+0x3c/0x40 [i915]
[  530.232562]  intel_power_domains_init_hw+0x63/0x780 [i915]
[  530.232602]  i915_driver_load+0x9b6/0x16c0 [i915]
[  530.232643]  i915_pci_probe+0x37/0x90 [i915]
[  530.232652]  pci_device_probe+0xa8/0x130
[  530.232659]  driver_probe_device+0x29c/0x450
[  530.232665]  __driver_attach+0xe3/0xf0
[  530.232670]  ? driver_probe_device+0x450/0x450
[  530.232674]  bus_for_each_dev+0x62/0xa0
[  530.232680]  driver_attach+0x1e/0x20
[  530.232683]  bus_add_driver+0x173/0x270
[  530.232689]  driver_register+0x60/0xe0
[  530.232694]  __pci_register_driver+0x60/0x70
[  530.232733]  i915_init+0x6f/0x78 [i915]
[  530.232736]  ? 0xffffffffa0466000
[  530.232741]  do_one_initcall+0x43/0x170
[  530.232746]  ? rcu_read_lock_sched_held+0x7a/0x90
[  530.232751]  ? kmem_cache_alloc_trace+0x261/0x2d0
[  530.232758]  do_init_module+0x5f/0x206
[  530.232763]  load_module+0x2561/0x2da0
[  530.232769]  ? show_coresize+0x30/0x30
[  530.232775]  ? kernel_read_file+0x105/0x190
[  530.232784]  SyS_finit_module+0xc1/0x100
[  530.232787]  ? SyS_finit_module+0xc1/0x100
[  530.232798]  entry_SYSCALL_64_fastpath+0x1c/0xb1
[  530.232801] RIP: 0033:0x7fe0f1efa9f9
[  530.232804] RSP: 002b:00007ffd4010dd28 EFLAGS: 00000206 ORIG_RAX: 0000000000000139
[  530.232810] RAX: ffffffffffffffda RBX: ffffffff8148a713 RCX: 00007fe0f1efa9f9
[  530.232812] RDX: 0000000000000000 RSI: 00007fe0f2e2de23 RDI: 0000000000000003
[  530.232815] RBP: ffffc9000078ff88 R08: 0000000000000000 R09: 0000000000000000
[  530.232818] R10: 0000000000000003 R11: 0000000000000206 R12: 0000000000000000
[  530.232821] R13: 0000563a98f68440 R14: 0000000000000000 R15: 0000563a98f68a10
[  530.232827]  ? __this_cpu_preempt_check+0x13/0x20
[  530.232834] Code: 00 00 4c 89 ef e8 8e 5b 30 e1 eb b3 31 f6 e8 a5 fe ff ff e9 28 ff ff ff 48 c7 c6 98 eb 3e a0 48 c7 c7 e6 e8 3d a0 e8 7e 17 e2 e0 <0f> ff e9 71 ff ff ff 66 0f 1f 44 00 00 0f 1f 44 00 00 55 48 89 
[  530.232996] ---[ end trace 0d0b86d0acdea3d9 ]---
[  530.233037] [drm:intel_power_well_enable [i915]] enabling AUX B
[  530.233082] [drm:intel_power_well_enable [i915]] enabling AUX C


Note, the A0 system fi-glk-2a is doing fine on these tests. So, it's maybe a firmware issue for this B0?

Full data:
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3092/fi-glk-2a-b/igt@drv_module_reload@basic-no-display.html
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3092/fi-glk-2a-b/igt@drv_module_reload@basic-reload.html
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3092/fi-glk-2a-b/igt@drv_module_reload@basic-reload-inject.html
Comment 1 Ricardo 2017-09-28 14:17:35 UTC
Lower to high according to matrix, this is not constant issue.
Comment 2 Jari Tahvanainen 2017-10-02 10:32:50 UTC
Something changed on https://intel-gfx-ci.01.org/CI/CI_DRM_3159, since now dmesg-warn is constant (100% repeatable) with all the 3 sub-test. Quite a few commits which might have caused the change...
Comment 3 Jani Saarinen 2017-10-02 14:07:49 UTC
Patch as reference to fix: https://patchwork.freedesktop.org/series/31265/
Comment 4 Jani Saarinen 2017-10-02 14:31:18 UTC
Patch fixing issue: https://intel-gfx-ci.01.org/tree/drm-tip/Patchwork_5869/
Comment 5 Jani Saarinen 2017-10-03 10:02:36 UTC
drm/i915: Fix DDI PHY init if it was already on

author	Imre Deak <imre.deak@intel.com>	2017-10-02 13:53:07 (GMT)
committer	Imre Deak <imre.deak@intel.com>	2017-10-03 09:09:52 (GMT)
commit	e19c1eb885ac4186e64c7e484424124f3145318e
Comment 6 Jani Saarinen 2017-10-03 10:02:45 UTC
drm/i915: Fix DDI PHY init if it was already on

author	Imre Deak <imre.deak@intel.com>	2017-10-02 13:53:07 (GMT)
committer	Imre Deak <imre.deak@intel.com>	2017-10-03 09:09:52 (GMT)
commit	e19c1eb885ac4186e64c7e484424124f3145318e

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.