Bug 108622 - [CI][BAT] igt@drv_selftest@live_execlists - dmesg-warn - *ERROR* MMIO: GuC action 0x2 failed with error -5 0xf000f000
Summary: [CI][BAT] igt@drv_selftest@live_execlists - dmesg-warn - *ERROR* MMIO: GuC ac...
Status: NEW
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: high normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
: 108732 (view as bug list)
Depends on:
Blocks:
 
Reported: 2018-11-01 11:09 UTC by Martin Peres
Modified: 2018-11-25 20:24 UTC (History)
1 user (show)

See Also:
i915 platform: BXT
i915 features: firmware/guc


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Martin Peres 2018-11-01 11:09:28 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_4704/fi-apl-guc/igt@drv_selftest@live_execlists.html

<3> [557.763361] [drm:intel_guc_send_mmio [i915]] *ERROR* MMIO: GuC action 0x2 failed with error -5 0xf000f000
<4> [557.763716] ------------[ cut here ]------------
<4> [557.763815] WARN_ON(intel_guc_send(guc, data, (sizeof(data) / sizeof((data)[0]) + (sizeof(struct { int:(-!!(__builtin_types_compatible_p(typeof((data)), typeof(&(data)[0])))); })))))
<4> [557.763945] WARNING: CPU: 0 PID: 256 at drivers/gpu/drm/i915/intel_guc_submission.c:620 inject_preempt_context+0x4f4/0x600 [i915]
<4> [557.763950] Modules linked in: i915(+) amdgpu chash gpu_sched ttm vgem snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic btusb btrtl btbcm btintel x86_pkg_temp_thermal coretemp bluetooth crct10dif_pclmul crc32_pclmul ghash_clmulni_intel ecdh_generic lpc_ich r8169 snd_hda_codec snd_hwdep snd_hda_core snd_pcm mei_me mei pinctrl_broxton pinctrl_intel prime_numbers [last unloaded: i915]
<4> [557.764064] CPU: 0 PID: 256 Comm: kworker/u9:0 Tainted: G     U            4.19.0-CI-CI_DRM_5064+ #1
<4> [557.764068] Hardware name: Intel corporation NUC6CAYS/NUC6CAYB, BIOS AYAPLCEL.86A.0056.2018.0926.1100 09/26/2018
<4> [557.764153] Workqueue: i915-guc_preempt inject_preempt_context [i915]
<4> [557.764240] RIP: 0010:inject_preempt_context+0x4f4/0x600 [i915]
<4> [557.764244] Code: 00 00 48 c7 c2 50 91 59 a0 48 c7 c7 6a 16 4e a0 e8 f1 19 c8 e0 0f 0b 48 c7 c6 c8 39 5d a0 48 c7 c7 a7 2f 5b a0 e8 5c 34 ba e0 <0f> 0b 49 0f ba b5 98 04 00 00 01 f0 49 0f ba ad d8 03 00 00 00 0f
<4> [557.764248] RSP: 0018:ffffc90000247e00 EFLAGS: 00010286
<4> [557.764255] RAX: 0000000000000000 RBX: 0000000000000080 RCX: 0000000000000000
<4> [557.764259] RDX: 0000000000000007 RSI: ffffffff820c31be RDI: 00000000ffffffff
<4> [557.764263] RBP: ffffc90000485112 R08: 00000000a47aa0a9 R09: 0000000000000000
<4> [557.764266] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8802685735a8
<4> [557.764270] R13: ffff880269822158 R14: ffff880236661310 R15: ffff88025e45ba78
<4> [557.764274] FS:  0000000000000000(0000) GS:ffff880277a00000(0000) knlGS:0000000000000000
<4> [557.764278] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [557.764281] CR2: 000055ad3b1b8980 CR3: 0000000005210000 CR4: 00000000003406f0
<4> [557.764285] Call Trace:
<4> [557.764304]  process_one_work+0x245/0x610
<4> [557.764318]  worker_thread+0x37/0x380
<4> [557.764327]  ? process_one_work+0x610/0x610
<4> [557.764332]  kthread+0x119/0x130
<4> [557.764337]  ? kthread_park+0x80/0x80
<4> [557.764346]  ret_from_fork+0x3a/0x50
<4> [557.764363] irq event stamp: 164702
<4> [557.764369] hardirqs last  enabled at (164701): [<ffffffff810f828a>] console_unlock+0x3fa/0x5f0
<4> [557.764375] hardirqs last disabled at (164702): [<ffffffff81001930>] trace_hardirqs_off_thunk+0x1a/0x1c
<4> [557.764380] softirqs last  enabled at (164684): [<ffffffff81c00319>] __do_softirq+0x319/0x48e
<4> [557.764386] softirqs last disabled at (164677): [<ffffffff8108c5e9>] irq_exit+0xa9/0xc0
<4> [557.764470] WARNING: CPU: 0 PID: 256 at drivers/gpu/drm/i915/intel_guc_submission.c:620 inject_preempt_context+0x4f4/0x600 [i915]
<4> [557.764474] ---[ end trace 0e556d9560b6a02b ]---
Comment 1 Martin Peres 2018-11-01 11:10:41 UTC
The priority is set to high because the guc is not used in production.
Comment 2 Chris Wilson 2018-11-01 11:23:56 UTC
A simple solution would be to accept that guc preemption is not yet functional.

--- a/drivers/gpu/drm/i915/intel_guc_submission.c
+++ b/drivers/gpu/drm/i915/intel_guc_submission.c
@@ -1285,6 +1285,7 @@ static void guc_set_default_submission(struct intel_engine_cs *engine)
        engine->reset.prepare = guc_reset_prepare;
 
        engine->flags &= ~I915_ENGINE_SUPPORTS_STATS;
+       engine->flags &= ~I915_ENGINE_HAS_PREEMPTION;
 }
 
 int intel_guc_submission_enable(struct intel_guc *guc)
Comment 3 Chris Wilson 2018-11-13 16:43:43 UTC
*** Bug 108732 has been marked as a duplicate of this bug. ***
Comment 4 Lakshmi 2018-11-25 20:24:09 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_145/fi-apl-guc/igt@runner@aborted.html

Aborting.
Previous test: i915_hangman (hangcheck-unterminated)
Next test: gem_exec_params (rel-constants-invalid)

Kernel tainted (0x240 -- 200)

Test result of igt@i915_hangman@hangcheck-unterminated is pass.

https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_145/fi-apl-guc/igt%40i915_hangman%40hangcheck-unterminated.html


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.