Bug 105411 - [CI] [SNB only] igt@* - incomplete - timout/system hang
Summary: [CI] [SNB only] igt@* - incomplete - timout/system hang
Status: RESOLVED MOVED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks: 105984
  Show dependency treegraph
 
Reported: 2018-03-09 06:33 UTC by Marta Löfstedt
Modified: 2019-11-29 17:41 UTC (History)
1 user (show)

See Also:
i915 platform: SNB
i915 features: GPU hang


Attachments

Description Marta Löfstedt 2018-03-09 06:33:18 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3901/shard-snb3/igt@kms_vblank@pipe-b-ts-continuation-suspend.html

run.log:
running: igt/kms_vblank/pipe-b-ts-continuation-suspend

[09/97] skip: 3, pass: 6 /                            
FATAL: command execution failed

Last dmesg:
<7>[   53.888393] [drm:verify_connector_state.isra.78 [i915]] [CONNECTOR:52:VGA-1]
<7>[   53.888474] [drm:intel_atomic_commit_tail [i915]] [CRTC:51:pipe B]
<7>[   53.888567] [drm:verify_single_dpll_state.isra.79 [i915]] PCH DPLL A
<6>[   54.001354] PM: suspend entry (deep)
Comment 1 Marta Löfstedt 2018-03-26 06:50:40 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3975/shard-snb7/igt@drv_suspend@fence-restore-tiled2untiled.html

run.log:
running: igt/drv_suspend/fence-restore-tiled2untiled

[77/98] skip: 52, pass: 24, fail: 1 /               
FATAL: command execution failed
...
Completed CI_IGT_test CI_DRM_3975/shard-snb7/27 : FAILURE
CI_IGT_test runtime 510 seconds
Rebooting shard-snb7

Last dmesg:
<6>[  386.036825] Console: switching to colour dummy device 80x25
<7>[  386.036876] [IGT] drv_suspend: executing
<7>[  386.054557] [IGT] drv_suspend: starting subtest fence-restore-tiled2untiled
Comment 3 Marta Löfstedt 2018-04-04 05:59:00 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4016/shard-snb1/igt@syncobj_wait@multi-wait-for-submit-unsubmitted-signaled.html

From run.log:
running: igt/gem_exec_parallel/blt-fds

[81/99] skip: 54, pass: 27 /          
FATAL: command execution failed
...
Completed CI_IGT_test CI_DRM_4016/shard-snb1/14 : FAILURE
CI_IGT_test runtime 104 seconds
Rebooting shard-snb1

However, last test in dmesg is:
 <7>[   82.018763] [IGT] i915_query: starting subtest query-topology-kernel-writes
<7>[   82.018867] [IGT] i915_query: exiting, ret=77
<6>[   82.048429] Console: switching to colour frame buffer device 128x48
<6>[   82.158117] Console: switching to colour dummy device 80x25
<7>[   82.158171] [IGT] perf_pmu: executing

which is 3 test earlier, when this happens we think something may have happened with writing out dmesg to disk. So, shard-snb1 should be watched with extra care for a while.
Comment 4 Marta Löfstedt 2018-04-09 07:40:20 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_4413/shard-snb3/igt@gem_exec_parallel@basic.html

run.log:
running: igt/gem_exec_parallel/basic

[50/75] skip: 27, pass: 23 -        
FATAL: command execution failed
...
Completed CI_IGT_test CI_DRM_4029/shard-snb3/33 : FAILURE
CI_IGT_test runtime 151 seconds
Rebooting shard-snb3

dmesg:
<7>[   85.180778] [drm:__intel_fbc_disable [i915]] Disabling FBC on pipe A
<6>[   85.310318] Console: switching to colour dummy device 80x25
<7>[   85.310373] [IGT] gem_exec_parallel: executing
Comment 22 Martin Peres 2018-11-05 13:00:46 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_137/fi-snb-2600/igt@gem_exec_reloc@basic-wc-cpu-noreloc.html

<6> [111.733817] [IGT] gem_exec_reloc: executing
<3> [111.788154] [drm:fw_domains_get [i915]] *ERROR* render: timed out waiting for forcewake ack request.
Comment 24 Francesco Balestrieri 2018-11-23 11:18:54 UTC
Setting to medium since it's a meta-bug of sorts.
Comment 31 CI Bug Log 2019-01-29 09:16:03 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  -}
{+ fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_196/fi-snb-2520m/igt@gem_exec_whisper@normal.html
* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_191/fi-snb-2520m/igt@gem_exec_schedule@preempt-contexts-bsd1.html
* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_199/fi-snb-2520m/igt@gem_create@create-invalid-nonaligned.html
Comment 33 CI Bug Log 2019-02-12 09:49:42 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_210/fi-snb-2520m/igt@kms_atomic_transition@plane-toggle-modeset-transition.html
Comment 34 CI Bug Log 2019-02-21 14:33:58 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_223/fi-snb-2520m/igt@gem_linear_blits@normal.html
Comment 35 CI Bug Log 2019-02-25 13:19:48 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_227/fi-snb-2520m/igt@syncobj_wait@multi-wait-all-for-submit-submitted-signaled.html
Comment 36 CI Bug Log 2019-03-04 09:08:21 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  -}
{+ fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_234/fi-snb-2520m/igt@gem_exec_reloc@basic-write-cpu-noreloc.html
Comment 37 CI Bug Log 2019-04-29 07:16:19 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds)
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6003/fi-snb-2600/igt@runner@aborted.html
Comment 38 CI Bug Log 2019-04-30 07:01:58 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds) -}
{+ SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds) +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6009/shard-snb4/igt@runner@aborted.html
  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6011/shard-snb7/igt@runner@aborted.html
Comment 39 CI Bug Log 2019-05-26 19:17:51 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_293/fi-snb-2520m/igt@gem_busy@extended-parallel-rcs0.html
  * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_293/fi-snb-2520m/igt@gem_busy@extended-parallel-bcs0.html
Comment 40 CI Bug Log 2019-05-29 05:59:24 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds) -}
{+ SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds|mock_contexts|live_contexts) +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6155/shard-snb6/igt@runner@aborted.html
Comment 41 Martin Peres 2019-05-29 07:19:59 UTC
<3>[ 1519.992252] __intel_engines_record_defaults:4373 GEM_BUG_ON(intel_context_is_pinned(ce))
<4>[ 1519.992311] ------------[ cut here ]------------
<2>[ 1519.992315] kernel BUG at drivers/gpu/drm/i915/i915_gem.c:4373!
<4>[ 1519.992347] invalid opcode: 0000 [#1] PREEMPT SMP PTI
<4>[ 1519.992357] CPU: 5 PID: 8478 Comm: i915_selftest Tainted: G     U            5.2.0-rc2-CI-CI_DRM_6155+ #1
<4>[ 1519.992380] Hardware name: Dell Inc. XPS 8300  /0Y2MRG, BIOS A06 10/17/2011
<4>[ 1519.992462] RIP: 0010:i915_gem_init+0xa80/0xa90 [i915]
<4>[ 1519.992472] Code: df 97 f2 e0 48 8b 35 27 88 1d 00 49 c7 c0 c9 4a 34 a0 b9 15 11 00 00 48 c7 c2 80 1a 2f a0 48 c7 c7 9b f3 1f a0 e8 40 5d f9 e0 <0f> 0b 0f 1f 40 00 66 2e 0f 1f 84 00 00 00 00 00 41 54 55 53 8b 87
<4>[ 1519.992496] RSP: 0018:ffffc90000487a30 EFLAGS: 00010286
<4>[ 1519.992505] RAX: 0000000000000010 RBX: ffff88813b3e0000 RCX: 0000000000000000
<4>[ 1519.992515] RDX: 0000000000000000 RSI: 0000000000000058 RDI: 0000000000000000
<4>[ 1519.992536] RBP: ffff88813b3e0068 R08: ffffffffa0344ac9 R09: 0000000000000000
<4>[ 1519.992546] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88813b3e0d38
<4>[ 1519.992556] R13: ffff888212066a48 R14: ffff88820c312158 R15: 0000000000000000
<4>[ 1519.992567] FS:  00007fc6433cbe40(0000) GS:ffff888227a80000(0000) knlGS:0000000000000000
<4>[ 1519.992578] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 1519.992587] CR2: 0000561b4c3acd00 CR3: 0000000215a74001 CR4: 00000000000606e0
<4>[ 1519.992597] Call Trace:
<4>[ 1519.992654]  i915_driver_load+0xdb8/0x18a0 [i915]
<4>[ 1519.992666]  ? lock_acquire+0xa6/0x1c0
<4>[ 1519.992675]  ? __pm_runtime_resume+0x4f/0x80
<4>[ 1519.992685]  ? _raw_spin_unlock_irqrestore+0x4c/0x60
<4>[ 1519.992694]  ? _raw_spin_unlock_irqrestore+0x4c/0x60
<4>[ 1519.992702]  ? lockdep_hardirqs_on+0xe3/0x1b0
<4>[ 1519.992781]  i915_pci_probe+0x29/0xa0 [i915]
<4>[ 1519.992791]  pci_device_probe+0x9e/0x120
<4>[ 1519.992800]  really_probe+0xea/0x3c0
<4>[ 1519.992808]  driver_probe_device+0x10b/0x120
<4>[ 1519.992816]  device_driver_attach+0x4a/0x50
<4>[ 1519.992825]  __driver_attach+0x97/0x130
<4>[ 1519.992832]  ? device_driver_attach+0x50/0x50
<4>[ 1519.992840]  bus_for_each_dev+0x74/0xc0
<4>[ 1519.992849]  bus_add_driver+0x13f/0x210
<4>[ 1519.992856]  ? 0xffffffffa0472000
<4>[ 1519.992863]  driver_register+0x56/0xe0
<4>[ 1519.992870]  ? 0xffffffffa0472000
<4>[ 1519.992877]  do_one_initcall+0x58/0x300
<4>[ 1519.992886]  ? do_init_module+0x1d/0x1f6
<4>[ 1519.992895]  ? rcu_read_lock_sched_held+0x6f/0x80
<4>[ 1519.992903]  ? kmem_cache_alloc_trace+0x261/0x290
<4>[ 1519.992923]  do_init_module+0x56/0x1f6
<4>[ 1519.992931]  load_module+0x24d1/0x2990
<4>[ 1519.992946]  ? __se_sys_finit_module+0xd3/0xf0
<4>[ 1519.992954]  __se_sys_finit_module+0xd3/0xf0
<4>[ 1519.992967]  do_syscall_64+0x55/0x1c0
<4>[ 1519.992975]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4>[ 1519.992984] RIP: 0033:0x7fc642c8b839
<4>[ 1519.992991] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4>[ 1519.993025] RSP: 002b:00007ffd7ab70848 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4>[ 1519.993037] RAX: ffffffffffffffda RBX: 00005562f2bef320 RCX: 00007fc642c8b839
<4>[ 1519.993047] RDX: 0000000000000000 RSI: 00005562f2be80e0 RDI: 0000000000000006
<4>[ 1519.993057] RBP: 00005562f2be80e0 R08: 0000000000000004 R09: 00005562f132bc1b
<4>[ 1519.993068] R10: 00007ffd7ab70a90 R11: 0000000000000246 R12: 0000000000000000
<4>[ 1519.993077] R13: 00005562f2be5bf0 R14: 0000000000000020 R15: 0000000000000047
Comment 42 CI Bug Log 2019-07-25 12:39:57 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  -}
{+ fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6549/fi-snb-2520m/igt@i915_module_load@reload-with-fault-injection.html
Comment 43 CI Bug Log 2019-10-11 07:48:24 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_383/fi-snb-2520m/igt@gem_eio@kms.html
Comment 44 Martin Peres 2019-11-29 17:41:42 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/intel/issues/82.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.