Bug 105411 - [CI] [SNB only] igt@* - incomplete - timout/system hang
Summary: [CI] [SNB only] igt@* - incomplete - timout/system hang
Status: NEW
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks: 105984
  Show dependency treegraph
 
Reported: 2018-03-09 06:33 UTC by Marta Löfstedt
Modified: 2019-05-29 07:19 UTC (History)
1 user (show)

See Also:
i915 platform: SNB
i915 features: GPU hang


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Marta Löfstedt 2018-03-09 06:33:18 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3901/shard-snb3/igt@kms_vblank@pipe-b-ts-continuation-suspend.html

run.log:
running: igt/kms_vblank/pipe-b-ts-continuation-suspend

[09/97] skip: 3, pass: 6 /                            
FATAL: command execution failed

Last dmesg:
<7>[   53.888393] [drm:verify_connector_state.isra.78 [i915]] [CONNECTOR:52:VGA-1]
<7>[   53.888474] [drm:intel_atomic_commit_tail [i915]] [CRTC:51:pipe B]
<7>[   53.888567] [drm:verify_single_dpll_state.isra.79 [i915]] PCH DPLL A
<6>[   54.001354] PM: suspend entry (deep)
Comment 1 Marta Löfstedt 2018-03-26 06:50:40 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_3975/shard-snb7/igt@drv_suspend@fence-restore-tiled2untiled.html

run.log:
running: igt/drv_suspend/fence-restore-tiled2untiled

[77/98] skip: 52, pass: 24, fail: 1 /               
FATAL: command execution failed
...
Completed CI_IGT_test CI_DRM_3975/shard-snb7/27 : FAILURE
CI_IGT_test runtime 510 seconds
Rebooting shard-snb7

Last dmesg:
<6>[  386.036825] Console: switching to colour dummy device 80x25
<7>[  386.036876] [IGT] drv_suspend: executing
<7>[  386.054557] [IGT] drv_suspend: starting subtest fence-restore-tiled2untiled
Comment 3 Marta Löfstedt 2018-04-04 05:59:00 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_4016/shard-snb1/igt@syncobj_wait@multi-wait-for-submit-unsubmitted-signaled.html

From run.log:
running: igt/gem_exec_parallel/blt-fds

[81/99] skip: 54, pass: 27 /          
FATAL: command execution failed
...
Completed CI_IGT_test CI_DRM_4016/shard-snb1/14 : FAILURE
CI_IGT_test runtime 104 seconds
Rebooting shard-snb1

However, last test in dmesg is:
 <7>[   82.018763] [IGT] i915_query: starting subtest query-topology-kernel-writes
<7>[   82.018867] [IGT] i915_query: exiting, ret=77
<6>[   82.048429] Console: switching to colour frame buffer device 128x48
<6>[   82.158117] Console: switching to colour dummy device 80x25
<7>[   82.158171] [IGT] perf_pmu: executing

which is 3 test earlier, when this happens we think something may have happened with writing out dmesg to disk. So, shard-snb1 should be watched with extra care for a while.
Comment 4 Marta Löfstedt 2018-04-09 07:40:20 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/IGT_4413/shard-snb3/igt@gem_exec_parallel@basic.html

run.log:
running: igt/gem_exec_parallel/basic

[50/75] skip: 27, pass: 23 -        
FATAL: command execution failed
...
Completed CI_IGT_test CI_DRM_4029/shard-snb3/33 : FAILURE
CI_IGT_test runtime 151 seconds
Rebooting shard-snb3

dmesg:
<7>[   85.180778] [drm:__intel_fbc_disable [i915]] Disabling FBC on pipe A
<6>[   85.310318] Console: switching to colour dummy device 80x25
<7>[   85.310373] [IGT] gem_exec_parallel: executing
Comment 22 Martin Peres 2018-11-05 13:00:46 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_137/fi-snb-2600/igt@gem_exec_reloc@basic-wc-cpu-noreloc.html

<6> [111.733817] [IGT] gem_exec_reloc: executing
<3> [111.788154] [drm:fw_domains_get [i915]] *ERROR* render: timed out waiting for forcewake ack request.
Comment 24 Francesco Balestrieri 2018-11-23 11:18:54 UTC
Setting to medium since it's a meta-bug of sorts.
Comment 31 CI Bug Log 2019-01-29 09:16:03 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  -}
{+ fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_196/fi-snb-2520m/igt@gem_exec_whisper@normal.html
* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_191/fi-snb-2520m/igt@gem_exec_schedule@preempt-contexts-bsd1.html
* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_199/fi-snb-2520m/igt@gem_create@create-invalid-nonaligned.html
Comment 33 CI Bug Log 2019-02-12 09:49:42 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_210/fi-snb-2520m/igt@kms_atomic_transition@plane-toggle-modeset-transition.html
Comment 34 CI Bug Log 2019-02-21 14:33:58 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_223/fi-snb-2520m/igt@gem_linear_blits@normal.html
Comment 35 CI Bug Log 2019-02-25 13:19:48 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_227/fi-snb-2520m/igt@syncobj_wait@multi-wait-all-for-submit-submitted-signaled.html
Comment 36 CI Bug Log 2019-03-04 09:08:21 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  -}
{+ fi-snb-2520m: igt@gem_random tests&amp;no logs - incomplete  +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_234/fi-snb-2520m/igt@gem_exec_reloc@basic-write-cpu-noreloc.html
Comment 37 CI Bug Log 2019-04-29 07:16:19 UTC
The CI Bug Log issue associated to this bug has been updated.

### New filters associated

* SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds)
  - https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6003/fi-snb-2600/igt@runner@aborted.html
Comment 38 CI Bug Log 2019-04-30 07:01:58 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds) -}
{+ SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds) +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6009/shard-snb4/igt@runner@aborted.html
  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6011/shard-snb7/igt@runner@aborted.html
Comment 39 CI Bug Log 2019-05-26 19:17:51 UTC
A CI Bug Log filter associated to this bug has been updated:

{- fi-snb-2520m: random tests - incomplete - no proper logs -}
{+ fi-snb-2520m: random tests - incomplete - no proper logs +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_293/fi-snb-2520m/igt@gem_busy@extended-parallel-rcs0.html
  * https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_293/fi-snb-2520m/igt@gem_busy@extended-parallel-bcs0.html
Comment 40 CI Bug Log 2019-05-29 05:59:24 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds) -}
{+ SNB: igt@runner@aborted - fail - Previous test: i915_selftest (live_workarounds|mock_contexts|live_contexts) +}

New failures caught by the filter:

  * https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_6155/shard-snb6/igt@runner@aborted.html
Comment 41 Martin Peres 2019-05-29 07:19:59 UTC
<3>[ 1519.992252] __intel_engines_record_defaults:4373 GEM_BUG_ON(intel_context_is_pinned(ce))
<4>[ 1519.992311] ------------[ cut here ]------------
<2>[ 1519.992315] kernel BUG at drivers/gpu/drm/i915/i915_gem.c:4373!
<4>[ 1519.992347] invalid opcode: 0000 [#1] PREEMPT SMP PTI
<4>[ 1519.992357] CPU: 5 PID: 8478 Comm: i915_selftest Tainted: G     U            5.2.0-rc2-CI-CI_DRM_6155+ #1
<4>[ 1519.992380] Hardware name: Dell Inc. XPS 8300  /0Y2MRG, BIOS A06 10/17/2011
<4>[ 1519.992462] RIP: 0010:i915_gem_init+0xa80/0xa90 [i915]
<4>[ 1519.992472] Code: df 97 f2 e0 48 8b 35 27 88 1d 00 49 c7 c0 c9 4a 34 a0 b9 15 11 00 00 48 c7 c2 80 1a 2f a0 48 c7 c7 9b f3 1f a0 e8 40 5d f9 e0 <0f> 0b 0f 1f 40 00 66 2e 0f 1f 84 00 00 00 00 00 41 54 55 53 8b 87
<4>[ 1519.992496] RSP: 0018:ffffc90000487a30 EFLAGS: 00010286
<4>[ 1519.992505] RAX: 0000000000000010 RBX: ffff88813b3e0000 RCX: 0000000000000000
<4>[ 1519.992515] RDX: 0000000000000000 RSI: 0000000000000058 RDI: 0000000000000000
<4>[ 1519.992536] RBP: ffff88813b3e0068 R08: ffffffffa0344ac9 R09: 0000000000000000
<4>[ 1519.992546] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88813b3e0d38
<4>[ 1519.992556] R13: ffff888212066a48 R14: ffff88820c312158 R15: 0000000000000000
<4>[ 1519.992567] FS:  00007fc6433cbe40(0000) GS:ffff888227a80000(0000) knlGS:0000000000000000
<4>[ 1519.992578] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 1519.992587] CR2: 0000561b4c3acd00 CR3: 0000000215a74001 CR4: 00000000000606e0
<4>[ 1519.992597] Call Trace:
<4>[ 1519.992654]  i915_driver_load+0xdb8/0x18a0 [i915]
<4>[ 1519.992666]  ? lock_acquire+0xa6/0x1c0
<4>[ 1519.992675]  ? __pm_runtime_resume+0x4f/0x80
<4>[ 1519.992685]  ? _raw_spin_unlock_irqrestore+0x4c/0x60
<4>[ 1519.992694]  ? _raw_spin_unlock_irqrestore+0x4c/0x60
<4>[ 1519.992702]  ? lockdep_hardirqs_on+0xe3/0x1b0
<4>[ 1519.992781]  i915_pci_probe+0x29/0xa0 [i915]
<4>[ 1519.992791]  pci_device_probe+0x9e/0x120
<4>[ 1519.992800]  really_probe+0xea/0x3c0
<4>[ 1519.992808]  driver_probe_device+0x10b/0x120
<4>[ 1519.992816]  device_driver_attach+0x4a/0x50
<4>[ 1519.992825]  __driver_attach+0x97/0x130
<4>[ 1519.992832]  ? device_driver_attach+0x50/0x50
<4>[ 1519.992840]  bus_for_each_dev+0x74/0xc0
<4>[ 1519.992849]  bus_add_driver+0x13f/0x210
<4>[ 1519.992856]  ? 0xffffffffa0472000
<4>[ 1519.992863]  driver_register+0x56/0xe0
<4>[ 1519.992870]  ? 0xffffffffa0472000
<4>[ 1519.992877]  do_one_initcall+0x58/0x300
<4>[ 1519.992886]  ? do_init_module+0x1d/0x1f6
<4>[ 1519.992895]  ? rcu_read_lock_sched_held+0x6f/0x80
<4>[ 1519.992903]  ? kmem_cache_alloc_trace+0x261/0x290
<4>[ 1519.992923]  do_init_module+0x56/0x1f6
<4>[ 1519.992931]  load_module+0x24d1/0x2990
<4>[ 1519.992946]  ? __se_sys_finit_module+0xd3/0xf0
<4>[ 1519.992954]  __se_sys_finit_module+0xd3/0xf0
<4>[ 1519.992967]  do_syscall_64+0x55/0x1c0
<4>[ 1519.992975]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4>[ 1519.992984] RIP: 0033:0x7fc642c8b839
<4>[ 1519.992991] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4>[ 1519.993025] RSP: 002b:00007ffd7ab70848 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4>[ 1519.993037] RAX: ffffffffffffffda RBX: 00005562f2bef320 RCX: 00007fc642c8b839
<4>[ 1519.993047] RDX: 0000000000000000 RSI: 00005562f2be80e0 RDI: 0000000000000006
<4>[ 1519.993057] RBP: 00005562f2be80e0 R08: 0000000000000004 R09: 00005562f132bc1b
<4>[ 1519.993068] R10: 00007ffd7ab70a90 R11: 0000000000000246 R12: 0000000000000000
<4>[ 1519.993077] R13: 00005562f2be5bf0 R14: 0000000000000020 R15: 0000000000000047


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.