Bug 107918

Summary: [CI][DRMTIP] igt@*suspend* - fail - Failed assertion: igt_sysfs_set(power_dir, "state", suspend_state_name[state]), Freezing of tasks failed
Product: DRI Reporter: Martin Peres <martin.peres>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: CLOSED NOTOURBUG QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: intel-gfx-bugs
Version: XOrg git   
Hardware: Other   
OS: All   
Whiteboard: ReadyForDev
i915 platform: BSW/CHT, BYT, CFL, IVB i915 features:

Description Martin Peres 2018-09-13 11:03:06 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_109/fi-ivb-3770/igt@gem_eio@suspend.html

(gem_eio:1413) igt_aux-CRITICAL: Test assertion failure function suspend_via_sysfs, file ../lib/igt_aux.c:794:
(gem_eio:1413) igt_aux-CRITICAL: Failed assertion: igt_sysfs_set(power_dir, "state", suspend_state_name[state])
(gem_eio:1413) igt_aux-CRITICAL: Last errno: 16, Device or resource busy
Subtest suspend failed.

<7>[  113.051567] [IGT] gem_eio: starting subtest suspend
<6>[  113.051991] PM: suspend entry (deep)
<6>[  113.051998] PM: Syncing filesystems ... done.
<6>[  113.072428] Freezing user space processes ... 
<3>[  133.075613] Freezing of tasks failed after 20.003 seconds (1 tasks refusing to freeze, wq_busy=0):
<6>[  133.075634] systemd-udevd   D12520   267    225 0x80000104
<4>[  133.075650] Call Trace:
<4>[  133.075666]  ? __schedule+0x364/0xb40
<4>[  133.075680]  schedule+0x2d/0x90
<4>[  133.075691]  async_synchronize_cookie_domain+0x9f/0x150
<4>[  133.075704]  ? wait_woken+0xa0/0xa0
<4>[  133.075716]  do_init_module+0xb7/0x1ea
<4>[  133.075727]  load_module+0x26ba/0x29a0
<4>[  133.075741]  ? vfs_read+0x122/0x140
<4>[  133.075756]  ? __se_sys_finit_module+0xd3/0xf0
<4>[  133.075766]  __se_sys_finit_module+0xd3/0xf0
<4>[  133.075782]  do_syscall_64+0x55/0x190
<4>[  133.075793]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4>[  133.075805] RIP: 0033:0x7fddaf544839
<4>[  133.075817] Code: Bad RIP value.
<4>[  133.075826] RSP: 002b:00007ffd07f8ab98 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4>[  133.075842] RAX: ffffffffffffffda RBX: 000055e994c9c5c0 RCX: 00007fddaf544839
<4>[  133.075855] RDX: 0000000000000000 RSI: 00007fddaf2230e5 RDI: 000000000000000f
<4>[  133.075868] RBP: 00007fddaf2230e5 R08: 0000000000000000 R09: 00007ffd07f8acb0
<4>[  133.075880] R10: 000000000000000f R11: 0000000000000246 R12: 0000000000000000
<4>[  133.075893] R13: 000055e994c8e110 R14: 0000000000020000 R15: 000055e994c9c5c0
<6>[  133.075917] OOM killer enabled.
<6>[  133.075925] Restarting tasks ... done.
<6>[  133.078119] video LNXVIDEO:00: Restoring backlight state
<6>[  133.078133] PM: suspend exit
Comment 1 Chris Wilson 2018-09-13 11:11:30 UTC
systemd-udevd is loading a module that gets stuck. Could be a broken driver (though we have no trace suggesting a hang there) or a deadlock in the async domains used for the modprobe and suspend...
Comment 2 Martin Peres 2018-10-24 11:54:45 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_132/fi-snb-2600/igt@gem_exec_suspend@basic-s3-devices.html

Starting subtest: basic-S3-devices
(gem_exec_suspend:1218) igt_aux-CRITICAL: Test assertion failure function suspend_via_sysfs, file ../lib/igt_aux.c:794:
(gem_exec_suspend:1218) igt_aux-CRITICAL: Failed assertion: igt_sysfs_set(power_dir, "state", suspend_state_name[state])
(gem_exec_suspend:1218) igt_aux-CRITICAL: Last errno: 16, Device or resource busy
Subtest basic-S3-devices failed.
Comment 3 Martin Peres 2018-10-24 11:56:09 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/drmtip_132/fi-whl-u/igt@kms_draw_crc@draw-method-rgb565-blt-ytiled.html

Starting subtest: draw-method-rgb565-blt-ytiled
(kms_draw_crc:1370) igt_debugfs-CRITICAL: Test assertion failure function igt_assert_crc_equal, file ../lib/igt_debugfs.c:392:
(kms_draw_crc:1370) igt_debugfs-CRITICAL: Failed assertion: !mismatch
Subtest draw-method-rgb565-blt-ytiled failed.
Comment 4 Imre Deak 2018-10-25 11:02:07 UTC
(In reply to Chris Wilson from comment #1)
> systemd-udevd is loading a module that gets stuck. Could be a broken driver
> (though we have no trace suggesting a hang there) or a deadlock in the async
> domains used for the modprobe and suspend...

Boot log has already

<5>[    2.919662] scsi 4:0:0:0: Direct-Access     Multiple Card  Reader     1.00 PQ: 0 ANSI: 0
<1>[    2.920615] BUG: unable to handle kernel NULL pointer dereference at 0000000000000000
<6>[    2.920631] PGD 0 P4D 0 
<4>[    2.920639] Oops: 0000 [#1] PREEMPT SMP PTI
<4>[    2.920647] CPU: 6 PID: 193 Comm: usb-storage Not tainted 4.19.0-rc3-g5fd5d0c19c4a-drmtip_109+ #1
<4>[    2.920659] Hardware name: Hewlett-Packard HP Pro 3500 Series/2ABF, BIOS 8.11 10/24/2012
<4>[    2.920673] RIP: 0010:sbitmap_any_bit_set+0xb/0x40
<4>[    2.920681] Code: 45 0f 46 c8 83 c2 01 45 89 ca 4c 89 54 01 08 48 8b 4f 10 2b 74 01 08 39 57 08 77 d8 f3 c3 90 8b 57 08 85 d2 74 2a 48 8b 47 10 <48> 83 38 00 75 23 83 ea 01 48 83 c0 40 48 c1 e2 06 48 01 c2 eb 0b
<4>[    2.920702] RSP: 0018:ffffbdb44038fdc0 EFLAGS: 00010202
<4>[    2.920711] RAX: 0000000000000000 RBX: ffff9df08baecfb8 RCX: 051bc07c00000000
<4>[    2.920720] RDX: 0000000000000001 RSI: ffffffff96080db0 RDI: ffff9df08baed108
<4>[    2.920729] RBP: 0000000000000001 R08: 0000000078233d04 R09: 0000000000000002
<4>[    2.920739] R10: ffffbdb44038fd30 R11: ffffffff96246be0 R12: 0000000000000001
<4>[    2.920748] R13: ffff9df0d7535d70 R14: 0000000000000000 R15: ffff9df08b8f5470
<4>[    2.920758] FS:  0000000000000000(0000) GS:ffff9df0db980000(0000) knlGS:0000000000000000
<4>[    2.920768] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[    2.920776] CR2: 0000000000000000 CR3: 000000010d210001 CR4: 00000000001606e0
<4>[    2.920785] Call Trace:
<4>[    2.920793]  blk_mq_run_hw_queue+0xa4/0xf0
<4>[    2.920801]  blk_mq_run_hw_queues+0x35/0x50
<4>[    2.920810]  scsi_end_request+0x1d2/0x1e0
<4>[    2.920820]  ? fill_inquiry_response+0x10/0x10
<4>[    2.920827]  scsi_io_completion+0x7c/0x600
<4>[    2.920836]  ? fill_inquiry_response+0x10/0x10
<4>[    2.920843]  blk_mq_complete_request+0xaa/0x130
<4>[    2.920852]  usb_stor_control_thread+0x27f/0x2f0
<4>[    2.920861]  ? _raw_spin_unlock_irqrestore+0x39/0x60
<4>[    2.920871]  ? __kthread_parkme+0x57/0x80
<4>[    2.920878]  kthread+0x119/0x130
<4>[    2.920885]  ? kthread_park+0x80/0x80
<4>[    2.920893]  ret_from_fork+0x3a/0x50
Comment 5 CI Bug Log 2019-01-24 11:23:14 UTC
A CI Bug Log filter associated to this bug has been updated:

{- SNB CFL: igt@gem_exec_suspend@basic-s3-devices - fail - Failed assertion: igt_sysfs_set(power_dir, &quot;state&quot;, suspend_state_name[state]), Last errno: 16 -}
{+ SNB BYT BSW CFL: igt@gem_exec_suspend@basic-s[34]-devices - fail - Failed assertion: igt_sysfs_set(power_dir, &quot;state&quot;, suspend_state_name[state]), Last errno:16 +}

New failures caught by the filter:

* https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5449_144/fi-bsw-n3050/igt@gem_exec_suspend@basic-s4-devices.html
* https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5449_144/fi-byt-j1900/igt@gem_exec_suspend@basic-s4-devices.html
* https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5449_144/fi-byt-n2820/igt@gem_exec_suspend@basic-s4-devices.html

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.