Bug 99089

Summary: Random screen lockups
Product: DRI Reporter: Witold Baryluk <witold.baryluk+freedesktop>
Component: DRM/RadeonAssignee: Default DRI bug account <dri-devel>
Status: RESOLVED WORKSFORME QA Contact:
Severity: normal    
Priority: medium    
Version: unspecified   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
Xorg.0.log none

Description Witold Baryluk 2016-12-15 05:28:10 UTC
Created attachment 128477 [details]
Xorg.0.log

Hi,

Not sure if this is a kernel bug, or Xorg or what, but filling here.

It started few weeks ago, and I am getting sporadic screen lookups. Machine is accessible remotely.

$ dmesg
[37571.611965] [drm:intel_pipe_update_end [i915]] *ERROR* Atomic update failure on pipe A (start=163606 end=163607) time 560 us, min 1192, max 1199, scanline start 1172, end 1213



[128477.262834] thinkpad_acpi: EC reports that Thermal Table has changed
[128477.267408] [drm] PCIE GART of 512M enabled (table at 0x0000000000142000).
[128477.267436] radeon 0000:01:00.0: WB enabled
[128477.267440] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000020000c00 and cpu addr 0xffffa0a6cf42dc00
[128477.267843] radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x00000000000521d0 and cpu addr 0xffffb6a2412121d0
[128477.299290] [drm] ring test on 0 succeeded in 1 usecs
[128477.473960] [drm] ring test on 5 succeeded in 1 usecs
[128477.473968] [drm] UVD initialized successfully.
[128477.473997] [drm] ib test on ring 0 succeeded in 0 usecs
[128478.140120] [drm] ib test on ring 5 succeeded
[128687.068243] INFO: task Xorg:1647 blocked for more than 120 seconds.
[128687.068256]       Tainted: P           OE   4.8.0-2-amd64 #1
[128687.068260] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[128687.068265] Xorg            D ffffa0a6dbc98180     0  1647   1638 0x00400002
[128687.068276]  ffffa0a6be9d6100 ffffa0a6d2256f00 ffffffffaeac3daa 00000000c0c7ddb8
[128687.068283]  ffffa0a6bcc4c000 ffffa0a6bcc4b9b8 ffffa0a6d0773a34 ffffa0a6be9d6100
[128687.068290]  00000000ffffffff ffffa0a6d0773a38 ffffffffaefeb6d1 ffffa0a6d0773a30
[128687.068297] Call Trace:
[128687.068312]  [<ffffffffaeac3daa>] ? mutex_optimistic_spin+0x17a/0x1c0
[128687.068320]  [<ffffffffaefeb6d1>] ? schedule+0x31/0x80
[128687.068325]  [<ffffffffaefeb96a>] ? schedule_preempt_disabled+0xa/0x10
[128687.068331]  [<ffffffffaefed3e4>] ? __mutex_lock_slowpath+0xb4/0x130
[128687.068336]  [<ffffffffaefed47b>] ? mutex_lock+0x1b/0x30
[128687.068368]  [<ffffffffc0a13189>] ? drm_kms_helper_poll_enable+0x19/0x30 [drm_kms_helper]
[128687.068435]  [<ffffffffc0c659c4>] ? radeon_resume_kms+0x1f4/0x3a0 [radeon]
[128687.068478]  [<ffffffffc0c63131>] ? radeon_pmops_runtime_resume+0x71/0xb0 [radeon]
[128687.068485]  [<ffffffffaed76678>] ? pci_pm_runtime_resume+0x78/0xa0
[128687.068492]  [<ffffffffaee6926e>] ? __rpm_callback+0x2e/0x70
[128687.068498]  [<ffffffffaee57340>] ? vga_switcheroo_set_dynamic_switch+0x80/0x80
[128687.068502]  [<ffffffffaee692cf>] ? rpm_callback+0x1f/0x70
[128687.068506]  [<ffffffffaee57340>] ? vga_switcheroo_set_dynamic_switch+0x80/0x80
[128687.068510]  [<ffffffffaee69a87>] ? rpm_resume+0x4a7/0x6b0
[128687.068528]  [<ffffffffc093bdc0>] ? ttm_bo_release_list+0xb0/0x160 [ttm]
[128687.068533]  [<ffffffffaee6a4e7>] ? __pm_runtime_resume+0x47/0x70
[128687.068587]  [<ffffffffc0c8c093>] ? radeon_crtc_set_config+0x33/0x110 [radeon]
[128687.068640]  [<ffffffffc08ef4d1>] ? drm_mode_set_config_internal+0x61/0x110 [drm]
[128687.068662]  [<ffffffffc0a1f747>] ? restore_fbdev_mode+0x1e7/0x260 [drm_kms_helper]
[128687.068707]  [<ffffffffc08ff693>] ? drm_modeset_lock_all_ctx+0x93/0xb0 [drm]
[128687.068729]  [<ffffffffc0a211ee>] ? drm_fb_helper_restore_fbdev_mode_unlocked+0x2e/0x70 [drm_kms_helper]
[128687.068784]  [<ffffffffc0c94636>] ? radeon_fbdev_restore_mode+0x16/0x40 [radeon]
[128687.068828]  [<ffffffffc0c6788e>] ? radeon_driver_lastclose_kms+0xe/0x20 [radeon]
[128687.068858]  [<ffffffffc08e5259>] ? drm_lastclose+0x29/0x120 [drm]
[128687.068889]  [<ffffffffc08e554a>] ? drm_release+0x1fa/0x310 [drm]
[128687.068896]  [<ffffffffaec043ad>] ? __fput+0xcd/0x1e0
[128687.068901]  [<ffffffffaea994f2>] ? task_work_run+0x72/0x90
[128687.068907]  [<ffffffffaea7fd57>] ? do_exit+0x2d7/0xb30
[128687.068912]  [<ffffffffaea8062a>] ? do_group_exit+0x3a/0xa0
[128687.068917]  [<ffffffffaea8b469>] ? get_signal+0x299/0x640
[128687.068923]  [<ffffffffaea2c456>] ? do_signal+0x36/0x700
[128687.068927]  [<ffffffffaea8a345>] ? do_send_sig_info+0x65/0x90
[128687.068934]  [<ffffffffaea891fd>] ? __set_task_blocked+0x3d/0x90
[128687.068941]  [<ffffffffaea03275>] ? exit_to_usermode_loop+0x85/0xc0
[128687.068945]  [<ffffffffaea03b05>] ? syscall_return_slowpath+0x45/0x50
[128687.068950]  [<ffffffffaefefafe>] ? system_call_fast_compare_end+0x94/0x96
[128687.069005] INFO: task bitcoin-msghand:27056 blocked for more than 120 seconds.
[128687.069010]       Tainted: P           OE   4.8.0-2-amd64 #1
[128687.069014] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[128687.069017] bitcoin-msghand D ffffa0a6a2f13000     0 27056      1 0x00000002
[128687.069024]  ffffa0a6a8815200 ffffa0a6a2dfa300 ffffa0a6958f9000 0000000000000246
[128687.069031]  ffffa0a5180cc000 ffffa0a5180cbc18 ffffffffc091e004 ffffa0a6a8815200
[128687.069037]  00000000ffffffff ffffffffc091e008 ffffffffaefeb6d1 ffffffffc091e000
[128687.069043] Call Trace:
[128687.069050]  [<ffffffffaefeb6d1>] ? schedule+0x31/0x80
[128687.069055]  [<ffffffffaefeb96a>] ? schedule_preempt_disabled+0xa/0x10
[128687.069060]  [<ffffffffaefed3e4>] ? __mutex_lock_slowpath+0xb4/0x130
[128687.069066]  [<ffffffffaefed47b>] ? mutex_lock+0x1b/0x30
[128687.069096]  [<ffffffffc08e5381>] ? drm_release+0x31/0x310 [drm]
[128687.069101]  [<ffffffffaec043ad>] ? __fput+0xcd/0x1e0
[128687.069106]  [<ffffffffaea994f2>] ? task_work_run+0x72/0x90
[128687.069110]  [<ffffffffaea7fd57>] ? do_exit+0x2d7/0xb30
[128687.069114]  [<ffffffffaea8062a>] ? do_group_exit+0x3a/0xa0
[128687.069118]  [<ffffffffaea8b469>] ? get_signal+0x299/0x640
[128687.069124]  [<ffffffffaea2c456>] ? do_signal+0x36/0x700
[128687.069130]  [<ffffffffaeecf8b0>] ? sock_sendmsg+0x30/0x40
[128687.069135]  [<ffffffffaeecfe6d>] ? SYSC_sendto+0x11d/0x150
[128687.069141]  [<ffffffffaea03275>] ? exit_to_usermode_loop+0x85/0xc0
[128687.069146]  [<ffffffffaea03b05>] ? syscall_return_slowpath+0x45/0x50
[128687.069151]  [<ffffffffaefefafe>] ? system_call_fast_compare_end+0x94/0x96
[128807.900232] INFO: task Xorg:1647 blocked for more than 120 seconds.
[128807.900243]       Tainted: P           OE   4.8.0-2-amd64 #1
[128807.900247] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[128807.900252] Xorg            D ffffa0a6dbc98180     0  1647   1638 0x00400002
[128807.900264]  ffffa0a6be9d6100 ffffa0a6d2256f00 ffffffffaeac3daa 00000000c0c7ddb8
[128807.900271]  ffffa0a6bcc4c000 ffffa0a6bcc4b9b8 ffffa0a6d0773a34 ffffa0a6be9d6100
[128807.900277]  00000000ffffffff ffffa0a6d0773a38 ffffffffaefeb6d1 ffffa0a6d0773a30
[128807.900283] Call Trace:
[128807.900298]  [<ffffffffaeac3daa>] ? mutex_optimistic_spin+0x17a/0x1c0
[128807.900306]  [<ffffffffaefeb6d1>] ? schedule+0x31/0x80
[128807.900311]  [<ffffffffaefeb96a>] ? schedule_preempt_disabled+0xa/0x10
[128807.900317]  [<ffffffffaefed3e4>] ? __mutex_lock_slowpath+0xb4/0x130
[128807.900322]  [<ffffffffaefed47b>] ? mutex_lock+0x1b/0x30
[128807.900351]  [<ffffffffc0a13189>] ? drm_kms_helper_poll_enable+0x19/0x30 [drm_kms_helper]
[128807.900415]  [<ffffffffc0c659c4>] ? radeon_resume_kms+0x1f4/0x3a0 [radeon]
[128807.900458]  [<ffffffffc0c63131>] ? radeon_pmops_runtime_resume+0x71/0xb0 [radeon]
[128807.900466]  [<ffffffffaed76678>] ? pci_pm_runtime_resume+0x78/0xa0
[128807.900472]  [<ffffffffaee6926e>] ? __rpm_callback+0x2e/0x70
[128807.900478]  [<ffffffffaee57340>] ? vga_switcheroo_set_dynamic_switch+0x80/0x80
[128807.900482]  [<ffffffffaee692cf>] ? rpm_callback+0x1f/0x70
[128807.900486]  [<ffffffffaee57340>] ? vga_switcheroo_set_dynamic_switch+0x80/0x80
[128807.900491]  [<ffffffffaee69a87>] ? rpm_resume+0x4a7/0x6b0
[128807.900508]  [<ffffffffc093bdc0>] ? ttm_bo_release_list+0xb0/0x160 [ttm]
[128807.900513]  [<ffffffffaee6a4e7>] ? __pm_runtime_resume+0x47/0x70
[128807.900567]  [<ffffffffc0c8c093>] ? radeon_crtc_set_config+0x33/0x110 [radeon]
[128807.900618]  [<ffffffffc08ef4d1>] ? drm_mode_set_config_internal+0x61/0x110 [drm]
[128807.900640]  [<ffffffffc0a1f747>] ? restore_fbdev_mode+0x1e7/0x260 [drm_kms_helper]
[128807.900684]  [<ffffffffc08ff693>] ? drm_modeset_lock_all_ctx+0x93/0xb0 [drm]
[128807.900707]  [<ffffffffc0a211ee>] ? drm_fb_helper_restore_fbdev_mode_unlocked+0x2e/0x70 [drm_kms_helper]
[128807.900762]  [<ffffffffc0c94636>] ? radeon_fbdev_restore_mode+0x16/0x40 [radeon]
[128807.900806]  [<ffffffffc0c6788e>] ? radeon_driver_lastclose_kms+0xe/0x20 [radeon]
[128807.900836]  [<ffffffffc08e5259>] ? drm_lastclose+0x29/0x120 [drm]
[128807.900867]  [<ffffffffc08e554a>] ? drm_release+0x1fa/0x310 [drm]
[128807.900874]  [<ffffffffaec043ad>] ? __fput+0xcd/0x1e0
[128807.900880]  [<ffffffffaea994f2>] ? task_work_run+0x72/0x90
[128807.900885]  [<ffffffffaea7fd57>] ? do_exit+0x2d7/0xb30
[128807.900890]  [<ffffffffaea8062a>] ? do_group_exit+0x3a/0xa0
[128807.900895]  [<ffffffffaea8b469>] ? get_signal+0x299/0x640
[128807.900902]  [<ffffffffaea2c456>] ? do_signal+0x36/0x700
[128807.900905]  [<ffffffffaea8a345>] ? do_send_sig_info+0x65/0x90
[128807.900912]  [<ffffffffaea891fd>] ? __set_task_blocked+0x3d/0x90
[128807.900919]  [<ffffffffaea03275>] ? exit_to_usermode_loop+0x85/0xc0
[128807.900923]  [<ffffffffaea03b05>] ? syscall_return_slowpath+0x45/0x50
[128807.900929]  [<ffffffffaefefafe>] ? system_call_fast_compare_end+0x94/0x96
[128807.900981] INFO: task bitcoin-msghand:27056 blocked for more than 120 seconds.
[128807.900986]       Tainted: P           OE   4.8.0-2-amd64 #1
[128807.900990] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[128807.900994] bitcoin-msghand D ffffa0a6a2f13000     0 27056      1 0x00000002
[128807.901000]  ffffa0a6a8815200 ffffa0a6a2dfa300 ffffa0a6958f9000 0000000000000246
[128807.901007]  ffffa0a5180cc000 ffffa0a5180cbc18 ffffffffc091e004 ffffa0a6a8815200
[128807.901014]  00000000ffffffff ffffffffc091e008 ffffffffaefeb6d1 ffffffffc091e000
[128807.901020] Call Trace:
[128807.901026]  [<ffffffffaefeb6d1>] ? schedule+0x31/0x80
[128807.901031]  [<ffffffffaefeb96a>] ? schedule_preempt_disabled+0xa/0x10
[128807.901037]  [<ffffffffaefed3e4>] ? __mutex_lock_slowpath+0xb4/0x130
[128807.901042]  [<ffffffffaefed47b>] ? mutex_lock+0x1b/0x30
[128807.901073]  [<ffffffffc08e5381>] ? drm_release+0x31/0x310 [drm]
[128807.901078]  [<ffffffffaec043ad>] ? __fput+0xcd/0x1e0
[128807.901082]  [<ffffffffaea994f2>] ? task_work_run+0x72/0x90
[128807.901087]  [<ffffffffaea7fd57>] ? do_exit+0x2d7/0xb30
[128807.901091]  [<ffffffffaea8062a>] ? do_group_exit+0x3a/0xa0
[128807.901095]  [<ffffffffaea8b469>] ? get_signal+0x299/0x640
[128807.901100]  [<ffffffffaea2c456>] ? do_signal+0x36/0x700
[128807.901107]  [<ffffffffaeecf8b0>] ? sock_sendmsg+0x30/0x40
[128807.901111]  [<ffffffffaeecfe6d>] ? SYSC_sendto+0x11d/0x150
[128807.901117]  [<ffffffffaea03275>] ? exit_to_usermode_loop+0x85/0xc0
[128807.901122]  [<ffffffffaea03b05>] ? syscall_return_slowpath+0x45/0x50
[128807.901126]  [<ffffffffaefefafe>] ? system_call_fast_compare_end+0x94/0x96

$ uname -a
Linux kickass 4.8.0-2-amd64 #1 SMP Debian 4.8.11-1 (2016-12-02) x86_64 GNU/Linux

ii  xorg                                  1:7.7+18                             amd64        X.Org X Window System
ii  xserver-xorg-core                     2:1.19.0-2                           amd64        Xorg X server - core server
ii  xserver-xorg-video-radeon             1:7.8.0-1+b1                         amd64        X.Org X server -- AMD/ATI Radeon display driver



Machine:

ThinkPad W500

Xorg.0.log attached
Comment 1 Michel Dänzer 2016-12-15 06:52:37 UTC
(In reply to Witold Baryluk from comment #0)
> 
> It started few weeks ago, and I am getting sporadic screen lookups.

Did the kernel get upgraded to a newer version around that time?


> $ dmesg
> [37571.611965] [drm:intel_pipe_update_end [i915]] *ERROR* Atomic update
> failure on pipe A (start=163606 end=163607) time 560 us, min 1192, max 1199,
> scanline start 1172, end 1213

Are there any other error messages in dmesg after this? Maybe attach the whole dmesg output.

It looks like maybe a DRM (driver) error path doesn't unlock dev->mode_config.mutex, so any code trying to lock that hangs.
Comment 2 Witold Baryluk 2016-12-23 18:10:28 UTC
The kernel update might be possibility.

I am now running 4.9.0 custom built, and the system / screen is not freezing, but instead sometimes Xorg crashes, usually when clicking something. I am able to reproduce it reliably on some webpages in chrome, when clicking specifying button (dmesg shows X11 segfault, but entire Xorg exits, and I need to relogin). I was able to also reproduce it in few other programs like VLC, but less reliably.

Probably a different issue.

It is also possibly that my Xorg server is now confused, and uses i915 instead of radeon (ThinkPad W500 has both), but dmesg and xorg.0.log claims I am using radeon, but glxinfo shows there is no hardware opengl acceleration (llvmpipe instead). Weird.

I will try to reproduce it again, maybe on 4.8 kernel again.
Comment 3 Witold Baryluk 2018-11-15 23:36:02 UTC
I am going to close this, as I did not have this lockups for some time now anymore. I most likely changed some settings in BIOS that make it stable. If I ever get more data on recent kernels and can reproduce it again, I will reopen.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.