Bug 108857 - display becomes unresponsive and keyboard input fails
Summary: display becomes unresponsive and keyboard input fails
Status: NEW
Alias: None
Product: xorg
Classification: Unclassified
Component: Driver/nouveau (show other bugs)
Version: unspecified
Hardware: x86-64 (AMD64) Linux (All)
: medium major
Assignee: Nouveau Project
QA Contact: Xorg Project Team
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-11-25 12:20 UTC by tla2k20
Modified: 2018-12-01 07:04 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments
dmesg (71.79 KB, text/plain)
2018-11-25 12:20 UTC, tla2k20
no flags Details
lspci -vvv (38.30 KB, text/plain)
2018-11-25 12:21 UTC, tla2k20
no flags Details
dmesg from 4.19.4-300.fc29.x86_64 (75.24 KB, text/plain)
2018-11-28 16:37 UTC, tla2k20
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description tla2k20 2018-11-25 12:20:11 UTC
Created attachment 142607 [details]
dmesg

Fedora release 29 (Twenty Nine)
Linux s0.home 4.19.3-300.fc29.x86_64 #1 SMP Wed Nov 21 15:27:25 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

System boots and is initially OK for a few minutes then becomes unresponsive to mouse movements and keyboard input.  Access via a SSH session is fine.

Issue started around 3 kernel updates ago (4.19.*).
 
top shows the following when the problem is in progress:

top - 12:10:25 up  1:10,  2 users,  load average: 0.23, 0.25, 0.27
Tasks: 356 total,   2 running, 354 sleeping,   0 stopped,   0 zombie
%Cpu(s):  0.8 us,  6.6 sy,  0.0 ni, 91.6 id,  0.0 wa,  1.0 hi,  0.0 si,  0.0 st
MiB Mem :  15982.9 total,   9198.9 free,   4552.3 used,   2231.7 buff/cache
MiB Swap:   8008.0 total,   8008.0 free,      0.0 used.  11035.1 avail Mem

  PID USER      PR  NI    VIRT    RES    SHR S  %CPU  %MEM     TIME+ COMMAND
 7144 root      20   0       0      0      0 R  53.8   0.0   0:02.26 kworker/u16:0+events_unbound
 2325 user1     20   0 3867708 195040 118048 S   2.0   1.2   0:28.43 gnome-shell

kworker/u16:2+events_unbound seems to eat CPU and dmesg reports:

nouveau 0000:01:00.0: DRM: base-0: timeout
Comment 1 tla2k20 2018-11-25 12:21:19 UTC
Created attachment 142608 [details]
lspci -vvv
Comment 2 Rhys Kidd 2018-11-26 21:36:37 UTC
Comparing dmesg, a similar timeout fault with the GP104 was experienced by this user (their dmesg is linked): https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1799180/comments/5
Comment 3 tla2k20 2018-11-28 16:35:58 UTC
Updated to latest Fedora 29 kernel today and the problem is still evident.

Linux s0.home 4.19.4-300.fc29.x86_64 #1 SMP Fri Nov 23 13:03:11 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
Comment 4 tla2k20 2018-11-28 16:37:00 UTC
Created attachment 142650 [details]
dmesg from 4.19.4-300.fc29.x86_64
Comment 5 tla2k20 2018-11-29 16:43:20 UTC
Switched to the nvidia drivers today and no issues, so looks like it is probably nouveau related?

sudo dnf config-manager --add-repo=https://negativo17.org/repos/fedora-nvidia.repo
sudo dnf -y remove \*nvidia\*
sudo dnf -y install nvidia-driver nvidia-settings kernel-devel nvidia-driver-libs.i686
# wait for the kernel driver to build in the background (top) then...
sudo reboot
Comment 6 Victor Costan 2018-12-01 07:04:05 UTC
I ran into the same problem since I upgraded to 4.19. I've been using the 4.18 kernel to get my work done. I just tried the 4.19 and 4.20 vanilla kernels packaged for Fedora, and the problem is still there.

I've been suspecting the Spectre mitigations until I found this bug -- I now tried the proprietary nvidia driver, and it seems to have made the problem go away.

In case it helps, I have a GTX1080 founders' edition.

Relevant lines from dmesg:
[    1.745599] nouveau 0000:01:00.0: NVIDIA GP104 (134000a1)
[    1.852957] nouveau 0000:01:00.0: bios: version 86.04.17.00.01
[    1.853407] nouveau 0000:01:00.0: bios: M0203E type 08
[    1.853440] nouveau 0000:01:00.0: fb: 8192 MiB of unknown memory type
[    1.893737] [TTM] Zone  kernel: Available graphics memory: 16445892 kiB
[    1.893738] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
[    1.893738] [TTM] Initializing pool allocator
[    1.893741] [TTM] Initializing DMA pool allocator
[    1.893750] nouveau 0000:01:00.0: DRM: VRAM: 8192 MiB
[    1.893751] nouveau 0000:01:00.0: DRM: GART: 536870912 MiB
[    1.893752] nouveau 0000:01:00.0: DRM: BIT table 'A' not found
[    1.893753] nouveau 0000:01:00.0: DRM: BIT table 'L' not found
[    1.893754] nouveau 0000:01:00.0: DRM: TMDS table version 2.0
[    1.893755] nouveau 0000:01:00.0: DRM: DCB version 4.1
[    1.893756] nouveau 0000:01:00.0: DRM: DCB outp 00: 01000f42 00020030
[    1.893757] nouveau 0000:01:00.0: DRM: DCB outp 01: 04811f96 04600020
[    1.893757] nouveau 0000:01:00.0: DRM: DCB outp 02: 04011f92 00020020
[    1.893758] nouveau 0000:01:00.0: DRM: DCB outp 03: 04822f86 04600010
[    1.893759] nouveau 0000:01:00.0: DRM: DCB outp 04: 04022f82 00020010
[    1.893760] nouveau 0000:01:00.0: DRM: DCB outp 06: 02033f62 00020010
[    1.893761] nouveau 0000:01:00.0: DRM: DCB outp 07: 02844f76 04600020
[    1.893762] nouveau 0000:01:00.0: DRM: DCB outp 08: 02044f72 00020020
[    1.893762] nouveau 0000:01:00.0: DRM: DCB conn 00: 00001031
[    1.893763] nouveau 0000:01:00.0: DRM: DCB conn 01: 02000146
[    1.893764] nouveau 0000:01:00.0: DRM: DCB conn 02: 01000246
[    1.893765] nouveau 0000:01:00.0: DRM: DCB conn 03: 00010361
[    1.893765] nouveau 0000:01:00.0: DRM: DCB conn 04: 00020446
[    1.998241] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
[    1.998243] [drm] Driver supports precise vblank timestamp query.
[    2.043058] nouveau 0000:01:00.0: DRM: MM: using COPY for buffer copies
[    2.374601] nouveau 0000:01:00.0: DRM: allocated 3840x2160 fb: 0x200000, bo 00000000ae0f5db6
[    2.374656] fbcon: nouveaufb (fb0) is primary device
[    2.374657] fbcon: Deferring console take-over
[    2.374658] nouveau 0000:01:00.0: fb0: nouveaufb frame buffer device
[    2.393355] [drm] Initialized nouveau 1.3.1 20120801 for 0000:01:00.0 on minor 0
[    2.470633] nouveau 0000:01:00.0: disp: 0x000064f7[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   40.934912] nouveau 0000:01:00.0: disp: chid 1 mthd 0000 data 00000000 00003000 00000000
[   40.934931] nouveau 0000:01:00.0: disp: chid 1 mthd 0004 data 08700f00 10003004 00000000
[   40.934948] nouveau 0000:01:00.0: disp: chid 1 mthd 0008 data 0000f004 10003008 00000000
[   40.934966] nouveau 0000:01:00.0: disp: chid 1 mthd 000c data 0000cf00 1000300c 00000000
[   40.934977] nouveau 0000:01:00.0: disp: chid 1 mthd 0010 data 20000000 10003010 00000000
[   40.934992] nouveau 0000:01:00.0: disp: chid 1 mthd 0014 data 00000000 10003014 00000000
[   40.935002] nouveau 0000:01:00.0: disp: chid 1 mthd 0018 data 00000000 10003018 00000000
[   40.935015] nouveau 0000:01:00.0: disp: chid 1 mthd 001c data 00000000 1000301c 00000000
[   40.935024] nouveau 0000:01:00.0: disp: chid 1 mthd 0020 data 00000000 10003020 00000000
[   40.935037] nouveau 0000:01:00.0: disp: chid 1 mthd 0000 data 00000400 10001000 00000002
[   42.935037] nouveau 0000:01:00.0: DRM: base-0: timeout
[   44.937059] nouveau 0000:01:00.0: DRM: base-0: timeout
[   46.940112] nouveau 0000:01:00.0: DRM: base-0: timeout
[   48.942221] nouveau 0000:01:00.0: DRM: base-0: timeout
[   73.444493] nouveau 0000:01:00.0: DRM: base-0: timeout
[   91.300747] nouveau 0000:01:00.0: DRM: base-0: timeout
[   93.306492] nouveau 0000:01:00.0: DRM: base-0: timeout
[   95.310609] nouveau 0000:01:00.0: DRM: base-0: timeout
[  111.646135] nouveau 0000:01:00.0: DRM: base-0: timeout
[  113.648187] nouveau 0000:01:00.0: DRM: base-0: timeout
[  115.649874] nouveau 0000:01:00.0: DRM: base-0: timeout
[  117.650629] nouveau 0000:01:00.0: DRM: base-0: timeout
[  120.902222] nouveau 0000:01:00.0: DRM: base-0: timeout
[  122.902900] nouveau 0000:01:00.0: DRM: base-0: timeout
[  124.903509] nouveau 0000:01:00.0: DRM: base-0: timeout
[  126.904112] nouveau 0000:01:00.0: DRM: base-0: timeout
[  154.470730] nouveau 0000:01:00.0: DRM: base-0: timeout
[  214.470998] nouveau 0000:01:00.0: DRM: base-0: timeout
[  274.471269] nouveau 0000:01:00.0: DRM: base-0: timeout
[  334.470469] nouveau 0000:01:00.0: DRM: base-0: timeout
[  394.470680] nouveau 0000:01:00.0: DRM: base-0: timeout
[  454.470846] nouveau 0000:01:00.0: DRM: base-0: timeout


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.