Bug 99202 - system freeze - fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT] - MULTIPLE_WARP_ERRORS - DATA_ERROR 00000004 [INVALID_VALUE] - ILLEGAL_CLASS ch 11 - RT_STORAGE_TYPE_MISMATCH - CLASS_SUBCH_MISMATCH
Summary: system freeze - fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT] - MULTIPLE_WARP_ERRORS -...
Status: NEW
Alias: None
Product: xorg
Classification: Unclassified
Component: Driver/nouveau (show other bugs)
Version: unspecified
Hardware: Other All
: medium normal
Assignee: Nouveau Project
QA Contact: Xorg Project Team
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-12-26 16:53 UTC by as.newsletter
Modified: 2019-11-10 00:09 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description as.newsletter 2016-12-26 16:53:38 UTC
This might be related to https://bugs.freedesktop.org/show_bug.cgi?id=93629 but a few things are different.

The screen freezes except for that the mouse pointer can be used (click has no effect). Apart from the magic syskeys (sync, unmount, boot) the keyboard doesn't react.
I can not use ALT-tab to switch to another program. I can not leave the game (fullscreen mode) by using the ingame console or using ALT-F4. The mouse pointer is trapped on the monitor where the game runs (I have two). The sound keeps playing and sound effects are played now and then too.
I tried to get to the terminals via CTRL-ALT-F[1-6] but nothing happens. Switching (KDE) desktops via CTRL-ALT-[arrow-left,arrow-right] is just the same.

So far the problem happed when I used app-emulation/wine to play a game (namely MassEffect 2, I haven't tried others) and once when I used firefoy (maybe when using youtube). It happens unexpectedly, at least I didn't find a pattern.
I downgraded to xf86-video-nouveau-1.0.12 - seems like it happens less often. Once when using firefox (probably watching a video).

Dec 15 15:12:38 [kernel] [ 6197.881567] nouveau 0000:03:00.0: gr: GPC3/PROP trap: 00000004 [] x = 1968, y = 992, format = 0, storage type = 0
[...]
Dec 18 03:20:57 [kernel] [97585.789880] nouveau 0000:03:00.0: fifo: read fault at 0000441000 engine 00 [GR] client 10 [PD] reason 02 [PTE] on channel 11 [017e507000 MassEffect2.exe[18715]]
Dec 18 03:20:57 [kernel] [97585.789883] nouveau 0000:03:00.0: fifo: gr engine fault on channel 11, recovering...
Dec 18 03:21:46 [kernel] [97635.048675] sysrq: SysRq : Emergency Sync
Dec 18 03:21:46 [kernel] [97635.049024] Emergency Sync complete
[...]
Dec 18 17:57:52 [kernel] [ 7553.852044] nouveau 0000:03:00.0: gr: TRAP ch 13 [017e107000 MassEffect2.exe[10678]]
Dec 18 17:57:52 [kernel] [ 7553.852056] nouveau 0000:03:00.0: gr: GPC0/TPC0/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3d000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852067] nouveau 0000:03:00.0: gr: GPC1/TPC0/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3c000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852078] nouveau 0000:03:00.0: gr: GPC2/TPC0/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3f000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852086] nouveau 0000:03:00.0: gr: GPC2/TPC2/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3e000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852097] nouveau 0000:03:00.0: gr: GPC3/TPC0/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3c000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852105] nouveau 0000:03:00.0: gr: GPC3/TPC1/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3c000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852114] nouveau 0000:03:00.0: gr: GPC3/TPC2/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3c000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852125] nouveau 0000:03:00.0: gr: GPC4/TPC0/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3c000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852133] nouveau 0000:03:00.0: gr: GPC4/TPC1/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3d000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852142] nouveau 0000:03:00.0: gr: GPC4/TPC2/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 3c000d [OOR_REG]
Dec 18 17:57:52 [kernel] [ 7553.852158] nouveau 0000:03:00.0: gr: TRAP ch 13 [017e107000 MassEffect2.exe[10678]]
Dec 18 17:57:52 [kernel] [ 7553.852168] nouveau 0000:03:00.0: gr: GPC0/TPC0/MP trap: global 00000004 [MULTIPLE_WARP_ERRORS] warp 1000d [OOR_REG]
Dec 18 17:57:57 [kernel] [ 7558.147983] nouveau 0000:03:00.0: fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]
Dec 18 17:57:57 [kernel] [ 7558.147987] nouveau 0000:03:00.0: fifo: gr engine fault on channel 2, recovering...
Dec 18 17:59:01 [CROND] (root) CMD (rm -f /var/spool/cron/lastrun/cron.hourly)
[...]
Dec 18 20:43:22 [kernel] [ 6788.582706] nouveau 0000:03:00.0: gr: TRAP ch 12 [017e348000 MassEffect2.exe[19526]]
Dec 18 20:43:22 [kernel] [ 6788.582719] nouveau 0000:03:00.0: gr: GPC3/PROP trap: 00000100 [RT_STORAGE_TYPE_MISMATCH] x = 48, y = 4, format = 2a, storage type = e6
Dec 18 20:43:26 [kernel] [ 6792.878268] nouveau 0000:03:00.0: fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]
Dec 18 20:43:26 [kernel] [ 6792.878271] nouveau 0000:03:00.0: fifo: gr engine fault on channel 2, recovering.. 
[...]
Dec 21 21:19:08 [kernel] [54341.955480] nouveau 0000:03:00.0: fifo: PBDMA0: 00040000 [PBENTRY] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0000 data 00000000
Dec 21 21:19:08 [kernel] [54341.955500] nouveau 0000:03:00.0: gr: DATA_ERROR 00000004 [INVALID_VALUE] ch 11 [017efba000 firefox[17446]] subc 0 class a197 mthd 2380 data 00000040
Dec 21 21:19:08 [kernel] [54342.174931] nouveau 0000:03:00.0: gr: TRAP ch 11 [017efba000 firefox[17446]]
Dec 21 21:19:08 [kernel] [54342.174938] nouveau 0000:03:00.0: gr: DISPATCH 80000002 [CLASS_SUBCH_MISMATCH]
Dec 21 21:19:08 [kernel] [54342.174953] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 000c data 00000000
Dec 21 21:19:08 [kernel] [54342.175025] nouveau 0000:03:00.0: fifo: PBDMA0: 02000000 [SEMAPHORE] ch 11 [017efba000 firefox[17446]] subc 0 mthd 001c data 00000000
Dec 21 21:19:08 [kernel] [54342.175084] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0030 data 43000000
Dec 21 21:19:08 [kernel] [54342.175102] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0034 data 00000000
Dec 21 21:19:08 [kernel] [54342.175121] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0038 data 3f800000
Dec 21 21:19:08 [kernel] [54342.175139] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 003c data 00000000
Dec 21 21:19:08 [kernel] [54342.175157] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0040 data 00000000
Dec 21 21:19:08 [kernel] [54342.175175] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0044 data 00000000
Dec 21 21:19:08 [kernel] [54342.175193] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0048 data 00000000
Dec 21 21:19:08 [kernel] [54342.175211] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 004c data 3f800000
Dec 21 21:19:08 [kernel] [54342.175296] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0054 data 00000000
Dec 21 21:19:08 [kernel] [54342.175323] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0058 data 00000000
Dec 21 21:19:08 [kernel] [54342.175350] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 005c data 00000000
Dec 21 21:19:08 [kernel] [54342.175376] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0060 data 3f800000
Dec 21 21:19:08 [kernel] [54342.175403] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0064 data 00000000
Dec 21 21:19:08 [kernel] [54342.175429] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0068 data 00000000
Dec 21 21:19:08 [kernel] [54342.175452] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 006c data 00000000
Dec 21 21:19:08 [kernel] [54342.175475] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0070 data 00000000
Dec 21 21:19:08 [kernel] [54342.175517] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0074 data 3f800000
Dec 21 21:19:08 [kernel] [54342.175541] nouveau 0000:03:00.0: fifo: PBDMA0: 00400000 [METHODCRC] ch 11 [017efba000 firefox[17446]] subc 0 mthd 007c data 00000000
Dec 21 21:19:08 [kernel] [54342.175563] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 0084 data 00000000
[...]
Dec 21 21:19:08 [kernel] [54342.176189] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 00f8 data 00000800
Dec 21 21:19:08 [kernel] [54342.176211] nouveau 0000:03:00.0: fifo: PBDMA0: 00200000 [METHOD] ch 11 [017efba000 firefox[17446]] subc 0 mthd 00fc data 00000000
Dec 21 21:19:08 [kernel] [54342.176229] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 0100 data 004e2000
Dec 21 21:19:08 [kernel] [54342.176244] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 0104 data 200208e3
Dec 21 21:19:08 [kernel] [54342.176258] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 0108 data 00000020
Dec 21 21:19:08 [kernel] [54342.176270] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 010c data 0000000b
Dec 21 21:19:08 [kernel] [54342.176285] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 0110 data 20030700
Dec 21 21:19:08 [kernel] [54342.176300] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 0114 data 00001010
[...]
Dec 21 21:20:09 [kernel] [54402.836602] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 2644 data ef584f57
Dec 21 21:20:09 [kernel] [54402.836618] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 2648 data 00000002
Dec 21 21:20:09 [kernel] [54402.836635] nouveau 0000:03:00.0: gr: ILLEGAL_CLASS ch 11 [017efba000 firefox[17446]] subc 0 class 0000 mthd 264c data 00000000
Dec 21 21:21:15 [kernel] [54468.854863] sysrq: SysRq : Emergency Sync


03:00.0 VGA compatible controller: NVIDIA Corporation GK110B [GeForce GTX TITAN Z] (rev a1) (prog-if 00 [VGA controller])
        Subsystem: eVga.com. Corp. GK110B [GeForce GTX TITAN Z]
        Flags: bus master, fast devsel, latency 0, IRQ 34
        Memory at f8000000 (32-bit, non-prefetchable) [size=16M]
        Memory at e0000000 (64-bit, prefetchable) [size=128M]
        Memory at ee000000 (64-bit, prefetchable) [size=32M]
        I/O ports at cf00 [size=128]
        [virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: [60] Power Management version 3
        Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
        Capabilities: [78] Express Endpoint, MSI 00
        Capabilities: [100] Virtual Channel
        Capabilities: [128] Power Budgeting <?>
        Capabilities: [420] Advanced Error Reporting
        Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
        Capabilities: [900] #19
        Kernel driver in use: nouveau

x11-drivers/xf86-video-nouveau
     IUSE (alle Versionen):  glamor
     Installierte Versionen: Version:   1.0.12

$> emerge --info | head -n3
Portage 2.3.3 (python 2.7.12-final-0, default/linux/amd64/13.0/desktop/plasma, gcc-4.9.3, glibc-2.23-r3, 4.8.2-gentoo_M16 x86_64)
=================================================================
System uname: Linux-4.8.2-gentoo_M16-x86_64-Intel-R-_Core-TM-_i7_CPU_870_@_2.93GHz-with-gentoo-2.3

$> USE="opengl"; for PKG in $(equery -q hasuse $USE); do echo $PKG: $(equery -q uses $PKG |grep $USE); done | head -n 2
app-emulation/qemu-2.7.0-r7: +opengl
app-emulation/wine-1.9.20: +opengl

# opencl is builtin for mesa and wine
$> USE="opencl"; for PKG in $(equery -q hasuse $USE); do echo $PKG: $(equery -q uses $PKG |grep $USE); done | head -n 2
app-emulation/wine-1.9.20: +opencl
media-libs/mesa-13.0.2: +opencl

# downgrading wine didn't help
$> eix app-emulation/wine$ | grep -i install
     installed Versions: Version:   1.9.20^t

$> eix firefox$
     installed Versions: Version:   50.1.0
Comment 1 derrierdo 2017-02-18 19:42:00 UTC
Problem still appear with my
01:00.0 VGA compatible controller: NVIDIA Corporation GT218 [GeForce 210] (rev a2)

Linux wizz 4.9.0-aufs #7 SMP Mon Jan 16 19:33:55 EST 2017 x86_64 Intel(R) Core(TM) i3-3220 CPU @ 3.30GHz GenuineIntel GNU/Linux



2017-02-18T14:17:30-05:00 wizz kernel: nouveau 0000:01:00.0: fifo: CACHE_ERROR - ch 17 [dolphin[5746]] subc 3 mthd 01d0 data beef0201
2017-02-18T14:17:30-05:00 wizz kernel: nouveau 0000:01:00.0: fifo: CACHE_ERROR - ch 17 [dolphin[5746]] subc 3 mthd 01d4 data beef0201
2017-02-18T14:17:30-05:00 wizz kernel: nouveau 0000:01:00.0: fifo: CACHE_ERROR - ch 17 [dolphin[5746]] subc 3 mthd 01d8 data beef0201
2017-02-18T14:17:30-05:00 wizz kernel: nouveau 0000:01:00.0: fifo: CACHE_ERROR - ch 17 [dolphin[5746]] subc 3 mthd 01dc data beef0201
2017-02-18T14:17:30-05:00 wizz kernel: nouveau 0000:01:00.0: gr: TRAP_CCACHE 00000001 [FAULT]
2017-02-18T14:17:30-05:00 wizz kernel: nouveau 0000:01:00.0: gr: TRAP_CCACHE 000e0080 00000000 00000000 00000000 00000000 00000004 00000000
2017-02-18T14:17:30-05:00 wizz kernel: nouveau 0000:01:00.0: gr: 00200000 [] ch 17 [003ea0f000 dolphin[5746]] subc 3 class 8597 mthd 13bc data 00000054
2017-02-18T14:17:30-05:00 wizz kernel: nouveau 0000:01:00.0: fb: trapped read at 002027ff00 on channel 17 [3ea0f000 dolphin[5746]] engine 00 [PGRAPH] client 05 [CCACHE] subclient 00 [CB] reason 00000006 [NULL_DMAOBJ]
Comment 2 Jeff 2017-07-06 20:12:06 UTC
I am seeing the _exact_ same issue (quoted below), as described by in the original message. Same symptoms and error messages.

"The screen freezes except for that the mouse pointer can be used (click has no effect). Apart from the magic syskeys (sync, unmount, boot) the keyboard doesn't react. I can not use ALT-tab to switch to another program. I can not leave the game (fullscreen mode) by using the ingame console or using ALT-F4. The mouse pointer is trapped on the monitor where the game runs (I have two). The sound keeps playing and sound effects are played now and then too. I tried to get to the terminals via CTRL-ALT-F[1-6] but nothing happens. Switching (KDE) desktops via CTRL-ALT-[arrow-left,arrow-right] is just the same."

I am running Ubuntu 16.10 on a Dell Precision Tower 5810.

Jul  6 10:54:38 nachos gcr-prompter[22026]: GtkDialog mapped without a transient parent. This is discouraged.
Jul  6 10:58:31 nachos kernel: [82204.289470] nouveau 0000:03:00.0: fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]
Jul  6 10:58:31 nachos kernel: [82204.289480] nouveau 0000:03:00.0: fifo: gr engine fault on channel 5, recovering...

03:00.0 VGA compatible controller: NVIDIA Corporation GK107 [NVS 510] (rev a1) (prog-if 00 [VGA controller])
        Subsystem: NVIDIA Corporation GK107 [NVS 510]
        Physical Slot: 2
        Flags: bus master, fast devsel, latency 0, IRQ 30
        Memory at f6000000 (32-bit, non-prefetchable) [size=16M]
        Memory at e0000000 (64-bit, prefetchable) [size=256M]
        Memory at f0000000 (64-bit, prefetchable) [size=32M]
        I/O ports at e000 [size=128]
        Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: <access denied>
        Kernel driver in use: nouveau
        Kernel modules: nvidiafb, nouveau

Linux nachos 4.8.0-58-generic #63-Ubuntu SMP Mon Jun 26 17:08:21 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
Comment 3 as.newsletter 2017-07-09 18:37:16 UTC
On one occasion I was able to kill xorg (?). The effect was that the session was over but I could log in and start working - no reboot!
Can you try using ALT+print+k ? It took quite a while until the login screen was available, maybe five minutes. 
Anyhow that works *sometimes* and often it does not.
Comment 4 Artem S. Tashkinov 2018-05-30 12:47:56 UTC
Probably related bug 105884
Comment 5 kenorb 2019-01-06 00:09:03 UTC
Related bug: #107829
Related post: https://askubuntu.com/q/1046945/78223


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.