Bug 95330 - [NV84] Hangs with gr: DATA_ERROR [INVALID_BITFIELD], TRAP_PROP [RT_FAULT], fb: trapped write [PGRAPH] [PROP] [RT0] [PAGE_NOT_PRESENT], bsp: Watchdog interrupt, engine hung
Summary: [NV84] Hangs with gr: DATA_ERROR [INVALID_BITFIELD], TRAP_PROP [RT_FAULT], fb...
Status: RESOLVED INVALID
Alias: None
Product: xorg
Classification: Unclassified
Component: Driver/nouveau (show other bugs)
Version: unspecified
Hardware: Other All
: medium normal
Assignee: Nouveau Project
QA Contact: Xorg Project Team
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-05-10 02:36 UTC by Matt Whitlock
Modified: 2016-06-03 23:20 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
kernel.log (70.07 KB, text/plain)
2016-05-10 02:36 UTC, Matt Whitlock
no flags Details
xorg.log (31.14 KB, text/plain)
2016-05-10 02:37 UTC, Matt Whitlock
no flags Details

Description Matt Whitlock 2016-05-10 02:36:01 UTC
Created attachment 123586 [details]
kernel.log

Nouveau has been wildly unstable for me ever since I upgraded from about Linux 4.1 to 4.4+. The latest manifestation of this bumpy ride was a full X.org server hang. I was still able to SSH in to capture some logs.

01:00.0 VGA compatible controller: NVIDIA Corporation G84 [GeForce 8600 GT] (rev a1) (prog-if 00 [VGA controller])
	Subsystem: Gigabyte Technology Co., Ltd G84 [GeForce 8600 GT]
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0, Cache Line Size: 64 bytes
	Interrupt: pin A routed to IRQ 27
	Region 0: Memory at e4000000 (32-bit, non-prefetchable) [size=16M]
	Region 1: Memory at d0000000 (64-bit, prefetchable) [size=256M]
	Region 3: Memory at e2000000 (64-bit, non-prefetchable) [size=32M]
	Region 5: I/O ports at 3000 [size=128]
	Expansion ROM at <ignored> [disabled]
	Capabilities: [60] Power Management version 2
		Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
		Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
	Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
		Address: 00000000fee0100c  Data: 41e1
	Capabilities: [78] Express (v1) Endpoint, MSI 00
		DevCap:	MaxPayload 128 bytes, PhantFunc 0, Latency L0s <512ns, L1 <4us
			ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset- SlotPowerLimit 75.000W
		DevCtl:	Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
			RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
			MaxPayload 128 bytes, MaxReadReq 512 bytes
		DevSta:	CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
		LnkCap:	Port #0, Speed 2.5GT/s, Width x16, ASPM L0s L1, Exit Latency L0s <512ns, L1 <4us
			ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp-
		LnkCtl:	ASPM Disabled; RCB 128 bytes Disabled- CommClk+
			ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
		LnkSta:	Speed 2.5GT/s, Width x16, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
	Capabilities: [100 v1] Virtual Channel
		Caps:	LPEVC=0 RefClk=100ns PATEntryBits=1
		Arb:	Fixed- WRR32- WRR64- WRR128-
		Ctrl:	ArbSelect=Fixed
		Status:	InProgress-
		VC0:	Caps:	PATOffset=00 MaxTimeSlots=1 RejSnoopTrans-
			Arb:	Fixed- WRR32- WRR64- WRR128- TWRR128- WRR256-
			Ctrl:	Enable+ ID=0 ArbSelect=Fixed TC/VC=01
			Status:	NegoPending- InProgress-
	Capabilities: [128 v1] Power Budgeting <?>
	Capabilities: [600 v1] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
	Kernel driver in use: nouveau


Software versions in use:

  * Linux kernel 4.5.3-gentoo
  * X.org X11 server 1.18.3
  * xf86-video-nouveau 1.0.12
  * libdrm 2.4.68
  * Mesa 11.2.1


Hang occurred shortly after quitting VLC. Kernel logs shows multiple errors from nouveau at this time, followed by an eventual segfault of vlc in libxcb.so.1.1.0 and segfault of kactivitymanage in libQt5Sql.so.5.6.0.

I logged in via SSH and issued a reboot command, which took a while to process. Kernel log shows multiple "failed to idle channel" errors from nouveau, apparently one per X client. Also "fifo: channel … unload timeout" and "bsp: Watchdog interrupt, engine hung." See attached kernel log for specifics.
Comment 1 Matt Whitlock 2016-05-10 02:37:08 UTC
Created attachment 123587 [details]
xorg.log

The X.org server log shows "[mi] EQ overflowing" errors with backtraces.
Comment 2 Matt Whitlock 2016-06-03 23:20:25 UTC
I think the kernel upgrade was coincidental. I discovered several failed capacitors on my motherboard. Replacing them made my problems go away. Closing this bug report as invalid.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.