Bug 90686 - [gen4] GPU hang
Summary: [gen4] GPU hang
Status: RESOLVED FIXED
Alias: None
Product: Mesa
Classification: Unclassified
Component: Drivers/DRI/i965 (show other bugs)
Version: 10.1
Hardware: x86 (IA32) Linux (All)
: medium blocker
Assignee: Ian Romanick
QA Contact: Intel 3D Bugs Mailing List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-05-27 20:44 UTC by mathias.aldrin
Modified: 2015-05-28 07:21 UTC (History)
2 users (show)

See Also:
i915 platform:
i915 features:


Attachments
card0 (855.06 KB, text/plain)
2015-05-27 20:44 UTC, mathias.aldrin
Details
dmesg (52.57 KB, text/plain)
2015-05-27 20:59 UTC, mathias.aldrin
Details
lspci (1.90 KB, text/plain)
2015-05-27 21:00 UTC, mathias.aldrin
Details
mesa (146 bytes, text/plain)
2015-05-27 21:01 UTC, mathias.aldrin
Details
xorg version (661 bytes, text/plain)
2015-05-27 21:01 UTC, mathias.aldrin
Details

Description mathias.aldrin 2015-05-27 20:44:56 UTC
Created attachment 116094 [details]
card0

Just instaled Linux Mint 17.1 and updated all packages.
After this I tried to do some work but mouse pointer freezes.
Looking in syslog I found the following:

May 27 20:28:36 fujitsu pulseaudio[2321]: [pulseaudio] pid.c: Daemon already running.
May 27 20:30:39 fujitsu kernel: [  169.808065] [drm] stuck on render ring
May 27 20:30:39 fujitsu kernel: [  169.808077] [drm] GPU crash dump saved to /sys/class/drm/card0/error
May 27 20:30:39 fujitsu kernel: [  169.808079] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
May 27 20:30:39 fujitsu kernel: [  169.808089] [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
May 27 20:30:39 fujitsu kernel: [  169.808093] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
May 27 20:30:39 fujitsu kernel: [  169.808095] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
May 27 20:30:39 fujitsu kernel: [  169.808873] [drm:i915_set_reset_status] *ERROR* render ring hung inside bo (0x5f95000 ctx 0) at 0x5f951cc
May 27 20:30:39 fujitsu kernel: [  169.864012] [drm] GMBUS [i915 gmbus vga] timed out, falling back to bit banging on pin 2
May 27 20:30:40 fujitsu kernel: [  170.320016] [drm:i915_reset] *ERROR* Failed to reset chip.


Time: 1432751439 s 812273 us
Kernel: 3.13.0-37-generic
PCI ID: 0x2992
EIR: 0x00000000
IER: 0x00028053
PGTBL_ER: 0x00000000
FORCEWAKE: 0x00000000
DERRMR: 0x00000000
CCID: 0x00000000
Missed interrupts: 0x00000000
  fence[0] = a5f0000056009d
  fence[1] = 605c0000605500d
  fence[2] = 8b0d0000630e07d
  fence[3] = 14bb0000146409d
  fence[4] = 8ded00008b6e06d
  fence[5] = 56c5000056b200d
  fence[6] = 4f7300004ce009d
  fence[7] = 45db000045d000d
  fence[8] = 9467000092e802d
  fence[9] = f7700000f6400d
  fence[10] = 57ad000056dc02d
  fence[11] = 599e000058ef01d
  fence[12] = 5a0a0000599f01d
  fence[13] = 605300005fb404d
  fence[14] = 607b0000606804d
  fence[15] = 630d0000608e06d
  INSTDONE_0: 0x3fe5fafd
  INSTDONE_1: 0x000ffffd
  INSTDONE_2: 0x00000000
  INSTDONE_3: 0x00000000
render command stream:
  HEAD: 0x00013200
  TAIL: 0x00013420
  CTL: 0x0001f001
  ACTHD: 0x05f951cc
  IPEIR: 0x00000000
  IPEHR: 0x79010003
  INSTDONE: 0x3fe5fafd
  BBADDR: 0x05f951cb
  BB_STATE: 0x000000a0
  INSTPS: 0x8001e022
  INSTPM: 0x00000000
  FADDR: 0x05f95280
  seqno: 0xfffff7a4
  waiting: yes
  ring->head: 0x00000000
  ring->tail: 0x00013420
  hangcheck: hung [40]
Active [181]:
  05f95000    32768 7e 00 fffff7a5 0 dirty purgeable render uncached
  06054000     4096 76 00 fffff7a5 0 dirty purgeable render uncached
  08b22000     4096 76 00 fffff7a5 0 dirty purgeable render uncached
  08b23000     4096 76 00 fffff7a5 0 dirty purgeable render uncached
  00544000     4096 7e 00 fffff7a6 0 p dirty render uncached
  08e11000  4423680 02 02 fffff7a6 fffff7a6 X dirty render uncached
Comment 1 mathias.aldrin 2015-05-27 20:59:59 UTC
Created attachment 116095 [details]
dmesg
Comment 2 mathias.aldrin 2015-05-27 21:00:38 UTC
Created attachment 116096 [details]
lspci
Comment 3 mathias.aldrin 2015-05-27 21:01:09 UTC
Created attachment 116097 [details]
mesa
Comment 4 mathias.aldrin 2015-05-27 21:01:34 UTC
Created attachment 116098 [details]
xorg version
Comment 5 mathias.aldrin 2015-05-27 21:03:14 UTC
The symphtom looks similar to bug 77207 which is solved and closed.
Comment 6 Matt Turner 2015-05-27 21:47:44 UTC
Mesa 10.1 is ancient at this point. Please try mesa-10.5.x.
Comment 7 mathias.aldrin 2015-05-28 06:14:18 UTC
Hi,

I updated the SW and have not seen the crash since.

sudo apt-add-repository ppa:oibaf/graphics-drivers
sudo apt-get update
sudo apt-get dist-upgrade
sudo reboot now



inxi -Fxz
System: Host: fujitsu Kernel: 3.13.0-37-generic i686 (32 bit, gcc: 4.8.2) Desktop: N/A Distro: Linux Mint 17.1 Rebecca
Machine: System: FUJITSU SIEMENS product: ESPRIMO E
Mobo: FUJITSU SIEMENS model: D2344-A2 version: S26361-D2344-A2
Bios: FUJITSU SIEMENS // Phoenix version: 6.00 R1.04.2344.A2 date: 11/17/2006
CPU: Dual core Intel Core2 CPU 6300 (-MCP-) cache: 2048 KB flags: (lm nx sse sse2 sse3 ssse3 vmx) bmips: 7447.98 
Clock Speeds: 1: 1600.00 MHz 2: 1867.00 MHz
Graphics: Card: Intel 82Q963/Q965 Integrated Graphics Controller bus-ID: 00:02.0 
X.Org: 1.15.1 drivers: intel (unloaded: fbdev,vesa) Resolution: 1920x1080@60.0hz 
GLX Renderer: Gallium 0.4 on llvmpipe (LLVM 3.6, 128 bits) GLX Version: 3.0 Mesa 10.7.0-devel (git-4066536 2015-05-27 trusty-oibaf-ppa) Direct Rendering: Yes
Audio: Card: Intel 82801H (ICH8 Family) HD Audio Controller driver: snd_hda_intel bus-ID: 00:1b.0
Sound: Advanced Linux Sound Architecture ver: k3.13.0-37-generic
Network: Card-1: Broadcom NetXtreme BCM5755 Gigabit Ethernet PCI Express driver: tg3 ver: 3.134 bus-ID: 09:00.0
IF: eth0 state: down mac: <filter>
Card-2: Ralink RT2790 Wireless 802.11n 1T/2R PCIe driver: rt2800pci ver: 2.3.0 bus-ID: 06:00.0
IF: wlan0 state: up mac: <filter>
Drives: HDD Total Size: 80.0GB (6.2% used) 1: id: /dev/sda model: ST3808110AS size: 80.0GB 
Partition: ID: / size: 70G used: 4.7G (8%) fs: ext4 ID: swap-1 size: 4.28GB used: 0.00GB (0%) fs: swap 
RAID: No RAID devices detected - /proc/mdstat and md_mod kernel raid module present
Sensors: System Temperatures: cpu: 66.0C mobo: N/A 
Fan Speeds (in rpm): cpu: N/A 
Info: Processes: 107 Uptime: 3 min Memory: 129.8/4024.3MB Runlevel: 2 Gcc sys: 4.8.2 Client: Shell inxi: 1.8.4



Br Mathias
Comment 8 Matt Turner 2015-05-28 06:16:47 UTC
Excellent. Thanks for testing. Please reopen (or file a new bug) if you encounter more hangs.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.