Bug 23354 - M52 GPU hang and reset failed
Summary: M52 GPU hang and reset failed
Status: RESOLVED MOVED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Radeon (show other bugs)
Version: XOrg git
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-08-16 15:01 UTC by Pauli
Modified: 2019-11-19 08:07 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments
dmesg captured from terminal using ssh. (45.39 KB, text/plain)
2009-08-16 15:01 UTC, Pauli
no flags Details
lspci for hardware details. (23.18 KB, text/plain)
2009-08-16 15:24 UTC, Pauli
no flags Details
dmesg from 2nd hang (122.90 KB, text/plain)
2009-08-16 15:57 UTC, Pauli
no flags Details
all IB buffers dumped to single file (181.35 KB, text/plain)
2009-08-16 15:58 UTC, Pauli
no flags Details

Description Pauli 2009-08-16 15:01:06 UTC
Created attachment 28679 [details]
dmesg captured from terminal using ssh.

I was running modified version of mesa when I got GPU hang. (My changes may have broken state)



But effects of gpu hang were fatal:
-dmesg has message that radeon was trying to reset GPU
-ring test failed so reset failed
-dmesg is spammed with messages that scheduling IB failed
-whole computer is unresponsive localy
-open ssh connection works for only few commands (luckily dmesg did work)
-It seems like no disk access can happen after hang
-No new ssh connection are possible
-Old ssh connection did freeze soon so not much useful info there

After reboot logs didn't include any info from hang or after it.

I have seen before similar hang with M9+ card but no ssh connected then so no debug info.
Comment 1 Pauli 2009-08-16 15:24:27 UTC
Created attachment 28680 [details]
lspci for hardware details.

software details:
kernel is vanila 2.6.31-rc6
mesa master from yesterday with my patches applied+minor hacking
libdrm master 1d465178fbab77a9c
xf86-video-ati master cd99d9f0

PS. Afre ssh freeze I did try Sysrq+S,K,R and Then just B. Nothing did happend to first 3 but reboot worked.
Comment 2 Pauli 2009-08-16 15:57:45 UTC
Created attachment 28681 [details]
dmesg from 2nd hang

ok. I can reproduce this hang with my broken mesa.

This time I got IB info over ssh before I lost control.
Comment 3 Pauli 2009-08-16 15:58:39 UTC
Created attachment 28682 [details]
all IB buffers dumped to single file
Comment 4 Martin Peres 2019-11-19 08:07:50 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/65.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.