Bug 67511

Summary: [ILK Regression]igt/gem_suspend/fence-restore-untiled causes system hang
Product: DRI Reporter: lu hua <huax.lu>
Component: DRM/IntelAssignee: Rafael J. Wysocki <rjw>
Status: CLOSED NOTOURBUG QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: major    
Priority: high CC: xunx.fang, yangweix.shui
Version: unspecified   
Hardware: All   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:

Description lu hua 2013-07-30 02:50:23 UTC
System Environment:
--------------------------
Platform:    Ironlake
Kernel:     (drm-intel-next-queued)fae5cbff3e3e4097de2e76178d462b90626c2bdb

Bug detailed description:
-----------------------------
It causes system hang on ironlake with -queued, -fixes, -nightly kernel.

The latest known good commit:d861e3387650296f1fca2a4dd0dcd380c8fdddad.
The latest known bad commit: 3b27af3560f3cfe4e09171024515fa304ebae93b

output:
rtcwake: wakeup from "mem" using /dev/rtc0 at Tue Jul 30 02:46:39 2013

dmesg:
[  146.062106] console [netcon0] enabled
[  146.105912] netconsole: network logging started
[  146.710959] console [netcon0] enabled
[  146.754781] netconsole: network logging started
[  163.758369] [drm:i915_driver_open],
[  163.801444] [drm:intel_crtc_cursor_set], cursor off
[  163.860075] [drm:intel_crtc_set_config], [CRTC:3] [FB:35] #connectors=1 (x y) (0 0)
[  163.951996] [drm:intel_modeset_stage_output_state], [CONNECTOR:7:VGA-1] to [CRTC:3]
[  164.043930] [drm:intel_crtc_cursor_set], cursor off
[  164.102627] [drm:intel_crtc_set_config], [CRTC:5] [NOFB]
[  164.166476] [drm:intel_modeset_stage_output_state], [CONNECTOR:7:VGA-1] to [CRTC:3]
[  164.258379] [drm:i915_driver_open],
[  164.448192] PM: Syncing filesystems ... done.
[  164.500798] PM: Preparing system for mem sleep
[  164.554220] Freezing user space processes ... (elapsed 0.001 seconds) done.
[  164.639288] Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
[  164.729662] PM: Entering mem sleep
[  164.770492] Suspending console(s) (use no_console_suspend to debug)
[  164.845552] hpet1: lost 2 rtc interrupts
[  164.892579] hpet1: lost 2 rtc interrupts
[  164.939584] hpet1: lost 2 rtc interrupts
[  164.986591] hpet1: lost 2 rtc interrupts
[  165.033598] hpet1: lost 2 rtc interrupts
[  165.080622] hpet1: lost 2 rtc interrupts
[  165.127635] hpet1: lost 2 rtc interrupts
[  165.174621] hpet1: lost 2 rtc interrupts
[  165.221628] hpet1: lost 2 rtc interrupts
[  165.268635] hpet1: lost 2 rtc interrupts


Reproduce steps:
----------------------------
1. ./gem_suspend --run-subtest fence-restore-untiled
Comment 1 Chris Wilson 2013-07-30 08:21:32 UTC
Hmm, can you please complete the bisect - whatever the cause it is likely accidental.
Comment 2 lu hua 2013-07-31 09:03:58 UTC
It only happens on 3.5 sata HD. It works well on SSD HD. 
If connect this 3.5 sata HD to haswell, This issue also exists. 
lspci
00:00.0 Host bridge: Intel Corporation Core Processor DRAM Controller (rev 12)
00:02.0 VGA compatible controller: Intel Corporation Core Processor Integrated Graphics Controller (rev 12)
00:16.0 Communication controller: Intel Corporation 5 Series/3400 Series Chipset HECI Controller (rev 06)
00:1a.0 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB Universal Host Controller (rev 06)
00:1a.1 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB Universal Host Controller (rev 06)
00:1a.2 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB Universal Host Controller (rev 06)
00:1a.7 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB2 Enhanced Host Controller (rev 06)
00:1b.0 Audio device: Intel Corporation 5 Series/3400 Series Chipset High Definition Audio (rev 06)
00:1c.0 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 1 (rev 06)
00:1c.4 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 5 (rev 06)
00:1c.5 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 6 (rev 06)
00:1d.0 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB Universal Host Controller (rev 06)
00:1d.1 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB Universal Host Controller (rev 06)
00:1d.2 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB Universal Host Controller (rev 06)
00:1d.7 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB2 Enhanced Host Controller (rev 06)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev a6)
00:1f.0 ISA bridge: Intel Corporation 5 Series Chipset LPC Interface Controller (rev 06)
00:1f.2 IDE interface: Intel Corporation 5 Series/3400 Series Chipset 4 port SATA IDE Controller (rev 06)
00:1f.3 SMBus: Intel Corporation 5 Series/3400 Series Chipset SMBus Controller (rev 06)
00:1f.5 IDE interface: Intel Corporation 5 Series/3400 Series Chipset 2 port SATA IDE Controller (rev 06)
02:00.0 IDE interface: JMicron Technology Corp. JMB368 IDE controller
03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 03)
04:07.0 FireWire (IEEE 1394): Texas Instruments TSB43AB23 IEEE-1394a-2000 Controller (PHY/Link)
3f:00.0 Host bridge: Intel Corporation Core Processor QuickPath Architecture Generic Non-core Registers (rev 02)
3f:00.1 Host bridge: Intel Corporation Core Processor QuickPath Architecture System Address Decoder (rev 02)
3f:02.0 Host bridge: Intel Corporation Core Processor QPI Link 0 (rev 02)
3f:02.1 Host bridge: Intel Corporation Core Processor QPI Physical 0 (rev 02)
3f:02.2 Host bridge: Intel Corporation Core Processor Reserved (rev 02)
3f:02.3 Host bridge: Intel Corporation Core Processor Reserved (rev 02)
Comment 3 Chris Wilson 2013-07-31 09:12:46 UTC
Sounds like NOTOURBUG, but it may be bisectable so worth investigating.
Comment 4 lu hua 2013-08-02 03:22:27 UTC
There are only 'skip'ped commits left to test.
The first bad commit could be any of:
b2c064b25ad07169b2892a733918e6b941bf3366
2e199192df85eb936a7829dc28b57b85c59c86fc
2e4f1db49d97222110b6add9a2c6cf5251a41e35
1001b4d4a8ee6b2e7a6078a02ccdf68f91b192bd
be547436c22c3b7d934d9afd841cfd7a6807f7ab
4960e05e22604ee270a023f968e0e4f9bd0c6fef
ea50be59345a2b714fd3ed43e1bba89906c177c3
303bfdb1a14d0460feb859cd008ff81da36b517c
e2ff39400d81233374e780b133496a2296643d7d
242831eb15a06fa4414eaa705fdc6dd432ab98d1
ac212b6980d8d5eda705864fc5a8ecddc6d6eacc
173a5a4c909789fcd57d00355d2237618a3824a4
683058e315f00a216fd6c79df4f63bc9945ca434
0902a9044fa5b7a0456ea4daacec2c2b3189ba8c
aba6efc47133af4941cda16e690f71b7ad894da2
4f3549d72d1b5c90ecc7e673402f38f4486d22c2
c655affbd524d0105978ecd696c3bb8a281b418b
08f502c1c343031f0d126bd00e87dede38269d12
dd3652ed1a5328a91352409fe81af3e7d629e44b
a204dbc61b7f4cb1a7e2cb3ad057b135164782da
Comment 5 Chris Wilson 2013-08-02 07:16:14 UTC
Looks like ACPI. Please file a bug in bugzilla.kernel.org and cc Rafael J. Wysocki <rafael.j.wysocki@intel.com>

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.