Bug 88651

Summary: [HSW]igt/gem_cs_prefetch & gem_dummy_reloc_loop & gem_cpu_reloc sporadically causes *ERROR* Hangcheck timer elapsed... blitter ring idle
Product: DRI Reporter: lu hua <huax.lu>
Component: DRM/IntelAssignee: Mika Kuoppala <mika.kuoppala>
Status: CLOSED DUPLICATE QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: christophe.prigent, humberto.i.perez.rodriguez, intel-gfx-bugs
Version: unspecified   
Hardware: All   
OS: Linux (All)   
Whiteboard:
i915 platform: HSW i915 features: GEM/Other
Attachments:
Description Flags
dmesg
none
drm/i915: Use ring head/tail to check ring idle in hangcheck
none
gem_dummy_reloc_loop.zip
none
gem_cpu_reloc.zip none

Description lu hua 2015-01-21 05:58:26 UTC
Created attachment 112583 [details]
dmesg

==System Environment==
--------------------------
Regression: not sure

on-working platforms: HSW

==kernel==
--------------------------
drm-intel-nightly/d6bc7a6a0a7573350e8be8ec54002c20d1dbe1e0
commit d6bc7a6a0a7573350e8be8ec54002c20d1dbe1e0
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Tue Jan 20 15:10:59 2015 +0100

    drm-intel-nightly: 2015y-01m-20d-14h-10m-40s UTC integration manifest

==Bug detailed description==
-----------------------------
It sporadically causes <3>[   77.789102] [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... blitter ring idle. fail rate:1/3
igt/gem_cpu_reloc also sporadically causes this error.

output:
IGT-Version: 1.9-ga29f28e (x86_64) (Linux: 3.19.0-rc4_drm-intel-nightly_d6bc7a_20150121+ x86_64)
gem_cs_prefetch: 100%
Test suceeded, cleanup up - this might take a while.

==Reproduce steps==
---------------------------- 
1. ./gem_cs_prefetch
Comment 1 lu hua 2015-01-21 06:05:52 UTC
gem_dummy_reloc_loop also has this error. Fail rate: 1/5

[root@x-hsw27 tests]# ./gem_dummy_reloc_loop --run-subtest blt
IGT-Version: 1.9-ga29f28e (x86_64) (Linux: 3.19.0-rc4_drm-intel-nightly_d6bc7a_20150121+ x86_64)
running dummy loop on blt
dummy loop run on blt completed
Subtest blt: SUCCESS (6.048s)
[root@x-hsw27 tests]# dmesg -r|egrep "<[1-4]>"|grep drm
<3>[   91.814292] [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... blitter ring idle
Comment 2 Mika Kuoppala 2015-02-03 12:33:04 UTC
Created attachment 113106 [details] [review]
drm/i915: Use ring head/tail to check ring idle in hangcheck
Comment 3 Daniel Vetter 2015-02-24 10:35:52 UTC
You need to set to NEEDINFO, otherwise QA doesn't test - it won't show up in their queries.
Comment 4 Mika Kuoppala 2015-02-24 11:19:07 UTC
That patch was garbage so nothing to test.
Comment 5 Humberto Israel Perez Rodriguez 2015-08-12 05:05:57 UTC
Hi, the program "gem_cs_prefetch" that only contains one sub-test was ran in HSW-ULT and passed with the latest configuration as indicated below : 

-- Hardware --
Platform: Intel NUC D54250WYK
Processo: Intel(R) Core(TM) i5-4250U CPU @ 1.30GHz
-- Software --
Linux distribution: Ubuntu 14.04.02 LTS 64Bits
BIOS: WYLPT10H.86A.0021.2013.1017.1606



Test Environment:
````````````````````````````````````
Kernel: tag drm-intel-testing-2015-07-31 (4.2-rc4) from git://anongit.freedesktop.org/drm-intel
Mesa: mesa-10.6.3 from http://cgit.freedesktop.org/mesa/mesa/
Xf86_video_intel: 2.99.917 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm: libdrm-2.4.62 from http://cgit.freedesktop.org/mesa/drm/
Cairo: 1.14.2 from http://cgit.freedesktop.org/cairo
libva: libva-1.6.0 from http://cgit.freedesktop.org/libva/
intel-driver: 1.6.0. from http://cgit.freedesktop.org/vaapi/intel-driver
xorg: 1.17.99 installed with script git_xorg.sh
Xserver: xorg-server-1.17.2 from http://cgit.freedesktop.org/xorg/xserver
Intel-gpu-tools: 1.11 from http://cgit.freedesktop.org/xorg/app/intel-gpu-tools/


If needed more information or you have any doubt do not hesitate to contact me
Comment 6 Humberto Israel Perez Rodriguez 2015-08-12 15:07:40 UTC
Created attachment 117650 [details]
gem_dummy_reloc_loop.zip


Hi, for the program "gem_dummy_reloc_loop" the following test pass on HSW-ULT

# ./gem_dummy_reloc_loop --run-subtest blt   


but this 2 test was skipped

# ./gem_dummy_reloc_loop --run-subtest bsd-ring1
# ./gem_dummy_reloc_loop --run-subtest bsd-ring2

So, one question here is of I need raise a new bug ?

** The gem_dummy_reloc_loop.zip contains console output for these subtest and dmesg log **
gem_dummy_reloc_loop --debug --run-subtest bsd-ring1.txt
gem_dummy_reloc_loop --debug --run-subtest bsd-ring2.txt
gem_dummy_reloc_loop --debug --run-subtest blt.txt
HW-ULT_dmesg.txt



For the program "gem_cpu_reloc" the following subtest passed

# ./gem_cpu_reloc --run-subtest basic
# ./gem_cpu_reloc --run-subtest full

Please observe the gem_cpu_reloc.zip that contains the evidence of the test 
gem_cpu_reloc.zip


This was the configuration under ran the subtests : 

-- Hardware --
Platform: Intel NUC D54250WYK
Processo: Intel(R) Core(TM) i5-4250U CPU @ 1.30GHz
-- Software --
Linux distribution: Ubuntu 14.04.02 LTS 64Bits
BIOS: WYLPT10H.86A.0021.2013.1017.1606



Test Environment:
````````````````````````````````````
Kernel: tag drm-intel-testing-2015-07-31 (4.2-rc4) from git://anongit.freedesktop.org/drm-intel
Mesa: mesa-10.6.3 from http://cgit.freedesktop.org/mesa/mesa/
Xf86_video_intel: 2.99.917 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
Libdrm: libdrm-2.4.62 from http://cgit.freedesktop.org/mesa/drm/
Cairo: 1.14.2 from http://cgit.freedesktop.org/cairo
libva: libva-1.6.0 from http://cgit.freedesktop.org/libva/
intel-driver: 1.6.0. from http://cgit.freedesktop.org/vaapi/intel-driver
xorg: 1.17.99 installed with script git_xorg.sh
Xserver: xorg-server-1.17.2 from http://cgit.freedesktop.org/xorg/xserver
Intel-gpu-tools: 1.11 from http://cgit.freedesktop.org/xorg/app/intel-gpu-tools/


If needed more information or you have any doubt do not hesitate to contact me
Comment 7 Humberto Israel Perez Rodriguez 2015-08-12 15:08:29 UTC
Created attachment 117651 [details]
gem_cpu_reloc.zip
Comment 8 Chris Wilson 2016-03-04 19:29:01 UTC

*** This bug has been marked as a duplicate of bug 94217 ***

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.