Bug 88981 - [BSW/SKL/BYT/BXT]igt/gem_exec_lut_handle causes *ERROR* Hangcheck timer elapsed... render ring idle
Summary: [BSW/SKL/BYT/BXT]igt/gem_exec_lut_handle causes *ERROR* Hangcheck timer elaps...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: unspecified
Hardware: All Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-02-05 05:20 UTC by lu hua
Modified: 2017-07-24 22:48 UTC (History)
2 users (show)

See Also:
i915 platform: BSW/CHT, SKL
i915 features: GEM/Other


Attachments
dmesg (124.89 KB, text/plain)
2015-04-07 02:40 UTC, lu hua
no flags Details

Description lu hua 2015-02-05 05:20:29 UTC
==System Environment==
--------------------------
Regression: not sure

==kernel==
--------------------------
drm-intel-nightly/0eafcbf6bc5917efd97c9916329c6f859cfb41ed
commit 0eafcbf6bc5917efd97c9916329c6f859cfb41ed
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Tue Feb 3 19:19:19 2015 +0100

    drm-intel-nightly: 2015y-02m-03d-18h-18m-53s UTC integration manifest

==Bug detailed description==
-----------------------------
It causes <3>[ 4329.946633] [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... render ring idle.on automation testing
I am unable to reproduce it manually.

test log:
@test: Intel_gpu_tools/igt_gem_exec_lut_handle
returncode: 0
info: @@@Returncode: 0

test case start at: Tue Feb  3 11:48:20 2015
test case end at:   Tue Feb  3 11:56:49 2015

Errors:


Dmesg:
<3>[ 4329.946633] [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... render ring idle


Output:
             command   pid dev master a   uid      magic
Test Environment check: Succeeded.
[1/1] dmesg-warn: 1 |
[1/1] dmesg-warn: 1 /


Thank you for running Piglit!
Results have been written to /GFX/Test/Piglit/piglit/t
{
    "results_version": 2,
    "uname": "Linux x-bsw08 3.19.0-rc7_drm-intel-nightly_0eafcb_20150204+ #38 SMP Wed Feb 4 11:26:26 CST 2015 x86_64 x86_64 x86_64 GNU/Linux\n",
    "time_elapsed": 502.3301010131836,
    "tests": {
        "igt/gem_exec_lut_handle": {
            "dmesg": "[ 4329.946633] [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... render ring idle",
            "returncode": 0,
            "err": "",
            "environment": "PIGLIT_SOURCE_DIR=\"/GFX/Test/Piglit/piglit\" PIGLIT_PLATFORM=\"mixed_glx_egl\"",
            "command": "/GFX/Test/Intel_gpu_tools/intel-gpu-tools/tests/gem_exec_lut_handle",
            "result": "dmesg-warn",
            "time": 502.1684060096741,
            "out": "IGT-Version: 1.9-g87edb51 (x86_64) (Linux: 3.19.0-rc7_drm-intel-nightly_0eafcb_20150204+ x86_64)\nrelocation: buffers=   1: old= 408053 + 17.4*reloc, lut=  35950 + 143.5*reloc (ns)\nrelocation: buffers=   2: old=  38386 + 153.0*reloc, lut=  39810 + 142.3*reloc (ns)\nrelocation: buffers=   4: old=  39303 + 156.7*reloc, lut=  39962 + 142.8*reloc (ns)\nrelocation: buffers=   8: old=  41255 + 153.4*reloc, lut=  41122 + 143.0*reloc (ns)\nrelocation: buffers=  16: old=  45337 + 153.3*reloc, lut=  44985 + 143.1*reloc (ns)\nrelocation: buffers=  32: old=  37409 + 159.9*reloc, lut=  39477 + 150.2*reloc (ns)\nrelocation: buffers=  64: old=  70905 + 159.4*reloc, lut=  70185 + 148.5*reloc (ns)\nrelocation: buffers= 128: old=  96056 + 173.7*reloc, lut=  93624 + 161.6*reloc (ns)\nrelocation: buffers= 256: old= 187988 + 182.5*reloc, lut= 183626 + 168.1*reloc (ns)\nrelocation: buffers= 512: old= 376599 + 218.9*reloc, lut= 366951 + 191.3*reloc (ns)\nrelocation: buffers=1024: old=1190077 + 333.2*reloc, lut=1174841 + 264.4*reloc (ns)\nrelocation: buffers=2048: old=3588263 + 496.3*reloc, lut=3563453 + 333.0*reloc (ns)\nskip-relocs: buffers=   1: old=   6681 + 88.3*reloc, lut=   6392 + 81.1*reloc (ns)\nskip-relocs: buffers=   2: old=   7372 + 88.4*reloc, lut=   7124 + 81.1*reloc (ns)\nskip-relocs: buffers=   4: old=   8286 + 88.5*reloc, lut=   7936 + 81.3*reloc (ns)\nskip-relocs: buffers=   8: old=  10099 + 88.6*reloc, lut=   9669 + 81.4*reloc (ns)\nskip-relocs: buffers=  16: old=  12409 + 95.8*reloc, lut=  13120 + 81.6*reloc (ns)\nskip-relocs: buffers=  32: old=  22337 + 89.3*reloc, lut=  21921 + 81.8*reloc (ns)\nskip-relocs: buffers=  64: old=  42431 + 92.3*reloc, lut=  40941 + 83.5*reloc (ns)\nskip-relocs: buffers= 128: old=  95018 + 95.6*reloc, lut=  93127 + 85.7*reloc (ns)\nskip-relocs: buffers= 256: old= 186631 + 104.0*reloc, lut= 182497 + 93.1*reloc (ns)\nskip-relocs: buffers= 512: old= 376347 + 140.4*reloc, lut= 367393 + 113.9*reloc (ns)\nskip-relocs: buffers=1024: old=1189452 + 231.7*reloc, lut=1172909 + 162.6*reloc (ns)\nskip-relocs: buffers=2048: old=3580134 + 353.2*reloc, lut=3557845 + 229.6*reloc (ns)\nno-relocs: buffers=   1: old=   6012 + 30.2*reloc, lut=   5861 + 23.8*reloc (ns)\nno-relocs: buffers=   2: old=   6690 + 30.3*reloc, lut=   6561 + 23.9*reloc (ns)\nno-relocs: buffers=   4: old=   7413 + 30.5*reloc, lut=   7335 + 24.0*reloc (ns)\nno-relocs: buffers=   8: old=   9098 + 30.7*reloc, lut=   8881 + 24.1*reloc (ns)\nno-relocs: buffers=  16: old=  12311 + 30.7*reloc, lut=  12038 + 24.2*reloc (ns)\nno-relocs: buffers=  32: old=  20864 + 30.8*reloc, lut=  20221 + 24.1*reloc (ns)\nno-relocs: buffers=  64: old=  40160 + 32.2*reloc, lut=  38479 + 25.6*reloc (ns)\nno-relocs: buffers= 128: old=  90954 + 30.0*reloc, lut=  88742 + 23.0*reloc (ns)\nno-relocs: buffers= 256: old= 179589 + 29.9*reloc, lut= 175230 + 22.9*reloc (ns)\nno-relocs: buffers= 512: old= 362405 + 42.8*reloc, lut= 352873 + 33.3*reloc (ns)\nno-relocs: buffers=1024: old=1151193 + 41.7*reloc, lut=1132095 + 29.7*reloc (ns)\nno-relocs: buffers=2048: old=3488270 + 45.6*reloc, lut=3459551 + 29.1*reloc (ns)\n"

==Reproduce steps==
---------------------------- 
1. run all igt cases
Comment 1 lu hua 2015-04-07 02:40:55 UTC
Created attachment 114896 [details]
dmesg

Test on the latest -nightly kernel, I reproduce this error.
IGT-Version: 1.10-g43a1f64 (x86_64) (Linux: 4.0.0-rc6_drm-intel-nightly_333cf6_20150403+ x86_64)
relocation: buffers=   1: old=   9883 + 70.7*reloc, lut=  10415 + 61.6*reloc (ns)
relocation: buffers=   2: old=  43176 + 487.3*reloc, lut=  43624 + 481.3*reloc (ns)
relocation: buffers=   4: old= 421216 + 566.6*reloc, lut=  42036 + 704.6*reloc (ns)
relocation: buffers=   8: old=  46960 + 786.2*reloc, lut=  45953 + 778.8*reloc (ns)
relocation: buffers=  16: old=  42697 + 845.8*reloc, lut=  42976 + 840.9*reloc (ns)
relocation: buffers=  32: old=  51074 + 868.5*reloc, lut=  50122 + 863.6*reloc (ns)
relocation: buffers=  64: old=  74159 + 883.3*reloc, lut=  75096 + 875.2*reloc (ns)
relocation: buffers= 128: old= 112083 + 902.7*reloc, lut= 110199 + 886.1*reloc (ns)
relocation: buffers= 256: old= 203096 + 909.1*reloc, lut= 197321 + 889.1*reloc (ns)
relocation: buffers= 512: old= 371372 + 951.3*reloc, lut= 363720 + 912.0*reloc (ns)
relocation: buffers=1024: old= 986476 + 1015.9*reloc, lut= 979852 + 967.3*reloc (ns)
relocation: buffers=2048: old=3289816 + 1272.0*reloc, lut=3250566 + 1141.5*reloc (ns)
skip-relocs: buffers=   1: old=   9734 + 82.5*reloc, lut=   8585 + 75.3*reloc (ns)
skip-relocs: buffers=   2: old=  11425 + 82.1*reloc, lut=  10737 + 75.1*reloc (ns)
skip-relocs: buffers=   4: old=  12983 + 81.9*reloc, lut=  12029 + 74.9*reloc (ns)
skip-relocs: buffers=   8: old=  11609 + 83.4*reloc, lut=  10706 + 76.2*reloc (ns)
skip-relocs: buffers=  16: old=  14929 + 83.2*reloc, lut=  14512 + 76.2*reloc (ns)
skip-relocs: buffers=  32: old=  29272 + 87.0*reloc, lut=  32612 + 73.2*reloc (ns)
skip-relocs: buffers=  64: old=  60615 + 80.9*reloc, lut=  59457 + 72.5*reloc (ns)
skip-relocs: buffers= 128: old= 105637 + 86.5*reloc, lut= 104245 + 75.9*reloc (ns)
skip-relocs: buffers= 256: old= 194529 + 92.1*reloc, lut= 191189 + 79.4*reloc (ns)
skip-relocs: buffers= 512: old= 363259 + 122.9*reloc, lut= 357833 + 93.6*reloc (ns)
skip-relocs: buffers=1024: old= 984330 + 191.6*reloc, lut= 958018 + 136.1*reloc (ns)
skip-relocs: buffers=2048: old=3273974 + 288.3*reloc, lut=3236932 + 176.8*reloc (ns)
no-relocs: buffers=   1: old=   8085 + 31.5*reloc, lut=   6420 + 25.2*reloc (ns)
no-relocs: buffers=   2: old=   9206 + 31.2*reloc, lut=   8036 + 25.4*reloc (ns)
no-relocs: buffers=   4: old=   7564 + 31.5*reloc, lut=   6405 + 24.6*reloc (ns)
no-relocs: buffers=   8: old=   8911 + 33.8*reloc, lut=   7710 + 25.5*reloc (ns)
no-relocs: buffers=  16: old=  11290 + 31.4*reloc, lut=  11413 + 25.7*reloc (ns)
no-relocs: buffers=  32: old=  31092 + 31.2*reloc, lut=  31972 + 23.1*reloc (ns)
no-relocs: buffers=  64: old=  59256 + 27.6*reloc, lut=  57818 + 22.5*reloc (ns)
no-relocs: buffers= 128: old= 101909 + 27.3*reloc, lut= 100185 + 20.9*reloc (ns)
no-relocs: buffers= 256: old= 186526 + 27.2*reloc, lut= 183392 + 20.6*reloc (ns)
no-relocs: buffers= 512: old= 349405 + 31.3*reloc, lut= 343982 + 23.8*reloc (ns)
no-relocs: buffers=1024: old= 934785 + 49.2*reloc, lut= 917700 + 40.6*reloc (ns)
no-relocs: buffers=2048: old=3183432 + 34.0*reloc, lut=3135389 + 23.5*reloc (ns)
SUCCESS (588.883s)

real    9m49.119s
user    0m18.216s
sys     9m17.545s
Comment 2 lu hua 2015-04-16 06:04:26 UTC
It also happens on SKL.
root@x-skly05:/GFX/Test/Intel_gpu_tools/intel-gpu-tools/tests# time ./gem_exec_lut_handle
IGT-Version: 1.10-g2e526ae (x86_64) (Linux: 4.0.0_drm-intel-nightly_5ea91d_20150415+ x86_64)
relocation: buffers=   1: old=  40288 + 143.3*reloc, lut=  43431 + 126.6*reloc (ns)
relocation: buffers=   2: old= 301245 + 52.1*reloc, lut=  46231 + 118.8*reloc (ns)
relocation: buffers=   4: old=  51351 + 108.1*reloc, lut=  46177 + 116.1*reloc (ns)
relocation: buffers=   8: old=  41177 + 138.5*reloc, lut=  42458 + 136.8*reloc (ns)
relocation: buffers=  16: old=  50569 + 118.6*reloc, lut=  43618 + 142.0*reloc (ns)
relocation: buffers=  32: old=  54306 + 128.9*reloc, lut=  56114 + 129.7*reloc (ns)
relocation: buffers=  64: old=  84740 + 169.9*reloc, lut=  81765 + 171.2*reloc (ns)
relocation: buffers= 128: old= 166713 + 155.2*reloc, lut= 158750 + 159.6*reloc (ns)
relocation: buffers= 256: old= 205643 + 187.3*reloc, lut= 200472 + 186.5*reloc (ns)
relocation: buffers= 512: old= 410002 + 230.9*reloc, lut= 393246 + 202.8*reloc (ns)
relocation: buffers=1024: old= 873728 + 200.5*reloc, lut= 843966 + 182.6*reloc (ns)
relocation: buffers=2048: old=1482885 + 182.4*reloc, lut=1510086 + 65.4*reloc (ns)
skip-relocs: buffers=   1: old=   5401 + 46.7*reloc, lut=   5513 + 41.8*reloc (ns)
skip-relocs: buffers=   2: old=   6671 + 48.5*reloc, lut=   6808 + 44.6*reloc (ns)
skip-relocs: buffers=   4: old=   8647 + 39.4*reloc, lut=   8138 + 37.7*reloc (ns)
skip-relocs: buffers=   8: old=   8467 + 33.8*reloc, lut=   7969 + 35.3*reloc (ns)
skip-relocs: buffers=  16: old=  12027 + 39.1*reloc, lut=  12159 + 36.3*reloc (ns)
skip-relocs: buffers=  32: old=  18738 + 45.0*reloc, lut=  20124 + 36.2*reloc (ns)
skip-relocs: buffers=  64: old=  39213 + 38.2*reloc, lut=  36908 + 37.8*reloc (ns)
skip-relocs: buffers= 128: old=  85202 + 47.5*reloc, lut=  83915 + 42.7*reloc (ns)
skip-relocs: buffers= 256: old= 182382 + 75.9*reloc, lut= 179066 + 75.7*reloc (ns)
skip-relocs: buffers= 512: old= 413904 + 78.4*reloc, lut= 408063 + 46.7*reloc (ns)
skip-relocs: buffers=1024: old= 840316 + 150.5*reloc, lut= 830475 + 53.5*reloc (ns)
skip-relocs: buffers=2048: old=1498004 + 55.2*reloc, lut=1509686 + 9.4*reloc (ns)
no-relocs: buffers=   1: old=   3037 + 0.6*reloc, lut=   2936 + 0.6*reloc (ns)
no-relocs: buffers=   2: old=   4230 + 0.6*reloc, lut=   4104 + 0.6*reloc (ns)
no-relocs: buffers=   4: old=   5999 + 0.8*reloc, lut=   5823 + 0.8*reloc (ns)
no-relocs: buffers=   8: old=   9209 + 0.7*reloc, lut=   8752 + 0.7*reloc (ns)
no-relocs: buffers=  16: old=  15045 + 0.8*reloc, lut=  14532 + 0.7*reloc (ns)
no-relocs: buffers=  32: old=  19149 + 0.5*reloc, lut=  18748 + 0.4*reloc (ns)
no-relocs: buffers=  64: old=  47024 + -3.6*reloc, lut=  44442 + -2.8*reloc (ns)
no-relocs: buffers= 128: old=  90780 + -0.7*reloc, lut=  87444 + 0.0*reloc (ns)
no-relocs: buffers= 256: old= 186003 + 1.7*reloc, lut= 180705 + 2.2*reloc (ns)
no-relocs: buffers= 512: old= 401300 + -0.5*reloc, lut= 377329 + 3.7*reloc (ns)
no-relocs: buffers=1024: old= 798101 + 1.5*reloc, lut= 794731 + 1.1*reloc (ns)
no-relocs: buffers=2048: old=1522957 + 1.3*reloc, lut=1490423 + 0.6*reloc (ns)
SUCCESS (294.099s)

real    4m54.146s
user    0m0.900s
sys     4m50.172s
root@x-skly05:/GFX/Test/Intel_gpu_tools/intel-gpu-tools/tests# dmesg -r|egrep "<[1-4]>"|grep drm
<3>[  215.847173] [drm:i915_hangcheck_elapsed [i915]] *ERROR* Hangcheck timer elapsed... render ring idle
Comment 3 Elio 2015-10-15 22:49:13 UTC
This issue is present on BYT with following configuration:

Kernel: http://vanaheimr.fr.intel.com/shared/out/kernels/drm-intel/WW42.1_4.3.0-rc4_c38f2c2/
 xorg-server-1.17.2
libdrm-2.4.65
xf86-video-intel2.99.917
mesa-11.0.2
libva-1.6.1
intel-driver 1.6.1
cairo 1.14.2
intel-gpu-tools-1.12
Comment 4 Elio 2016-01-15 22:56:52 UTC
Same test is failing in BXT with following configuration:
BXT APL	Intel Core (TM) CPU @ 1.2 GHZ	Intel HD Graphics	GPAIDV5440021	GDC363547 
Linux distribution: Ubuntu 15.10 64 bits
Kernel: drm-intel-nightly 4.4.0-rc8 8834392 
libdrm-2.4.66 from http://cgit.freedesktop.org/mesa/drm/
xf86-video-intel 2.99.917 from http://cgit.freedesktop.org/xorg/driver/xf86-video-intel/
mesa-11.0.8 from http://cgit.freedesktop.org/mesa/mesa/
libva-1.6.2 from http://cgit.freedesktop.org/libva/
intel-driver 1.6.2 from http://cgit.freedesktop.org/vaapi/intel-driver
cairo 1.15.2 from http://cgit.freedesktop.org/cairo
IGT master 1.13 cd658a7c6bca74eb897a8ae87751029793f0c4f4 from http://cgit.freedesktop.org/xorg/app/intel-gpu-tools/
DMC and GUC firmware from https://01.org/linuxgraphics/intel-linux-graphics-firmwares
Comment 5 Chris Wilson 2016-01-28 10:15:03 UTC
commit 7c17d377374ddbcfb7873366559fc4ed8b296e11
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Wed Jan 20 15:43:35 2016 +0200

    drm/i915: Use ordered seqno write interrupt generation on gen8+ execlists


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.