Bug 108561 - [CI][BAT] igt@gem_busy@close-race - dmesg-fail - general protection fault: 0000 [#1] PREEMPT SMP PTI
Summary: [CI][BAT] igt@gem_busy@close-race - dmesg-fail - general protection fault: 00...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: highest normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords: regression
Depends on:
Blocks:
 
Reported: 2018-10-26 10:37 UTC by Martin Peres
Modified: 2018-10-30 16:40 UTC (History)
1 user (show)

See Also:
i915 platform: BXT, BYT, CFL, GLK, HSW, ICL, ILK, KBL, SKL, SNB
i915 features: GEM/Other


Attachments

Description Martin Peres 2018-10-26 10:37:29 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5035/shard-kbl6/igt@gem_busy@close-race.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5035/shard-apl2/igt@gem_busy@close-race.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5034/shard-snb1/igt@gem_busy@close-race.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5034/shard-kbl3/igt@gem_busy@close-race.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5034/shard-hsw6/igt@gem_busy@close-race.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5037/shard-skl6/igt@gem_busy@close-race.html

Test requirement not met in function __real_main459, file ../tests/i915/gem_busy.c:526:
Test requirement: has_semaphores(fd)
Starting subtest: close-race
child 0 died with signal 11, Segmentation fault
Subtest close-race: FAIL (4.515s)
Test requirement not met in function gem_require_ring, file ../lib/ioctl_wrappers.c:1486:
Test requirement: gem_has_ring(fd, ring)

<4> [329.188442] general protection fault: 0000 [#1] PREEMPT SMP PTI
<4> [329.188531] CPU: 0 PID: 1246 Comm: gem_busy Tainted: G     U            4.19.0-CI-CI_DRM_5037+ #1
<4> [329.188619] Hardware name: Google Caroline/Caroline, BIOS MrChromebox 08/27/2018
<4> [329.188996] RIP: 0010:i915_gem_busy_ioctl+0x146/0x380 [i915]
<4> [329.189062] Code: 0b 43 04 49 83 c6 08 4d 39 e6 89 43 04 74 6d 4d 8b 3e e8 6d 24 07 e1 85 c0 74 0d 80 3d 08 71 1d 00 00 0f 84 bb 00 00 00 31 c0 <49> 81 7f 08 20 6a 19 a0 75 cc 41 8b 97 50 02 00 00 49 8b 8f a8 00
<4> [329.189219] RSP: 0018:ffffc90000663cf8 EFLAGS: 00010246
<4> [329.189284] RAX: 0000000000000000 RBX: ffffc90000663dc0 RCX: 0000000000000001
<4> [329.189357] RDX: 0000000080000000 RSI: 00000000ffffffff RDI: ffffffff822474a0
<4> [329.189428] RBP: ffffc90000663d28 R08: ffff880177354500 R09: 0000000000000001
<4> [329.189498] R10: 0000000000000040 R11: 0000000000000000 R12: ffff8801414be348
<4> [329.189566] R13: 0000000000000003 R14: ffff8801414be340 R15: 6b6b6b6b6b6b6b6b
<4> [329.189640] FS:  00007f3143c62980(0000) GS:ffff88017aa00000(0000) knlGS:0000000000000000
<4> [329.189718] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [329.189782] CR2: 00007f3143c96000 CR3: 0000000164c62003 CR4: 00000000003606f0
<4> [329.189852] Call Trace:
<4> [329.190182]  ? gtt_set+0x140/0x140 [i915]
<4> [329.190252]  drm_ioctl_kernel+0x81/0xf0
<4> [329.190321]  drm_ioctl+0x2e6/0x3a0
<4> [329.190646]  ? gtt_set+0x140/0x140 [i915]
<4> [329.190725]  ? trace_hardirqs_on_thunk+0x1a/0x1c
<4> [329.190790]  ? lockdep_hardirqs_on+0xe0/0x1b0
<4> [329.190850]  ? trace_hardirqs_on_thunk+0x1a/0x1c
<4> [329.190920]  do_vfs_ioctl+0xa0/0x6d0
<4> [329.190977]  ? retint_kernel+0x2d/0x2d
<4> [329.191044]  ksys_ioctl+0x35/0x60
<4> [329.191101]  __x64_sys_ioctl+0x11/0x20
<4> [329.191154]  do_syscall_64+0x55/0x190
<4> [329.191213]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [329.191275] RIP: 0033:0x7f31430f55d7
<4> [329.191326] Code: b3 66 90 48 8b 05 b1 48 2d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 81 48 2d 00 f7 d8 64 89 01 48
<4> [329.191487] RSP: 002b:00007ffeb2cf9988 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
<4> [329.191577] RAX: ffffffffffffffda RBX: 00007ffeb2cf9ab0 RCX: 00007f31430f55d7
<4> [329.191650] RDX: 00007ffeb2cf9b70 RSI: 00000000c0086457 RDI: 0000000000000005
<4> [329.191722] RBP: 00007ffeb2cf9b70 R08: 00007f3143c62980 R09: 00007f31435ea2b0
<4> [329.191795] R10: 0000000000000054 R11: 0000000000000246 R12: 00000000c0086457
<4> [329.191867] R13: 0000000000000005 R14: 0000000000000000 R15: 0000000000000000
<4> [329.191964] Modules linked in: vgem snd_hda_codec_hdmi snd_hda_intel snd_hda_codec snd_hwdep snd_hda_core x86_pkg_temp_thermal btusb coretemp crct10dif_pclmul btrtl cdc_ether crc32_pclmul btbcm ghash_clmulni_intel usbnet btintel snd_pcm r8152 bluetooth mii i915 ecdh_generic i2c_hid pinctrl_sunrisepoint pinctrl_intel prime_numbers
<4> [329.393844] ---[ end trace 57742830a7a609d7 ]---
Comment 1 Chris Wilson 2018-10-26 10:44:41 UTC
https://patchwork.freedesktop.org/series/51579/
Comment 2 Chris Wilson 2018-10-26 14:44:55 UTC
commit a590d0fdbaa56f482ff515e1040b6d9b1b200d63 (HEAD -> drm-misc-next, drm-misc/drm-misc-next)
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Fri Oct 26 09:03:02 2018 +0100

    dma-buf: Update reservation shared_count after adding the new fence
    
    We need to serialise the addition of a new fence into the shared list
    such that the fence is visible before we claim it is there. Otherwise a
    concurrent reader of the shared fence list will see an uninitialised
    fence slot before it is set.
    
      <4> [109.613162] general protection fault: 0000 [#1] PREEMPT SMP PTI
      <4> [109.613177] CPU: 1 PID: 1357 Comm: gem_busy Tainted: G     U            4.19.0-rc8-CI-CI_DRM_5035+ #1
      <4> [109.613189] Hardware name: Dell Inc. XPS 8300  /0Y2MRG, BIOS A06 10/17/2011
      <4> [109.613252] RIP: 0010:i915_gem_busy_ioctl+0x146/0x380 [i915]
      <4> [109.613261] Code: 0b 43 04 49 83 c6 08 4d 39 e6 89 43 04 74 6d 4d 8b 3e e8 5d 54 f4 e0 85 c0 74 0d 80 3d 08 71 1d 00 00
      0f 84 bb 00 00 00 31 c0 <49> 81 7f 08 20 3a 2c a0 75 cc 41 8b 97 50 02 00 00 49 8b 8f a8 00
      <4> [109.613283] RSP: 0018:ffffc9000044bcf8 EFLAGS: 00010246
      <4> [109.613292] RAX: 0000000000000000 RBX: ffffc9000044bdc0 RCX: 0000000000000001
      <4> [109.613302] RDX: 0000000000000000 RSI: 00000000ffffffff RDI: ffffffff822474a0
      <4> [109.613311] RBP: ffffc9000044bd28 R08: ffff88021e158680 R09: 0000000000000001
      <4> [109.613321] R10: 0000000000000040 R11: 0000000000000000 R12: ffff88021e1641b8
      <4> [109.613331] R13: 0000000000000003 R14: ffff88021e1641b0 R15: 6b6b6b6b6b6b6b6b
      <4> [109.613341] FS:  00007f9c9fc84980(0000) GS:ffff880227a40000(0000) knlGS:0000000000000000
      <4> [109.613352] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      <4> [109.613360] CR2: 00007f9c9fcb8000 CR3: 00000002247d4005 CR4: 00000000000606e0
    
    Fixes: 27836b641c1b ("dma-buf: remove shared fence staging in reservation object")
    Testcase: igt/gem_busy/close-race
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Cc: Christian König <christian.koenig@amd.com>
    Cc: Junwei Zhang <Jerry.Zhang@amd.com>
    Cc: Huang Rui <ray.huang@amd.com>
    Cc: Sumit Semwal <sumit.semwal@linaro.org>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20181026080302.11507-1-chris@chris-wilson.co.uk
Comment 3 Martin Peres 2018-10-30 16:40:43 UTC
(In reply to Chris Wilson from comment #2)
> commit a590d0fdbaa56f482ff515e1040b6d9b1b200d63 (HEAD -> drm-misc-next,
> drm-misc/drm-misc-next)
> Author: Chris Wilson <chris@chris-wilson.co.uk>
> Date:   Fri Oct 26 09:03:02 2018 +0100
> 
>     dma-buf: Update reservation shared_count after adding the new fence
>     
>     We need to serialise the addition of a new fence into the shared list
>     such that the fence is visible before we claim it is there. Otherwise a
>     concurrent reader of the shared fence list will see an uninitialised
>     fence slot before it is set.
>     
>       <4> [109.613162] general protection fault: 0000 [#1] PREEMPT SMP PTI
>       <4> [109.613177] CPU: 1 PID: 1357 Comm: gem_busy Tainted: G     U     
> 4.19.0-rc8-CI-CI_DRM_5035+ #1
>       <4> [109.613189] Hardware name: Dell Inc. XPS 8300  /0Y2MRG, BIOS A06
> 10/17/2011
>       <4> [109.613252] RIP: 0010:i915_gem_busy_ioctl+0x146/0x380 [i915]
>       <4> [109.613261] Code: 0b 43 04 49 83 c6 08 4d 39 e6 89 43 04 74 6d 4d
> 8b 3e e8 5d 54 f4 e0 85 c0 74 0d 80 3d 08 71 1d 00 00
>       0f 84 bb 00 00 00 31 c0 <49> 81 7f 08 20 3a 2c a0 75 cc 41 8b 97 50 02
> 00 00 49 8b 8f a8 00
>       <4> [109.613283] RSP: 0018:ffffc9000044bcf8 EFLAGS: 00010246
>       <4> [109.613292] RAX: 0000000000000000 RBX: ffffc9000044bdc0 RCX:
> 0000000000000001
>       <4> [109.613302] RDX: 0000000000000000 RSI: 00000000ffffffff RDI:
> ffffffff822474a0
>       <4> [109.613311] RBP: ffffc9000044bd28 R08: ffff88021e158680 R09:
> 0000000000000001
>       <4> [109.613321] R10: 0000000000000040 R11: 0000000000000000 R12:
> ffff88021e1641b8
>       <4> [109.613331] R13: 0000000000000003 R14: ffff88021e1641b0 R15:
> 6b6b6b6b6b6b6b6b
>       <4> [109.613341] FS:  00007f9c9fc84980(0000) GS:ffff880227a40000(0000)
> knlGS:0000000000000000
>       <4> [109.613352] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>       <4> [109.613360] CR2: 00007f9c9fcb8000 CR3: 00000002247d4005 CR4:
> 00000000000606e0
>     
>     Fixes: 27836b641c1b ("dma-buf: remove shared fence staging in
> reservation object")
>     Testcase: igt/gem_busy/close-race
>     Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
>     Cc: Christian König <christian.koenig@amd.com>
>     Cc: Junwei Zhang <Jerry.Zhang@amd.com>
>     Cc: Huang Rui <ray.huang@amd.com>
>     Cc: Sumit Semwal <sumit.semwal@linaro.org>
>     Reviewed-by: Christian König <christian.koenig@amd.com>
>     Link:
> https://patchwork.freedesktop.org/patch/msgid/20181026080302.11507-1-
> chris@chris-wilson.co.uk

It did the trick, thanks!


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.