Bug 108965 - [CI][BAT] igt@amdgpu_amd_basic@userptr - dmesg-warn - general protection fault: 0000 [#1] PREEMPT SMP PTI
Summary: [CI][BAT] igt@amdgpu_amd_basic@userptr - dmesg-warn - general protection faul...
Status: RESOLVED MOVED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/AMDgpu (show other bugs)
Version: XOrg git
Hardware: Other All
: high normal
Assignee: Default DRI bug account
QA Contact:
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-12-07 11:40 UTC by Martin Peres
Modified: 2019-11-19 09:07 UTC (History)
1 user (show)

See Also:
i915 platform:
i915 features:


Attachments

Description Martin Peres 2018-12-07 11:40:18 UTC
https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5277/fi-kbl-8809g/igt@amdgpu_amd_basic@userptr.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_5279/fi-kbl-8809g/igt@amdgpu_amd_basic@userptr.html

<4> [176.496217] general protection fault: 0000 [#1] PREEMPT SMP PTI
<4> [176.496220] CPU: 5 PID: 3916 Comm: amd_basic Tainted: G     U            4.20.0-rc5-CI-CI_DRM_5279+ #1
<4> [176.496222] Hardware name: Intel Corporation NUC8i7HVK/NUC8i7HVB, BIOS HNKBLi70.86A.0047.2018.0718.1706 07/18/2018
<4> [176.496225] RIP: 0010:__mmu_notifier_release+0x4e/0x100
<4> [176.496226] Code: 31 c0 31 d2 31 f6 48 c7 c7 a0 20 26 82 b9 02 00 00 00 41 89 c4 e8 92 34 ef ff 48 8b bb 50 05 00 00 58 48 8b 1f 48 85 db 74 26 <48> 8b 43 10 48 8b 00 48 85 c0 74 0b 4c 89 f6 48 89 df e8 0b ad a0
<4> [176.496228] RSP: 0018:ffffc90000307db8 EFLAGS: 00010202
<4> [176.496230] RAX: 0000000000000001 RBX: 6b6b6b6b6b6b6b6b RCX: 0000000000000000
<4> [176.496231] RDX: 0000000000000007 RSI: ffffffff821298aa RDI: ffffffff820d7ca7
<4> [176.496232] RBP: ffffc90000307dd0 R08: 00000000d182b757 R09: 0000000000000000
<4> [176.496233] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
<4> [176.496235] R13: ffff8881e7e2d7c8 R14: ffff888258ae8780 R15: ffff888258ae8828
<4> [176.496236] FS:  00007f392bc09980(0000) GS:ffff888276b40000(0000) knlGS:0000000000000000
<4> [176.496238] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [176.496239] CR2: 00007f3929bd02f8 CR3: 000000026f014005 CR4: 00000000003606e0
<4> [176.496240] Call Trace:
<4> [176.496243]  exit_mmap+0x13c/0x180
<4> [176.496246]  ? do_exit+0x586/0xd10
<4> [176.496248]  ? do_exit+0x5ae/0xd10
<4> [176.496251]  mmput+0x5c/0x120
<4> [176.496253]  do_exit+0x5be/0xd10
<4> [176.496256]  do_group_exit+0x34/0xb0
<4> [176.496258]  __x64_sys_exit_group+0xf/0x10
<4> [176.496261]  do_syscall_64+0x55/0x190
<4> [176.496264]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [176.496265] RIP: 0033:0x7f392b071e06
<4> [176.496267] Code: 00 00 00 be 3c 00 00 00 eb 19 66 2e 0f 1f 84 00 00 00 00 00 89 d7 89 f0 0f 05 48 3d 00 f0 ff ff 77 22 f4 89 d7 44 89 c0 0f 05 <48> 3d 00 f0 ff ff 76 e2 f7 d8 64 41 89 01 eb da 66 2e 0f 1f 84 00
<4> [176.496268] RSP: 002b:00007ffda7c7f588 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
<4> [176.496270] RAX: ffffffffffffffda RBX: 00007f392b374740 RCX: 00007f392b071e06
<4> [176.496287] RDX: 0000000000000000 RSI: 000000000000003c RDI: 0000000000000000
<4> [176.496289] RBP: 0000000000000000 R08: 00000000000000e7 R09: ffffffffffffff80
<4> [176.496290] R10: 00007f391f6ff0c0 R11: 0000000000000246 R12: 00007f392b374740
<4> [176.496291] R13: 0000000000000003 R14: 00007f392b37d628 R15: 0000000000000000
<4> [176.496294] Modules linked in: vgem snd_hda_codec_realtek snd_hda_codec_generic amdgpu x86_pkg_temp_thermal coretemp crct10dif_pclmul crc32_pclmul btusb btrtl btbcm btintel snd_hda_codec_hdmi ghash_clmulni_intel bluetooth snd_hda_codec snd_hwdep snd_hda_core chash gpu_sched snd_pcm e1000e ecdh_generic ttm igb i2c_i801 prime_numbers pinctrl_sunrisepoint pinctrl_intel [last unloaded: i915]
Comment 1 CI Bug Log 2019-02-21 12:35:58 UTC
A CI Bug Log filter associated to this bug has been updated:

{- VEGA M: igt@amdgpu_amd_basic@userptr - dmesg-warn - general protection fault: 0000 [#1] PREEMPT SMP PTI -}
{+ VEGA M: igt@amdgpu_amd_basic@(semaphore|userptr) - dmesg-warn - general protection fault: 0000 [#1] PREEMPT SMP PTI +}

 No new failures caught with the new filter
Comment 3 harish.chegondi 2019-11-08 19:55:35 UTC
There are two issues associated with this bug:

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7252/fi-kbl-8809g/igt@amdgpu_amd_basic@semaphore.html

https://intel-gfx-ci.01.org/tree/drm-tip/CI_DRM_7252/fi-kbl-8809g/igt@amdgpu_amd_basic@userptr.html

While the CI buglog indicates that the last failure happened almost 5 months ago and there have been 1517 covered runs since the last failure, the userptr test (link mentioned above) has been failing consistently but with a different failure signature (probably the reason why the issue filter is not catching it?)

The userptr test has been failing recently with "No such device" error with the signature below.

(amd_basic:3914) CRITICAL: Test assertion failure function amdgpu_userptr_test, file ../tests/amdgpu/amd_basic.c:1335:
(amd_basic:3914) CRITICAL: Failed assertion: r == 0
(amd_basic:3914) CRITICAL: Last errno: 19, No such device
(amd_basic:3914) CRITICAL: error: -19 != 0
(amd_basic:3914) igt_core-INFO: Stack trace:
(amd_basic:3914) igt_core-INFO:   #0 ../lib/igt_core.c:1716 __igt_fail_assert()
(amd_basic:3914) igt_core-INFO:   #1 ../tests/amdgpu/amd_basic.c:1324 __real_main1383()
(amd_basic:3914) igt_core-INFO:   #2 [main+0x30]

On the other hand the semaphore test has been consistently passing in the recent runs.
Comment 4 Martin Peres 2019-11-19 09:07:09 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/631.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.