Bug 105176 - [IGT] gem_shrink/pwrite-userptr races on obj->mm.pages_pin_count
Summary: [IGT] gem_shrink/pwrite-userptr races on obj->mm.pages_pin_count
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-02-20 18:43 UTC by Octavio
Modified: 2018-04-09 21:04 UTC (History)
1 user (show)

See Also:
i915 platform: CFL, CNL
i915 features: GEM/Other


Attachments
dmesg log (227.25 KB, text/plain)
2018-02-20 18:43 UTC, Octavio
no flags Details
kernel log (318.09 KB, text/plain)
2018-02-20 18:44 UTC, Octavio
no flags Details
kernel log (CNL QA) (467.10 KB, text/plain)
2018-03-08 17:20 UTC, Hector Velazquez
no flags Details
dmesg -w -H (CNL QA) (37.58 KB, text/plain)
2018-03-08 17:27 UTC, Hector Velazquez
no flags Details

Description Octavio 2018-02-20 18:43:10 UTC
The below test case has a dmesg-warn on CFL

 igt@gem_shrink@pwrite-userptr

Configuration 

======================================
        Graphic stack
======================================

======================================
             Software
======================================
kernel version              : 4.16.0-rc1-drm-intel-qa-ww8-commit-79e91ad+
hostname                    : gfx-desktop
architecture                : x86_64
os version                  : Ubuntu 17.10
os codename                 : artful
kernel driver               : i915
bios revision               : 118.7
bios release date           : 01/04/2018
ksc                         : 1.13
hardware acceleration       : disabled
swap partition              : enabled on (/dev/sda2)

======================================
        Graphic drivers
======================================
grep: /opt/X11R7/var/log/Xorg.0.log: No such file or directory
libdrm                      : 2.4.90
vaapi (intel-driver)        : Intel i965 driver for Intel(R) Coffee Lake - 2.1.1.pre1 (2.0.0-138-gbb92421)
cairo                       : 1.15.11
intel-gpu-tools (tag)       : intel-gpu-tools-1.21-106-g6fc91dbb
intel-gpu-tools (commit)    : 6fc91dbb

======================================
             Hardware
======================================
motherboard model          : CoffeeLakeClientPlatform
motherboard id             : CoffeeLakeHDDR4RVP
form factor                : Laptop
manufacturer               : IntelCorporation
cpu family                 : Other
cpu family id              : 6
cpu information            : Genuine Intel(R) CPU 0000 @ 2.80GHz
gpu card                   : Intel Corporation Device 3e9b (prog-if 00 [VGA controller])
memory ram                 : 31.29 GB
max memory ram             : 32 GB
cpu thread                 : 12
cpu core                   : 6
cpu model                  : 158
cpu stepping               : 10
socket                     : Other
hard drive                 : 74GiB (80GB)
current cd clock frequency : 540000 kHz
maximum cd clock frequency : 675000 kHz
displays connected         : DP-1 DP-2

======================================
             Firmware
======================================
dmc fw loaded             : yes
dmc version               : 1.4
guc fw loaded             : fetch SUCCESS, load SUCCESS
guc version wanted        : wanted 9.39, found 9.39
guc version found         : wanted 9.39, found 9.39

======================================
             kernel parameters
======================================
splash drm.debug=0x1e drm.debug=0x1e intel_iommu=igfx_off auto panic=1 i915.alpha_support=1 i915.enable_guc=-1 nmi_watchdog=panic fsck.repair=yes i915.error_capture=yes


=================================================
Dmesg-warn
=================================================

[  124.593000] [IGT] gem_shrink: executing
[  124.700322] gem_shrink (1589): drop_caches: 4
[  124.758233] [IGT] gem_shrink: starting subtest pwrite-userptr
[  205.856275] WARNING: CPU: 3 PID: 1762 at mm/page_alloc.c:4133 __alloc_pages_slowpath+0xd8c/0xdb0
[  205.856276] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic nls_iso8859_1 iwlmvm mac80211 8250_dw intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec snd_hda_core kvm snd_hwdep snd_pcm irqbypass crct10dif_pclmul crc32_pclmul snd_seq_midi ghash_clmulni_intel snd_seq_midi_event pcbc snd_rawmidi snd_seq snd_seq_device snd_timer aesni_intel aes_x86_64 iwlwifi crypto_simd glue_helper cryptd intel_cstate intel_rapl_perf input_leds snd wmi_bmof serio_raw idma64 cfg80211 soundcore virt_dma intel_lpss_pci intel_pch_thermal intel_lpss tpm_crb acpi_pad mac_hid parport_pc ppdev lp parport ip_tables x_tables autofs4 hid_generic usbhid hid uas usb_storage i915 e1000e ahci libahci wmi video
[  205.856308] CPU: 3 PID: 1762 Comm: kworker/u24:8 Tainted: G     U           4.16.0-rc1-drm-intel-qa-ww8-commit-79e91ad+ #1
[  205.856309] Hardware name: Intel Corporation CoffeeLake Client Platform/CoffeeLake H DDR4 RVP, BIOS CNLSFWR1.R00.X118.B07.1801040709 01/04/2018
[  205.856312] Workqueue: events_freezable_power_ disk_events_workfn
[  205.856314] RIP: 0010:__alloc_pages_slowpath+0xd8c/0xdb0
[  205.856315] RSP: 0018:ffffb5ce85cff828 EFLAGS: 00010046
[  205.856316] RAX: 0000000000000000 RBX: 0000000000010000 RCX: 0000000000000000
[  205.856316] RDX: 0000000000000000 RSI: 0000000000000002 RDI: ffff8f2cabffd0b0
[  205.856317] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000f83
[  205.856318] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000015a00
[  205.856318] R13: 0000000000000000 R14: ffffb5ce85cff938 R15: 0000000000000000
[  205.856319] FS:  0000000000000000(0000) GS:ffff8f2c8b2c0000(0000) knlGS:0000000000000000
[  205.856320] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  205.856321] CR2: 00007fa8b3517c20 CR3: 00000002be80a002 CR4: 00000000003606e0
[  205.856321] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  205.856322] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  205.856322] Call Trace:
[  205.856326]  __alloc_pages_nodemask+0x243/0x280
[  205.856328]  new_slab+0x308/0x6f0
[  205.856329]  ___slab_alloc+0x267/0x4b0
[  205.856332]  ? create_object+0x43/0x300
[  205.856334]  ? unwind_next_frame+0x5e/0x4d0
[  205.856336]  ? create_object+0x43/0x300
[  205.856336]  ? __slab_alloc+0x1c/0x30
[  205.856337]  __slab_alloc+0x1c/0x30
[  205.856338]  kmem_cache_alloc+0x194/0x1d0
[  205.856340]  create_object+0x43/0x300
[  205.856342]  kmem_cache_alloc_node+0x138/0x1e0
[  205.856344]  scsi_old_init_rq+0x7e/0x100
[  205.856347]  alloc_request_size+0x4a/0x70
[  205.856349]  mempool_alloc+0x65/0x180
[  205.856351]  ? ret_from_fork+0x35/0x40
[  205.856352]  get_request+0x3e6/0x840
[  205.856355]  ? wait_woken+0x80/0x80
[  205.856356]  blk_get_request_flags+0x103/0x190
[  205.856357]  scsi_execute+0x38/0x250
[  205.856359]  scsi_test_unit_ready+0x75/0xf0
[  205.856360]  sd_check_events+0x125/0x1a0
[  205.856362]  disk_check_events+0x62/0x140
[  205.856364]  process_one_work+0x154/0x3d0
[  205.856366]  worker_thread+0x4a/0x440
[  205.856367]  kthread+0xf5/0x130
[  205.856368]  ? rescuer_thread+0x370/0x370
[  205.856370]  ? kthread_associate_blkcg+0x90/0x90
[  205.856371]  ret_from_fork+0x35/0x40
[  205.856372] Code: e9 cf f2 ff ff f7 44 24 20 00 08 00 00 bb 01 00 00 00 0f 84 de fd ff ff e9 73 ff ff ff 0f ff e9 c1 fc ff ff 0f ff e9 b0 fc ff ff <0f> ff 49 8b 76 08 8b 7c 24 20 44 89 f9 48 c7 c2 28 37 cd b8 e8
[  205.856393] ---[ end trace 881adad28e604a8e ]---
[  205.856394] kmemleak: Cannot allocate a kmemleak_object structure
[  205.856396] kmemleak: Kernel memory leak detector disabled
[  205.856410] kmemleak: Automatic memory scanning thread ended
[  548.798129] [IGT] gem_shrink: exiting, ret=0
Comment 1 Octavio 2018-02-20 18:43:56 UTC
Created attachment 137474 [details]
dmesg log
Comment 2 Octavio 2018-02-20 18:44:19 UTC
Created attachment 137475 [details]
kernel log
Comment 3 Hector Velazquez 2018-02-20 18:51:41 UTC
This tests has the same dmesg-warn on GLK QA 

Tests List:

igt@gem_shrink@execbufx-sanitycheck
igt@gem_shrink@execbufn-sanitycheck
igt@gem_shrink@hang-sanitycheck

IGT-Version: 1.21-gdd61508 (x86_64) (Linux: 4.16.0-rc1-drm-tip-ww8-commit-337c900+ x86_64)

Output:
. . . 	
[  436.014951] WARNING: CPU: 3 PID: 55 at mm/page_alloc.c:4133 __alloc_pages_slowpath+0xd8c/0xdb0
[  436.014954] Modules linked in: spi_pxa2xx_platform 8250_dw intel_rapl intel_telemetry_pltdrv intel_pmc_ipc intel_punit_ipc intel_telemetry_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd intel_cstate intel_rapl_perf input_leds serio_raw wmi_bmof lpc_ich snd_hda_codec_hdmi snd_soc_skl snd_soc_skl_ipc snd_hda_codec_realtek snd_soc_sst_ipc nls_iso8859_1 snd_soc_sst_dsp snd_hda_codec_generic snd_hda_ext_core snd_soc_acpi snd_soc_core snd_compress snd_pcm_dmaengine ac97_bus snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device snd_timer snd idma64 virt_dma mei_me shpchp intel_lpss_pci intel_lpss
[  436.014992]  mei soundcore rfkill_gpio intel_vbtn soc_button_array int3406_thermal dptf_power int3403_thermal int3400_thermal int340x_thermal_zone acpi_thermal_rel intel_hid sparse_keymap mac_hid parport_pc ppdev lp parport ip_tables x_tables autofs4 uas i915 r8169 usb_storage ahci mii libahci i2c_hid wmi hid video
[  436.015013] CPU: 3 PID: 55 Comm: kswapd0 Not tainted 4.16.0-rc1-drm-tip-ww8-commit-337c900+ #1
[  436.015014] Hardware name: Intel Corp. Geminilake/GLK RVP1 DDR4 (05), BIOS GELKRVPA.X64.0077.B50.1712072148 12/07/2017
[  436.015016] RIP: 0010:__alloc_pages_slowpath+0xd8c/0xdb0
[  436.015017] RSP: 0018:ffffb37300e4b588 EFLAGS: 00010046
[  436.015018] RAX: 0000000000000000 RBX: 0000000000010000 RCX: 0000000000000000
[  436.015019] RDX: 0000000000000000 RSI: 0000000000000002 RDI: ffff8991ffffa0b0
[  436.015020] RBP: 0000000001000000 R08: 0000000000000000 R09: 0000000000000f84
[  436.015020] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000001015a00
[  436.015021] R13: 0000000000000000 R14: ffffb37300e4b698 R15: 0000000000000000
[  436.015022] FS:  0000000000000000(0000) GS:ffff8991ffd80000(0000) knlGS:0000000000000000
[  436.015023] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  436.015024] CR2: 00007f5303a20248 CR3: 00000002717f8000 CR4: 0000000000340ee0
[  436.015025] Call Trace:
[  436.015029]  __alloc_pages_nodemask+0x243/0x280
[  436.015032]  new_slab+0x308/0x6f0
[  436.015034]  ? get_partial_node.isra.69+0xed/0x220
[  436.015035]  ___slab_alloc+0x267/0x4b0
[  436.015038]  ? create_object+0x43/0x300
[  436.015041]  ? mempool_alloc+0x65/0x180
[  436.015043]  ? __save_stack_trace+0x82/0x100
[  436.015045]  ? create_object+0x43/0x300
[  436.015046]  ? __slab_alloc+0x1c/0x30
[  436.015047]  __slab_alloc+0x1c/0x30
[  436.015048]  kmem_cache_alloc+0x194/0x1d0
[  436.015050]  ? mempool_alloc+0x65/0x180
[  436.015051]  create_object+0x43/0x300
[  436.015053]  kmem_cache_alloc+0xec/0x1d0
[  436.015055]  mempool_alloc+0x65/0x180
[  436.015058]  bio_alloc_bioset+0xaf/0x220
[  436.015060]  ? SyS_madvise+0x9d0/0x9d0
[  436.015061]  get_swap_bio+0x50/0xf0
[  436.015063]  ? SyS_madvise+0x9d0/0x9d0
[  436.015064]  __swap_writepage+0x2b4/0x390
[  436.015066]  ? __frontswap_store+0x6d/0xf0
[  436.015068]  shmem_writepage+0x1cc/0x2e0
[  436.015070]  pageout.isra.51+0x1db/0x320
[  436.015071]  shrink_page_list+0x9ae/0xbf0
[  436.015073]  shrink_inactive_list+0x2ad/0x620
[  436.015075]  shrink_node_memcg+0x367/0x7e0
[  436.015076]  ? shrink_slab.part.48+0x247/0x3e0
[  436.015078]  ? shrink_node+0xdc/0x330
[  436.015079]  shrink_node+0xdc/0x330
[  436.015081]  kswapd+0x32a/0x730
[  436.015084]  kthread+0xf5/0x130
[  436.015085]  ? mem_cgroup_shrink_node+0x180/0x180
[  436.015086]  ? kthread_associate_blkcg+0x90/0x90
[  436.015090]  ret_from_fork+0x35/0x40
[  436.015091] Code: e9 cf f2 ff ff f7 44 24 20 00 08 00 00 bb 01 00 00 00 0f 84 de fd ff ff e9 73 ff ff ff 0f ff e9 c1 fc ff ff 0f ff e9 b0 fc ff ff <0f> ff 49 8b 76 08 8b 7c 24 20 44 89 f9 48 c7 c2 80 31 ad b0 e8 
[  436.015116] ---[ end trace 89005bd483288459 ]---
[  436.015117] kmemleak: Cannot allocate a kmemleak_object structure
[  445.828518] i915 0000:00:02.0: Resetting rcs0 after gpu hang
. . .
Comment 4 Chris Wilson 2018-02-20 20:02:36 UTC
Ho hum, we lack the serialisation between claiming a pin on the pages and waiting for that to be released. Getting that without introducing a deadlock will be tricky.
Comment 5 Chris Wilson 2018-02-20 20:03:06 UTC
(In reply to Hector Velazquez from comment #3)
> This tests has the same dmesg-warn on GLK QA 

That is nothing to do with this bug.
Comment 6 Chris Wilson 2018-02-20 20:18:50 UTC
Ah, I see. Someone reported the wrong dmesg-warn. The issue here is:

[   80.552326] [IGT] gem_shrink: starting subtest pwrite-userptr
[  341.073195] ------------[ cut here ]------------
[  341.073198] Failed to release pages: bind_count=0, pages_pin_count=1, pin_global=0
[  341.073288] WARNING: CPU: 11 PID: 2038 at drivers/gpu/drm/i915/i915_gem_userptr.c:89 cancel_userptr+0xe5/0xf0 [i915]
[  341.073289] Modules linked in: snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic nls_iso8859_1 iwlmvm mac80211 8250_dw snd_hda_intel intel_rapl x86_pkg_temp_thermal snd_hda_codec intel_powerclamp snd_hda_core coretemp snd_hwdep kvm_intel snd_pcm kvm irqbypass snd_seq_midi snd_seq_midi_event crct10dif_pclmul crc32_pclmul snd_rawmidi ghash_clmulni_intel pcbc snd_seq aesni_intel aes_x86_64 crypto_simd glue_helper snd_seq_device cryptd snd_timer iwlwifi intel_cstate intel_rapl_perf snd input_leds idma64 serio_raw wmi_bmof virt_dma cfg80211 intel_lpss_pci soundcore intel_pch_thermal intel_lpss mac_hid tpm_crb acpi_pad parport_pc ppdev lp parport ip_tables x_tables autofs4 hid_generic usbhid uas hid usb_storage i915 e1000e ahci libahci wmi video
[  341.073319] CPU: 11 PID: 2038 Comm: kworker/u24:30 Tainted: G     U           4.16.0-rc1-drm-intel-qa-ww8-commit-79e91ad+ #1
[  341.073320] Hardware name: Intel Corporation CoffeeLake Client Platform/CoffeeLake H DDR4 RVP, BIOS CNLSFWR1.R00.X118.B07.1801040709 01/04/2018
[  341.073349] Workqueue: i915-userptr-release cancel_userptr [i915]
[  341.073370] RIP: 0010:cancel_userptr+0xe5/0xf0 [i915]
[  341.073371] RSP: 0018:ffff9bc607cc7e60 EFLAGS: 00010282
[  341.073372] RAX: 0000000000000000 RBX: ffff8fb538bd6400 RCX: 0000000000000006
[  341.073373] RDX: 0000000000000007 RSI: 0000000000000082 RDI: ffff8fb70b4d6590
[  341.073373] RBP: ffff8fb538bd65b0 R08: 0000000000000001 R09: 00000000000009e0
[  341.073374] R10: ffff9bc607cc7e60 R11: 00000000000009e0 R12: 0000000000000000
[  341.073375] R13: 0000000000000000 R14: 0ffff8fb6f8a78e6 R15: ffff8fb6f8e7ec00
[  341.073376] FS:  0000000000000000(0000) GS:ffff8fb70b4c0000(0000) knlGS:0000000000000000
[  341.073377] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  341.073378] CR2: 000056295dcd5af5 CR3: 000000049100a005 CR4: 00000000003606e0
[  341.073378] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  341.073379] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  341.073380] Call Trace:
[  341.073388]  process_one_work+0x154/0x3d0
[  341.073390]  worker_thread+0x4a/0x440
[  341.073391]  kthread+0xf5/0x130
[  341.073393]  ? rescuer_thread+0x370/0x370
[  341.073394]  ? kthread_associate_blkcg+0x90/0x90
[  341.073398]  ret_from_fork+0x35/0x40
[  341.073400] Code: 02 d3 fe ff eb c9 8b 93 d0 01 00 00 8b 8b ac 01 00 00 48 c7 c7 08 45 27 c0 8b b3 a4 01 00 00 c6 05 fc 7b 17 00 01 e8 2b 20 0f d2 <0f> ff eb bc 0f 1f 80 00 00 00 00 0f 1f 44 00 00 41 57 41 56 41 
[  341.073421] ---[ end trace 8fa48fec1312957b ]---
Comment 7 Hector Velazquez 2018-03-08 17:19:39 UTC
This tests has a dmesg-warn on CNL QA 
Tests List:
igt@gem_shrink@get-pages-userptr

output:
. . .
Subtest get-pages-userptr: SUCCESS (135.941s)
gem_shrink:822) igt-core-DEBUG: Exiting with status code 0
. . .

kernel log sample:
. . .
[   34.076630] ------------[ cut here ]------------
[   34.076633] Failed to release pages: bind_count=0, pages_pin_count=1, pin_global=0
[   34.076696] WARNING: CPU: 0 PID: 54 at drivers/gpu/drm/i915/i915_gem_userptr.c:89 cancel_userptr+0xe5/0xf0 [i915]
[   34.076697] Modules linked in: snd_hda_codec_hdmi cmac bnep 8250_dw snd_soc_skl snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_ext_core snd_soc_acpi snd_soc_core arc4 nls_iso8859_1 snd_compress snd_hda_codec_realtek snd_hda_codec_generic snd_pcm_dmaengine ac97_bus iwlmvm mac80211 x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel kvm irqbypass crct10dif_pclmul snd_hda_codec crc32_pclmul ghash_clmulni_intel pcbc snd_hda_core snd_hwdep snd_pcm aesni_intel snd_seq_midi aes_x86_64 snd_seq_midi_event crypto_simd glue_helper cryptd snd_rawmidi snd_seq snd_seq_device snd_timer input_leds serio_raw asix usbnet mii wmi_bmof snd iwlwifi btusb btrtl btbcm btintel soundcore bluetooth idma64 shpchp ecdh_generic virt_dma mei_me cfg80211 mei intel_lpss_pci intel_lpss intel_pch_thermal
[   34.076734]  mac_hid acpi_pad parport_pc ppdev lp parport ip_tables x_tables autofs4 uas usb_storage i915 e1000e prime_numbers wmi video
[   34.076746] CPU: 0 PID: 54 Comm: kworker/u8:1 Tainted: G     U  W        4.16.0-rc4-drm-intel-qa-ww10-commit-6c6e100+ #1
[   34.076747] Hardware name: Intel Corporation CannonLake Client Platform/CannonLake Y LPDDR4 RVP, BIOS CNLSFWR1.R00.X124.B02.1802051422 02/05/2018
[   34.076778] Workqueue: i915-userptr-release cancel_userptr [i915]
[   34.076803] RIP: 0010:cancel_userptr+0xe5/0xf0 [i915]
[   34.076804] RSP: 0018:ffffa9f880ecbe60 EFLAGS: 00010282
[   34.076806] RAX: 0000000000000000 RBX: ffff92fbcb3d1e00 RCX: 0000000000000006
[   34.076807] RDX: 0000000000000007 RSI: 0000000000000082 RDI: ffff92fdef8165d0
[   34.076808] RBP: ffff92fbcb3d1fb0 R08: 0000000000000001 R09: 00000000000007c4
[   34.076809] R10: ffffa9f880ecbe60 R11: 00000000000007c4 R12: 0000000000000000
[   34.076810] R13: 0000000000000000 R14: 0ffff92fdda6b9cd R15: ffff92fde67eccc0
[   34.076812] FS:  0000000000000000(0000) GS:ffff92fdef800000(0000) knlGS:0000000000000000
[   34.076813] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   34.076814] CR2: 00007f03910d30a0 CR3: 00000001d4a0a003 CR4: 0000000000760ef0
[   34.076815] PKRU: 55555554
[   34.076816] Call Trace:
[   34.076824]  process_one_work+0x147/0x3c0
[   34.076827]  worker_thread+0x4a/0x440
[   34.076829]  kthread+0xf8/0x130
[   34.076832]  ? rescuer_thread+0x360/0x360
[   34.076833]  ? kthread_associate_blkcg+0x90/0x90
[   34.076836]  ret_from_fork+0x35/0x40
[   34.076838] Code: 82 08 ff ff eb c9 8b 93 d0 01 00 00 8b 8b ac 01 00 00 48 c7 c7 c8 e7 40 c0 8b b3 a4 01 00 00 c6 05 ef 1f 18 00 01 e8 4b 15 56 de <0f> 0b eb bc 0f 1f 80 00 00 00 00 0f 1f 44 00 00 41 57 41 56 41 
[   34.076894] WARNING: CPU: 0 PID: 54 at drivers/gpu/drm/i915/i915_gem_userptr.c:89 cancel_userptr+0xe5/0xf0 [i915]
[   34.076895] ---[ end trace 4575c9034fdf7795 ]---
. . .

dmesg-warn sample
. . .
[  +8.520822] WARNING: CPU: 2 PID: 989 at mm/page_alloc.c:4137 __alloc_pages_slowpath+0xd90/0xdb0
[  +0.000001] Modules linked in: snd_hda_codec_hdmi cmac bnep 8250_dw snd_soc_skl snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_ext_core snd_soc_acpi snd_soc_core arc4 nls_iso8859_1 snd_compress snd_hda_codec_realtek snd_hda_codec_generic snd_pcm_dmaengine ac97_bus iwlmvm mac80211 x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel kvm irqbypass crct10dif_pclmul snd_hda_codec crc32_pclmul ghash_clmulni_intel pcbc snd_hda_core snd_hwdep snd_pcm aesni_intel snd_seq_midi aes_x86_64 snd_seq_midi_event crypto_simd glue_helper cryptd snd_rawmidi snd_seq snd_seq_device snd_timer input_leds serio_raw asix usbnet mii wmi_bmof snd iwlwifi btusb btrtl btbcm btintel soundcore bluetooth idma64 shpchp ecdh_generic virt_dma mei_me cfg80211 mei intel_lpss_pci intel_lpss intel_pch_thermal
[  +0.000044]  mac_hid acpi_pad parport_pc ppdev lp parport ip_tables x_tables autofs4 uas usb_storage i915 e1000e prime_numbers wmi video
[  +0.000011] CPU: 2 PID: 989 Comm: kworker/u9:12 Tainted: G     U           4.16.0-rc4-drm-intel-qa-ww10-commit-6c6e100+ #1
[  +0.000001] Hardware name: Intel Corporation CannonLake Client Platform/CannonLake Y LPDDR4 RVP, BIOS CNLSFWR1.R00.X124.B02.1802051422 02/05/2018
[  +0.000034] Workqueue: i915-userptr-acquire __i915_gem_userptr_get_pages_worker [i915]
[  +0.000005] RIP: 0010:__alloc_pages_slowpath+0xd90/0xdb0
[  +0.000001] RSP: 0018:ffffa9f881d130f8 EFLAGS: 00010046
[  +0.000002] RAX: 0000000000000000 RBX: 0000000000010000 RCX: 0000000000000000
[  +0.000001] RDX: 0000000000000000 RSI: 0000000000000002 RDI: ffff92fdefbfd0b0
[  +0.000001] RBP: 0000000001000000 R08: 0000000000000000 R09: 0000000000000f83
[  +0.000001] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000001015a00
[  +0.000001] R13: 0000000000000000 R14: ffffa9f881d13208 R15: 0000000000000000
[  +0.000002] FS:  0000000000000000(0000) GS:ffff92fdef900000(0000) knlGS:0000000000000000
[  +0.000001] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  +0.000001] CR2: 00007f6644deefa0 CR3: 00000001d4a0a003 CR4: 0000000000760ee0
[  +0.000001] PKRU: 55555554
[  +0.000001] Call Trace:
[  +0.000005]  __alloc_pages_nodemask+0x246/0x280
[  +0.000003]  new_slab+0x309/0x710
[  +0.000002]  ___slab_alloc+0x267/0x4b0
[  +0.000003]  ? create_object+0x43/0x300
[  +0.000003]  ? unwind_next_frame+0x340/0x4e0
[  +0.000003]  ? handle_mm_fault+0xfa/0x200
[  +0.000003]  ? __kernel_text_address+0xe/0x30
[  +0.000002]  ? create_object+0x43/0x300
[  +0.000002]  ? __slab_alloc+0x1c/0x30
[  +0.000002]  __slab_alloc+0x1c/0x30
[  +0.000001]  kmem_cache_alloc+0x197/0x1d0
[  +0.000003]  ? mempool_alloc+0x6a/0x180
[  +0.000001]  create_object+0x43/0x300
[  +0.000002]  ? create_object+0x24e/0x300
[  +0.000002]  kmem_cache_alloc+0xec/0x1d0
[  +0.000002]  mempool_alloc+0x6a/0x180
[  +0.000003]  ? radix_tree_node_alloc.constprop.18+0x8f/0xe0
[  +0.000001] kmemleak: Cannot allocate a kmemleak_object structure
[  +0.000003]  bio_alloc_bioset+0xaf/0x220
[  +0.000000] kmemleak: Kernel memory leak detector disabled
[  +0.000003]  ? end_swap_bio_read+0xb0/0xb0
[  +0.000002]  get_swap_bio+0x50/0xf0
[  +0.000003]  ? end_swap_bio_read+0xb0/0xb0
[  +0.000002]  __swap_writepage+0x2ba/0x3a0
[  +0.000003]  ? __frontswap_store+0x73/0xf0
[  +0.000002]  shmem_writepage+0x1cc/0x2e0
[  +0.000003]  pageout.isra.51+0x1e1/0x330
[  +0.000002]  shrink_page_list+0x9e4/0xc30
[  +0.000003]  shrink_inactive_list+0x2dd/0x670
[  +0.000003]  shrink_node_memcg+0x367/0x7e0
[  +0.000034]  ? shrinker_lock+0x4d/0x90 [i915]
[  +0.000004]  ? shrink_node+0xdc/0x350
[  +0.000002]  shrink_node+0xdc/0x350
[  +0.000002]  do_try_to_free_pages+0xeb/0x360
[  +0.000002]  try_to_free_pages+0xf7/0x1d0
[  +0.000003]  __alloc_pages_slowpath+0x3ad/0xdb0
[  +0.000004]  __alloc_pages_nodemask+0x246/0x280
[  +0.000003]  alloc_pages_vma+0x7c/0x1e0
[  +0.000002]  __handle_mm_fault+0xd07/0x1130
[  +0.000003]  handle_mm_fault+0xfa/0x200
[  +0.000002]  __get_user_pages+0x11a/0x630
[  +0.000002]  ? create_object+0x24e/0x300
[  +0.000003]  get_user_pages_remote+0x137/0x1f0
[  +0.000032]  __i915_gem_userptr_get_pages_worker+0x185/0x230 [i915]
[  +0.000004]  process_one_work+0x147/0x3c0
[  +0.000003]  worker_thread+0x4a/0x440
[  +0.000003]  kthread+0xf8/0x130
[  +0.000002]  ? rescuer_thread+0x360/0x360
[  +0.000002]  ? kthread_associate_blkcg+0x90/0x90
[  +0.000002]  ret_from_fork+0x35/0x40
[  +0.000002] Code: e9 cb f2 ff ff f7 44 24 20 00 08 00 00 bb 01 00 00 00 0f 84 de fd ff ff e9 73 ff ff ff 0f 0b e9 bf fc ff ff 0f 0b e9 ae fc ff ff <0f> 0b 49 8b 76 08 8b 7c 24 20 44 89 f9 48 c7 c2 70 33 8d 9f e8 
[  +0.000036] WARNING: CPU: 2 PID: 989 at mm/page_alloc.c:4137 __alloc_pages_slowpath+0xd90/0xdb0
[  +0.000001] ---[ end trace 4575c9034fdf7794 ]---
. . .
======================================
        Graphic stack
======================================
Component: drm
    tag: libdrm-2.4.89-90-g45eee3f
    commit: 45eee3fd44f4fa253cd4d5c03f48d4a6899f899f

Component: intel-gpu-tools
    tag: intel-gpu-tools-1.21-188-g289202e
    commit: 289202e876688338f439ded7613804ee67034e91

Component: piglit
    tag: piglit-v1
    commit: b8e7cc0e59cf21fe5f98a2d8280cf9a03d02f542
======================================
             Software
======================================
kernel version              : 4.16.0-rc4-drm-intel-qa-ww10-commit-6c6e100+
os version                  : Ubuntu 17.10 artful x86_64
kernel driver               : i915
hardware acceleration       : disabled
swap partition              : enabled on (/dev/nvme0n1p2)
======================================
        Graphic drivers
======================================
modesetting                 : enabled
modesetting compiled for    : 1.19.5 X.Org Video Driver
libdrm                      : 2.4.91
intel-gpu-tools (tag)       : intel-gpu-tools-1.21-188-g289202e8
intel-gpu-tools (commit)    : 289202e8
======================================
             Hardware
======================================
motherboard id             : CannonLakeYLPDDR4RVP
cpu information            : Genuine Intel(R) CPU 0000 @ 1.10GHz
gpu card                   : Intel Corporation Device 5a41 (rev 04) (prog-if 00 [VGA controller])
memory ram                 : 7.63 GB
current cd clock frequency : 168000 kHz
maximum cd clock frequency : 528000 kHz
displays connected         : eDP-1
======================================
             Firmware
======================================
dmc fw loaded             : yes
dmc version               : 1.7
guc fw loaded             : fetch NONE, load NONE
======================================
             kernel parameters
======================================
drm.debug=0x1e intel_iommu=igfx_off fsck.repair=yes i915.alpha_support=1 resume=/dev/nvme0n1p2
Comment 8 Hector Velazquez 2018-03-08 17:20:26 UTC
Created attachment 137902 [details]
kernel log (CNL QA)
Comment 9 Hector Velazquez 2018-03-08 17:27:06 UTC
Created attachment 137903 [details]
dmesg -w -H (CNL QA)
Comment 10 Jani Saarinen 2018-03-29 07:10:40 UTC
First of all. Sorry about spam.
This is mass update for our bugs. 

Sorry if you feel this annoying but with this trying to understand if bug still valid or not.
If bug investigation still in progress, please ignore this and I apologize!

If you think this is not anymore valid, please comment to the bug that can be closed.
If you haven't tested with our latest pre-upstream tree(drm-tip), can you do that also to see if issue is valid there still and if you cannot see issue there, please comment to the bug.
Comment 11 Elizabeth 2018-04-09 21:04:14 UTC
Last seen with 4.16.0-rc5-ww11-commit-e867298+. Closing.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.