Bug 90395

Summary: [BDW]igt/gem_pwrite/huge-cpu causes oom killer
Product: DRI Reporter: lu hua <huax.lu>
Component: DRM/IntelAssignee: Intel GFX Bugs mailing list <intel-gfx-bugs>
Status: CLOSED DUPLICATE QA Contact: Intel GFX Bugs mailing list <intel-gfx-bugs>
Severity: normal    
Priority: medium CC: intel-gfx-bugs
Version: unspecified   
Hardware: All   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:
Attachments:
Description Flags
dmesg none

Description lu hua 2015-05-11 06:52:51 UTC
Created attachment 115679 [details]
dmesg

==System Environment==
--------------------------
Regression: not sure

Non-working platforms: BDW

==kernel==
--------------------------
drm-intel-nightly/a1e469d124cad96cd0d0e149c84f7ebd43ca1893
commit a1e469d124cad96cd0d0e149c84f7ebd43ca1893
Author: Daniel Vetter <daniel.vetter@ffwll.ch>
Date:   Fri May 8 17:48:23 2015 +0200

    drm-intel-nightly: 2015y-05m-08d-15h-47m-50s UTC integration manifest

==Bug detailed description==
-----------------------------
It causes oom killer on BDW. It's a new case in recently. bug 90254 is only mentions call trace, oom killer is a separate bug.

output:
IGT-Version: 1.10-g9b0a32d (x86_64) (Linux: 4.1.0-rc2_drm-intel-nightly_a1e469_20150509+ x86_64)
Killed

Call trace:
[    2.681873] WARNING: CPU: 0 PID: 1216 at drivers/gpu/drm/drm_irq.c:1159 drm_wait_one_vblank+0x3b/0x16d [drm]()
[    2.681873] vblank not available on crtc 0, ret=-22
[    2.681876] Modules linked in: i915 button video drm_kms_helper drm
[    2.681878] CPU: 0 PID: 1216 Comm: kworker/u16:3 Not tainted 4.1.0-rc2_drm-intel-nightly_a1e469_20150509+ #398
[    2.681882] Workqueue: events_unbound async_run_entry_fn
[    2.681884]  0000000000000000 0000000000000009 ffffffff817a66cc ffff88014444b8b8
[    2.681885]  ffffffff8103ebde 0000000000000246 ffffffffa0005a47 0000000000000000
[    2.681886]  ffff88014949f000 0000000000000000 ffff880002f49000 0000000000000009
[    2.681886] Call Trace:
[    2.681890]  [<ffffffff817a66cc>] ? dump_stack+0x40/0x50
[    2.681892]  [<ffffffff8103ebde>] ? warn_slowpath_common+0x98/0xb0
[    2.681897]  [<ffffffffa0005a47>] ? drm_wait_one_vblank+0x3b/0x16d [drm]
[    2.681898]  [<ffffffff8103ec3b>] ? warn_slowpath_fmt+0x45/0x4a
[    2.681903]  [<ffffffffa0005a47>] ? drm_wait_one_vblank+0x3b/0x16d [drm]
[    2.681905]  [<ffffffff813fc894>] ? __pm_runtime_resume+0x5b/0x6a
[    2.681942]  [<ffffffffa00d37a3>] ? intel_finish_crtc_commit+0x47/0x10b [i915]
[    2.681945]  [<ffffffffa0056d24>] ? drm_atomic_helper_commit_planes+0x170/0x1a9 [drm_kms_helper]
[    2.681975]  [<ffffffffa00d48df>] ? __intel_set_mode+0x8cb/0x959 [i915]
[    2.682003]  [<ffffffffa00d9da6>] ? intel_crtc_set_config+0x3e6/0x531 [i915]
[    2.682008]  [<ffffffffa0018203>] ? drm_modeset_lock+0x4e/0xa3 [drm]
[    2.682015]  [<ffffffffa000c2be>] ? drm_mode_set_config_internal+0x4e/0xd2 [drm]
[    2.682017]  [<ffffffffa0058fc5>] ? restore_fbdev_mode+0xac/0xc3 [drm_kms_helper]
[    2.682019]  [<ffffffffa005a7c6>] ? drm_fb_helper_restore_fbdev_mode_unlocked+0x1e/0x54 [drm_kms_helper]
[    2.682022]  [<ffffffffa005a82a>] ? drm_fb_helper_set_par+0x2e/0x32 [drm_kms_helper]
[    2.682053]  [<ffffffffa00e65e7>] ? intel_fbdev_set_par+0x11/0x55 [i915]
[    2.682056]  [<ffffffff8137e8be>] ? fbcon_init+0x2fd/0x406
[    2.682058]  [<ffffffff813d432f>] ? visual_init+0xaf/0x102
[    2.682059]  [<ffffffff813d5881>] ? do_bind_con_driver+0x19e/0x2c2
[    2.682061]  [<ffffffff813d5c59>] ? do_take_over_console+0x12c/0x15c
[    2.682062]  [<ffffffff8137dfad>] ? do_fbcon_takeover+0x53/0x97
[    2.682065]  [<ffffffff810549dc>] ? notifier_call_chain+0x35/0x59
[    2.682067]  [<ffffffff81054c23>] ? __blocking_notifier_call_chain+0x43/0x5b
[    2.682069]  [<ffffffff8138603b>] ? lock_fb_info+0x12/0x2f
[    2.682071]  [<ffffffff81387856>] ? register_framebuffer+0x26c/0x2a2
[    2.682073]  [<ffffffffa005aadb>] ? drm_fb_helper_initial_config+0x2ad/0x34a [drm_kms_helper]
[    2.682075]  [<ffffffff81055b8f>] ? async_run_entry_fn+0x2d/0xbf
[    2.682076]  [<ffffffff8104f985>] ? process_one_work+0x1b2/0x31d
[    2.682078]  [<ffffffff8105026f>] ? worker_thread+0x265/0x351
[    2.682079]  [<ffffffff8105000a>] ? cancel_delayed_work_sync+0xa/0xa
[    2.682081]  [<ffffffff81053ee1>] ? kthread+0xce/0xd6
[    2.682082]  [<ffffffff81053e13>] ? kthread_create_on_node+0x162/0x162
[    2.682084]  [<ffffffff817ac5d2>] ? ret_from_fork+0x42/0x70
[    2.682086]  [<ffffffff81053e13>] ? kthread_create_on_node+0x162/0x162
[    2.682087] ---[ end trace 5a4c0d0700e2a099 ]---

dmesg:
[  352.761473] [ pid ]   uid  tgid total_vm      rss nr_ptes nr_pmds swapents oom_score_adj name
[  352.761478] [ 2817]     0  2817     4493       63      14       3        1             0 sh
[  352.761481] [ 2846]     0  2846     4480      606      15       3        1             0 initctl
[  352.761484] [ 2848]     0  2848     8255      120      19       3        0             0 mountall
[  352.761487] [ 2940]     0  2940     5035      198      14       3        0             0 upstart-udev-br
[  352.761490] [ 2945]     0  2945    12563      329      30       3        0         -1000 systemd-udevd
[  352.761493] [ 3876]     0  3876     5857       69      18       3        0             0 rpcbind
[  352.761496] [ 3966]   102  3966     9892      166      23       3        0             0 dbus-daemon
[  352.761498] [ 3969]     0  3969     7444       61      19       3        0             0 rpc.idmapd
[  352.761501] [ 3980]   117  3980     5388      114      16       3        0             0 rpc.statd
[  352.761504] [ 4034]     0  4034    82549      274      65       3        0             0 ModemManager
[  352.761507] [ 4055]     0  4055    10864       86      26       3        0             0 systemd-logind
[  352.761509] [ 4092]     0  4092    89381      637      72       3        0             0 NetworkManager
[  352.761512] [ 4111]     0  4111    73632      197      48       3        0             0 polkitd
[  352.761514] [ 4126]   101  4126    65535      181      30       4        0             0 rsyslogd
[  352.761517] [ 4174]     0  4174    19215      271      41       3        0             0 cupsd
[  352.761520] [ 4181]   111  4181     8089       75      22       3        0             0 avahi-daemon
[  352.761523] [ 4186]   111  4186     8058       62      21       3        0             0 avahi-daemon
[  352.761525] [ 4205]     0  4205     5006       41      12       3        0             0 getty
[  352.761528] [ 4209]     0  4209     5006       40      12       3        0             0 getty
[  352.761531] [ 4216]     0  4216     5006       41      13       3        0             0 getty
[  352.761533] [ 4217]     0  4217     5006       41      13       3        0             0 getty
[  352.761536] [ 4220]     0  4220     5006       40      13       3        0             0 getty
[  352.761539] [ 4247]     0  4247    15343      170      36       3        0         -1000 sshd
[  352.761541] [ 4250]     0  4250     4799       55      14       3        0             0 irqbalance
[  352.761544] [ 4252]     0  4252     5916       63      17       3        0             0 cron
[  352.761547] [ 4256]     0  4256     1094       45       8       3        0             0 acpid
[  352.761549] [ 4293]   109  4293   109280      352      78       3        0             0 whoopsie
[  352.761552] [ 4301]   106  4301     9288       82      22       3        0             0 kerneloops
[  352.761554] [ 4335]     0  4335    18840      222      41       3        0             0 cups-browsed
[  352.761557] [ 4440]     0  4440     3920      166      13       3        0             0 upstart-file-br
[  352.761560] [ 4453]     0  4453     4048      284      13       3        0             0 upstart-socket-
[  352.761563] [ 4472]     0  4472     5006       41      13       3        0             0 getty
[  352.761566] [ 4636]     0  4636     2560      567      10       3        7             0 dhclient
[  352.761569] [ 4641] 65534  4641     8808       64      22       3        0             0 dnsmasq
[  352.761570] [ 4841]     0  4841    27447      252      57       3        0             0 sshd
[  352.761572] [ 4917]     0  4917     6835      644      19       3        0             0 bash
[  352.761574] [ 4937]     0  4937    19611      130      41       5        0          1000 gem_pwrite
[  352.761576] Out of memory: Kill process 4937 (gem_pwrite) score 1000 or sacrifice child
[  352.761610] Killed process 4937 (gem_pwrite) total-vm:78444kB, anon-rss:516kB, file-rss:4kB

==Reproduce steps==
---------------------------- 
1.  ./gem_pwrite --run-subtest huge-cpu
Comment 1 Ander Conselvan de Oliveira 2015-05-12 09:02:31 UTC
I believe bug 90254, comment 1 applies here.
Comment 2 lu hua 2015-05-13 02:00:32 UTC
bug 90254 mentions that it takes 25 minutes and doesn't exit testing, also We don't see oom killer. 
I am not sure it's real oom kill or only time out.So report this bug to track oom killer.
I should try to give enough time to run bug 90254.
If you can confirm they are duplication, pls mark as duplicate. Thanks.
Comment 3 lu hua 2015-05-13 02:03:49 UTC
(In reply to lu hua from comment #2)
> bug 90254 mentions that it takes 25 minutes and doesn't exit testing, also
> We don't see oom killer. 
> I am not sure it's real oom kill or only time out.So report this bug to
> track oom killer.
> I should try to give enough time to run bug 90254.
> If you can confirm they are duplication, pls mark as duplicate. Thanks.

Sorry, I re-check bug 90254's dmesg, it has oom killer issue.

*** This bug has been marked as a duplicate of bug 90254 ***

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.