Bug 108799 - [CI][BAT] igt@i915_selftest@live_sanitycheck - incomplete - BUG: unable to handle kernel (paging request|NULL pointer dereference) at
Summary: [CI][BAT] igt@i915_selftest@live_sanitycheck - incomplete - BUG: unable to ha...
Status: CLOSED FIXED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: XOrg git
Hardware: Other All
: medium normal
Assignee: Intel GFX Bugs mailing list
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-11-19 16:35 UTC by Martin Peres
Modified: 2018-12-28 08:41 UTC (History)
1 user (show)

See Also:
i915 platform: I915G
i915 features: GEM/Other


Attachments

Description Martin Peres 2018-11-19 16:35:46 UTC
<6> [300.469975] Console: switching to colour dummy device 80x25
<6> [300.470148] [IGT] drv_selftest: executing
<7> [300.553067] [drm:intel_disable_pipe [i915]] disabling pipe A
<7> [300.567102] [drm:i9xx_get_fifo_size [i915]] FIFO size - (0x00001d9c) A: 28
<7> [300.567169] [drm:i9xx_get_fifo_size [i915]] FIFO size - (0x00001d9c) B: 31
<7> [300.567231] [drm:i9xx_update_wm [i915]] FIFO watermarks - A: 26, B: 29
<7> [300.567294] [drm:i9xx_update_wm [i915]] Setting FIFO watermarks - A: 26, B: 29, C: 2, SR 1
<7> [300.567384] [drm:intel_atomic_commit_tail [i915]] [ENCODER:39:CRT]
<7> [300.567470] [drm:intel_atomic_commit_tail [i915]] [ENCODER:40:SDVO B]
<7> [300.567550] [drm:verify_connector_state.isra.70 [i915]] [CONNECTOR:38:VGA-1]
<7> [300.567691] [drm:intel_atomic_commit_tail [i915]] [CRTC:32:pipe A]
<6> [300.682598] [IGT] drv_selftest: starting subtest live_sanitycheck
<5> [301.613299] Setting dangerous option live_selftests - tainting kernel
<7> [301.715207] [drm:i915_driver_load [i915]] No PCH found.
<7> [301.715291] [drm:i915_driver_load [i915]] WOPCM size: 1024KiB
<7> [301.715399] [drm:intel_uc_init_early [i915]] enable_guc=0 (submission:no huc:no)
<7> [301.715500] [drm:intel_uc_init_early [i915]] guc_log_level=0 (enabled:no, verbose:no, verbosity:0)
<7> [301.715586] [drm:intel_power_domains_init [i915]] Allowed DC state mask 00
<6> [301.719629] [drm] Display disabled (module parameter)
<6> [301.719643] pci 0000:00:00.0: Intel 915G Chipset
<6> [301.719708] pci 0000:00:00.0: detected gtt size: 262144K total, 262144K mappable
<6> [301.723227] pci 0000:00:00.0: detected 8192K stolen memory
<7> [301.723415] [drm:i915_ggtt_probe_hw [i915]] GGTT size = 256M
<7> [301.723507] [drm:i915_ggtt_probe_hw [i915]] GMADR size = 256M
<7> [301.727546] [drm:i915_ggtt_probe_hw [i915]] DSM size = 8M
<6> [301.731464] [drm] Replacing VGA console driver
<7> [301.739093] [drm:i915_gem_init_stolen [i915]] Memory reserved for graphics device: 8192K, usable: 8192K
<7> [301.739862] [drm:intel_gvt_init [i915]] GVT-g is disabled by kernel params
<7> [301.741256] [drm:intel_opregion_setup [i915]] graphic opregion physical addr: 0x0
<7> [301.743600] [drm:intel_opregion_setup [i915]] ACPI OpRegion not supported!
<7> [301.746571] [drm:intel_bios_init [i915]] Skipping VBT init due to disabled display.
<7> [301.752015] [drm:intel_power_well_enable [i915]] enabling always-on
<7> [301.754489] [drm:i915_irq_handler [i915]] EIR stuck: 0x00000010, masked
<7> [301.765806] [drm:intel_fbc_init [i915]] Sanitized enable_fbc value: 0
<7> [301.767395] [drm:intel_modeset_init [i915]] 0 display pipe available.
<7> [301.769063] [drm:intel_dump_cdclk_state [i915]] Current CDCLK 333333 kHz, VCO 0 kHz, ref 0 kHz, bypass 0 kHz, voltage level 0
<7> [301.771329] [drm:intel_update_max_cdclk [i915]] Max CD clock rate: 333333 kHz
<7> [301.774508] [drm:intel_modeset_init [i915]] Max dotclock rate: 599999 kHz
<7> [301.794215] [drm:i915_gem_init_ggtt [i915]] clearing unused GTT space: [1000, 10000000]
<1> [301.797515] BUG: unable to handle kernel NULL pointer dereference at 0000000000000001
<6> [301.797540] PGD 0 P4D 0 
<4> [301.797553] Oops: 0002 [#1] PREEMPT SMP PTI
<4> [301.797566] CPU: 0 PID: 1765 Comm: drv_selftest Tainted: G U 4.20.0-rc1-g756a0fd616c3-repeat_14+ #1
<4> [301.797587] Hardware name: Dell Inc. OptiPlex GX280 /0G8310, BIOS A04 02/09/2005
<4> [301.797735] RIP: 0010:i915_gem_init_ggtt+0xef/0x160 [i915]
<4> [301.797750] Code: 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e <2e> 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01 30 6e 2e 01
<4> [301.797784] RSP: 0018:ffffa06900233a58 EFLAGS: 00010246
<4> [301.797797] RAX: 0000000000000001 RBX: ffff9715f38c0000 RCX: ffffffffc0940920
<4> [301.797813] RDX: ffffa0690197fffc RSI: 000000000000ffff RDI: 000000002e6e3001
<4> [301.797828] RBP: 0000000000001000 R08: 000000002b0bcc18 R09: 0000000000000000
<4> [301.797843] R10: 0000000000000000 R11: 0000000000000000 R12: ffff9715f38c8ae8
<4> [301.797858] R13: 000000000ffff000 R14: ffff9715f38c8f40 R15: 0000000000000000
<4> [301.797874] FS: 00007f809b561980(0000) GS:ffff9715fe000000(0000) knlGS:0000000000000000
<4> [301.797892] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [301.797905] CR2: 0000000000000001 CR3: 0000000030e0e000 CR4: 00000000000006f0
<4> [301.797921] Call Trace:
<4> [301.798027] i915_gem_init+0xe2/0x850 [i915]
<4> [301.798116] i915_driver_load+0xc81/0x1530 [i915]
<4> [301.798136] ? _raw_spin_unlock_irqrestore+0x4c/0x60
<4> [301.798152] ? lockdep_hardirqs_on+0xe0/0x1b0
<4> [301.798243] i915_pci_probe+0x29/0xa0 [i915]
<4> [301.798258] pci_device_probe+0xa1/0x130
<4> [301.798272] really_probe+0xf3/0x3e0
<4> [301.798286] driver_probe_device+0x10a/0x120
<4> [301.798299] __driver_attach+0xdb/0x100
<4> [301.798311] ? driver_probe_device+0x120/0x120
<4> [301.798324] ? driver_probe_device+0x120/0x120
<4> [301.798338] bus_for_each_dev+0x74/0xc0
<4> [301.798352] bus_add_driver+0x15f/0x250
<4> [301.798364] ? 0xffffffffc0108000
<4> [301.798376] driver_register+0x56/0xe0
<4> [301.798387] ? 0xffffffffc0108000
<4> [301.798399] do_one_initcall+0x58/0x2e0
<4> [301.798413] ? do_init_module+0x1d/0x1ea
<4> [301.798425] ? rcu_read_lock_sched_held+0x6f/0x80
<4> [301.798439] ? kmem_cache_alloc_trace+0x264/0x290
<4> [301.798454] do_init_module+0x56/0x1ea
<4> [301.798467] load_module+0x2714/0x29f0
<4> [301.798491] ? __se_sys_finit_module+0xd3/0xf0
<4> [301.798503] __se_sys_finit_module+0xd3/0xf0
<4> [301.798522] do_syscall_64+0x55/0x190
<4> [301.798535] entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [301.798548] RIP: 0033:0x7f809ae29839
<4> [301.798560] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [301.798596] RSP: 002b:00007fff4ab513b8 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [301.798614] RAX: ffffffffffffffda RBX: 0000561d99a2d0b0 RCX: 00007f809ae29839
<4> [301.798630] RDX: 0000000000000000 RSI: 0000561d99a22cd0 RDI: 0000000000000006
<4> [301.798646] RBP: 0000561d99a22cd0 R08: 0000000000000004 R09: 0000000000000000
<4> [301.798662] R10: 00007fff4ab51530 R11: 0000000000000246 R12: 0000000000000000
<4> [301.798678] R13: 0000561d99a20230 R14: 0000000000000020 R15: 000000000000003f
<4> [301.798699] Modules linked in: i915(+) amdgpu chash gpu_sched ttm snd_hda_codec snd_hwdep snd_hda_core snd_pcm vgem lpc_ich tg3 prime_numbers [last unloaded: i915]



<6> [306.738641] Console: switching to colour dummy device 80x25
<6> [306.738817] [IGT] drv_selftest: executing
<7> [306.822113] [drm:intel_disable_pipe [i915]] disabling pipe A
<7> [306.835642] [drm:i9xx_get_fifo_size [i915]] FIFO size - (0x00001d9c) A: 28
<7> [306.835709] [drm:i9xx_get_fifo_size [i915]] FIFO size - (0x00001d9c) B: 31
<7> [306.835771] [drm:i9xx_update_wm [i915]] FIFO watermarks - A: 26, B: 29
<7> [306.835834] [drm:i9xx_update_wm [i915]] Setting FIFO watermarks - A: 26, B: 29, C: 2, SR 1
<7> [306.835924] [drm:intel_atomic_commit_tail [i915]] [ENCODER:39:CRT]
<7> [306.836010] [drm:intel_atomic_commit_tail [i915]] [ENCODER:40:SDVO B]
<7> [306.836090] [drm:verify_connector_state.isra.70 [i915]] [CONNECTOR:38:VGA-1]
<7> [306.836230] [drm:intel_atomic_commit_tail [i915]] [CRTC:32:pipe A]
<6> [306.949083] [IGT] drv_selftest: starting subtest live_sanitycheck
<5> [307.879029] Setting dangerous option live_selftests - tainting kernel
<7> [307.981693] [drm:i915_driver_load [i915]] No PCH found.
<7> [307.981778] [drm:i915_driver_load [i915]] WOPCM size: 1024KiB
<7> [307.981890] [drm:intel_uc_init_early [i915]] enable_guc=0 (submission:no huc:no)
<7> [307.981995] [drm:intel_uc_init_early [i915]] guc_log_level=0 (enabled:no, verbose:no, verbosity:0)
<7> [307.982083] [drm:intel_power_domains_init [i915]] Allowed DC state mask 00
<6> [307.986092] [drm] Display disabled (module parameter)
<6> [307.986106] pci 0000:00:00.0: Intel 915G Chipset
<6> [307.986171] pci 0000:00:00.0: detected gtt size: 262144K total, 262144K mappable
<6> [307.987037] pci 0000:00:00.0: detected 8192K stolen memory
<7> [307.987202] [drm:i915_ggtt_probe_hw [i915]] GGTT size = 256M
<7> [307.987293] [drm:i915_ggtt_probe_hw [i915]] GMADR size = 256M
<7> [307.990056] [drm:i915_ggtt_probe_hw [i915]] DSM size = 8M
<6> [307.994463] [drm] Replacing VGA console driver
<7> [308.002170] [drm:i915_gem_init_stolen [i915]] Memory reserved for graphics device: 8192K, usable: 8192K
<7> [308.003928] [drm:intel_gvt_init [i915]] GVT-g is disabled by kernel params
<7> [308.006937] [drm:intel_opregion_setup [i915]] graphic opregion physical addr: 0x0
<7> [308.009187] [drm:intel_opregion_setup [i915]] ACPI OpRegion not supported!
<7> [308.012029] [drm:intel_bios_init [i915]] Skipping VBT init due to disabled display.
<7> [308.017861] [drm:intel_power_well_enable [i915]] enabling always-on
<7> [308.024759] [drm:i915_irq_handler [i915]] EIR stuck: 0x00000010, masked
<7> [308.031238] [drm:intel_fbc_init [i915]] Sanitized enable_fbc value: 0
<7> [308.031381] [drm:intel_modeset_init [i915]] 0 display pipe available.
<7> [308.031486] [drm:intel_dump_cdclk_state [i915]] Current CDCLK 333333 kHz, VCO 0 kHz, ref 0 kHz, bypass 0 kHz, voltage level 0
<7> [308.031587] [drm:intel_update_max_cdclk [i915]] Max CD clock rate: 333333 kHz
<7> [308.031688] [drm:intel_modeset_init [i915]] Max dotclock rate: 599999 kHz
<7> [308.047252] [drm:i915_gem_init_ggtt [i915]] clearing unused GTT space: [1000, 10000000]
<1> [308.059166] BUG: unable to handle kernel paging request at ffffffff90010fdd
<6> [308.059189] PGD 33212067 P4D 33212067 PUD 33213063 PMD 0 
<4> [308.059207] Oops: 0002 [#1] PREEMPT SMP PTI
<4> [308.059221] CPU: 0 PID: 1830 Comm: drv_selftest Tainted: G U 4.20.0-rc1-g756a0fd616c3-repeat_14+ #1
<4> [308.059242] Hardware name: Dell Inc. OptiPlex GX280 /0G8310, BIOS A04 02/09/2005
<4> [308.059404] RIP: 0010:intel_ctx_workarounds_init+0x0/0x560 [i915]
<4> [308.059419] Code: dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f <01> 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90 dd 0f 01 90
<4> [308.059455] RSP: 0000:ffffac4780253a78 EFLAGS: 00010246
<4> [308.059468] RAX: 0000000000000000 RBX: ffffa28630130000 RCX: ffffffffc0ce8920
<4> [308.059485] RDX: ffffac47819ffffc RSI: 000000000000ffff RDI: ffffa28630130000
<4> [308.059500] RBP: ffffa28630130068 R08: 00000000ce6bdc3b R09: 0000000000000000
<4> [308.059516] R10: 0000000000000000 R11: 0000000000000000 R12: ffffa2863d578008
<4> [308.059532] R13: ffffa2863013ad40 R14: ffffa28630130d58 R15: 0000000000000000
<4> [308.059548] FS: 00007f82e1523980(0000) GS:ffffa2863e000000(0000) knlGS:0000000000000000
<4> [308.059566] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [308.059579] CR2: ffffffff90010fdd CR3: 0000000022c60000 CR4: 00000000000006f0
<4> [308.059595] Call Trace:
<4> [308.059690] i915_gem_contexts_init+0x2a/0x220 [i915]
<4> [308.059794] i915_gem_init+0x1e5/0x850 [i915]
<4> [308.059884] i915_driver_load+0xc81/0x1530 [i915]
<4> [308.059904] ? _raw_spin_unlock_irqrestore+0x4c/0x60
<4> [308.059921] ? lockdep_hardirqs_on+0xe0/0x1b0
<4> [308.060012] i915_pci_probe+0x29/0xa0 [i915]
<4> [308.060028] pci_device_probe+0xa1/0x130
<4> [308.060043] really_probe+0xf3/0x3e0
<4> [308.060057] driver_probe_device+0x10a/0x120
<4> [308.060071] __driver_attach+0xdb/0x100
<4> [308.060083] ? driver_probe_device+0x120/0x120
<4> [308.060096] ? driver_probe_device+0x120/0x120
<4> [308.060111] bus_for_each_dev+0x74/0xc0
<4> [308.060125] bus_add_driver+0x15f/0x250
<4> [308.060140] ? 0xffffffffc04b9000
<4> [308.060153] driver_register+0x56/0xe0
<4> [308.060165] ? 0xffffffffc04b9000
<4> [308.060176] do_one_initcall+0x58/0x2e0
<4> [308.060190] ? do_init_module+0x1d/0x1ea
<4> [308.060202] ? rcu_read_lock_sched_held+0x6f/0x80
<4> [308.060216] ? kmem_cache_alloc_trace+0x264/0x290
<4> [308.060231] do_init_module+0x56/0x1ea
<4> [308.060244] load_module+0x2714/0x29f0
<4> [308.060268] ? __se_sys_finit_module+0xd3/0xf0
<4> [308.060280] __se_sys_finit_module+0xd3/0xf0
<4> [308.060299] do_syscall_64+0x55/0x190
<4> [308.060311] entry_SYSCALL_64_after_hwframe+0x49/0xbe
<4> [308.060325] RIP: 0033:0x7f82e0deb839
<4> [308.060336] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
<4> [308.060371] RSP: 002b:00007ffd550bb478 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [308.060390] RAX: ffffffffffffffda RBX: 0000556fbc2340b0 RCX: 00007f82e0deb839
<4> [308.060405] RDX: 0000000000000000 RSI: 0000556fbc22ae10 RDI: 0000000000000006
<4> [308.060420] RBP: 0000556fbc22ae10 R08: 0000000000000004 R09: 0000000000000000
<4> [308.060435] R10: 00007ffd550bb5f0 R11: 0000000000000246 R12: 0000000000000000
<4> [308.060451] R13: 0000556fbc226c40 R14: 0000000000000020 R15: 000000000000003f
<4> [308.060472] Modules linked in: i915(+) amdgpu chash gpu_sched ttm snd_hda_codec snd_hwdep snd_hda_core snd_pcm vgem lpc_ich tg3 prime_numbers [last unloaded: i915]
Comment 1 Martin Peres 2018-11-19 16:36:26 UTC
^ Seen on GDG
Comment 2 Chris Wilson 2018-11-19 16:43:50 UTC
Wtf. More memcorruption smell.
Comment 3 Chris Wilson 2018-11-19 19:08:56 UTC
commit dafdf69736d66075836b7bc291584cd0889e7601 (HEAD -> topic/core-for-CI, drm-intel/topic/core-for-CI)
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Fri May 12 12:45:25 2017 +0100

    perf/core: Avoid removing shared pmu_context on unregister
    
    In commit 1fd7e4169954 ("perf/core: Remove perf_cpu_context::unique_pmu"),
    the search for another user of the pmu_cpu_context was removed, and so
    we unconditionally free it during perf_pmu_unregister. This leads to
    random corruption later and a BUG at mm/percpu.c:689.
    
    v2: Check for shared pmu_contexts under the mutex.
    
    Fixes: 1fd7e4169954 ("perf/core: Remove perf_cpu_context::unique_pmu")
    Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
    Cc: David Carrillo-Cisneros <davidcc@google.com>
    Cc: Peter Zijlstra (Intel) <peterz@infradead.org>
    Cc: Ingo Molnar <mingo@redhat.com>
    Cc: <stable@vger.kernel.org> # v4.11+
    Link: http://patchwork.freedesktop.org/patch/msgid/20170512114525.17575-1-chris@chris-wilson.co.uk
Comment 4 Francesco Balestrieri 2018-12-28 08:41:48 UTC
Not seen in more than a month, closing.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.