Summary: | Kernel Oops on boot ../drivers/gpu/drm/i915/intel_drv.h:1813 gen8_write32+0x1e7/0x240 | ||||||
---|---|---|---|---|---|---|---|
Product: | DRI | Reporter: | omkhar | ||||
Component: | DRM/Intel | Assignee: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||
Status: | CLOSED WORKSFORME | QA Contact: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||
Severity: | normal | ||||||
Priority: | medium | CC: | intel-gfx-bugs | ||||
Version: | XOrg git | ||||||
Hardware: | x86-64 (AMD64) | ||||||
OS: | Linux (All) | ||||||
Whiteboard: | Triaged | ||||||
i915 platform: | BDW | i915 features: | |||||
Attachments: |
|
Description
omkhar
2018-04-25 14:02:05 UTC
Then don't use the debugfs interface. Instead of overriding rps, tell us why and report bugs for underperformance. https://patchwork.freedesktop.org/patch/218775/ I didn't "Use the debug interface" - I booted a stock kernel on a Linux distro produced by Intel. This was in dmesg. I filed a defect with the distro team and they directed me this team instead. Is it possible for the two Intel teams to chat about who's doing what incorrectly? Corresponding Clear Linux defect: https://github.com/clearlinux/distribution/issues/48 Could you provide a dmesg log booting with drm.debug=0xe? Oops looks the same, I have added some additional details from dmesg | grep i915: omkhar@ajaxvpn ~ $ dmesg | grep i915 [ 0.320753] calling i915_init+0x0/0x55 @ 1 [ 0.321231] [drm:i915_driver_load] ppgtt mode: 3 [ 0.321255] [drm:i915_ggtt_probe_hw] GGTT size = 4096M [ 0.321257] [drm:i915_ggtt_probe_hw] GMADR size = 256M [ 0.321260] [drm:i915_ggtt_probe_hw] DSM size = 32M [ 0.321381] [drm:i915_gem_init_stolen] Memory reserved for graphics device: 32768K, usable: 31744K [ 0.325299] [drm:i915_driver_load] rawclk rate: 24000 kHz [ 0.325310] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem [ 0.326425] [drm:i915_gem_init_ggtt] clearing unused GTT space: [1000, 100000000] [ 0.326449] [drm:i915_gem_contexts_init] logical context support initialized [ 0.327348] [drm] Initialized i915 1.6.0 20171222 for 0000:00:02.0 on minor 0 [ 0.329433] i915 device info: pciid=0x1606 rev=0x09 platform=BROADWELL gen=8 [ 0.329434] i915 device info: is_mobile: no [ 0.329436] i915 device info: is_lp: no [ 0.329437] i915 device info: is_alpha_support: no [ 0.329438] i915 device info: has_64bit_reloc: yes [ 0.329440] i915 device info: has_aliasing_ppgtt: yes [ 0.329441] i915 device info: has_csr: no [ 0.329442] i915 device info: has_ddi: yes [ 0.329443] i915 device info: has_dp_mst: yes [ 0.329444] i915 device info: has_reset_engine: yes [ 0.329446] i915 device info: has_fbc: yes [ 0.329447] i915 device info: has_fpga_dbg: yes [ 0.329448] i915 device info: has_full_ppgtt: yes [ 0.329455] i915 device info: has_full_48bit_ppgtt: yes [ 0.329461] i915 device info: has_gmch_display: no [ 0.329466] i915 device info: has_guc: no [ 0.329468] i915 device info: has_guc_ct: no [ 0.329469] i915 device info: has_hotplug: yes [ 0.329470] i915 device info: has_l3_dpf: no [ 0.329471] i915 device info: has_llc: yes [ 0.329472] i915 device info: has_logical_ring_contexts: yes [ 0.329474] i915 device info: has_logical_ring_preemption: no [ 0.329475] i915 device info: has_overlay: no [ 0.329476] i915 device info: has_pooled_eu: no [ 0.329477] i915 device info: has_psr: yes [ 0.329479] i915 device info: has_rc6: yes [ 0.329480] i915 device info: has_rc6p: no [ 0.329481] i915 device info: has_resource_streamer: yes [ 0.329482] i915 device info: has_runtime_pm: yes [ 0.329483] i915 device info: has_snoop: no [ 0.329485] i915 device info: unfenced_needs_alignment: no [ 0.329486] i915 device info: cursor_needs_physical: no [ 0.329487] i915 device info: hws_needs_physical: no [ 0.329488] i915 device info: overlay_needs_physical: no [ 0.329490] i915 device info: supports_tv: no [ 0.329491] i915 device info: has_ipc: no [ 0.329492] i915 device info: slice mask: 0001 [ 0.329493] i915 device info: slice total: 1 [ 0.329494] i915 device info: subslice total: 2 [ 0.329496] i915 device info: subslice mask 0003 [ 0.329497] i915 device info: subslice per slice: 2 [ 0.329498] i915 device info: EU total: 12 [ 0.329499] i915 device info: EU per subslice: 6 [ 0.329500] i915 device info: has slice power gating: no [ 0.329502] i915 device info: has subslice power gating: no [ 0.329503] i915 device info: has EU power gating: no [ 0.329504] i915 device info: CS timestamp frequency: 12500 kHz [ 0.329540] initcall i915_init+0x0/0x55 returned 0 after 8576 usecs [ 0.330236] [drm:gmbus_xfer] GMBUS [i915 gmbus dpc] NAK for addr: 0050 w(1) [ 0.330239] [drm:gmbus_xfer] GMBUS [i915 gmbus dpc] NAK on first message, retry [ 0.331234] [drm:gmbus_xfer] GMBUS [i915 gmbus dpc] NAK for addr: 0050 w(1) [ 0.331238] [drm:drm_do_probe_ddc_edid] drm: skipping non-existent adapter i915 gmbus dpc [ 0.331244] [drm:intel_gmbus_force_bit] enabling bit-banging on i915 gmbus dpc. force bit now 1 [ 0.331998] [drm:drm_do_probe_ddc_edid] drm: skipping non-existent adapter i915 gmbus dpc [ 0.332000] [drm:intel_gmbus_force_bit] disabling bit-banging on i915 gmbus dpc. force bit now 0 [ 0.332217] [drm:gmbus_xfer] GMBUS [i915 gmbus dpc] NAK for addr: 0040 w(1) [ 0.332219] [drm:gmbus_xfer] GMBUS [i915 gmbus dpc] NAK on first message, retry [ 0.333238] [drm:gmbus_xfer] GMBUS [i915 gmbus dpc] NAK for addr: 0040 w(1) [ 1.139865] snd_hda_intel 0000:00:03.0: bound 0000:00:02.0 (ops i915_audio_component_bind_ops) [ 1.139867] clr: call_modprobe: i915 2 [ 1.154234] [drm:i915_audio_component_get_eld] Not valid for port B [ 1.154237] [drm:i915_audio_component_get_eld] Not valid for port B [ 1.154239] [drm:i915_audio_component_get_eld] Not valid for port B [ 1.154241] [drm:i915_audio_component_get_eld] Not valid for port C [ 1.154243] [drm:i915_audio_component_get_eld] Not valid for port C [ 1.154245] [drm:i915_audio_component_get_eld] Not valid for port C [ 1.154247] [drm:i915_audio_component_get_eld] Not valid for port D [ 1.154249] [drm:i915_audio_component_get_eld] Not valid for port D [ 1.154251] [drm:i915_audio_component_get_eld] Not valid for port D [ 1.418946] [drm:i915_min_freq_set] Manually setting min freq to 700 [ 1.418964] WARNING: CPU: 0 PID: 201 at ../drivers/gpu/drm/i915/intel_drv.h:1813 gen8_write32+0x1e7/0x240 [ 1.419021] i915_min_freq_set+0x9d/0xf0 Looks like the oops occurs *right* after the i915_min_freq_set call Please send whole dmesg from the boot to failure, do not grep anything. What system this is? Created attachment 139144 [details]
Dmesg
This system is a whitebox that I'm using as a headless router. Here are the details on Amazon https://www.amazon.com/gp/product/B01N6GSS7Y/ref=oh_aui_detailpage_o04_s00?ie=UTF8&psc=1 Demsg still not from the beginning of the boot, can you get that from the beginning as now it starts from [ 0.190165]. In that web page: Intel Celeron 3215U Processor, dual core ,1.7 GHz Afraid that is the early message out of dmesg right after boot, I've asked the Clear Linux team as to the best method of getting earlier information cpuinfo if it's helpful omkhar@ajaxvpn ~ $ cat /proc/cpuinfo processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 61 model name : Intel(R) Celeron(R) CPU 3215U @ 1.70GHz stepping : 4 microcode : 0x2a cpu MHz : 1621.583 cache size : 2048 KB physical id : 0 siblings : 2 core id : 0 cpu cores : 2 apicid : 0 initial apicid : 0 fpu : yes fpu_exception : yes cpuid level : 20 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt xsave rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust erms invpcid rdseed intel_pt xsaveopt dtherm ida arat pln pts bugs : cpu_meltdown spectre_v1 spectre_v2 bogomips : 3392.40 clflush size : 64 cache_alignment : 64 address sizes : 39 bits physical, 48 bits virtual power management: processor : 1 vendor_id : GenuineIntel cpu family : 6 model : 61 model name : Intel(R) Celeron(R) CPU 3215U @ 1.70GHz stepping : 4 microcode : 0x2a cpu MHz : 1696.393 cache size : 2048 KB physical id : 0 siblings : 2 core id : 1 cpu cores : 2 apicid : 2 initial apicid : 2 fpu : yes fpu_exception : yes cpuid level : 20 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt xsave rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust erms invpcid rdseed intel_pt xsaveopt dtherm ida arat pln pts bugs : cpu_meltdown spectre_v1 spectre_v2 bogomips : 3392.40 clflush size : 64 cache_alignment : 64 address sizes : 39 bits physical, 48 bits virtual power management: Any updates here getting whole boot log? I've asked the Clear Linux team once again, I will advise when I hear back. I've attempted multiple reboots and for some reason the dmesg always begins at 0.190165. ping. requested the distro team for further information regarding the missing dmesg info... nothing yet. Just bumped again. OK, thanks. ping. Any luck still? Chris, any idea if still valid? Panic seems to have disappeared with the latest Clear Linux Kernel (4.16.9-571.native) Thank you for reporting back, closing. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.