Bug 106820

Summary: 4.17.1 BUG: unable to handle kernel paging request at 00000000bfbfbf08 sysfs_kf_seq_show
Product: DRI Reporter: Sverd Johnsen <sverd.johnsen>
Component: DRM/AMDgpuAssignee: Default DRI bug account <dri-devel>
Status: RESOLVED MOVED QA Contact:
Severity: normal    
Priority: medium CC: iavaelooeyt, sarnex
Version: XOrg git   
Hardware: x86-64 (AMD64)   
OS: Linux (All)   
Whiteboard:
i915 platform: i915 features:

Description Sverd Johnsen 2018-06-05 03:33:14 UTC
after unloading and loading amdgpu module several times (checking status of bug 104274 with new 4.17 kernel) over some time i noticed that sensors program gets killed reading /sys/class/i2c-adapter/i2c-10/name

seems to be from amdgpu driver (it belongs to 0000:01:00.0 according to the device symlink)

[ 5643.197797] BUG: unable to handle kernel paging request at 00000000bfbfbf08
[ 5643.197801] PGD 0 P4D 0 
[ 5643.197802] Oops: 0000 [#1] PREEMPT SMP
[ 5643.197803] Modules linked in: amdgpu chash gpu_sched ttm af_packet macvtap macvlan bonding nft_nat nft_chain_nat_ipv4 nf_nat_ipv4 nf_nat nf_log_ipv6 nf_log_ipv4 nf_log_common xfrm_user nft_reject_inet nf_reject_ipv4 xfrm_algo nf_reject_ipv6 nft_reject nft_log nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nft_ct nf_conntrack cls_u32 nft_counter nft_meta nft_set_bitmap sch_htb nft_set_hash nft_set_rbtree raid0 x86_pkg_temp_thermal intel_powerclamp kvm_intel vhost_net tun vhost tap kvm deflate efi_pstore intel_cstate intel_uncore md_mod snd_hda_codec_realtek bcache snd_hda_codec_generic intel_rapl_perf efivars snd_hda_codec_hdmi snd_hda_intel snd_hda_codec plusb input_leds usbnet led_class snd_hwdep snd_hda_core mii mei_me snd_pcm mei tpm_crb crypto_user efivarfs algif_skcipher
[ 5643.197824]  af_alg mousedev joydev psmouse atkbd crct10dif_pclmul crc32_pclmul libps2 ghash_clmulni_intel pcspkr tpm_tis tpm_tis_core shpchp fan thermal tpm i8042 rng_core acpi_pad vfio_pci irqbypass vfio_virqfd vfio_iommu_type1 vfio [last unloaded: amdgpu]
[ 5643.197836] CPU: 0 PID: 5587 Comm: sensors Not tainted 4.17.0-1-vfio #2
[ 5643.197837] Hardware name: Gigabyte Technology Co., Ltd. Z170X-UD3/Z170X-UD3-CF, BIOS F23d 12/01/2017
[ 5643.197840] RIP: 0010:sysfs_kf_seq_show+0x1e/0x100
[ 5643.197841] RSP: 0018:ffff952b854efd98 EFLAGS: 00010206
[ 5643.197841] RAX: 00000000bfbfbf00 RBX: 0000000000000000 RCX: 0000000000000001
[ 5643.197842] RDX: ffff8ef36cb509c0 RSI: 0000000000000001 RDI: ffff8ef27e8f2e80
[ 5643.197843] RBP: 00000000ffffffff R08: ffff8ef39379f928 R09: ffff8ef36cb509c0
[ 5643.197843] R10: 0000000000000000 R11: 0000000000000001 R12: ffff952b854efec0
[ 5643.197844] R13: ffff8ef3773d9200 R14: 0000000000000001 R15: ffff8ef27e8f2e80
[ 5643.197844] FS:  00007fb4230b9740(0000) GS:ffff8ef3dec00000(0000) knlGS:0000000000000000
[ 5643.197845] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5643.197846] CR2: 00000000bfbfbf08 CR3: 00000003bbde7004 CR4: 00000000003606f0
[ 5643.197846] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 5643.197847] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 5643.197847] Call Trace:
[ 5643.197850]  seq_read+0x13b/0x3c0
[ 5643.197852]  __vfs_read+0x1e/0x130
[ 5643.197854]  ? common_file_perm+0xcf/0x130
[ 5643.197855]  vfs_read+0x85/0x110
[ 5643.197856]  ksys_read+0x3a/0x90
[ 5643.197858]  do_syscall_64+0x53/0x300
[ 5643.197860]  ? page_fault+0x8/0x30
[ 5643.197861]  ? do_page_fault+0x1b/0x5e
[ 5643.197862]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 5643.197864] RIP: 0033:0x7fb4229b8901
[ 5643.197864] RSP: 002b:00007fff9fe0e068 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
[ 5643.197865] RAX: ffffffffffffffda RBX: 00005648e71aa330 RCX: 00007fb4229b8901
[ 5643.197865] RDX: 0000000000001000 RSI: 00005648e71aa7f0 RDI: 0000000000000004
[ 5643.197866] RBP: 0000000000000d68 R08: 0000000000000003 R09: 0000000000000000
[ 5643.197866] R10: 00005648e71a1010 R11: 0000000000000246 R12: 00007fb422c815c0
[ 5643.197867] R13: 00007fb422c82100 R14: 000000000000007f R15: 00005648e71aa330
[ 5643.197868] Code: de ff ff eb b8 5b 5d 41 5c c3 0f 1f 00 55 53 4c 8b 4f 78 49 8b 01 48 8b 40 08 4c 8b 40 60 49 8b 40 28 48 85 c0 0f 84 b0 00 00 00 <48> 8b 68 08 48 8b 4f 18 48 8b 47 08 48 39 c1 0f 87 ae 00 00 00 
[ 5643.197881] RIP: sysfs_kf_seq_show+0x1e/0x100 RSP: ffff952b854efd98
[ 5643.197881] CR2: 00000000bfbfbf08
[ 5643.197882] ---[ end trace 1347051bc2a6e340 ]---

can add more info if needed with debug kernel options, patches or whatever.
Comment 1 Sverd Johnsen 2018-06-15 18:09:42 UTC
4.17.1 still affected. load+unload should be sufficient


[  854.947999] BUG: unable to handle kernel NULL pointer dereference at 0000000000000007
[  854.948013] PGD 0 P4D 0 
[  854.948022] Oops: 0000 [#1] PREEMPT SMP PTI
[  854.948026] Modules linked in: amdgpu twofish_generic twofish_avx_x86_64 twofish_x86_64_3way twofish_x86_64 twofish_common serpent_avx2 serpent_avx_x86_64 serpent_generic chash gpu_sched ttm msr af_packet macvtap macvlan bonding nf_log_ipv6 nf_log_ipv4 nf_log_common nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_log nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nft_limit nft_ct nf_conntrack xfrm_user xfrm_algo nft_counter cls_u32 nft_meta nft_set_bitmap nft_set_hash sch_htb nft_set_rbtree raid0 intel_pmc_core x86_pkg_temp_thermal intel_powerclamp kvm_intel vhost_net tun vhost tap kvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi md_mod snd_hda_intel bcache snd_hda_codec efi_pstore intel_cstate intel_uncore intel_rapl_perf snd_hwdep cdc_ether efivars
[  854.948111]  snd_hda_core usbnet mei_me r8152 mei mii snd_pcm input_leds led_class tpm_crb binfmt_misc crypto_user efivarfs algif_skcipher af_alg mousedev joydev psmouse crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcspkr shpchp tpm_tis tpm_tis_core tpm fan thermal i8042 rng_core acpi_pad vfio_pci irqbypass vfio_virqfd vfio_iommu_type1 vfio atkbd libps2 [last unloaded: amdgpu]
[  854.948164] CPU: 2 PID: 6097 Comm: TaskSchedulerFo Not tainted 4.17.1-2-ph #2
[  854.948168] Hardware name: Gigabyte Technology Co., Ltd. Z170X-UD3/Z170X-UD3-CF, BIOS F23d 12/01/2017
[  854.948179] RIP: 0010:sysfs_kf_seq_show+0x1e/0x100
[  854.948184] RSP: 0018:ffffadb482a07dd0 EFLAGS: 00010286
[  854.948190] RAX: ffffffffffffffff RBX: 0000000000000000 RCX: 0000000000000001
[  854.948194] RDX: ffff97d857e5ec00 RSI: 0000000000000001 RDI: ffff97d883e41c80
[  854.948198] RBP: 00000000ffffffff R08: ffff97d85472e928 R09: ffff97d857e5ec00
[  854.948202] R10: 0000000000000000 R11: 0000000000000001 R12: ffffadb482a07f08
[  854.948206] R13: ffff97d83f71fa00 R14: 0000000000000001 R15: ffff97d883e41c80
[  854.948211] FS:  00007fbce080f700(0000) GS:ffff97d89ed00000(0000) knlGS:0000000000000000
[  854.948216] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  854.948220] CR2: 0000000000000007 CR3: 0000000440274004 CR4: 00000000003606e0
[  854.948224] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  854.948228] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  854.948232] Call Trace:
[  854.948244]  seq_read+0x166/0x460
[  854.948252]  __vfs_read+0x31/0x150
[  854.948260]  vfs_read+0x85/0x130
[  854.948267]  ksys_read+0x4a/0xb0
[  854.948274]  do_syscall_64+0x43/0xf0
[  854.948282]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  854.948287] RIP: 0033:0x7fbcf30ff934
[  854.948292] RSP: 002b:00007fbce080ded0 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
[  854.948298] RAX: ffffffffffffffda RBX: 0000000000000093 RCX: 00007fbcf30ff934
[  854.948302] RDX: 0000000000001000 RSI: 000025eba2e3b400 RDI: 0000000000000093
[  854.948306] RBP: 000025eba2e3b400 R08: 0000000000000000 R09: 000025eba3991b40
[  854.948310] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000001000
[  854.948314] R13: 000025eba2e3b400 R14: 0000000000000d68 R15: 00007fbcf33c85c0
[  854.948318] Code: dd ff ff eb b8 5b 5d 41 5c c3 0f 1f 00 55 53 4c 8b 4f 78 49 8b 01 48 8b 40 08 4c 8b 40 60 49 8b 40 28 48 85 c0 0f 84 b3 00 00 00 <48> 8b 68 08 48 8b 4f 18 48 8b 47 08 48 39 c1 0f 87 b1 00 00 00 
[  854.948407] RIP: sysfs_kf_seq_show+0x1e/0x100 RSP: ffffadb482a07dd0
[  854.948410] CR2: 0000000000000007
[  854.948416] ---[ end trace f982eed156f986a9 ]---

etc.

[ 1571.019192] general protection fault: 0000 [#7] PREEMPT SMP PTI
[ 1571.019195] Modules linked in: cpufreq_ondemand twofish_generic twofish_avx_x86_64 twofish_x86_64_3way twofish_x86_64 twofish_common serpent_avx2 serpent_avx_x86_64 serpent_generic chash gpu_sched ttm msr af_packet macvtap macvlan bonding nf_log_ipv6 nf_log_ipv4 nf_log_common nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_log nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nft_limit nft_ct nf_conntrack xfrm_user xfrm_algo nft_counter cls_u32 nft_meta nft_set_bitmap nft_set_hash sch_htb nft_set_rbtree raid0 intel_pmc_core x86_pkg_temp_thermal intel_powerclamp kvm_intel vhost_net tun vhost tap kvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi md_mod snd_hda_intel bcache snd_hda_codec efi_pstore intel_cstate intel_uncore intel_rapl_perf snd_hwdep cdc_ether
[ 1571.019218]  efivars snd_hda_core usbnet mei_me r8152 mei mii snd_pcm input_leds led_class tpm_crb binfmt_misc crypto_user efivarfs algif_skcipher af_alg mousedev joydev psmouse crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcspkr shpchp tpm_tis tpm_tis_core tpm fan thermal i8042 rng_core acpi_pad vfio_pci irqbypass vfio_virqfd vfio_iommu_type1 vfio atkbd libps2 [last unloaded: amdgpu]
[ 1571.019234] CPU: 3 PID: 8349 Comm: sensors Tainted: G      D           4.17.1-2-ph #2
[ 1571.019235] Hardware name: Gigabyte Technology Co., Ltd. Z170X-UD3/Z170X-UD3-CF, BIOS F23d 12/01/2017
[ 1571.019239] RIP: 0010:sysfs_kf_seq_show+0x1e/0x100
[ 1571.019240] RSP: 0018:ffffadb48258fdd0 EFLAGS: 00010206
[ 1571.019241] RAX: 20205b0a30303030 RBX: 0000000000000000 RCX: 0000000000000001
[ 1571.019242] RDX: ffff97d856debc00 RSI: 0000000000000001 RDI: ffff97d883e40080
[ 1571.019243] RBP: 00000000ffffffff R08: ffff97d83f183928 R09: ffff97d856debc00
[ 1571.019244] R10: 0000000000000000 R11: 0000000000000001 R12: ffffadb48258ff08
[ 1571.019245] R13: ffff97d83ff67800 R14: 0000000000000001 R15: ffff97d883e40080
[ 1571.019246] FS:  00007f7b8c789740(0000) GS:ffff97d89ed80000(0000) knlGS:0000000000000000
[ 1571.019247] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1571.019248] CR2: 000055b25f006568 CR3: 000000040c9e8006 CR4: 00000000003606e0
[ 1571.019249] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 1571.019250] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 1571.019251] Call Trace:
[ 1571.019255]  seq_read+0x166/0x460
[ 1571.019258]  ? ptrace_do_notify+0x92/0xc0
[ 1571.019260]  __vfs_read+0x31/0x150
[ 1571.019262]  vfs_read+0x85/0x130
[ 1571.019263]  ksys_read+0x4a/0xb0
[ 1571.019265]  do_syscall_64+0x43/0xf0
[ 1571.019268]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 1571.019269] RIP: 0033:0x7f7b8c088901
[ 1571.019270] RSP: 002b:00007ffd4b10a6c8 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
[ 1571.019272] RAX: ffffffffffffffda RBX: 000055b25f005330 RCX: 00007f7b8c088901
[ 1571.019273] RDX: 0000000000001000 RSI: 000055b25f005790 RDI: 0000000000000004
[ 1571.019274] RBP: 0000000000000d68 R08: 0000000000000003 R09: 0000000000000000
[ 1571.019275] R10: 000055b25effc010 R11: 0000000000000246 R12: 00007f7b8c3515c0
[ 1571.019275] R13: 00007f7b8c352100 R14: 000000000000007f R15: 000055b25f005330
[ 1571.019277] Code: dd ff ff eb b8 5b 5d 41 5c c3 0f 1f 00 55 53 4c 8b 4f 78 49 8b 01 48 8b 40 08 4c 8b 40 60 49 8b 40 28 48 85 c0 0f 84 b3 00 00 00 <48> 8b 68 08 48 8b 4f 18 48 8b 47 08 48 39 c1 0f 87 b1 00 00 00 
[ 1571.019297] RIP: sysfs_kf_seq_show+0x1e/0x100 RSP: ffffadb48258fdd0
[ 1571.019299] ---[ end trace f982eed156f986af ]---
Comment 2 Nick Sarnie 2018-06-17 15:46:04 UTC
Same exact issue for me.

Thanks,
Sarnex
Comment 3 Martin Peres 2019-11-19 08:40:36 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/413.

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.