lpfc: Crash on shutdown with 5.9.0-rc1

Hannes Reinecke hare at suse.de
Thu Sep 10 04:12:54 EDT 2020


Hi James,

I just hit this crash on shutdown with nvme-5.9:

[ 3405.218638] general protection fault, probably for non-canonical
address 0x6b6b6b6b6b6b84ef: 0000 [#1] SMP NOPTI
[ 3405.239110] CPU: 6 PID: 680 Comm: lpfc_worker_0 Tainted: G
 E     5.9.0-rc1-default+ #632
[ 3405.257300] Hardware name: HPE ProLiant DL325 Gen10/ProLiant DL325
Gen10, BIOS A41 03/09/2020
[ 3405.274451] RIP: 0010:_raw_spin_lock_irqsave+0x1e/0x40
[ 3405.284782] Code: 00 00 00 00 00 0f 1f 80 00 00 00 00 0f 1f 44 00 00
53 9c 58 0f 1f 44 00 00 48 89 c3 fa 66 0f 1f 44 00 00 31 c0 ba 01 00 00
00 <f0> 0f b1 17 75 05 48 89 d8 5b c3 89 c6 e8 f0 21 7c ff 66 90 48 89
[ 3405.322544] RSP: 0018:ffffa6c804397c78 EFLAGS: 00010046
[ 3405.333046] RAX: 0000000000000000 RBX: 0000000000000246 RCX:
000000006b6b6b6b
[ 3405.347388] RDX: 0000000000000001 RSI: 0000000000000100 RDI:
6b6b6b6b6b6b84ef
[ 3405.361731] RBP: 6b6b6b6b6b6b84ef R08: 000000006b6b6b6b R09:
000000006b6b6b6b
[ 3405.376079] R10: 0000000000000000 R11: ffff920ff51063d9 R12:
ffff920ff5100000
[ 3405.390423] R13: ffff920ff5100000 R14: 0000000000000000 R15:
ffff920fc752ee00
[ 3405.404768] FS:  0000000000000000(0000) GS:ffff92107ec00000(0000)
knlGS:0000000000000000
[ 3405.421035] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3405.432582] CR2: 00007f3a0a1be960 CR3: 00000008520d0000 CR4:
00000000003506e0
[ 3405.446929] Call Trace:
[ 3405.451864]  lpfc_nlp_put.part.22+0x47/0x260 [lpfc]
[ 3405.461685]  lpfc_ct_free_iocb+0x20/0xc0 [lpfc]
[ 3405.470806]  lpfc_cmpl_ct_disc_fdmi+0xb3/0x3c0 [lpfc]
[ 3405.480969]  lpfc_sli_cancel_iocbs+0x78/0xc0 [lpfc]
[ 3405.490790]  lpfc_hba_clean_txcmplq+0xe4/0x1e0 [lpfc]
[ 3405.500959]  lpfc_hba_down_post_s4+0x6e/0x3e0 [lpfc]
[ 3405.510944]  ? msleep+0x29/0x40
[ 3405.517266]  lpfc_sli_brdrestart_s4+0x62/0x180 [lpfc]
[ 3405.527436]  lpfc_sli4_port_sta_fn_reset.constprop.49+0x5b/0x180 [lpfc]
[ 3405.540753]  lpfc_handle_eratt_s4+0x1ac/0x5c0 [lpfc]
[ 3405.550748]  lpfc_do_work+0xb79/0x1800 [lpfc]
[ 3405.559507]  ? wait_woken+0x80/0x80
[ 3405.566531]  ? lpfc_unregister_unused_fcf+0x80/0x80 [lpfc]
[ 3405.577561]  kthread+0x118/0x140
[ 3405.584043]  ? kthread_associate_blkcg+0xa0/0xa0
[ 3405.593323]  ret_from_fork+0x22/0x30
[ 3405.600505] Modules linked in: xfs(E) loop(E) nvme_loop(E) nfsv3(E)
nfs_acl(E) rpcsec_gss_krb5(E) auth_rpcgss(E) nfsv4(E) dns_resolver(E)
nfs(E) lockd(E) grace(E) sunrpc(E) fscache(E) af_packet(E) iscsi_ibft(E)
iscsi_boot_sysfs(E) ip_set(E) nfnetlink(E) rfkill(E) x_tables(E)
bpfilter(E) edac_mce_amd(E) kvm_amd(E) mgag200(E) kvm(E) i2c_algo_bit(E)
drm_kms_helper(E) nls_iso8859_1(E) irqbypass(E) nls_cp437(E)
crct10dif_pclmul(E) crc32_pclmul(E) vfat(E) ipmi_ssif(E) cec(E)
ghash_clmulni_intel(E) fat(E) aesni_intel(E) drm(E) crypto_simd(E)
tg3(E) cryptd(E) syscopyarea(E) ses(E) acpi_ipmi(E) sp5100_tco(E)
sysfillrect(E) glue_helper(E) enclosure(E) sysimgblt(E) pcspkr(E)
ipmi_si(E) libphy(E) fb_sys_fops(E) hpwdt(E) i2c_piix4(E) ccp(E)
k10temp(E) hpilo(E) ipmi_devintf(E) tiny_power_button(E) acpi_tad(E)
ipmi_msghandler(E) acpi_cpufreq(E) button(E) btrfs(E) blake2b_generic(E)
libcrc32c(E) xor(E) raid6_pq(E) sd_mod(E) lpfc(E) nvmet_fc(E) nvmet(E)
nvme_fc(E) ehci_pci(E) xhci_pci(E) nvme_fabrics(E)
[ 3405.600569]  ehci_hcd(E) xhci_hcd(E) nvme_core(E) smartpqi(E)
t10_pi(E) crc32c_intel(E) scsi_transport_sas(E) scsi_transport_fc(E)
usbcore(E) wmi(E) sg(E) dm_multipath(E) dm_mod(E) scsi_dh_rdac(E)
scsi_dh_emc(E) scsi_dh_alua(E) scsi_mod(E) efivarfs(E) autofs4(E) [last
unloaded: ip_tables]
[ 3405.827793] ---[ end trace a70b79e0bbbacf2d ]---
[ 3405.848246] RIP: 0010:_raw_spin_lock_irqsave+0x1e/0x40
[ 3405.858669] Code: 00 00 00 00 00 0f 1f 80 00 00 00 00 0f 1f 44 00 00
53 9c 58 0f 1f 44 00 00 48 89 c3 fa 66 0f 1f 44 00 00 31 c0 ba 01 00 00
00 <f0> 0f b1 17 75 05 48 89 d8 5b c3 89 c6 e8 f0 21 7c ff 66 90 48 89
[ 3405.896426] RSP: 0018:ffffa6c804397c78 EFLAGS: 00010046
[ 3405.906930] RAX: 0000000000000000 RBX: 0000000000000246 RCX:
000000006b6b6b6b
[ 3405.921277] RDX: 0000000000000001 RSI: 0000000000000100 RDI:
6b6b6b6b6b6b84ef
[ 3405.935620] RBP: 6b6b6b6b6b6b84ef R08: 000000006b6b6b6b R09:
000000006b6b6b6b
[ 3405.949964] R10: 0000000000000000 R11: ffff920ff51063d9 R12:
ffff920ff5100000
[ 3405.964307] R13: ffff920ff5100000 R14: 0000000000000000 R15:
ffff920fc752ee00
[ 3405.978650] FS:  0000000000000000(0000) GS:ffff92107ec00000(0000)
knlGS:0000000000000000
[ 3405.994915] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3406.006463] CR2: 00007f3a0a1be960 CR3: 00000008520d0000 CR4:
00000000003506e0
[ 3406.020811] Kernel panic - not syncing: Fatal exception
[ 3406.058787] Kernel Offset: 0x29000000 from 0xffffffff81000000
(relocation range: 0xffffffff80000000-0xffffffffbfffffff)

Is this a known issue?
Maybe fixed with updates getting in the scsi branch?

Thanks.

Cheers,

Hannes
-- 
Dr. Hannes Reinecke		           Kernel Storage Architect
hare at suse.de			                  +49 911 74053 688
SUSE Software Solutions Germany GmbH, Maxfeldstr. 5, 90409 Nürnberg
HRB 36809 (AG Nürnberg), GF: Felix Imendörffer



More information about the Linux-nvme mailing list