lpfc: Crash on shutdown with 5.9.0-rc1

James Smart james.smart at broadcom.com
Thu Sep 10 11:02:09 EDT 2020


On 9/10/2020 1:12 AM, Hannes Reinecke wrote:
> Hi James,
>
> I just hit this crash on shutdown with nvme-5.9:
>
> [ 3405.218638] general protection fault, probably for non-canonical
> address 0x6b6b6b6b6b6b84ef: 0000 [#1] SMP NOPTI
> [ 3405.239110] CPU: 6 PID: 680 Comm: lpfc_worker_0 Tainted: G
>   E     5.9.0-rc1-default+ #632
> [ 3405.257300] Hardware name: HPE ProLiant DL325 Gen10/ProLiant DL325
> Gen10, BIOS A41 03/09/2020
> [ 3405.274451] RIP: 0010:_raw_spin_lock_irqsave+0x1e/0x40
> [ 3405.284782] Code: 00 00 00 00 00 0f 1f 80 00 00 00 00 0f 1f 44 00 00
> 53 9c 58 0f 1f 44 00 00 48 89 c3 fa 66 0f 1f 44 00 00 31 c0 ba 01 00 00
> 00 <f0> 0f b1 17 75 05 48 89 d8 5b c3 89 c6 e8 f0 21 7c ff 66 90 48 89
> [ 3405.322544] RSP: 0018:ffffa6c804397c78 EFLAGS: 00010046
> [ 3405.333046] RAX: 0000000000000000 RBX: 0000000000000246 RCX:
> 000000006b6b6b6b
> [ 3405.347388] RDX: 0000000000000001 RSI: 0000000000000100 RDI:
> 6b6b6b6b6b6b84ef
> [ 3405.361731] RBP: 6b6b6b6b6b6b84ef R08: 000000006b6b6b6b R09:
> 000000006b6b6b6b
> [ 3405.376079] R10: 0000000000000000 R11: ffff920ff51063d9 R12:
> ffff920ff5100000
> [ 3405.390423] R13: ffff920ff5100000 R14: 0000000000000000 R15:
> ffff920fc752ee00
> [ 3405.404768] FS:  0000000000000000(0000) GS:ffff92107ec00000(0000)
> knlGS:0000000000000000
> [ 3405.421035] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 3405.432582] CR2: 00007f3a0a1be960 CR3: 00000008520d0000 CR4:
> 00000000003506e0
> [ 3405.446929] Call Trace:
> [ 3405.451864]  lpfc_nlp_put.part.22+0x47/0x260 [lpfc]
> [ 3405.461685]  lpfc_ct_free_iocb+0x20/0xc0 [lpfc]
> [ 3405.470806]  lpfc_cmpl_ct_disc_fdmi+0xb3/0x3c0 [lpfc]
> [ 3405.480969]  lpfc_sli_cancel_iocbs+0x78/0xc0 [lpfc]
> [ 3405.490790]  lpfc_hba_clean_txcmplq+0xe4/0x1e0 [lpfc]
> [ 3405.500959]  lpfc_hba_down_post_s4+0x6e/0x3e0 [lpfc]
> [ 3405.510944]  ? msleep+0x29/0x40
> [ 3405.517266]  lpfc_sli_brdrestart_s4+0x62/0x180 [lpfc]
> [ 3405.527436]  lpfc_sli4_port_sta_fn_reset.constprop.49+0x5b/0x180 [lpfc]
> [ 3405.540753]  lpfc_handle_eratt_s4+0x1ac/0x5c0 [lpfc]
> [ 3405.550748]  lpfc_do_work+0xb79/0x1800 [lpfc]
> [ 3405.559507]  ? wait_woken+0x80/0x80
> [ 3405.566531]  ? lpfc_unregister_unused_fcf+0x80/0x80 [lpfc]
> [ 3405.577561]  kthread+0x118/0x140
>
>
> Is this a known issue?
> Maybe fixed with updates getting in the scsi branch?
>
> Thanks.
>
> Cheers,
>
> Hannes

I'm guessing none of the deltas in the scsi tree will help you. This is 
a rare path - kicked off by detecting a hw failure 
(lpfc_handle_eratt_s4). We'll look into it.

-- james




More information about the Linux-nvme mailing list