Intel P3600 writes after hot add (RHEL7.1)
j cagle
jcagle at gmail.com
Tue Sep 29 14:23:50 PDT 2015
I'm having a problem with 2.5" Intel P3600 & P3700 Series NVMe drives
with RHEL7.1.
I can quiesce the drive, then hot-remove it, and there's no problem.
I can then hot-add it and it is discovered properly - no problem.
Then I can read from the drive, again - no problem.
The problem is that the server crashes (PCIe unrecoverable error) as
soon as I attempt to write anything to the drive:
[ 9838.102041] {1}[Hardware Error]: event severity: fatal
[ 9838.102042] {1}[Hardware Error]: Error 0, type: fatal
[ 9838.102042] {1}[Hardware Error]: section_type: PCIe error
[ 9838.102043] {1}[Hardware Error]: port_type: 0, PCIe end point
[ 9838.102043] {1}[Hardware Error]: version: 1.16
[ 9838.102044] {1}[Hardware Error]: command: 0x0406, status: 0x0010
[ 9838.102045] {1}[Hardware Error]: device_id: 0000:84:00.0
[ 9838.102045] {1}[Hardware Error]: slot: 0
[ 9838.102045] {1}[Hardware Error]: secondary_bus: 0x00
[ 9838.102046] {1}[Hardware Error]: vendor_id: 0x8086, device_id: 0x0953
[ 9838.102046] {1}[Hardware Error]: class_code: 020801
[ 9838.102062] Kernel panic - not syncing: Fatal hardware error!
Has anyone seen this error before?
Thanks,
John
More information about the Linux-nvme
mailing list