- To: linux-scsi@xxxxxxxxxxxxxxx
- Subject: Multipath failing path when mpt2sas report an aborted command and sd report hostbyte=DID_SOFT_ERROR
- From: Francesco Santagata <francesco.santagata@xxxxxxxxxx>
- Date: Tue, 22 May 2012 19:57:33 +0200
- In-reply-to: <CAJ9D7cBXzJuzCQtQn3YzbdnjVhCEXRbJXPLHEUVtqgrEAydYuw@mail.gmail.com>
Hello linux-scsi,
I'm reporting a problem with multipath in conjunction with SAS2 HBA,
mpt2sas, SAS2Expander and SATA disks without multiplexer.
OS: RedHat 6.2
Kernel: 2.6.32-220.17.1.el6.x86_64
HBA: LSI 9205-8e
mpt2sas 13.00.00.00
FW Ver 13.00.57.00
BIOS 07.25.00.00
DISK Hitachi HDS723020BLA642
JBOD NEWISYS NDS-4600-JD Firmware:0501
N-Disk 60
MultiPath device-mapper-multipath.x86_64 0.4.9
I'm using sata drives directly attached to the JBOD without any sataMux.
I'm using multipath even if I can see drives attached to only only
scsi address; this is because of the architecture of the software i'm
building on top of the hardware.
An example:
mpathak (35000cca369e6df1b) dm-50 ATA,Hitachi HDS72302
size=1.8T features='0' hwhandler='0' wp=rw
`-+- policy='round-robin 0' prio=1 status=active
`- 9:0:51:0 sday 67:32 active ready running
mpathr (35000cca369e7a662) dm-21 ATA,Hitachi HDS72302
size=1.8T features='0' hwhandler='0' wp=rw
`-+- policy='round-robin 0' prio=1 status=active
`- 9:0:22:0 sdv 65:80 active ready running
When I'm doing heavy reads (dd with bs=65536) on all 60 disks i'm
experiencing this problem:
May 14 15:48:42 localhost multipathd: 67:0: mark as failed
May 14 15:48:42 localhost multipathd: mpathbg: remaining active paths: 0
May 14 15:48:47 localhost multipathd: mpathbg: sdaw - serial checker
reports path is up
May 14 15:48:47 localhost multipathd: 67:0: reinstated
May 14 15:48:47 localhost multipathd: mpathbg: remaining active paths: 1
and if I look into kernel messages this is what I see:
ay 14 15:48:42 localhost kernel: mpt2sas1:
enclosure_logical_id(0x500093d000421000), slot(21)
May 14 15:48:42 localhost kernel: mpt2sas1: handle(0x003d),
ioc_status(scsi ioc terminated)(0x004b), smid(315)
May 14 15:48:42 localhost kernel: mpt2sas1: request_len(16384),
underflow(16384), resid(16384)
May 14 15:48:42 localhost kernel: mpt2sas1: tag(4),
transfer_count(0), sc->result(0x000b0000)
May 14 15:48:42 localhost kernel: mpt2sas1:
scsi_status(good)(0x00), scsi_state( )(0x00)
May 14 15:48:42 localhost kernel: mpt2sas1: log_info(0x31080000):
originator(PL), code(0x08), sub_code(0x0000)
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Done: SUCCESS
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 25 18 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Unhandled error code
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 25 18 00 00 20 00
May 14 15:48:42 localhost kernel: end_request: I/O error, dev sdaw,
sector 672867608
May 14 15:48:42 localhost kernel: sd 11:0:45:0: [sdas] CDB: Read(10):
28 00 21 56 33 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:10:0: [sdj] CDB: Read(10):
28 00 28 d9 7e d8 00 00 f0 00
May 14 15:48:42 localhost kernel: sd 11:0:11:0: [sdk] CDB: Read(10):
28 00 21 56 33 98 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:57:0: [sdbe] CDB: Read(10):
28 00 26 aa 3e a8 00 00 68 00
May 14 15:48:42 localhost kernel: sd 11:0:10:0: [sdj] CDB: Read(10):
28 00 28 d9 92 68 00 00 08 00
May 14 15:48:42 localhost kernel: sd 11:0:12:0: [sdl] CDB: Read(10):
28 00 21 56 33 98 00 00 a0 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 23 c8 00 00 60 00
May 14 15:48:42 localhost kernel: mpt2sas1:
sas_address(0x500093d000421090), phy(16)
May 14 15:48:42 localhost kernel: mpt2sas1:
enclosure_logical_id(0x500093d000421000), slot(21)
May 14 15:48:42 localhost kernel: mpt2sas1: handle(0x003d),
ioc_status(success)(0x0000), smid(525)
May 14 15:48:42 localhost kernel: mpt2sas1: request_len(49152),
underflow(49152), resid(49152)
May 14 15:48:42 localhost kernel: mpt2sas1: tag(0),
transfer_count(0), sc->result(0x00000002)
May 14 15:48:42 localhost kernel: mpt2sas1: scsi_status(check
condition)(0x02), scsi_state(autosense valid )(0x01)
May 14 15:48:42 localhost kernel: mpt2sas1: [sense_key,asc,ascq]:
[0x03,0x11,0x00], count(18)
May 14 15:48:42 localhost kernel: sd 11:0:44:0: [sdar] CDB: Read(10):
28 00 28 d9 7e d8 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:3:0: [sdd] CDB: Read(10): 28
00 27 e9 30 10 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:7:0: [sdg] CDB: Read(10): 28
00 21 56 34 18 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:45:0: [sdas] CDB: Read(10):
28 00 21 56 33 f8 00 00 40 00
May 14 15:48:42 localhost kernel: sd 11:0:48:0: [sdav] CDB: Read(10):
28 00 27 c7 f1 08 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:37:0: [sdak] CDB: Read(10):
28 00 28 d9 7e f8 00 00 c8 00
May 14 15:48:42 localhost kernel: sd 11:0:19:0: [sds] CDB: Read(10):
28 00 28 d9 7e d8 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:38:0: [sdal] CDB: Read(10):
28 00 28 d9 7e d8 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 24 d0 00 00 20 00
May 14 15:48:42 localhost kernel: mpt2sas1:
sas_address(0x500093d000421090), phy(16)
May 14 15:48:42 localhost kernel: mpt2sas1:
enclosure_logical_id(0x500093d000421000), slot(21)
May 14 15:48:42 localhost kernel: mpt2sas1: handle(0x003d),
ioc_status(scsi ioc terminated)(0x004b), smid(495)
May 14 15:48:42 localhost kernel: mpt2sas1: request_len(16384),
underflow(16384), resid(16384)
May 14 15:48:42 localhost kernel: mpt2sas1: tag(2),
transfer_count(0), sc->result(0x000b0000)
May 14 15:48:42 localhost kernel: mpt2sas1:
scsi_status(good)(0x00), scsi_state( )(0x00)
May 14 15:48:42 localhost kernel: mpt2sas1: log_info(0x31080000):
originator(PL), code(0x08), sub_code(0x0000)
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 26 88 00 00 20 00
May 14 15:48:42 localhost kernel: mpt2sas1:
sas_address(0x500093d000421090), phy(16)
May 14 15:48:42 localhost kernel: mpt2sas1:
enclosure_logical_id(0x500093d000421000), slot(21)
May 14 15:48:42 localhost kernel: mpt2sas1: handle(0x003d),
ioc_status(scsi ioc terminated)(0x004b), smid(27)
May 14 15:48:42 localhost kernel: mpt2sas1: request_len(16384),
underflow(16384), resid(16384)
May 14 15:48:42 localhost kernel: mpt2sas1: tag(3),
transfer_count(0), sc->result(0x000b0000)
May 14 15:48:42 localhost kernel: mpt2sas1:
scsi_status(good)(0x00), scsi_state( )(0x00)
May 14 15:48:42 localhost kernel: mpt2sas1: log_info(0x31080000):
originator(PL), code(0x08), sub_code(0x0000)
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 25 38 00 00 80 00
May 14 15:48:42 localhost kernel: mpt2sas1:
sas_address(0x500093d000421090), phy(16)
May 14 15:48:42 localhost kernel: mpt2sas1:
enclosure_logical_id(0x500093d000421000), slot(21)
May 14 15:48:42 localhost kernel: mpt2sas1: handle(0x003d),
ioc_status(scsi ioc terminated)(0x004b), smid(446)
May 14 15:48:42 localhost kernel: mpt2sas1: request_len(65536),
underflow(65536), resid(65536)
May 14 15:48:42 localhost kernel: mpt2sas1: tag(5),
transfer_count(0), sc->result(0x000b0000)
May 14 15:48:42 localhost kernel: mpt2sas1:
scsi_status(good)(0x00), scsi_state( )(0x00)
May 14 15:48:42 localhost kernel: sd 11:0:19:0: [sds] CDB:
May 14 15:48:42 localhost kernel: mpt2sas1: log_info(0x31080000):
originator(PL), code(0x08), sub_code(0x0000)
May 14 15:48:42 localhost kernel: Read(10): 28 00 48 09 3b 50 00 00 08 00
May 14 15:48:42 localhost kernel: sd 11:0:34:0: [sdah] CDB: Read(10):
28 00 21 56 33 98 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:39:0: [sdam] CDB: Read(10):
28 00 28 d9 7e d8 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:27:0: [sdaa] CDB: Read(10):
28 00 26 aa 3e a8 00 00 60 00
May 14 15:48:42 localhost kernel: sd 11:0:50:0: [sdax] CDB: Read(10):
28 00 21 56 33 98 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:37:0: [sdak] CDB: Read(10):
28 00 28 d9 92 68 00 00 08 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Done: SUCCESS
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Result:
hostbyte=invalid driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: sd 11:0:18:0: [sdr] CDB:
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB:
Read(10)Read(10):: 28 28 00 00 27 28 c7 1b f0 23 80 c8 00 00 00 e8 00
00
May 14 15:48:42 localhost kernel: 60 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Sense Key :
Medium Error [current]
May 14 15:48:42 localhost kernel: sd 11:0:13:0: [sdm] CDB: Info fld=0x281b23f2
May 14 15:48:42 localhost kernel: Read(10): 28
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] 00 26 aa
3eAdd. Sense: Unrecovered read error a8
May 14 15:48:42 localhost kernel: 00 00 68 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Unhandled sense code
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Result:
hostbyte=invalid driverbyte=DRIVER_SENSE
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Sense Key :
Medium Error [current]
May 14 15:48:42 localhost kernel: Info fld=0x281b23f2
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Add. Sense:
Unrecovered read error
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 23 c8 00 00 60 00
May 14 15:48:42 localhost kernel: end_request: critical target error,
dev sdaw, sector 672867314
May 14 15:48:42 localhost kernel: end_request: I/O error, dev dm-48,
sector 672868344
May 14 15:48:42 localhost kernel: sd 11:0:6:0: [sdf] CDB: Read(10): 28
00 21 56 33 90 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:23:0: [sdw] CDB: Read(10):
28 00 21 56 33 90 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:13:0: [sdm] CDB: Read(10):
28 00 26 aa 3f 98 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:0:0: [sda] CDB: Read(10): 28
00 28 d9 7e d8 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:44:0: [sdar] CDB: Read(10):
28 00 28 d9 7f e8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:60:0: [sdbh] CDB: Read(10):
28 00 26 aa 3e b0 00 00 60 00
May 14 15:48:42 localhost kernel: sd 11:0:39:0: [sdam] CDB: Read(10):
28 00 48 09 3b 50 00 00 08 00
May 14 15:48:42 localhost kernel: sd 11:0:26:0: [sdz] CDB: Read(10):
28 00 21 56 34 20 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:32:0: [sdaf] CDB: Read(10):
28 00 26 aa
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Done: 3fSUCCESS
May 14 15:48:42 localhost kernel: 10
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] 00Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: 20Read(10) 00:
May 14 15:48:42 localhost kernel: 28 00 28 1b 26 88 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Unhandled error code
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 26 88 00 00
May 14 15:48:42 localhost kernel: sd 11:0:32:0: [sdaf] CDB: 20Read(10) 00:
May 14 15:48:42 localhost kernel: 28
May 14 15:48:42 localhost kernel: end_request: I/O error, dev sdaw,
sector 672867976
May 14 15:48:42 localhost kernel: 00 26 aa 3e 08 00 01 08 00
May 14 15:48:42 localhost kernel: sd 11:0:27:0: [sdaa] CDB: Read(10):
28 00 26 aa 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:36:0: [sdaj] CDB: Read(10):
28 00 26 aa 3f 10 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:9:0: [sdi] CDB: Read(10): 28
00 27 c7 f0 e0 00 00 c8 00
May 14 15:48:42 localhost kernel: sd 11:0:38:0: [sdal] CDB: Read(10):
28 00 28 d9 7f e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:37:0: [sdak] CDB: Read(10):
28 00 28 d9 7f e8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:9:0: [sdi] CDB: Read(10): 28
00 27 c7 f1 d0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:59:0: [sdbg] CDB: Read(10):
28 00 28 d9 7e d8 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:28:0: [sdab] CDB: Read(10):
28 00 26 aa 3f 10 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:24:0: [sdx] CDB: Read(10):
28 00 26 aa 3f 10 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:33:0: [sdag] CDB: Read(10):
28 00 28 d9 7e f8 00 00 c8 00
May 14 15:48:42 localhost kernel: sd 11:0:33:0: [sdag] CDB: Read(10):
28 00 28 d9 7f e8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:33:0: [sdag] CDB: Read(10):
28 00 28 d9 92 68 00 00 08 00
May 14 15:48:42 localhost kernel: sd 11:0:10:0: [sdj] CDB: Read(10):
28 00 28 d9 7f e8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:27:0: [sdaa] CDB: Read(10):
28 00 26 aa 3f 10 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:55:0: [sdbc] CDB: Read(10):
28 00 27 c7 f1 70 00 00 80 00
May 14 15:48:42 localhost kernel: sd 11:0:55:0: [sdbc] CDB: Read(10):
28 00 27 c7 f0 80 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Done: SUCCESS
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 25 38 00 00 80 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Unhandled error code
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB: Read(10):
28 00 28 1b 25 38 00 00 80 00
May 14 15:48:42 localhost kernel: end_request: I/O error, dev sdaw,
sector 672867640
May 14 15:48:42 localhost kernel: sd 11:0:60:0: [sdbh] CDB: Read(10):
28 00 26 aa 3f 10 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:19:0: [sds] CDB: Read(10):
28 00 28 d9 7f e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:25:0: [sdy] CDB: Read(10):
28 00 27 c7 f1 68 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:18:0: [sdr] CDB: Read(10):
28 00 27 c7 f1 70 00 00 80 00
May 14 15:48:42 localhost kernel: sd 11:0:25:0: [sdy] CDB: Read(10):
28 00 27 c7 f0 80 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:17:0: [sdq] CDB: Read(10):
28 00 21 56 34 20 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:22:0: [sdv] CDB: Read(10):
28 00 28 d9 7e d0 00 00 f0 00
May 14 15:48:42 localhost kernel: end_request: I/O error, dev dm-48,
sector 672867976
May 14 15:48:42 localhost kernel: sd 11:0:59:0: [sdbg] CDB: Read(10):
28 00 28 d9 7f e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:43:0: [sdaq] CDB: Read(10):
28 00 27 c7 f0 78 00 01 70 00
May 14 15:48:42 localhost kernel: sd 11:0:8:0: [sdh] CDB: Read(10): 28
00 27 c7 f1 68 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:8:0: [sdh] CDB: Read(10): 28
00 27 c7 f0 80 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:39:0: [sdam] CDB: Read(10):
28 00 28 d9 7f e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 30 e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:40:0: [sdan] CDB: Read(10):
28 00 27 e9 30 a0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 30 e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:16:0: [sdp] CDB: Read(10):
28 00 27 e9 30 a0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:42:0: [sdap] CDB: Read(10):
28 00 27 e9 30 e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:46:0: [sdat] CDB: Read(10):
28 00 27 e9 31 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:53:0: [sdba] CDB: Read(10):
28 00 27 e9 30 98 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:29:0: [sdac] CDB: Read(10):
28 00 27 e9 30 e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:56:0: [sdbd] CDB: Read(10):
28 00 27 e9 30 98 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 31 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:47:0: [sdau] CDB: Read(10):
28 00 27 c7 f0 80 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:29:0: [sdac] CDB: Read(10):
28 00 27 e9 30 38 00 00 40 00
May 14 15:48:42 localhost kernel: sd 11:0:46:0: [sdat] CDB: Read(10):
28 00 27 e9 30 98 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:42:0: [sdap] CDB: Read(10):
28 00 27 e9 31 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:47:0: [sdau] CDB: Read(10):
28 00 27 c7 f1 68 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:16:0: [sdp] CDB: Read(10):
28 00 27 e9 30 e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:42:0: [sdap] CDB: Read(10):
28 00 27 e9 30 98 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:16:0: [sdp] CDB: Read(10):
28 00 27 e9 31 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:56:0: [sdbd] CDB: Read(10):
28 00 27 e9 30 d8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:59:0: [sdbg] CDB: Read(10):
28 00 48 09 3b 50 00 00 08 00
May 14 15:48:42 localhost kernel: sd 11:0:16:0: [sdp] CDB: Read(10):
28 00 27 e9 31 48 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:56:0: [sdbd] CDB: Read(10):
28 00 27 e9 31 40 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:35:0: [sdai] CDB: Read(10):
28 00 26 aa 3e a8 00 00 60 00
May 14 15:48:42 localhost kernel: sd 11:0:56:0: [sdbd] CDB: Read(10):
28 00 27 e9 30 f8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:46:0: [sdat] CDB: Read(10):
28 00 27 e9 30 d8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:41:0: [sdao] CDB: Read(10):
28 00 26 aa 3e b0 00 00 60 00
May 14 15:48:42 localhost kernel: sd 11:0:41:0: [sdao] CDB: Read(10):
28 00 26 aa 3f 10 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:36:0: [sdaj] CDB: Read(10):
28 00 26 aa 3f 38 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:6:0: [sdf] CDB: Read(10): 28
00 21 56 34 60 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:45:0: [sdas] CDB: Read(10):
28 00 21 56 34 60 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:32:0: [sdaf] CDB: Read(10):
28 00 26 aa 3f 30 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:13:0: [sdm] CDB: Read(10):
28 00 26 aa 3f 10 00 00 40 00
May 14 15:48:42 localhost kernel: sd 11:0:7:0: [sdg] CDB: Read(10): 28
00 21 56 34 60 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:7:0: [sdg] CDB: Read(10): 28
00 21 56 34 40 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:11:0: [sdk] CDB: Read(10):
28 00 21 56 34 60 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:34:0: [sdah] CDB: Read(10):
28 00 21 56 34 40 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:23:0: [sdw] CDB: Read(10):
28 00 21 56 34 58 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:6:0: [sdf] CDB: Read(10): 28
00 21 56 34 38 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:38:0: [sdal] CDB: Read(10):
28 00 28 d9 7f c0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:17:0: [sdq] CDB: Read(10):
28 00 21 56 34 40 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:5:0: [sde] CDB: Read(10): 28
00 27 c7 f0 80 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:5:0: [sde] CDB: Read(10): 28
00 27 c7 f1 68 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:53:0: [sdba] CDB: Read(10):
28 00 27 e9 30 d8 00 00 40 00
May 14 15:48:42 localhost kernel: sd 11:0:3:0: [sdd] CDB: Read(10): 28
00 27 e9 30 98 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:35:0: [sdai] CDB: Read(10):
28 00 26 aa 3f 10 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:53:0: [sdba] CDB: Read(10):
28 00 27 e9 31 20 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:45:0: [sdas] CDB: Read(10):
28 00 21 56 34 38 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:40:0: [sdan] CDB: Read(10):
28 00 27 e9 30 e0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:40:0: [sdan] CDB:
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Done: Read(10)SUCCESS
May 14 15:48:42 localhost kernel: :
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] 28Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB:
27Read(10) e9: 31 28 48 00 00 28 00 1b 20 24 00 d0
May 14 15:48:42 localhost kernel: 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Unhandled error code
May 14 15:48:42 localhost kernel: sd 11:0:60:0: [sdbh] CDB:
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] Read(10)Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 14 15:48:42 localhost kernel: :
May 14 15:48:42 localhost kernel: sd 11:0:49:0: [sdaw] CDB:
28Read(10) 00: 26 28 aa 00 3f 28 30 1b 00 24 00 d0 20 00 00 00
May 14 15:48:42 localhost kernel: 20 00
May 14 15:48:42 localhost kernel: end_request: I/O error, dev sdaw,
sector 672867536
May 14 15:48:42 localhost kernel: sd 11:0:11:0: [sdk] CDB: Read(10):
28 00 21 56 34 40 00 00 80 00
May 14 15:48:42 localhost kernel: sd 11:0:40:0: [sdan] CDB: Read(10):
28 00 27 e9 31 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:0:0: [sda] CDB: Read(10): 28
00 28 d9 7f e8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:29:0: [sdac] CDB: Read(10):
28 00 27 e9 31 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:42:0: [sdap] CDB: Read(10):
28 00 27 e9 30 c0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 31 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 31 60 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 30 98 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:59:0: [sdbg] CDB: Read(10):
28 00 28 d9 7f c0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 30 98 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 31 60 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:0:0: [sda] CDB: Read(10): 28
00 28 d9 7f c0 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:11:0: [sdk] CDB: Read(10):
28 00 21 56 34 c8 00 00 40 00
May 14 15:48:42 localhost kernel: sd 11:0:17:0: [sdq] CDB: Read(10):
28 00 21 56 34 c8 00 00 40 00
May 14 15:48:42 localhost kernel: sd 11:0:44:0: [sdar] CDB: Read(10):
28 00 28 d9 80 08 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:38:0: [sdal] CDB: Read(10):
28 00 28 d9 80 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:45:0: [sdas] CDB: Read(10):
28 00 21 56 34 c0 00 00 40 00
May 14 15:48:42 localhost kernel: end_request: I/O error, dev dm-48,
sector 672867640
May 14 15:48:42 localhost kernel: sd 11:0:10:0: [sdj] CDB: Read(10):
28 00 28 d9 7f c8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:46:0: [sdat] CDB: Read(10):
28 00 27 e9 31 20 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:41:0: [sdao] CDB: Read(10):
28 00 26 aa 3f 38 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:12:0: [sdl] CDB: Read(10):
28 00 21 56 34 40 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:35:0: [sdai] CDB: Read(10):
28 00 26 aa 3f 30 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:11:0: [sdk] CDB: Read(10):
28 00 21 56 35 08 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:27:0: [sdaa] CDB: Read(10):
28 00 26 aa 3f 30 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:19:0: [sds] CDB: Read(10):
28 00 28 d9 7f c0 00 00 68 00
May 14 15:48:42 localhost kernel: sd 11:0:3:0: [sdd] CDB: Read(10): 28
00 27 e9 31 80 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:39:0: [sdam] CDB: Read(10):
28 00 28 d9 7f c0 00 00 68 00
May 14 15:48:42 localhost kernel: sd 11:0:3:0: [sdd] CDB: Read(10): 28
00 27 e9 30 b8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:59:0: [sdbg] CDB: Read(10):
28 00 28 d9 80 00 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 30 b8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:40:0: [sdan] CDB: Read(10):
28 00 27 e9 30 c0 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 30 b8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:53:0: [sdba] CDB: Read(10):
28 00 27 e9 30 b8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:20:0: [sdt] CDB: Read(10):
28 00 27 c7 f0 78 00 00 f0 00
May 14 15:48:42 localhost kernel: sd 11:0:20:0: [sdt] CDB: Read(10):
28 00 27 c7 f1 68 00 00 80 00
May 14 15:48:42 localhost kernel: sd 11:0:16:0: [sdp] CDB: Read(10):
28 00 27 e9 30 c0 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:57:0: [sdbe] CDB: Read(10):
28 00 26 aa 3f 10 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:42:0: [sdap] CDB: Read(10):
28 00 27 e9 31 20 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:56:0: [sdbd] CDB: Read(10):
28 00 27 e9 30 b8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:46:0: [sdat] CDB: Read(10):
28 00 27 e9 30 b8 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:6:0: [sdf] CDB: Read(10): 28
00 21 56 34 80 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:45:0: [sdas] CDB: Read(10):
28 00 21 56 35 08 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:46:0: [sdat] CDB: Read(10):
28 00 27 e9 31 c8 00 00 c8 00
May 14 15:48:42 localhost kernel: sd 11:0:24:0: [sdx] CDB: Read(10):
28 00 26 aa 3f 30 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:46:0: [sdat] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:34:0: [sdah] CDB: Read(10):
28 00 21 56 34 80 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:17:0: [sdq] CDB: Read(10):
28 00 21 56 35 08 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:50:0: [sdax] CDB: Read(10):
28 00 21 56 34 40 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:33:0: [sdag] CDB: Read(10):
28 00 28 d9 7f c8 00 00 60 00
May 14 15:48:42 localhost kernel: sd 11:0:7:0: [sdg] CDB: Read(10): 28
00 21 56 34 80 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:22:0: [sdv] CDB: Read(10):
28 00 28 d9 7f c0 00 00 40 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 31 20 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:23:0: [sdw] CDB: Read(10):
28 00 21 56 34 38 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 31 20 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:40:0: [sdan] CDB: Read(10):
28 00 27 e9 31 a8 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 32 08 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:40:0: [sdan] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:16:0: [sdp] CDB: Read(10):
28 00 27 e9 31 a8 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:16:0: [sdp] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:42:0: [sdap] CDB: Read(10):
28 00 27 e9 32 10 00 00 80 00
May 14 15:48:42 localhost kernel: sd 11:0:42:0: [sdap] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:38:0: [sdal] CDB: Read(10):
28 00 28 d9 80 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:53:0: [sdba] CDB: Read(10):
28 00 27 e9 31 40 00 01 50 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 32 08 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:37:0: [sdak] CDB: Read(10):
28 00 28 d9 7f c8 00 00 60 00
May 14 15:48:42 localhost kernel: sd 11:0:53:0: [sdba] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:59:0: [sdbg] CDB: Read(10):
28 00 28 d9 80 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:0:0: [sda] CDB: Read(10): 28
00 28 d9 80 08 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:29:0: [sdac] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:0:0: [sda] CDB: Read(10): 28
00 28 d9 80 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:3:0: [sdd] CDB: Read(10): 28
00 27 e9 31 e8 00 00 a8 00
May 14 15:48:42 localhost kernel: sd 11:0:3:0: [sdd] CDB: Read(10): 28
00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:29:0: [sdac] CDB: Read(10):
28 00 27 e9 32 70 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:29:0: [sdac] CDB: Read(10):
28 00 27 e9 30 98 00 01 b8 00
May 14 15:48:42 localhost kernel: sd 11:0:6:0: [sdf] CDB: Read(10): 28
00 21 56 35 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:56:0: [sdbd] CDB: Read(10):
28 00 27 e9 31 20 00 00 e8 00
May 14 15:48:42 localhost kernel: sd 11:0:45:0: [sdas] CDB: Read(10):
28 00 21 56 35 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:56:0: [sdbd] CDB: Read(10):
28 00 27 e9 32 08 00 00 88 00
May 14 15:48:42 localhost kernel: sd 11:0:34:0: [sdah] CDB: Read(10):
28 00 21 56 35 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:56:0: [sdbd] CDB: Read(10):
28 00 27 e9 3f 90 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:17:0: [sdq] CDB: Read(10):
28 00 21 56 35 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:10:0: [sdj] CDB: Read(10):
28 00 28 d9 80 08 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:44:0: [sdar] CDB: Read(10):
28 00 28 d9 80 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:7:0: [sdg] CDB: Read(10): 28
00 21 56 35 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:12:0: [sdl] CDB: Read(10):
28 00 21 56 35 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:33:0: [sdag] CDB: Read(10):
28 00 28 d9 80 28 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:57:0: [sdbe] CDB: Read(10):
28 00 26 aa 3f 30 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:17:0: [sdq] CDB: Read(10):
28 00 21 56 35 50 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:6:0: [sdf] CDB: Read(10): 28
00 21 56 35 48 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:46:0: [sdat] CDB: Read(10):
28 00 27 e9 32 50 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:45:0: [sdas] CDB: Read(10):
28 00 21 56 35 48 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:58:0: [sdbf] CDB: Read(10):
28 00 27 e9 32 50 00 00 20 00
May 14 15:48:42 localhost kernel: sd 11:0:51:0: [sday] CDB: Read(10):
28 00 27 e9 32 50 00 00 20 00
May 14 15:48:42 localhost kernel: end_request: critical target error,
dev dm-48, sector 672867272
then some other end_request: I/O error, dev dm-48.
This one I'm reporting is one of the worst case I found because Sense
Key is reporting a Medium Error.
I managed to replicate the same scenario even with other Sense Key
like [sense_key,asc,ascq]: [0x0b,0x47,0x03] which seems to be a iuCRC
error.
This happens to a lot of disks, not always to the same one.
And, oh, about the disk sdaw -> dm-48, it's fully working, I've
managed to read with dd all the sectors that caused the read error.
The error anyway where logged by smart:
SMART Error Log Version: 1
ATA Error Count: 1
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 1 occurred at disk power-on lifetime: 2260 hours (94 days + 4 hours)
When the command that caused the error occurred, the device was
active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 36 f2 23 1b 08
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
60 80 28 38 25 1b 40 00 5d+00:18:26.240 READ FPDMA QUEUED
60 20 20 18 25 1b 40 00 5d+00:18:26.237 READ FPDMA QUEUED
60 20 18 88 26 1b 40 00 5d+00:18:26.235 READ FPDMA QUEUED
60 20 10 d0 24 1b 40 00 5d+00:18:26.232 READ FPDMA QUEUED
60 e8 08 28 24 1b 40 00 5d+00:18:26.230 READ FPDMA QUEUED
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
please notice kernel: "28 1b 25 38" and smart: "28 38 25 1b", looks to
be the same sector...
What you can see in the logs is that "Result: hostbyte=DID_SOFT_ERROR
driverbyte=DRIVER_OK" make the multipath fail the only path, without
any retry.
I simulated the same scenario with a patch in scsi_debug.ko and I
verified that presenting to multipath the same scenario :
[sense_key,asc,ascq]: [0x0b,0x47,0x03
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
the behaviour is different:
May 18 16:46:10 localhost kernel: scsi_debug: cmd 28 00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: FOO: simulating error:
sense_key,asc,ascq: 0x0b,0x47,0x03
May 18 16:46:10 localhost kernel: scsi_debug: [sense_key,asc,ascq]:
[0xb,0x47,0x3]
May 18 16:46:10 localhost kernel: scsi_debug: <25 0 0 0> non-zero
result=0xb0000
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Done: RETRY
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] CDB: Read(10): 28
00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: scsi_debug: cmd 28 00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: FOO: simulating error:
sense_key,asc,ascq: 0x0b,0x47,0x03
May 18 16:46:10 localhost kernel: scsi_debug: [sense_key,asc,ascq]:
[0xb,0x47,0x3]
May 18 16:46:10 localhost kernel: scsi_debug: <25 0 0 0> non-zero
result=0xb0000
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Done: RETRY
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] CDB: Read(10): 28
00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: scsi_debug: cmd 28 00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: FOO: simulating error:
sense_key,asc,ascq: 0x0b,0x47,0x03
May 18 16:46:10 localhost kernel: scsi_debug: [sense_key,asc,ascq]:
[0xb,0x47,0x3]
May 18 16:46:10 localhost kernel: scsi_debug: <25 0 0 0> non-zero
result=0xb0000
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Done: RETRY
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] CDB: Read(10): 28
00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: scsi_debug: cmd 28 00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: FOO: simulating error:
sense_key,asc,ascq: 0x0b,0x47,0x03
May 18 16:46:10 localhost kernel: scsi_debug: [sense_key,asc,ascq]:
[0xb,0x47,0x3]
May 18 16:46:10 localhost kernel: scsi_debug: <25 0 0 0> non-zero
result=0xb0000
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Done: RETRY
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] CDB: Read(10): 28
00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: scsi_debug: cmd 28 00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: FOO: simulating error:
sense_key,asc,ascq: 0x0b,0x47,0x03
May 18 16:46:10 localhost kernel: scsi_debug: [sense_key,asc,ascq]:
[0xb,0x47,0x3]
May 18 16:46:10 localhost kernel: scsi_debug: <25 0 0 0> non-zero
result=0xb0000
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Done: RETRY
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] CDB: Read(10): 28
00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: scsi_debug: cmd 28 00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: FOO: simulating error:
sense_key,asc,ascq: 0x0b,0x47,0x03
May 18 16:46:10 localhost kernel: scsi_debug: [sense_key,asc,ascq]:
[0xb,0x47,0x3]
May 18 16:46:10 localhost kernel: scsi_debug: <25 0 0 0> non-zero
result=0xb0000
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Done: SUCCESS
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] CDB: Read(10): 28
00 00 00 12 34 00 00 01 00
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Unhandled error code
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] Result:
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
May 18 16:46:10 localhost kernel: sd 25:0:0:0: [sdb] CDB: Read(10): 28
00 00 00 12 34 00 00 01 00
with all the errors I've logged from mpt2sas in junction with
multipath I never managed to log a single "Done: RETRY",
it really looks like multipath kernel moduel is failing a path using
mpt2sas at a glance of an error, without even retrying once.
Is it the correct behaviour or am I missing something ?
Are the retries handled in a different manner between mpt2sas and scsi_debug ?
Please keep in mind that scsi_debug was coded to make fail forever
sector 0x1234 and after the last "Done: RETRY" multipath failed to
path to the device.
This is the behaviour that I would like to see when using mpt2sas.
Thank you
Francesco Santagata
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
[SCSI Target Devel]
[Linux SCSI Target Infrastructure]
[Kernel Newbies]
[Share Photos]
[IDE]
[Security]
[Git]
[Netfilter]
[Bugtraq]
[Photos]
[Yosemite]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Linux ATA RAID]
[Linux IIO]
[Samba]
[Video 4 Linux]
[Device Mapper]
[Linux Resources]