Ok here are the news.
After the started rebuild went very well and without any problems, some hours later drive 1 again dropped out of the raid...
At that precious time there was heavy I/O, because I was copying about 1 TB of data from one single drive in the device to the raid 5.
About 800 GB have been copied, when the error occoured.
Code: Select all
[/] # dmesg
, 0, 0].
Retried request is finished...MV_Request: Cdb[28, 0, 0,53, 14,84, 0, 0, 80, 0, 0, 0].
appRequest.cgi[27480]: segfault at 089ef1c8 eip 0804cc59 esp bfd01ddc error 4
md0: bitmap file is out of date (0 < 31451) -- forcing full recovery
md0: bitmap file is out of date, doing full recovery
md0: bitmap initialized from disk: read 11/11 pages, set 357317 bits
created bitmap (175 pages) for device md0
Interrupt Error: 0x40000080 orgIntStatus: 0x40000081 completeSlot=0x2.
Toggle CMD register start stop bit at port 0x0.
Abort error requests....
MV_Request: Cdb[28, 0,25,6a, cc,84, 0, 0, 78, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, d2,fc, 0, 1, 0, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, d4,fc, 0, 1, 0, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, cc,fc, 0, 1, 0, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, cf,fc, 0, 1, 0, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, ce,fc, 0, 1, 0, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, d3,fc, 0, 1, 0, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, d0,fc, 0, 1, 0, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, d1,fc, 0, 1, 0, 0, 0, 0].
Abort error requests....
MV_Request: Cdb[2a, 0,25,6a, cd,fc, 0, 1, 0, 0, 0, 0].
Device_IssueReadLogExt on device 0x0.
Read Log Ext is finished on device 0x0.
Retry request...MV_Request: Cdb[2a, 0,25,6a, cd,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, cd,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[2a, 0,25,6a, d1,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, d1,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[2a, 0,25,6a, d0,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, d0,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[2a, 0,25,6a, d3,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, d3,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[2a, 0,25,6a, ce,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, ce,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[2a, 0,25,6a, cf,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, cf,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[2a, 0,25,6a, cc,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, cc,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[2a, 0,25,6a, d4,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, d4,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[2a, 0,25,6a, d2,fc, 0, 1, 0, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[2a, 0,25,6a, d2,fc, 0, 1, 0, 0, 0, 0].
Retry request...MV_Request: Cdb[28, 0,25,6a, cc,84, 0, 0, 78, 0, 0, 0].
Retried request is finished...MV_Request: Cdb[28, 0,25,6a, cc,84, 0, 0, 78, 0, 0, 0].
Port_Monitor: Running_Slot=0x3d50850b.
MV_Request: Cdb[2a, 0,2e,2f, 54,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 59,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 5a,84, 0, 0, 40, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 50,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 51,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[28, 0,2e,2f, 50,44, 0, 0, 40, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 57,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[2a, 0, 0, 0, 89,df, 0, 0, 8, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 52,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 53,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 56,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 58,84, 0, 1, 0, 0, 0, 0].
MV_Request: Cdb[2a, 0,2e,2f, 55,84, 0, 1, 0, 0, 0, 0].
ENABLE_WRITE_CACHE (current: enabled).
__MV__ reset handler f7c6dc80.
sd 2:0:0:0: Device offlined - not ready after error recovery
sd 2:0:0:0: Device offlined - not ready after error recovery
sd 2:0:0:0: Device offlined - not ready after error recovery
sd 2:0:0:0: Device offlined - not ready after error recovery
sd 2:0:0:0: Device offlined - not ready after error recovery
sd 2:0:0:0: Device offlined - not ready after error recovery
sd 2:0:0:0: Device offlined - not ready after error recovery
sd 2:0:0:0: [sda] Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
end_request: I/O error, dev sda, sector 774853764
raid5: Disk failure on sda3, disabling device. Operation continuing on 3 devices
sd 2:0:0:0: [sda] Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
end_request: I/O error, dev sda, sector 774854020
sd 2:0:0:0: [sda] Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
end_request: I/O error, dev sda, sector 774854276
sd 2:0:0:0: [sda] Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
end_request: I/O error, dev sda, sector 774854532
sd 2:0:0:0: [sda] Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
end_request: I/O error, dev sda, sector 774854788
sd 2:0:0:0: [sda] Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
end_request: I/O error, dev sda, sector 774855044
sd 2:0:0:0: [sda] Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
end_request: I/O error, dev sda, sector 774855300
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: rejecting I/O to offline device
md: super_written gets error=-5, uptodate=0
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: rejecting I/O to offline device
md: super_written gets error=-5, uptodate=0
raid1: Disk failure on sda1, disabling device.
Operation continuing on 4 devices
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: rejecting I/O to offline device
md: super_written gets error=-5, uptodate=0
sd 2:0:0:0: rejecting I/O to offline device
RAID1 conf printout:
--- wd:4 rd:5
disk 0, wo:1, o:0, dev:sda1
disk 1, wo:0, o:1, dev:sdb1
disk 2, wo:0, o:1, dev:sdc1
disk 3, wo:0, o:1, dev:sdd1
disk 4, wo:0, o:1, dev:sde1
RAID1 conf printout:
--- wd:4 rd:5
disk 1, wo:0, o:1, dev:sdb1
disk 2, wo:0, o:1, dev:sdc1
disk 3, wo:0, o:1, dev:sdd1
disk 4, wo:0, o:1, dev:sde1
RAID5 conf printout:
--- rd:4 wd:3
disk 0, o:0, dev:sda3
disk 1, o:1, dev:sdb3
disk 2, o:1, dev:sdc3
disk 3, o:1, dev:sdd3
RAID5 conf printout:
--- rd:4 wd:3
disk 1, o:1, dev:sdb3
disk 2, o:1, dev:sdc3
disk 3, o:1, dev:sdd3
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: rejecting I/O to offline device
active port 0 :139
active port 1 :445
active port 2 :20
sd 2:0:0:0: rejecting I/O to offline device
sd 2:0:0:0: rejecting I/O to offline device
raid1: Disk failure on sda2, disabling device.
Operation continuing on 1 devices
RAID1 conf printout:
--- wd:1 rd:2
disk 0, wo:1, o:0, dev:sda2
disk 1, wo:0, o:1, dev:sdb2
RAID1 conf printout:
--- wd:1 rd:2
disk 1, wo:0, o:1, dev:sdb2
RAID1 conf printout:
--- wd:1 rd:2
disk 0, wo:1, o:1, dev:sde2
disk 1, wo:0, o:1, dev:sdb2
RAID1 conf printout:
--- wd:1 rd:2
disk 0, wo:1, o:1, dev:sde2
disk 1, wo:0, o:1, dev:sdb2
md: recovery of RAID array md5
md: minimum _guaranteed_ speed: 2000000 KB/sec/disk.
md: using maximum available idle IO bandwidth (but not more than 2000000 KB/sec) for recovery.
md: using 128k window, over a total of 530048 blocks.
md: unbind<sda2>
md: export_rdev(sda2)
md: unbind<sda1>
md: export_rdev(sda1)
raid1: Disk failure on sda4, disabling device.
Operation continuing on 3 devices
md: unbind<sda4>
md: export_rdev(sda4)
md: unbind<sda3>
md: export_rdev(sda3)
active port 0 :139
active port 1 :445
active port 2 :20
md: md5: recovery done.
RAID1 conf printout:
--- wd:2 rd:2
disk 0, wo:0, o:1, dev:sde2
disk 1, wo:0, o:1, dev:sdb2
[/] # dmesg
Again, I removed the disk from the nas, formatted it on my pc and reinserted it - immediately the raid rebuild started again
The bad thing is, that I'm really feeling very bad about my nas now...I think I will never trust that device completely again
One of the reasons I bought myself a prebuild nas system (and did not assemble one by myself) was that I expected those systems to run stable and reliable out of the box...obviously I was wrong.
At the moment I'm quite disappointed.