<p dir="ltr">Overeager head parking?</p>
<div class="gmail_quote">On Feb 3, 2015 6:39 PM, "Mark Mitchell" <<a href="mailto:mark.russel.mitchell@gmail.com">mark.russel.mitchell@gmail.com</a>> wrote:<br type="attribution"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">I'm running my first RAID array in a machine I built just short of a<br>
year ago. I'm getting repeated messages in kern.log about ata resets<br>
on 2 ata channels.<br>
<br>
I took one of the affected drives out of the array, and ran a smart<br>
long test on them (smart.sdd.txt, attached). It shows a head flying<br>
time of 6912h+43m+51.802s (around 288 days).<br>
<br>
All of the drives on the system are showing pre-fail and OldAge in the<br>
smart reports. I'm finding this difficult to believe, all of them<br>
except sda are only about a year old.<br>
<br>
Do I really have to go out and buy a bunch of new 3TB drives?<br>
<br>
Here are some representative errors from kern.log;<br>
<br>
==> /var/log/kern.log <==<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092255] ata5.00:<br>
exception Emask 0x10 SAct 0x40000001 SErr 0x10200 action 0xe frozen<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092259] ata5.00: irq_stat<br>
0x00400000, PHY RDY changed<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092262] ata5: SError: {<br>
Persist PHYRdyChg }<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092265] ata5.00: failed<br>
command: READ FPDMA QUEUED<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092269] ata5.00: cmd<br>
60/a0:00:22:c0:0a/00:00:09:00:00/40 tag 0 ncq 81920 in<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092269] res<br>
40/00:00:22:c0:0a/00:00:09:00:00/40 Emask 0x10 (ATA bus error)<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092272] ata5.00: status: { DRDY }<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092274] ata5.00: failed<br>
command: READ FPDMA QUEUED<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092278] ata5.00: cmd<br>
60/08:f0:72:f9:66/02:00:08:00:00/40 tag 30 ncq 266240 in<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092278] res<br>
40/00:00:22:c0:0a/00:00:09:00:00/40 Emask 0x10 (ATA bus error)<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092281] ata5.00: status: { DRDY }<br>
Feb 3 18:31:46 home-desktop kernel: [611894.092285] ata5: hard resetting link<br>
Feb 3 18:31:51 home-desktop kernel: [611899.409269] ata5: SATA link<br>
up 1.5 Gbps (SStatus 113 SControl 310)<br>
Feb 3 18:31:51 home-desktop kernel: [611899.435209] ata5.00:<br>
configured for UDMA/33<br>
Feb 3 18:31:51 home-desktop kernel: [611899.449242] ata5: EH complete<br>
Feb 3 18:32:17 home-desktop kernel: [611925.496050] ata6: exception<br>
Emask 0x10 SAct 0x0 SErr 0x10002 action 0xe frozen<br>
Feb 3 18:32:17 home-desktop kernel: [611925.496054] ata6: irq_stat<br>
0x00400000, PHY RDY changed<br>
Feb 3 18:32:17 home-desktop kernel: [611925.496057] ata6: SError: {<br>
RecovComm PHYRdyChg }<br>
Feb 3 18:32:17 home-desktop kernel: [611925.496061] ata6: hard resetting link<br>
Feb 3 18:32:22 home-desktop kernel: [611930.406105] ata5: exception<br>
Emask 0x10 SAct 0x0 SErr 0x10200 action 0xe frozen<br>
Feb 3 18:32:22 home-desktop kernel: [611930.406109] ata5: irq_stat<br>
0x00400000, PHY RDY changed<br>
Feb 3 18:32:22 home-desktop kernel: [611930.406111] ata5: SError: {<br>
Persist PHYRdyChg }<br>
Feb 3 18:32:22 home-desktop kernel: [611930.406116] ata5: hard resetting link<br>
Feb 3 18:32:24 home-desktop kernel: [611932.038938] ata6: SATA link<br>
up 1.5 Gbps (SStatus 113 SControl 310)<br>
Feb 3 18:32:28 home-desktop kernel: [611935.720865] ata5: SATA link<br>
up 1.5 Gbps (SStatus 113 SControl 310)<br>
Feb 3 18:32:28 home-desktop kernel: [611935.739014] ata5.00:<br>
configured for UDMA/33<br>
Feb 3 18:32:28 home-desktop kernel: [611935.752837] ata5: EH complete<br>
Feb 3 18:32:29 home-desktop kernel: [611937.036124] ata6.00: qc<br>
timeout (cmd 0xec)<br>
Feb 3 18:32:29 home-desktop kernel: [611937.036135] ata6.00: failed<br>
to IDENTIFY (I/O error, err_mask=0x4)<br>
Feb 3 18:32:29 home-desktop kernel: [611937.036137] ata6.00:<br>
revalidation failed (errno=-5)<br>
Feb 3 18:32:29 home-desktop kernel: [611937.036141] ata6: hard resetting link<br>
Feb 3 18:32:30 home-desktop kernel: [611937.527854] ata6: SATA link<br>
up 1.5 Gbps (SStatus 113 SControl 310)<br>
Feb 3 18:32:30 home-desktop kernel: [611937.528629] ata6.00: supports<br>
DRM functions and may not be fully accessible<br>
Feb 3 18:32:30 home-desktop kernel: [611937.529644] ata6.00: supports<br>
DRM functions and may not be fully accessible<br>
Feb 3 18:32:30 home-desktop kernel: [611937.529824] ata6.00:<br>
configured for UDMA/33<br>
Feb 3 18:32:30 home-desktop kernel: [611937.529997] ata6: EH complete<br>
<br>
Here's my drive layout;<br>
mark@home-desktop:~$ sudo lsblk<br>
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT<br>
sda 8:0 0 931.5G 0 disk<br>
├─sda1 8:1 0 37M 0 part /boot/efi<br>
├─sda2 8:2 0 37.3G 0 part [SWAP]<br>
├─sda3 8:3 0 860.8G 0 part /home<br>
└─sda4 8:4 0 33.5G 0 part /<br>
sdb 8:16 0 2.7T 0 disk<br>
└─sdb1 8:17 0 2.7T 0 part<br>
└─md0 9:0 0 8.2T 0 raid5<br>
└─md0p1 259:0 0 8.2T 0 md /srv/media<br>
sdc 8:32 0 2.7T 0 disk<br>
└─sdc1 8:33 0 2.7T 0 part<br>
└─md0 9:0 0 8.2T 0 raid5<br>
└─md0p1 259:0 0 8.2T 0 md /srv/media<br>
sdd 8:48 0 2.7T 0 disk<br>
└─sdd1 8:49 0 2.7T 0 part<br>
└─md0 9:0 0 8.2T 0 raid5<br>
└─md0p1 259:0 0 8.2T 0 md /srv/media<br>
sde 8:64 0 2.7T 0 disk<br>
└─sde1 8:65 0 2.7T 0 part<br>
└─md0 9:0 0 8.2T 0 raid5<br>
└─md0p1 259:0 0 8.2T 0 md /srv/media<br>
sr0 11:0 1 4.3G 0 rom<br>
<br>_______________________________________________<br>
TCLUG Mailing List - Minneapolis/St. Paul, Minnesota<br>
<a href="mailto:tclug-list@mn-linux.org">tclug-list@mn-linux.org</a><br>
<a href="http://mailman.mn-linux.org/mailman/listinfo/tclug-list" target="_blank">http://mailman.mn-linux.org/mailman/listinfo/tclug-list</a><br>
<br></blockquote></div>