I'm getting these errors ramdomly, and I don't know if its normal or not.
[39441.061856] ata3.00: failed to read SCR 1 (Emask=0x40)
[39441.061866] ata3.01: failed to read SCR 1 (Emask=0x40)
[39441.061892] ata3.15: exception Emask 0x10 SAct 0x0 SErr 0x280100 action 0x6 frozen
[39441.061897] ata3.15: irq_stat 0x08000000, interface fatal error
[39441.061904] ata3.15: SError: { UnrecovData 10B8B BadCRC }
[39441.061910] ata3.00: exception Emask 0x100 SAct 0x0 SErr 0x0 action 0x6 frozen
[39441.061917] ata3.01: exception Emask 0x100 SAct 0xe SErr 0x0 action 0x6 frozen
[39441.061923] ata3.01: failed command: READ FPDMA QUEUED
[39441.061933] ata3.01: cmd 60/a8:08:b0:48:62/00:00:00:00:00/40 tag 1 ncq 86016 in
[39441.061940] ata3.01: status: { DRDY }
[39441.061944] ata3.01: failed command: READ FPDMA QUEUED
[39441.061953] ata3.01: cmd 60/a8:10:b0:49:62/00:00:00:00:00/40 tag 2 ncq 86016 in
[39441.061959] ata3.01: status: { DRDY }
[39441.061963] ata3.01: failed command: READ FPDMA QUEUED
[39441.061972] ata3.01: cmd 60/58:18:58:4a:62/00:00:00:00:00/40 tag 3 ncq 45056 in
[39441.061978] ata3.01: status: { DRDY }
[39441.061987] ata3.15: hard resetting link
[39441.608302] ata3.15: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[39441.609090] ata3.00: hard resetting link
[39441.929246] ata3.00: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
[39441.929333] ata3.01: hard resetting link
[39442.249184] ata3.01: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
[39442.263242] ata3.00: configured for UDMA/133
[39442.277570] ata3.01: configured for UDMA/133
[39442.277725] ata3: EH complete
I'm also pasting smartctl -a
for sda, sdb and sdc.
Thanks in advance for your help.
While I essentially agree with Geppettvs D'Constanzo's answer, I would suggest that some of the first things you might also try are
Checking that your SATA cable is securely attached and plugged into the sockets on the motherboard and hard drive.
Replacing your SATA cable. SATA cables are (relatively) inexpensive and you do sometimes get a "bad" one. Often simply replacing the cable is the easiest way to diagnose and solve a problem like this.
(Although it is somewhat unexpected that two cables would both be bad at the same time. Still, it's an easy thing to check so in my opinion probably worth doing.)
I just saw you pastbins containing the SMART data for your drives. Notice the unexpectedly large number of CRC errors for drives
sdb
andsdc
. I suggest you start by checking the cables and connections for those drives.OK. So not a latpop then. ;-)
Of course, if this is happening on a laptop than none of the above apply and I'm not sure what advice to offer. Maybe remove and re-install the hard drive? Perhaps it just needs to be re-seated in its socket to improve the connection?sbd
andsdc
are connected on the same external e-sata cable (Thermaltake Duo HDD Dock). i'll replace my e-sata cable.It could be due to a faulty or low quality cable. It could also be that the cable is somehow moved, bumped, or otherwise jostled while the drive is being used.
It looks like you have a bad quality/damaged SATA Power/Data Cable. Which may be causing Bad CRC's. They aren't harmful at all and you can live with them but you are going to lose a lot of data soon.
The SMART report of your hard disk drives looks sane, so I am for power supply issues based on my experience when setting 5 hard disk drives in the same case/power source. I finished using an external power source (475W) for 2 drives and the case's 600W for all the case including GPU, optical and hard disk drives.
Anyway, I suggest you to run a full backup before you do anything else. If possible, clone your hard disk drive, after which you should check your cables and power source voltages.
There seems to be a problem between some kernel versions ans some SATA controllers.
I have recently started to suffer a very similar problem (not sure if it is just the same) on a web server running Scientific Linux.
The most accurate and complete information I have found about such problem is this launchpad bug.
In short: Disabling NCQ seems to be the best workaround for users having this problem.
This error is unlikely to damage your hard drive but is highly likely to corrupt your filesystem(s). Begin by determining which drive is throwing the errors. This usually be determined easily by a number of approaches such as:
1) Issuing the command
dmesg | grep ata3
and looking for the hard drive make and model. (as ata3 is the port throwing the error in your situation. Adjust accordingly) this will provide output similar to this:A quick glance indicates that the drive connected to ata3 is the SAMSUNG HD103SJ
2) Issue the command below:
This will provide both the ports and the device names highlighted on the same line as seen below:
It's easy to see that the device connected to ata3 has been assigned the device name sdb
3)install lsscsi with
sudo apt install lsscsi
and issue the commandlsscsi
Note that the first entry on each line above is the scsi_host, channel, target_number and LUN. It is placed in brackets and each element is colon separated. When there are multiple SCSI devices their entries are sorted in ascending order.
Simply adding 1 to the first number in each line of output gives you the ATA port. You can find more detail on
lsscsi
here and here.Since in your case we are seeing errors thrown on both 3.00 and 3.01 you have more than one drive connected to the same ATA port. You are going toi want to carefully check connectivity to both ata3.00 and ata3.01 This could be a multi-bay drive enclosure connected to the same cable. Since both drives are throwing errors, replacing the cable to the aforementioned multi-drive bay should eliminate the problem for both drives. These devices usually have an external power source which also could be the culprit and need to be replaced, but the cable (being the weakest link) is by far the most likely root cause of the problem.
Sources:
Experience
https://linux.die.net/man/8/lsscsi
http://sg.danny.cz/scsi/lsscsi.html
https://serverfault.com/questions/244944/linux-ata-errors-translating-to-a-device-name/868943#868943
Had the same issue - in my case this was due to 4-pin to SATA power adapter not being plugged snugly.
This is almost always a bad drive, I have thousands of drives that we use and although these errors never cause the drive to fail they have resulted in file system corruption. I think it really has to do with a problem with the controller board on the drive.
I have tried everything to solve this problem, the fix is to replace the drive and things work on the same cables and controllers.
Good luck
I know this thread is old, but just ran into the same issue, came here from google.
ata3.01: failed command: READ FPDMA QUEUED
on booting the KUBUNTU 16.04 livecd.Changing the SATA cable didnt do anything.
Replaced the PSU and the problem went away.
I know this thread is old but I recently encountered the same problem on a newly bought machine with 6 sata slots. I installed the cdrom and the hardrive on 2 sata slots close to each other and then proceeded to install ubuntu without errors until I reboot then on I saw the ata8: hard resetting link error. The machine halts up to this point never recovered. I tried rebooting for a few times and it did not work. And then I tried to replace the hardrive on one of the 4 available slots and it worked just fine without hassle.
This error is dangerous and it can damage your HD.
To solve it:
http://eliasoenal.com/2012/10/31/power-supply-failures-can-be-pretty-annoying-to-find/
I had the same problem. I had tried everything but only on the j- micron port on my asus p5k I did not have the errors.
But when I put the drive to an other power supply it worked and the errors were gone. Then I put the drive back to its original power supply but a new power connector and that worked too.