Can not find that anywhere it seems.
What is the expected performance difference in a storage backend scenario that is heavily parallellized in access (like a SAN, Virtualization host storage etc.) between SAS and SATA, all other things being equal?
I Think it runs down to the impact of NCQ (32 command limit) to the MUCH higher oustanding command limit of SAS discs.
We are considering replacing some discs and have a chance to go for SAS or SATA - all the rest is in place - and I look for an evaluation from a performance point. Please ignore all the other issues (reliability etc.) - I purely wonder about the impact SAS will have on similar specced discs (RPM etc. being equal). The discs we have in mind can be ordered with both connectors and - there is an idea here to use SATA for ap ossibly repurpose later. THe price difference is not really high, but it made me wonder about the performance impact...
Yes, the extensive command set of the SCSI is a big bonus of using it over SATA. from SAS' Wiki:
The error recovery commands and block reclamation commands are pivotal in data integrity, S.M.A.R.T. is really for consumer grade equipment.
Also, SAS uses a higher signaling voltage, which enables longer cables compared to that of SATA. That's important when trying to cable up additional storage to an existing SAN.
You menitoned NCQ, but SCSI uses TCQ instead, which can be used in three different modes, however the bigger bonus imo with regard to parallelized setups is the ability to send up to 2^64 commands before filling the queue. Protocols like iSCSI and Fibre Channel limit this right now but the ability is there for future use.
I can only answer that portion, because I don't know if going with SAS for a couple of new disk will give you the same benefit of a purely SAS setup.
This is a late answer, but I would like to add my opinion.
From a pure speed standpoint, a nearline drive (as the ones OP considered) will performs practically the same both using the SATA interface or SAS interface. Despite the much lower-depth NCQ (31 entries rather than TCQ 64K), this limited hardware queue is sufficient, when augmented with the much deeper software-based IO queue, to extract almost the same IOPS which can be obtained using SAS-based TCQ.
Anyway, this does not means that SAS has no practical advantages:
However, when considering performance alone, the sad reality is that mechanical disk's random IOPS values are so low that the interface has almost no impact, excluding huge disk arrays where is can sometime limit your sequential IO transfer rate. Due to how they take rotational delay (which is hidden from the OS) into account, the killer performance-enhancing feature is NCQ/TCQ, and the SATA implementation is sufficiently good at that.
Some more significant differences appear when considering higher end SAS disks, which not only offer higher-RPM disks (10K and 15K), but have some interesting write-coalescing technologies (ie: HGST media cache tech) which, by the way, are slowly spilling into enterprise-SATA drives also.
1 https://ata.wiki.kernel.org/index.php/Libata_FAQ:
Old post, but I wanted to update it since I never see mention of the most important difference in terms of performance which is that SATA is only half duplex. Expect a huge performance difference under heavy loads with a mix of read/write. Definitely not suitable for SAN or virtualization IMO.
Of course as stated above there are other benefits with SAS. Nowadays NL-SAS disks can be had for not much more than SATA.