I have heard it recommended to stay away from AWS hosting for certain "big data" applications (e.g. Hadoop, Cassandra, Solr) because EC2 instances typically use network attached storage (though there are more recently some high i/o instances, but they are apparently pretty expensive).
It makes sense to me that NAS would entail a pretty decent performance hit, but how much? Since AWS exists, then presumably there are plenty of examples of applications that make sense in this type of environment, but what is a good rule of thumb for determining if a particular application is a good candidate for AWS and NAS? (Besides sticking it on AWS and trying it out.)