I have a high-IO application that is working, and scaling well to date. Over the past few months we've been trying to look down the road and predict where our next bottlenecks will occur. One of them is surely the file system.
We are currently monitoring
- Space available
- Read operations per second
- Write operations per second
This seems a bit too sparse to me. What else should I be watching? I am not even sure what the 'yellow line' would be for the operations/second.
Some suggestions
/sys/block/<dev>/queue/read_ahead_kb
to ensure you're reading data you need, not data the OS thinks you need.noatime
mount option unless you're hosting a mail spooldata=writeback
if you've got good backups