I'm running an OpsWorks stack with:
- 10 c3.2xlarge instances running across 4 availability zones.
- The CPU never surpasses 5% on each instance.
- Each instance is connecting (through a memcached layer) to a db.r3.8xlarge (32 vCPU, 244 GiB RAM) RDS instance.
- The DB's CPU is staying nicely between 15-20 percent.
- Each EC2 instance is set to 250 connections with the RDS (total 2,500 connections).
- Each instance's memory is gradually increasing but I think this is a separate issue.
- The
load_1
,load_5
,load_10
metrics are all really high 0.9-1.0. - The active concurrent users connecting (based on a 3 minute window) are getting stuck at between 2500-3000 connections.
We tried increasing RDS instance size and while it did stop our RDS from maxing out we're trying to figure out why the conncurrent connections seems to be pegging.
Is there some setting in the ELB that limits connections? The latency for calls is really high. I'm just trying to figure out something which might be throttling this since all the hardware seems to be running okay.