Autoscaling Sidekiq Workers

chris · August 20, 2020, 8:52pm

Hi,

Was wondering if anyone has had success autoscaling Sidekiq worker instances based on queue depth? If so I’d love to hear about your experiences.

Looking at the Convox docs I’d assume that I’d need to use something like this gem to publish my queue metrics to CloudWatch and then setup the autoscaling targets in my convox.yml?

chris · August 21, 2020, 5:24pm

To answer my own question, yes, thats how you do it!

Adding the gem to my app got all the metrics feeding direct into CloudWatch, and right now I’m doing simple scaling based on queue depth, appears to be working well:

scale:
  count: 1-10
  targets:
    custom:
      Sidekiq/EnqueuedJobs:
        aggregate: max
        value: 500

nathan.f77 · August 27, 2020, 6:02pm

Hi Chris, thanks for sharing this, that’s awesome! I was just scaling my Sidekiq workers based on CPU usage, but this is a much better approach, so I’ll set this up as well. Thanks!

miles · October 20, 2020, 6:20pm

this is exactly what I was looking for, thank you!

nathan.f77 · February 23, 2021, 5:54am

Also thanks for writing up this blog post, it was very helpful! Autoscaling Sidekiq Workers on Convox

I finally got this set up and it seems to be working very well.

nathan.f77 · February 23, 2021, 6:05am

I’ve been running some tests on my staging server, where I enqueued a large number of jobs:

It was able to start all 10 workers (the maximum I set) and reached a steady state, then after 20 minutes it started to stop them one-by-one, and seemed to stop one every 2 minutes. So I think it’s working pretty well! I’m tempted to change some of the intervals, but I think I’ll just keep it like this for now since it seems like a reasonable default.

chris · April 27, 2021, 11:55am

Just a heads up, this appears to have stopped working with a recent rack update.

Now when I go into ECS and look at the AutoScaling policy it has changed to be based on CPU utilization, and also appears to be ignoring the cool down params that have been set.

As a workaround, you’re going to need to go into CloudWatch and create 2 alerts on your EnqueuedJobs metric, one for scaling up, and the other for scaling down, then manually (or via Terraform) add 2 auto scaling policies based on them to scale your worker count up and down.

nijeesh.k · May 17, 2022, 10:35am

Is there a way that i could scale on a per queue basis, i have a queue named watermark so if the number of enqueued jobs goes over a threshold lets say 100 can i autoscale it ?

sidekiq-worker-watermark:
    command: bundle exec sidekiq -q watermark
    scale:
      count: 1-5
      targets:
        custom:
          Sidekiq/QueueSize:
            aggregate: max
            value: 100
            dimensions:
              QueueName: watermark

i tried this, but doest seem to be working

edit #1

its working for scale-up, but doesn’t seem to be scaling time even after the default cool-down period has been exceeded

Topic		Replies	Views
Latest Rack Autoscaled from 10 -> 700 instances Rack (Version 2)	4	626	March 20, 2021
Auto scaling based on CPU usage Feature Requests	0	464	July 30, 2020
[20180425220726] Service Autoscaling and Bug Fixes Releases	0	447	April 25, 2018
[20160602143119] CPU Metrics; ECR in Ireland; Scale Down To Remove Unneeded ELBs Releases	0	346	June 2, 2016
Latest version of Sidekiq (a Ruby background job library) requires a Redis version that seems incompatible with Convox Configuration	1	1037	January 20, 2020

Autoscaling Sidekiq Workers

Related topics