Can celery and Django sit on the same machine?

Radek · October 9, 2023, 8:13am

Hi
I have a working web application on Django. It is using

app for DB
app that runs 3 machines
** Django
** celery
** beat for celery

the application was created / coded by somone else for me. And I always want to understand how IT works behind…

My question is if instead of 3 machines I can be using only 2 or even 1. And why.

Thank you
Radek

Radek · October 17, 2023, 8:34pm

someone would know … ?

roadmr · October 17, 2023, 11:07pm

Hi!

You can definitely run all services on a single machine. You’d have to “hack” things a bit, by creating a small shell script that starts your services like so, in daemon mode or in the background

#!/bin/bash
gunicorn the_django_app -D  [other parameters]
celery worker -D [other parameters]
celery beat --detach [other parameters]
sleep infinity

And then use this script in your Dockerfile’s CMD instead of calling e.g. gunicorn directly.

That said, it’s definitely recommended and best practice to run them in separate machines. There are several reasons:

Dedicated resources. If your workers are overworked (heh) they will not interfere with your web service, and viceversa.
Assymetrical vertical scaling (wow that sounded fancy but it’s really not). If the web server needs a lot of memory but the workers do not, you can scale independently; with the single-machine approach you have less flexibility in this respect. The Celery Beat machine typically can get by with little resources since it’s just queueing up jobs for the workers at intervals.
Horizontal scaling per workload. If your service mainly processes background tasks but doesn’t see a lot of web requests, you can scale the worker process group independently as much as you’d like while keeping only a few web servers. This works the other way too: if you’re mostly serving web requests and have the occasional long-running process, you can scale to many web servers but keep only a few worker nodes.
Single celery beat instance. This is actually important because if you have several beat instances you might end up with duplicated tasks from them. A single beat instance schedules the jobs, and multiple workers take them from the queue for processing.

I hope this rant is useful

Daniel

Radek · October 18, 2023, 4:21am

hi Daniel

great, great. Thank you so much for your explanation. I know now how to do it and why not to do it too.

Any chance you can help me with these two questions of mine? Especilly the first one is quite important to me.

Thank you in advance
Radek

Topic		Replies	Views
Best practices to use workers - Python / Docker Questions / Help	4	1106	May 4, 2024
questoins on free allowances explanation + ssh to machines	1	249	October 17, 2023
Celery worker Questions / Help django , storage , tigris	4	324	March 24, 2024
fly.toml for Complex Django App django	4	619	July 12, 2023
Configuring fly.toml to auto-start/stop celery workers Python django , machines , autoscaling	2	297	July 26, 2024

Can celery and Django sit on the same machine?

Related topics