Cost prediction on an API service based on usage

CastAsHuman · February 26, 2025, 4:05pm

Hey everyone! I need to make an API so I started searching for providers and found Fly, which seems great for me! (up until now, I’ve only used Firebase’s services, so I’m used to never caring about scaling)

The API should just be 1 Node.js endpoint, which performs 2 network requests; The first one to googleapis’ Sheet API and the second to OpenAI completions API.

I’ve already built a part of the API with the default configuration, but I’m thinking that I should change it to have 1 VM always active, so there are no coldstarts and scale horizontally from there.

The thing is, I’m not sure how well it can scale, nor how much can it cost. Should I scale the VMs vertically? How many requests can a machine handle, based on what the API does?

Any help will be much appreciated!

halfer · February 26, 2025, 8:05pm

Ooh, all the well-it-depends questions!

I would say that since you should add some redundancy in the first instance, start with horizontal scaling. When your app is exclusively in development, you can have one machine in one region, and as you get some initial users, you could have two machines in one region, or a small number of machines in two regions, etc.

In terms of your individual spec, start with 256M (the smallest machine) and bump up from there. I run a PHP/Laravel/Apache app in this container size, and I’ve just had to reduce the number of Apache workers to stop the machine from crashing.

Your app does not sound demanding at all, though of course it depends on what req/sec throughput you expect. Both Google Sheets and OpenAI have rate-limiters, and you’ll hit into them well before you need to increase from 256M.

halfer · February 26, 2025, 8:07pm

Yes, I generally would. Small machines are so cheap that having them always running is a good default. You certainly can create elastic systems that will wake a machine based on incoming traffic, but if you’re in the development phase at present, I’d say it’s not worth getting distracted for the purposes of saving 5 USD per month.

system · March 5, 2025, 8:07pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
CPU performance on a Laravel API Questions / Help laravel , machines , autoscaling	7	154	November 22, 2024
What plan would you advise for an app growing in size ? Questions / Help	4	434	March 29, 2024
autoscale max instances	8	735	October 12, 2021
Autoscaling machine api	2	361	September 13, 2023
Autoscaling on CPU utilization? Questions / Help	15	2100	May 18, 2023

Cost prediction on an API service based on usage

Related topics