How is it determined what is invoiced (scale to zero)

daansk44 · March 13, 2024, 6:35pm

I utilize fly.io with its scale to zero feature, allowing applications to remain inactive when not in use. Recently, I stumbled upon an article discussing hosting an AI model alongside scale to zero functionality. Given that GPUs consume computing resources, charging per hour could become costly. Hence, I’m curious about how this calculation is determined.

greg · March 14, 2024, 1:24am

Hi,

It’s a good question.

For normal CPUs, it’s per second, and only when running. From the docs:

Started Machines are billed per second that they’re running (the time they spend in the started state), based on the price of a named CPU/RAM combination, plus the price of any additional RAM you specify.

but as you mention, their GPU pricing is listed per hour. Not per second:

If you don’t get a reply here, maybe email billing@fly.io

daansk44 · March 14, 2024, 11:35am

The reason I asked was, for example, when you run Ollama and need the computing power for a few minutes, for example when you use Ollama to rewrite code. You only do this a few times a day and it is not necessary for it to be on all the time. I am currently testing this locally, but this takes quite a lot of computing power, which is why you actually want to run this on a GPU. However, when this is calculated per hour, the costs may not match the benefits.

Article that I found:

system · March 21, 2024, 11:36am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
scale to zero possible?	39	6331	May 14, 2023
Just because it's cool - how does the Scale to zero Postgres for hobby projects work?	2	363	September 4, 2023
Scale to zero question	2	627	September 16, 2022
Billing Estimation Questions / Help	2	194	April 27, 2024
Implementing scale to zero is super easy!	3	1993	April 29, 2023

How is it determined what is invoiced (scale to zero)

Related Topics