Built-in security for GPU-based FlyApps to avoid unauthorized requests waking them up?

empz · August 19, 2024, 12:50pm

I have a gpu-enabled Fly app that I use to run some Python-based inferencing. I’m running a custom Docker image with a FastAPI/uvicorn server. After 60 seconds of idle, the server will shutdown and so the fly machine. As soon as a request is received, the machine will start and process it. This all works great, but anybody with the URL can make requests and wake it up (even though I have authorization logic in place).
Is there some kind of built-in authentication to avoid this or do I have to put a cheap API gateway that forwards requests to the GPU-enable fly app only if the request is authorized?

khuezy · August 19, 2024, 1:04pm

I believe you have to have some kind of proxy since the GPU is tied to a machine.
I would make your gpu app private, then proxy the request via flycast after it’s auth/authorized.

system · August 26, 2024, 1:05pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Get help with Fly GPUs! gpu	26	1500	February 15, 2025
Machine did not have a restart policy, defaulting to restart Questions / Help docs	16	2753	November 18, 2022
PUT Request on `/logger` waking up suspended machines every few minutes Questions / Help machines , autoscaling	9	62	March 23, 2025
Improved tired-proxy for use with Fly Machines	3	1244	August 22, 2023
How to keep a fly machine awake with a long running task Questions / Help autoscaling	3	269	July 1, 2024

Built-in security for GPU-based FlyApps to avoid unauthorized requests waking them up?

Related topics