Is it possible to run the machine close to upstream 3rd party services?

huevosabio · April 16, 2025, 4:20pm

I need to run scripts that make multiple ML/AI calls, most of which are third party services. Is it possible to use Fly to optimize the execution location to minimize the latency between the script and the ML/AI services rather than between user and machine?

motz · April 16, 2025, 5:28pm

Yes, you can of course place your fly app in a region close to the servers you’re talking with.

Most times it’s better to place your app close to your users though.
Fly machines are run in datacenters with very good internet connections. These AI services will also live in datacenters with very good internet connections.
The AI use-case means that these requests take a long time (in the context of networking), so placing your app closer to your users will improve usability more than gaining some 10-100ms extra on just these API calls between the very well-connected servers. (that will take time on the order of seconds anyway)

khuezy · April 16, 2025, 7:11pm

What is the typical latency of your ML/AI request? Does it make sense to optimize regional latency to save 100ms if the expected latency from users using an LLM API to be in the seconds?

rubys · April 16, 2025, 7:37pm

This may be of use: Dynamic Request Routing with fly-replay · Fly Docs

system · April 23, 2025, 7:38pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Requests are being sent to fly machines in geographically distant regions Questions / Help	1	206	October 23, 2023
Does Fly prioritize cold starts or distance? Questions / Help	0	272	October 19, 2022
Mapping latency to fly regions	1	457	August 26, 2021
Is Fly.io a Good Choice for Edge AI Computing? gpu	4	50	April 1, 2025
Feature requests: Region latency tests	10	2853	January 7, 2024

Is it possible to run the machine close to upstream 3rd party services?

Related topics