OpenAI Stream Slow in Production

brianrhea · January 12, 2024, 6:05pm

I have a chat integration with OpenAI on my app that is super snappy in my local environment. But in production, the streamed responses are very, very slow.

I don’t have many users on the app, so I can’t imagine that this is a resource allocation issue. I’m experiencing the lag even when I can view the logs and see that I’m the only active user.

What are some steps I could take to diagnose where the bottleneck is and why the app performs so much worse on Fly than on my MacBook Air?

Thanks!

ben-ang · January 15, 2024, 8:07pm

You may have seen this already, but here’s a link to some docs on how to get performance metrics for your apps that might help you get going with this: Metrics on Fly.io · Fly Docs

system · January 22, 2024, 8:07pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Slow response times Questions / Help	5	1307	February 22, 2023
App is extremely slow, takes at least 5 seconds for a CRUD operation Questions / Help	4	110	March 16, 2025
Bizzarely slow first deploy for a rails app, and slow external api calls Questions / Help	6	622	July 14, 2022
Question about streaming response and auto_stop_machines=true streams , autoscaling	1	209	March 12, 2024
Application slow for a single user... how? Questions / Help machines	17	182	August 9, 2024

OpenAI Stream Slow in Production

Related topics