I deployed an app with this github project: Issues · ahmetoner/whisper-asr-webservice · GitHub
It’s just a python app that runs the whisper audio transcription service. My project is literally just a fly.toml file
# fly.toml app configuration file generated for whisper-asr on 2023-07-15T13:26:24-05:00
#
# See https://fly.io/docs/reference/configuration/ for information about how to use this file.
#
primary_region = "ord"
[build]
image = "onerahmet/openai-whisper-asr-webservice:latest"
[http_service]
internal_port = 9000
force_https = true
auto_stop_machines = true
auto_start_machines = true
min_machines_running = 0
[env]
ASR_MODEL = "base"
ASR_ENGINE = "openai_whisper"
I scaled it to 1 CPU and 2048 MB in memory. However, when I try to upload and transcribe a 7 minute file, it times out and responds with a 502 error code.
What could the issue be? Is it because i need a more beefier machine?
Update: fly scale vm performance-2x
also didnt work