Postgres with replicas returning "password authentication failed" in logs

bravesirthomas · November 17, 2023, 2:07pm

Hi,

I’ve set up Betterstack logging with my Fly.io apps, and my production DB is returning with the following every 15 seconds or so: FATAL: password authentication failed for user \"postgres\". This is not happening with my test database which has no replicas, and the error is only coming from the primary node. The average load metric is also higher with a lot of transactions showing in PGAdmin. This is despite my production environment not actually having any traffic yet as I’m just getting it ready for release, so load should be way down compared to my test environment.

I’m a bit lost as to what is causing these logs. Is it a replica? It happens even if I spin up an entirely new postgres app with a couple of replicas.

Any help would be much appreciated.

Thanks,
Tom.

bravesirthomas · November 17, 2023, 5:06pm

To follow up on this, I decided to start another Postgres instance from scratch from a single machine. With a single machine, no errors. Scaling up to three also returns no errors. So, I can only deduce this to be an issue when you use flyctl and create a High Availability Postgres app with more than one machine. Cloning and scaling manually works fine.

mayailurus · November 17, 2023, 11:04pm

That is the interval for its health checks, by default, so that provides us with at least one hypothesis. The full configuration (intervals, timeouts, paths) is in the checks subtree of…

fly config show -a db-app-name

These are just GET requests to a tiny HTTP server that the machine maintains, so you can introduce them ahead of schedule via…

fly m list -a db-app-name  # consult the IP ADDRESS column of the primary
fly proxy 5500 fdaa:<hex-digits-from-above>:2

And then in a separate terminal…

curl http://127.0.0.1:5500/flycheck/pg

…as many times as necessary to observe a frequency change in the logs (or not!).

bravesirthomas · November 18, 2023, 3:21pm

Thanks, @mayailurus, your reply has been helpful although it’s not solved my issue. I also started to have other logs mentioning “pg” and “psql” roles, which I found odd. Closing PGAdmin seemed to remove these extra logs, but there were still some happening at 4am using the “postgres” role.

I’m going to continue to monitor this overnight and see what’s happening. With a bit of luck, I won’t have these logs being sent to my logging service anymore!

system · November 25, 2023, 3:22pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Issues with Postgres connection timeout Questions / Help postgres , machines	2	258	May 22, 2024
Postgres "server misbehaving" error messages postgres	8	2110	October 26, 2022
Postgres instance randomly dies and fails to restart postgres	5	401	December 30, 2022
fly deploy changed postgres password - what did I do wrong? postgres , flyctl	7	70	May 17, 2025
Cannot connect to PostgreSQL my-chat-app-db-vittorio - "FATAL: password authentication failed" Questions / Help postgres , databases	6	19	July 7, 2025

Postgres with replicas returning "password authentication failed" in logs

Related topics