Node.js App Out-of-Memory

nichtsam · June 30, 2024, 12:08pm

Hello everyone,

I’m encountering frequent Out-of-Memory (OOM) issues with my Node.js application, specifically a Remix application with an Express server. Despite various attempts to diagnose and resolve the problem, I’m still unclear about its root cause, and I would greatly appreciate your insights and suggestions.

Background:

Recently, I began experiencing erratic errors in my app—issues like hanging or crashing in unexpected places that aren’t typically performance-intensive. Subsequently, I received notifications from fly.io indicating that the app had encountered OOM conditions. Upon further investigation, I observed that the OOM occurs consistently on a specific page the first time after the machine starts, but not thereafter following a restart.

Details and Approaches Tried:

OOM Kill Logs:

[   16.369753] Out of memory: Killed process 342 (node) total-vm:21741816kB, anon-rss:80896kB, file-rss:0kB, shmem-rss:0kB, UID:0 pgtables:2316kB oom_score_adj:0
[   15.666088] Out of memory: Killed process 342 (node) total-vm:21746452kB, anon-rss:83100kB, file-rss:0kB, shmem-rss:0kB, UID:0 pgtables:1948kB oom_score_adj:0
[   18.665771] Out of memory: Killed process 342 (node) total-vm:21746900kB, anon-rss:83100kB, file-rss:0kB, shmem-rss:0kB, UID:0 pgtables:1976kB oom_score_adj:0

These didn’t happened in sequence, I just put them together for better readability.

Grafana Metrics:

Local Checking Results:
I did some checks locally, oom never happened with the same interactions.

Local checks with node --inspect and chrome://inspect. Snapshot 1 is after the app started and before any connection, then I recorded allocation timeline and went through the interactions, afterward I took a few other snapshots, Snapshot 6 is taken about an hour later.

截圖 2024-06-30 下午1.40.431920×1186 111 KB
with command top, the MEM is usually around 100M, at most around 160M.
with /usr/bin/time -l

       35.76 real         2.08 user         0.28 sys
           170246144  maximum resident set size
                   0  average shared memory size
                   0  average unshared data size
                   0  average unshared stack size
               14004  page reclaims
                   0  page faults
                   0  swaps
                   0  block input operations
                   0  block output operations
                 497  messages sent
                 258  messages received
                   1  signals received
                  15  voluntary context switches
               16769  involuntary context switches
         11397507944  instructions retired
          5356333425  cycles elapsed
           126870592  peak memory footprint

Seeking Advice:

I’m relatively new to this field and hope the above details provide sufficient insights into my situation.

Whether you suspect a memory leak in my application or if it simply requires more memory?
What steps you would recommend for further diagnosing and pinpointing the root cause of these intermittent OOM errors?

Thank you in advance for your help and expertise!

rubys · June 30, 2024, 12:47pm

If it is a specific page after the machine restarts, I would not suspect a memory leak is the cause. Either you need to experiment a bit to get Node’s garbage collection to kick in before you crash, or you need more memory.

You can find some suggestions here: Scaling · Fly Docs

khuezy · June 30, 2024, 1:25pm

Sounds like there’s a memory leak either in your app or a 3rd party library you’re using. Have you tried running your app locally in production mode (the same thing in your deployed app).

I’ve had this happened to me a few months ago w/ the turso db client. It dev mode, there were no memory leaks because the system was constantly cleaning up memory when reloading/refreshing pages. Once I run it as a production build, I could see the leak locally. Once the turso team was notified, they fixed it pretty quick.

nichtsam · June 30, 2024, 1:49pm

I was using production mode throughout my checks.
I’m also using turso db client, sometimes it errs, but I guess with memory insufficiency it could d be expected?

nichtsam · June 30, 2024, 1:53pm

Hey thanks!!
I would look into scaling.

I have a follow-up question,
How could one tell if the memory isn’t enough?

The thing is that I don’t see the hint that’s suggesting I have insufficient memory, because with the metrics it seems to me that the app isn’t using all the memory and there are still some memory to use.

khuezy · June 30, 2024, 2:01pm

If there’s a leak, it won’t matter how much memory you give it, it’ll eventually run out and crash. What’s turso db and libsql version? The memory leak was fixed some time in Nov/Dec 2023

nichtsam · June 30, 2024, 2:16pm

"@libsql/client": "0.4.3"

I’m asking because I want to know how to read and diagnose the metrics correctly.
the OOM didn’t happen after a while after the machine started, so it don’t feel like memory leak.
Another thing is that I’m unclear of why I’m having memory issues, because in my eyes the metrics shows that I have enough memory.

khuezy · June 30, 2024, 2:25pm

Oh yea… that’s really old and has the memory leak iirc. Also update your turso db

nichtsam · June 30, 2024, 2:39pm

It’s a version around this year January, I think the memory leak you mentioned is fixed in this version? and I check the db it seems to have automatic updating.

Anyways, I will update my db, thanks for suggestion!

khuezy · June 30, 2024, 2:43pm

Yea you’re right, leak was fix in rc 0.4.0, so it should be fixed in 0.4.3
Something else is the culprit.

system · July 7, 2024, 2:44pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
OOM Error	3	510	June 7, 2023
Running out of memory and crashing: how can I diagnose where the error is coming from? Questions / Help	3	1977	April 12, 2023
Out of memory Typescript application	2	592	August 29, 2022
NextJS app ran out of memory and crashed but metrics show memory usage not exceeded Questions / Help	4	2222	September 19, 2022
Out of memory?	1	756	February 1, 2023

Node.js App Out-of-Memory

Seeking Advice:

Related topics