NGinx does not restart after log file rotate

NigelAves1 · September 17, 2023, 3:57am

@stefan1959 @joe I tried doing the test that Stefan asked for. And Joe, that was all!

Here is what I added to the nginx.conf file:

error_log /var/log/nginx/error.log debug;

worker_rlimit_core 500M;
working_directory /var/log/nginx/;
debug_points abort;

After running it never created the core file. (and I did try other directories, being a production version of NGinx there’s a chance it was not compiled for debug)

I created a text file with all the debug messages from the error.log, (2.5 Meg) but I cannot see a way of attaching the file, not that the contents looked particularly useful. I did a search for connection / error / socket … not mentioned once in the entire log file.

Thank you for your help trying to find out what’s wrong. This is painful for me as I’m in an area of the system I know nothing about.

Nigel.

Joe · September 17, 2023, 4:36am

We definitely don’t want 2.5MB of logs. I doubt you need debug logs for this problem.

When you restart nginx from the command line (systemctl restart nginx) does it work?

Stegan · September 17, 2023, 8:56am

The problem is not “why is it not starting?” but more about “what is preventing it running?” isn’t it.? or more precisely what is stopping it prior to a restart.

If it starts using Virtualmin GUI then it is going to reveal the same information as a
systemctl restart nginx the nginx log will only show the successful process not what killed it.
Nginx is a pretty robust webserver it doesn’t die due to a mundane process like log rotate. it needs something more critical to stop it (a reboot?) Shouldn’t we be focussing on what stopped it in the first place?

NigelAves1 · September 17, 2023, 2:41pm

Joe, just tried it. Yes it did start correctly. Nigel

ID10T · September 17, 2023, 3:36pm

I kinda hinted at that in my first post but we didn’t know for sure that logrotate was the culprit, though evidence is now pointing in that direction.

@NigelAves1 Can you post your /etc/logrotate.d/nginx config?

NigelAves1 · September 17, 2023, 3:41pm

@Joe @Stegan I don’t know if this will help or not, but I tried a small experiment. I created a new log file rotate and added each log file one by one.

It worked correct until I hit the 6th log file to rotate, then it died. FYI on my server there are 20 log files to rotate in the /var/log/VirtualMin directory.

Just in case here is what systemctl status nginx.service had to say

× nginx.service - A high performance web server and a reverse proxy server
Loaded: loaded (/lib/systemd/system/nginx.service; enabled; preset: enabled)
Active: failed (Result: start-limit-hit) since Sun 2023-09-17 09:44:33 MDT; 15s ago
Duration: 6ms
Docs: man:nginx(8)
Process: 1411183 ExecStartPre=/usr/sbin/nginx -t -q -g daemon on; master_process on; (code=exited, status=0/SUCCESS)
Process: 1411184 ExecStart=/usr/sbin/nginx -g daemon on; master_process on; (code=exited, status=0/SUCCESS)
Process: 1411213 ExecStop=/sbin/start-stop-daemon --quiet --stop --retry QUIT/5 --pidfile /run/nginx.pid (code=exited, status=0/SUCCESS)
Main PID: 1411185 (code=exited, status=0/SUCCESS)
CPU: 130ms

Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: Failed to start nginx.service - A high performance web server and a reverse proxy server.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: nginx.service: Start request repeated too quickly.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: nginx.service: Failed with result ‘start-limit-hit’.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: Failed to start nginx.service - A high performance web server and a reverse proxy server.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: nginx.service: Start request repeated too quickly.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: nginx.service: Failed with result ‘start-limit-hit’.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: Failed to start nginx.service - A high performance web server and a reverse proxy server.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: nginx.service: Start request repeated too quickly.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: nginx.service: Failed with result ‘start-limit-hit’.
Sep 17 09:44:33 apache-web-server.twin-peaks-video.com systemd[1]: Failed to start nginx.service - A high performance web server and a reverse proxy server.

ID10T · September 17, 2023, 4:01pm

Also, if logrotate is killing Nginx then you might want to file a bug with Debian.

bugs.debian.org/cgi-bin/pkgreport.cgi?pkg=logrotate;dist=unstable

Joe · September 17, 2023, 6:24pm

It’s too early to go filing bugs upstream. We don’t know what’s happening yet.

Stegan · September 17, 2023, 8:49pm

given that this happens in the wee dark hours is there a reboot being triggered by something?

To eliminate the log rotate process (I still can’t believe that trivial process is killing nginx) and presumably we have established that the event is reproducible. Can’t we just stop all log rotation and start them one by one at a more convenient time that can be monitored. Assuming that only the default log rotates are active why are we (nginx users) seeing this?

ID10T · September 17, 2023, 11:21pm

I’ve located these references. Seems logrotate reloads apache2, and presumably nginx, for some reason. Maybe to reconstruct an empty logfile? Maybe a restart would work? Kind of a kludge and just a guess.

/etc/logrotate.d/virtualmin.conf
systemctl reload apache2 ; sleep 5

/etc/logrotate.d/apache2
postrotate if pgrep -f ^/usr/sbin/apache2 > /dev/null; then invoke-rc.d apache2 reload 2>&1 | logger -t apache2.logrotate fi

NigelAves1 · September 18, 2023, 2:39am

@Stegan @Joe … I read Id1ot’s post with the sleep on the reload.

I first added : sleep 5 - ran the file rotation on just Virtualmin logs and it went through and Nginx was still running at the end. I tried sleep 1, NGinx dead at end / sleep 2, NGinx alive at end.

Playing safe I’ve set it to 4. - That said, if you need me to do any more tests for you not a problem.

Once again, thank you all for thinking about this issue, sorry I was not more help in debugging. Nigel.

Stegan · September 18, 2023, 8:16am

Wow! that is a surprise. I am also amazed that this has not shown itself on other systems. The fact it is restarting nginx is good but why is it stopping it in the first place? I now wonder if the timing issue may be more related to available cores/memory the log rotation taking longer to complete than normal

stefan1959 · September 18, 2023, 9:28am

Not sure why a reload is used. Maybe reload stops logging for 5 seconds.

Stegan · September 18, 2023, 9:40am

Are we saying that the logging process actually stops nginx (therefore has to try to restart it)
or is the logging process (including the log rotation) takes so long that it needs to suspend nginx and therefore restart it.Are the logs that big?

5 seconds is a long time for a busy site (just think of all the frustrated users and lost orders)

stefan1959 · September 18, 2023, 9:59am

I don’t think reload stops the server.

Stegan · September 18, 2023, 11:02am

but why do we have to restart it?

NigelAves1 · September 18, 2023, 2:27pm

related to available cores/memory

FYI - The server I built is overkill Ryzen 9 / 128Gigs of memory.

ID10T · September 18, 2023, 3:13pm

reload just reloads the configuration file.
Restart will stop and then start it. Two different options.

ID10T · September 18, 2023, 3:16pm

Just to be clear. You are saying there was no “watch” in the Debian 12 configuration? It was completely missing and you added it?

Joe · September 18, 2023, 3:56pm

reload is not restart. And, when log files change, something needs to signal the service to change.