Webmin reboot fails and needs a hardware reset

SYSTEM INFORMATION
OS type and version Ubuntu 22.04.4 LTS (Jammy Jellyfish)
Webmin version 2.111
Virtualmin version 7.20.2 Pro

Hi Everyone,

I’ve been using Webmin and Virtualmin for few years, but I still consider myself a “Newbie”
A month ago I was forced to migrate from So You Start to a new bare metal server hosted by OVH.
The OS installation (OVH Image) went smoothly and without any issues.
But since then, almost every time an automatic kernel update takes place (leading into a required reboot), when I perform the reboot from the webmin interface, after the shutdown, the server does not reboot and a hardware reset has to be performed which needs a time-consuming intervention.

I don’t see anything wrong in syslog; only a blank once the shutdown has been completed.
Nothing in kern.log either.
Extensive offline hardware tests have been performed without any error being detected.
CPU temperatures are always well within limits.

Has anybody experienced a similar issue?
Any suggestions?

See below some system information:

System:
Kernel: 5.15.0-117-generic x86_64 bits: 64 compiler: gcc v: 11.4.0 Console: pty pts/1
Distro: Ubuntu 22.04.4 LTS (Jammy Jellyfish)
Machine:
Type: Server Mobo: ASRockRack model: E3C252D4U-2T/OVH serial:
UEFI: American Megatrends LLC. v: 4.03.OV01 date: 05/28/2024
CPU:
Info: 6-core model: Intel Xeon E-2386G bits: 64 type: MT MCP arch: Rocket Lake rev: 1 cache:
L1: 480 KiB L2: 3 MiB L3: 12 MiB
Speed (MHz): avg: 799 high: 800 min/max: 800/5100 cores: 1: 800 2: 800 3: 800 4: 799 5: 800
6: 800 7: 800 8: 800 9: 800 10: 800 11: 800 12: 800 bogomips: 84096
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Graphics:
Device-1: ASPEED Graphics Family vendor: ASRock driver: N/A bus-ID: 05:00.0
Display: server: No display server data found. Headless machine? tty: 141x48
Message: GL data unavailable in console. Try -G --display
Audio:
Message: No device data found.
Network:
Device-1: Intel Ethernet X710 for 10GBASE-T vendor: ASRock driver: i40e v: kernel port: N/A
bus-ID: 03:00.0
IF: eno1 state: up speed: 1000 Mbps duplex: full mac:
Device-2: Intel Ethernet X710 for 10GBASE-T vendor: ASRock driver: i40e v: kernel port: N/A
bus-ID: 03:00.1
IF: enp3s0f1 state: down mac:
Device-3: American Megatrends Virtual Ethernet type: USB driver: cdc_ether bus-ID: 1-8.3:3
IF: enxb62de1c40313 state: down mac:
RAID:
Device-1: md2 type: mdraid level: mirror status: active size: 1022 MiB
Info: report: 2/2 UU blocks: 1046528 chunk-size: N/A
Components: Online: 0: nvme0n1p2 1: nvme1n1p2
Device-2: md3 type: mdraid level: mirror status: active size: 474.81 GiB
Info: report: 2/2 UU blocks: 497875968 chunk-size: N/A
Components: Online: 0: nvme0n1p3 1: nvme1n1p3
Drives:
Local Storage: total: raw: 953.88 GiB usable: 477.81 GiB used: 29.16 GiB (6.1%)
ID-1: /dev/nvme0n1 vendor: Samsung model: MZVL2512HCJQ-00B07 size: 476.94 GiB temp: 33.9 C
ID-2: /dev/nvme1n1 vendor: Samsung model: MZVL2512HCJQ-00B07 size: 476.94 GiB temp: 32.9 C
Partition:
ID-1: / size: 466.29 GiB used: 28.91 GiB (6.2%) fs: ext4 dev: /dev/md3
ID-2: /boot size: 987.4 MiB used: 256.9 MiB (26.0%) fs: ext4 dev: /dev/md2
ID-3: /boot/efi size: 510.7 MiB used: 5 MiB (1.0%) fs: vfat dev: /dev/nvme1n1p1
Swap:
ID-1: swap-1 type: partition size: 512 MiB used: 2.5 MiB (0.5%) dev: /dev/nvme1n1p4
ID-2: swap-2 type: partition size: 512 MiB used: 0 KiB (0.0%) dev: /dev/nvme0n1p4
Sensors:
System Temperatures: cpu: 27.8 C mobo: N/A
Fan Speeds (RPM): N/A
Info:
Processes: 289 Uptime: 12d 21m Memory: 31.23 GiB used: 4.77 GiB (15.3%) Init: systemd
runlevel: 5 Compilers: gcc: 11.4.0 Packages: 1175 Shell: Bash v: 5.1.16 inxi: 3.3.13

How does a reboot work in the CLI.

Hi Stefan,

I just done one.
In fact it was needed to update from kernel 5.15.0-117-generic to 5.15.0-118-generic.
It went OK, no hardware reset needed :thinking:
Any thoughts?

Thanks in advance, all the best,

Eric

did the kernel update fix your issue?

Change of hosting provider. Especially considering the bad reputation they have.

If I’m allowed (feel free to delete my post if it’s forbidden) I would recommand Kernelhost (my provider). They, according to me, have the most professional support I ever saw and never got technical problems (from their side).

Hi Everyone,

I just tried again using the Webmin reboot and the server hung again.
This time it looks like I will have to move to “rescue mode” to find out what went wrong :unamused:

I will post the outcome when I (hopefully) I know more…

At the end the “rescue mode” reboot was not needed but a physical hardware reset had to be performed.
Since then I tried a reboot from the Webmin interface and another from the CLI and they both went fine :roll_eyes: :exploding_head:
As part of the intervention, I’ve been told the BIOS has also been updated , I´m not sure if this is really the solution :face_with_raised_eyebrow:
Anyway, I guess this is not Webmin/Virtualmin related.

That is what we like to hear (A Solution)

1 Like

Well… this is a conclusion, I hope it was a solution :worried:

A Webmin reboot is just sending a shutdown -r now, which is the same thing that happens when you do a reboot in the CLI (whether you call it with the shorthand reboot or the shutdown command, they’re all just links to systemctl now).

There is no reason it would be differently when done from Webmin. I think you must be seeing some other problem that only happens intermittently, and it’s just happened to randomly occur when you rebooted from Webmin.

Hi Joe,

I agree with you, but I was wandering if there was anything specific to Webmin that could cause this behaviour?

It seems to occur when a new kernel has been automatically installed and I get the notification that a reboot is required for this new kernel to be applied…

I also have a ticket open with OVH, but their response time is rather slow.

Thanks for the feedback, regards,

Eric

@ Tactikast Thank you for the hint about Kernelhost, they seem to have very positive feedback, I will keep it mind…

It may also happen that the pre-installed OS Image have some problems.

As Joe said It should not be related to Virtualmin. But more the configuration of the installed Ubuntu OS.

In the worst case scenario (if it’s not too much work to reinstall everything), and if the problem come back, Maybe you can install your own OS (your own image directly downloaded from the official Platform; Here Ubuntu)

Hosting provider have their custom Image, sometime they may lead to unexpected behaviour. Still if you would install your own, I can not guarantee the server will work because it may depend how their infrastructure is setup (Maybe I’m saying a mistake, I’m really not sure about this last part). But if it’s a dedicated server it should be fine.

Hi Tactikast,

I’m not envisaging migrating again or reinstalling everything, I would rather prefer to identify what is wrong.
I will try to find out more about the customization performed by the hosting provider.

All the best,

Eric

I can not edit so I correct what I said.

The last part “it may depend how their infrastructure is setup” is wrong.

If the hosting provider offer custom image installation it should work. And if they do not handle it, they will simply not allow it … My bad.

I would rather prefer to identify what is wrong

So your previous solution didn’t work ??

Well, I will have to wait until the next kernel update to find out :frowning:

I will keep you posted…

Regards,

Eric

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.