Cpu disabled by guest operating system, Unwanted reboot issue on rocky linux 9.2

m.balkrkishna · March 6, 2024, 2:51pm

I have a VM on VmWare ESXI , with below version.
Rocky linux VM rocky-release-9.2-1.4.el9 on kernel-5.14.0-284.11.1.el9_2.x86_64

Have multiple VMs deployed, which does not have issues.
one of the VM goes for a quick/fast reboot with systemd getting killed and all services restarting.

on esxi side below event can be seen. during these unwanted reboot
The CPU has been disabled by the guest operating system. Power off or reset the virtual machine.

any suggestion on the compatible kernel which can be used with rocky-release-9.2-1.4.el9 ?
any other solution or workaround for the intermittent issue ?

nazunalika · March 6, 2024, 3:14pm

Rocky Linux 9.2 is not supported. Rocky Linux Release Version Guide - Rocky Linux Wiki

Update your system(s) to the latest release.

With that said, you need to determine where the cause of this issue is. It may not he as cut and dry as you expect.

https://kb.vmware.com/s/article/2000542

m.balkrkishna · March 6, 2024, 3:40pm

thanks Louis for update,
I have gone through the documentation form VMware for this.
In our case I have checked the top results just before the reboot, there is no CPU spike I could see.
there are some CPU freeze issues reported on the kernel-5.14.0-284.11.1.el9_2.x86_64, which also looks similar to my case.

Can we use the kernel version 5.14.0-362.8.1 with Rocky 9.2 ? or it can not used ?

nazunalika · March 6, 2024, 4:16pm

Kernels of older releases are also not supported. As I said, update your systems to the latest available versions. Doing so will isolate if some or all of your issues have been resolved in the latest available packages.

LinuxGuy1997 · March 6, 2024, 4:30pm

You did not specify the version of ESXi being used, but it sounds a lot like this vmxnet3 bug.

You could try the included workaround and see if it helps…

Good luck!

m.balkrkishna · March 7, 2024, 6:18pm

@LinuxGuy1997 thanks , I will check on this and see if can apply the patch on the esxi.
below is the esxi details :
Client version:1.34.4
Client build number:
16668064
ESXi version:
7.0.1
ESXi build number:
16850804

m.balkrkishna · March 11, 2024, 2:29pm

Update

There was one suspect, a cron job to clean the page cache which was running below command.
"sync ; echo 3 > /proc/sys/vm/drop_caches "
Coincidently it was getting conflicting with the other cron job which eventually will be running at same time.
For fixing the time of this drop cache cron has be changed to a non conflicting time.

Even after this change, we are seeing an unwanted reboot happening.
below logs can be seen in /var/log/syslog during the reboot, which indicates some SIGTERM, any way we can get to know the root case of it ?

Mar 09 12:00:53  systemd-journald[329]: Received SIGTERM from PID 1 (systemd).
Mar 09 12:00:59  systemd-journald[704]: Received client request to flush runtime journal.
Mar 09 12:01:09  rsyslogd[942]: [origin software="rsyslogd" swVersion="8.2102.0-113.el9_2" x-pid="942" x-info="https://www.rsyslog.com"] start
Mar 09 12:01:09  rsyslogd[942]: imjournal: journal files changed, reloading...  [v8.2102.0-113.el9_2 try https://www.rsyslog.com/e/0 ]

jlehtone · March 12, 2024, 8:52am

Just to check: have you applied dnf up or do you still have old content?

There is no “Rocky 9.2” and “Rocky 9.3”. There is only “Rocky 9” that is currently based on RHEL 9.3 content. Rocky 9 will shift to RHEL 9.4 content soon™ after RHEL 9.4 is released.

m.balkrkishna · March 13, 2024, 12:32pm

Still having the old content. As for now, we will have to stick to below once.
Rocky linux VM rocky-release-9.2-1.4.el9 on kernel-5.14.0-284.11.1.el9_2.x86_64

On recent occurrence of issue, I found that there is one more cron which runs every 2 hrs and run below command.

sync ; echo 3 > /proc/sys/vm/drop_caches

this looks to be one of the suspect to me. This cron has been removed now.
As the issue is not consistent (1 out of 10 VMs), not sure if this is actual fix.
Any thoughts on this ?

system · May 12, 2024, 12:33pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Blocking BUG with the new RockyLinux 9.2 kernel Rocky Linux Help & Support	23	8315	December 27, 2023
Kernel panik in install rocky 9 Rocky Linux Help & Support rocky-linux-9	28	4988	November 13, 2023
Kernel panic ryzen 9 7950x on reboot or shutdown Rocky Linux Help & Support	2	679	August 25, 2023
System becomes unresponsive after upgrading to Rocky linux 9.4 Rocky Linux Help & Support rocky-linux-9	3	806	July 22, 2024
Rocky Linux 9 - VirtualBox - Kernel Panic Rocky Linux Help & Support	21	13135	August 2, 2023

Cpu disabled by guest operating system, Unwanted reboot issue on rocky linux 9.2

Related topics