Recently we migrated from Cenos 7 to Rocky 8. All our instances are running on AWS instance type “r6i.4xlarge”.
From last 2-3 months we have been facing an issue where the Instance status checks is showing “Instance reachability check failed”. The servers are failing randomly and on restart it works fine for some random time.
We have investigated the System logs but did not find anything suspicious.
Here is the complete details of Linux for your reference.
This is an urgent issue as it is impacting all our production environments and causing disruptions to end customers. Would request you to please look into it on Priority.
Please let me know if any further info is required.
We do not see any log entry during disconnect time and when we try to take the screenshot we are getting the below. Nothing in the System logs as well.
@linuxlover We also checked with AWS Support and they mentioned the issue lies at OS level.
Moreover, had discussions with MarkLogic Support but mentioned that there is no issue at the MarkLogic level.
I really do love such comments like this. Makes me laugh. If the system is so important to you why aren’t you purchasing a subscription/support contract when things go wrong? And instead, use a free operating system and expect priority support for free when things go wrong lol
Don’t get me wrong, I understand you have a problem, but when something is free, be it the system or support, there’s no place for making demands. You chose to use a system without a support contract/subscription, there are no guarantees or SLA’s for support.
That said, I would at first though try reinstalling the kernel and or reinstalling grub as if it’s just booting to the grub command prompt that looks like it hasn’t read the grub configuration for some reason. But from the screen showing an AWS logo, looks like it’s booted some other image or recovery mode or something, rather than booting Rocky.