Hi,
I’m running an NFS server using nfs-ganesha. I used to run this on CentOS 7, and recently moved to run it on Rocky (8.8) and I’m experiencing a significant performance drop- Number of operations per seconds drop by about 10-15%.
This workload has a very high CPU and network usage. These are mpstat results while running the workload:
Centos:
04:02:01 PM CPU %usr %nice %sys %iowait %irq %soft %steal %guest %gnice %idleAverage:
all 54.08 0.06 26.98 0.04 0.00 13.65 0.00 0.00 0.00 5.19
Rocky:
03:42:10 PM CPU %usr %nice %sys %iowait %irq %soft %steal %guest %gnice %idle
Average: all 47.82 0.09 29.43 0.05 2.17 12.77 0.00 0.00 0.00 7.67
We can see that for Rocky, we spend about 7% less on user space. Sys and idle are slightly higher and that irq usage went from exactly 0 to ~2%.
To me it looks like something changed with IRQ handling that probably also increases the kernel CPU usage. Anyone has any idea what this could be?
Note that I’m running this workload on cloud VMs. The VMs themselves are completely identical in both cases.