I currently have 7 RL9 VMs running.
On one machine, and only one machine, dnf
will OOM the machine. And only when EPEL is enabled.
(For testing I have disabled swap 'cos otherwise the machine goes into swap-of-death and loadav went to over 50!)
eg
dnf makecache
...
[652504.185326] systemd invoked oom-killer: gfp_mask=0x140cca(GFP_HIGHUSER_MOVABLE|__GFP_COMP), order=0, oom_score_adj=0
...
[652504.185569] [ pid ] uid tgid total_vm rss pgtables_bytes swapents oom_score_adj name
...
[652504.185735] [3661998] 0 3661998 212112 178248 1593344 0 0 dnf
...
[652504.185824] Out of memory: Killed process 3661998 (dnf) total-vm:848448kB, a
non-rss:712980kB, file-rss:12kB, shmem-rss:0kB, UID:0 pgtables:1556kB oom_score_
adj:0
But if I disable epel
then it works fine
% dnf --disablerepo=epel makecache
Docker CE Stable - x86_64 25 kB/s | 3.5 kB 00:00
Extra Packages for Enterprise Linux 9 openh264 6.6 kB/s | 993 B 00:00
MariaDB 3.9 kB/s | 3.4 kB 00:00
Rocky Linux 9 - BaseOS 8.1 kB/s | 4.1 kB 00:00
Rocky Linux 9 - AppStream 15 kB/s | 4.5 kB 00:00
Rocky Linux 9 - Extras 26 kB/s | 2.9 kB 00:00
Metadata cache created.
Indeed a --disablerepo=* --enablerepo=epel
also explodes.
An rpm --verify epel-repease
shows the repo file is unchanged.
Even manually cleaning out /var/cache/dnf
didn’t help.
On another machine on the same physical host
/bin/time -v dnf makecache
Extra Packages for Enterprise Linux 9 - x86_64 24 kB/s | 33 kB 00:01
Extra Packages for Enterprise Linux 9 openh264 8.8 kB/s | 993 B 00:00
Rocky Linux 9 - BaseOS 29 kB/s | 4.1 kB 00:00
Rocky Linux 9 - AppStream 14 kB/s | 4.5 kB 00:00
Rocky Linux 9 - Extras 19 kB/s | 2.9 kB 00:00
Metadata cache created.
...
Maximum resident set size (kbytes): 104388
So I have no idea why this one machine is blowing up!