NIC card in recovery model after reboot

Hi i just got a new server and installed some new packages. After i reboot the system one of my NICs(Intel X710) didn’t work(it works before, i boot and checked it. it’s Rocky linux 8.10).

➜  ~ lspci | grep -i ethernet
81:00.0 Ethernet controller: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ (rev 01)
81:00.1 Ethernet controller: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ (rev 01)
e1:00.0 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 Gigabit Ethernet PCIe
e1:00.1 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 Gigabit Ethernet PCIe
➜  ~ lspci -k | grep -A 3 -i ethernet
81:00.0 Ethernet controller: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ (rev 01)
	Subsystem: Intel Corporation Ethernet Converged Network Adapter X710
	Kernel modules: i40e
81:00.1 Ethernet controller: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ (rev 01)
	Subsystem: Intel Corporation Ethernet Converged Network Adapter X710
	Kernel modules: i40e
82:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Function
	Subsystem: Dell Device 08ff
--
e1:00.0 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 Gigabit Ethernet PCIe
	DeviceName: Embedded NIC 1
	Subsystem: Dell PowerEdge Rx5xx LOM Board
	Kernel driver in use: tg3
--
e1:00.1 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 Gigabit Ethernet PCIe
	DeviceName: Embedded NIC 2
	Subsystem: Dell PowerEdge Rx5xx LOM Board
	Kernel driver in use: tg3
➜  ~ dmesg | grep -i i40e
[    3.292347] i40e: Intel(R) Ethernet Connection XL710 Network Driver
[    3.292352] i40e: Copyright (c) 2013 - 2019 Intel Corporation.
[    7.300053] i40e 0000:81:00.0: Firmware recovery mode detected. Limiting functionality.
[    7.300064] i40e 0000:81:00.0: Refer to the Intel(R) Ethernet Adapters and Devices User Guide for details on firmware recovery mode.
[    7.301821] i40e 0000:81:00.0: fw 7.83.59945 api 1.9 nvm 7.10 0x800075df 19.5.12 [8086:1572] [8086:0006]
[    7.402757] i40e 0000:81:00.0: capability discovery failed, err 0000000000000000 aq_err I40E_AQ_RC_EMODE
[   11.404562] i40e 0000:81:00.1: Firmware recovery mode detected. Limiting functionality.
[   11.404573] i40e 0000:81:00.1: Refer to the Intel(R) Ethernet Adapters and Devices User Guide for details on firmware recovery mode.
[   11.406285] i40e 0000:81:00.1: fw 7.83.59945 api 1.9 nvm 7.10 0x800075df 19.5.12 [8086:1572] [8086:0006]
[   11.506953] i40e 0000:81:00.1: capability discovery failed, err 0000000000000000 aq_err I40E_AQ_RC_EMODE

what should i do now?

You should consider seeing if you can update the firmware.

Downloads page

Potential Firmware Download

1 Like

hi thanks for your help!
i tried to update it, but i got this:

Intel(R) Ethernet NVM Update Tool
NVMUpdate version 1.42.24.2
Copyright(C) 2013 - 2024 Intel Corporation.


WARNING: To avoid damage to your device, do not stop the update or reboot or power off the system during this update.
Inventory in progress. Please wait [*.........]


Num Description                          Ver.(hex)  DevId S:B    Status
=== ================================== ============ ===== ====== ==============
01) Intel(R) Ethernet Converged           N/A(N/A)   1572 00:129 Access error
    Network Adapter X710


Tool execution completed with the following status: An error occurred accessing the device.
Press any key to exit.

Intel forum hints at the problems: nvmupdate64e access error for XL710 after failed NVM upgrade - Intel Community as card not being accessible.

Another post hints at failed update that then caused the access error: Intel XL710-QDA2 - nvmupdate failed now getting access error - Intel Community so maybe an update has been done previously that didn’t finish correctly and broke the card? Your dmesg showed it in firmware recovery mode which doesn’t sound good.

Both those posts may suggest on how you might re-animate the card. Or you may find you need to buy a new card altogether.

There is also this reddit post: Reddit - Dive into anything might need to have firmware updated in a specific order rather than attempting the latest. Or requires updating using the appropriate driver version. Although sounds to me there might not even be a driver to communicate with the card at all.

Or you can try the fwupdmgr tool included in most distros in case that might update it:

fwupdmgr update

if it doesn’t find any firmware to update it won’t do anything. But sounds like your card is broken, or reinstall with a different system that worked before and see if it’s due to changes in newer Linux versions.

1 Like

Hi, thanks a lot for your help. Unfortunately everything i tried still didn’t fix it. I decided to get a new card instead…

try to set these parameters for both cards and see how it will behave
(in this form it works until reboot):
ethtool -K eth0 lro off
ethtool -K eth0 tso off
ethtool -K eth1 lro off
ethtool -K eth1 tso off