KDUMP_COMMANDLINE: remove irqpoll parameter on aws aarch64 platform

Currently, kdump may experience failure on some aws aarch64 platform.
The final scenario is:

    [   79.145089] printk: console [ttyS0] disabled
Then the system has no response any more. And after reboot, there is no
vmcore generated under /var/crash/. More detail [1].

In a short word, it is caused by the irqpoll policy and some unknown
acpi issue. The serial device is hot-removed as a pci device.

More detailed, the irqpoll policy demands to iterate over all interrupt
handler, if the interrupt line is shared, then the handler is
dispatched. And acpi handler acpi_irq() is on a shared interrupt line,
so it is called.  But for some unknown reason, the acpi hardware regs
hold wrong state, and the acpi driver decides that a hot-removed event
happens on a pci slot, which finally removes the pci serial device.

To tackle this issue by removing the irqpoll parameter on aws aarch64
platform, until the real root cause in acpi is found and resolved.

[1]: https://bugzilla.redhat.com/show_bug.cgi?id=2080468#c0

Signed-off-by: Pingfan Liu <piliu@redhat.com>

Acked-by: Coiby Xu <coxu@redhat.com>
This commit is contained in:
Pingfan Liu 2022-07-21 19:00:19 +08:00 committed by Coiby Xu
parent c735539b35
commit d593bfa6fc

View File

@ -641,6 +641,18 @@ function remove_kdump_kernel_key()
keyctl unlink "$KDUMP_KEY_ID" %:.ima keyctl unlink "$KDUMP_KEY_ID" %:.ima
} }
function is_aws_aarch64()
{
local _bios_model
_bios_model=$(lscpu | grep "BIOS Model name")
if [[ "${_bios_model}" =~ "AWS Graviton" ]]; then
return 0
fi
return 1
}
# Load the kdump kernel specified in /etc/sysconfig/kdump # Load the kdump kernel specified in /etc/sysconfig/kdump
# If none is specified, try to load a kdump kernel with the same version # If none is specified, try to load a kdump kernel with the same version
# as the currently running kernel. # as the currently running kernel.
@ -650,6 +662,10 @@ load_kdump()
KEXEC_ARGS=$(prepare_kexec_args "${KEXEC_ARGS}") KEXEC_ARGS=$(prepare_kexec_args "${KEXEC_ARGS}")
KDUMP_COMMANDLINE=$(prepare_cmdline "${KDUMP_COMMANDLINE}" "${KDUMP_COMMANDLINE_REMOVE}" "${KDUMP_COMMANDLINE_APPEND}") KDUMP_COMMANDLINE=$(prepare_cmdline "${KDUMP_COMMANDLINE}" "${KDUMP_COMMANDLINE_REMOVE}" "${KDUMP_COMMANDLINE_APPEND}")
# This is a workaround on AWS platform, since irqpoll may cause the hot-remove of some pci hotplug device
if is_aws_aarch64; then
KDUMP_COMMANDLINE=$(remove_cmdline_param "${KDUMP_COMMANDLINE}" irqpoll)
fi
# For secureboot enabled machines, use new kexec file based syscall. # For secureboot enabled machines, use new kexec file based syscall.
# Old syscall will always fail as it does not have capability to # Old syscall will always fail as it does not have capability to