The 2.6.32-220 kernel hangs on bootup on HP DL385G6 with HP Smart Array P410 controller and hpsa driver

Issue:

After upgrading to RedHat 6.2, the new kernel (2.6.32-220) doesn't boot. The same machine boots fine on 6.1.z kernel 2.6.32-131.21.1

While loading the hpsa driver the following messages appear

        HP HPSA Driver (v 2.0.2-3)
        hpsa 0000:48:00.0: PCI INT A -> GSI 33 (level, low) -> IRQ 33
        Uhhuh. NMI received for unknown reason a1 on CPU 0.
        You have some hardware problem, likely on the PCI bus.
        Dazed and confused, but trying to continue
        hpsa 0000:48:00.0: MSIX
        hpsa 0000:48:00.0: hpsa0: <0x323a at IRQ 48 using DAC
        hpsa 0000:48:00.0: Direct-Access    device c4b0t0I0 added.
        hpsa 0000:48:00.0: RAID          device c4b3t0I0 added.
        scsi 4:0:0:0: Direct-Access    HP     LOGICAL VOLUME   1.66 PQ: 0 ANSI: 5
        scsi 4:3:0:0: RAID             HP     P410         1.66 PQ: 0 ANSI: 0
        

Environment:

  • Red Hat Enterprise Linux 7.2 (2.6.32-220)
  • Red Hat Enterprise Linux 6.1 does not exhibit this problem
  • HP DL385G6
  • HP Smart Array P410 controller and hpsa driver

Resolution:

Workaround: adding pcie_aspm=off to kernel command line allows the system to boot.

A permanent kernel fix is tracked by Red Hat Bugzilla #769626. Contact your support representative for further details.

Root Cause:

PCI Express (PCIE) Active State Power Management (ASPM) was incorrectly being enabled, and interferes with the correct operation of hpsa driver.

Diagnostic Steps:

Disable NMI watchdog by adding nmi_watchdog=0 to the kernel cmd line and check if the system boots up.
FAILURE

Set pci_aspm=off to the kernel cmd line and check if the system boots up.
SUCCESS


Back to top...

+ Recent posts