[Bug 47471] New: Radeon - NMI: PCI system error (SERR) for reason a1 on CPU 0.

bugzilla-daemon at bugzilla.kernel.org bugzilla-daemon at bugzilla.kernel.org
Thu Sep 13 08:00:48 PDT 2012


https://bugzilla.kernel.org/show_bug.cgi?id=47471

           Summary: Radeon - NMI: PCI system error (SERR) for reason a1 on
                    CPU 0.
           Product: Drivers
           Version: 2.5
    Kernel Version: 3.4.9, 3.5.3
          Platform: All
        OS/Version: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Video(DRI - non Intel)
        AssignedTo: drivers_video-dri at kernel-bugs.osdl.org
        ReportedBy: binarycode.11 at gmail.com
        Regression: No


Created an attachment (id=80091)
 --> (https://bugzilla.kernel.org/attachment.cgi?id=80091)
dmesg - kernel 3.2.28

Distribution: slackware -current  
kernel version: 3.2.28-smp
Hardware Environment: Dell Inspiron 6400 - video card: Ati X1400

Hi! 
On my Slackware -current whenever I try to change my ati x1400 power profile
(by echoing the "/sys/class/drm/card0/device/power_profile" file) I always get
the following NMI alert:
NMI: PCI system error (SERR) for reason a1 on CPU 0.
Dazed and confused, but trying to continue

(It's sometimes reported the reason "b1" instead of the "a1") 

There is no report about the NMI error message, during the system boot. 
At the startup my system defaults the video card power profile to its "default"
state, so that my system startup scenar is the following:
---
root at darkstar:~# cat /sys/class/drm/card0/device/power_method 
profile
root at darkstar:~# cat /sys/class/drm/card0/device/power_profile 
default

root at darkstar:~# cat /sys/kernel/debug/dri/0/radeon_pm_info 
default engine clock: 432000 kHz
current engine clock: 432000 kHz
default memory clock: 396000 kHz
current memory clock: 396000 kHz
PCIE lanes: 0
---

Everytime I change the video card clock frequency, by switching to another
power profile, I get the NMI alert:
---
root at darkstar:~# echo low > /sys/class/drm/card0/device/power_profile
NMI: PCI system error (SERR) for reason a1 on CPU 0.
Dazed and confused, but trying to continue
...
root at darkstar:~# cat /sys/kernel/debug/dri/0/radeon_pm_info 
default engine clock: 432000 kHz
current engine clock: 324000 kHz
default memory clock: 396000 kHz
current memory clock: 135000 kHz
PCIE lanes: 1
---

The NMI alert still occurres with KMS disabled (booting with the "nomodeset"
kernel option).
With KMS disabled, I tried to reduce the power by adding the following three
lines to the "Device Section" of a new "xorg.conf":
  Option "DynamicPM"         "on"
  Option "ClockGating"       "on"
  Option "ForceLowPowerMode" "on"
But, as soon as I startX I receive the NMI alert!

I could notice this issue running other kernel versions up to the 3.2.28
(specifically: 3.2.23, 3.2.26, 3.2.27, 3.2.28)

I tried upgrading to the slackware kernel vanilla version 3.4.9-smp (now in
testing). I could notice a slightly different behaviour compared to the 3.2.x
kernels: 
Changing the power profile doesnt't produce anymore the NMI error message
output but, the NMI error message always occures during the bootup of the
system!
(I've attached the dmesg-kernel_3.4.9, too.)
Same situation running gentoo live 12.1 (kernel 3.3.0) and knoppix live 7.0.4
(kernel 3.4.9).

Despite the NMI alerts the system works well, but I'm worried about the
possibility that the NMI signal is warning about (or could lead to...) video
card failure.

-- 
Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are watching the assignee of the bug.


More information about the dri-devel mailing list