[Bug 107475] [iGVT-g][SKL] GPU Hang and iGVT-g guest crash under certain loads

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Oct 22 23:59:57 UTC 2018


https://bugs.freedesktop.org/show_bug.cgi?id=107475

--- Comment #5 from leozinho29_eu at hotmail.com ---
The GPU hangs are no longer present, but the error in QEMU 3.0.0:

qemu-system-x86_64:
vfio_region_write(123f09b0-4c00-11e8-a6ca-f3c21e47e012:region0+0x24ec,
0x83a8,4) failed: Endereço inválido

Means it's not possible to ensure the guest and host system will be stable. At
the same time QEMU 3.0.0 prints the error, dmesg prints messages related:

[  600.375287] gvt: vgpu(1) Invalid FORCE_NONPRIV write 83a8
[  600.375292] gvt: vgpu 1: fail to emulate MMIO write 000024ec len 4

Under heavy guest load, the guest and/or host can crash. QEMU from KVMGT
release
(https://lists.freedesktop.org/archives/intel-gfx/2018-October/179043.html) is
silent, not printing anything about the error.

The guest can crash with a blue screen and the host is OK. The guest may crash
and the host can crash together.

When the host crashes, dmesg has a message in the style of a kernel WARNING or
kernel BUG but the computer freezes and the message is not saved for reboot.
The following messages were the last ones saved before the host crash:

Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188019] gvt: guest page
write error, gpa 28f4c000
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188032] gvt: guest page
write error, gpa 28f4c010
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188043] gvt: guest page
write error, gpa 28f4c020
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188052] gvt: guest page
write error, gpa 28f4c030
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188062] gvt: guest page
write error, gpa 28f4c040
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188072] gvt: guest page
write error, gpa 28f4c050
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188081] gvt: guest page
write error, gpa 28f4c060
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188091] gvt: guest page
write error, gpa 28f4c070
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188101] gvt: guest page
write error, gpa 28f4c080
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188110] gvt: guest page
write error, gpa 28f4c090
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188120] gvt: guest page
write error, gpa 28f4c0a0
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188131] gvt: guest page
write error, gpa 28f4c0b0
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188143] gvt: guest page
write error, gpa 28f4c0c0
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188156] gvt: guest page
write error, gpa 28f4c0d0
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188297] gvt: guest page
write error, gpa 28f4c0e0
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188311] gvt: guest page
write error, gpa 28f4c0f0
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188323] gvt: guest page
write error, gpa 28f4c100
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188333] gvt: guest page
write error, gpa 28f4c110
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188343] gvt: guest page
write error, gpa 28f4c120
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188352] gvt: guest page
write error, gpa 28f4c130
Oct 22 20:04:32 Lenovo-ideapad-310-14ISK kernel: [ 4102.188362] gvt: guest page
write error, gpa 28f4c140

I tested with QEMU from KVMGT release and QEMU 3.0.0, kernel 4.17.19, from the
KVMGT release and drm-tip. Any combination of them causes the problems when the
guest is under heavy load.

The only option for now is to use the Intel Windows driver version
24.20.100.6136, as it has no error related to:

[  600.375292] gvt: vgpu 1: fail to emulate MMIO write 000024ec len 4

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are the QA Contact for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20181022/5c54a524/attachment.html>


More information about the intel-gfx-bugs mailing list