[Bug 102433] GPU hang resulting in Freeze(?) then unclean logout (possibly connected to LibreOffice)

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Nov 8 16:23:50 UTC 2017


https://bugs.freedesktop.org/show_bug.cgi?id=102433

--- Comment #8 from Elizabeth <elizabethx.de.la.torre.mena at intel.com> ---
(In reply to wettererscheinung from comment #7)
>...
> * Nonetheless, how do I apply this on grub?
Hello Maria, to apply this execute:
$ sudo nano /etc/default/grub
  Add intel_iommu=igfx_off inside the "" after the grub command line, i.e.:
  GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=igfx_off"
Save and close. Then apply:
$sudo update-grub
And then reboot.

> * I read that virtualization won't work anymore - is that true?
> (This would be a problem as I do use virtualbox regularly)
You can find more information over the internet:
https://en.wikipedia.org/wiki/Input%E2%80%93output_memory_management_unit#Virtualization
, Virtualization should keep working.

> * When I understood the info about this feature right you mean that
> possibly the GPU doesn't work correctly with the DMA Re-Mapping?
> Is that an hardware/guarantee issue?
You could do a memtest86 to be sure your memory is working correctly:
On debian,  do 'apt install memtest86'. You should see it in the grub options
as a boot target that you can choose.
There is no log.  If memtest reports an error, you have to replace your memory. 
If it was a DMAR error, that should be follow on bug 89360.

> Bytheway I experience two kinds of occurances: 1 - freeze, then logout;
> 2 - total freeze, no change, only poweroff helps therefore no error
> report possible
Those could be different issues, though you would need to identify a patron to
determine if they should be worked separately.

> Yours
> Maria

>From error state:

ERROR: 0x00000000
FAULT_TLB_DATA: 0x0000001b 0xaacb0b2b
    Address 0x0000baacb0b2b000 GGTT
DONE_REG: 0x07ffffff
render command stream:
  START: 0x00011000
  HEAD:  0xf9001d80 [0x00001d28]
    head = 0x00001d80, wraps = 1992
  TAIL:  0x00001da8 [0x00001d80, 0x00001da8]
  CTL:   0x00003001
    len=16384, enabled
  MODE:  0x00000000
  HWS:   0xfffe8000
  ACTHD: 0x00000000 f9001d80
    at ring: 0x00000000
  IPEIR: 0x00000000
  IPEHR: 0x7a000004
  INSTDONE: 0xffdfffff
    busy: CS
  SC_INSTDONE: 0xfffffbff
  SAMPLER_INSTDONE[0][0]: 0xffffffff
  SAMPLER_INSTDONE[0][1]: 0xffffffff
  SAMPLER_INSTDONE[0][2]: 0xffffffff
  ROW_INSTDONE[0][0]: 0xfffffffd
  ROW_INSTDONE[0][1]: 0xfffffffd
  ROW_INSTDONE[0][2]: 0xfffffffd
  batch: [0x00000000_044a6000, 0x00000000_044ae000]
  BBADDR: 0x00000000_044a631c
  BB_STATE: 0x00000020
  INSTPS: 0x00008980
  INSTPM: 0x00000000
  FADDR: 0x00000000 00012da8
  RC PSMI: 0x00000010
  FAULT_REG: 0x00000000
  SYNC_0: 0x00000000
  SYNC_1: 0x00000000
  SYNC_2: 0x00000000
  GFX_MODE: 0x00008000
  PDP0: 0x000000041915e000
  PDP1: 0x0000000000000000
  PDP2: 0x0000000000000000
  PDP3: 0x0000000000000000
  seqno: 0x002e45d4
  last_seqno: 0x002e45d6
  waiting: yes
  ring->head: 0x00001d00
  ring->tail: 0x00001da8
  hangcheck stall: yes
  hangcheck action: dead
  hangcheck action timestamp: 4331761496, 122744 ms ago
  ELSP[0]:  pid 1042, ban score 0, seqno        2:002e45d5, emitted 123896ms
ago, head 00001d28, tail 00001da8
  ELSP[1]:  pid 1904, ban score 0, seqno        a:002e45d6, emitted 123896ms
ago, head 00001c10, tail 00001c88
  Active context: Xorg[1042] user_handle 1 hw_id 2, ban score 0 guilty 0 active
0

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are the assignee for the bug.
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/intel-gfx-bugs/attachments/20171108/62c36fb6/attachment.html>


More information about the intel-gfx-bugs mailing list