✗ Xe.CI.Full: failure for tests/intel/xe_wedged: Manage verbosity of GT reset fault injection
Matt Roper
matthew.d.roper at intel.com
Wed Nov 27 23:16:22 UTC 2024
On Wed, Nov 27, 2024 at 12:30:49AM +0000, Patchwork wrote:
> == Series Details ==
>
> Series: tests/intel/xe_wedged: Manage verbosity of GT reset fault injection
> URL : https://patchwork.freedesktop.org/series/141810/
> State : failure
>
> == Summary ==
>
> CI Bug Log - changes from XEIGT_8126_full -> XEIGTPW_12203_full
> ====================================================
>
> Summary
> -------
>
> **FAILURE**
>
> Serious unknown changes coming with XEIGTPW_12203_full absolutely need to be
> verified manually.
>
> If you think the reported changes have nothing to do with the changes
> introduced in XEIGTPW_12203_full, please notify your bug team (I915-ci-infra at lists.freedesktop.org) to allow them
> to document this new failure mode, which will reduce false positives in CI.
>
>
>
> Participating hosts (4 -> 4)
> ------------------------------
>
> No changes in participating hosts
>
> Possible new issues
> -------------------
>
> Here are the unknown changes that may have been introduced in XEIGTPW_12203_full:
>
> ### IGT changes ###
>
> #### Possible regressions ####
>
> * igt at core_hotunplug@hotreplug:
> - shard-lnl: [PASS][1] -> [DMESG-WARN][2]
> [1]: https://intel-gfx-ci.01.org/tree/intel-xe/IGT_8126/shard-lnl-5/igt@core_hotunplug@hotreplug.html
> [2]: https://intel-gfx-ci.01.org/tree/intel-xe/IGTPW_12203/shard-lnl-6/igt@core_hotunplug@hotreplug.html
This change to the xe_wedged test would not impact the behavior of the
core_hotunplug test.
> Known issues
> ------------
<snip>
> * igt at xe_wedged@basic-wedged:
> - shard-dg2-set2: [SKIP][800] ([Intel XE#1130]) -> [DMESG-WARN][801] ([Intel XE#2919])
> [800]: https://intel-gfx-ci.01.org/tree/intel-xe/IGT_8126/shard-dg2-466/igt@xe_wedged@basic-wedged.html
> [801]: https://intel-gfx-ci.01.org/tree/intel-xe/IGTPW_12203/shard-dg2-435/igt@xe_wedged@basic-wedged.html
Although this patch completely fixed the issue for me on the LNL system
I was using to test locally, it looks like we do still need to tell
igt_runner to ignore one more non-stackdump message to completely fix
the bug on all platforms (which I think the other patch on the list from
Kamil should take care of).
Applied to master with Kamil's r-b from the gitlab comment. Since it
looks like this wasn't a full fix (despite being a move in the right
direction), I re-opened the gitlab issue that got auto-closed as well
until we have the rest of the fix.
Matt
--
Matt Roper
Graphics Software Engineer
Linux GPU Platform Enablement
Intel Corporation
More information about the igt-dev
mailing list