Re: ✗ Fi.CI.BAT: failure for Intel CCS + 10bpc/fp16 stuff

Knop, Ryszard ryszard.knop at intel.com
Wed Jul 3 13:37:39 UTC 2024


On Wed, 2024-07-03 at 16:17 +0300, Ville Syrjälä wrote:
> On Wed, Jul 03, 2024 at 01:10:44PM -0000, Patchwork wrote:
> > == Series Details ==
> > 
> > Series: Intel CCS + 10bpc/fp16 stuff
> > URL   : https://patchwork.freedesktop.org/series/135670/
> > State : failure
> > 
> > == Summary ==
> > 
> > CI Bug Log - changes from IGT_7911 -> IGTPW_11352
> > ====================================================
> > 
> > Summary
> > -------
> > 
> >   **FAILURE**
> > 
> >   Serious unknown changes coming with IGTPW_11352 absolutely need to be
> >   verified manually.
> >   
> >   If you think the reported changes have nothing to do with the changes
> >   introduced in IGTPW_11352, please notify your bug team (I915-ci-infra at lists.freedesktop.org) to allow them
> >   to document this new failure mode, which will reduce false positives in CI.
> > 
> >   External URL: https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_11352/index.html
> > 
> > Participating hosts (39 -> 41)
> > ------------------------------
> > 
> >   Additional (3): bat-kbl-2 fi-cfl-8109u bat-jsl-3 
> >   Missing    (1): fi-snb-2520m 
> > 
> > Possible new issues
> > -------------------
> > 
> >   Here are the unknown changes that may have been introduced in IGTPW_11352:
> > 
> > ### IGT changes ###
> > 
> > #### Possible regressions ####
> > 
> >   * igt at i915_selftest@live at gem_contexts:
> >     - bat-jsl-1:          [PASS][1] -> [INCOMPLETE][2]
> >    [1]: https://intel-gfx-ci.01.org/tree/drm-tip/IGT_7911/bat-jsl-1/igt@i915_selftest@live@gem_contexts.html
> >    [2]: https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_11352/bat-jsl-1/igt@i915_selftest@live@gem_contexts.html
> 
> <4>[  520.753491] nvme nvme0: I/O tag 717 (72cd) QID 3 timeout, completion polled
> <4>[  551.089459] nvme nvme0: I/O tag 718 (22ce) QID 3 timeout, completion polled
> 
> Is the disk dying?
> 

+Mateusz, +Grzegorz: There are some PCIe comm errors around this drive,
plus commands timing out. SMART and error logs look healthy though,
wear at 18%. Either the drive or the M.2 PCIe slot?


More information about the I915-ci-infra mailing list