[igt-dev] [PATCH i-g-t] lib/igt_device_scan: Rescan pci properties if PCI_SLOT_NAME wasn't found

Zbigniew Kempczyński zbigniew.kempczynski at intel.com
Thu Aug 11 07:38:36 UTC 2022


On Wed, Aug 10, 2022 at 03:38:40PM +0300, Petri Latvala wrote:
> On Wed, Aug 10, 2022 at 10:19:49AM +0200, Zbigniew Kempczyński wrote:
> > References: https://gitlab.freedesktop.org/drm/intel/-/issues/6543
> > 
> > Due to lack of reproduction path of the above issue more data about
> > missing PCI_SLOT_NAME property is required.
> > 
> > What is extremely weird when PCI_SLOT_NAME is missing udev returns some
> > properties, like:
> > 
> > [properties]
> > DEVPATH                         : /devices/pci0000:00/0000:00:02.0
> > DRIVER                          : i915
> > PCI_CLASS                       : 30000
> > PCI_ID                          : 8086:191E
> > SUBSYSTEM                       : pci
> > 
> > To narrow the problematic code when PCI_SLOT_NAME is missing lets dump
> > kernel uevent file and retry scanning properties from udev. Retry path
> > allows to detect if udev returns same list of properties.
> > 
> > Above doesn't fix the issue. It is providing additional information about
> > devices and their properties provided from udev, especially when missing
> > PCI_SLOT_NAME will strike again. It also contains warning which might
> > detect situation where two scanning of properties will provide different
> > results.
> > 
> > Signed-off-by: Zbigniew Kempczyński <zbigniew.kempczynski at intel.com>
> > Cc: Petri Latvala <petri.latvala at intel.com>
> > ---
> >  lib/igt_device_scan.c | 48 +++++++++++++++++++++++++++++++++++++------
> >  1 file changed, 42 insertions(+), 6 deletions(-)
> > 
> > diff --git a/lib/igt_device_scan.c b/lib/igt_device_scan.c
> > index d6fae0650c..c2ecfcd3d4 100644
> > --- a/lib/igt_device_scan.c
> > +++ b/lib/igt_device_scan.c
> > @@ -568,28 +568,54 @@ static void dump_props_and_attrs(const struct igt_device *dev)
> >  	printf("\n");
> >  }
> >  
> > +static void dump_uevent_file(struct igt_device *dev)
> > +{
> > +	char filename[FILENAME_MAX];
> > +	const char *devpath = get_prop(dev, "DEVPATH");
> > +	char *line = NULL;
> > +	FILE *in;
> > +	size_t n;
> > +
> > +	igt_assert_f(devpath, "DEVPATH property doesn't exist\n");
> > +	snprintf(filename, FILENAME_MAX, "/sys%s/uevent", devpath);
> > +
> > +	in = fopen(filename, "r");
> > +	igt_assert(in);
> > +
> > +	printf("[uevent: %s]\n", filename);
> > +	while (getline(&line, &n, in) >= 0)
> > +		printf("%s", line);
> 
> Bleh, I was going to ask you to use igt_info instead so we get the log
> buffer out in the correct order, but the file already has a lot of
> printfs for lsgpu's sake. Let's consider that a cleanup TODO for
> later...
> 
> Reviewed-by: Petri Latvala <petri.latvala at intel.com>
> 
> > +
> > +	free(line);
> > +	fclose(in);
> > +}
> > +
> >  /*
> >   * Get PCI_SLOT_NAME property, it should be in format of
> >   * xxxx:yy:zz.z
> >   */
> > -static void set_pci_slot_name(struct igt_device *dev)
> > +static bool set_pci_slot_name(struct igt_device *dev)
> >  {
> >  	const char *pci_slot_name = get_prop(dev, "PCI_SLOT_NAME");
> >  	int len;
> >  
> >  	if (!pci_slot_name) {
> >  		dump_props_and_attrs(dev);
> > -		igt_assert_f(pci_slot_name, "PCI_SLOT_NAME property == NULL\n");
> > +		igt_warn("PCI_SLOT_NAME property == NULL\n");
> > +		dump_uevent_file(dev);
> > +		return false;
> >  	}
> >  
> >  	len = strlen(pci_slot_name);
> >  	if (len != PCI_SLOT_NAME_SIZE) {
> >  		dump_props_and_attrs(dev);
> > -		igt_assert_f(len != PCI_SLOT_NAME_SIZE,
> > -			     "PCI_SLOT_NAME length != %d [%s]\n", len, pci_slot_name);
> > +		igt_warn("PCI_SLOT_NAME length != %d [%s]\n", len, pci_slot_name);
> > +		dump_uevent_file(dev);
> > +		return false;
> >  	}
> >  
> >  	dev->pci_slot_name = strdup(pci_slot_name);
> > +	return true;
> >  }
> >  
> >  /*
> > @@ -649,7 +675,16 @@ static struct igt_device *igt_device_new_from_udev(struct udev_device *dev)
> >  		uint16_t vendor, device;
> >  
> >  		set_vendor_device(idev);
> > -		set_pci_slot_name(idev);
> > +
> > +		/*
> > +		 * Very rare we observe there's no PCI_SLOT_NAME property.
> > +		 * We depend on it so retry acquiring properties from udev.
> > +		 */
> > +		if (!set_pci_slot_name(idev)) {
> > +			g_hash_table_remove_all(idev->props_ht);
> > +			get_props(dev, idev);
> > +			igt_assert(set_pci_slot_name(idev));
> > +		}
> >  		get_pci_vendor_device(idev, &vendor, &device);
> >  		idev->codename = __pci_codename(vendor, device);
> >  		idev->dev_type = __pci_devtype(vendor, device, idev->pci_slot_name);
> > @@ -1270,7 +1305,8 @@ igt_devs_print_detail(struct igt_list_head *view,
> >  			_print_key_value("codename", dev->codename);
> >  		}
> >  
> > -		dump_props_and_attrs(dev);
> > +		if (is_pci_subsystem(dev))
> > +			dump_props_and_attrs(dev);

This incidentally left in patch I've sent (I just narrowed debugging
to pci subsystem). Do you want to resend or I can just go back with 
previous code and merge?

--
Zbigniew

> >  	}
> >  }
> >  
> > -- 
> > 2.34.1
> > 


More information about the igt-dev mailing list