[Intel-gfx] [PATCH 00/12] i915 perf support for command stream based OA, GPU and workload metrics capture
Sagar Arun Kamble
sagar.a.kamble at intel.com
Mon Jul 31 07:59:33 UTC 2017
This series is prepared from below two series posted by Sourab in March.
1. https://patchwork.freedesktop.org/series/21351/ - Collect command stream
based OA reports using i915 perf
2. https://patchwork.freedesktop.org/series/21352/ - Collect command stream
based GPU metrics for all engines using i915 perf
This series addresses most of the review comments from above two. Major
change is moving the stream structure and information from dev_priv to
per-engine structures. Stating below the intent of this series from cover
letters of earlier series.
This series adds framework for
1. Collection of OA reports associated with the render command stream, which
are collected around batchbuffer boundaries.
2. Collect other metadata such as ctx_id, pid, tag etc. with the samples,
and thus we can establish the association of samples collected with the
corresponding process/workload.
3. Collection of GPU performance metrics associated with the command stream of
a particular engine. These metrics include timestamps of work submission and
completion on engines, mmio metrics, etc. These metrics are are collected
around batchbuffer boundaries.
Functionality to be added in future patches:
1. GPU/CPU cross-timestamp sync patches need to be reworked as requested by
kernel maintainers.
2. Some of the data types being collected through these patches can be done in
the userspace and that is yet to be finalized. Based on that some of the
functionality from this series can be pruned.
3. Add support in the perf IGT tests for verifying CS based perf functionality.
Cc: Lionel Landwerlin <lionel.g.landwerlin at intel.com>
Cc: Matthew Auld <matthew.auld at intel.com>
Cc: Chris Wilson <chris at chris-wilson.co.uk>
Sourab Gupta (12):
drm/i915: Add ctx getparam ioctl parameter to retrieve ctx unique id
drm/i915: Expose OA sample source to userspace
drm/i915: Framework for capturing command stream based OA reports and
ctx id info.
drm/i915: Flush periodic samples, in case of no pending CS sample
requests
drm/i915: Inform userspace about command stream OA buf overflow
drm/i915: Populate ctx ID for periodic OA reports
drm/i915: Add support for having pid output with OA report
drm/i915: Add support for emitting execbuffer tags through OA counter
reports
drm/i915: Add support for collecting timestamps on all gpu engines
drm/i915: Extract raw GPU timestamps from OA reports to forward in
perf samples
drm/i915: Async check for streams data availability with hrtimer
rescheduling
drm/i915: Support for capturing MMIO register values
drivers/gpu/drm/i915/i915_drv.h | 165 ++-
drivers/gpu/drm/i915/i915_gem.c | 1 +
drivers/gpu/drm/i915/i915_gem_context.c | 3 +
drivers/gpu/drm/i915/i915_gem_execbuffer.c | 11 +
drivers/gpu/drm/i915/i915_perf.c | 1790 ++++++++++++++++++++++++----
drivers/gpu/drm/i915/i915_reg.h | 6 +
drivers/gpu/drm/i915/intel_engine_cs.c | 4 +
drivers/gpu/drm/i915/intel_ringbuffer.c | 2 +
drivers/gpu/drm/i915/intel_ringbuffer.h | 8 +
include/uapi/drm/i915_drm.h | 69 ++
10 files changed, 1798 insertions(+), 261 deletions(-)
--
1.9.1
More information about the Intel-gfx
mailing list