[Intel-gfx] TIMESTAMP register

Lawrynowicz, Jacek jacek.lawrynowicz at intel.com
Wed Apr 18 08:32:04 CEST 2012

I think that ioctl to get GPU timestamp and its resolution would be the best 
This ioctl won't be called very often so there's no need to sacrifice 
simplicity for performance in this case.
Using clock_gettime() sounds like trouble to me.
How would you keep GPU and CPU time synchronized?
Is forcewakeup necessary in case of a register which is not affected by gfx 


-----Original Message-----
From: intel-gfx-bounces+jacek.lawrynowicz=intel.com at lists.freedesktop.org 
[mailto:intel-gfx-bounces+jacek.lawrynowicz=intel.com at lists.freedesktop.org] 
On Behalf Of Chris Wilson
Sent: Wednesday, April 18, 2012 1:52 AM
To: Ben Widawsky; Daniel Vetter
Cc: intel-gfx at lists.freedesktop.org
Subject: Re: [Intel-gfx] TIMESTAMP register

On Tue, 17 Apr 2012 16:27:45 -0700, Ben Widawsky <ben at bwidawsk.net> wrote:
> On Tue, 17 Apr 2012 23:04:18 +0200
> Daniel Vetter <daniel at ffwll.ch> wrote:
> > On Tue, Apr 17, 2012 at 08:34:11PM +0000, Lawrynowicz, Jacek wrote:
> > > ARB_timer_query allows client read TIMESTAMP both asynchronously and 
> > > synchronously.
> > > The former can be implemented as you said but the latter requires 
> > > support from the KMD.
> > > This must be a simple MMIO read as this is the only way to report 
> > > "current" GPU time.
> > > Implementing synchronous TIMESTAMP query using pipe control would render 
> > > the third example from ARB_timer_query spec useless.
> >
> > Ok, I've looked like a dofus again, but now I've read the spec and
> > we indeed seem to need a synchronous readout of the TIMESTAMP
> > register. I guess a new register will do, together with some
> > fixed-point integer that tells userspace how to convert it to nanoseconds.
> > -Daniel
> I've not read the spec, but synchronous and "current" doesn't mean the
> exact same thing to me. I assume the spec doesn't allow getting the
> value in a batch and then just waiting for rendering to complete?

The spec stipulates that the client is able to query the timestamp counter 
synchronously from within the render stream (ala PIPE_CONTROL) and query the 
current timestamp asynchronously. The spec also explicitly allows for those 
two clocks to be different (though close enough for the user to not care). 
Therefore you need only use the nanosecond monotonic clock for the 
asynchronous query and apply an offset to the GPU timestamp when converting 
that from ticks to nanoseconds. My bet is that
clock_gettime() is going to beat even ioctl(QUERY_COUNTER), not least because 
TIMESTAMP (being a per-ring register) is going to require the forcewake dance.

Chris Wilson, Intel Open Source Technology Centre 
Intel-gfx mailing list
Intel-gfx at lists.freedesktop.org
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 8658 bytes
Desc: not available
URL: <http://lists.freedesktop.org/archives/intel-gfx/attachments/20120418/f801122f/attachment.bin>
-------------- next part --------------
Intel Technology Poland sp. z o.o.
z siedziba w Gdansku
ul. Slowackiego 173
80-298 Gdansk

Sad Rejonowy Gdansk Polnoc w Gdansku, 
VII Wydzial Gospodarczy Krajowego Rejestru Sadowego, 
numer KRS 101882

NIP 957-07-52-316
Kapital zakladowy 200.000 zl

This e-mail and any attachments may contain confidential material for
the sole use of the intended recipient(s). Any review or distribution
by others is strictly prohibited. If you are not the intended
recipient, please contact the sender and delete all copies.

More information about the Intel-gfx mailing list