[Intel-gfx] [CI 16/20] drm/i915: Only query timestamp when measuring elapsed time
Tvrtko Ursulin
tvrtko.ursulin at linux.intel.com
Mon May 23 08:54:41 UTC 2016
On 20/05/16 13:20, Chris Wilson wrote:
> On Thu, May 19, 2016 at 04:44:03PM +0100, Tvrtko Ursulin wrote:
>>
>> On 19/05/16 12:32, Chris Wilson wrote:
>>> Avoid the two calls to ktime_get_raw_ns() (at best it reads the TSC) as
>>> we only need to compute the elapsed time for a timed wait.
>>>
>>> v2: Eliminate the unused local variable reducing the function size by 64
>>> bytes (using the storage space on the callers stack rather than adding
>>> to our stack frame)
>>>
>>> Signed-off-by: Chris Wilson <chris at chris-wilson.co.uk>
>>> ---
>>> drivers/gpu/drm/i915/i915_gem.c | 14 +++++---------
>>> 1 file changed, 5 insertions(+), 9 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/i915/i915_gem.c b/drivers/gpu/drm/i915/i915_gem.c
>>> index b48a3b46e86f..2c254cf49c15 100644
>>> --- a/drivers/gpu/drm/i915/i915_gem.c
>>> +++ b/drivers/gpu/drm/i915/i915_gem.c
>>> @@ -1215,7 +1215,6 @@ int __i915_wait_request(struct drm_i915_gem_request *req,
>>> int state = interruptible ? TASK_INTERRUPTIBLE : TASK_UNINTERRUPTIBLE;
>>> struct intel_wait wait;
>>> unsigned long timeout_remain;
>>> - s64 before = 0; /* Only to silence a compiler warning. */
>>> int ret = 0;
>>>
>>> might_sleep();
>>> @@ -1234,12 +1233,9 @@ int __i915_wait_request(struct drm_i915_gem_request *req,
>>> if (*timeout == 0)
>>> return -ETIME;
>>>
>>> + /* Record current time in case interrupted, or wedged */
>>> timeout_remain = nsecs_to_jiffies_timeout(*timeout);
>>> -
>>> - /*
>>> - * Record current time in case interrupted by signal, or wedged.
>>> - */
>>> - before = ktime_get_raw_ns();
>>> + *timeout += ktime_get_raw_ns();
>>> }
>>>
>>> trace_i915_gem_request_wait_begin(req);
>>> @@ -1296,9 +1292,9 @@ complete:
>>> trace_i915_gem_request_wait_end(req);
>>>
>>> if (timeout) {
>>> - s64 tres = *timeout - (ktime_get_raw_ns() - before);
>>> -
>>> - *timeout = tres < 0 ? 0 : tres;
>>> + *timeout -= ktime_get_raw_ns();
>>> + if (*timeout < 0)
>>> + *timeout = 0;
>>>
>>> /*
>>> * Apparently ktime isn't accurate enough and occasionally has a
>>>
>>
>> I think this is bad, better have a local than play games with
>> callers storage.
>
> It's smaller faster code :-)
You know I am normally all for that but this one I cannot approve.
Regards,
Tvrtko
More information about the Intel-gfx
mailing list