[Linaro-mm-sig] thoughts of looking at android fences
Maarten Lankhorst
maarten.lankhorst at canonical.com
Mon Nov 4 01:36:54 PST 2013
op 02-11-13 22:36, Colin Cross schreef:
> On Wed, Oct 30, 2013 at 5:17 AM, Maarten Lankhorst
> <maarten.lankhorst at canonical.com> wrote:
>> op 24-10-13 14:13, Maarten Lankhorst schreef:
>>> So I actually tried to implement it now. I killed all the deprecated members and assumed a linear timeline.
>>> This means that syncpoints can only be added at the end, not in between. In particular it means sw_sync
>>> might be slightly broken.
>>>
>>> I only tested it with a simple program I wrote called ufence.c, it's in drivers/staging/android/ufence.c in the following tree:
>>>
>>> http://cgit.freedesktop.org/~mlankhorst/linux
>>>
>>> the "rfc: convert android to fence api" has all the changes from my dma-fence proposal to what android would need,
>>> it also converts the userspace fence api to use the dma-fence api.
>>>
>>> sync_pt is implemented as fence too. This meant not having to convert all of android right away, though I did make some changes.
>>> I killed the deprecated members and made all the fence calls forward to the sync_timeline_ops. dup and compare are no longer used.
>>>
>>> I haven't given this a spin on a full android kernel, only with the components that are in mainline kernel under staging and my dumb test program.
>>>
>>> ~Maarten
>>>
>>> PS: The nomenclature is very confusing. I want to rename dma-fence to syncpoint, but I want some feedback from the android devs first. :)
>>>
>> Come on, any feedback? I want to move the discussion forward.
>>
>> ~Maarten
> I experimented with it a little on a device that uses sync and came
> across a few bugs:
> 1. sync_timeline_signal needs to call __fence_signal on all signaled
> points on the timeline, not just the first
> 2. fence_add_callback doesn't always initialize cb.node
> 3. sync_fence_wait should take ms
> 4. sync_print_pt status printing was incorrect
> 5. there is a deadlock:
> sync_print_obj takes obj->child_list_lock
> sync_print_pt
> fence_is_signaled
> fence_signal takes fence->lock == obj->child_list_lock
> 6. freeing a timeline before all the fences holding points on that
> timeline have timed out results in a crash
>
> With the attached patch to fix these issues, our libsync and sync_test
> give the same results as with our sync code. I haven't tested against
> the full Android framework yet.
>
> The compare op and timeline ordering is critical to the efficiency of
> sync points on Android. The compare op is used when merging fences to
> drop all but the latest point on the same timeline. This is necessary
> for example when the same buffer is submitted to the display on
> multiple frames, like when there is a live wallpaper in the background
> updating at 60 fps and a static screen of widgets on top of it. The
> static widget buffer is submitted on every frame, returning a new
> fence each time. The compositor merges the new fence with the fence
> for the previous buffer, and because they are on the same timeline it
> merges down to a single point. I experimented with disabling the
> merge optimization on a Nexus 10, and found that leaving the screen on
> running a live wallpaper eventually resulted in 100k outstanding sync
> points.
Hey,
fence_add_callback will now always initialize cb->node, even on failure.
I added __fence_is_signaled, to be used with the lock held.
sync_print_pt didn't work when the fence was signaled with an error, I fixed that.
So I reworked patch below, no merge optimization yet. It will be done as a separate patch. :)
---
diff --git a/drivers/base/fence.c b/drivers/base/fence.c
index 89c89ae19f58..9e7a63c4b07f 100644
--- a/drivers/base/fence.c
+++ b/drivers/base/fence.c
@@ -185,8 +185,10 @@ int fence_add_callback(struct fence *fence, struct fence_cb *cb,
if (WARN_ON(!fence || !func))
return -EINVAL;
- if (test_bit(FENCE_FLAG_SIGNALED_BIT, &fence->flags))
+ if (test_bit(FENCE_FLAG_SIGNALED_BIT, &fence->flags)) {
+ LIST_INIT_HEAD(&cb->node);
return -ENOENT;
+ }
spin_lock_irqsave(fence->lock, flags);
@@ -202,7 +204,8 @@ int fence_add_callback(struct fence *fence, struct fence_cb *cb,
if (!ret) {
cb->func = func;
list_add_tail(&cb->node, &fence->cb_list);
- }
+ } else
+ LIST_INIT_HEAD(&cb->node);
spin_unlock_irqrestore(fence->lock, flags);
return ret;
diff --git a/drivers/staging/android/sync.c b/drivers/staging/android/sync.c
index 110a9e99cb71..2c7fd3f2ab23 100644
--- a/drivers/staging/android/sync.c
+++ b/drivers/staging/android/sync.c
@@ -74,6 +74,16 @@ static void sync_timeline_free(struct kref *kref)
kfree(obj);
}
+static void sync_timeline_get(struct sync_timeline *obj)
+{
+ kref_get(&obj->kref);
+}
+
+static void sync_timeline_put(struct sync_timeline *obj)
+{
+ kref_put(&obj->kref, sync_timeline_free);
+}
+
void sync_timeline_destroy(struct sync_timeline *obj)
{
obj->destroyed = true;
@@ -83,8 +93,8 @@ void sync_timeline_destroy(struct sync_timeline *obj)
* that their parent is going away.
*/
- if (!kref_put(&obj->kref, sync_timeline_free))
- sync_timeline_signal(obj);
+ sync_timeline_signal(obj);
+ sync_timeline_put(obj);
}
EXPORT_SYMBOL(sync_timeline_destroy);
@@ -98,12 +108,8 @@ void sync_timeline_signal(struct sync_timeline *obj)
spin_lock_irqsave(&obj->child_list_lock, flags);
list_for_each_entry_safe(pt, next, &obj->active_list_head, active_list) {
- if (!pt->base.ops->signaled(&pt->base))
- break;
- else {
- __fence_signal(&pt->base);
+ if (__fence_is_signaled(&pt->base))
list_del(&pt->active_list);
- }
}
spin_unlock_irqrestore(&obj->child_list_lock, flags);
}
@@ -122,6 +128,7 @@ struct sync_pt *sync_pt_create(struct sync_timeline *obj, int size)
return NULL;
spin_lock_irqsave(&obj->child_list_lock, flags);
+ sync_timeline_get(obj);
__fence_init(&pt->base, &android_fence_ops, &obj->child_list_lock, obj->context, ++obj->value);
list_add_tail(&pt->child_list, &obj->child_list_head);
INIT_LIST_HEAD(&pt->active_list);
@@ -255,7 +262,7 @@ struct sync_fence *sync_fence_merge(const char *name,
fence_get(pt);
fence->cbs[a->num_fences + i].sync_pt = pt;
fence->cbs[a->num_fences + i].fence = fence;
- if (fence_add_callback(pt, &fence->cbs[i].cb, fence_check_cb_func))
+ if (fence_add_callback(pt, &fence->cbs[a->num_fences + i].cb, fence_check_cb_func))
atomic_dec(&fence->status);
}
@@ -325,6 +332,8 @@ int sync_fence_wait(struct sync_fence *fence, long timeout)
if (timeout < 0)
timeout = MAX_SCHEDULE_TIMEOUT;
+ else
+ timeout = msecs_to_jiffies(timeout);
trace_sync_wait(fence, 1);
for (i = 0; i < fence->num_fences; ++i)
@@ -383,6 +392,7 @@ static void android_fence_release(struct fence *fence)
if (parent->ops->free_pt)
parent->ops->free_pt(pt);
+ sync_timeline_put(parent);
kfree(pt);
}
diff --git a/drivers/staging/android/sync_debug.c b/drivers/staging/android/sync_debug.c
index 55ad34085f2f..2ef6496c7cd0 100644
--- a/drivers/staging/android/sync_debug.c
+++ b/drivers/staging/android/sync_debug.c
@@ -82,18 +82,18 @@ static const char *sync_status_str(int status)
static void sync_print_pt(struct seq_file *s, struct sync_pt *pt, bool fence)
{
- int status = 0;
+ int status = 1;
struct sync_timeline *parent = sync_pt_parent(pt);
- if (fence_is_signaled(&pt->base)) {
+
+ if (__fence_is_signaled(&pt->base))
status = pt->base.status;
- if (!status)
- status = 1;
- }
+
seq_printf(s, " %s%spt %s",
fence ? parent->name : "",
fence ? "_" : "",
sync_status_str(status));
- if (status) {
+
+ if (status <= 0) {
struct timeval tv = ktime_to_timeval(pt->base.timestamp);
seq_printf(s, "@%ld.%06ld", tv.tv_sec, tv.tv_usec);
}
diff --git a/include/linux/fence.h b/include/linux/fence.h
index 2beb3b0ff2a3..dd1639ff96c7 100644
--- a/include/linux/fence.h
+++ b/include/linux/fence.h
@@ -237,6 +237,20 @@ int fence_add_callback(struct fence *fence, struct fence_cb *cb,
bool fence_remove_callback(struct fence *fence, struct fence_cb *cb);
void fence_enable_sw_signaling(struct fence *fence);
+static inline bool
+__fence_is_signaled(struct fence *fence)
+{
+ if (test_bit(FENCE_FLAG_SIGNALED_BIT, &fence->flags))
+ return true;
+
+ if (fence->ops->signaled && fence->ops->signaled(fence)) {
+ __fence_signal(fence);
+ return true;
+ }
+
+ return false;
+}
+
/**
* fence_is_signaled - Return an indication if the fence is signaled yet.
* @fence: [in] the fence to check
More information about the dri-devel
mailing list