[Beignet] [PATCH v3 1/3] return 32 could gain 0.2% performance on opencv optical flow case.
xionghu.luo at intel.com
xionghu.luo at intel.com
Tue Sep 8 22:43:38 PDT 2015
From: Luo Xionghu <xionghu.luo at intel.com>
Signed-off-by: Luo Xionghu <xionghu.luo at intel.com>
---
src/cl_gt_device.h | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/src/cl_gt_device.h b/src/cl_gt_device.h
index bd87cc4..a51843d 100644
--- a/src/cl_gt_device.h
+++ b/src/cl_gt_device.h
@@ -39,7 +39,7 @@
.native_vector_width_float = 4,
.native_vector_width_double = 2,
.native_vector_width_half = 8,
-.preferred_wg_sz_mul = 16,
+.preferred_wg_sz_mul = 32,
.address_bits = 32,
.max_mem_alloc_size = 512 * 1024 * 1024,
.image_support = CL_TRUE,
--
1.9.1
More information about the Beignet
mailing list