[Beignet] [PATCH] GBE: Merge successive load/store together for better performance.

Moore, Anthony W anthony.w.moore at intel.com
Mon Aug 18 08:47:05 PDT 2014


Hi,

For this patch http://lists.freedesktop.org/archives/beignet/2014-May/002879.html, why are only DWORDs (and floats) enabled for merging? I tried adding 8-bit and 16-bit and saw some significant performance improvement with some of OpenCV's kernels.

+        // we only support DWORD data type merge
+        if(!ty->isFloatTy() && !ty->isIntegerTy(32)) continue;

Thanks!
Tony 


More information about the Beignet mailing list