[Beignet] [PATCH] GBE: Merge successive load/store together for better performance.
Moore, Anthony W
anthony.w.moore at intel.com
Mon Aug 18 08:47:05 PDT 2014
Hi,
For this patch http://lists.freedesktop.org/archives/beignet/2014-May/002879.html, why are only DWORDs (and floats) enabled for merging? I tried adding 8-bit and 16-bit and saw some significant performance improvement with some of OpenCV's kernels.
+ // we only support DWORD data type merge
+ if(!ty->isFloatTy() && !ty->isIntegerTy(32)) continue;
Thanks!
Tony
More information about the Beignet
mailing list