[Beignet] [PATCH OpenCL 2.0] Backend: Update the workgroup instructions for llvm backend to gen.
He Junyan
Junyan.he at inbox.com
Sat May 2 18:55:30 PDT 2015
I have modified it in the new patch set.
Just ignore this one. Thanks.
On 2015年04月30日 13:49, Zhigang Gong wrote:
> Junyan,
>
> I haven't found any new response to address this comment from Rong and me.
> Do you miss this comment or do I miss your new patch?
>
> Thanks,
> Zhigang Gong.
>
> On Thu, Apr 02, 2015 at 12:53:30PM +0800, Zhigang Gong wrote:
>> Right, especially for those builtin function which don't care about the sign.
>> Junyan, could you refine your patch accordingly?
>> Thanks.
>>
>> On Tue, Mar 24, 2015 at 07:39:03AM +0000, Yang, Rong R wrote:
>>> Zhigang have add function OCLIntrinsicMap.find to handle override function name, only need one DECL_LLVM_GEN_FUNCTION for one group override functions, and in the GenWriter::emitCallInst to get the corresponding argument type. It reduce the DECL_LLVM_GEN_FUNCTION significant. Can you also use this method?
>>>
>>>> -----Original Message-----
>>>> From: Beignet [mailto:beignet-bounces at lists.freedesktop.org] On Behalf Of
>>>> junyan.he at inbox.com
>>>> Sent: Tuesday, March 24, 2015 14:40
>>>> To: beignet at lists.freedesktop.org
>>>> Cc: Junyan He
>>>> Subject: [Beignet] [PATCH OpenCL 2.0] Backend: Update the workgroup
>>>> instructions for llvm backend to gen.
>>>>
>>>> From: Junyan He <junyan.he at linux.intel.com>
>>>>
>>>> Signed-off-by: Junyan He <junyan.he at linux.intel.com>
>>>> ---
>>>> backend/src/llvm/llvm_gen_ocl_function.hxx | 87
>>>> ++++++++++++++++++++++++++++
>>>> 1 file changed, 87 insertions(+)
>>>>
>>>> diff --git a/backend/src/llvm/llvm_gen_ocl_function.hxx
>>>> b/backend/src/llvm/llvm_gen_ocl_function.hxx
>>>> index 9536a3c..947fadc 100644
>>>> --- a/backend/src/llvm/llvm_gen_ocl_function.hxx
>>>> +++ b/backend/src/llvm/llvm_gen_ocl_function.hxx
>>>> @@ -160,3 +160,90 @@ DECL_LLVM_GEN_FUNCTION(REGION,
>>>> __gen_ocl_region)
>>>>
>>>> // printf function
>>>> DECL_LLVM_GEN_FUNCTION(PRINTF, __gen_ocl_printf)
>>>> +
>>>> +// work group function
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_INT_1D,
>>>> +_Z30__gen_ocl_work_group_broadcastij)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_INT_2D,
>>>> +_Z30__gen_ocl_work_group_broadcastijj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_INT_3D,
>>>> +_Z30__gen_ocl_work_group_broadcastijjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_UINT_1D,
>>>> +_Z30__gen_ocl_work_group_broadcastjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_UINT_2D,
>>>> +_Z30__gen_ocl_work_group_broadcastjjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_UINT_3D,
>>>> +_Z30__gen_ocl_work_group_broadcastjjjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_LONG_1D,
>>>> +_Z30__gen_ocl_work_group_broadcastlj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_LONG_2D,
>>>> +_Z30__gen_ocl_work_group_broadcastljj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_lONG_3D,
>>>> +_Z30__gen_ocl_work_group_broadcastljjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_ULONG_1D,
>>>> +_Z30__gen_ocl_work_group_broadcastmj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_ULONG_2D,
>>>> +_Z30__gen_ocl_work_group_broadcastmjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_ULONG_3D,
>>>> +_Z30__gen_ocl_work_group_broadcastmjjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_FLOAT_1D,
>>>> +_Z30__gen_ocl_work_group_broadcastfj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_FLOAT_2D,
>>>> +_Z30__gen_ocl_work_group_broadcastfjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_FLOAT_3D,
>>>> +_Z30__gen_ocl_work_group_broadcastfjjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_DOUBLE_1D,
>>>> +_Z30__gen_ocl_work_group_broadcastdj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_DOUBLE_2D,
>>>> +_Z30__gen_ocl_work_group_broadcastdjj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_BROADCAST_DOUBLE_3D,
>>>> +_Z30__gen_ocl_work_group_broadcastdjjj)
>>>> +
>>>> +// work group reduce
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_ADD_DOUBLE,
>>>> +_Z31__gen_ocl_work_group_reduce_addd)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_ADD_FLOAT,
>>>> +_Z31__gen_ocl_work_group_reduce_addf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_ADD_INT,
>>>> +_Z31__gen_ocl_work_group_reduce_addi)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_ADD_UINT,
>>>> +_Z31__gen_ocl_work_group_reduce_addj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_ADD_LONG,
>>>> +_Z31__gen_ocl_work_group_reduce_addl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_ADD_ULONG,
>>>> +_Z31__gen_ocl_work_group_reduce_addm)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MAXD,
>>>> +_Z31__gen_ocl_work_group_reduce_maxd)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MAXF,
>>>> +_Z31__gen_ocl_work_group_reduce_maxf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MAXI,
>>>> +_Z31__gen_ocl_work_group_reduce_maxi)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MAXJ,
>>>> +_Z31__gen_ocl_work_group_reduce_maxj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MAXL,
>>>> +_Z31__gen_ocl_work_group_reduce_maxl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MAXM,
>>>> +_Z31__gen_ocl_work_group_reduce_maxm)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MIND,
>>>> +_Z31__gen_ocl_work_group_reduce_mind)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MINF,
>>>> +_Z31__gen_ocl_work_group_reduce_minf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MINI,
>>>> +_Z31__gen_ocl_work_group_reduce_mini)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MINJ,
>>>> +_Z31__gen_ocl_work_group_reduce_minj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MINL,
>>>> +_Z31__gen_ocl_work_group_reduce_minl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_REDUCE_MINM,
>>>> +_Z31__gen_ocl_work_group_reduce_minm)
>>>> +
>>>> +// work group scan_exclusive
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_ADD_DO
>>>> UBLE,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_addd)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_ADD_FLO
>>>> AT,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_addf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_ADD_INT,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_addi)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_ADD_UIN
>>>> T,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_addj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_ADD_LO
>>>> NG,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_addl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_ADD_UL
>>>> ONG,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_addm)
>>>> +
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MAX_DO
>>>> UBLE,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_maxd)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MAX_FL
>>>> OAT,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_maxf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MAX_INT,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_maxi)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MAX_UI
>>>> NT,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_maxj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MAX_LO
>>>> NG,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_maxl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MAXM_
>>>> ULONG,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_maxm)
>>>> +
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MIN_DO
>>>> UBLE,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_mind)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MIN_FLO
>>>> AT,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_minf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MIN_INT,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_mini)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MIN_UIN
>>>> T,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_minj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MIN_LO
>>>> NG,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_minl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_EXCLUSIVE_MIN_UL
>>>> ONG,
>>>> +_Z39__gen_ocl_work_group_scan_exclusive_minm)
>>>> +
>>>> +// work group scan_inclusive
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_ADD_DO
>>>> UBLE,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_addd)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_ADD_FLO
>>>> AT,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_addf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_ADD_INT,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_addi)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_ADD_UIN
>>>> T,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_addj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_ADD_LO
>>>> NG,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_addl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_ADD_UL
>>>> ONG,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_addm)
>>>> +
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MAX_DO
>>>> UBLE,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_maxd)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MAX_FL
>>>> OAT,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_maxf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MAX_INT,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_maxi)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MAX_UI
>>>> NT,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_maxj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MAX_LO
>>>> NG,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_maxl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MAX_UL
>>>> ONG,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_maxm)
>>>> +
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MIN_DO
>>>> UBLE,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_mind)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MIN_FLO
>>>> AT,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_minf)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MIN_INT,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_mini)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MIN_UIN
>>>> T,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_minj)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MIN_LO
>>>> NG,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_minl)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_SCAN_INCLUSIVE_MIN_UL
>>>> ONG,
>>>> +_Z39__gen_ocl_work_group_scan_inclusive_minm)
>>>> +
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_ALL,
>>>> __gen_ocl_work_group_all)
>>>> +DECL_LLVM_GEN_FUNCTION(WORK_GROUP_ANY,
>>>> __gen_ocl_work_group_any)
>>>> --
>>>> 1.7.9.5
>>>>
>>>> _______________________________________________
>>>> Beignet mailing list
>>>> Beignet at lists.freedesktop.org
>>>> http://lists.freedesktop.org/mailman/listinfo/beignet
>>> _______________________________________________
>>> Beignet mailing list
>>> Beignet at lists.freedesktop.org
>>> http://lists.freedesktop.org/mailman/listinfo/beignet
>> _______________________________________________
>> Beignet mailing list
>> Beignet at lists.freedesktop.org
>> http://lists.freedesktop.org/mailman/listinfo/beignet
More information about the Beignet
mailing list