[Beignet] [PATCH v5] Add test cases generator.
Sun, Yi
yi.sun at intel.com
Sun Dec 29 22:15:16 PST 2013
I got it. That because "make clean" can delete all the files which generated by the utest_generator when running 'cmake'.
So "make clean && cmake . && make -jx" can work well.
Shouldn't we delete those files in 'make clean'?
Thanks
--Sun, Yi
> -----Original Message-----
> From: Zou, Nanhai
> Sent: Monday, December 30, 2013 1:46 PM
> To: Sun, Yi; Zhigang Gong
> Cc: Song, Ruiling; beignet at lists.freedesktop.org; Shui, YangweiX
> Subject: RE: [Beignet] [PATCH v5] Add test cases generator.
>
>
> First make without -jx
> then make clean && make -jx
>
> You will see the failure.
>
> Thanks
> Zou Nanhai
>
> ________________________________________
> From: Sun, Yi
> Sent: Friday, December 27, 2013 2:51 PM
> To: Zou, Nanhai; Zhigang Gong
> Cc: Song, Ruiling; beignet at lists.freedesktop.org; Shui, YangweiX
> Subject: RE: [Beignet] [PATCH v5] Add test cases generator.
>
> Anyone else can reproduce this issue?
> I have tried it on 3 machines, but all following command are passed.
> make -j(2, 4, 8, 16, 32)
>
>
> Thanks
> --Sun, Yi
>
> > -----Original Message-----
> > From: Zou, Nanhai
> > Sent: Friday, December 27, 2013 2:38 PM
> > To: Zhigang Gong; Sun, Yi
> > Cc: Song, Ruiling; beignet at lists.freedesktop.org; Shui, YangweiX
> > Subject: RE: [Beignet] [PATCH v5] Add test cases generator.
> >
> > Hi Yi,
> > It seems that this patch breaks make -jx
> >
> > Can you have a check?
> >
> > Thanks
> > Zou Nanhai
> >
> > -----Original Message-----
> > From: beignet-bounces at lists.freedesktop.org
> > [mailto:beignet-bounces at lists.freedesktop.org] On Behalf Of Zhigang Gong
> > Sent: Wednesday, December 25, 2013 2:01 PM
> > To: Sun, Yi
> > Cc: Song, Ruiling; beignet at lists.freedesktop.org; Shui, YangweiX
> > Subject: Re: [Beignet] [PATCH v5] Add test cases generator.
> >
> > Pushed, thanks.
> >
> > On Tue, Dec 24, 2013 at 11:15:18AM +0800, Yi Sun wrote:
> > > v1:
> > > File utest_generator.py contain the base class and function for
> > generating
> > > File utest_math_gen.py can generate most math function for all the
> > gentype
> > > utest_math_gen.py can be run during cmake.
> > >
> > > v2:
> > > 1. Put all the generated unit test cases to folder utest/generated.
> > > 2. Delete all generated folder while involve make clean.
> > > 3. At the top of the generated test cases, add some comments
> > > 4. Instead of defined FLT_ULP(0.000001) as the ulp unit, caculate the
> > float ulp before using it.
> > > 5. Add several math functions' test case.
> > >
> > > v3:
> > > 1. Refine the calculation for float, and calculate each float got from
> cpu
> > function.
> > >
> > > v4:
> > > Refine the calculation for float.
> > >
> > > Following fucntions test cases fail with input 0, 1 or 3.14:
> > > builtin_atan2_float
> > > builtin_atanh_float
> > > builtin_rootn_float
> > > builtin_cos_float
> > > builtin_cospi_float
> > > builtin_erf_float
> > > builtin_erfc_float
> > > builtin_mad_float
> > > builtin_nextafter_float
> > > builtin_pown_float
> > > builtin_powr_float
> > > builtin_rint_float
> > > builtin_sinpi_float
> > > builtin_tan_float
> > > builtin_tanpi_float
> > >
> > > v5:
> > > remove case builtin_mad_float
> > >
> > > todo:
> > > atan2pi
> > > fmax
> > > fmin
> > > sincos
> > >
> > > Signed-off-by: Yi Sun <yi.sun at intel.com>
> > > Signed-off-by: Yangwei Shui <yangweix.shui at intel.com>
> > > ---
> > > utests/CMakeLists.txt | 9 +
> > > utests/utest_generator.py | 374 +++++++++++++++++++++++++++++++
> > > utests/utest_helper.cpp | 30 +++
> > > utests/utest_helper.hpp | 6 +
> > > utests/utest_math_gen.py | 531
> > > +++++++++++++++++++++++++++++++++++++++++++++
> > > 5 files changed, 950 insertions(+), 0 deletions(-) create mode
> > > 100644 utests/utest_generator.py create mode 100755
> > > utests/utest_math_gen.py
> > >
> > > diff --git a/utests/CMakeLists.txt b/utests/CMakeLists.txt index
> > > 5e0bc19..836c80d 100644
> > > --- a/utests/CMakeLists.txt
> > > +++ b/utests/CMakeLists.txt
> > > @@ -1,10 +1,19 @@
> > > INCLUDE_DIRECTORIES(${CMAKE_CURRENT_SOURCE_DIR}
> > > ${CMAKE_CURRENT_SOURCE_DIR}/../include)
> > >
> > > +EXEC_PROGRAM(mkdir ${CMAKE_CURRENT_SOURCE_DIR} ARGS
> > generated -p)
> > > +##### Math Function Part:
> > > +EXEC_PROGRAM(python ${CMAKE_CURRENT_SOURCE_DIR} ARGS
> > > +utest_math_gen.py OUTPUT_VARIABLE GEN_MATH_STRING)
> string(REGEX
> > > +REPLACE " " ";" ADDMATHFUNC ${GEN_MATH_STRING}) string(REGEX
> > REPLACE
> > > +" " "\n" NAMEMATHLIST ${GEN_MATH_STRING}) MESSAGE(STATUS
> > "Generated
> > > +Builtin Math Functions:\n" ${NAMEMATHLIST})
> > > +set_directory_properties(PROPERTIES ADDITIONAL_MAKE_CLEAN_FILES
> > > +generated/)
> > > +
> > > link_directories (${LLVM_LIBRARY_DIR}) set (utests_sources
> > > cl_create_kernel.cpp
> > > utest_error.c
> > > + ${ADDMATHFUNC}
> > > compiler_basic_arithmetic.cpp
> > > compiler_displacement_map_element.cpp
> > > compiler_shader_toy.cpp
> > > diff --git a/utests/utest_generator.py b/utests/utest_generator.py new
> > > file mode 100644 index 0000000..626ac96
> > > --- /dev/null
> > > +++ b/utests/utest_generator.py
> > > @@ -0,0 +1,374 @@
> > > +#!/usr/bin/python
> > > +import os,sys,re
> > > +
> > > +FLT_MAX_POSI='0x1.fffffep127f'
> > > +FLT_MIN_NEGA='-0x1.fffffep127f'
> > > +FLT_MIN_POSI='0x1.0p-126f'
> > > +FLT_MAX_NEGA='-0x1.0p-126f'
> > > +
> > > +paraTypeList={'float':'%.20f','int':'%d','double':'%lf','uint':'%d','
> > > +string':'%s'}
> > > +
> > > +
> > > +def ulpUnit(ulpSize):
> > > + return re.findall(r"([a-zA-Z_]+)",ulpSize)[0]
> > > +
> > > +def ulpNum(ulpSize):
> > > + return re.findall(r"([0-9]+)",ulpSize)[0]
> > > +
> > > +def udebug(ulpSize,returnType):
> > > + #ulpUnit=re.findall(r"([a-zA-Z_]+)",ulpSize)[0]
> > > + #ulpNum=re.findall(r"([0-9]+)",ulpSize)[0]
> > > + text='''
> > > + static const char* INFORNAN;
> > > + static %s ULPSIZE;
> > > +
> > > + if (isinf(cpu_data[index])){
> > > + INFORNAN="INF";
> > > + }
> > > + else if (isnan(cpu_data[index])){
> > > + INFORNAN="NAN";
> > > + }
> > > + else{
> > > + ULPSIZE=cl_%s(cpu_data[index]) * %s;
> > > + }
> > > +
> > > +#if udebug
> > > + if (isinf(cpu_data[index])){
> > > + if (isinf(gpu_data[index]))
> > > + printf("%s expect:%s\\n", log, INFORNAN);
> > > + else
> > > + printf_c("%s expect:%s\\n", log, INFORNAN);
> > > + }
> > > + else if (isnan(cpu_data[index])){
> > > + if (isnan(gpu_data[index]))
> > > + printf("%s expect:%s\\n", log, INFORNAN);
> > > + else
> > > + printf_c("%s expect:%s\\n", log, INFORNAN);
> > > + }
> > > + else if (diff <= ULPSIZE){
> > > + printf("%s expect:%s\\n", log, ULPSIZE);
> > > + }
> > > + else
> > > + printf_c("%s expect:%s\\n", log, ULPSIZE); #else
> > > + if (isinf(cpu_data[index])){
> > > + sprintf(log, "%s expect:%s\\n", log, INFORNAN);
> > > + OCL_ASSERTM(isinf(gpu_data[index]),log);
> > > + }
> > > + else if (isnan(cpu_data[index])){
> > > + sprintf(log, "%s expect:%s\\n", log, INFORNAN);
> > > + OCL_ASSERTM(isnan(gpu_data[index]),log);
> > > + }
> > > + else{
> > > + sprintf(log, "%s expect:%s\\n", log, ULPSIZE);
> > > + OCL_ASSERTM(fabs(gpu_data[index]-cpu_data[index]) <= ULPSIZE,
> > log);
> > > + }
> > > +#endif
> > > + }
> > > +}\n'''%(returnType,\
> > > + ulpUnit(ulpSize),ulpNum(ulpSize),\
> > > + paraTypeList['string'],paraTypeList['string'],\
> > > + paraTypeList['string'],paraTypeList['string'],\
> > > + paraTypeList['string'],paraTypeList['string'],\
> > > + paraTypeList['string'],paraTypeList['string'],\
> > > + paraTypeList['string'],paraTypeList['%s'%(returnType)],\
> > > + paraTypeList['string'],paraTypeList['%s'%(returnType)],\
> > > + paraTypeList['string'],paraTypeList['string'],\
> > > + paraTypeList['string'],paraTypeList['string'],\
> > > + paraTypeList['string'],paraTypeList['%s'%(returnType)])
> > > +
> > > + return text
> > > +
> > > +def gene2ValuesLoop(values1,values2,inputValues):
> > > + values2=values2+inputValues*len(inputValues)
> > > +
> > > + for i in inputValues:
> > > + for j in range(0,len(inputValues)):
> > > + values1 += [i]
> > > +
> > > + return values1,values2
> > > +
> > > +def gene3ValuesLoop(values1,values2,values3,inputValues):
> > > + for i in inputValues:
> > > + for j in range(0,len(inputValues)):
> > > + for k in range(0,len(inputValues)):
> > > + values1 += [i]
> > > +
> > > + for i in inputValues:
> > > + for j in inputValues:
> > > + for k in range(0,len(inputValues)):
> > > + values2 += [j]
> > > +
> > > + values3=inputValues*(len(inputValues)**2)
> > > + return values1,values2,values3
> > > +
> > > +class func:
> > > + """ This class will define all needed instance attribute in fundation a c
> > programing file. """
> > > +
> > > + def __init__(self,name,cpuFuncName,inputType,outputType,values,ulp,
> > cpu_func=''):
> > > + self.funcName = name
> > > + self.cpuFuncName = cpuFuncName
> > > + self.fileName = 'builtin_'+name
> > > + self.inputtype = inputType
> > > + self.outputtype = outputType
> > > + self.values = values
> > > + self.ulp = ulp
> > > + self.cpufunc=cpu_func
> > > + self.cpplines = []
> > > +
> > > +#####cpp file required information:
> > > + self.Head='''/*
> > > +This file is generated by utest_generator.py.
> > > +Usually you need NOT modify this file manually.
> > > +But when any bug occured, you can change the value of udebug from 0
> > > +to 1, which can print more values and information to assist debuging the
> > issue.
> > > +*/
> > > +
> > > +#include "utest_helper.hpp"
> > > +#include <stdio.h>
> > > +#include <math.h>
> > > +#include <algorithm>
> > > +
> > > +#define udebug 0
> > > +#define FLT_MAX 0x1.fffffep127f
> > > +#define FLT_MIN 0x1.0p-126f
> > > +#define INT_ULP 0
> > > +
> > > +#define printf_c(...) \\
> > > +{\\
> > > + printf("\\033[1m\\033[40;31m");\\
> > > + printf( __VA_ARGS__ );\\
> > > + printf("\\033[0m");\\
> > > +}
> > > +'''
> > > + #########Execute class itself
> > > + self.geneProcess()
> > > +
> > > +#####Computer vector && argument type:
> > > + def argtype(self,paraN,index):
> > > + return re.findall(r"[a-zA-Z_]+",self.inputtype[paraN][index])[0]
> > > +
> > > + def argvector(self,paraN,index):
> > > + vector=re.findall(r"[0-9]+",self.inputtype[paraN][index])
> > > + if vector:
> > > + vector=vector[0]
> > > + else:
> > > + vector=1
> > > + return vector
> > > +
> > > + def returnVector(self,index):
> > > + returnVector=re.findall(r"[0-9]+",self.outputtype[index])
> > > + if returnVector:
> > > + returnVector=returnVector[0]
> > > + else:
> > > + returnVector=1
> > > + return returnVector
> > > +
> > > + def retType(self,index):
> > > + return re.findall("[a-zA-Z_]+",self.outputtype[index])[0]
> > > +
> > > + def inputNumFormat(self,paraN,index):
> > > + return paraTypeList['%s'%(self.argtype(paraN,index))]
> > > +
> > > + def outputNumFormat(self,index):
> > > + return paraTypeList['%s'%(self.retType(index))]
> > > +
> > > +#####Cpu values analyse
> > > + def GenInputValues(self,index):
> > > + #namesuffix=self.inputtype[0][index]
> > > + for i in range(0,self.values.__len__()):
> > > + self.cpplines += [ "const %s input_data%d[] =
> > {%s};" %(self.argtype(i,index),i+1,str(self.values[i]).strip('[]').replace('\'','')) ]
> > > + self.cpplines += [ "const int count_input = sizeof(input_data1) /
> > sizeof(input_data1[0]);" ]
> > > + self.cpplines += [ "const int vector =
> > > +%s;\n"%(self.argvector(self.inputtype.__len__()-1,index)) ]
> > > +
> > > +#####Cpu Function
> > > + def GenCpuCompilerMath(self,index):
> > > + #namesuffix=self.inputtype[0][index]
> > > + defline='static void cpu_compiler_math(%s *dst,
> '%(self.retType(index))
> > > + cpufunargs='('
> > > + funcline = ['{']
> > > + vectorargs=[]
> > > +
> > > + if (self.returnVector(index) == 1 and self.argvector(0,index) != 1):
> > > + for i in range(0,self.values.__len__()):
> > > + defline += 'const %s *src%d'%(self.argtype(i,index),i+1)
> > > + defline += ( i == self.values.__len__()-1 ) and ')' or ','
> > > + vectorargs.append('(')
> > > + for i in range(0,self.values.__len__()):
> > > + for j in range(0,self.vector):
> > > + vectorargs += "x%d%d"%(i+1,j+1)
> > > + vectorargs += ( j == self.vector-1 ) and ');' or ','
> > > + funcline += [" const %s x%d%d =
> > > + *(src%d+%d);"%(self.argtype(i,index),i+1,j+1,i+1,j)]
> > > +
> > > + return 0
> > > +
> > > + for i in range(0,self.values.__len__()):
> > > + defline += 'const %s *src%d'%(self.argtype(i,index),i+1)
> > > + defline += ( i == self.values.__len__()-1 ) and ')' or ','
> > > + cpufunargs += "x%d"%(i+1)
> > > + cpufunargs += ( i == self.values.__len__()-1 ) and ');' or ','
> > > + funcline += [" const %s x%d =
> > > + *src%d;"%(self.argtype(i,index),i+1,i+1)]
> > > +
> > > + funcline += [ " dst[0] = %s%s"%(self.cpuFuncName, cpufunargs) ]
> > > + funcline += [ '}']
> > > +
> > > + funcline = [defline] + funcline
> > > +
> > > + self.cpplines += funcline
> > > +# self.writeCPP( '\n'.join(funcline), 'a', namesuffix)
> > > +
> > > + def writeCPP(self,content,authority,namesuffix):
> > > + file_object =
> > open("generated/%s_%s.cpp"%(self.fileName,namesuffix),authority)
> > > + file_object.writelines(content)
> > > + file_object.close()
> > > +
> > > + def writeCL(self,content,authority,namesuffix):
> > > + file_object =
> >
> open(os.getcwd()+"/../kernels/%s_%s.cl"%(self.fileName,namesuffix),authority
> > )
> > > + file_object.writelines(content)
> > > + file_object.close()
> > > +
> > > + def nameForCmake(self,content,namesuffix):
> > > + print("generated/%s_%s.cpp"%(self.fileName,namesuffix)),
> > > +
> > > + def utestFunc(self,index):
> > > + funcLines=[]
> > > + namesuffix=self.inputtype[0][index]
> > > + funcline=[]
> > > + funchead='''
> > > +static void %s_%s(void)
> > > +{
> > > + int index;
> > > + %s gpu_data[count_input] = {0}, cpu_data[count_input] = {0},
> > > +diff=0.0;
> > > + char log[1024] = {0};
> > > +
> > > + OCL_CREATE_KERNEL(\"%s_%s\");
> > > + OCL_CREATE_BUFFER(buf[0], CL_MEM_READ_WRITE, count_input *
> > > + sizeof(%s), NULL);
> > > +
> > > + globals[0] = count_input;
> > > + locals[0] = 1;
> > > + '''%(self.fileName,namesuffix,\
> > > + self.retType(index),\
> > > + self.fileName, namesuffix,\
> > > + self.retType(index))
> > > +
> > > + funcline += [funchead]
> > > + for i in range(1,self.values.__len__()+1):
> > > + funcline += [" OCL_CREATE_BUFFER(buf[%d],
> > CL_MEM_READ_WRITE, count_input * sizeof(%s),
> > NULL);"%(i,self.argtype(i-1,index))]
> > > + funcline += [" clEnqueueWriteBuffer( queue, buf[%d], CL_TRUE,
> > > + 0, count_input * sizeof(%s), input_data%d, 0, NULL,
> > > + NULL);"%(i,self.argtype(i-1,index),i)]
> > > +
> > > + funcline += [" OCL_CREATE_BUFFER(buf[%d],
> CL_MEM_READ_WRITE,
> > sizeof(int), NULL);"%(self.inputtype.__len__()+1)]
> > > + funcline += [" clEnqueueWriteBuffer( queue, buf[%d], CL_TRUE, 0,
> > > + sizeof(int), &vector, 0, NULL, NULL);"%(self.inputtype.__len__()+1)]
> > > +
> > > + #0=output 1=input1 2=input2 ... len+2=output
> > > + for i in range(0,self.values.__len__()+2):
> > > + funcline += [" OCL_SET_ARG(%d, sizeof(cl_mem),
> > > +&buf[%d]);"%(i,i)]
> > > +
> > > + funcrun='''
> > > + // Run the kernel:
> > > + OCL_NDRANGE( 1 );
> > > + clEnqueueReadBuffer( queue, buf[0], CL_TRUE, 0, sizeof(%s) *
> > > +count_input, gpu_data, 0, NULL, NULL);
> > > +'''%(self.inputtype.__len__()+1)
> > > + funcline += [ funcrun ]
> > > +
> > > + funcsprintfa=' sprintf(log, \"'
> > > + funcsprintfb=''
> > > + if (self.returnVector(index) == 1 and self.argvector(0,index) != 1):
> > > + funccompare='''
> > > + for (index = 0; index < count_input/vector; index++) {
> > > + cpu_compiler_math( cpu_data + index, '''
> > > + else:
> > > + funccompare='''
> > > + for (index = 0; index < count_input; index++) {
> > > + cpu_compiler_math( cpu_data + index,'''
> > > +
> > > + for i in range(0,self.values.__len__()):
> > > + funccompare += " input_data%d + index"%(i+1)
> > > + funccompare += (self.values.__len__() - 1 == i) and ');' or ','
> > > +
> > > + funcsprintfa += "input_data%d:"%(i+1)
> > > + funcsprintfa += "%s "%(self.inputNumFormat(i,index))
> > > + funcsprintfb += " input_data%d[index],"%(i+1)
> > > +
> > > + funcline += [ funccompare ]
> > > +
> > > + funcsprintfa += " -> gpu:%s cpu:%s
> >
> diff:%s\","%(self.outputNumFormat(index),self.outputNumFormat(index),self.o
> > utputNumFormat(index))#,self.outputNumFormat(index))
> > > + funcsprintfb += " gpu_data[index], cpu_data[index],
> > > + diff);"#%(ulpUnit(self.ulp),ulpNum(self.ulp))
> > > +
> > > + #funcdiff = " diff = fabs((gpu_data[index]-cpu_data[index])"
> > > + #funcdiff += (self.retType(index) == "int") and ');' or
> > '/(cpu_data[index]>1?cpu_data[index]:1));'
> > > + funcdiff = " diff = fabs((gpu_data[index]-cpu_data[index]));"
> > > + funcline += [ funcdiff ]
> > > + funcline += [ funcsprintfa + funcsprintfb ]
> > > +
> > > + self.cpplines += funcline
> > > +
> > > + self.cpplines += [ udebug(self.ulp,self.retType(index)) ]
> > > + self.cpplines += [
> > > + "MAKE_UTEST_FROM_FUNCTION(%s_%s)"%(self.fileName,namesuffix) ]
> > > +
> > > + def genCL(self,index):
> > > + namesuffix=self.inputtype[0][index]
> > > + clLine = []
> > > + clhead = '__kernel void %s_%s(__global %s *dst,
> > '%(self.fileName,namesuffix,self.retType(index))
> > > + clvalueDef=''
> > > + clcomputer=''
> > > + tmp=''
> > > +
> > > + for i in range(0,self.values.__len__()):
> > > + clhead += ' __global %s *src%d,'%(self.argtype(i,index),i+1)
> > > + clvalueDef += ' %s x%d = (%s)
> > ('%(self.inputtype[i][index],i+1,self.inputtype[i][index])
> > > + tmp = 'src%d[i * (*vector) + '%(i+1)
> > > + for j in range(0,int(self.argvector(i,index))):
> > > + clvalueDef += tmp + ((int(self.argvector(i-1,index)) == j+1 ) and
> > '%d]);\n'%(j) or '%d],'%(j))
> > > + clcomputer += (self.values.__len__() == i+1) and 'x%d);'%(i+1)
> > > + or 'x%d,'%(i+1)
> > > +
> > > + clhead += ' __global int *vector) {\n'
> > > + clhead += ' int i = get_global_id(0);'
> > > + clLine += [ clhead ]
> > > + clLine += [ clvalueDef ]
> > > + clLine += [ ' %s ret;'%(self.outputtype[index]) ]
> > > + clLine += [ ' ret = %s('%(self.funcName) + clcomputer ]
> > > +
> > > + if (int(self.returnVector(index)) == 1):
> > > + clLine += [ ' dst[i] = ret;' ]
> > > + else:
> > > + for i in range(0,int(self.returnVector(index))):
> > > + clLine += [ ' dst[i * (*vector) + %d] = ret[%d];'%(i,i) ]
> > > + clLine += [ '};' ]
> > > +
> > > + self.writeCL('\n'.join(clLine),'w',namesuffix)
> > > +
> > > + def geneProcess(self):
> > > + for i in range(0,self.inputtype[0].__len__()):
> > > +##########Write Cpp file
> > > + namesuffix=self.inputtype[0][i]
> > > + self.cpplines = []
> > > + #The head:
> > > + self.cpplines += [self.Head]
> > > +
> > > + #Parameters:
> > > + self.GenInputValues(i)
> > > +
> > > + #cpu function generator:
> > > + self.cpplines += [self.cpufunc]
> > > +
> > > + #Cpu function:
> > > + self.GenCpuCompilerMath(i)
> > > +
> > > + #utest function
> > > + self.utestFunc(i)
> > > +
> > > + #kernel cl
> > > + self.genCL(i)
> > > +
> > > + #CMakelists.txt
> > > + self.nameForCmake(self.fileName,namesuffix)
> > > +
> > > + self.writeCPP( '\n'.join(self.cpplines) ,'w',namesuffix)
> > > +#########End
> > > +
> > > +#def main():
> > > +#
> > > +#if __name__ == "__main__":
> > > +# main()
> > > diff --git a/utests/utest_helper.cpp b/utests/utest_helper.cpp index
> > > 65af727..8b9ae5f 100644
> > > --- a/utests/utest_helper.cpp
> > > +++ b/utests/utest_helper.cpp
> > > @@ -641,3 +641,33 @@ int cl_check_image(const int *img, int w, int h,
> > const char *bmp)
> > > return (float(discrepancy) / float(n) > max_error_ratio) ? 0 : 1; }
> > >
> > > +typedef struct
> > > +{
> > > + unsigned int mantissa:23;
> > > + unsigned int exponent:8;
> > > + unsigned int sign:1;
> > > +} FLOAT;
> > > +
> > > +typedef union
> > > +{
> > > + float f;
> > > + unsigned int i;
> > > + FLOAT spliter;
> > > +} SF;
> > > +
> > > +const float cl_FLT_ULP(float float_number) {
> > > + SF floatBin, ulpBin;
> > > + floatBin.f = float_number;
> > > +
> > > + ulpBin.spliter.sign = floatBin.spliter.sign;
> > > + ulpBin.spliter.exponent = floatBin.spliter.exponent;
> > > + ulpBin.spliter.mantissa = 0x1;
> > > +
> > > + return ulpBin.f;
> > > +}
> > > +
> > > +const int cl_INT_ULP(int int_number)
> > > +{
> > > + return 0;
> > > +}
> > > diff --git a/utests/utest_helper.hpp b/utests/utest_helper.hpp index
> > > 29a21d5..79e7417 100644
> > > --- a/utests/utest_helper.hpp
> > > +++ b/utests/utest_helper.hpp
> > > @@ -216,5 +216,11 @@ extern void cl_write_bmp(const int *data, int
> > > width, int height, const char *fil
> > > /* Check data from img against bmp file located at "bmp" */ extern
> > > int cl_check_image(const int *img, int w, int h, const char *bmp);
> > >
> > > +/* Calculator ULP of each FLOAT value */ extern const float
> > > +cl_FLT_ULP(float float_number);
> > > +
> > > +/* Calculator ULP of each INT value */ extern const int
> > > +cl_INT_ULP(int int_number);
> > > +
> > > #endif /* __UTEST_HELPER_HPP__ */
> > >
> > > diff --git a/utests/utest_math_gen.py b/utests/utest_math_gen.py new
> > > file mode 100755 index 0000000..7a4b678
> > > --- /dev/null
> > > +++ b/utests/utest_math_gen.py
> > > @@ -0,0 +1,531 @@
> > > +#!/usr/bin/python
> > > +from utest_generator import *
> > > +import os,sys
> > > +
> > > +#base_input_values = [80, -80, 3.14, -3.14, -0.5, 0.5, 1, -1,
> > > +0.0,6,-6,1500.24,-1500.24] #extend_input_values =
> > > +[FLT_MAX_POSI,FLT_MIN_NEGA,FLT_MIN_POSI,FLT_MAX_NEGA,80, -80,
> > 3.14,
> > > +-3.14, -0.5, 0.5, 1, -1, 0.0,6,-6,1500.24,-1500.24]
> > > +
> > > +#func:
> > > +# gpufuncName
> > > +# cpuFuncName
> > > +# fileName: 'builtin_'+name
> > > +# inputtype: a 2-D list because there're more than one input data
> > > +# outputtype: a list
> > > +# values
> > > +# ulp
> > > +
> > > +base_input_values = [ 0, 1, 3.14]
> > > +def main():
> > > + ##### gentype acos(gentype)
> > > + acos_input_values = base_input_values
> > > + acos_input_type = ['float','float2','float4','float8','float16']
> > > + acos_output_type = ['float','float2','float4','float8','float16']
> > > + acosUtests =
> > > +func('acos','acos',[acos_input_type],acos_output_type,[acos_input_val
> > > +ues],'4 * FLT_ULP')
> > > +
> > > + ##### gentype acosh(gentype)
> > > + acosh_input_values = base_input_values acosh_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + acosh_output_type = ['float','float2','float4','float8','float16']
> > > + acoshUtests =
> > > + func('acosh','acosh',[acosh_input_type],acosh_output_type,[acosh_inp
> > > + ut_values],'4 * FLT_ULP')
> > > +
> > > + ##### gentype acospi(gentype x)
> > > + acospi_input_values = base_input_values
> > > + acospi_input_type = ['float','float2','float4','float8','float16']
> > > + acospi_output_type = ['float','float2','float4','float8','float16']
> > > + acospi_cpu_func='''
> > > +static float acospi(float x){
> > > + return acos(x)/M_PI;
> > > +} '''
> > > + acospiUtests =
> > > +func('acospi','acospi',[acospi_input_type],acospi_output_type,[acospi
> > > +_input_values],'4 * FLT_ULP',acospi_cpu_func)
> > > +
> > > + ##### gentype asin(gentype)
> > > + asin_input_values = base_input_values asin_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + asin_output_type = ['float','float2','float4','float8','float16']
> > > + asinUtests =
> > > + func('asin','asin',[asin_input_type],asin_output_type,[asin_input_va
> > > + lues],'4 * FLT_ULP')
> > > +
> > > + ##### gentype asinh(gentype)
> > > + asinh_input_values = base_input_values asinh_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + asinh_output_type = ['float','float2','float4','float8','float16']
> > > + asinhUtests =
> > > + func('asinh','asinh',[asinh_input_type],asinh_output_type,[asinh_inp
> > > + ut_values],'4 * FLT_ULP')
> > > +
> > > + ##### gentype asinpi(gentype x)
> > > + asinpi_input_values = base_input_values
> > > + asinpi_input_type = ['float','float2','float4','float8','float16']
> > > + asinpi_output_type = ['float','float2','float4','float8','float16']
> > > + asinpi_cpu_func='''
> > > +static float asinpi(float x){
> > > + return asin(x)/M_PI;
> > > +} '''
> > > + asinpiUtests =
> > > +func('asinpi','asinpi',[asinpi_input_type],asinpi_output_type,[asinpi
> > > +_input_values],'4 * FLT_ULP',asinpi_cpu_func)
> > > +
> > > + ##### gentype atan(gentype y_over_x) atan_input_values =
> > > + base_input_values atan_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + atan_output_type = ['float','float2','float4','float8','float16']
> > > + atanUtests =
> > > + func('atan','atan',[atan_input_type],atan_output_type,[atan_input_va
> > > + lues],'5 * FLT_ULP')
> > > +
> > > + ##### gentype atan2(gentype y, gentype x) atan2_base_values =
> > > + base_input_values
> > > + atan2_input_values1 = []
> > > + atan2_input_values2 = []
> > > +
> > > +
> atan2_input_values1,atan2_input_values2=gene2ValuesLoop(atan2_input_
> > > + values1,atan2_input_values2,atan2_base_values)
> > > + atan2_input_type1 = ['float','float2','float4','float8','float16']
> > > + atan2_input_type2 = ['float','float2','float4','float8','float16']
> > > + atan2_output_type = ['float','float2','float4','float8','float16']
> > > + atan2Utests =
> > > + func('atan2','atan2',[atan2_input_type1,atan2_input_type2],atan2_out
> > > + put_type,[atan2_input_values1,atan2_input_values2],'6 * FLT_ULP')
> > > +
> > > + ##### gentype atanh(gentype)
> > > + atanh_input_values = base_input_values atanh_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + atanh_output_type = ['float','float2','float4','float8','float16']
> > > + atanhUtests =
> > > + func('atanh','atanh',[atanh_input_type],atanh_output_type,[atanh_inp
> > > + ut_values],'5 * FLT_ULP')
> > > +
> > > + ##### gentype atanpi(gentype x)
> > > + atanpi_input_values = base_input_values
> > > + atanpi_input_type = ['float','float2','float4','float8','float16']
> > > + atanpi_output_type = ['float','float2','float4','float8','float16']
> > > + atanpi_cpu_func='''
> > > +static float atanpi(float x){
> > > + return atan(x)/M_PI;
> > > +} '''
> > > + atanpiUtests =
> > > +func('atanpi','atanpi',[atanpi_input_type],atanpi_output_type,[atanpi
> > > +_input_values],'4 * FLT_ULP',atanpi_cpu_func)
> > > +
> > > +# ##### gentype atan2pi(gentype y, gentype x) # atan2pi_base_values
> > > += base_input_values # atan2pi_input_values1 = [] #
> > > +atan2pi_input_values2 = [] #
> > >
> +atan2pi_input_values1,atan2pi_input_values2=gene2ValuesLoop(atan2pi_i
> > > +nput_values1,atan2pi_input_values2,atan2pi_base_values)
> > > +# atan2pi_input_type1 =
> > > +['float','float2','float4','float8','float16']
> > > +# atan2pi_input_type2 =
> > > +['float','float2','float4','float8','float16']
> > > +# atan2pi_output_type =
> > > +['float','float2','float4','float8','float16']
> > > +# atan2pi_cpu_func='''
> > > +#static float atan2pi(float y, float x){ # return atan2(y,x)/M_PI;
> > > +#} '''
> > > +# atan2piUtests =
> > > +func('atan2pi','atan2pi',[atan2pi_input_type1,atan2pi_input_type2],at
> > > +an2pi_output_type,[atan2pi_input_values1,atan2pi_input_values2],'6 *
> > > +FLT_ULP',atan2pi_cpu_func)
> > > +
> > > + ##### gentype cbrt(gentype)
> > > + cbrt_input_values = base_input_values cbrt_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + cbrt_output_type = ['float','float2','float4','float8','float16']
> > > + cbrtUtests =
> > > + func('cbrt','cbrt',[cbrt_input_type],cbrt_output_type,[cbrt_input_va
> > > + lues],'4 * FLT_ULP')
> > > +
> > > + ##### gentype ceil(gentype)
> > > + ceil_input_values = base_input_values ceil_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + ceil_output_type = ['float','float2','float4','float8','float16']
> > > + ceilUtests =
> > > + func('ceil','ceil',[ceil_input_type],ceil_output_type,[ceil_input_va
> > > + lues],'0 * FLT_ULP')
> > > +
> > > + ##### gentype copysign(gentype x, gentype y) copysign_base_values
> > > + = base_input_values
> > > + copysign_input_values1 = []
> > > + copysign_input_values2 = []
> > > +
> > > + copysign_input_values1,copysign_input_values2=gene2ValuesLoop(copysi
> > > + gn_input_values1,copysign_input_values2,copysign_base_values)
> > > + copysign_input_type1 =
> > > + ['float','float2','float4','float8','float16']
> > > + copysign_input_type2 =
> > > + ['float','float2','float4','float8','float16']
> > > + copysign_output_type =
> > > + ['float','float2','float4','float8','float16']
> > > + copysignUtests =
> > > + func('copysign','copysign',[copysign_input_type1,copysign_input_type
> > > + 2],copysign_output_type,[copysign_input_values1,copysign_input_value
> > > + s2],'0 * FLT_ULP')
> > > +
> > > + ##### gentype cos(gentype)
> > > + cos_input_values = base_input_values cos_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + cos_output_type = ['float','float2','float4','float8','float16']
> > > + cosUtests =
> > > + func('cos','cos',[cos_input_type],cos_output_type,[cos_input_values]
> > > + ,'4 * FLT_ULP')
> > > +
> > > + ##### gentype cosh(gentype)
> > > + cosh_input_values = base_input_values cosh_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + cosh_output_type = ['float','float2','float4','float8','float16']
> > > + coshUtests =
> > > + func('cosh','cosh',[cosh_input_type],cosh_output_type,[cosh_input_va
> > > + lues],'4 * FLT_ULP')
> > > +
> > > + ##### gentype cospi(gentype x)
> > > + cospi_input_values = base_input_values
> > > + cospi_input_type = ['float','float2','float4','float8','float16']
> > > + cospi_output_type = ['float','float2','float4','float8','float16']
> > > + cospi_cpu_func='''
> > > +static float cospi(float x){
> > > + return cos(M_PI * x);
> > > +} '''
> > > + cospiUtests =
> > > +func('cospi','cospi',[cospi_input_type],cospi_output_type,[cospi_inpu
> > > +t_values],'2 * FLT_ULP',cospi_cpu_func)
> > > +
> > > + ##### gentype erf(gentype)
> > > + erf_input_values = base_input_values erf_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + erf_output_type = ['float','float2','float4','float8','float16']
> > > + erfUtests =
> > > + func('erf','erf',[erf_input_type],erf_output_type,[erf_input_values]
> > > + ,'16 * FLT_ULP')
> > > +
> > > + ##### gentype erfc(gentype)
> > > + erfc_input_values = base_input_values erfc_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + erfc_output_type = ['float','float2','float4','float8','float16']
> > > + erfcUtests =
> > > + func('erfc','erfc',[erfc_input_type],erfc_output_type,[erfc_input_va
> > > + lues],'16 * FLT_ULP')
> > > +
> > > + ##### gentype exp(gentype x)
> > > + exp_input_values = base_input_values exp_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + exp_output_type = ['float','float2','float4','float8','float16']
> > > + expUtests =
> > > + func('exp','exp',[exp_input_type],exp_output_type,[exp_input_values]
> > > + ,'4 * FLT_ULP')
> > > +
> > > + ##### gentype exp2(gentype)
> > > + exp2_input_values = base_input_values exp2_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + exp2_output_type = ['float','float2','float4','float8','float16']
> > > + exp2Utests =
> > > + func('exp2','exp2',[exp2_input_type],exp2_output_type,[exp2_input_va
> > > + lues],'4 * FLT_ULP')
> > > +
> > > + ##### gentype exp10(gentype)
> > > + exp10_input_values = base_input_values exp10_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + exp10_output_type = ['float','float2','float4','float8','float16']
> > > + exp10Utests =
> > > + func('exp10','exp10',[exp10_input_type],exp10_output_type,[exp10_inp
> > > + ut_values],'4 * FLT_ULP')
> > > +
> > > + ##### gentype expm1(gentype x)
> > > + expm1_input_values = base_input_values expm1_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + expm1_output_type = ['float','float2','float4','float8','float16']
> > > + expm1Utests =
> > > +
> func('expm1','expm1',[expm1_input_type],expm1_output_type,[expm1_inp
> > > + ut_values],'4 * FLT_ULP')
> > > +
> > > + ##### gentype fabs(gentype)
> > > + fabs_input_values = base_input_values fabs_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + fabs_output_type = ['float','float2','float4','float8','float16']
> > > + fabsUtests =
> > > + func('fabs','fabs',[fabs_input_type],fabs_output_type,[fabs_input_va
> > > + lues],'0 * FLT_ULP')
> > > +
> > > + ##### gentype fdim(gentype x, gentype y) fdim_base_values =
> > > + base_input_values
> > > + fdim_input_values1 = []
> > > + fdim_input_values2 = []
> > > +
> > > +
> fdim_input_values1,fdim_input_values2=gene2ValuesLoop(fdim_input_val
> > > + ues1,fdim_input_values2,fdim_base_values)
> > > + fdim_input_type1 = ['float','float2','float4','float8','float16']
> > > + fdim_input_type2 = ['float','float2','float4','float8','float16']
> > > + fdim_output_type = ['float','float2','float4','float8','float16']
> > > + fdimUtests =
> > > + func('fdim','fdim',[fdim_input_type1,fdim_input_type2],fdim_output_t
> > > + ype,[fdim_input_values1,fdim_input_values2],'0 * FLT_ULP')
> > > +
> > > + ##### gentype floor(gentype)
> > > + floor_input_values = base_input_values floor_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + floor_output_type = ['float','float2','float4','float8','float16']
> > > + floorUtests =
> > > + func('floor','floor',[floor_input_type],floor_output_type,[floor_inp
> > > + ut_values],'0 * FLT_ULP')
> > > +
> > > + ##### gentype fma(gentype a, gentype b, gentype c)
> fma_base_values
> > > + = base_input_values
> > > + fma_input_values1 = []
> > > + fma_input_values2 = []
> > > + fma_input_values3 = []
> > > +
> > > +
> > fma_input_values1,fma_input_values2,fma_input_values3=gene3ValuesLoo
> > > +
> p(fma_input_values1,fma_input_values2,fma_input_values3,fma_base_val
> > > + ues)
> > > + fma_input_type1 = ['float','float2','float4','float8','float16']
> > > + fma_input_type2 = ['float','float2','float4','float8','float16']
> > > + fma_input_type3 = ['float','float2','float4','float8','float16']
> > > + fma_output_type = ['float','float2','float4','float8','float16']
> > > + fmaUtests =
> > > + func('fma','fma',[fma_input_type1,fma_input_type2,fma_input_type3],f
> > > +
> ma_output_type,[fma_input_values1,fma_input_values2,fma_input_values
> > > + 3],'0 * FLT_ULP')
> > > +
> > > + ##### gentype fmax(gentype x, gentype y) fmax_base_values =
> > > + base_input_values
> > > + fmax_input_values1 = []
> > > + fmax_input_values2 = []
> > > +
> > > +
> fmax_input_values1,fmax_input_values2=gene2ValuesLoop(fmax_input_val
> > > + ues1,fmax_input_values2,fmax_base_values)
> > > + fmax_input_type1 = ['float','float2','float4','float8','float16']
> > > + fmax_input_type2 = ['float','float2','float4','float8','float16']
> > > + fmax_output_type = ['float','float2','float4','float8','float16']
> > > + fmaxUtests =
> > > + func('fmax','fmax',[fmax_input_type1,fmax_input_type2],fmax_output_t
> > > + ype,[fmax_input_values1,fmax_input_values2],'0 * FLT_ULP')
> > > +
> > > + ##### gentypef fmax(gentypef x, float y) #
> > > +fmax_gentypef_base_values = base_input_values #
> > > +fmax_gentypef_input_values1 = [] # fmax_gentypef_input_values2 = []
> > > +#
> > >
> >
> +fmax_gentypef_input_values2,fmax_gentypef_input_values1=gene2ValuesLo
> > >
> >
> +op(fmax_gentypef_input_values1,fmax_gentypef_input_values2,fmax_genty
> > > +pef_base_values) # fmax_gentypef_input_type1 =
> > > +['float','float2','float4','float8','float16']
> > > +# fmax_gentypef_input_type2 =
> > > +['float','float','float','float','float']
> > > +# fmax_gentypef_output_type =
> > > +['float','float2','float4','float8','float16']
> > > +# ##### gentypef fmax(gentypef x, float y) # fmax_gentypefUtests =
> > > +func('gentypef_fmax','gentypef_fmax',[fmax_gentypef_input_type1,fmax_
> > >
> +gentypef_input_type2],fmax_gentypef_output_type,[fmax_gentypef_input_
> > > +values1,fmax_gentypef_input_values2],'0 * FLT_ULP')
> > > +
> > > + ##### gentype fmin(gentype x, gentype y) fmin_base_values =
> > > + base_input_values
> > > + fmin_input_values1 = []
> > > + fmin_input_values2 = []
> > > +
> > > +
> fmin_input_values1,fmin_input_values2=gene2ValuesLoop(fmin_input_val
> > > + ues1,fmin_input_values2,fmin_base_values)
> > > + fmin_input_type1 = ['float','float2','float4','float8','float16']
> > > + fmin_input_type2 = ['float','float2','float4','float8','float16']
> > > + fmin_output_type = ['float','float2','float4','float8','float16']
> > > + fminUtests =
> > > + func('fmin','fmin',[fmin_input_type1,fmin_input_type2],fmin_output_t
> > > + ype,[fmin_input_values1,fmin_input_values2],'0 * FLT_ULP')
> > > +
> > > +# ##### gentypef fmin(gentypef x, float y) #
> > > +fmin_gentypef_base_values = base_input_values #
> > > +fmin_gentypef_input_values1 = [] # fmin_gentypef_input_values2 = []
> > > +#
> > >
> +fmin_gentypef_input_values2,fmin_gentypef_input_values1=gene2ValuesLo
> > >
> +op(fmin_gentypef_input_values1,fmin_gentypef_input_values2,fmin_genty
> > > +pef_base_values) # fmin_gentypef_input_type1 =
> > > +['float','float2','float4','float8','float16']
> > > +# fmin_gentypef_input_type2 =
> > > +['float','float','float','float','float']
> > > +# fmin_gentypef_output_type =
> > > +['float','float2','float4','float8','float16']
> > > +# ##### gentypef fmin(gentypef x, float y) # fmin_gentypefUtests =
> > > +func('gentypef_fmin','gentypef_fmin',[fmin_gentypef_input_type1,fmin_
> > >
> +gentypef_input_type2],fmin_gentypef_output_type,[fmin_gentypef_input_
> > > +values1,fmin_gentypef_input_values2],'0 * FLT_ULP') #
> > > + ##### gentype fmod(gentype x, gentype y)
> > > + fmod_base_values = base_input_values
> > > + fmod_input_values1 = []
> > > + fmod_input_values2 = []
> > > +
> > >
> >
> +fmod_input_values1,fmod_input_values2=gene2ValuesLoop(fmod_input_valu
> > > +es1,fmod_input_values2,fmod_base_values)
> > > + fmod_input_type1 = ['float','float2','float4','float8','float16']
> > > + fmod_input_type2 = ['float','float2','float4','float8','float16']
> > > + fmod_output_type = ['float','float2','float4','float8','float16']
> > > + fmodUtests =
> > > +func('fmod','fmod',[fmod_input_type1,fmod_input_type2],fmod_output_ty
> > > +pe,[fmod_input_values1,fmod_input_values2],'0 * FLT_ULP')
> > > +
> > > + ##### gentype hypot(gentype x, gentype y) hypot_base_values =
> > > + base_input_values
> > > + hypot_input_values1 = []
> > > + hypot_input_values2 = []
> > > +
> > > +
> hypot_input_values1,hypot_input_values2=gene2ValuesLoop(hypot_input_
> > > + values1,hypot_input_values2,hypot_base_values)
> > > + hypot_input_type1 = ['float','float2','float4','float8','float16']
> > > + hypot_input_type2 = ['float','float2','float4','float8','float16']
> > > + hypot_output_type = ['float','float2','float4','float8','float16']
> > > + hypotUtests =
> > > + func('hypot','hypot',[hypot_input_type1,hypot_input_type2],hypot_out
> > > + put_type,[hypot_input_values1,hypot_input_values2],'4 * FLT_ULP')
> > > +
> > > + ##### intn ilogb(floartn x)
> > > + ilogb_input_values = base_input_values ilogb_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + ilogb_output_type = ['int','int2','int4','int8','int16']
> > > + ilogbUtests =
> > > + func('ilogb','ilogb',[ilogb_input_type],ilogb_output_type,[ilogb_inp
> > > + ut_values],'0 * INT_ULP')
> > > +
> > > + ##### gentype lgamma(gentype x)
> > > + lgamma_input_values = base_input_values lgamma_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + lgamma_output_type = ['float','float2','float4','float8','float16']
> > > + lgammaUtests =
> > > +
> > func('lgamma','lgamma',[lgamma_input_type],lgamma_output_type,[lgamm
> > > + a_input_values],'4 * FLT_ULP')
> > > +
> > > + ##### gentype log(gentype)
> > > + log_input_values = base_input_values log_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + log_output_type = ['float','float2','float4','float8','float16']
> > > + logUtests =
> > > + func('log','log',[log_input_type],log_output_type,[log_input_values]
> > > + ,'4 * FLT_ULP')
> > > +
> > > + ##### gentype log2(gentype)
> > > + log2_input_values = base_input_values log2_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + log2_output_type = ['float','float2','float4','float8','float16']
> > > + log2Utests =
> > > + func('log2','log2',[log2_input_type],log2_output_type,[log2_input_va
> > > + lues],'4 * FLT_ULP')
> > > +
> > > + ##### gentype log10(gentype)
> > > + log10_input_values = base_input_values log10_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + log10_output_type = ['float','float2','float4','float8','float16']
> > > + log10Utests =
> > > + func('log10','log10',[log10_input_type],log10_output_type,[log10_inp
> > > + ut_values],'4 * FLT_ULP')
> > > +
> > > + ##### gentype log1p(gentype x)
> > > + log1p_input_values = base_input_values log1p_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + log1p_output_type = ['float','float2','float4','float8','float16']
> > > + log1pUtests =
> > > + func('log1p','log1p',[log1p_input_type],log1p_output_type,[log1p_inp
> > > + ut_values],'4 * FLT_ULP')
> > > +
> > > + ##### gentype logb(gentype x)
> > > + logb_input_values = base_input_values logb_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + logb_output_type = ['float','float2','float4','float8','float16']
> > > + logbUtests =
> > > + func('logb','logb',[logb_input_type],logb_output_type,[logb_input_va
> > > + lues],'0 * FLT_ULP')
> > > +
> > > + ##### gentype maxmag(gentype x, gentype y) maxmag_base_values
> =
> > > + base_input_values
> > > + maxmag_input_values1 = []
> > > + maxmag_input_values2 = []
> > > +
> > >
> >
> +maxmag_input_values1,maxmag_input_values2=gene2ValuesLoop(maxmag_
> > inpu
> > > +t_values1,maxmag_input_values2,maxmag_base_values)
> > > + maxmag_input_type1 = ['float','float2','float4','float8','float16']
> > > + maxmag_input_type2 = ['float','float2','float4','float8','float16']
> > > + maxmag_output_type = ['float','float2','float4','float8','float16']
> > > + maxmag_cpu_func='''
> > > +static float maxmag(float x, float y){
> > > + if(fabs(x) > fabs(y))
> > > + return x;
> > > + else if (fabs(x) < fabs(y))
> > > + return y;
> > > + else
> > > + return fmax(x,y);
> > > +} '''
> > > + maxmagUtests =
> > >
> >
> +func('maxmag','maxmag',[maxmag_input_type1,maxmag_input_type2],max
> > mag
> > > +_output_type,[maxmag_input_values1,maxmag_input_values2],'0 *
> > > +FLT_ULP',maxmag_cpu_func)
> > > +
> > > + ##### gentype minmag(gentype x, gentype y) minmag_base_values =
> > > + base_input_values
> > > + minmag_input_values1 = []
> > > + minmag_input_values2 = []
> > > +
> > >
> >
> +minmag_input_values1,minmag_input_values2=gene2ValuesLoop(minmag_in
> > pu
> > > +t_values1,minmag_input_values2,minmag_base_values)
> > > + minmag_input_type1 = ['float','float2','float4','float8','float16']
> > > + minmag_input_type2 = ['float','float2','float4','float8','float16']
> > > + minmag_output_type = ['float','float2','float4','float8','float16']
> > > + minmag_cpu_func='''
> > > +static float minmag(float x, float y){
> > > + if(fabs(x) < fabs(y))
> > > + return x;
> > > + else if (fabs(x) > fabs(y))
> > > + return y;
> > > + else
> > > + return fmin(x,y);
> > > +} '''
> > > + minmagUtests =
> > >
> >
> +func('minmag','minmag',[minmag_input_type1,minmag_input_type2],minma
> > g
> > > +_output_type,[minmag_input_values1,minmag_input_values2],'0 *
> > > +FLT_ULP',minmag_cpu_func)
> > > +
> > > +# ##### floatn nan(uintn nancode)
> > > +# nan_input_values = base_input_values # nan_input_type =
> > > +['uint','uint2','uint4','uint8','uint16']
> > > +# nan_output_type = ['float','float2','float4','float8','float16']
> > > +# nanUtests =
> > > +func('nan','nan',[nan_input_type],nan_output_type,[nan_input_values],
> > > +'0 * FLT_ULP')
> > > +
> > > + ##### gentype nextafter(gentype x, gentype y) nextafter_base_values
> > > + = base_input_values
> > > + nextafter_input_values1 = []
> > > + nextafter_input_values2 = []
> > > +
> > > + nextafter_input_values1,nextafter_input_values2=gene2ValuesLoop(next
> > > + after_input_values1,nextafter_input_values2,nextafter_base_values)
> > > + nextafter_input_type1 =
> > > + ['float','float2','float4','float8','float16']
> > > + nextafter_input_type2 =
> > > + ['float','float2','float4','float8','float16']
> > > + nextafter_output_type =
> > > + ['float','float2','float4','float8','float16']
> > > + nextafterUtests =
> > > + func('nextafter','nextafter',[nextafter_input_type1,nextafter_input_
> > > + type2],nextafter_output_type,[nextafter_input_values1,nextafter_inpu
> > > + t_values2],'0 * FLT_ULP')
> > > +
> > > + ##### gentype pow(gentype x, gentype y) pow_base_values =
> > > + base_input_values
> > > + pow_input_values1 = []
> > > + pow_input_values2 = []
> > > +
> > > +
> > pow_input_values1,pow_input_values2=gene2ValuesLoop(pow_input_values
> > > + 1,pow_input_values2,pow_base_values)
> > > + pow_input_type1 = ['float','float2','float4','float8','float16']
> > > + pow_input_type2 = ['float','float2','float4','float8','float16']
> > > + pow_output_type = ['float','float2','float4','float8','float16']
> > > + powUtests =
> > > + func('pow','pow',[pow_input_type1,pow_input_type2],pow_output_type,[
> > > + pow_input_values1,pow_input_values2],'16 * FLT_ULP')
> > > +
> > > + ##### floatn pown(floatn x, intn y)
> > > + pown_input_values1 =
> > > +[FLT_MAX_POSI,FLT_MIN_NEGA,FLT_MIN_POSI,FLT_MAX_NEGA,80, -80,
> > 3.14,
> > > +-3.14, -0.5, 0.5, 1, -1, 0.0,6,-6,1500.24,-1500.24]
> > > + pown_input_values2 = [-1,-2,-3,4,5,6,7,8,9,10,11,12,13,14,15,16,12]
> > > + pown_input_type1 = ['float','float2','float4','float8','float16']
> > > + pown_input_type2 = ['int','int2','int4','int8','int16']
> > > + pown_output_type = ['float','float2','float4','float8','float16']
> > > + pown_cpu_func='''
> > > +static float pown(float x, int y){
> > > + return pow(x,y);
> > > +} '''
> > > + pownUtests =
> > >
> +func('pown','pown',[pown_input_type1,pown_input_type2],pown_output_ty
> > > +pe,[pown_input_values1,pown_input_values2],'16 * FLT_ULP',
> > > +pown_cpu_func)
> > > +
> > > + ##### gentype powr(gentype x, gentype y)
> > > + powr_input_values1 =
> > > +[FLT_MAX_POSI,FLT_MIN_NEGA,FLT_MIN_POSI,FLT_MAX_NEGA,80, -80,
> > 3.14,
> > > +-3.14, -0.5, 0.5, 1, -1, 0.0,6,-6,1500.24,-1500.24]
> > > + powr_input_values2 =
> > > +[1,2,3.14,4,5,6,7,8,9.889,10,11,12,13,14.33,15,0,12]
> > > + powr_input_type1 = ['float','float2','float4','float8','float16']
> > > + powr_input_type2 = ['float','float2','float4','float8','float16']
> > > + powr_output_type = ['float','float2','float4','float8','float16']
> > > + powr_cpu_func='''
> > > +static float powr(float x, int y){
> > > + return pow(x,y);
> > > +} '''
> > > + powrUtests =
> > > +func('powr','powr',[powr_input_type1,powr_input_type2],powr_output_ty
> > > +pe,[powr_input_values1,powr_input_values2],'16 * FLT_ULP',
> > > +powr_cpu_func)
> > > +
> > > + ##### gentype remainder(gentype x, gentype y)
> remainder_base_values
> > > + = base_input_values
> > > + remainder_input_values1 = []
> > > + remainder_input_values2 = []
> > > +
> > > +
> > remainder_input_values1,remainder_input_values2=gene2ValuesLoop(rema
> > > + inder_input_values1,remainder_input_values2,remainder_base_values)
> > > + remainder_input_type1 =
> > > + ['float','float2','float4','float8','float16']
> > > + remainder_input_type2 =
> > > + ['float','float2','float4','float8','float16']
> > > + remainder_output_type =
> > > + ['float','float2','float4','float8','float16']
> > > + remainderUtests =
> > > + func('remainder','remainder',[remainder_input_type1,remainder_input_
> > > +
> type2],remainder_output_type,[remainder_input_values1,remainder_inpu
> > > + t_values2],'0 * FLT_ULP')
> > > +
> > > + ##### gentype rint(gentype x)
> > > + rint_input_values = base_input_values rint_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + rint_output_type = ['float','float2','float4','float8','float16']
> > > + rintUtests =
> > > + func('rint','rint',[rint_input_type],rint_output_type,[rint_input_va
> > > + lues],'0 * FLT_ULP')
> > > +
> > > + ##### floatn rootn(floatn x, intn y)
> > > + rootn_input_values1 =
> > > +[FLT_MAX_POSI,FLT_MIN_NEGA,FLT_MIN_POSI,FLT_MAX_NEGA,80, -80,
> > 3.14,
> > > +-3.14, -0.5, 0.5, 1, -1, 0.0,6,-6,1500.24,-1500.24,2,3,4]
> > > + rootn_input_values2 =
> > > +[-1,-2,-3,2,3,6,7,8,9,2,11,12,13,14,15,16,2,2,2,2]
> > > + rootn_input_type1 = ['float','float2','float4','float8','float16']
> > > + rootn_input_type2 = ['int','int2','int4','int8','int16']
> > > + rootn_output_type = ['float','float2','float4','float8','float16']
> > > + rootn_cpu_func='''
> > > +static float rootn(float x, int y){
> > > + return pow(x,1.0/y);
> > > +} '''
> > > + rootnUtests =
> > > +func('rootn','rootn',[rootn_input_type1,rootn_input_type2],rootn_outp
> > > +ut_type,[rootn_input_values1,rootn_input_values2],'4 *
> > > +FLT_ULP',rootn_cpu_func)
> > > +
> > > + ##### gentype round(gentype x)
> > > + round_input_values = base_input_values round_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + round_output_type = ['float','float2','float4','float8','float16']
> > > + roundUtests =
> > > + func('round','round',[round_input_type],round_output_type,[round_inp
> > > + ut_values],'0 * FLT_ULP')
> > > +
> > > + ##### gentype rsqrt(gentype)
> > > + rsqrt_input_values = base_input_values
> > > + rsqrt_input_type = ['float','float2','float4','float8','float16']
> > > + rsqrt_output_type = ['float','float2','float4','float8','float16']
> > > + rsqrt_cpu_func='''
> > > +static float rsqrt(float x)
> > > +{ return 1/sqrt(x);} '''
> > > + rsqrtUtests =
> > > +func('rsqrt','rsqrt',[rsqrt_input_type],rsqrt_output_type,[rsqrt_inpu
> > > +t_values],'4 * FLT_ULP', rsqrt_cpu_func)
> > > +
> > > +
> > > + ##### gentype sin(gentype)
> > > + sin_input_values = base_input_values sin_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + sin_output_type = ['float','float2','float4','float8','float16']
> > > + sinUtests =
> > > + func('sin','sin',[sin_input_type],sin_output_type,[sin_input_values]
> > > + ,'4 * FLT_ULP')
> > > +
> > > +# ##### gentype sincos(gentype)
> > > +# sincos_input_values1 =
> > > +[FLT_MAX_POSI,FLT_MIN_NEGA,FLT_MIN_POSI,FLT_MAX_NEGA,80, -80,
> > 3.14,
> > > +-3.14, -0.5, 0.5, 1, -1, 0.0,6,-6,1500.24,-1500.24] #
> > > +sincos_input_values2 = [] # sincos_input_type1 =
> > > +['float','float2','float4','float8','float16']
> > > +# sincos_input_type2 =
> > > +['float','float2','float4','float8','float16']
> > > +# sincos_output_type =
> > > +['float','float2','float4','float8','float16']
> > > +# ###### gentype sincos(gentype)
> > > +# # sincosUtests =
> > > +func('sincos','sincos',[sincos_input_type1,sincos_input_type2],sincos
> > > +_output_type,[sincos_input_values1,sincos_input_values2],'4 *
> > > +FLT_ULP')
> > > +
> > > + ##### gentype sinh(gentype)
> > > + sinh_input_values = base_input_values sinh_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + sinh_output_type = ['float','float2','float4','float8','float16']
> > > + sinhUtests =
> > > + func('sinh','sinh',[sinh_input_type],sinh_output_type,[sinh_input_va
> > > + lues],'4 * FLT_ULP')
> > > +
> > > + ##### gentype sinpi(gentype x)
> > > + sinpi_input_values = base_input_values
> > > + sinpi_input_type = ['float','float2','float4','float8','float16']
> > > + sinpi_output_type = ['float','float2','float4','float8','float16']
> > > + sinpi_cpu_func='''
> > > +static float sinpi(float x){
> > > + return sin(M_PI*x);
> > > +} '''
> > > + sinpiUtests =
> > > +func('sinpi','sinpi',[sinpi_input_type],sinpi_output_type,[sinpi_inpu
> > > +t_values],'4 * FLT_ULP',sinpi_cpu_func)
> > > +
> > > + ##### gentype sqrt(gentype)
> > > + sqrt_input_values = base_input_values sqrt_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + sqrt_output_type = ['float','float2','float4','float8','float16']
> > > + sqrtUtests =
> > > + func('sqrt','sqrt',[sqrt_input_type],sqrt_output_type,[sqrt_input_va
> > > + lues],'4 * FLT_ULP')
> > > +
> > > + ##### gentype tan(gentype)
> > > + tan_input_values = base_input_values tan_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + tan_output_type = ['float','float2','float4','float8','float16']
> > > + tanUtests =
> > > + func('tan','tan',[tan_input_type],tan_output_type,[tan_input_values]
> > > + ,'5 * FLT_ULP')
> > > +
> > > + ##### gentype tanh(gentype)
> > > + tanh_input_values = base_input_values tanh_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + tanh_output_type = ['float','float2','float4','float8','float16']
> > > + tanhUtests =
> > > + func('tanh','tanh',[tanh_input_type],tanh_output_type,[tanh_input_va
> > > + lues],'5 * FLT_ULP')
> > > +
> > > + ##### gentype tanpi(gentype x)
> > > + tanpi_input_values = base_input_values
> > > + tanpi_input_type = ['float','float2','float4','float8','float16']
> > > + tanpi_output_type = ['float','float2','float4','float8','float16']
> > > + tanpi_cpu_func='''
> > > +static float tanpi(float x){
> > > + return tan(M_PI*x);
> > > +} '''
> > > + tanpiUtests =
> > > +func('tanpi','tanpi',[tanpi_input_type],tanpi_output_type,[tanpi_inpu
> > > +t_values],'4 * FLT_ULP',tanpi_cpu_func)
> > > +
> > > + ##### gentype tgamma(gentype)
> > > + tgamma_input_values = base_input_values tgamma_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + tgamma_output_type = ['float','float2','float4','float8','float16']
> > > + tgammaUtests =
> > > +
> >
> func('tgamma','tgamma',[tgamma_input_type],tgamma_output_type,[tgamm
> > > + a_input_values],'16 * FLT_ULP')
> > > +
> > > + ##### gentype trunc(gentype)
> > > + trunc_input_values = base_input_values trunc_input_type =
> > > + ['float','float2','float4','float8','float16']
> > > + trunc_output_type = ['float','float2','float4','float8','float16']
> > > + truncUtests =
> > > + func('trunc','trunc',[trunc_input_type],trunc_output_type,[trunc_inp
> > > + ut_values],'0 * FLT_ULP')
> > > +
> > > +if __name__ == "__main__":
> > > + main()
> > > --
> > > 1.7.6.4
> > >
> > > _______________________________________________
> > > Beignet mailing list
> > > Beignet at lists.freedesktop.org
> > > http://lists.freedesktop.org/mailman/listinfo/beignet
> > _______________________________________________
> > Beignet mailing list
> > Beignet at lists.freedesktop.org
> > http://lists.freedesktop.org/mailman/listinfo/beignet
More information about the Beignet
mailing list