<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Word 14 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri","sans-serif";
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri","sans-serif";}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="blue" vlink="purple">
<div class="WordSection1">
<p class="MsoNormal">I am experimenting with adding image support to Mesa and am encountering something I don’t understand with the mapping routines implemented in transfer.cpp, within the soft_copy_op function.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">We are working on 10.1 branch of Mesa which has some tweaks to let Mesa run OpenCL kernels build from the AMD OpenCL driver, so my issues may very well be an artifact of that environment.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">In any case, here is what I am seeing:<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">I have a simple program which generates an image of a given size, and sets each pixel to a value from a given starting point with a given increment value.<o:p></o:p></p>
<p class="MsoNormal">For example, the command “imageTest 32 32 0 1” would create a 32x32 image, in which pixel (0,0) would be set to 0, (0,1)=1 ... (0,31)=31, (1,0)=32 and so on...<o:p></o:p></p>
<p class="MsoNormal">The kernel takes the x,y coordinates as parameters and returns the value at that location.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">It appears that the default soft_copy_op destination mapping does not provide a compatible setup for images.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Given a 32x32 image of format CL_R8, INT32, the validate_object() routines in clEnqueueWriteImage() calculate a destination pitch of {4,128,4096}<o:p></o:p></p>
<p class="MsoNormal">This results in a size of 4096 being passed to the mapping get (dst_pitch[2]*region[2]) this generates a staging texture of 4096x1x1 pixels.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">With this, I can retrieve the values from the first row of the image, but none of the subsequent rows return valid values.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">After some experimentation (with a 64x4 image) I discovered that there appears to be a minimum row pitch of 256bytes, so the 128 being passed in for the 32x32 image didn’t work properly.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">I changed the clEnqueueWrite routine to adjust the destination pitches as follows:<o:p></o:p></p>
<p class="MsoNormal" style="text-indent:.5in">dst_pitch[1]= MAX2(256,dst_pitch[1]); // row pitch<o:p></o:p></p>
<p class="MsoNormal" style="text-indent:.5in">dst_pitch[2]=dst_pitch[1]*region[1]; // slice pitch<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">That seemed to work at first, but once I started moving to different image sizes, things did not work right at all...<o:p></o:p></p>
<p class="MsoNormal">With a 64x64 image, the subsequent rows no longer mapped properly.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">After more experimentation, I seem to have found a method that will work reliably, with testing of images up to 2048x2048. (I can’t seem to go higher than that, due to a memory leak I have not found yet...)<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">What I did, was add a different mapping get template for images:<o:p></o:p></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New"">template<><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New"">struct _map<clover::image*> {<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""> static mapping<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""> get(command_queue &q, clover::image *img, cl_map_flags flags, size_t offset, size_t size) {<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""> return { q, img->resource(q), flags, true, {{offset}}, {{size, size, 1}} };<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""> }<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New"">} <o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Then, in the soft_copy_op routine, instead of passing in the slice pitch size, I pass in MAX2(region[0],region[1]).<o:p></o:p></p>
<p class="MsoNormal">I’d rather pass in the entire vector for the region, but I didn’t want to rework the other mapping get routines in the template quite yet...<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">This works, as long as my images are square... but once I move to different dimensions (ie 32x64, 64x128) my second row of data is off again...<o:p></o:p></p>
<p class="MsoNormal">I’m assuming this is because of passing the largest dimension in for the mapping get routine, rather than the width and height....
<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">I feel like I’m misunderstanding how these mapping routines are supposed to be working.
<o:p></o:p></p>
<p class="MsoNormal">I’m also concerned that as the image sizes grow, that the use of the ‘memcpy’ will be very inefficient (as opposed to a DMA copy)<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">I’m hoping someone might be able to explain a bit about these mapping routines, and perhaps shed some light on the OpenGL image transfer routines and how they might be accessible from the OpenCL side.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Thanks for reading my rambling message! :)<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><b><span style="font-size:14.0pt">Al Dorrington<o:p></o:p></span></b></p>
<p class="MsoNormal"><i><span style="font-size:10.0pt">Software Engineer Sr<o:p></o:p></span></i></p>
<p class="MsoNormal"><i><span style="font-size:10.0pt">Lockheed Martin, Mission Systems and Training<o:p></o:p></span></i></p>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</body>
</html>