<div dir="ltr">I think I've gotten through everything at this point. If I've missed anything, please let me know.<br></div><br><div class="gmail_quote"><div dir="ltr">On Fri, Jan 18, 2019 at 5:37 AM Iago Toral <<a href="mailto:itoral@igalia.com">itoral@igalia.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div style="text-align:left;direction:ltr"><div>Thanks a lot of for all the review work! When you're done reviewing all the patches I'll prepare a v4 with all the changes.</div><div><br></div><div>On Thu, 2019-01-17 at 18:24 -0600, Jason Ekstrand wrote:</div><blockquote type="cite" style="margin:0px 0px 0px 0.8ex;border-left:2px solid rgb(114,159,207);padding-left:1ex"><div dir="ltr">I'm done for the day but I've read through most of the patches. I think I've got 4 or 5 tricky ones left. By and large, I think things are looking really good. I don't know that we'll make 19.0 but there's a possibility. If not, it'll likely land shortly after.<br></div><br><div class="gmail_quote"><div dir="ltr">On Tue, Jan 15, 2019 at 7:54 AM Iago Toral Quiroga <<a href="mailto:itoral@igalia.com" target="_blank">itoral@igalia.com</a>> wrote:<br></div><blockquote type="cite" style="margin:0px 0px 0px 0.8ex;border-left:2px solid rgb(114,159,207);padding-left:1ex">The changes in this version address review feedback to v2 and, most importantly,<br>
rebase on top of relevant changes in master, specifically Curro's regioning<br>
lowering pass. This new regioning pass simplifies some of the NIR translation<br>
code (specifically the code for translating regioning restrictions on<br>
conversions for atom platforms) making some of the previous work in this series<br>
unnecessary. The regioning restrictions for conversions between integer and<br>
half-float added with this series are are now implemented as part of this<br>
framework instead of doing it at NIR translation time. This version of the<br>
series also dropped the SPIR-V compiler patches that have already been merged.<br>
<br>
As always, a branch for with these patches is available for testing in the<br>
itoral/VK_KHR_shader_float16_int8 branch of the Igalia Mesa repository at<br>
<a href="https://github.com/Igalia/mesa" rel="noreferrer" target="_blank">https://github.com/Igalia/mesa</a>.<br>
<br>
Iago Toral Quiroga (42):<br>
intel/compiler: handle conversions between int and half-float on atom<br>
intel/compiler: add a NIR pass to lower conversions<br>
intel/compiler: split float to 64-bit opcodes from int to 64-bit<br>
intel/compiler: handle b2i/b2f with other integer conversion opcodes<br>
intel/compiler: assert restrictions on conversions to half-float<br>
intel/compiler: lower some 16-bit float operations to 32-bit<br>
intel/compiler: lower 16-bit extended math to 32-bit prior to gen9<br>
intel/compiler: implement 16-bit fsign<br>
intel/compiler: allow extended math functions with HF operands<br>
compiler/nir: add lowering option for 16-bit fmod<br>
intel/compiler: lower 16-bit fmod<br>
compiler/nir: add lowering for 16-bit flrp<br>
intel/compiler: lower 16-bit flrp<br>
compiler/nir: add lowering for 16-bit ldexp<br>
intel/compiler: Extended Math is limited to SIMD8 on half-float<br>
intel/compiler: add instruction setters for Src1Type and Src2Type.<br>
intel/compiler: add new half-float register type for 3-src<br>
instructions<br>
intel/compiler: add a helper function to query hardware type table<br>
intel/compiler: don't compact 3-src instructions with Src1Type or<br>
Src2Type bits<br>
intel/compiler: allow half-float on 3-source instructions since gen8<br>
intel/compiler: set correct precision fields for 3-source float<br>
instructions<br>
intel/compiler: don't propagate HF immediates to 3-src instructions<br>
intel/compiler: fix ddx and ddy for 16-bit float<br>
intel/compiler: fix ddy for half-float in gen8<br>
intel/compiler: workaround for SIMD8 half-float MAD in gen8<br>
intel/compiler: split is_partial_write() into two variants<br>
intel/compiler: activate 16-bit bit-size lowerings also for 8-bit<br>
intel/compiler: handle 64-bit float to 8-bit integer conversions<br>
intel/compiler: handle conversions between int and half-float on atom<br>
intel/compiler: implement isign for int8<br>
intel/compiler: ask for an integer type if requesting an 8-bit type<br>
intel/eu: force stride of 2 on NULL register for Byte instructions<br>
compiler/spirv: add support for Float16 and Int8 capabilities<br>
anv/pipeline: support Float16 and Int8 capabilities in gen8+<br>
anv/device: expose shaderFloat16 and shaderInt8 in gen8+<br>
intel/compiler: implement is_zero, is_one, is_negative_one for<br>
8-bit/16-bit<br>
intel/compiler: add a brw_reg_type_is_integer helper<br>
intel/compiler: fix cmod propagation for non 32-bit types<br>
intel/compiler: remove MAD/LRP algebraic optimizations from the<br>
backend<br>
intel/compiler: support half-float in the combine constants pass<br>
intel/compiler: fix combine constants for Align16 with half-float<br>
prior to gen9<br>
intel/compiler: allow propagating HF immediates to MAD/LRP<br>
<br>
src/compiler/nir/nir.h | 2 +<br>
src/compiler/nir/nir_opt_algebraic.py | 11 +-<br>
src/compiler/shader_info.h | 2 +<br>
src/compiler/spirv/spirv_to_nir.c | 8 +-<br>
src/intel/Makefile.sources | 1 +<br>
src/intel/compiler/brw_compiler.c | 2 +<br>
src/intel/compiler/brw_eu_compact.c | 5 +-<br>
src/intel/compiler/brw_eu_emit.c | 36 +++-<br>
src/intel/compiler/brw_fs.cpp | 143 ++++++++++------<br>
src/intel/compiler/brw_fs.h | 1 +<br>
.../compiler/brw_fs_cmod_propagation.cpp | 34 ++--<br>
.../compiler/brw_fs_combine_constants.cpp | 82 +++++++--<br>
.../compiler/brw_fs_copy_propagation.cpp | 14 +-<br>
src/intel/compiler/brw_fs_cse.cpp | 3 +-<br>
.../compiler/brw_fs_dead_code_eliminate.cpp | 2 +-<br>
src/intel/compiler/brw_fs_generator.cpp | 47 +++---<br>
src/intel/compiler/brw_fs_live_variables.cpp | 2 +-<br>
src/intel/compiler/brw_fs_nir.cpp | 85 ++++++++--<br>
src/intel/compiler/brw_fs_reg_allocate.cpp | 2 +-<br>
.../compiler/brw_fs_register_coalesce.cpp | 2 +-<br>
.../compiler/brw_fs_saturate_propagation.cpp | 7 +-<br>
src/intel/compiler/brw_fs_sel_peephole.cpp | 4 +-<br>
src/intel/compiler/brw_inst.h | 2 +<br>
src/intel/compiler/brw_ir_fs.h | 36 +++-<br>
src/intel/compiler/brw_nir.c | 21 ++-<br>
src/intel/compiler/brw_nir.h | 2 +<br>
.../compiler/brw_nir_lower_conversions.c | 158 ++++++++++++++++++<br>
src/intel/compiler/brw_reg_type.c | 70 +++++---<br>
src/intel/compiler/brw_reg_type.h | 18 ++<br>
src/intel/compiler/brw_shader.cpp | 26 +++<br>
src/intel/compiler/meson.build | 1 +<br>
src/intel/vulkan/anv_device.c | 9 +<br>
src/intel/vulkan/anv_extensions.py | 1 +<br>
src/intel/vulkan/anv_pipeline.c | 2 +<br>
34 files changed, 674 insertions(+), 167 deletions(-)<br>
create mode 100644 src/intel/compiler/brw_nir_lower_conversions.c<br>
<br>
</blockquote></div></blockquote></div>
</blockquote></div>