Added optimizations for several out_reverse, over_reverse and in oprations: - out_reverse_8_0565 - out_reverse_8_8888 - over_reverse_n_8888 - in_n_8_8 Benchmark results (lowlevel-blt-bench) on Malta board (@1Ghz) are included in the log messages. Any comments to these patches are welcome.