Added optimizations for several nearest neigbor scaling fast paths: - over_8888_8888 - over_8888_0565 - src_0565_8888 Benchmark results (lowlevel-blt-bench) on Malta board (@1Ghz) are included in the log messages. Any comments to this patch are welcome.