Division, unfortunately, is a showstopper for parallel execution. It occupies a lot of resources, even if its result is not needed for the next instruction. It’s in my plans to try hiding division latency — maybe by using division result only on the next iteration. But I still think it will slow down the main loop.
Запись редактировалась последний раз: May 30, 2018, 8:23 pm