Just wanted to share my observations about performance on Apple M1 (although I suspect similar for A series).
I had done something similar and was surprised (yup, I'm a noob) to find that creating a whole new matrix doesn't impact performance at all...
Confirmed with official Xcode compiler statistics (with and without float3x3 conversion)...
Hurray scalar ALUs and optimizing compilers.
Topic:
Graphics & Games
SubTopic:
General
Tags: