Hardware Support for Low Precision Data Types?

Hi all,

I'm trying to find out if/when we can expect mxfp8/mxfp4 support on Apple Silicon. I've noticed that mlx now has casting data types, but all computation is still done in bf16. Would be great to reduce power consumption with support for these lower precision data types since edge inference is already typically done at a lower precision!

Thanks in advance.

Hardware Support for Low Precision Data Types?
 
 
Q