Can I Perform Hybrid Execution on Neural Engine and CPU with 16-bit Precision?

Hello,

I have a question regarding hybrid execution for deep learning models on Apple's Neural Engine and CPU. I am aware that setting the precision of some layers to 32-bit allows hybrid execution across both the Neural Engine and the CPU. However, I would like to know if it is possible to achieve the same with 16-bit precision.

Is there any specific configuration or workaround to enable hybrid execution in this case? Any guidance or documentation references would be greatly appreciated.

Thank you!

Can I Perform Hybrid Execution on Neural Engine and CPU with 16-bit Precision?
 
 
Q