Hello everyone,
We are GPU developers who utilize PTX / GCN / RDNA ISA to develop our software
Is there any reference available for asm-level Neural Engine and GPU, so we can write our custom device code and get it built and running?
Not sure if this is a normal request, I understand there are high-level libraries such as FFT / BLAS / Accelerate etc available, but we need to go down in order to implement our own technology features that solve some specific problems we need to resolve first before rolling out the product for the Mac OS X platform