JUST ENDED
|

Core AI Q&A

Connect with Apple engineers in the Core AI Q&A on the Apple Developer Forums.

Post

Replies

Boosts

Views

Activity

On Siri & Apple Intelligence
Regarding the 'weight list' of Siri: can you all provide technical specifics on how a model qualifies for this list, and can a developer-supplied model/adapter ever handle requests that originate from the system-wide Siri interface?
1
0
95
1h
Compute unit specification for function runs
When we specialize with preferredComputeUnitKind: .neuralEngine, the resolved options' allowedComputeUnitKinds return all three units, and sometimes a function we intend for ANE ends up on the GPU. We can't find any API that reports where a function actually ran, but system resource utilization shows a GPU spike. Is there a supported way to confirm the actual compute unit at runtime? And how does your prioritization work if we prefer running on the ANE rather than the GPU? Or, can we disallow certain compute units? How does this compute unit selection map with someone coming from CoreML where the desired compute units were honored?
1
0
87
1h
Ahead-of-time (AOT) compilation for older devices
Will ahead of time compilation stay limited to A17 Pro and M1 class hardware? Docs indicate that AOT compilation is only supported for A17 Pro/M1 and newer. iOS 27 supports iPhone 11 (A13) and newer. AOT can't be used on A13-A16 which is a substantial fraction of active user devices in 2026 Is there a path to AOT on older Apple Silicon chips (A14 and newer) down the line? If not, I am afraid, the impact of AOT will be delayed by 3-4 years until it can be broadly leveraged.
3
1
119
1h
Mixing Core AI and Core ML in one pipeline
We built a setup where a model split into an encoder and a decoder can run each part on a different backend, using our own component protocols. Is mixing Core AI and Core ML within a single inference pass something you would recommend, and what is the realistic cost at the boundary where we convert between MLMultiArray / MLTensor and NDArray? Is there a way to keep the encoder output resident on the GPU or ANE so it does not need a host round trip into the other backend?
2
1
162
1h
Modern drawing classifier recommendations
What modern tools would you recommend to replace a drawing classification machine learning model I built years ago (with Turi Create) to recognize emojis trained on drawings users submit? I tried an image classifier with CreateML but training time is much longer and accuracy is worse. Does Apple have a solution for drawing classification? Is a ML model still the best solution, or any AI advancements good for this use case? Thanks!
2
0
99
2h
Specialized models across OS updates
The docs say we can delete the source .aimodel after a .persistent specialize and keep the bookmark to save space. But an OS update always invalidates the cache and bookmarks, so it looks like anyone who deleted the source has to re-download the whole model after every update. For large models, that's a lot of bandwidth and impacts the first-time experience after the update. Is that the intended trade-off, or does the cache hold enough to re-specialize itself? Does every minor OS bumps (27.1 -> 27.2) always invalidate cache when .persisted, or only major ones? Also, can the user delete a .persistent entry themselves through Settings or storage management, or only the app? We need to know whether our "model is ready" state can disappear without the app knowing.
1
0
123
2h
.md summaries of WWDC for Claude
Are there any official .md summaries for Claude? How should we tell our Agents where the source of truth for WWDC new code is?
Replies
1
Boosts
0
Views
25
Activity
49m
On Siri & Apple Intelligence
Regarding the 'weight list' of Siri: can you all provide technical specifics on how a model qualifies for this list, and can a developer-supplied model/adapter ever handle requests that originate from the system-wide Siri interface?
Replies
1
Boosts
0
Views
95
Activity
1h
For Model Control & Metadata
Is there a supported way to 'pin' an app to a specific version of an on-device system model to prevent regression in agentic workflows & what is the typical update cadence for these weights?
Replies
1
Boosts
1
Views
98
Activity
1h
Deployments and Entitemments ability
For apps using the 'MLX distributor' or local adapters, are there any specialized background processing entitlements needed to ensure inference isn't killed by the OS during long-running tasks
Replies
1
Boosts
2
Views
88
Activity
1h
Compute unit specification for function runs
When we specialize with preferredComputeUnitKind: .neuralEngine, the resolved options' allowedComputeUnitKinds return all three units, and sometimes a function we intend for ANE ends up on the GPU. We can't find any API that reports where a function actually ran, but system resource utilization shows a GPU spike. Is there a supported way to confirm the actual compute unit at runtime? And how does your prioritization work if we prefer running on the ANE rather than the GPU? Or, can we disallow certain compute units? How does this compute unit selection map with someone coming from CoreML where the desired compute units were honored?
Replies
1
Boosts
0
Views
87
Activity
1h
Is Core AI capable of submitting Neural Engine requests in the background?
In iOS 26, Core ML reliably submits inference requests targeting the Neural Engine in the background. In iOS 27 Seed 1, this capability is currently broken. Is Core AI expected to reliably submit inference requests targeting the Neural Engine in the background?
Replies
2
Boosts
0
Views
117
Activity
1h
Ahead-of-time (AOT) compilation for older devices
Will ahead of time compilation stay limited to A17 Pro and M1 class hardware? Docs indicate that AOT compilation is only supported for A17 Pro/M1 and newer. iOS 27 supports iPhone 11 (A13) and newer. AOT can't be used on A13-A16 which is a substantial fraction of active user devices in 2026 Is there a path to AOT on older Apple Silicon chips (A14 and newer) down the line? If not, I am afraid, the impact of AOT will be delayed by 3-4 years until it can be broadly leveraged.
Replies
3
Boosts
1
Views
119
Activity
1h
On Model Control & Metadata
While response.usage provides token counts and reasoning signals, are there plans to expose per-token logprobs or confidence scores to help developers build more robust 'evaluators' for non-deterministic outputs?
Replies
1
Boosts
0
Views
54
Activity
1h
Mixing Core AI and Core ML in one pipeline
We built a setup where a model split into an encoder and a decoder can run each part on a different backend, using our own component protocols. Is mixing Core AI and Core ML within a single inference pass something you would recommend, and what is the realistic cost at the boundary where we convert between MLMultiArray / MLTensor and NDArray? Is there a way to keep the encoder output resident on the GPU or ANE so it does not need a host round trip into the other backend?
Replies
2
Boosts
1
Views
162
Activity
1h
Modern drawing classifier recommendations
What modern tools would you recommend to replace a drawing classification machine learning model I built years ago (with Turi Create) to recognize emojis trained on drawings users submit? I tried an image classifier with CreateML but training time is much longer and accuracy is worse. Does Apple have a solution for drawing classification? Is a ML model still the best solution, or any AI advancements good for this use case? Thanks!
Replies
2
Boosts
0
Views
99
Activity
2h
Specialized models across OS updates
The docs say we can delete the source .aimodel after a .persistent specialize and keep the bookmark to save space. But an OS update always invalidates the cache and bookmarks, so it looks like anyone who deleted the source has to re-download the whole model after every update. For large models, that's a lot of bandwidth and impacts the first-time experience after the update. Is that the intended trade-off, or does the cache hold enough to re-specialize itself? Does every minor OS bumps (27.1 -> 27.2) always invalidate cache when .persisted, or only major ones? Also, can the user delete a .persistent entry themselves through Settings or storage management, or only the app? We need to know whether our "model is ready" state can disappear without the app knowing.
Replies
1
Boosts
0
Views
123
Activity
2h
Dynamically loading expert weights on the Neural Engine
AFM3 Core Advanced dynamically loads experts into DRAM* for Neural Engine execution. Is this capability accessible to third party via Core AI or Core ML APIs in iOS 27? Reference: https://machinelearning.apple.com/research/introducing-third-generation-of-apple-foundation-models
Replies
0
Boosts
1
Views
103
Activity
2h