JUST ENDED

Core AI Q&A

Connect with Apple engineers in the Core AI Q&A on the Apple Developer Forums.

Machine Learning & AI Core AI

Post

Replies

Boosts

Views

Activity

.md summaries of WWDC for Claude

Are there any official .md summaries for Claude? How should we tell our Agents where the source of truth for WWDC new code is?

Machine Learning & AI Core AI

386

Jun ’26

On Siri & Apple Intelligence

Regarding the 'weight list' of Siri: can you all provide technical specifics on how a model qualifies for this list, and can a developer-supplied model/adapter ever handle requests that originate from the system-wide Siri interface?

Machine Learning & AI Core AI

407

Jun ’26

For Model Control & Metadata

Is there a supported way to 'pin' an app to a specific version of an on-device system model to prevent regression in agentic workflows & what is the typical update cadence for these weights?

Machine Learning & AI Core AI

390

Jun ’26

Deployments and Entitemments ability

For apps using the 'MLX distributor' or local adapters, are there any specialized background processing entitlements needed to ensure inference isn't killed by the OS during long-running tasks

Machine Learning & AI Core AI

376

Jun ’26

Compute unit specification for function runs

When we specialize with preferredComputeUnitKind: .neuralEngine, the resolved options' allowedComputeUnitKinds return all three units, and sometimes a function we intend for ANE ends up on the GPU. We can't find any API that reports where a function actually ran, but system resource utilization shows a GPU spike. Is there a supported way to confirm the actual compute unit at runtime? And how does your prioritization work if we prefer running on the ANE rather than the GPU? Or, can we disallow certain compute units? How does this compute unit selection map with someone coming from CoreML where the desired compute units were honored?

Machine Learning & AI Core AI

379

Jun ’26

Is Core AI capable of submitting Neural Engine requests in the background?

In iOS 26, Core ML reliably submits inference requests targeting the Neural Engine in the background. In iOS 27 Seed 1, this capability is currently broken. Is Core AI expected to reliably submit inference requests targeting the Neural Engine in the background?

Machine Learning & AI Core AI

464

Jun ’26

Ahead-of-time (AOT) compilation for older devices

Will ahead of time compilation stay limited to A17 Pro and M1 class hardware? Docs indicate that AOT compilation is only supported for A17 Pro/M1 and newer. iOS 27 supports iPhone 11 (A13) and newer. AOT can't be used on A13-A16 which is a substantial fraction of active user devices in 2026 Is there a path to AOT on older Apple Silicon chips (A14 and newer) down the line? If not, I am afraid, the impact of AOT will be delayed by 3-4 years until it can be broadly leveraged.

Machine Learning & AI Core AI

456

Jun ’26

On Model Control & Metadata

While response.usage provides token counts and reasoning signals, are there plans to expose per-token logprobs or confidence scores to help developers build more robust 'evaluators' for non-deterministic outputs?

Machine Learning & AI Core AI

361

Jun ’26

Mixing Core AI and Core ML in one pipeline

We built a setup where a model split into an encoder and a decoder can run each part on a different backend, using our own component protocols. Is mixing Core AI and Core ML within a single inference pass something you would recommend, and what is the realistic cost at the boundary where we convert between MLMultiArray / MLTensor and NDArray? Is there a way to keep the encoder output resident on the GPU or ANE so it does not need a host round trip into the other backend?

Machine Learning & AI Core AI

476

Jun ’26

Modern drawing classifier recommendations

What modern tools would you recommend to replace a drawing classification machine learning model I built years ago (with Turi Create) to recognize emojis trained on drawings users submit? I tried an image classifier with CreateML but training time is much longer and accuracy is worse. Does Apple have a solution for drawing classification? Is a ML model still the best solution, or any AI advancements good for this use case? Thanks!

Machine Learning & AI Core AI

440

Jun ’26

Specialized models across OS updates

The docs say we can delete the source .aimodel after a .persistent specialize and keep the bookmark to save space. But an OS update always invalidates the cache and bookmarks, so it looks like anyone who deleted the source has to re-download the whole model after every update. For large models, that's a lot of bandwidth and impacts the first-time experience after the update. Is that the intended trade-off, or does the cache hold enough to re-specialize itself? Does every minor OS bumps (27.1 -> 27.2) always invalidate cache when .persisted, or only major ones? Also, can the user delete a .persistent entry themselves through Settings or storage management, or only the app? We need to know whether our "model is ready" state can disappear without the app knowing.

Machine Learning & AI Core AI

422

Jun ’26

Dynamically loading expert weights on the Neural Engine

AFM3 Core Advanced dynamically loads experts into DRAM* for Neural Engine execution. Is this capability accessible to third party via Core AI or Core ML APIs in iOS 27? Reference: https://machinelearning.apple.com/research/introducing-third-generation-of-apple-foundation-models

Machine Learning & AI Core AI

381

Jun ’26

.md summaries of WWDC for Claude

Are there any official .md summaries for Claude? How should we tell our Agents where the source of truth for WWDC new code is?

Machine Learning & AI Core AI

Replies: 1
Boosts: 0
Views: 386
Activity: Jun ’26

On Siri & Apple Intelligence

Machine Learning & AI Core AI

Replies: 1
Boosts: 0
Views: 407
Activity: Jun ’26

For Model Control & Metadata

Is there a supported way to 'pin' an app to a specific version of an on-device system model to prevent regression in agentic workflows & what is the typical update cadence for these weights?

Machine Learning & AI Core AI

Replies: 1
Boosts: 1
Views: 390
Activity: Jun ’26

Deployments and Entitemments ability

For apps using the 'MLX distributor' or local adapters, are there any specialized background processing entitlements needed to ensure inference isn't killed by the OS during long-running tasks

Machine Learning & AI Core AI