Image understanding to on-device model

Question

Created 1w

Replies 1

Boosts 0

Participants 2

I can’t seem to find a way to include an image when prompting the new on-device model in Xcode, even though Apple explicitly states that the model was trained and tested with image data (https://machinelearning.apple.com/research/apple-foundation-models-2025-updates).

Has anyone managed to get this working, or are VLM-style capabilities simply not exposed yet?

Boost

Answer 1

DTS Engineer OP

Apple

1w

Hi @1729k, currently Foundation Models does not support images as input. But depending on your app's needs you could consider pairing it with Vision, a computer vision framework that also runs on-device.

Best,

-J

0