face and body detection is local model or a cloud model?

Is the face and body detection service in the Vision framework a local model or a cloud model? https://developer.apple.com/documentation/vision

face and body detection is local model or a cloud model?
 
 
Q