I would like to do speech recognition; however, I want the transcription to be entirely in phonemes instead of English words. This is doable with pocket sphinx so I think it shouldn't be much effort at all for Apple to implement. You guys definitely just built a wrapper around pocket sphinx. Would it be possible to create my own "Locale" with a dictionary of words that are really just phonemes? Haven't really looked much into what Locales consist of.
Speech Framework Phoneme Recognition
Is this doable using contextualStrings?
https://developer.apple.com/reference/speech/sfspeechrecognitionrequest/1649391-contextualstrings