According to the documentation, you can set the mediaFileURL of a custom message to an audio file - which will play and display with a graphical waveform representation.
As far as I can tell (based on other posts, testing, looking at code), this is not the case at all.
What's really irritating is that if you use a video file, it works, but the video is perpetually muted - there appears to be no way for the end user or the developer to enable sound.