Best strategy for a "Sing that Tune" game?

I am trying to build a "Sing that Tune" game. For example:

  1. The app will tell the user to sing, "Row row your boat."
  2. The user will sing "Row row your boat" into the microphone.
  3. If the user's melody is close enough to the actual melody, the game is won.

My question: Since I'm dealing with live audio that might be "correct" but not "exact," is the best strategy to use ShazamKit and an SHCustomCatalog, or is it better to use Create ML and sound classification? I know Create ML model can learn the difference between a baby and a firetruck, but can it learn the difference between a good guess and a wrong guess of a sung melody?

Thank you,

Eli

Replies

ShazamKit won’t recognize whether a user is singing a melody. Shazam only recognizes specific recordings.

I’m sorry that I don’t know the solution to your problem, but ShazamKit is not it.