Introduction to Speech Synthesis Programming Guide

Speech synthesis, also called text-to-speech, is the generation of synthetic speech. An application or other process sends text to a speech synthesizer, which creates a spoken version that can be output through the audio hardware or saved to a file.

This document covers speech synthesis support in OS X. It provides an overview of speech components and the speech synthesis process, and it describes how to incorporate and manipulate synthesized speech in your application.

You should read this document to learn about speech synthesis in OS X and about how you can customize your application’s spoken output. If you’re unfamiliar with the concepts of synthesized speech, be sure to read Speech Synthesis in OS X for an overview of how speech fits into the operating system and of the speech generation process. Carbon, Cocoa, and AppleScript provide APIs to produce spoken output. Be sure to read the API overviews in Designing and Implementing an Application That Speaks to find out which programming language provides the features you need.

Organization of This Document

Speech Synthesis Programming Guide contains the following chapters and appendixes:

See Also

In addition to this document, the Reference Library > User Experience > Speech Technologies contains several other resources to help you take advantage of synthesized speech in your application.