Introduction to Speech Synthesis Programming Guide

Speech synthesis, also called text-to-speech, is the generation of synthetic speech. An application or other process sends text to a speech synthesizer, which creates a spoken version that can be output through the audio hardware or saved to a file.

This document covers speech synthesis support in OS X. It provides an overview of speech components and the speech synthesis process, and it describes how to incorporate and manipulate synthesized speech in your application.

You should read this document to learn about speech synthesis in OS X and about how you can customize your application’s spoken output. If you’re unfamiliar with the concepts of synthesized speech, be sure to read Speech Synthesis in OS X for an overview of how speech fits into the operating system and of the speech generation process. Carbon, Cocoa, and AppleScript provide APIs to produce spoken output. Be sure to read the API overviews in Designing and Implementing an Application That Speaks to find out which programming language provides the features you need.

Organization of This Document

Speech Synthesis Programming Guide contains the following chapters and appendixes:

Speech Synthesis in OS X provides an overview of the speech generation process and its components, and describes ways you can use and customize spoken output in your application.
Designing and Implementing an Application That Speaks describes some design strategies and guidelines for providing spoken output in your application. It also outlines the speech synthesis APIs available in Cocoa, Carbon, and AppleScript, and provides some sample code.
Techniques for Customizing Synthesized Speech details several methods for customizing your application’s spoken output, and describes a handful of ways to improve synthesized speech.
Syntax of Embedded Speech Commands describes the formal syntax of embedded speech commands.
Phonemes lists the North American English phoneme symbols the MacinTalk synthesizer recognizes.
Glossary contains definitions of speech-related terms used in this document. Every term you see displayed in bold font, such as speech synthesizer, appears in the Glossary.
Revision History lists the changes made to this document.

Introduction to Speech Synthesis Programming Guide

Organization of This Document

See Also