Global Variable

NSLinguisticTagSchemeLanguage

Supplies the language for a token, if one can be determined.

Declaration

const NSLinguisticTagScheme NSLinguisticTagSchemeLanguage;

Discussion

Each value for this tag scheme is a BCP-47 language identifier. For example, the language identifier for English is "en" and the identifier for Chinese written using the Simplified Chinese script is "zh-Hans". The identifier "und" is used if a specific language cannot be determined.

The tagger generally attempts to determine the language of text at the level of an entire sentence, paragraph, or document, rather than word by word.

See Also

Schemes

NSLinguisticTagSchemeTokenType

Classifies tokens according to their broad type: word, punctuation, or whitespace.

NSLinguisticTagSchemeLexicalClass

Classifies tokens according to class: part of speech, type of punctuation, or whitespace.

NSLinguisticTagSchemeNameType

Classifies tokens according to whether they are part of a named entity.

NSLinguisticTagSchemeNameTypeOrLexicalClass

Classifies tokens corresponding to names according to NSLinguisticTagSchemeNameType, and classifies all other tokens according to NSLinguisticTagSchemeLexicalClass.

NSLinguisticTagSchemeLemma

Supplies a stem form of a word token, if known.

NSLinguisticTagSchemeScript

Supplies the script for a token, if one can be determined.