| Derived from | |
| Framework | CoreFoundation/CoreFoundation.h |
| Companion guide | |
| Declared in | CFCharacterSet.h |
A CFCharacterSet object represents a set of Unicode compliant characters. CFString uses CFCharacterSet objects to group characters together for searching operations, so that they can find any of a particular set of characters during a search. The two opaque types, CFCharacterSet and CFMutableCharacterSet, define the interface for static and dynamic character sets, respectively. The objects you create using these opaque types are referred to as character set objects (and when no confusion will result, merely as character sets).
CFCharacterSet's principal function, CFCharacterSetIsCharacterMember, provides the basis for all other functions in its interface. You create a character set using one of the CFCharacterSetCreate... functions. You may also use any one of the predefined character sets using the CFCharacterSetGetPredefined function.
CFCharacterSet is “toll-free bridged” with its Cocoa Foundation counterpart, NSCharacterSet. This means that the Core Foundation type is interchangeable in function or method calls with the bridged Foundation object. Therefore, in a method where you see an NSCharacterSet * parameter, you can pass in a CFCharacterSetRef, and in a function where you see a CFCharacterSetRef parameter, you can pass in an NSCharacterSet instance. This capability also applies to concrete subclasses of NSCharacterSet. See Interchangeable Data Types for more information on toll-free bridging.
CFCharacterSetCreateCopy
CFCharacterSetCreateInvertedSet
CFCharacterSetCreateWithCharactersInRange
CFCharacterSetCreateWithCharactersInString
CFCharacterSetCreateWithBitmapRepresentation
CFCharacterSetCreateBitmapRepresentation
CFCharacterSetHasMemberInPlane
CFCharacterSetIsCharacterMember
CFCharacterSetIsLongCharacterMember
CFCharacterSetIsSupersetOfSet
Creates a new immutable data with the bitmap representation from the given character set.
CFDataRef CFCharacterSetCreateBitmapRepresentation ( CFAllocatorRef alloc, CFCharacterSetRef theSet );
The allocator to use to allocate memory for the new object. Pass NULL or kCFAllocatorDefault to use the current default allocator.
The set from which to create a bitmap representation. Refer to the comments for CFCharacterSetCreateWithBitmapRepresentation for the detailed discussion of the bitmap representation format.
A new CFData object containing a bitmap representation of theSet. Ownership follows the Create Rule.
CFCharacterSet.hCreates a new character set with the values from a given character set.
CFCharacterSetRef CFCharacterSetCreateCopy ( CFAllocatorRef alloc, CFCharacterSetRef theSet );
The allocator to use to allocate memory for the new object. Pass NULL or kCFAllocatorDefault to use the current default allocator.
The character set to copy.
A new character set that is a copy of theSet. Ownership follows the Create Rule.
This function tries to compact the backing store where applicable.
CFCharacterSet.hCreates a new immutable character set that is the invert of the specified character set.
CFCharacterSetRef CFCharacterSetCreateInvertedSet ( CFAllocatorRef alloc, CFCharacterSetRef theSet );
The allocator to use to allocate memory for the new object. Pass NULL or kCFAllocatorDefault to use the current default allocator.
The character set from which to create an inverted set.
A new character set that is the invert of theSet. Ownership follows the Create Rule.
CFCharacterSet.h
Creates a new immutable character set with the bitmap representation specified by given data.
CFCharacterSetRef CFCharacterSetCreateWithBitmapRepresentation ( CFAllocatorRef alloc, CFDataRef theData );
The allocator to use to allocate memory for the new object. Pass NULL or kCFAllocatorDefault to use the current default allocator.
A CFData object that specifies the bitmap representation of the Unicode character points the for the new character set. The bitmap representation could contain all the Unicode character range starting from BMP to Plane 16. The first 8KiB (8192 bytes) of the data represent the BMP range. The BMP range 8KiB can be followed by zero to sixteen 8KiB bitmaps, each prepended with the plane index byte. For example, the bitmap representing the BMP and Plane 2 has the size of 16385 bytes (8KiB for BMP, 1 byte index, and a 8KiB bitmap for Plane 2). The plane index byte, in this case, contains the integer value two.
If the data contains a Plane index byte outside of the valid Plane range (1 to 16), the behavior is undefined.
A new character set containing the indicated characters from theData. Ownership follows the Create Rule.
CFCharacterSet.h
Creates a new character set with the values from the given range of Unicode characters.
CFCharacterSetRef CFCharacterSetCreateWithCharactersInRange ( CFAllocatorRef alloc, CFRange theRange );
The allocator to use to allocate memory for the new object. Pass NULL or kCFAllocatorDefault to use the current default allocator.
The Unicode range of characters of the new character set. The function accepts the range in 32-bit in the UTF-32 format. The valid character point range is from 0x00000 to 0x10FFFF.
A new character set that contains a contiguous range of Unicode characters. Ownership follows the Create Rule.
CFCharacterSet.h
Creates a new character set with the values in the given string.
CFCharacterSetRef CFCharacterSetCreateWithCharactersInString ( CFAllocatorRef alloc, CFStringRef theString );
The allocator to use to allocate memory for the new object. Pass NULL or kCFAllocatorDefault to use the current default allocator.
A string containing the characters for the new set.
A new character set containing the characters from theString. Ownership follows the Create Rule.
CFCharacterSet.h
Returns a predefined character set.
CFCharacterSetRef CFCharacterSetGetPredefined ( CFCharacterSetPredefinedSet theSetIdentifier );
A predefined character set. See “Predefined CFCharacterSet Selector Values” for the list of available character sets.
A predefined character set. This instance is owned by Core Foundation.
CFCharacterSet.h
Returns the type identifier of the CFCharacterSet opaque type.
CFTypeID CFCharacterSetGetTypeID ( void );
The type identifier of the CFCharacterSet opaque type.
CFMutableCharacterSet objects have the same type identifier as CFCharacterSet objects.
CFCharacterSet.hReports whether or not a character set contains at least one member character in the specified plane.
Boolean CFCharacterSetHasMemberInPlane ( CFCharacterSetRef theSet, CFIndex thePlane );
The character set to examine.
The plane number to be checked for the membership. The valid value range is from 0 to 16. If the value is outside of the valid plane number range, the behavior is undefined.
true if at least one member character is in the specified plane, otherwise false.
CFCharacterSet.h
Reports whether or not a given Unicode character is in a character set.
Boolean CFCharacterSetIsCharacterMember ( CFCharacterSetRef theSet, UniChar theChar );
The character set to examine.
The Unicode character for which to test against the character set. Note that this function takes 16-bit Unicode character value; hence, it does not support access to the non-BMP planes.
true if theSet contains theChar, otherwise false.
CFCharacterSet.hReports whether or not a given UTF-32 character is in a character set.
Boolean CFCharacterSetIsLongCharacterMember ( CFCharacterSetRef theSet, UTF32Char theChar );
The character set to examine.
The UTF-32 character for which to test against the character set.
true if theSet contains theChar, otherwise false.
CFCharacterSet.hReports whether or not a character set is a superset of another set.
Boolean CFCharacterSetIsSupersetOfSet ( CFCharacterSetRef theSet, CFCharacterSetRef theOtherset );
The character set to be checked for the membership of theOtherSet.
The character set to be checked whether or not it is a subset of theSet.
true if theSet is a superset of theOtherSet, otherwise false.
CFCharacterSet.hDefines a predefined character set.
typedef CFIndex CFCharacterSetPredefinedSet;
See “Predefined CFCharacterSet Selector Values” for values.
CFCharacterSet.hA reference to an immutable character set object.
typedef const struct __CFCharacterSet *CFCharacterSetRef;
CFCharacterSet.hIdentifiers for the available predefined CFCharacterSet objects.
enum {
kCFCharacterSetControl = 1,
kCFCharacterSetWhitespace,
kCFCharacterSetWhitespaceAndNewline,
kCFCharacterSetDecimalDigit,
kCFCharacterSetLetter,
kCFCharacterSetLowercaseLetter,
kCFCharacterSetUppercaseLetter,
kCFCharacterSetNonBase,
kCFCharacterSetDecomposable,
kCFCharacterSetAlphaNumeric,
kCFCharacterSetPunctuation,
kCFCharacterSetCapitalizedLetter = 13,
kCFCharacterSetSymbol = 14,
kCFCharacterSetNewline = 15,
kCFCharacterSetIllegal = 12
};
kCFCharacterSetControlControl character set (Unicode General Category Cc and Cf).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetWhitespaceWhitespace character set (Unicode General Category Zs and U0009 CHARACTER TABULATION).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetWhitespaceAndNewlineWhitespace and Newline character set (Unicode General Category Z*, U000A ~ U000D, and U0085).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetDecimalDigitDecimal digit character set.
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetLetterLetter character set (Unicode General Category L* & M*).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetLowercaseLetterLowercase character set (Unicode General Category Ll).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetUppercaseLetterUppercase character set (Unicode General Category Lu and Lt).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetNonBaseNon-base character set (Unicode General Category M*).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetDecomposableCanonically decomposable character set.
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetAlphaNumericAlpha Numeric character set (Unicode General Category L*, M*, & N*).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetPunctuationPunctuation character set (Unicode General Category P*).
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
kCFCharacterSetCapitalizedLetterTitlecase character set (Unicode General Category Lt).
Available in Mac OS X v10.2 and later.
Declared in CFCharacterSet.h
kCFCharacterSetSymbolSymbol character set (Unicode General Category S*).
Available in Mac OS X v10.3 and later.
Declared in CFCharacterSet.h
kCFCharacterSetNewlineNewline character set (U000A ~ U000D, U0085, U2028, and U2029).
Available in Mac OS X v10.5 and later.
Declared in CFCharacterSet.h
kCFCharacterSetIllegalIllegal character set.
Available in Mac OS X v10.0 and later.
Declared in CFCharacterSet.h
Use these constants with the CFCharacterSetGetPredefined function to get one of the predefined character sets.
CFCharacterSet.h
Last updated: 2006-12-01