Important: The information in this document is obsolete and should not be used for new development.
This document has had the following releases:
October 4, 1999 | The following changes were made to reflect revisions for TEC 1.5: |
Removed script code kTextEncodingMacUkrainian from enumeration in Text Encoding Base . | |
Added the following ISO 8-bit and 7-bit script codes to enumeration under
Text Encoding Base
.: kTextEncodingISOLatin6 = 0x020A, /* ISO 8859-10 */ kTextEncodingISOLatin7 = 0x020D, /* ISO 8859-13, Baltic Rim */ kTextEncodingISOLatin8 = 0x020E, /* ISO 8859-14, Celtic */ kTextEncodingISOLatin9 = 0x020F, /* ISO 8859-15, 8859-1 + EURO etc */ |
|
Added the following script code to enumeration under
Text Encoding Base
: kTextEncodingUnknown = 0xFFFF /* Unknown or unspecified * |
|
Under Text Encoding Variant , added information indicating that minor variants can reflect minor changes to an encoding (such as the addition of the EURO SIGN character). Removed obsolete information describing bit masks to indicate variants. | |
Made the following changes to the enumeration under
Text Encoding Variant
: Removed current kTextEncodingMacRoman variants kMacRomanStandardVariant = 0, kMacRomanCurrencySignVariant = 1, and replaced them with kMacRomanDefaultVariant = 0, /* meta value, maps to 1 or 2 */ kMacRomanCurrencySignVariant = 1, /* for Mac OS version < 8.5 */ kMacRomanEuroSignVariant = 2, /* for Mac OS version >= 8.5 */ |
|
Removed current kTextEncodingMacIcelandic variants kMacIcelandicStandardVariant = 0, kMacIcelandicTrueTypeVariant = 1 and replaced them with: kMacIcelandicStdDefaultVariant = 0, /* meta value, maps to 2 or 4 */ kMacIcelandicTTDefaultVariant = 1, /* meta value, maps to 3 or 5 */ /* The following are for Mac OS version < 8.5 */ kMacIcelandicStdCurrSignVariant = 2, // 0xBB/0xBC are ord. indicators kMacIcelandicTTCurrSignVariant = 3, // 0xBB/0xBC are fi/fl ligatures /* The following are for Mac OS version >= 8.5 */ kMacIcelandicStdEuroSignVariant = 4, // 0xBB/0xBC are ord. indicators kMacIcelandicTTEuroSignVariant = 5, // 0xBB/0xBC are fi/fl ligatures |
|
Added new variants: /* Variants of kTextEncodingMacCyrillic */ kMacCyrillicDefaultVariant = 0, // meta value, maps to 1, 2, or 3 kMacCyrillicCurrSignStdVariant = 1, //for Russian & Bulg Mac OS < 9.0 kMacCyrillicCurrSignUkrVariant = 2, //for Ukraine Mac OS < 9.0 & CyrLK kMacCyrillicEuroSignVariant = 3, // for Mac OS >= 9.0 /* Variants of kTextEncodingMacCroatian */ kMacCroatianDefaultVariant = 0, /* meta value, maps to 1 or 2 */ kMacCroatianCurrencySignVariant = 1, /* Mac OS version < 8.5 */ kMacCroatianEuroSignVariant = 2, /* Mac OS version >= 8.5 */ /* Variants of kTextEncodingMacRomanian */ kMacRomanianDefaultVariant = 0, /* meta value, maps to 1 or 2 */ kMacRomanianCurrencySignVariant = 1, /* Mac OS version < 8.5 */ kMacRomanianEuroSignVariant = 2, /* Mac OS version >= 8.5 */ /* Variants of kTextEncodingMacVT100 */ kMacVT100DefaultVariant = 0, /* meta value, maps to 1 or 2 */ kMacVT100CurrencySignVariant = 1, /* Mac OS version < 8.5 */ kMacVT100EuroSignVariant = 2, /* Mac OS version >= 8.5 */ |
|
Added new section, Unicode Character Properties . | |
New fields added to the TECInfo data structure. The kTECInfoCurrentFormat is now set to 2, and the following other constants were added: kTECAddTextRunHeuristicsBit = 6, kTECAddFallbackInterruptBit = 7< kTECAddTextRunHeuristicsMask = 1L << kTECAddTextRunHeuristicsBit, kTECAddFallbackInterruptMask = 1L << kTECAddFallbackInterruptBit |
|
Function description for ResolveDefaultTextEncoding changed to read "Currently, this affects only the base encoding and variant values packed into the text encoding specification." | |
Added new section Finding Mac OS Encodings that Match Other Encodings and added new function NearestMacTextEncodings . | |
Added new section Obtaining Unicode Character Properties and added new function UCGetCharProperty . | |
Added new sectionSetting Conversion Options and added new function TECSetBasicOptions. | |
Under
Unicode Mapping Versions
:
kUnicodeUseHFSPlusMapping now correctly set to 4, not -4.
New constant and corresponding mask added: kUnicodeTextRunHeuristicsBit = 11 kUnicodeTextRunHeuristicsMask = 1L << kUnicodeTextRunHeuristicsBit Additional text changes throughout the chapter to reflect these additions. |
|
Added new constant, kUnicodeFallbackInterruptSafeMask, to Fallback-Handler Control Flags . Added text to application-defined function MyUnicodeToTextFallbackProc to indicate proper usage of this constant. | |
Clarified function description for function UnicodeMapping. | |
Wording changed throughout the section Converting From Unicode to reflect that you can now convert strings from one Unicode variant to another. | |
Some Character Encodings and Their Common Internet Names : Added the following character encoding Internet names to Table 1 : Under Western European Languages: ISO 8859-3 (Latin 3) and ISO 8859-15 (Latin 9) Under Central European Languages: ISO 8859-4 (Latin 4) and CP 1257 (Windows BalticRim) Under Vietnamese, CP 1258 (Windows Vietnamese) | |
October 23, 1998 | The following corrections were made to reflect revisions for TEC 1.4. |
Noted "Text Encoding Converter 1.4" where applicable. | |
Constants added to the enumeration in "Text Encoding Base" (page 31):
kTextEncodingMacCeltic = 39 kTextEncodingMacGaelic = 40 kTextEncodingMacInuit = 0xEC kTextEncodingUnicodeV2_1 = 0x103 kTextEncodingISOLatin3 = 0x203 kTextEncodingISOLatin4 = 0x204 kTextEncodingWindowsVietnamese = 0x508 |
|
Constants added to the enumeration in "Text Encoding Variant" (page 36):
kMacRomanStandardVariant = 0 kMacRomanCurrencySignVariant = 1 |
|
Constants deleted from the enumeration in "Text Encoding Variant" (page 36):
kJapaneseNoOneByteKanaOption = 0x20 kJapaneseUseAsciiBackslashOption = 0x40 |
|
Constants added to the enumerations associated with the TECInfo structure (page 47):
kTECTextRunBitClearFixBit = 2 kTECTextToUnicodeScanFixBit = 3 kTECAddForceASCIIChangesBit = 4 kTECPreferredEncodingFixBit = 5 kTECTextRunBitClearFixMask = 1L << kTECTextRunBitClearFixBit, kTECTextToUnicodeScanFixMask = 1L << kTECTextToUnicodeScanFixBit, kTECAddForceASCIIChangesMask = 1L << kTECAddForceASCIIChangesBit, kTECPreferredEncodingFixMask = 1L << kTECPreferredEncodingFixBit |
|
Constants added to the first enumeration in "Conversion Control Flags" (page 112):
kUnicodeForceASCIIRangeBit = 9 kUnicodeNoHalfwidthCharsBit = 10 |
|
Constants added to the second enumeration in "Conversion Control Flags" (page 112):
kUnicodeForceASCIIRangeMask = 1L << kUnicodeForceASCIIRangeBit kUnicodeNoHalfwidthCharsMask = 1L << kUnicodeNoHalfwidthCharsBit |
|
Changes to Table C-1 (page 247): New entry for CP 850 (DOS Latin-1). Other entries updated for correct version of Text Encoding Converter or Unicode Converter support. |