Legacy Documentclose button

Important: The information in this document is obsolete and should not be used for new development.

Up Previous Next 

PATH 
Mac OS 8 and 9 Developer Documentation > Text Encoding Conversion Manager
Programming With the Text Encoding Conversion Manager



Document Version History

This document has had the following releases:

Table A-1  Text Encodng Converter Manager revision history

Version

Notes

October 4, 1999 The following changes were made to reflect revisions for TEC 1.5:
Removed script code kTextEncodingMacUkrainian from enumeration in Text Encoding Base .
Added the following ISO 8-bit and 7-bit script codes to enumeration under Text Encoding Base .:
kTextEncodingISOLatin6 = 0x020A, /* ISO 8859-10 */
kTextEncodingISOLatin7 = 0x020D, /* ISO 8859-13, Baltic Rim */
kTextEncodingISOLatin8 = 0x020E, /* ISO 8859-14, Celtic */
kTextEncodingISOLatin9 = 0x020F, /* ISO 8859-15, 8859-1 + EURO etc */
Added the following script code to enumeration under Text Encoding Base :
kTextEncodingUnknown = 0xFFFF /* Unknown or unspecified *
Under Text Encoding Variant , added information indicating that minor variants can reflect minor changes to an encoding (such as the addition of the EURO SIGN character). Removed obsolete information describing bit masks to indicate variants.
Made the following changes to the enumeration under Text Encoding Variant :
Removed current kTextEncodingMacRoman variants
kMacRomanStandardVariant = 0,
kMacRomanCurrencySignVariant = 1,

and replaced them with
kMacRomanDefaultVariant = 0, /* meta value, maps to 1 or 2 */
kMacRomanCurrencySignVariant = 1, /* for Mac OS version < 8.5 */
kMacRomanEuroSignVariant = 2, /* for Mac OS version >= 8.5 */
Removed current kTextEncodingMacIcelandic variants
kMacIcelandicStandardVariant = 0,
kMacIcelandicTrueTypeVariant = 1

and replaced them with:
kMacIcelandicStdDefaultVariant = 0, /* meta value, maps to 2 or 4 */
kMacIcelandicTTDefaultVariant = 1, /* meta value, maps to 3 or 5 */

/* The following are for Mac OS version < 8.5 */
kMacIcelandicStdCurrSignVariant = 2, // 0xBB/0xBC are ord. indicators
kMacIcelandicTTCurrSignVariant = 3, // 0xBB/0xBC are fi/fl ligatures

/* The following are for Mac OS version >= 8.5 */
kMacIcelandicStdEuroSignVariant = 4, // 0xBB/0xBC are ord. indicators
kMacIcelandicTTEuroSignVariant = 5, // 0xBB/0xBC are fi/fl ligatures
Added new variants:
/* Variants of kTextEncodingMacCyrillic */
kMacCyrillicDefaultVariant = 0, // meta value, maps to 1, 2, or 3
kMacCyrillicCurrSignStdVariant = 1, //for Russian & Bulg Mac OS < 9.0
kMacCyrillicCurrSignUkrVariant = 2, //for Ukraine Mac OS < 9.0 & CyrLK
kMacCyrillicEuroSignVariant = 3, // for Mac OS >= 9.0

/* Variants of kTextEncodingMacCroatian */
kMacCroatianDefaultVariant = 0, /* meta value, maps to 1 or 2 */
kMacCroatianCurrencySignVariant = 1, /* Mac OS version < 8.5 */
kMacCroatianEuroSignVariant = 2, /* Mac OS version >= 8.5 */

/* Variants of kTextEncodingMacRomanian */
kMacRomanianDefaultVariant = 0, /* meta value, maps to 1 or 2 */
kMacRomanianCurrencySignVariant = 1, /* Mac OS version < 8.5 */
kMacRomanianEuroSignVariant = 2, /* Mac OS version >= 8.5 */

/* Variants of kTextEncodingMacVT100 */
kMacVT100DefaultVariant = 0, /* meta value, maps to 1 or 2 */
kMacVT100CurrencySignVariant = 1, /* Mac OS version < 8.5 */
kMacVT100EuroSignVariant = 2, /* Mac OS version >= 8.5 */
Added new section, Unicode Character Properties .
New fields added to the TECInfo data structure. The kTECInfoCurrentFormat is now set to 2, and the following other constants were added:
kTECAddTextRunHeuristicsBit = 6,
kTECAddFallbackInterruptBit = 7<
kTECAddTextRunHeuristicsMask = 1L << kTECAddTextRunHeuristicsBit,
kTECAddFallbackInterruptMask = 1L << kTECAddFallbackInterruptBit
Function description for ResolveDefaultTextEncoding changed to read "Currently, this affects only the base encoding and variant values packed into the text encoding specification."
Added new section Finding Mac OS Encodings that Match Other Encodings and added new function NearestMacTextEncodings .
Added new section Obtaining Unicode Character Properties and added new function UCGetCharProperty .
Added new sectionSetting Conversion Options and added new function TECSetBasicOptions.
Under Unicode Mapping Versions : kUnicodeUseHFSPlusMapping now correctly set to 4, not -4. New constant and corresponding mask added: kUnicodeTextRunHeuristicsBit = 11
kUnicodeTextRunHeuristicsMask = 1L << kUnicodeTextRunHeuristicsBit
Additional text changes throughout the chapter to reflect these additions.
Added new constant, kUnicodeFallbackInterruptSafeMask, to Fallback-Handler Control Flags . Added text to application-defined function MyUnicodeToTextFallbackProc to indicate proper usage of this constant.
Clarified function description for function UnicodeMapping.
Wording changed throughout the section Converting From Unicode to reflect that you can now convert strings from one Unicode variant to another.
Some Character Encodings and Their Common Internet Names : Added the following character encoding Internet names to Table 1 : Under Western European Languages: ISO 8859-3 (Latin 3) and ISO 8859-15 (Latin 9) Under Central European Languages: ISO 8859-4 (Latin 4) and CP 1257 (Windows BalticRim) Under Vietnamese, CP 1258 (Windows Vietnamese)
October 23, 1998 The following corrections were made to reflect revisions for TEC 1.4.
Noted "Text Encoding Converter 1.4" where applicable.
Constants added to the enumeration in "Text Encoding Base" (page 31): kTextEncodingMacCeltic = 39
kTextEncodingMacGaelic = 40 kTextEncodingMacInuit = 0xEC kTextEncodingUnicodeV2_1 = 0x103 kTextEncodingISOLatin3 = 0x203
kTextEncodingISOLatin4 = 0x204 kTextEncodingWindowsVietnamese = 0x508
Constants added to the enumeration in "Text Encoding Variant" (page 36): kMacRomanStandardVariant = 0
kMacRomanCurrencySignVariant = 1
Constants deleted from the enumeration in "Text Encoding Variant" (page 36): kJapaneseNoOneByteKanaOption = 0x20
kJapaneseUseAsciiBackslashOption = 0x40
Constants added to the enumerations associated with the TECInfo structure (page 47): kTECTextRunBitClearFixBit = 2
kTECTextToUnicodeScanFixBit = 3
kTECAddForceASCIIChangesBit = 4
kTECPreferredEncodingFixBit = 5
kTECTextRunBitClearFixMask = 1L << kTECTextRunBitClearFixBit,
kTECTextToUnicodeScanFixMask = 1L << kTECTextToUnicodeScanFixBit,
kTECAddForceASCIIChangesMask = 1L << kTECAddForceASCIIChangesBit,
kTECPreferredEncodingFixMask = 1L << kTECPreferredEncodingFixBit
Constants added to the first enumeration in "Conversion Control Flags" (page 112): kUnicodeForceASCIIRangeBit = 9
kUnicodeNoHalfwidthCharsBit = 10
Constants added to the second enumeration in "Conversion Control Flags" (page 112): kUnicodeForceASCIIRangeMask = 1L << kUnicodeForceASCIIRangeBit
kUnicodeNoHalfwidthCharsMask = 1L << kUnicodeNoHalfwidthCharsBit
Changes to Table C-1 (page 247): New entry for CP 850 (DOS Latin-1). Other entries updated for correct version of Text Encoding Converter or Unicode Converter support.

Glossary


© 1999 Apple Computer, Inc. – (Last Updated 13 Dec 99)

Up Previous Next