24.05.2014 Views

AIX Version 4.3 Differences Guide

AIX Version 4.3 Differences Guide

AIX Version 4.3 Differences Guide

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Note: The terms UCS-2 and Unicode will be used interchangeably throughout<br />

this documentation.<br />

The Unicode 2.0 standard (ISBN 0-201-48345-9) is the basis for the <strong>AIX</strong><br />

implementation and is used as the primary reference.<br />

Further information is also available at the Unicode URL: http://www.unicode.org<br />

10.3.1 UTF-8<br />

For file systems to be able to work with Unicode, a UCS Transformation Format<br />

was developed by the X/Open Internationalization (also known as I18N, because<br />

there are 18 letters between the I and the N) working group. This transformation,<br />

called UTF-8, provides a way of transforming Unicode into an ASCII<br />

representation.<br />

The UTF-8 transformation is important when considering that the majority of file<br />

systems are ACSII-based. When using the UTF-8 Transformation Format, you<br />

are assured of a file system-safe way of using Unicode to store files.<br />

The UTF-8 transformation format is a multi-byte code set capable of encoding the<br />

same set of characters of UCS-2 in 1 to 3 bytes or UCS-4 in 1 to 6 bytes. It has<br />

the following characteristics:<br />

• ASCII (7-bit code) is a proper subset.<br />

• It preserves the semantics of the portable character set.<br />

• It preserves the semantics of a null octet for the C programming language.<br />

Note: It is expected that UTF-8 will be a dominant transformation method where<br />

the UCS is not practical.<br />

10.3.2 ULS<br />

ULS, or Universal Language Support, refers to a toolkit of functions that enable<br />

applications to work with UCS. The ULS is designed to be operating<br />

system-independent, so the functions can be ported to any system.<br />

The Universal Language Support specification describes a set of functions and<br />

utilities for addressing a wide range of national language support (NLS)<br />

problems. In summary, it provides:<br />

• A universal coded character set capable of encoding most of today's<br />

languages and scripts.<br />

• A set of universal language support functions for the processing of input and<br />

output of supported characters.<br />

• A set of universal locale objects that describe processing of text on a global<br />

scope.<br />

• A set of universal layout objects that describe processing of text on a global<br />

scope.<br />

• A set of universal conversion objects for import/export of traditional data into<br />

the ULS environment.<br />

This specification identifies the level of support expected for the Universal Coded<br />

Character Sets in the <strong>AIX</strong> operating system and all future releases of it. Without<br />

National Language Support 243

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!