13.07.2015 Views

Proposal to encode characters for Extended Tamil §1 ... - TamilNet

Proposal to encode characters for Extended Tamil §1 ... - TamilNet

Proposal to encode characters for Extended Tamil §1 ... - TamilNet

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Currently (even as of 5.2) the Unicode standard (§9.6, p 289) prescribes the use ofsuperscript <strong>characters</strong> 00B2, 00B3 and 2074 <strong>to</strong> handle <strong>Extended</strong> <strong>Tamil</strong>. However, we shallshow that while the glyphs these <strong>characters</strong> supply may be indeed used as diacritics, thereexist problems which necessitate the encoding of separate <strong>characters</strong>.§2. Characters that are needed <strong>for</strong> <strong>Extended</strong> <strong>Tamil</strong>The <strong>Tamil</strong> script does not have <strong>characters</strong> <strong>for</strong> vocalic R, RR, L and LL (dependent andindependent). It has only one non-nasal consonant per class among the class consonantswhereas the other Sanskrit-supporting scripts like Devanagari and Grantha have four,leaving three consonants missing per class. The exception is CA-class where two <strong>characters</strong>– CA and JA – are already present in <strong>Tamil</strong> and only CHA and JHA are missing. <strong>Tamil</strong> alsodoes not have an anunasika sign, anusvara, visarga, ardhavisarga, danda-s and avagraha.However, since the danda-s from 0964 and 0965 and the ardhavisarga from 1CF2(and the newly-proposed rotated version at 1CF3) are being used <strong>for</strong> all Indic scripts, these<strong>characters</strong> do not need <strong>to</strong> be duplicated <strong>for</strong> <strong>Extended</strong> <strong>Tamil</strong>.Thus, <strong>to</strong> extend the <strong>Tamil</strong> script <strong>to</strong> represent Sanskrit, one needs <strong>to</strong> <strong>encode</strong><strong>characters</strong> <strong>for</strong> vocalic R etc, the missing class consonants and then the anunasika sign etc.Thus those <strong>characters</strong> that need <strong>to</strong> be <strong>encode</strong>d <strong>for</strong> <strong>Extended</strong> <strong>Tamil</strong> are:1) TAMIL EXTENDED LETTER VOCALIC R2) TAMIL LETTER VOCALIC RR3) TAMIL LETTER VOCALIC L4) TAMIL LETTER VOCALIC LL5) TAMIL VOWEL SIGN VOCALIC R6) TAMIL VOWEL SIGN VOCALIC RR7) TAMIL VOWEL SIGN VOCALIC L8) TAMIL VOWEL SIGN VOCALIC LL9) TAMIL LETTER KHA10) TAMIL LETTER GA11) TAMIL LETTER GHA12) TAMIL LETTER CHA13) TAMIL LETTER JHA14) TAMIL LETTER TTHA15) TAMIL LETTER DDA2

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!