(last updated 2007-05-14) Character set name: TSCII (TAMIL SCRIPT CODE FOR INFORMATION INTERCHANGE) Character set aliases: None Suitability for use in MIME text: YES usable as 8bit or with base64 or quoted-printable encoding Published Specifications: http://www.tscii.org/tsciispec.html ISO 10646 Equivalency Table: Available as a technical note at the Unicode Consortium website http://www.unicode.org/notes/tn15/ As a glyph-based encoding, TSCII codechart includes vowels, consonants and abugida (compound vowel-consonant) characters. Unicode, as a character encoding encodes only vowels and consonants. Hence not all codepoints of TSCII can be converted one-to-one with ISO 10646. Intended usage: COMMON Additional Information: Tamil is one of the main Indian languages (Dravidian in Origin) currently spoken by over 70 million people worldwide. TSCII (Tamil Script Code for Information Interchange) is a bilingual 8-bit glyph-based encoding scheme (Roman and Tamil). The TSCII scheme was collectively worked out through Net-based discussions in 1998. TSCII is modeled on "ISO-8859-XX family of charsets" with standard plain ASCII set filling the 7-bit part and a set of Tamil character glyphs filling the 8-bit part. Full technical details on the TSCII charset are available at the TSCII official website: http://www.tscii.org/tsciispec.html Person(s) & email address to contact for further information: TSCII USER GROUP represented by Kalyanasundaram, Kuppuswamy (Switzerland) kalyan.geo&yahoo.com Manivannan, Mani (USA) mmanivannan&gmail.com Nedumaran, Muthu (Malaysia) muthu&murasu.com TAMIL SCRIPT CODE FOR INFORMATION INTERCHANGE (TSCII) Glyph/Character Listing and ISO 10646 Mapping Table Column #1 is the TSCII code position (in hex), Column #2 is the TSCII character name ISO 10646 Mapping table can be obtained as a Technical note from the Unicode Consortium website: http://www.unicode.org/notes/tn15 A Unicode based PDF file that includes the actual glyph forms of all characters included in TSCII charset is available at the TSCII website (see PDF file in particular) http://www.tscii.org/tsciispec.html HEX Character Name 00 NULL 01 START OF HEADING 02 START OF TEXT 03 END OF TEXT 04 END OF TRANSMISSION 05 ENQUIRY 06 ACKNOWLEDGE 07 BELL 08 BACKSPACE 09 HORIZONTAL TABULATION 0A LINE FEED 0B VERTICAL TABULATION 0C FORM FEED 0D CARRIAGE RETURN 0E SHIFT OUT 0F SHIFT IN 10 DATA LINE ESCAPE 11 DEVICE CONTROL ONE 12 DEVICE CONTROL TWO 13 DEVICE CONTROL THREE 14 DEVICE CONTROL FOUR 15 NEGATIVE ACKNOWLEDGE 16 SYNCHRONOUS IDLE 17 END OF TRANSMISSION BLOCK 18 CANCEL 19 END OF MEDIUM 1A SUBSTITUTE 1B ESCAPE 1C FILE SEPARATOR 1D GROUP SEPARATOR 1E RECORD SEPARATOR 1F UNIT SEPARATOR 20 SPACE 21 EXCLAMATION MARK 22 QUOTATION MARK 23 NUMBER SIGN 24 DOLLAR SIGN 25 PERCENT SIGN 26 AMPERSAND 27 APOSTROPHE 28 LEFT PARENTHESIS 29 RIGHT PARENTHESIS 2A ASTERISK 2B PLUS SIGN 2C COMMA 2D HYPHEN MINUS 2E FULL STOP 2F SOLIDUS 30 DIGIT ZERO 31 DIGIT ONE 32 DIGIT TWO 33 DIGIT THREE 34 DIGIT FOUR 35 DIGIT FIVE 36 DIGIT SIX 37 DIGIT SEVEN 38 DIGIT EIGHT 39 DIGIT NINE 3A COLON 3B SEMICOLON 3C LESS-THAN SIGN 3D EQUALS SIGN 3E GREATER-THAN SIGN 3F QUESTION MARK 40 COMMERCIAL AT 41 LATIN CAPITAL LETTER A 42 LATIN CAPITAL LETTER B 43 LATIN CAPITAL LETTER C 44 LATIN CAPITAL LETTER D 45 LATIN CAPITAL LETTER E 46 LATIN CAPITAL LETTER F 47 LATIN CAPITAL LETTER G 48 LATIN CAPITAL LETTER H 49 LATIN CAPITAL LETTER I 4A LATIN CAPITAL LETTER J 4B LATIN CAPITAL LETTER K 4C LATIN CAPITAL LETTER L 4D LATIN CAPITAL LETTER M 4E LATIN CAPITAL LETTER N 4F LATIN CAPITAL LETTER O 50 LATIN CAPITAL LETTER P 51 LATIN CAPITAL LETTER Q 52 LATIN CAPITAL LETTER R 53 LATIN CAPITAL LETTER S 54 LATIN CAPITAL LETTER T 55 LATIN CAPITAL LETTER U 56 LATIN CAPITAL LETTER V 57 LATIN CAPITAL LETTER W 58 LATIN CAPITAL LETTER X 59 LATIN CAPITAL LETTER Y 5A LATIN CAPITAL LETTER Z 5B LEFT SQUARE BRACKET 5C REVERSE SOLIDUS 5D RIGHT SQUARE BRACKET 5E CIRCUMFLEX ACCENT 5F LOW LINE 60 GRAVE ACCENT 61 LATIN SMALL LETTER A 62 LATIN SMALL LETTER B 63 LATIN SMALL LETTER C 64 LATIN SMALL LETTER D 65 LATIN SMALL LETTER E 66 LATIN SMALL LETTER F 67 LATIN SMALL LETTER G 68 LATIN SMALL LETTER H 69 LATIN SMALL LETTER I 6A LATIN SMALL LETTER J 6B LATIN SMALL LETTER K 6C LATIN SMALL LETTER L 6D LATIN SMALL LETTER M 6E LATIN SMALL LETTER N 6F LATIN SMALL LETTER O 70 LATIN SMALL LETTER P 71 LATIN SMALL LETTER Q 72 LATIN SMALL LETTER R 73 LATIN SMALL LETTER S 74 LATIN SMALL LETTER T 75 LATIN SMALL LETTER U 76 LATIN SMALL LETTER V 77 LATIN SMALL LETTER W 78 LATIN SMALL LETTER X 79 LATIN SMALL LETTER Y 7A LATIN SMALL LETTER Z 7B LEFT CURLY BRACKET 7C VERTICAL LINE 7D RIGHT CURLY BRACKET 7E TILDE 7F DELETE 80 TAMIL DIGIT CUZHI = Tamil digit zero 81 TAMIL DIGIT ONRRU = Tamil digit one 82 TAMIL GRANTHA LETTER SRI = Tamil letter sri 83 TAMIL GRANTHA LETTER JA = Tamil letter ja 84 TAMIL GRANTHA LETTER SSA = Tamil letter ssa 85 TAMIL GRANTHA LETTER SA = Tamil letter sa 86 TAMIL GRANTHA LETTER HA = Tamil letter ha 87 TAMIL GRANTHA LETTER KSHA = Tamil letter ksha 88 TAMIL GRANTHA LETTER J = Tamil letter j 89 TAMIL GRANTHA LETTER SS = Tamil letter ss 8A TAMIL GRANTHA LETTER S = Tamil letter s 8B TAMIL GRANTHA LETTER H = Tamil letter h 8C TAMIL GRANTHA LETTER KSH = Tamil letter ksh 8D TAMIL DIGIT IRANNNTU = Tamil digit two 8E TAMIL DIGIT MUUNNRRU = Tamil digit three 8F TAMIL DIGIT NAANNKU = Tamil digit four 90 TAMIL DIGIT AINTHU = Tamil digit five 91 LEFT SINGLE QUOTATION MARK 92 RIGHT SINGLE QUOTATION MARK 93 LEFT DOUBLE QUOTATION MARK 94 RIGHT DOUBLE QUOTATION MARK 95 TAMIL DIGIT AARRU = Tamil digit six 96 TAMIL DIGIT EEZHU = Tamil digit seven 97 TAMIL DIGIT ETTU = Tamil digit eight 98 TAMIL DIGIT ONPATHU = Tamil digit nine 99 TAMIL LETTER NGAKARA UKARAM = Tamil letter ngu 9A TAMIL LETTER NJAKARA UKARAM = Tamil letter nju 9B TAMIL LETTER NGAKARA UUKAARAM = Tamil letter nguu 9C TAMIL LETTER NJAKARA UUKAARAM = Tamil letter njuu 9D TAMIL NUMBER PATHTHU = Tamil number ten 9E TAMIL NUMBER NUURRU = Tamil number one hundred 9F TAMIL NUMBER AAYIRAM = Tamil number one thousand A0 A1 TAMIL VOWEL SIGN KAAL = Tamil vowel sign aa A2 TAMIL VOWEL SIGN KOKKI = Tamil vowel sign i A3 TAMIL VOWEL SIGN CUZHI-K-KOKKI = Tamil vowel sign ii A4 TAMIL VOWEL SIGN KONNNTAI = Tamil vowel sign u A5 TAMIL VOWEL SIGN CUZHIK KONNNTAI = Tamil vowel sign uu A6 TAMIL VOWEL SIGN KOMPU = Tamil vowel sign e A7 TAMIL VOWEL SIGN IRATTAI-K-KOMPU = Tamil vowel sign ee A8 TAMIL VOWEL SIGN IRATTAI-C-CUZHI = Tamil vowel sign ai A9 COPYRIGHT SIGN AA TAMIL VOWEL SIGN CIRRAKU = Tamil au length mark AB TAMIL LETTER AKARAM = Tamil letter a AC TAMIL LETTER AAKAARAM = Tamil letter aa AD TAMIL VOWEL IKARAM (USAGE IN SLOT DEPRECATED) = Tamil letter i AE TAMIL LETTER IIKAARAM = Tamil letter ii AF TAMIL LETTER UKARAM = Tamil letter u B0 TAMIL LETTER UUKAARAM = Tamil letter uu B1 TAMIL LETTER EKARAM = Tamil letter e B2 TAMIL LETTER EEKAARAM = Tamil letter ee B3 TAMIL LETTER AIKAARAM = Tamil letter ai B4 TAMIL LETTER OKARAM = Tamil letter o B5 TAMIL LETTER OOKAARAM = Tamil letter oo B6 TAMIL LETTER AUKAARAM = Tamil letter au B7 TAMIL AAYTHAM LETTER AKHENAM or AKHAAN = Tamil letter aaytham B8 TAMIL LETTER KAKARA AKARAM = Tamil letter ka B9 TAMIL LETTER NGAKARA AKARAM = Tamil letter nga BA TAMIL LETTER CAKARA AKARAM = Tamil letter ca BB TAMIL LETTER NJAKARA AKARAM = Tamil letter nja BC TAMIL LETTER TAKARA AKARAM = Tamil letter tta BD TAMIL LETTER NNNAKARA AKARAM = Tamil letter nnna BE TAMIL LETTER THAKARA AKARAM = Tamil letter ta BF TAMIL LETTER NAKARA AKARAM = Tamil letter na C0 TAMIL LETTER PAKARA AKARAM = Tamil letter pa C1 TAMIL LETTER MAKARA AKARAM = Tamil letter ma C2 TAMIL LETTER YAKARA AKARAM = Tamil letter ya C3 TAMIL LETTER RAKARA AKARAM = Tamil letter ra C4 TAMIL LETTER LAKARA AKARAM = Tamil letter la C5 TAMIL LETTER VAKARA AKARAM = Tamil letter va C6 TAMIL LETTER ZHAKARA AKARAM = Tamil letter llla C7 TAMIL LETTER LLAKARA AKARAM = Tamil letter lla C8 TAMIL LETTER RRAKARA AKARAM = Tamil letter rra C9 TAMIL LETTER NNAKARA AKARAM = Tamil letter nna CA TAMIL LETTER TAKARA IKARAM = Tamil letter tti CB TAMIL LETTER TAKARA IIKAARAM = Tamil letter ttii CC TAMIL LETTER KAKARA UKARAM = Tamil letter ku CD TAMIL LETTER CAKARA UKARAM = Tamil letter cu CE TAMIL LETTER TAKARA UKARAM = Tamil letter ttu CF TAMIL LETTER NNNAKARA UKARAM = Tamil letter nnnu D0 TAMIL LETTER THAKARA UKARAM = Tamil letter tu D1 TAMIL LETTER NAKARA UKARAM = Tamil letter nu D2 TAMIL LETTER PAKARA UKARAM = Tamil letter pu D3 TAMIL LETTER MAKARA UKARAM = Tamil letter mu D4 TAMIL LETTER YAKARA UKARAM = Tamil letter yu D5 TAMIL LETTER RAKARA UKARAM = Tamil letter ru D6 TAMIL LETTER LAKARA UKARAM = Tamil letter lu D7 TAMIL LETTER VAKARA UKARAM = Tamil letter vu D8 TAMIL LETTER ZHAKARA UKARAM = Tamil letter lllu D9 TAMIL LETTER LLAKARA UKARAM = Tamil letter llu DA TAMIL LETTER RRAKARA UKARAM = Tamil letter rru DB TAMIL LETTER NNAKARA UKARAM = Tamil letter nnu DC TAMIL LETTER KAKARA UUKAARAM = Tamil letter kuu DD TAMIL LETTER CAKARA UUKAARAM = Tamil letter cuu DE TAMIL LETTER TAKARA UUKAARAM = Tamil letter ttuu DF TAMIL LETTER NNNAKARA UUKAARAM = Tamil letter nnnuu E0 TAMIL LETTER THAKARA UUKAARAM = Tamil letter tuu E1 TAMIL LETTER NAKARA UUKAARAM = Tamil letter nuu E2 TAMIL LETTER PAKARA UUKAARAM = Tamil letter puu E3 TAMIL LETTER MAKARA UUKAARAM = Tamil letter muu E4 TAMIL LETTER YAKARA UUKAARAM = Tamil letter yuu E5 TAMIL LETTER RAKARA UUKAARAM = Tamil letter ruu E6 TAMIL LETTER LAKARA UUKAARAM = Tamil letter luu E7 TAMIL LETTER VAKARA UUKAARAM = Tamil letter vuu E8 TAMIL LETTER ZHAKARA UUKAARAM = Tamil letter llluu E9 TAMIL LETTER LLAKARA UUKAARAM = Tamil letter lluu EA TAMIL LETTER RRAKARA UUKAARAM = Tamil letter rruu EB TAMIL LETTER NNAKARA UUKAARAM = Tamil letter nnuu EC TAMIL LETTER KAKARAM = Tamil letter k ED TAMIL LETTER NGAKARAM = Tamil letter ng EE TAMIL LETTER CAKARAM = Tamil letter c EF TAMIL LETTER NJAKARAM = Tamil letter nj F0 TAMIL LETTER TAKARAM = Tamil letter tt F1 TAMIL LETTER NNNAKARAM = Tamil letter nnn F2 TAMIL LETTER THAKARAM = Tamil letter t F3 TAMIL LETTER NAKARAM = Tamil letter n F4 TAMIL LETTER PAKARAM = Tamil letter p F5 TAMIL LETTER MAKARAM = Tamil letter m F6 TAMIL LETTER YAKARAM = Tamil letter y F7 TAMIL LETTER RAKARAM = Tamil letter r F8 TAMIL LETTER LAKARAM = Tamil letter l F9 TAMIL LETTER VAKARAM = Tamil letter v FA TAMIL LETTER ZHAKARAM = Tamil letter LLL FB TAMIL LETTER LLAKARAM = Tamil letter ll FC TAMIL LETTER RRAKARAM = Tamil letter rr FD TAMIL LETTER NNAKARAM = Tamil letter nn FE TAMIL LETTER IKARAM = Tamil letter i FF NOTES: i) Third vowel "i" is included in slots AD and EF but usage of "i" at slot AD is deprecated. Inclusion of the glyph at slot AD is for rendering legacy data and to enable conversion to other encodings. Text converters to other encodings should attempt to determine which slot is used in the text for ikaram before converting. ii) Though ukara- and uukara modifiers (at slots 4A, 4B) are indicated as "TAMIL VOWEL SIGN U" and "TAMIL VOWEL SIGN UU" respectively, their usage is permitted only for the grantha vowels. Entire ukara and uukara uyirmey series are encoded directly in TSCII and they alone are to be used to render these uyirmeys. iii) Tamil numerals 0-9 are indicated as "TAMIL DIGITS" while Tamil numerals 10,100 and 100 are indicated as "TAMIL NUMBERS". This is to recognize the fact that Tamil numerals are being in used in two different systems (decimalic as in Arabic using digits 0-9 and as an additive-positional system using numerals 10,100 and 1000 as well). iv) TAMIL LETTER AAYTHAM at slot 7B is a dependant letter though in Unicode 4.1 it is listed differently as TAMIL VISARGA SIGN. (in Tamil grammar this aaytham letter is called as "caarpu ezuttu"), Acknowledgment: TSCII user group would like to acknowledge the help of following persons in the preparation of this TSCII specifications document: Mr. S. Kaviarasan (USA), Mr. Ravindran Paul (Malaysia), Mr. Sivaraj Doddannan (India), Dr. RM. Krishnan (India), Dr. Kumar Mallikarjunan (USA) and Mr. Sinnathurai Srivas (UK). (file created 2007-05-14)