# TLD: coop # IDN Reference table for Thai language # # Language: Thai (th) # Version: 1.0 # Effective Date: January 20, 2022 # # Registry: Tucows Registry Services # Contact: trs-support@tucows.com # Website: https://tucowsregistry.com/ # Relevant Policy Document URL: https://unr.com/idn # Code points requiring context rules # # Code point Description of rule/Reference # # U+002D Label must neither start nor end with U+002D. Label # HYPHEN-MINUS must not have U+002D in both third and fourth # position. RFC 5891 (sec 4.2.3.1) # # 0E50..0E59 Must not be combined with (EUROPEAN) DIGITS. # THAI DIGITS # # 0030..0039 Must not be combined with THAI DIGITS. # (EUROPEAN) DIGITS # # Thai specific context rules # # precedes-consonant A leading-vowel must precede a consonant # follows-consonant A below-vowel, above-vowel, below-diacritic, or above-diacritic MAITAIKHU must follow a consonant # between-consonant-and-ct A vowel MAIHAN AKAT must be in between a consonant and either tone or consonant # follow-consonant-tone-sara-aa A vowel SARA A can follow a consonant, a tone or a vowel SARA AA # follows-consonant-tone A vowel SARA AA, or an above diacritic NIKHAHIT followed by a vowel SARA AA can follow a consonant or a tone # follows-consonant-av-bv A tone mark, THANTHAKHAT, NIKHAHIT can only follow a consonant, above-vowel or below-vowel # follows-any-precedes-end An abbreviation mark, PAIYANNOI cannot be at the beginning of the label and must end it # follows-any-precedes-rep-cons-lv-end A repetition mark, MAIYAMOK can precede a repetition mark, an independent vowel, or end of label; it cannot start a label # # Character classes (e.g., consonant, leading-vowel, below-vowel) defined in: # https://www.icann.org/sites/default/files/packages/lgr/lgr-second-level-thai-language-18may21-en.html # The character sequence U+0E4D U+0E32 (above diacritic NIKHAHIT followed by a vowel # SARA AA) is considered an independent code point; this in order to implement the # context rule "follows-consonant-tone". In this way, this character sequence is # consider an exception to the context rule "follows-consonant-av-bv". U+002D U+0E01 U+0E02 U+0E03 U+0E04 U+0E05 U+0E06 U+0E07 U+0E08 U+0E09 U+0E0A U+0E0B U+0E0C U+0E0D U+0E0E U+0E0F U+0E10 U+0E11 U+0E12 U+0E13 U+0E14 U+0E15 U+0E16 U+0E17 U+0E18 U+0E19 U+0E1A U+0E1B U+0E1C U+0E1D U+0E1E U+0E1F U+0E20 U+0E21 U+0E22 U+0E23 U+0E24 U+0E25 U+0E26 U+0E27 U+0E28 U+0E29 U+0E2A U+0E2B U+0E2C U+0E2D U+0E2E U+0E2F U+0E30 U+0E31 U+0E32 U+0E34 U+0E35 U+0E36 U+0E37 U+0E38 U+0E39 U+0E3A U+0E40 U+0E41 U+0E42 U+0E43 U+0E44 U+0E46 U+0E47 U+0E48 U+0E49 U+0E4A U+0E4B U+0E4C U+0E4D U+0030;;U+0E50 U+0031;;U+0E51 U+0032;;U+0E52 U+0033;;U+0E53 U+0034;;U+0E54 U+0035;;U+0E55 U+0036;;U+0E56 U+0037;;U+0E57 U+0038;;U+0E58 U+0039;;U+0E59 U+0E50;;U+0030 U+0E51;;U+0031 U+0E52;;U+0032 U+0E53;;U+0033 U+0E54;;U+0034 U+0E55;;U+0035 U+0E56;;U+0036 U+0E57;;U+0037 U+0E58;;U+0038 U+0E59;;U+0039