|#| X-SAMPA (ASCII) <-> IPA (UNICODE) |#| DICTIONARY PROPERTIES |@| dict-name: X-SAMPA |@| encoding-1-name: X-SAMPA |@| encoding-1-charset: ASCII-I |@| encoding-2-name: IPA |@| encoding-2-charset: Unicode |@| encoding-order: both |@| direction: ltr |@| delimiter: |@| author: Nathan Schneider |@| since: 17.oct.2004 |@| version: 0.8 |#| output-as-html option: would preserve <, >, and & characters in output rather than converting them to < > & |@| output-1-as-html: 0 |@| output-2-as-html: 0 |#| sample-input: |#| output-1-default: serif |#| output-2-default: sans |#| DISPLAY PROPERTIES |@| sans-serif-font: "Arial Unicode MS" |@| serif-font: "Doulos SIL" |@| entity-format: named |@| numbered-entity-format: dec |#| LINKS |!| http:// link description |#| NOTES ABOUT THIS DOCUMENT |#| * Character mappings are grouped roughly according to the overall chart of the International Phonetic Alphabet. |#| Within each group, they are ordered rougly according to their order on the IPA chart. However, because |#| of matters of precedence, several rules have been repositioned within their respective groups or moved |#| of them. In some cases, the rule has been left as a comment in the position conforming to IPA order. |#| Since the rules are processed in the order that they appear in the document, precedence requires that |#| sequences with multiple characters (more specific) generally precede similar sequences with fewer characters. |#| * There are no rules mapping characters to themselves. Only character conversions have associated rules. |#| * In order to facilitate reverse translation, no character sequence is entered as a regular expressions. |#| * Double quotes are used to enclose values that contain whitespace, start with the | character, or contain |#| one or more double quotes. CURRENTLY, the specified delimiter following the second double quote is used to |#| end the character sequence; first and second quotes are removed from the sequence, and all remaining |#| characters are considered part of the sequence. |#| * Unicode character values are entered in the following format: \0xFFFF |#| * The rules below are applied sequentially to the input string. Once a rule is applied to a sequence in the |#| input string, no other rule can modify the replaced value. |#| NOTE: arrange so all entity rules containing backslashes go first |#| SPECIAL RULES - PRECEDENCE SIGNIFICANT & \0x0276 front open rounded _X \0x0306 extra-short (FROM SUPRASEGMENTALS) |#| ===================== |#| CLICKS |#| ===================== "|\|\" \0x01C1 alveolar lateral click O\ \0x0298 bilabial click !\ \0x01C3 (post)alveolar click =\ \0x01C2 palatoalveolar click "|\" \0x01C0 dental click |#| ===================== |#| VOICED IMPLOSIVES |#| ===================== b_< \0x0253 vd bilabial implosive d_< \0x0257 vd dental/alveolar implosive J\_< \0x0284 vd palatal implosive g_< \0x0260 vd velar implosive G\_< \0x029B vd uvular implosive |#| ===================== |#| DIACRITICS |#| ===================== _> \0x02BC ejective _0 \0x0325 voiceless _v \0x032C voiced _h \0x02B0 aspirated _O \0x0339 more rounded _c \0x031C less rounded _+ \0x031F advanced _- \0x0320 retracted "_"" \0x0308 centralized _x \0x033D mid-centralized _= \0x0329 syllabic = \0x0329 syllabic _^ \0x032F non-syllabic _t \0x0324 breathy voiced _k \0x0330 creaky voiced _N \0x033C linguolabial _w \0x02B7 labialized ' \0x02B2 palatalized _j \0x02B2 palatalized _G \0x02E0 velarized _?\ \0x02E4 pharyngealized _e \0x0334 velarized or pharyngealized 5 \0x026B velarized l _r \0x031D raised _o \0x031E lowered _A \0x0318 advanced tongue root _q \0x0319 retracted tongue root _d \0x032A dental _a \0x033A apical _m \0x033B laminal _~ \0x0303 nasalized _n \0x207F nasal release _l \0x02E1 lateral release _} \0x031A no audible release |#| ===================== |#| PULMONIC CONSONANTS |#| ===================== |#| SPECIAL PRIORITY CHARACTERS 4 \0x027E vd alveolar tap |#| TRIPLE CHARACTERS |#| --------------------- r\` \0x027B vd retroflex approximant |#| DOUBLE CHARACTERS |#| --------------------- |#| PLOSIVE t` \0x0288 vl retroflex plosive d` \0x0256 vd retroflex plosive J\ \0x025F vd palatal plosive G\ \0x0262 vd uvular plosive |#| NASAL n` \0x0273 vd retroflex nasal N\ \0x0274 vd uvular nasal |#| TRILL B\ \0x0299 vd bilabial trill R\ \0x0280 vd uvular trill |#| TAP OR FLAP r` \0x027D vd retroflex flap |#| FRICATIVE p\ \0x0278 vl bilabial fricative s` \0x0282 vl retroflex fricative z` \0x0290 vd retroflex fricative j\ \0x029D vd palatal fricative X\ \0x0127 vl pharyngeal fricative ?\ \0x0295 vd pharyngeal fricative h\ \0x0266 vd glottal fricative |#| LATERAL FRICATIVE K\ \0x026E vd alveolar lateral fricative |#| APPROXIMANT v\ \0x028B vd labiodental approximant r\ \0x0279 vd alveolar approximant M\ \0x0270 velar approximant |#| LATERAL APPROXIMANT l` \0x026D vd retroflex lateral L\ \0x029F vd velar lateral |#| ===================== |#| TONES & WORD ACCENTS |#| ===================== _T \0x030B extra high tone _H \0x0301 high tone _M \0x0304 mid tone _L \0x0300 low tone _B \0x030F extra low tone |#| NOTE: UNICODE DOES NOT YET SUPPORT MOST CONTOUR SYMBOLS. THE COMMENTS BELOW INDICATE X-SAMPA REPRESENTATION |#| _R rising |#| _F falling |#| _H_T high rising |#| _B_L low rising |#| _R_F rising-falling \0x2197 global rise \0x2198 global fall |#| ===================== |#| DIACRITICS (contd) |#| ===================== ` \0x02DE rhoticity ~ \0x0303 nasalized |#| ===================== |#| PULMONIC CONSONANTS |#| ===================== |#| SINGLE CHARACTER |#| --------------------- |#| (multiple-character entries commented out) |#| PLOSIVE |#| t` \0x0288 vl retroflex plosive |#| d` \0x0256 vd retroflex plosive |#| J\ \0x025F vd palatal plosive |#| G\ \0x0262 vd uvular plosive ? \0x0294 vl glottal plosive |#| NASAL F \0x0271 vd labiodental nasal |#| n` \0x0273 vd retroflex nasal J \0x0272 vd palatal nasal |#| N\ \0x0274 vd uvular nasal N \0x014B vd velar nasal |#| TRILL |#| B\ \0x0299 vd bilabial trill |#| R\ \0x0280 vd uvular trill |#| TAP OR FLAP |#| 4 \0x027E vd alveolar tap |#| r` \0x027D vd retroflex flap |#| FRICATIVE |#| p\ \0x0278 vl bilabial fricative B \0x03B2 vd bilabial fricative T \0x03B8 vl dental fricative D \0x00F0 vd dental fricative S \0x0283 vl postalvelar fricative Z \0x0292 vd postalvelar fricative |#| s` \0x0282 vl retroflex fricative |#| z` \0x0290 vd retroflex fricative C \0x00E7 vl palatal fricative |#| j\ \0x029D vd palatal fricative G \0x0263 vd velar fricative X \0x03C7 vl uvular fricative R \0x0281 vd uvular fricative |#| X\ \0x0127 vl pharyngeal fricative |#| ?\ \0x0295 vd pharyngeal fricative |#| h\ \0x0266 vd glottal fricative |#| LATERAL FRICATIVE K \0x026C vl alveolar lateral fricative |#| K\ \0x026E vd alveolar lateral fricative |#| APPROXIMANT P \0x028B vd labiodental approximant |#| v\ \0x028B vd labiodental approximant |#| r\ \0x0279 vd alveolar approximant |#| r\` \0x027B vd retroflex approximant |#| M\ \0x0270 velar approximant |#| LATERAL APPROXIMANT |#| l` \0x026D vd retroflex lateral L \0x028E vd palatal lateral |#| L\ \0x029F vd velar lateral |#| ===================== |#| VOWELS |#| ===================== |#| FRONT I \0x026A lax close front unrounded Y \0x028F lax close front rounded 2 \0x00F8 front close-mid rounded E \0x025B open-mid front unrounded 9 \0x0153 front open-mid rounded { \0x00E6 raised front open unrounded |#| CENTRAL 1 \0x0268 close central unrounded } \0x0289 close central rounded @\ \0x0258 close-mid schwa 8 \0x0275 rounded schwa @ \0x0259 schwa 3\ \0x025E open-mid central rounded 3 \0x025C open-mid central 6 \0x0250 open-mid schwa |#| BACK M \0x026F close back unrounded U \0x028A lax close back unrounded 7 \0x0264 close-mid back unrounded V \0x028C open-mid back unrounded O \0x0254 open-mid back rounded A \0x0251 open back unrounded Q \0x0252 open back rounded |#| ===================== |#| OTHER SYMBOLS |#| ===================== W \0x028D vl labial-velar fricative H\ \0x029C vl epiglottal fricative H \0x0265 vd labial-palatal approximant <\ \0x02A2 vd epiglottal fricative >\ \0x02A1 epiglottal plosive s\ \0x0255 vl alveolo-palatal fricative z\ \0x0291 vd alveolo-palatal fricative l\ \0x027A alveolar lateral flap x\ \0x0267 simulatneous vl postalveolar fricative and vl velar fricative _ \0x0361 tie bar |#| ===================== |#| SUPRASEGMENTALS |#| ===================== """ \0x02C8 primary stress % \0x02CC secondary stress :\ \0x02D1 half-long : \0x02D0 long |#| . . syllable break |#| "|" | minor (foot) group "||" \0x2016 major (intonation) group -\ \0x203F linking (absence of a break) |#| ===================== |#| TONES & WORD ACCENTS (contd) |#| ===================== ! \0x2193 downstep ^ \0x2191 upstep