LMT scheme (Cyrillic — Lipilina-Muzychenko-Thapanosoth)¶
The lmt scheme implements the Cyrillic transliteration of Thai used in the MSU/ISAA 2018 learner textbook: И. Н. Липилина, Ю. Ф. Музыченко, П. Тхапаносотх, Учебник тайского языка: вводный курс (Москва: ИД ВКН, 2018, ISBN 978-5-907086-16-6). The scheme sits in the same Russian-academic tradition as morev — onset and coda inventories are identical — but uses a different surface presentation suited to a learner textbook: length marked with an ASCII colon, tones as superscript digits, and a space as the syllable separator.
from thaiphon import transcribe
transcribe("น้ำ", scheme="lmt")
# 'на:³'
transcribe("สวัสดี", scheme="lmt")
# 'са¹ ват¹ ди:⁰'
transcribe("กรุงเทพ", scheme="lmt")
# 'крунг⁰ тхе:п²'
Format conventions¶
| Convention | Detail |
|---|---|
| Syllable separator | single space |
| Long vowel | ASCII colon : immediately after the vowel letter (e.g. а:, и:) |
| Short vowel | no marker |
| Tone | Unicode superscript digit at the end of the syllable, after any coda |
| Mid tone | ⁰ |
| No slashes | output is bare — the /…/ marks seen in the printed book are a display convention, not part of the transcription |
Tone notation¶
Tone is appended as a single character after the full syllable, including any coda. Text mode uses Unicode superscript digits (U+2070, U+00B9, U+00B2, U+00B3, U+2074). HTML mode uses ordinary ASCII digits wrapped in <sup>…</sup> — using a Unicode superscript character inside a <sup> tag would produce a redundant superscript-within-superscript rendering, so the plain digit is preferred there.
| Tone | Text mode | HTML mode | Unicode |
|---|---|---|---|
| Mid | ⁰ |
<sup>0</sup> |
U+2070 |
| Low | ¹ |
<sup>1</sup> |
U+00B9 |
| Falling | ² |
<sup>2</sup> |
U+00B2 |
| High | ³ |
<sup>3</sup> |
U+00B3 |
| Rising | ⁴ |
<sup>4</sup> |
U+2074 |
Numbering differs from some dictionaries
LMT uses 0 for mid tone and runs 1–4 for the remaining four. Many Thai–English dictionaries use a 1–5 numbering where 1 = mid. Keep this in mind when reading LMT output alongside other systems.
transcribe("กา", scheme="lmt") # 'ка:⁰' — mid tone
transcribe("ก่า", scheme="lmt") # 'ка:¹' — low
transcribe("ก้า", scheme="lmt") # 'ка:²' — falling
transcribe("ก๊า", scheme="lmt") # 'ка:³' — high
transcribe("ก๋า", scheme="lmt") # 'ка:⁴' — rising
# HTML mode — digits in <sup> instead of Unicode superscripts.
transcribe("น้ำ", scheme="lmt", format="html")
# 'на:<sup>3</sup>'
Onset map (IPA → LMT Cyrillic)¶
The onset inventory is identical to the Morev scheme — both derive from the same Russian-academic transliteration tradition.
| IPA onset | LMT | Note |
|---|---|---|
k |
к |
unaspirated velar stop |
kʰ |
кх |
aspirated — digraph |
tɕ |
ть |
unaspirated palatal affricate |
tɕʰ |
ч |
aspirated palatal affricate — bare letter |
d |
д |
|
t |
т |
|
tʰ |
тх |
aspirated — digraph |
b |
б |
|
p |
п |
|
pʰ |
пх |
aspirated — digraph |
f |
ф |
onset /f/ only |
s |
с |
|
h |
х |
|
ʔ |
(none) | glottal stop onset is not written |
m |
м |
|
n |
н |
|
ŋ |
нг |
digraph |
j |
й |
|
r |
р |
|
l |
л |
|
w |
в |
as bare onset |
Onset clusters: In a true two-consonant onset cluster (/kw/, /kʰw/, etc.), the second-slot ว surfaces as the back vowel у rather than the consonant в. This matches the natural Russian phonetic reading of the glide.
Vowel map (IPA → LMT)¶
Long vowels are marked with a colon placed immediately after the vowel letter. The scheme uses three non-Cyrillic letters to cover phonemes that have no single-character Cyrillic equivalent: Latin ɔ (U+0254, LATIN SMALL LETTER OPEN O) for /ɔ/ and Latin ə (U+0259, LATIN SMALL LETTER SCHWA) for /ɤ/ and centring-diphthong off-glides. This mix of Cyrillic and Latin IPA characters mirrors the printed textbook's own typography.
| IPA quality | Short | Long |
|---|---|---|
| /a/ | а |
а: |
| /i/ | и |
и: |
| /u/ | у |
у: |
| /e/ | е |
е: |
| /ɛ/ | э |
э: |
| /o/ | о |
о: |
| /ɔ/ | ɔ |
ɔ: |
| /ɯ/ | ы |
ы: |
| /ɤ/ | ə |
ə: |
Cyrillic о vs Latin ɔ
/o/ uses Cyrillic о (U+043E); /ɔ/ uses Latin IPA ɔ (U+0254). The two glyphs look similar at small sizes but are distinct characters representing phonemically different vowels. For example, โมง (clock hour) → мо:нг⁰ with Cyrillic о, while สอง (two) → сɔ:нг⁴ with Latin ɔ.
Centring diphthongs¶
The length colon is placed on the nucleus only; the off-glide а stays bare at both lengths.
| Vowel | Short | Long |
|---|---|---|
| /iə/ | иа |
и:а |
| /ɯə/ | ыа |
ы:а |
| /uə/ | уа |
у:а |
transcribe("เรือ", scheme="lmt") # 'ры:а⁰' — long /ɯə/
transcribe("เมีย", scheme="lmt") # 'ми:а⁰' — long /iə/
Coda map (IPA → LMT)¶
Native Thai codas map directly to Cyrillic. Foreign-origin coda phonemes that Thai phonotactics does not natively support collapse unconditionally to the nearest native segment — the textbook makes no exception for modern loanwords. Callers that want preserved foreign /f/ in output should use rtl, paiboon, or paiboon_plus.
| IPA coda | LMT | Note |
|---|---|---|
m |
м |
|
n |
н |
|
ŋ |
нг |
|
p̚ |
п |
unreleased bilabial stop |
t̚ |
т |
unreleased alveolar stop |
k̚ |
к |
unreleased velar stop |
w (offglide) |
у |
|
j (offglide) |
й |
|
f (loanword) |
п |
always collapses |
s (loanword) |
т |
always collapses |
l (loanword) |
н |
always collapses |
transcribe("ลิฟต์", scheme="lmt") # 'лип³' — /f/ collapses to п
transcribe("ข้าว", scheme="lmt") # 'кха:у²' — /w/ offglide → у
transcribe("ภาษาไทย", scheme="lmt") # 'пха:⁰ са:⁴ тхай⁰'
Comparison with Morev¶
LMT and Morev share every onset and coda letter. Their surface presentations differ in three ways:
| Feature | Morev | LMT |
|---|---|---|
| Long vowel | combining macron ◌̄ on first vowel letter |
ASCII colon : after vowel letter |
| Tone notation | spacing modifier letter at syllable end (ˆ ` ˇ ´) |
Unicode superscript digit at syllable end (⁰ ¹ ² ³ ⁴) |
| Syllable separator | - (hyphen) |
(space) |
Use morev when your audience expects the dictionary-citation style. Use lmt when they are working from the 2018 MSU textbook, or when you prefer ASCII-colon length and numeric tone markers.
Examples¶
All outputs are verified against the engine.
| Thai | Gloss | LMT |
|---|---|---|
| กา | crow | ка:⁰ |
| ขา | leg | кха:⁴ |
| ตา | eye / grandfather | та:⁰ |
| ทา | to apply | тха:⁰ |
| ปา | to throw | па:⁰ |
| พา | to bring | пха:⁰ |
| จาน | plate | ть-а:н⁰ |
| ชา | tea | ча:⁰ |
| น้ำ | water | на:³ |
| ข้าว | rice | кха:у² |
| รัก | love | рак³ |
| ปลา | fish | пла:⁰ |
| สวัสดี | hello | са¹ ват¹ ди:⁰ |
| ภาษาไทย | Thai language | пха:⁰ са:⁴ тхай⁰ |
| กรุงเทพ | Bangkok | крунг⁰ тхе:п² |
| เขียว | green | кхи:ау⁴ |
| เสือ | tiger | сы:а³ |
| เรือ | boat | ры:а⁰ |
| ถุง | bag | тхунг⁴ |
| เด็ก | child | дек¹ |