LMT (Cyrillic)

LMT scheme (Cyrillic — Lipilina-Muzychenko-Thapanosoth)

The lmt scheme implements the Cyrillic transliteration of Thai used in the MSU/ISAA 2018 learner textbook: И. Н. Липилина, Ю. Ф. Музыченко, П. Тхапаносотх, Учебник тайского языка: вводный курс (Москва: ИД ВКН, 2018, ISBN 978-5-907086-16-6). The scheme sits in the same Russian-academic tradition as morev — onset and coda inventories are identical — but uses a different surface presentation suited to a learner textbook: length marked with an ASCII colon, tones as superscript digits, and a space as the syllable separator.

from thaiphon import transcribe

transcribe("น้ำ", scheme="lmt")
# 'на:³'

transcribe("สวัสดี", scheme="lmt")
# 'са¹ ват¹ ди:⁰'

transcribe("กรุงเทพ", scheme="lmt")
# 'крунг⁰ тхе:п²'

Format conventions

Convention Detail
Syllable separator single space
Long vowel ASCII colon : immediately after the vowel letter (e.g. а:, и:)
Short vowel no marker
Tone Unicode superscript digit at the end of the syllable, after any coda
Mid tone
No slashes output is bare — the /…/ marks seen in the printed book are a display convention, not part of the transcription

Tone notation

Tone is appended as a single character after the full syllable, including any coda. Text mode uses Unicode superscript digits (U+2070, U+00B9, U+00B2, U+00B3, U+2074). HTML mode uses ordinary ASCII digits wrapped in <sup>…</sup> — using a Unicode superscript character inside a <sup> tag would produce a redundant superscript-within-superscript rendering, so the plain digit is preferred there.

Tone Text mode HTML mode Unicode
Mid <sup>0</sup> U+2070
Low ¹ <sup>1</sup> U+00B9
Falling ² <sup>2</sup> U+00B2
High ³ <sup>3</sup> U+00B3
Rising <sup>4</sup> U+2074

Numbering differs from some dictionaries

LMT uses 0 for mid tone and runs 14 for the remaining four. Many Thai–English dictionaries use a 15 numbering where 1 = mid. Keep this in mind when reading LMT output alongside other systems.

transcribe("กา",  scheme="lmt")   # 'ка:⁰'  — mid tone
transcribe("ก่า", scheme="lmt")   # 'ка:¹'  — low
transcribe("ก้า", scheme="lmt")   # 'ка:²'  — falling
transcribe("ก๊า", scheme="lmt")   # 'ка:³'  — high
transcribe("ก๋า", scheme="lmt")   # 'ка:⁴'  — rising

# HTML mode — digits in <sup> instead of Unicode superscripts.
transcribe("น้ำ", scheme="lmt", format="html")
# 'на:<sup>3</sup>'

Onset map (IPA → LMT Cyrillic)

The onset inventory is identical to the Morev scheme — both derive from the same Russian-academic transliteration tradition.

IPA onset LMT Note
k к unaspirated velar stop
кх aspirated — digraph
ть unaspirated palatal affricate
tɕʰ ч aspirated palatal affricate — bare letter
d д
t т
тх aspirated — digraph
b б
p п
пх aspirated — digraph
f ф onset /f/ only
s с
h х
ʔ (none) glottal stop onset is not written
m м
n н
ŋ нг digraph
j й
r р
l л
w в as bare onset

Onset clusters: In a true two-consonant onset cluster (/kw/, /kʰw/, etc.), the second-slot surfaces as the back vowel у rather than the consonant в. This matches the natural Russian phonetic reading of the glide.

transcribe("กวาง", scheme="lmt")   # 'куа:нг⁰'
transcribe("ปลา",  scheme="lmt")   # 'пла:⁰'

Vowel map (IPA → LMT)

Long vowels are marked with a colon placed immediately after the vowel letter. The scheme uses three non-Cyrillic letters to cover phonemes that have no single-character Cyrillic equivalent: Latin ɔ (U+0254, LATIN SMALL LETTER OPEN O) for /ɔ/ and Latin ə (U+0259, LATIN SMALL LETTER SCHWA) for /ɤ/ and centring-diphthong off-glides. This mix of Cyrillic and Latin IPA characters mirrors the printed textbook's own typography.

IPA quality Short Long
/a/ а а:
/i/ и и:
/u/ у у:
/e/ е е:
/ɛ/ э э:
/o/ о о:
/ɔ/ ɔ ɔ:
/ɯ/ ы ы:
/ɤ/ ə ə:

Cyrillic о vs Latin ɔ

/o/ uses Cyrillic о (U+043E); /ɔ/ uses Latin IPA ɔ (U+0254). The two glyphs look similar at small sizes but are distinct characters representing phonemically different vowels. For example, โมง (clock hour) → мо:нг⁰ with Cyrillic о, while สอง (two) → сɔ:нг⁴ with Latin ɔ.


Centring diphthongs

The length colon is placed on the nucleus only; the off-glide а stays bare at both lengths.

Vowel Short Long
/iə/ иа и:а
/ɯə/ ыа ы:а
/uə/ уа у:а
transcribe("เรือ", scheme="lmt")   # 'ры:а⁰'  — long /ɯə/
transcribe("เมีย", scheme="lmt")   # 'ми:а⁰'  — long /iə/

Coda map (IPA → LMT)

Native Thai codas map directly to Cyrillic. Foreign-origin coda phonemes that Thai phonotactics does not natively support collapse unconditionally to the nearest native segment — the textbook makes no exception for modern loanwords. Callers that want preserved foreign /f/ in output should use rtl, paiboon, or paiboon_plus.

IPA coda LMT Note
m м
n н
ŋ нг
п unreleased bilabial stop
т unreleased alveolar stop
к unreleased velar stop
w (offglide) у
j (offglide) й
f (loanword) п always collapses
s (loanword) т always collapses
l (loanword) н always collapses
transcribe("ลิฟต์",  scheme="lmt")   # 'лип³'  — /f/ collapses to п
transcribe("ข้าว",   scheme="lmt")   # 'кха:у²' — /w/ offglide → у
transcribe("ภาษาไทย", scheme="lmt")  # 'пха:⁰ са:⁴ тхай⁰'

Comparison with Morev

LMT and Morev share every onset and coda letter. Their surface presentations differ in three ways:

Feature Morev LMT
Long vowel combining macron ◌̄ on first vowel letter ASCII colon : after vowel letter
Tone notation spacing modifier letter at syllable end (ˆ ` ˇ ´) Unicode superscript digit at syllable end (⁰ ¹ ² ³ ⁴)
Syllable separator - (hyphen) (space)

Use morev when your audience expects the dictionary-citation style. Use lmt when they are working from the 2018 MSU textbook, or when you prefer ASCII-colon length and numeric tone markers.


Examples

All outputs are verified against the engine.

Thai Gloss LMT
กา crow ка:⁰
ขา leg кха:⁴
ตา eye / grandfather та:⁰
ทา to apply тха:⁰
ปา to throw па:⁰
พา to bring пха:⁰
จาน plate ть-а:н⁰
ชา tea ча:⁰
น้ำ water на:³
ข้าว rice кха:у²
รัก love рак³
ปลา fish пла:⁰
สวัสดี hello са¹ ват¹ ди:⁰
ภาษาไทย Thai language пха:⁰ са:⁴ тхай⁰
กรุงเทพ Bangkok крунг⁰ тхе:п²
เขียว green кхи:ау⁴
เสือ tiger сы:а³
เรือ boat ры:а⁰
ถุง bag тхунг⁴
เด็ก child дек¹