LMT scheme (Cyrillic — Lipilina-Muzychenko-Thapanosoth)¶

The lmt scheme implements the Cyrillic transliteration of Thai used in the MSU/ISAA 2018 learner textbook: И. Н. Липилина, Ю. Ф. Музыченко, П. Тхапаносотх, Учебник тайского языка: вводный курс (Москва: ИД ВКН, 2018, ISBN 978-5-907086-16-6). The scheme sits in the same Russian-academic tradition as morev — onset and coda inventories are identical — but uses a different surface presentation suited to a learner textbook: length marked with an ASCII colon, tones as superscript digits, and a space as the syllable separator.

from thaiphon import transcribe

transcribe("น้ำ", scheme="lmt")
# 'на:³'

transcribe("สวัสดี", scheme="lmt")
# 'са¹ ват¹ ди:⁰'

transcribe("กรุงเทพ", scheme="lmt")
# 'крунг⁰ тхе:п²'

Format conventions¶

Convention	Detail
Syllable separator	single space
Long vowel	ASCII colon `:` immediately after the vowel letter (e.g. `а:`, `и:`)
Short vowel	no marker
Tone	Unicode superscript digit at the end of the syllable, after any coda
Mid tone	`⁰`
No slashes	output is bare — the `/…/` marks seen in the printed book are a display convention, not part of the transcription

Tone notation¶

Tone is appended as a single character after the full syllable, including any coda. Text mode uses Unicode superscript digits (U+2070, U+00B9, U+00B2, U+00B3, U+2074). HTML mode uses ordinary ASCII digits wrapped in <sup>…</sup> — using a Unicode superscript character inside a <sup> tag would produce a redundant superscript-within-superscript rendering, so the plain digit is preferred there.

Tone	Text mode	HTML mode	Unicode
Mid	`⁰`	`<sup>0</sup>`	U+2070
Low	`¹`	`<sup>1</sup>`	U+00B9
Falling	`²`	`<sup>2</sup>`	U+00B2
High	`³`	`<sup>3</sup>`	U+00B3
Rising	`⁴`	`<sup>4</sup>`	U+2074

Numbering differs from some dictionaries

LMT uses 0 for mid tone and runs 1–4 for the remaining four. Many Thai–English dictionaries use a 1–5 numbering where 1 = mid. Keep this in mind when reading LMT output alongside other systems.

transcribe("กา",  scheme="lmt")   # 'ка:⁰'  — mid tone
transcribe("ก่า", scheme="lmt")   # 'ка:¹'  — low
transcribe("ก้า", scheme="lmt")   # 'ка:²'  — falling
transcribe("ก๊า", scheme="lmt")   # 'ка:³'  — high
transcribe("ก๋า", scheme="lmt")   # 'ка:⁴'  — rising

# HTML mode — digits in <sup> instead of Unicode superscripts.
transcribe("น้ำ", scheme="lmt", format="html")
# 'на:<sup>3</sup>'

Onset map (IPA → LMT Cyrillic)¶

The onset inventory is identical to the Morev scheme — both derive from the same Russian-academic transliteration tradition.

IPA onset	LMT	Note
`k`	`к`	unaspirated velar stop
`kʰ`	`кх`	aspirated — digraph
`tɕ`	`ть`	unaspirated palatal affricate
`tɕʰ`	`ч`	aspirated palatal affricate — bare letter
`d`	`д`
`t`	`т`
`tʰ`	`тх`	aspirated — digraph
`b`	`б`
`p`	`п`
`pʰ`	`пх`	aspirated — digraph
`f`	`ф`	onset /f/ only
`s`	`с`
`h`	`х`
`ʔ`	(none)	glottal stop onset is not written
`m`	`м`
`n`	`н`
`ŋ`	`нг`	digraph
`j`	`й`
`r`	`р`
`l`	`л`
`w`	`в`	as bare onset

Onset clusters: In a true two-consonant onset cluster (/kw/, /kʰw/, etc.), the second-slot ว surfaces as the back vowel у rather than the consonant в. This matches the natural Russian phonetic reading of the glide.

transcribe("กวาง", scheme="lmt")   # 'куа:нг⁰'
transcribe("ปลา",  scheme="lmt")   # 'пла:⁰'

Vowel map (IPA → LMT)¶

Long vowels are marked with a colon placed immediately after the vowel letter. The scheme uses three non-Cyrillic letters to cover phonemes that have no single-character Cyrillic equivalent: Latin ɔ (U+0254, LATIN SMALL LETTER OPEN O) for /ɔ/ and Latin ə (U+0259, LATIN SMALL LETTER SCHWA) for /ɤ/ and centring-diphthong off-glides. This mix of Cyrillic and Latin IPA characters mirrors the printed textbook's own typography.

IPA quality	Short	Long
/a/	`а`	`а:`
/i/	`и`	`и:`
/u/	`у`	`у:`
/e/	`е`	`е:`
/ɛ/	`э`	`э:`
/o/	`о`	`о:`
/ɔ/	`ɔ`	`ɔ:`
/ɯ/	`ы`	`ы:`
/ɤ/	`ə`	`ə:`

Cyrillic о vs Latin ɔ

/o/ uses Cyrillic о (U+043E); /ɔ/ uses Latin IPA ɔ (U+0254). The two glyphs look similar at small sizes but are distinct characters representing phonemically different vowels. For example, โมง (clock hour) → мо:нг⁰ with Cyrillic о, while สอง (two) → сɔ:нг⁴ with Latin ɔ.

Centring diphthongs¶

The length colon is placed on the nucleus only; the off-glide а stays bare at both lengths.

Vowel	Short	Long
/iə/	`иа`	`и:а`
/ɯə/	`ыа`	`ы:а`
/uə/	`уа`	`у:а`

transcribe("เรือ", scheme="lmt")   # 'ры:а⁰'  — long /ɯə/
transcribe("เมีย", scheme="lmt")   # 'ми:а⁰'  — long /iə/

Coda map (IPA → LMT)¶

Native Thai codas map directly to Cyrillic. Foreign-origin coda phonemes that Thai phonotactics does not natively support collapse unconditionally to the nearest native segment — the textbook makes no exception for modern loanwords. Callers that want preserved foreign /f/ in output should use rtl, paiboon, or paiboon_plus.

IPA coda	LMT	Note
`m`	`м`
`n`	`н`
`ŋ`	`нг`
`p̚`	`п`	unreleased bilabial stop
`t̚`	`т`	unreleased alveolar stop
`k̚`	`к`	unreleased velar stop
`w` (offglide)	`у`
`j` (offglide)	`й`
`f` (loanword)	`п`	always collapses
`s` (loanword)	`т`	always collapses
`l` (loanword)	`н`	always collapses

transcribe("ลิฟต์",  scheme="lmt")   # 'лип³'  — /f/ collapses to п
transcribe("ข้าว",   scheme="lmt")   # 'кха:у²' — /w/ offglide → у
transcribe("ภาษาไทย", scheme="lmt")  # 'пха:⁰ са:⁴ тхай⁰'

Comparison with Morev¶

LMT and Morev share every onset and coda letter. Their surface presentations differ in three ways:

Feature	Morev	LMT
Long vowel	combining macron `◌̄` on first vowel letter	ASCII colon `:` after vowel letter
Tone notation	spacing modifier letter at syllable end (`ˆ` ` `ˇ` `´`)	Unicode superscript digit at syllable end (`⁰ ¹ ² ³ ⁴`)
Syllable separator	`-` (hyphen)	(space)

Use morev when your audience expects the dictionary-citation style. Use lmt when they are working from the 2018 MSU textbook, or when you prefer ASCII-colon length and numeric tone markers.

Examples¶

All outputs are verified against the engine.

Thai	Gloss	LMT
กา	crow	`ка:⁰`
ขา	leg	`кха:⁴`
ตา	eye / grandfather	`та:⁰`
ทา	to apply	`тха:⁰`
ปา	to throw	`па:⁰`
พา	to bring	`пха:⁰`
จาน	plate	`ть-а:н⁰`
ชา	tea	`ча:⁰`
น้ำ	water	`на:³`
ข้าว	rice	`кха:у²`
รัก	love	`рак³`
ปลา	fish	`пла:⁰`
สวัสดี	hello	`са¹ ват¹ ди:⁰`
ภาษาไทย	Thai language	`пха:⁰ са:⁴ тхай⁰`
กรุงเทพ	Bangkok	`крунг⁰ тхе:п²`
เขียว	green	`кхи:ау⁴`
เสือ	tiger	`сы:а³`
เรือ	boat	`ры:а⁰`
ถุง	bag	`тхунг⁴`
เด็ก	child	`дек¹`