Paiboon and Paiboon+

Paiboon and Paiboon+ schemes

Two scheme IDs are available: paiboon for the original system used in Paiboon Publishing's first-edition Thai learner series, and paiboon_plus for the revised system adopted in the 2009 Three-Way dictionary and later titles. The two schemes share every onset, coda, and monophthong. They diverge only in how centring diphthongs spell their length contrast.

from thaiphon import transcribe

transcribe("น้ำ", scheme="paiboon")
# 'náam'

transcribe("สวัสดี", scheme="paiboon")
# 'sà-wàt-dii'

transcribe("เสือ", scheme="paiboon")
# 'sʉ̌a'

transcribe("เสือ", scheme="paiboon_plus")
# 'sʉ̌ʉa'

Format conventions

Convention Detail
Syllable separator - (hyphen)
Tone combining diacritic on the first vowel letter; mid tone is unmarked
Long vowel doubled vowel letter (aa, ii, ɛɛ, etc.)
Vowel-initial syllable no symbol — the vowel starts the syllable directly
No slashes no /…/ wrapping

Tone notation

Mid tone is not marked. The four other tones use a combining diacritic on the first vowel letter of the syllable. After NFC normalisation, plain Latin vowels precompose with their diacritics; the IPA-letter vowels ʉ ɛ ɔ ə have no precomposed forms and keep the combining codepoint.

Tone Diacritic Unicode Example word Rendered
Mid (none) กา gaa
Low grave ◌̀ U+0300 ก่า gàa
Falling circumflex ◌̂ U+0302 ก้า gâa
High acute ◌́ U+0301 ก๊า gáa
Rising háček ◌̌ U+030C ก๋า gǎa

Onset map (IPA → Paiboon)

The convention is designed for English readers. Aspirated stops take the bare letter, since that matches the English intuition for p t k. Unaspirated voiceless stops — which English has no single-letter way to write distinctively — use digraphs that mimic the clusters in English words like "spa", "stop", "ski".

IPA onset Paiboon Note
k g unaspirated velar stop
k aspirated — bare letter
j unaspirated palatal affricate
tɕʰ ch aspirated palatal affricate
d d
t dt unaspirated alveolar stop
t aspirated — bare letter
b b
p bp unaspirated bilabial stop
p aspirated — bare letter
f f
s s
h h
ʔ (none) vowel-initial syllable begins with the vowel
m m
n n
ŋ ng
j y
r r
l l
w w

Aspirated vs. unaspirated: The key asymmetry is that aspirated stops are written with a single letter (p t k) while unaspirated voiceless stops take a digraph (bp dt g). New readers should keep this in mind: p in Paiboon is always the aspirated /pʰ/, and bp is always the unaspirated /p/.

transcribe("พา", scheme="paiboon")   # 'paa'   — aspirated /pʰ/
transcribe("ปา", scheme="paiboon")   # 'bpaa'  — unaspirated /p/
transcribe("ทา", scheme="paiboon")   # 'taa'   — aspirated /tʰ/
transcribe("ตา", scheme="paiboon")   # 'dtaa'  — unaspirated /t/
transcribe("คา", scheme="paiboon")   # 'kaa'   — aspirated /kʰ/
transcribe("กา", scheme="paiboon")   # 'gaa'   — unaspirated /k/

Vowel map (IPA → Paiboon)

Long monophthongs are doubled. Four IPA-letter vowels are used: ʉ (U+0289) for /ɯ/, ɛ (U+025B) for /ɛ/, ɔ (U+0254) for /ɔ/, and ə (U+0259) for /ɤ/.

IPA quality Short Long
/a/ a aa
/i/ i ii
/u/ u uu
/e/ e ee
/ɛ/ ɛ ɛɛ
/o/ o oo
/ɔ/ ɔ ɔɔ
/ɯ/ ʉ ʉʉ
/ɤ/ ə əə

Centring diphthongs — where paiboon and paiboon_plus differ

This is the only point of divergence between the two schemes.

Vowel Length paiboon paiboon_plus
/iə/ short ia ia
/iə/ long ia iia
/ɯə/ short ʉa ʉa
/ɯə/ long ʉa ʉʉa
/uə/ short ua ua
/uə/ long ua uua

paiboon uses the same spelling for both short and long centring diphthongs. paiboon_plus doubles the long form, which lets readers distinguish length in words where the distinction matters.

The /iː/+/w/ coda sequence (เขียว pattern) follows the same logic: paiboon renders it as iao, and paiboon_plus as iiao.

transcribe("เสือ",  scheme="paiboon")       # 'sʉ̌a'
transcribe("เสือ",  scheme="paiboon_plus")  # 'sʉ̌ʉa'

transcribe("เขียว", scheme="paiboon")       # 'kǐao'
transcribe("เขียว", scheme="paiboon_plus")  # 'kǐiao'

Coda map (IPA → Paiboon)

IPA coda Paiboon Note
m m
n n
ŋ ng
p
t
k
w (offglide) o default; see short /i/+w exception below
j (offglide) i
f (loanword) p collapses to native bilabial stop; see preservation note below
s (loanword) t collapses to native alveolar stop; see preservation note below
l (loanword) n collapses to native nasal; see preservation note below

Offglide /w/: The coda /w/ writes as o in most contexts, giving combinations like aao, ɛɛo, ɔɔo. Short /i/ + /w/ is the exception — it spells as iu rather than io, matching the English intuition for words like "Matthew".

Offglide /j/: The coda /j/ writes as i, giving combinations like aai, ɔɔi, əəi.

transcribe("ข้าว", scheme="paiboon")    # 'kâao'
transcribe("ลิฟต์", scheme="paiboon")   # 'líf'
transcribe("ฟุตบอล", scheme="paiboon")  # 'fút-bɔɔn'

Loanword coda preservation

Under the default everyday profile, the engine preserves foreign codas in lexicon-listed modern loanwords. For words like ลิฟต์ (elevator), เชฟ (chef), and ออฟฟิศ (office), the coda surfaces as f rather than collapsing to p. Some loans — such as กราฟ (graph) and กอล์ฟ (golf) — only preserve under the careful_educated profile.

transcribe("ลิฟต์", scheme="paiboon")                          # 'líf'
transcribe("ลิฟต์", scheme="paiboon", profile="etalon_compat") # 'líp'

transcribe("กราฟ", scheme="paiboon")                                    # 'graap'
transcribe("กราฟ", scheme="paiboon", profile="careful_educated")        # 'graaf'

Pass profile="etalon_compat" to opt back into strict native-coda collapse for every word, regardless of the lexicon.

See Reading profiles for a full description of each profile.


Examples

All outputs are verified against the engine.

Thai Gloss paiboon paiboon_plus
กา crow gaa gaa
ขา leg kǎa kǎa
ตา eye / grandfather dtaa dtaa
ทา to apply taa taa
ปา to throw bpaa bpaa
พา to bring paa paa
จาน plate jaan jaan
ชา tea chaa chaa
น้ำ water náam náam
ข้าว rice kâao kâao
รัก love rák rák
ปลา fish bplaa bplaa
สวัสดี hello sà-wàt-dii sà-wàt-dii
ภาษาไทย Thai language paa-sǎa-tai paa-sǎa-tai
กรุงเทพ Bangkok grung-têep grung-têep
เขียว green kǐao kǐiao
เสือ tiger sʉ̌a sʉ̌ʉa
เรือ boat rʉa rʉʉa
ถุง bag tǔng tǔng
เด็ก child dèk dèk