Paiboon and Paiboon+ schemes¶
Two scheme IDs are available: paiboon for the original system used in Paiboon Publishing's first-edition Thai learner series, and paiboon_plus for the revised system adopted in the 2009 Three-Way dictionary and later titles. The two schemes share every onset, coda, and monophthong. They diverge only in how centring diphthongs spell their length contrast.
from thaiphon import transcribe
transcribe("น้ำ", scheme="paiboon")
# 'náam'
transcribe("สวัสดี", scheme="paiboon")
# 'sà-wàt-dii'
transcribe("เสือ", scheme="paiboon")
# 'sʉ̌a'
transcribe("เสือ", scheme="paiboon_plus")
# 'sʉ̌ʉa'
Format conventions¶
| Convention | Detail |
|---|---|
| Syllable separator | - (hyphen) |
| Tone | combining diacritic on the first vowel letter; mid tone is unmarked |
| Long vowel | doubled vowel letter (aa, ii, ɛɛ, etc.) |
| Vowel-initial syllable | no symbol — the vowel starts the syllable directly |
| No slashes | no /…/ wrapping |
Tone notation¶
Mid tone is not marked. The four other tones use a combining diacritic on the first vowel letter of the syllable. After NFC normalisation, plain Latin vowels precompose with their diacritics; the IPA-letter vowels ʉ ɛ ɔ ə have no precomposed forms and keep the combining codepoint.
| Tone | Diacritic | Unicode | Example word | Rendered |
|---|---|---|---|---|
| Mid | (none) | — | กา | gaa |
| Low | grave ◌̀ |
U+0300 | ก่า | gàa |
| Falling | circumflex ◌̂ |
U+0302 | ก้า | gâa |
| High | acute ◌́ |
U+0301 | ก๊า | gáa |
| Rising | háček ◌̌ |
U+030C | ก๋า | gǎa |
Onset map (IPA → Paiboon)¶
The convention is designed for English readers. Aspirated stops take the bare letter, since that matches the English intuition for p t k. Unaspirated voiceless stops — which English has no single-letter way to write distinctively — use digraphs that mimic the clusters in English words like "spa", "stop", "ski".
| IPA onset | Paiboon | Note |
|---|---|---|
k |
g |
unaspirated velar stop |
kʰ |
k |
aspirated — bare letter |
tɕ |
j |
unaspirated palatal affricate |
tɕʰ |
ch |
aspirated palatal affricate |
d |
d |
|
t |
dt |
unaspirated alveolar stop |
tʰ |
t |
aspirated — bare letter |
b |
b |
|
p |
bp |
unaspirated bilabial stop |
pʰ |
p |
aspirated — bare letter |
f |
f |
|
s |
s |
|
h |
h |
|
ʔ |
(none) | vowel-initial syllable begins with the vowel |
m |
m |
|
n |
n |
|
ŋ |
ng |
|
j |
y |
|
r |
r |
|
l |
l |
|
w |
w |
Aspirated vs. unaspirated: The key asymmetry is that aspirated stops are written with a single letter (p t k) while unaspirated voiceless stops take a digraph (bp dt g). New readers should keep this in mind: p in Paiboon is always the aspirated /pʰ/, and bp is always the unaspirated /p/.
transcribe("พา", scheme="paiboon") # 'paa' — aspirated /pʰ/
transcribe("ปา", scheme="paiboon") # 'bpaa' — unaspirated /p/
transcribe("ทา", scheme="paiboon") # 'taa' — aspirated /tʰ/
transcribe("ตา", scheme="paiboon") # 'dtaa' — unaspirated /t/
transcribe("คา", scheme="paiboon") # 'kaa' — aspirated /kʰ/
transcribe("กา", scheme="paiboon") # 'gaa' — unaspirated /k/
Vowel map (IPA → Paiboon)¶
Long monophthongs are doubled. Four IPA-letter vowels are used: ʉ (U+0289) for /ɯ/, ɛ (U+025B) for /ɛ/, ɔ (U+0254) for /ɔ/, and ə (U+0259) for /ɤ/.
| IPA quality | Short | Long |
|---|---|---|
| /a/ | a |
aa |
| /i/ | i |
ii |
| /u/ | u |
uu |
| /e/ | e |
ee |
| /ɛ/ | ɛ |
ɛɛ |
| /o/ | o |
oo |
| /ɔ/ | ɔ |
ɔɔ |
| /ɯ/ | ʉ |
ʉʉ |
| /ɤ/ | ə |
əə |
Centring diphthongs — where paiboon and paiboon_plus differ¶
This is the only point of divergence between the two schemes.
| Vowel | Length | paiboon |
paiboon_plus |
|---|---|---|---|
| /iə/ | short | ia |
ia |
| /iə/ | long | ia |
iia |
| /ɯə/ | short | ʉa |
ʉa |
| /ɯə/ | long | ʉa |
ʉʉa |
| /uə/ | short | ua |
ua |
| /uə/ | long | ua |
uua |
paiboon uses the same spelling for both short and long centring diphthongs. paiboon_plus doubles the long form, which lets readers distinguish length in words where the distinction matters.
The /iː/+/w/ coda sequence (เขียว pattern) follows the same logic: paiboon renders it as iao, and paiboon_plus as iiao.
transcribe("เสือ", scheme="paiboon") # 'sʉ̌a'
transcribe("เสือ", scheme="paiboon_plus") # 'sʉ̌ʉa'
transcribe("เขียว", scheme="paiboon") # 'kǐao'
transcribe("เขียว", scheme="paiboon_plus") # 'kǐiao'
Coda map (IPA → Paiboon)¶
| IPA coda | Paiboon | Note |
|---|---|---|
m |
m |
|
n |
n |
|
ŋ |
ng |
|
p̚ |
p |
|
t̚ |
t |
|
k̚ |
k |
|
w (offglide) |
o |
default; see short /i/+w exception below |
j (offglide) |
i |
|
f (loanword) |
p |
collapses to native bilabial stop; see preservation note below |
s (loanword) |
t |
collapses to native alveolar stop; see preservation note below |
l (loanword) |
n |
collapses to native nasal; see preservation note below |
Offglide /w/: The coda /w/ writes as o in most contexts, giving combinations like aao, ɛɛo, ɔɔo. Short /i/ + /w/ is the exception — it spells as iu rather than io, matching the English intuition for words like "Matthew".
Offglide /j/: The coda /j/ writes as i, giving combinations like aai, ɔɔi, əəi.
transcribe("ข้าว", scheme="paiboon") # 'kâao'
transcribe("ลิฟต์", scheme="paiboon") # 'líf'
transcribe("ฟุตบอล", scheme="paiboon") # 'fút-bɔɔn'
Loanword coda preservation¶
Under the default everyday profile, the engine preserves foreign codas in lexicon-listed modern loanwords. For words like ลิฟต์ (elevator), เชฟ (chef), and ออฟฟิศ (office), the coda surfaces as f rather than collapsing to p. Some loans — such as กราฟ (graph) and กอล์ฟ (golf) — only preserve under the careful_educated profile.
transcribe("ลิฟต์", scheme="paiboon") # 'líf'
transcribe("ลิฟต์", scheme="paiboon", profile="etalon_compat") # 'líp'
transcribe("กราฟ", scheme="paiboon") # 'graap'
transcribe("กราฟ", scheme="paiboon", profile="careful_educated") # 'graaf'
Pass profile="etalon_compat" to opt back into strict native-coda collapse for every word, regardless of the lexicon.
See Reading profiles for a full description of each profile.
Examples¶
All outputs are verified against the engine.
| Thai | Gloss | paiboon |
paiboon_plus |
|---|---|---|---|
| กา | crow | gaa |
gaa |
| ขา | leg | kǎa |
kǎa |
| ตา | eye / grandfather | dtaa |
dtaa |
| ทา | to apply | taa |
taa |
| ปา | to throw | bpaa |
bpaa |
| พา | to bring | paa |
paa |
| จาน | plate | jaan |
jaan |
| ชา | tea | chaa |
chaa |
| น้ำ | water | náam |
náam |
| ข้าว | rice | kâao |
kâao |
| รัก | love | rák |
rák |
| ปลา | fish | bplaa |
bplaa |
| สวัสดี | hello | sà-wàt-dii |
sà-wàt-dii |
| ภาษาไทย | Thai language | paa-sǎa-tai |
paa-sǎa-tai |
| กรุงเทพ | Bangkok | grung-têep |
grung-têep |
| เขียว | green | kǐao |
kǐiao |
| เสือ | tiger | sʉ̌a |
sʉ̌ʉa |
| เรือ | boat | rʉa |
rʉʉa |
| ถุง | bag | tǔng |
tǔng |
| เด็ก | child | dèk |
dèk |