X-SAMPA
The Extended Speech Assessment Methods Phonetic Alphabet (X-SAMPA) is a variant of SAMPA developed in 1995 by John C. Wells, professor of phonetics at University College London.[1] It is designed to unify the individual language SAMPA alphabets, and extend SAMPA to cover the entire range of characters in the 1993 version of International Phonetic Alphabet (IPA). The result is a SAMPA-inspired remapping of the IPA into 7-bit ASCII.
SAMPA was devised as a hack to work around the inability of text encodings to represent IPA symbols. Later, as Unicode support for IPA symbols became more widespread, the necessity for a separate, computer-readable system for representing the IPA in ASCII decreased. However, X-SAMPA is still useful as the basis for an input method for true IPA.
Summary
Notes
- The IPA symbols that are ordinary lower case letters have the same value in X-SAMPA as they do in the IPA.
- X-SAMPA uses backslashes as modifying suffixes to create new symbols. For example,
O
is a distinct sound fromO\
, to which it bears no relation. Such use of the backslash character can be a problem, since many programs interpret it as an escape character for the character following it. For example, such X-SAMPA symbols do not work in EMU, so backslashes must be replaced with some other symbol (e.g., an asterisk: '*') when adding phonemic transcription to an EMU speech database. The backslash has no fixed meaning. - X-SAMPA diacritics follow the symbols they modify. Except for
~
for nasalization,=
for syllabicity, and`
for retroflexion and rhotacization, diacritics are joined to the character with the underscore character_
. - The underscore character is also used to encode the IPA tiebar:
k_p
codes for /k͡p/. - The numbers
_1
to_6
are reserved diacritics as shorthand for language-specific tone numbers. - The IETF language tags registry has assigned
fonxsamp
as the subtag for text transcribed in X-SAMPA.[2]
Lower-case symbols
X-SAMPA | IPA | IPA image | Description | Examples |
---|---|---|---|---|
a | a | open front unrounded vowel | French dame [dam] | |
b | b | voiced bilabial plosive | English bed [bEd] , French bon [bO~] | |
b_< | ɓ | voiced bilabial implosive | Sindhi ɓarʊ [b_<arU ] | |
c | c | voiceless palatal plosive | Hungarian latyak ["lQcQk] | |
d | d | voiced alveolar plosive | English dig [dIg] , French doigt [dwa] | |
d` | ɖ | voiced retroflex plosive | Swedish hord [hu:d`] | |
d_< | ɗ | voiced alveolar implosive | Sindhi ɗarʊ [d_<arU ] | |
e | e | close-mid front unrounded vowel | French blé [ble] | |
f | f | voiceless labiodental fricative | English five [faIv] , French femme [fam] | |
g | ɡ | voiced velar plosive | English game [geIm] , French longue [lO~g] | |
g_< | ɠ | voiced velar implosive | Sindhi ɠəro [g_<@ro ] | |
h | h | voiceless glottal fricative | English house [haUs] | |
h\ | ɦ | voiced glottal fricative | Czech hrad [h\rat] | |
i | i | close front unrounded vowel | English be [bi:] , French oui [wi] , Spanish si [si] | |
j | j | palatal approximant | English yes [jEs] , French yeux [j2] | |
j\ | ʝ | voiced palatal fricative | Greek γειά [j\a] | |
k | k | voiceless velar plosive | English skip [skIp] , Spanish carro ["karo] | |
l | l | alveolar lateral approximant | English lay [leI] , French mal [mal] | |
l` | ɭ | retroflex lateral approximant | Svealand Swedish sorl [so:l`] | |
l\ | ɺ | alveolar lateral flap | Wayuu püülükü [pM:l\MkM] | |
m | m | bilabial nasal | English mouse [maUs] , French homme [Om] | |
n | n | alveolar nasal | English nap [n{p] , French non [nO~] | |
n` | ɳ | retroflex nasal | Swedish hörn [h2:n`] | |
o | o | close-mid back rounded vowel | French veau [vo] | |
p | p | voiceless bilabial plosive | English speak [spik] , French pose [poz] , Spanish perro ["pero] | |
p\ | ɸ | voiceless bilabial fricative | Japanese fuku [p\M_0kM] | |
q | q | voiceless uvular plosive | Arabic qasbah ["qQs_Gba] | |
r | r | alveolar trill | Spanish perro ["pero] | |
r` | ɽ | retroflex flap | Bengali gari [gar`i:] | |
r\ | ɹ | alveolar approximant | English red [r\Ed] | |
r\` | ɻ | retroflex approximant | Malayalam വഴി ["v@r\`i] | |
s | s | voiceless alveolar fricative | English seem [si:m] , French session [sE"sjO~] | |
s` | ʂ | voiceless retroflex fricative | Swedish mars [mas`] | |
s\ | ɕ | voiceless alveolo-palatal fricative | Polish świerszcz [s\v'ers`ts`] | |
t | t | voiceless alveolar plosive | English stew [stju:] , French raté [Ra"te] | |
t` | ʈ | voiceless retroflex plosive | Swedish mört [m2t`] | |
u | u | close back rounded vowel | English boom [bu:m] , Spanish su [su] | |
v | v | voiced labiodental fricative | English vest [vEst] , French voix [vwa] | |
v\ (or P ) | ʋ | labiodental approximant | Dutch west [v\Est] /[PEst] | |
w | w | labial-velar approximant | English west [wEst] , French oui [wi] | |
x | x | voiceless velar fricative | Scots loch [lOx] or [5Ox] ; German Buch, Dach; Spanish caja, gestión | |
x\ | ɧ | voiceless palatal-velar fricative | Swedish sjal [x\A:l] | |
y | y | close front rounded vowel | French tu [ty] German über ["y:b6] | |
z | z | voiced alveolar fricative | English zoo [zu:] , French azote [a"zOt] | |
z` | ʐ | voiced retroflex fricative | Mandarin Chinese rang [z`aN] | |
z\ | ʑ | voiced alveolo-palatal fricative | Polish źrebak ["z\rEbak] |
Capital symbols
X-SAMPA | IPA | IPA image | Description | Example |
---|---|---|---|---|
A | ɑ | open back unrounded vowel | English father ["fA:D@ (r\ )] (RP and Gen.Am.) | |
B | β | voiced bilabial fricative | Spanish lavar [la"Ba4] | |
B\ | ʙ | bilabial trill | Reminiscent of shivering ("brrr") | |
C | ç | voiceless palatal fricative | German ich [IC] , English human ["Cjum@n] (broad transcription uses [hj -]) | |
D | ð | voiced dental fricative | English then [DEn] | |
E | ɛ | open-mid front unrounded vowel | French même [mE:m] , English met [mEt] (RP and Gen.Am.) | |
F | ɱ | labiodental nasal | English emphasis ["EFf@sIs] (spoken quickly, otherwise uses [Emf -]) | |
G | ɣ | voiced velar fricative | Greek γωνία [Go"nia] | |
G\ | ɢ | voiced uvular plosive | Inuktitut nirivvik [niG\ivvik] | |
G\_< | ʛ | voiced uvular implosive | Mam ʛa [G\_<a ] | |
H | ɥ | labial-palatal approximant | French huit [Hit] | |
H\ | ʜ | voiceless epiglottal fricative | Agul мехӀ [mEH\] | |
I | ɪ | near-close front unrounded vowel | English kit [kIt] | |
I\ | ᵻ | near-close central unrounded vowel (non-IPA) | Polish ryba [rI\bA] | |
J | ɲ | palatal nasal | Spanish año ["aJo] , English canyon ["k{J@n] (broad transcription uses [-nj -]) | |
J\ | ɟ | voiced palatal plosive | Hungarian egy [EJ\] | |
J\_< | ʄ | voiced palatal implosive | Sindhi ʄaro [J\_<aro ] | |
K | ɬ | voiceless alveolar lateral fricative | Welsh llaw [KaU] | |
K\ | ɮ | voiced alveolar lateral fricative | Mongolian долоо [tOK\O:] | |
L | ʎ | palatal lateral approximant | Italian famiglia [fa"miLLa] , Castilian: llamar [La"mar] | |
L\ | ʟ | velar lateral approximant | Korean 달구지 [t6L\gudz\i] | |
M | ɯ | close back unrounded vowel | Korean 음식 [M:ms\_hik_}] | |
M\ | ɰ | velar approximant | Spanish fuego ["fweM\o] | |
N | ŋ | velar nasal | English thing [TIN] | |
N\ | ɴ | uvular nasal | Japanese さん san [saN\] | |
O | ɔ | open-mid back rounded vowel | American English off [O:f] | |
O\ | ʘ | bilabial click | ||
P (or v\ ) | ʋ | labiodental approximant | Dutch west [PEst] /[v\Est] , allophone of English phoneme /r\/ | |
Q | ɒ | open back rounded vowel | RP lot [lQt] | |
R | ʁ | voiced uvular fricative | German rein [RaIn] | |
R\ | ʀ | uvular trill | French roi [R\wa] | |
S | ʃ | voiceless postalveolar fricative | English ship [SIp] | |
T | θ | voiceless dental fricative | English thin [TIn] | |
U | ʊ | near-close back rounded vowel | English foot [fUt] | |
U\ | ᵿ | near-close central rounded vowel (non-IPA) | English euphoria [jU\"fO@r\i@] | |
V | ʌ | open-mid back unrounded vowel | Scottish English strut [str\Vt] | |
W | ʍ | voiceless labial-velar fricative | Scots when [WEn] | |
X | χ | voiceless uvular fricative | Klallam sχaʔqʷaʔ [sXa?q_wa?] | |
X\ | ħ | voiceless pharyngeal fricative | Arabic ح ḥāʾ [X\A:] | |
Y | ʏ | near-close front rounded vowel | German hübsch [hYpS] | |
Z | ʒ | voiced postalveolar fricative | English vision ["vIZ@n] |
Other symbols
X-SAMPA | IPA | IPA image | Description | Example |
---|---|---|---|---|
. | . | syllable break | ||
" | ˈ | primary stress | ||
% | ˌ | secondary stress | American English pronunciation [pr\@%nVn.si."eI.S@n] | |
' (or _j ) | ʲ | palatalized | Russian Земля (Earth) [z'I"ml'a] or [z_jI"ml_ja] | |
: | ː | long | ||
:\ | ˑ | half long | Estonian differentiates three vowel lengths | |
- | separator | Polish trzy [t-S1] vs. czy [tS1] (affricate) | ||
@ | ə | schwa | English arena [@"r\i:n@] | |
@\ | ɘ | close-mid central unrounded vowel | Paicĩ kɘ̄ɾɘ [k@\_M4@\_M] | |
@` | ɚ | r-coloured schwa | American English color ["kVl@`] | |
{ | æ | near-open front unrounded vowel | English trap [tr\{p] | |
} | ʉ | close central rounded vowel | Swedish sju [x\}:] ; AuE/NZE boot [b}:t] | |
1 | ɨ | close central unrounded vowel | Welsh tu [t1] , American English rose's ["r\oUz1z] | |
2 | ø | close-mid front rounded vowel | Danish købe ["k2:b@] , French deux [d2] | |
3 | ɜ | open-mid central unrounded vowel | English nurse [n3:s] (RP) or [n3`s] (Gen.Am.) | |
3\ | ɞ | open-mid central rounded vowel | Irish tomhail [t3\:l'] | |
4 | ɾ | alveolar flap | Spanish pero ["pe4o] , American English better ["bE4@`] | |
5 | ɫ | velarized alveolar lateral approximant; also see _e | English milk [mI5k] , Portuguese livro ["5iv4u] | |
6 | ɐ | near-open central vowel | German besser ["bEs6] , Australian English mud [m6d] | |
7 | ɤ | close-mid back unrounded vowel | Estonian kõik [k7ik] , Vietnamese mơ [m7_M] | |
8 | ɵ | close-mid central rounded vowel | Swedish buss [b8s] | |
9 | œ | open-mid front rounded vowel | French neuf [n9f] , Danish drømme [dR9m@] | |
& | ɶ | open front rounded vowel | Swedish skörd [x\&d`] | |
? | ʔ | glottal stop | Cockney English bottle ["bQ?o] | |
?\ | ʕ | voiced pharyngeal fricative | Arabic ع ʿayn [?\Ajn] | |
* | undefined escape character, SAMPA's "conjunctor" | |||
/ | (a) French vowel archiphonemes or indeterminacies (b) delimiter of phonemic transcriptions | maison /mE/zO~/ | ||
< | begin nonsegmental notation, e.g., SAMPROSA[3] | |||
<\ | ʢ | voiced epiglottal fricative | Siwi arˤbˤəʢa (four) [ar_?\b_?\@<\a] | |
> | end nonsegmental notation | |||
>\ | ʡ | epiglottal plosive | Archi гӀарз (complaint) [>\arz] | |
^ | ꜛ | upstep | ||
! | ꜜ | downstep | ||
!\ | ǃ | postalveolar click | Zulu iqaqa (polecat) [i:!\a:!\a] | |
| | | | minor (foot) group | ||
|\ | ǀ | dental click | Zulu icici (earring) [i:|\i:|\i] | |
|| | ‖ | major (intonation) group | ||
|\|\ | ǁ | alveolar lateral click | Zulu xoxa (to converse) [|\|\O:|\|\a] | |
=\ | ǂ | palatal click | ||
-\ | ‿ | linking mark |
Diacritics
X-SAMPA | IPA | IPA image | Description |
---|---|---|---|
_" | ̈ | centralized | |
_+ | ̟ | advanced | |
_- | ̠ | retracted | |
_/ | ̌ | rising tone | |
_0 | ̥ | voiceless | |
_< | implosive (IPA uses separate symbols for implosives) | ||
= (or _= ) | ̩ | syllabic | |
_> | ʼ | ejective | |
_?\ | ˤ | pharyngealized | |
_\ | ̂ | falling tone | |
_^ | ̯ | non-syllabic | |
_} | ̚ | no audible release | |
` | ˞ | rhotacization in vowels, retroflexion in consonants (IPA uses separate symbols for consonants, see t` for an example) | |
~ (or _~ ) | ̃ | nasalization | |
_A | ̘ | advanced tongue root | |
_a | ̺ | apical | |
_B | ̏ | extra low tone | |
_B_L | ᷅ | low rising tone | |
_c | ̜ | less rounded | |
_d | ̪ | dental | |
_e | ̴ | velarized or pharyngealized; also see 5 | |
<F> | ↘ | global fall | |
_F | ̂ | falling tone | |
_G | ˠ | velarized | |
_H | ́ | high tone | |
_H_T | ᷄ | high rising tone | |
_h | ʰ | aspirated | |
_j (or ' ) | ʲ | palatalized | |
_k | ̰ | creaky voice | |
_L | ̀ | low tone | |
_l | ˡ | lateral release | |
_M | ̄ | mid tone | |
_m | ̻ | laminal | |
_N | ̼ | linguolabial | |
_n | ⁿ | nasal release | |
_O | ̹ | more rounded | |
_o | ̞ | lowered | |
_q | ̙ | retracted tongue root | |
<R> | ↗ | global rise | |
_R | ̌ | rising tone | |
_R_F | ᷈ | rising falling tone | |
_r | ̝ | raised | |
_T | ̋ | extra high tone | |
_t | ̤ | breathy voice | |
_v | ̬ | voiced | |
_w | ʷ | labialized | |
_X | ̆ | extra-short | |
_x | ̽ | mid-centralized |
Charts
Consonants
Consonants (pulmonic) | |||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Place of articulation → | Labial | Coronal | Dorsal | Laryngeal | |||||||||||||
Manner of articulation ↓ | Bilabial | Labio‐ dental |
Dental | Alveolar | Post‐ alveolar |
Retro‐ flex |
Palatal | Velar | Uvular | Pharyn‐ geal |
Epi‐ glottal |
Glottal | |||||
Nasal | m |
F |
n |
n` |
J |
N |
N\ |
||||||||||
Plosive | p b |
p_d b_d |
t d |
t` d` |
c J\ |
k g |
q G\ |
>\ |
? |
||||||||
Fricative | p\ B |
f v |
T D |
s z |
S Z |
s` z` |
C j\ |
x G |
X |
R |
X\ |
?\ |
H\ |
<\ |
h h\ | ||
Approximant | B_o |
v\ |
r\ |
r\` |
j |
M\ |
|||||||||||
Trill | B\ |
r |
* | R\ |
* | ||||||||||||
Tap or Flap | *† | *† | 4 |
r` |
* | ||||||||||||
Lateral Fricative | K K\ |
* | * | * | |||||||||||||
Lateral Approximant | l |
l` |
L |
L\ |
|||||||||||||
Lateral Flap | l\ |
* | * | * |
- Asterisks (*) mark sounds that do not have X-SAMPA symbols. Daggers (†) mark IPA symbols that have recently been added to Unicode. Since April 2008, the latter is the case of the labiodental flap, symbolized by a right-hook v in the IPA: . A dedicated symbol for the labiodental flap does not yet exist in X-SAMPA.
Coarticulated | |
---|---|
W |
Voiceless labialized velar approximant |
w |
Voiced labialized velar approximant |
H |
Voiced labialized palatal approximant |
s\ |
Voiceless palatalized postalveolar (alveolo-palatal) fricative |
z\ |
Voiced palatalized postalveolar (alveolo-palatal) fricative |
x\ |
Voiceless "palatal-velar" fricative |
Affricates and double articulation | |
---|---|
ts |
voiceless alveolar affricate |
dz |
voiced alveolar affricate |
tS |
voiceless postalveolar affricate |
dZ |
voiced postalveolar affricate |
ts\ |
voiceless alveolo-palatal affricate |
dz\ |
voiced alveolo-palatal affricate |
tK |
voiceless alveolar lateral affricate |
kp |
voiceless labial-velar plosive |
gb |
voiced labial-velar plosive |
Nm |
labial-velar nasal stop |
Consonants (non-pulmonic) | |||||
---|---|---|---|---|---|
Clicks | Implosives | Ejectives | |||
O\ |
Bilabial | b_< |
Bilabial | _> |
For example: |
|\ |
Laminal alveolar ("dental") | d_< |
Alveolar | p_> |
Bilabial |
!\ |
Apical (post-) alveolar ("retroflex") | J\_< |
Palatal | t_> |
Alveolar |
=\ |
Laminal postalveolar ("palatal") | g_< |
Velar | k_> |
Velar |
|\|\ |
Lateral coronal ("lateral") | G\_< |
Uvular | s_> |
Alveolar fricative |
See also
- Comparison of ASCII encodings of the International Phonetic Alphabet
- List of phonetics topics
- SAMPA, a language-specific predecessor of X-SAMPA
- SAMPA chart for English
References
- Wells, J.C. "Computer-coding the IPA: a proposed extension of SAMPA" (PDF). UCL Phonetics and Linguistics. University College London. Retrieved 16 March 2016.
- "Language Subtag Registry" (text). IETF. 2022-08-08. Retrieved 12 November 2022.
- For a summary of SAMPROSA, see Wells, J.C. (19 September 1995). "SAMPROSA (SAM Prosodic Transcription)". UCL Phonetics and Linguistics. University College London. Retrieved 23 October 2021.
External links
- Computer-coding the IPA: A proposed extension of SAMPA
- X-SAMPA to IPA to CXS converter
- Web-based translator for X-SAMPA documents. Produces Unicode text, XML text, PostScript, PDF, or LaTeX TIPA.
- Z-SAMPA, a backward-compatible extension of X-SAMPA sometimes used for conlangs