Japanese phonology

From MedBib.com - Medicine & Nature

This article contains IPA phonetic symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Unicode characters.

This article deals with the phonology (i.e. the sound system) of the Japanese language.

Contents

Consonants

Japanese phonemes, with notable allophones
Bilabial Alveolar Palatal Velar Glottal Variable
Plosive p
b
t ~ ts ~ t̠ɕ
d ~ d̠ʑ
k
ɡ ~ ŋ
ː
Fricative ɸ s ~ ɕ
z ~ d̠ʑ
h ~ ç
Nasal m n ɴ̞
Flap ɾ̠ ~ ɺ̠
Approximant j w͍ (ɰ͡β̞)

Note that this table does not cover the consonants /g/, /d/, /z/, /b/, or /p/. Please see below for other details.

Hiragana IPA Notes
,,,, [ka], [ki], [k], [k], [k] This is a voiceless stop. It is slightly aspirated: less aspirated than English stops, but more so than Spanish.
,,, [sa], [su͍], [se], [so] This is a laminal alveolar sound.
[ɕi] This is an alveolo-palatal sound.
,, [ta], [te], [to] This is a voiceless stop, it is slightly aspirated: less aspirated than English stops, but more so than Spanish. This sound is laminal denti-alveolar (that is, the blade of the tongue contacts the back of the upper teeth and the front part of the alveolar ridge)
[t̠͡ɕi]
[t͡su͍]
,,, [na], [nu͍], [ne], [no]
[ɲi]
,, [ha], [he], [ho]
[çi]
[ɸu͍]
,,,, [ma], [mi], [mu͍], [me], [mo]
,, [ja], [ju͍], [jo]
, [ɺu͍], [ɺo] The flap tends to be lateral before back vowels, especially [o]
, [ɾi], [ɾe] The flap tends to be central before front vowels, especially [i]
[ɺa] ~ [ɾa] The flap tends to be especially ambiguous as to centrality before [a]
[ɰ͡β̞a] Pronounced with compressed rather than rounded lips. The kana wi we wo are pronounced as vowels, [i], [e], [o]
[n] Before [d], [n], or [t]. This sound is laminal denti-alveolar (that is, the blade of the tongue contacts the back of the upper teeth and the front part of the alveolar ridge)
[m] before [b], [p] or [m].
[ŋ] Before [k], [ɡ], or [ŋ].
[ũ͍] Between [a] and [o] or before [s].
[ĩ] Between [i] and [o].
[ɴ] At the end of an utterance.

Vowels

Japanese has 5 vowels:

Hiragana IPA Notes
[a̠] This is a low central vowel, it is between the English a in "father" and the English a in "dad".
[i] This sounds like the English ee in "feet."
[ü͍] listen  This is a somewhat centralized close back compressed vowel, pronounced with the lips compressed toward each other but neither rounded like [u] nor spread to the sides like [ɯ]. Note, however, that there is no IPA symbol for lip compression, and the combination of round vowel with spread diacritic "[u͍]" is an ad hoc transcription.
[e̞] The e sounds to English speakers like a mix between short e in as in "bed," and long e as in "lay," though it is closer to the former than the latter.
[o̞] listen  This is a pure o, unlike the English one, which is a diphthong. The tongue is kept lowered while pronouncing the Japanese o, and the lips are mostly kept from moving.

The Japanese vowels are pronounced as monophthongs, unlike in English; except for /u/, they are similar to their Spanish or Italian counterparts.

Vowels have a phonemic length distinction (i.e., short vs. long). Cf. contrasting pairs of words like ojisan /ozisaɴ/ "uncle" vs. ojiisan /oziisaɴ/ "grandfather", or tsuki /tuki/ "moon" vs. tsūki /tuuki/ "airflow".

In most phonological analyses, all vowels are treated as occurring with the time frame of one mora. Phonetically long vowels, then, are treated as a sequence of two identical vowels, i.e. ojiisan is /oziisaɴ/ not /oziːsaɴ/.

Within words and phrases, Japanese allows long sequences of phonetic vowels without intervening consonants, although the pitch accent and slight rhythm breaks help track the timing when the vowels are identical.

[hoo.oo.o] hōō o (鳳凰を) 'phoenix (direct object)'
[too.oo.o.oou͍] tōō o ōu (東欧を覆う) 'to cover Eastern Europe'
(this artificial example is not something that would normally be said)

Phonological processes

Japanese contains a number of phonological processes which greatly alter the phonetic realization of consonants and vowels. A few are listed below.

Consonant processes

Weakening

Non-coronal voiced stops /b, ɡ/ between vowels may be weakened to fricatives, especially in fast and/or casual speech:

    /b/bilabial fricative [β]: /abaɾeɾu/[aβaɾeɾu͍] abareru 暴れる 'to behave violently'
    /ɡ/velar fricative [ɣ]: /haɡe/[haɣe] hage はげ 'baldness'

However, /ɡ/ is further complicated by its variant realization as a velar nasal [ŋ]. Standard Japanese speakers can be categorized into 3 groups (A, B, C), which will be explained below. If a speaker pronounces a given word consistently with the allophone [ŋ] (i.e. a B-speaker), that speaker will never have [ɣ] as an allophone in that same word. If a speaker varies between [ŋ] and [ɡ] (i.e. an A-speaker) or is generally consistent in using [ɡ], then the velar fricative [ɣ] is always another possible allophone in fast speech.

/ɡ/ may be weakened to nasal [ŋ] when it occurs within words — this includes not only between vowels but also between a vowel and a consonant. There is a fair amount of variation between speakers, however. Some, such as Vance (1987), have suggested that the variation follows social class; others, such as Akamatsu (1997), suggest that the variation follows age and geographic location. The generalized situation is as follows.

At the beginning of words:

In the middle of simple words (i.e. non-compounds):

In the middle of compound words morpheme-initially:

So, for some speakers the following two words are a minimal pair while for others they are homophonous:

To summarize using the example of hage はげ 'baldness':

Palatalization and affrication

The palatals /i/ and /j/ palatalize the consonants they follow:

    /m/palatalized [mʲ]: /umi/[u͍mʲi] umi 'sea'
    /ɡ/ → palatalized [ɡʲ]: /ɡjoːza/[ɡʲoːza] gyōza ぎょうざ 'fried dumpling'
    etc.    

The coronals /s, z, n, t/ and glottal /h/ are affected as follows:

    /s/ → alveolopalatal fricative [ɕ]: /sio/[ɕi.o] shio 'salt'
    /z/ → alveolopalatal [dʑ] or [ʑ]: /zisiɴ/[dʑiɕĩɴ] jishin 地震 'earthquake';
/ɡozjuu/[ɡodʑu͍u͍] ~ [ɡoʑu͍u͍] gojuu 50 'fifty'
    /n/ → alveolopalatal [ɲ]: /niwa/[n̠ʲiw͍a] niwa 'garden'
    /t/ → alveolopalatal affricate [tɕ]: /tiziɴ/[tɕidʑĩɴ] ~ [tɕiʑĩɴ] chijin 知人 'acquaintance'
    /h/ → palatal fricative [ç]: /hito/[çi̥to] hito 'person'

Of the allophones of /z/, the affricate [dʑ] is most common, especially at the beginning of utterances and after /ɴ/ (or /n/, depending on the analysis), while fricative [ʑ] may occur between vowels. Both sounds, however, are in free variation. The [n̠ʲ] is alveolopalatal, not a true palatal.

In the case of the /s/, /z/, and /t/, when followed by /j/, historically, the consonants were palatalized with /j/ merging into a single pronunciation. In modern Japanese, these are arguably separate phonemes, at least for the portion of the population that pronounces them distinctly in English borrowings.

    /sj/[ɕ] (Romanized as sh): /sjaboɴ//ɕaboɴ/[ɕabõɴ] shabon シャボン 'soap'
    /zj/[dʑ] or [ʑ] (Romanized as j): /zjaɡaimo//dʑaɡaimo/[dʑaŋaimo] じゃがいも 'potato'
    /tj/[tɕ] (Romanized as ch): /tja//tɕa/[tɕa] cha 'tea'

The vowel /u/ also affects consonants that it follows:

    /h/bilabial fricative [ɸ]: /huta/[ɸu͍̥ta] futa ふた 'lid'
    /t/ → dental affricate [ts]: /tuɡi/[tsu͍ŋi] tsugi 'next'

Moraic nasal

Some analyses of Japanese treat the moraic nasal as an archiphoneme /N/. However, other, less abstract approaches take its uvular citation pronunciation as basic, or treat it as a regular coronal /n/. In any case, it undergoes a variety of assimilatory processes. Within words, it is variously:

Some speakers produce /n/ before /z/, while others produce a nasalized vowel before /z/ (see Akamatsu 1997).

Moraic obstruent

In some analyses of Japanese, an archiphoneme /Q/ is posited. However, not all scholars agree that this is the best analysis. In those approaches that incorporate the moraic obstruent, it is said to completely assimilate to the following obstruent, resulting in a geminate (that is, double) consonant. The assimilated /Q/ remains unreleased and thus the geminates are phonetically long consonants. /Q/ does not occur before vowels or nasal consonants. This archiphoneme has a wide variety of phonetic realizations, for example:

    [p̚] before [p]: /niQpoN/[nʲipːõɴ] nippon 日本 'Japan'
    [pʲ̚] before [pʲ]: /haQpjaku/[hapʲːjaku͍] happyaku 八百 '800'
    [s̚] before [s]: /kaQseN/[kasːẽɴ] kassen 合戦 'battle'
    [tʲ̚] before [tɕ]: /saQti/[satːɕi] satchi 察知 'inference'
    etc.

Another analysis of Japanese dispenses with /Q/ and other archiphonemes entirely. In this approach, the words above are phonemicized as shown below:

    [p̚] before [p]: /nippon/[nʲipːõɴ] nippon 日本 'Japan'
    [pʲ̚] before [pʲ]: /happjaku/[hapʲːjaku͍] happyaku '800'
    [s̚] before [s]: /kassen/[kasːẽɴ] kassen 合戦 'battle'
    [tʲ̚] before [tɕ]: /satti/[satːɕi] satchi 察知 'inference'
    etc.

/d, z/ neutralization

Vowel processes

Devoicing

Japanese vowels, especially /i/ and /u/, tend to be devoiced when between unvoiced consonants except when they are in accented moras. Additionally, /i/ and /u/ are optionally devoiced following a voiceless consonant and at the end of an utterance.

    /kutu/[ku̥tsu͍] kutsu 'shoe'
    /ˈsuhada/[suhada] suhada すはだ 'bare skin'   (/su/ is not devoiced since it's accented)
    /hikaɴ/[çi̥kãɴ] hikan 悲観 'pessimism'
    /hikaku/[çi̥kaku͍] or [çi̥kaku̥] hikaku 比較 'comparison'

To a lesser extent /o/ (and even less commonly /a/) may devoice with the further requirement that there be two or more adjacent moras containing /o/.

    /kokoɾo/[ko̥koɺo] kokoro 'heart'

Devoicing is common in even normal slow speech and is not restricted to only fast speech.

The common sentence-ending copula desu is pronounced [desu̥].

Gender roles also play a part: it is regarded as effeminate to pronounce devoiced vowels as voiced, particularly the terminal "u" as in "arimasu". Basilectic varieties of Japanese can sometimes be recognized by their hyper-devoicing, while in some Western dialects and some registers of formal speech, every vowel is pronounced.

Nasalization

Japanese vowels are slightly nasalized when adjacent to nasals /m, n/. Before the moraic nasal /ɴ/, vowels are heavily nasalized:

    /seesaɴ/[seesãɴ] seisan 生産 'production'

Glottal stop insertion

At the beginning and end of utterances, Japanese vowels may be preceded and followed by a glottal stop [ʔ], respectively. This is demonstrated below with the following words (as pronounced in isolation):

    /eɴ/[ẽɴ] ~ [ʔẽɴ]: en 'yen'
    /kisi/[ki̥ɕiʔ]: kishi 'shore'
    /u/[u͍ʔ] ~ [ʔu͍ʔ]: u 'cormorant'

When an utterance-final word is uttered with emphasis, this glottal stop is plainly audible, and is often indicated in the writing system with a small letter tsu called a sokuon.

Moras and phonotactics

If considered as a system of moras instead of syllables (as the katakana and hiragana phonetic writing systems explicitly do), the sound structure is very simple: The language is made of moras, each with the same approximate time value and stress (stress, here, being correlated with pitch, not loudness). The Japanese mora may consist of either a vowel or one of the two moraic consonants, /N/ and /Q/. A vowel may be preceded by an optional (non-moraic) consonant, with or without a palatal glide /j/.

Mora Type Example Japanese moras per word
V /i/ i 'stomach' 1-mora word
CV /te/ te 'hand' 1-mora word
CjV /kja/ kya きゃ '(surprised or scared scream)' 1-mora word
N /N/   in /jo.N/ or /jo.n/ yon 'four' 2-mora word
Q /Q/   in /mi.Q.tu/ or /mi.t.tu/ mittsu 三つ 'three' 3-mora word

Consonantal moras are restricted from occurring word initially, though utterances starting with [n] are possible. Vowels may be long, and consonants may be geminate (doubled). Geminate consonants are limited to a sequence of /Q/ plus a voiceless obstruent, though some words are written with geminate voiced obstruents. In the analysis without archiphonemes, geminate clusters are simply two identical consonants, one after the other.

In the writing system, each kana corresponds to a mora. The moraic /Q/ (i.e., the first half of a geminate cluster) is indicated by a small "tsu" symbol called a sokuon (subscript in katakana, or in hiragana). Long vowels are usually indicated in katakana by a long dash following the first vowel, as in sābisu サービス 'service'. The direction of this dash follows the direction of writing.

In English, stressed syllables in a word are pronounced louder, longer, and with higher pitch, while unstressed syllables are relatively shorter in duration. In Japanese, all moras are pronounced with equal length and loudness. Japanese is therefore said to be a mora-timed language.

On the other hand, since all syllables have equal stress in Japanese, some unstressed syllables in European languages tend to be inaudible to the Japanese ear, leading to confusion.

(Compare to the syllable system of Finnish and Italian.)

Prosody

Main article: Japanese pitch accent

Standard Japanese has a distinctive pitch accent system: a word can have one of its moras bearing an accent or not. An accented mora is pronounced with a relatively high tone and is followed by a drop in pitch. The various Japanese dialects have different accent patterns, and some exhibit more complex prosodic systems.

Notes

  1. ^ Akamatsu (1997) speculates that only 10% of population are consistent [ɡ] users, though this is the norm in western Japan.
  2. ^ Japanese academics represent [ɡ] as and [ŋ] as こ゜.

Bibliography