1. Introduction
One of the phonetic and phonological phenomena observed in many languages is the preconsonantal vowel (syllable) shortening (e.g., English: House & Fairbanks, 1953; Peterson & Lehiste, 1960; French: Chen, 1970; Mack, 1982; Spanish: Delattre, 1962; Zimmerman & Sapon, 1958; Norwegian: Fintoft, 1961; Dutch: Slis & Cohen, 1969; Van den Berg, 1988; Japanese: Port, et al., 1987; Sato, 1993; Tamil: Balasubramanian, 1981; Hindi: Maddieson & Gandour, 1975; Russian: Chen, 1970; German: Kohler, 1979; Swedish: Carlson & Granström, 1986; Elert, 1964; Arabic: Alghamdi, 1990; Korean: Chen, 1970; Kim, 1965; Kim, 1987; Oh, 2002; Oh & Johnson, 1997; Yun, 2004, 2009, 2010). That is, a vowel or syllable in many languages is shorter before phonologically voiceless or tense obstruents within a syllable and/or across the syllable boundary while it is longer before their voiced or lax cognates.
On the other hand, it is surprising that few studies of Chinese have been reported with regard to the preconsonantal vowel (syllable) shortening, though Chinese is a language that is currently used by more than 1.4 billion people even in mainland China only. The shortening has been observed in languages whose stops are distinguished by the feature ±voice or ±tense. Interestingly, Chinese stops are not distinguished by voicing or tenseness. They are all phonologically voiceless as Korean stops are. However, unlike Korean stops with three-way distinctions by aspiration and tenseness (Kim, 1965), they are distinguished only by the feature aspiration (Lin, 2007). Now, it is worthwhile to examine whether the preceding vowel duration varies as a function of the feature aspiration of the following stop in Chinese.
Though the preconsonantal vowel (syllable) shortening is found in many languages, the degree or pattern of the durational variation differs between languages. First, it has been known that out of many languages, English reveals the greatest variation of vowel duration as a function of the following consonant, i.e., the vowel before a voiceless consonant is much shorter than that before its voiced counterpart (Chen, 1970; Zimmerman & Sapon, 1958). For instance, Chen (1970) calculated the average ratios of vowel duration before voiceless vs. voiced consonants in seven languages. The results showed that English had the lowest ratio: 0.61 in English, 0.87 in French, 0.82 in Russian, 0.78 in Korean, 0.90 in German, 0.86 in Spanish and 0.82 in Norwegian. However, especially the comparison between English and Korean was not fair because most of the test words in English were mono-syllabic whereas all the words were bi-syllabic in Korean. With regard to this, it should be noted that Korean stops become neutralized at coda position, i.e., no phonetic distinction between tense/lax stops (Martin, 1951). Unlike within a syllable, Korean is more remarkable than English in the variation of the preceding vowel duration across a syllable boundary (Kim, 1987; Yun, 2004, 2009, 2010).
Second, the following consonant closure duration also varies from language to language. For example, Chen (1970) reported that while the average durational ratio between English voiced consonant closures and their voiceless cognates was 1 : 1.59, it was 1 : 2.3 between Korean ones - Chen (1970) classified Korean consonants (stops and affricates) into two groups (voiced vs. voiceless) as in English. Kim (1987) observed that for English the ratio was 1 : 1.22, whereas for Korean the ratio between the three types (phonologically voiceless lax unaspirated stops /p, t, k/, voiceless tense unaspirated stops /p', t', k'/, and voiceless tense aspirated stops /ph, th, kh/) of stop closure durations was 1 : 3.36 : 2.52. Yun (2009) also reported that the average ratio between the three types of Korean stop closure durations was 1 (lax unaspirated) : 2.55 (unaspirated tense) : 2.14 (aspirated tense). The three studies suggest that the mean ratio between closure durations of tense vs. lax or voiceless vs. voiced consonants (stops and affricates) is much greater in Korean than in English.
Third, in English and Japanese, the preceding vowel (V) and the following stop closure (C) fully compensate for each other in duration (Port, 1981; Port, et al., 1987). That is, the micro-units (V, C) show inverse temporal variations, whereas the macro-unit (V+C) remains unchanged irrespective of the feature ±voice of the following consonant C. In Korean, however, the sum of vowel and consonant durations differs depending on the feature ±tense of the following consonant, i.e., V + tense C is longer than V + lax C. This means that the macro-unit V+C of Korean shows a pattern different from those of English and Japanese. The cross-linguistically different timing patterns between V and C are demonstrated in Yun (2010) where Korean, English, Japanese, and Arabic data are compared (see Figure 1). The compared data include Korean words (papa vs. pap'a vs. papha), English words (dibber vs. dipper and deeber vs. deeper) from Port (1981), Japanese words (kada vs. kata) from Port, et al. (1987), and Arabic words (badar vs. batar) from Alghamdi (1990). In Korean, solid lines with white circles indicate V and dotted lines VOT+V. Solid lines with white circles indicate V in Arabic, but VOT+V in English and Japanese. Figure 1 clearly shows that Korean V and C partly compensate for each other whereas the V and C reveal full compensations in the other three languages.
Not only V+C but also words with an intervocalic stop in Korean are significantly longer when the stop is tense than when it is lax. For example, Sato (1993) reported that /mamp'i/ and /mamphi/, which were similar in duration, were longer than /mampi/. Yun (2004) also showed that /ap'a/ and /apha/, which were similar in duration, were longer than /apa/. It is because the durational differentials in V+C remain at word level in Korean (Yun, 2004).
As mentioned so far, many languages show the preconsonantal vowel (syllable) shortening, but the degree or pattern of temporal variation differs between languages. Chinese also must have its own typical timing pattern between V and C. In order to discover the pattern, this study examined both the temporal micro-units (V, C, VOT) and macro-units (V+C, word) of Chinese.
2. Method
Five (one male and four female) native speakers of Mandarin took part in the recording. They were all undergraduate students of Hankuk University of Foreign Studies at their twenties.
As seen in Table 1, 15 pairs of bi-syllabic Chinese words were chosen as stimuli. All the words (five pairs with bilabial stops /p, ph/; five pairs with alveolar stops /t, th/; five pairs with velar stops /k, kh/) have one of Chinese unaspirated/aspirated stops (/p, ph, t, th, k, kh/) at the onset of the second syllable. The target words were embedded in a carrier sentence, 请把 _____ 再说一遍 [qiŋ pa _____ zài shuō yībiàn] ‘Please say _____ again.’ As a result, we obtained 30 sentences. Reading lists were prepared in which those sentences were written in ten different orders. The five informants were asked to read the lists at their normal rate to produce 1,500 tokens (5 informants × 30 words × 10 lists). Their speech was directly recorded into a computer through a microphone in the sound treated recording room of the Speech Laboratory at Hankuk University of Foreign Studies. The recording was digitised at a sampling rate of 16 kHz with 16 bit resolution and saved as files to be processed by the software package Praat.
First, we measured the 1st syllable duration - when the onset is a lateral /l/: /l/ + preceding vowel duration (V1); when the onset is a stop: closure duration (C1) + VOT1 + preceding vowel duration (V1); when the onset is an affricate (e.g., /qī/, /jī/): C1 + affrication (for convenience it is counted as VOT1) + V1; when the onset is a fricative (e.g, /shéŋ/, /shí/): frication + V1 (for convenience the coda /ŋ/ was counted as part of V1); when the onset is a nasal (e.g., /miǎn/): nasal duration + V1 (for convenience the final /n/ was counted as part of V1). Second, we measured the 2nd syllable duration - closure duration (C2) of the intervocalic stops + aspiration (VOT2) + vowel (V2) (for convenience the codas /ŋ, n/ were counted as part of V2). Finally, the whole word duration was measured.
We obtained averages (ms), SD and SE, with performing repeated measures ANOVAs.
3. Results and Discussion
As seen in Table 2, the preceding vowel (V1) is generally longer when the intervocalic consonant is unaspirated than when it is aspirated. But statistically significant differences were observed from only two out of the 15 pairs, and even the differences were not great (p = 0.032, 0.039). Furthermore, some vowels were longer before aspirated stops (e.g., /gǎipiàn/ 147 ms vs. /gàiphiàn/ 156 ms). Therefore, it can be said that overall, the effect of the feature aspiration on V1 was neither significant nor consistent though V1 is liable to be a little longer before an unaspirated stop. This means that there is no significant preconsonantal vowel shortening in Chinese.
Repeated measures ANOVAs on [(VOT1)+V1] were performed in accordance with Port, et al. (1987) which regards VOT as part of the following vowel. The results were similar to those of V1 (see Table 3). That is, VOT1 (or affrication)+V1 was generally longer before unaspirated stops than aspirated ones. However, except for three pairs, the differences were not significant. In addition, two pairs showed the opposite results, i.e., /gǎipiàn/ (163 ms) vs. /gàiphiàn/ (172 ms); /jǐkè/ (165 ms) vs. /jīkhě/ (174 ms), and one pair had the same length, i.e., /shěŋtào/ (156 ms) vs. /shéŋthào/ (156 ms). To summarize, a vowel tends to be a little longer before unaspirated stops, but the trend is weak and inconsistent. It again indicates that Chinese has no significant preconsonantal vowel shortening. This is very different from English and Korean in which the distinctive feature voicing or tenseness of the following consonant causes significant durational differences in the preceding vowel.
Closure duration (C2) of unaspirated intervocalic stops was significantly longer than that of aspirated ones in six out of the 15 pairs (see Table 4). However, the trend was not consistent. In the other nine pairs, C2 did not statistically differ depending on the feature aspiration of the intervocalic stops. C2 was even shorter in unaspirated stops than in aspirated ones in three pairs (/gēpó/ (41 ms) vs. /gēphò/ (45 ms); /tìtài/ (37 ms) vs. /tǐthài/ (41 ms); /tóukǎo/ (30 ms) vs. /tóukhào/) (32 ms)), or the same in one pair (/lǎopó/ (60 ms) vs. /lǎophó/ (60 ms)). All in all, however, we can say that the closure duration of Chinese unaspirated stops is similar to or longer than that of aspirated ones.
As seen in Table 5, when the intervocalic stop (C2) is unaspirated, (VOT1)+V1+C2 was longer in seven of the 15 pairs, and it was almost significantly longer in three other pairs (/gépì/ vs. /géphì/, p=0.061; /gǎipiàn/ vs. /gǎiphiàn/, p=0.061; /jǐkè/ vs. /jǐkhè/, p=0.055). In the other five pairs, (VOT1)+V1+C2 was a little longer when the stop is unaspirated, though the differences were not significant (p>0.1). In conclusion, the durational unit of (VOT1)+V1+C2 in Chinese tends to be longer when the stop is unaspirated than aspirated.
As said earlier, many languages have a compensatory temporal movement between the preceding vowel and the following consonant, whether the compensation is full or partial. However, such a compensation did hardly occur in Chinese, considering the durational variations of (VOT1)+V1 and C2 that were introduced in the previous sections. That is, regardless of statistical significance, each of the preceding vowel (VOT1)+V1 and the following consonant (C2) was generally longer when the stop is unaspirated than aspirated. Therefore, the sum of (VOT1)+V1 and C2 also was longer when the stop is unaspirated. A relatively longer preceding vowel and a relatively longer following unaspirated stop made a longer duration than the duration a relatively shorter vowel and a relatively shorter aspirated stop did. Compensation between V and C takes place with a long vowel and a short consonant or with a short vowel and a long consonant. However, it did not occur in Chinese.
One pair (/kǔké/ vs. /kùkhè/) out of the test words is demonstrated in Figure 2. No noticeable durational compensation is seen between V and C in Figure 2, while clear compensations are seen in the four languages in Figure 1.
As expected, all of the 15 word pairs were significantly distinguished by VOT2 (see Table 6). This proves that aspiration is the distinctive feature of Chinese stops. Velar consonants /k, kh/ had relatively longer VOTs than bilabial and alveolar stops as in other languages, e.g., Korean. In particular, unaspirated /k/ revealed notably longer VOTs than /p, t/.
Repeated measures ANOVAs showed that V2 in 13 word pairs was significantly longer after unaspirated stops (see Table 7). Another pair (/tìtài/ vs. /tǐthài/), which did not significantly differ in V2, also had the same trend: V2 (/ài/) was longer after /t/ (177 ms) than after /th/ (151 ms). Those results lead to a compensatory relationship between VOT and the following V, which could be the reason why VOT is often regarded as part of the following vowel.
Statistical analyses proved that VOT2+V2 in 12 pairs was significantly longer when the intervocalic stop is aspirated than unaspirated (see Table 8). In the other three pairs also, VOT2+V2, if not significant, was longer after an aspirated stop. This means that the opposite durational variations of VOT and V2 only partly compensate for each other.
Repeated measures ANOVAs yielded that words with an intervocalic aspirated stop were significantly longer than those with its unaspirated cognate in eight pairs (see Table 9). The other seven pairs also generated a little longer word durations when the intervocalic stop is aspirated. In Korean, the feature tenseness causes significant differentials between tense and lax stop closure durations, and the differentials contribute to significant durational differences between words with tense/lax intervocalic stops (Yun, 2010). In Chinese, VOT following intervocalic stop closure functions like Korean stop closure though its effect on word duration is not so strong or consistent as Korean stop closure.
As seen in the previous sections, the preceding vowel (VOT1)+V1 and the following intervocalic consonant C2 were generally longer when the stop is unaspirated, and so was their sum (VOT1)+V1+C2. In addition, V2 after VOT2 was generally longer when the intervocalic stop is unaspirated. Thus, the durational units before and after VOT2 varied to reduce the differences between VOT2s of unaspirated/aspirated stops. Nevertheless, words with an intervocalic aspirated stop were often significantly longer than those with its unaspirated cognate. It suggests that VOT2 is crucial to word duration in Chinese.
4. Summary and Conclusion
This study compared 15 pairs of bi-syllabic Chinese words with intervocalic unaspirated/aspirated stops to verify whether Chinese has the preconsonantal vowel shortening that is observed in many languages. The results were (1) preceding vowel shortening hardly occurred as a function of the feature aspiration of the following stop, (2) the following unaspirated stop closure (C) was similar to or longer than its aspirated cognate, (3) the preceding vowel duration (VOT1)+V1 and the following intervocalic stop closure duration C2 had no compensatory relationship unlike in other languages, e.g., Korean, English, Japanese, and Arabic where a long preceding vowel is followed by a short stop closure and vice versa. Rather, both the preceding vowel and the following stop closure tended to be longer when the intervocalic stop is unaspirated; so, the sum of the vowel duration and closure duration was also longer when the stop is unaspirated, (4) VOT2 was significantly longer when the stop is aspirated than when unaspirated, (5) V2 following VOT2 was significantly longer when the intervocalic stop is unaspirated, and it partly compensated for the differential between VOT2s before V2, (6) despite the partial compensation, VOT2+V2 was longer when the intervocalic stop is aspirated, (7) words with an intervocalic aspirated stop were longer than those with its unaspirated cognate though the differences were not always significant. This implies that the longer aspiration (VOT2) in words with an aspirated stop was not well absorbed by the other components of the words. Considering all, we can say that while VOT is the most important factor for deciding the timing structure of Chinese words with intervocalic stops, closure duration is crucial for Korean and many other languages. The language specific pattern of the preconsonantal vowel shortening should be incorporated into the phonology of Chinese.
On the other hand, Chinese stops seem to be all tense as well as voiceless. First, impr essionistically Chinese unaspirated stops sound like Korean tense unaspirated stops, while Chinese aspirated stops sound like Korean tense aspirated stops. Second, Chinese unaspirated/aspirated stops were often similar in duration though unaspirated stops were sometimes longer. Third, the effects of the following unaspirated/aspirated stops on the preceding vowel (VOT1)+V1 were generally similar, i.e., the duration of (VOT1)+V1 remained similar irrespective of the feature of the following stops. Therefore, if duration is a realization of utterance energy, Chinese unaspirated/aspirated stops are likely to have a similar degree of tenseness. Of course, whether Chinese stops are all tense or not, the feature tenseness does not distinguish Chinese stops as voicing does not.