History Of English Intonation English Language Essay

Published: Last Edited:

This essay has been submitted by a student. This is not an example of the work written by our professional essay writers.

Many people think that pronunciation is what makes up an accent. It may be that pronunciation is very important for an understandable accent. But it is intonation that gives the final touch that makes an accent native.

Intonation is the "music" of a language, and is perhaps the most important element of a good accent. Often we hear someone speaking with perfect grammar, and perfect formation of the sounds of English but with a little something that gives them away as not being nativespeaker.Therefore, it is necessary to realize that there is more than the correct pronunciation of the vowels and consonants of a language. This is very important and we do stress it in other articles. But it is only one of the three components to an accent, pronunciation, intonation, and linking. In other places we will examine the correct pronunciation of vowels and consonants, and linking, the way that syllables within a word, and the beginning and ending of words come together.

Two useful abstractions:

To understand how intonational transcription works, you must understand two

different kinds of abstractions which the system relies on.

The first is a phonetic abstraction, namely that there is something which we can call intonation, a well-defined set of linguistic phenomena all working together to determine the pitch pattern of an utterance. This abstraction is very useful because it is fairly easy to get a good measure of what listeners perceive as the pitch pattern. We can do this by extracting the fundamental frequency of the voiced parts of the utterance, a task which is computationally quite easy. We can then take the fundamental frequency pattern, and analyze it as the result of a set of linguistic categories with a number of specific purposes, and an algorithm which implements the categories as events in the pitch of the utterance. Two points to note here: 1) not all intonational categories have the same function; being an intonational category only means that the category has a specific and categorical effect on the pitch pattern. 2) these categories do not determine all aspects of the pitch pattern; various other non-linguistic differences, such as emotional state, degree of involvement in the speech, and individual differences such as ones due to sex, also affect aspects of the pitch pattern.

The second is a functional abstraction. These intonational categories can be classified with respect to the two major types of prosodic functions. Prosody can be described as consisting of 'head' mechanisms and 'edge' mechanisms.

Head mechanisms are those which act to pick out one piece of an utterance as different than its neighbors, while edge mechanisms indicate which items go with which by marking the edge of a larger grouping. Intonational categories in the English system similarly function either to pick out syllables which are more stressed than their neighbors, or to mark the final edge of a piece of an utterance which is to be interpreted as a group.

Edge marking tones - boundary tones and phrase tones.

The intonational categories which you will likely find most intuitive are the ones which are used to mark edges. One reason for this, I believe, is that the English orthography actually writes some of these differences. For example, consider the following pair of sentences.

1) This is a test sentence.

2) This is a test sentence?

If you convert these into speech (by reading them out loud), you will note a very salient difference in the pitch contour at the end. In 1) the pitch falls throughout the last word, often ending with a little bit of creaky voice, while in 2) the pitch rises throughout the last word, perhaps ending higher than anywhere else in the entire sentence. Such differences in pitch pattern reflect discourse-related differences such as is captured by the use of the question mark in 2).

At a full stop, our system indicates the possibility of four different contours, the two which appear in likely renditions of 1) and 2), and two more, one which you will likely produce in the non-final members of a slowly rendered list, and one which you might produce when calling someone in for dinner. In the transcription system, you will see these represented in the following way (more or less). The fall in 1) is low throughout, and so is indicated as LL% (two lows with the % indicating the final boundary). The rise in 2) is high throughout, with a very brief rise to a super-high at the end, and so is indicated as HH% (two highs). The so-called list boundary starts low and rises slightly at the end, and so is indicated as LH%. The last one which appears in calling chants is basically high throughout, and differs from the HH% (question marker) in that it does not rise to a super high. Thus, since it is high to start with, it starts with a H, and since it is not as high as the super high at the end, it is relatively low, and so is indicated with a L%. This makes for a neat 4-way distinction as below, given with stereotypical examples of places where you might find them. (Note these are not the only places you will find them!)

LL% Terminal fall - statements.

HH% High plateau with upped high at end - covert questions.

LH% Low plateau with little rise at end - internal to lists.

HL% High plateau with no rise to a super-high - end of calling chants

Head marking tones - pitch accents.

If you go back and reproduce the items in 1) and 2) again, and this time concentrate on the area aroundtest, you will very likely notice a large difference in pitch pattern in this region in addition to what is going on at the end. The wordtest is a critical portion of the utterance in most prosodic analyses of English, because it is the last item which bears some degree of stress, usually called tonic or sentence stress. I chose this sentence because the words test sentence form a compound, and one of the peculiarities of English compounds is that they are most stressed on the first half. Thus,test is the most stressed syllable in the last content word in the sentence. In stressed locations such as this, English speakers also implement tonal events. Such events are often called pitch accents,, pitch because they involve parts of the pitch pattern, and accents because they are involved in making a particular syllable more prominent. Stressing this syllable makes it stand out from its neighbors. Thus, the tonal events ontest are head-marking events.

Here, like the boundary tones just discussed, there are tonal differences associated with different discourse conditions. In 1) you very likely will produce the stressed item with a high pitch somewhere on it, while in 2) you very likely will produce the stressed item with a relatively low pitch. Thus, the difference between vanilla statements and covert questions is not only in the presence of LL% boundary tones in one and in HH% boundary tones in the other, but also in the presence of a H accent in one, but a L accent in the other. Since there is a categorical difference in how you use pitch to stress the tonic item, you need to have a categorical difference between H* and L* accents. (The star here indicates that the tone is associated with the stressed syllable.)

In addition to using relatively high and low pitch, there are more complicated rising and falling pitch accents which differ from the simple low and high accents in what they indicate. Our system captures these differences in the local use of pitch in the accent by combining H's and L's in various ways to get rises and falls. Thus, in addition to H* which indicates a generally high pitch around the stress and L* which indicates a generally low pitch around the stress, we can also have H+L's (falling accents), and L+H's (rising accents). To illustrate the difference between a simple H and a L+H, consider the following two conditions:

3)We will be having you read bunches of utterances for some obscure reason

related to why anyone would be interested in linguistics. The first is a test

sentence. It's just there for practice.

4)The first is not a real sentence, the first is a test sentence.

In producing test sentence in 3), it is likely there will not be an appreciable rise in pitch, while in 4), where it explicitly contrasts with the precedingreal, it is likely that there will be an appreciable rise in pitch from the is a tot est. In fact, it is a general property of contrasting items that they get rendered with a relatively low pitch on the material preceding the stressed item and a sudden rise to a peak on the stressed syllable. If you read over 4) several times, emphasizing the contrast more and more each time, this rising pitch event associated witht est will become more and more apparent.. In 4) the rising accent is seen in the relationship in pitch between the items immediately preceding the stressed syllable and the pitch on the stressed syllable itself. However, there are other examples of rising pitch accents in which the low pitch predominates in the stressed syllable, and the high does not become realized until very late in the syllable or in the following syllables. Pierrehumbert & Hirschberg (1991) discuss fairly clear examples of this accent such as the following:

5) A: Alan's such a klutz.

B: He's a good badminton player.

Here the intended meaning of the second response is that B is not sure that playing badminton qualifies one as not being a klutz. In the intended rendition there is a low pitch onbad and a rising pitch on the immediately following syllable, and then another fall to a general low ending in LH% phrase tones. Another example they discuss is the following:

6) A: Did you take out the garbage?

B: Sort of.

A: Sort of!?!

Here, the intended rendition of Sort of starts low inso rt and rises, and then falls and rises again at the end. The intended meaning is very much like that in 5), namely, B is not really sure what she did counts as taking out the garbage. A's rendition ofsort of in the last line has exactly the same pattern as B's, a rise throughsort followed by a fall and a rise at the end, though the rises and falls are more exaggerated. What's important in each of these cases,badminton in 5), and both sort of's in 6), is that the stressed syllable exhibits a distinctly low pitch and the rise which comes much later than the rise in 4).

In order to annotate this difference, Pierrehumbert used the * to indicate which part of the contour is to be associated with the stressed syllable. Thus, the contour in 4) is annotated as a L+H*, since the H part appears on the stressed syllable, and the L part simply comes some time before it. By contrast, the contour in 5) and 6) is annotated as a L*+H, since the L part happens on the stressed syllable, and the H part appears some time thereafter.

Pitch Range. :

One final aspect of intonational modeling must also be mentioned, that is the notion of pitch range. As I noted above, the tone category sequences do not all by themselves determine the pitch contour for an utterance, but other non-linguistic (non- conventionalized) factors also affect the final realization of pitch. One approach to handling these less conventionalized effects, such as what may be due to emotional involvement, is to allow for modulation of the overall range of the pitch movements. The general approach used in most models is to specify a 'pitch window', which indicates the range of pitch to be used at any given time. The top of the 'window' is where you find the H's and the bottom of the window is where you find the L's. This window can be affected by a number of different factors, which work in different ways. Some factors are global in that they typically affect a large portion of speech. Take, for instance, the effects of emotional involvement. When people get irate, there is a strong likelihood that the both H's and L's will be higher, and that the difference between the H's and L's will be bigger. This 'larger and higher window' will often affect entire sentences. You will also likely find such global shifts in window size if you examine how people do narratives which include parentheticals and quotations. Parentheticals often are rendered with a narrower window, while quotes often involve a larger window. Other factors which affect pitch range can be localized to one particular location in the utterance. The most commented upon is the effect of downstep (sometimes called catathesis). Downstep is a very regular lowering and narrowing of the pitch range which happens in the presence of the accents. In Pierrehumbert's analysis, any tone which is composed of two tones (the rising L+H and falling H+L accents) also trigger downstep.

You can easily imagine this effect in an emphatic rendition of the following sentence.

7)I don't want horses and dogs; I want sheep and cats.

If you are contrasting horses with sheep and dogs with cats, you will very likely produce this sentence with L+H accents on all four items (probably L*+H onhorses anddogs, and L+H* onsheep andcat s). If you do so, you will also notice that the second item in each list,dogs andcat s, will both be lower in pitch than the first,horse s, andsheep. This conventionalized lowering is taken to be due to the downstepping effect of the complex rising accents.

One can also see this conventionalized downstepping very clearly in phrases with multiple accents rendered in a finger-wagging lecturing style where the clear intent of the style is to indicate that 'you should know this by now'. For example,

8) You just don't seem to get it. <sigh> Insert tab A into slot B. Repeat it four


In this situation, the rendition of the last two sentences, which we can assume have been rendered several times before in the extended discourse, will likely not exhibit huge rising or falling accents. Nevertheless, I have heard this sort sentence produced with clear downsteps between each accent. Due to sentences like these, one must conclude that the occurrence of downstep does not necessarily demand the obvious existence of rising or falling accents. In Pierrehumbert's analysis, this is due to the H*+L tone category which is locally the same as a plain H*, except that it triggers the lecturing downstep effect. In other systems, such as the ToBI revision, this downstepping is marked with an explicit marker (an exclamation point placed before the affected accent.