Speech Therapy Information and Resources

  • Increase font size
  • Default font size
  • Decrease font size
Home Voice


ABSTRACT: Voice is the sound produced in the larynx by the vibration of the vocal folds (vocal cords). Certain speech sounds are voiced by this vibration. The human voice conveys information about the speaker through paralinguistic features such as: pitch, loudness, resonance, quality and flexibility. Speakers vary these paralinguistic to infuse their talk with emotion. Common voice difficulties include colds and laryngitis, and vocal misuse. Voice disorders persist over time and are characterised as organic, psychogenic or functional.


At a fundamental level, voice is simply the noise created by the vibration of the vocal folds (vocal cords) within the larynx. In English all speech sounds are produced on an outgoing (egressive) air stream from the lungs, i.e. air from the lungs is directed up through the trachea, through the space between the vocal folds in the larynx (i.e. the glottis) and eventually out through either the mouth or nose. This air stream may either be vocalised by the vibration of the vocal folds or non-vocalised. The vocalised air stream is used to produce voiced speech sounds, e.g. all vowels, and voiced consonants such as ‘b’, ‘d’ and ‘g’. A non-vocalised air stream is used to produce many voiceless consonants, e.g. 'p', 't' and 's'.

Paralinguistic Features

The production of the human voice occurs at the physiological level (see The Communication Chain in Communication Theory). More specifically, it occurs at the laryngeal level. Consequently, with the exception of contrasting voiced and voiceless speech sounds, voice is not a linguistic phenomenon. Rather, various features of the voice work together with verbal language to express meaning. Accordingly these features are referred to as paralinguistic features.

The paralinguistic features of the voice convey a lot of information about the speaker. For example, even when we can not see the person who is speaking (over the telephone for instance) we usually know if it is a male or a female and, very often, how old the person is: usually we can tell if it is a child or an adult. We may even be able to tell which part of the country they come from by their accent, and so on. We will discuss the most salient paralinguistic features below.


The pitch of the voice refers to how high or low the note produced by the vibrating vocal folds appears to be. The faster the vocal folds vibrate the higher the pitch. Conversely, slowly vibrating folds will produce a lower pitch. The pitch of the note is measured by its frequency in Hertz (Hz), i.e. the number of vibrations per second. The note A above middle C on a modern piano has a frequency of 440 Hz. The optimum or average pitch for the speaking voice varies from person to person but, typically, men (128 Hz) will speak with a lower pitch than women (225 Hz).

Loudness (volume)

Whereas pitch is determined by the speed of the vibrations of the vocal folds, loudness is determined by the strength of their vibration. This is controlled mainly by the force with which the air from the lungs is allowed to pass through the larynx. It is important to understand that the pitch of the voice can remain constant whilst the loudness of that particular pitch can be varied. In other words, it is possible to keep the vibration frequency the same but to increase the strength of the vibration by forcing through more air.

Loudness is measured in decibels (dB). A whispering person is typically speaking at a loudness of around 10 dB. In comparison, someone shouting may be around 70-80 dB. A jet engine will create a loudness of about 110 dB and anything above this usually creates a sensation of pain, i.e. the pain threshold for hearing.


When the egressive air stream from the lungs is vocalised by the vibrating vocal folds it is amplified by resonating in the chest, throat, mouth and the sinuses of the face and forehead. This resonance gives the voice a characteristic musical quality, or timbre, which is determined by such things as one’s size, the shape of the chest cavity, the mass of the vocal folds, and so on.

In addition, the term ‘resonance’ may also refer to the relative balance of sound being produced either through the mouth or through the nose. Certain English speech sounds are produced by allowing the escaping air to pass through the nose: these are the nasal consonants: ‘m’ as in mum, ‘n’ as in nut and ‘ng’ as in wing (see Articulation for an explanation of nasals). When produced accurately there should be no escape of air through the mouth. If air does escape through the mouth then the speaker is said to be hyponasal. This condition will occur when a person has a cold such that his or her nose becomes blocked. Under these circumstances the air can only escape through the mouth and it gives rise to a characteristic lack of nasal resonance. Conversely, all other English speech sounds are produced by the air escaping through the mouth, i.e. no air should escape through the nose. If the speaker either allows too much air to escape through the nose, or cannot prevent air escaping in this manner (as a result of a cleft palate, for example), the voice will sound hypernasal, commonly called nasal speech.


The quality of the voice refers to the complexity of the note produced by the vibration of the vocal folds. The term is frequently used to describe how aesthetically pleasing the voice is. It is extremely difficult, however, to define a typical voice. Nevertheless, the human voice should be pleasant with an engaging musical character and the absence of any interfering noise. Three broad quality types are recognised:

breathy voice

A person speaking with an excessively breathy voice produces an audible escape of air through the glottis. It is often produced as a result of insufficient closure of the vocal folds, thus creating a small chink through which air from the lungs spills through. This could be due to an organic problem such as vocal nodules or lack of breath control. Breathy voices typically sound weak, as they are often produced with reduced loudness.

hoarse voice

In simple terms, hoarseness is a disruption of the usually stable note produced by the vibration of the vocal folds, as a result of airflow turbulence. This turbulence can be created because the weight or tension of one vocal fold relative to the other is altered. Consequently, they no longer vibrate in synch and this creates the noise we perceive as hoarseness. The weight or tension of the vocal folds can be affected by such things as the build up of mucous (e.g. when one has a cold), growths (e.g. vocal nodules, polyps), and muscle tensions.

Typically, hoarseness is associated with weak vocalisation and lowered pitch: the voice sounds rough. It may be accompanied by occasional bouts of breathiness, in which case it is sometimes referred to as husky voice.

harsh voice

A harsh voice is associated with tension in the muscles of the larynx, those involved with breathing and, often, the vocal folds themselves. There is typically a hard glottal attack, i.e. the speaker brings the vocal folds together abruptly and with greater force than is necessary. This obtrusive glottal attack creates an unpleasant sound. In contrast to a hoarse voice, harsh voice is characteristically associated with a raised pitch.


There is a great deal of emotional overlay in the production of voice. For example, consider someone who is clinically depressed. Frequently, this condition is accompanied by changes in vocal characteristics: reduced loudness,  monotonous and lacking in energy. In contrast, an enthusiastic person frequently has a faster rate of speech, increased loudness and possibly exaggerated pitch variations.

Often, the most effective communicators are those who can effortlessly vary paralinguistic features to create an interesting and colourful voice which is capable of expressing a range of intellectual and emotional meanings.

Development of the Voice

As with language development and speech development, voice development also follows a predictable pattern.

There is a remarkable similarity in the quality of babies’ first cries. The pitch  is about the note A above middle C on the piano. Young infants are able to scream at very high pitches but this disappears as the larynx grows. Nearly all the first noises, or vocalisations, that the baby makes are produced by bringing the vocal folds together very gently. However, at the end of about two months of age the various cries become differentiated. Depending on the baby’s needs and moods it will either bring the vocal folds together gently or very hard, as with an angry cry of discomfort.

At about six months of age, when the child is beginning to babble (see Language Development for an explanation of babbling), they will use a pitch range that is equivalent to the middle range of an adult mezzo-soprano. The larynx continues to grow and the child’s singing voice develops until, at about six years of age, the vocal range of the C major scale is achieved. During puberty the sex glands become active and there is a rapid increase in the size of the larynx forcing the voice into a lower pitch. This is particularly noticeable in boys and it is known as vocal mutation, i.e. when the voice ‘breaks’. The implication here, of course, is that vocal mutation also occurs in girls. This, in fact, is true but as the larynx does not grow as large in girls the changes are less apparent.

Common Voice Difficulties

A wide variety of voice behaviours are within normal limits. For example, a person’s voice may vary with the amount it is used and how it is used. Differences may also depend on fatigue and the person’s emotional state. The voice may sound different in the morning to how it sounds at night. Having said this, most people will have experienced a voice difficulty at some time in their lives. Two common voice difficulties include (1) colds and laryngitis, and (2) vocal misuse.

Colds and laryngitis

A heavy cold or laryngitis often causes the voice to become hoarse and rough and it may be difficult, even painful, for the sufferer to speak loudly. The voice seems to keep coming and going. This is because during colds and bouts of laryngitis there is a build up of mucous on the vocal folds. Mucous is a colourless fluid secreted by special cells in the larynx. It is somewhat thicker than water and it lubricates the vocal folds. The excessive build up of mucous makes the vocal folds heavy and they therefore move more sluggishly. This results in a deeper voice. Also, because the mucous doesn’t form an even film over both of the vocal folds they do not come together evenly and extra air escapes as the folds try to shut together. This is what creates the breathy, grating sound known as hoarseness.

There is little that needs to be done to remediate this condition other than to ensure that the sufferer receives antibiotic treatment from a doctor for any infections, if this is necessary, and to encourage the person not to overuse their voice.

Vocal misuse

Another common voice difficulty occurs when people abuse the voice. For example, a child in the playground or an adult at a football match may shout too loudly and for too long. The voice ‘hurts’ because during shouting the vocal folds come together with a great deal of force. Too much of this irritates the vocal folds and they can become inflamed.

Often the best remedy is to rest the voice and adopt some basic vocal hygiene strategies (see Voice Care for Adults and Voice Care for Children), e.g. do not shout at the top of your voice and do not keep shouting for long periods; take plenty of rest periods between long stretches of talking; have a drink handy to lubricate your vocal tract.

A characteristic of the above difficulties is that they do not usually endure over an extended period of time. However, alterations in paralinguistic features that do persist may be the first sign of a voice disorder.

Voice Disorders

Voice disorders may be divided into three groups:

  • organic: caused by impairments of anatomical structure, neurological impairments or physiological impairments (e.g. growths on the vocal folds; injuries which damage the vocal apparatus; muscular dystrophy; motor neurone disease; thyroid disease)
  • psychogenic: emotional stress and certain psychological conditions can lead to persistent voice changes
  • functional: this is the name given to voice difficulties which persist in the absence of any obvious anatomical, neurological, or other organic difficulties affecting the larynx

There are many different types of voice disorder, too many to discuss here. However, you can read more about voice disorders here.


< Home



Sponsored Links

"It's no coincidence that in no known language does the phrase 'As pretty as an airport' appear."

- Douglas Adams