SELECTED ACOUSTIC CHARACTERISTICS OF EMERGING ESOPHAGEAL SPEECH: CASE STUDY GLENN BINDER, B.A. (SP. & Η. TH.) (WITWATERSRAND) Dept. Speech Pathology and Audiology, University of the Witwatersrand, Johannesburg. SUMMARY The development of esophageal speech was examined in a laryngectomee subject to observe the emergence of selected acoustic characteristics, and their relation to listener intelligibility ratings. Over a two-and-a-half month period, the data from five recording sessions was used for spectrographic and perceptual (listener) analysis. There was evidence to suggest a fairly reliable correlation between emerging acoustic characteristics and increasing perceptual ratings. Acoustic factors coincident with increased intelligibility ratings appeared related to two dimensions: firstly, the increasing pseudoglottic control over esophageal air release; secondly the presence of a mechanism of pharyngeal compression. Increased pseudoglottic control manifested in a reduction of tracheo- esophageal turbulence, and a more efficient burping mode of vibration with clearer formant structure. Spectrographic evidence of a fundamental frequency did not emerge. These dimensions appeared to have potential diagnostic and therapeutic value, rendering an analysis of the patient's developing vocal performance more explicit for both clinician and patient. OPSOMMING Die ontwikkeling van esofagale spraak is ondersoek in 'n gelaringektomeerde geval om die voorkoms van geselekteerde akoestiese eienskappe waar te neem, en hulle verhouding tot luisteraar-verstaanbaarheids-beoordelings. Gedurende 'n twee-en-'n-half-maand periode, is data van vyf opname sessies gebruik vir spektrografiese en luisteraar-analise. Daar is bewyse vir die daarstelling van 'n redelik betroubare korrelasie tussen die akoestiese eienskappe wat te voorskyn kom en die verhoogde perseptuele beoordelings. Akoestiese faktore wat saam val met verhoogde verstaanbaarheids beoordelings blyk verbind te wees ten opsigte van twee aspekte: eerstens, die verhoogde pseudoglottiese beheer oor esofagale lugvrylating; tweedens die teenwoordigheid van 'n meganisme van faringale samepersing. Verhoogde pseudoglottiese beheer het 'n afname van trageo-esofagale turbulensie tot gevolg, en 'n meer doeltreffende wind-opbreek metode van vibrasie met 'n duidelike formantstruktuur. Spektrografiese bewyse van 'n grondtoon het nie te voorskyn gekom nie. Dit blyk dat hierdie aspekte 'n potensiele diagnostiese en terapeutiese waarde het, met die moontlikheid van 'n duideliker analise van die pasient se ontwikkelende vokale prestasie vir die terapeut en pasient. Esophageal speech is one of two alternative methods of functional post- laryngectomy rehabilitation. Investigators over the past few decades have realised the importance of testing its efficacy both subjectively and objectively — one of the more recent objective methods being the use of spectrographic a n a l y s i s . 2 ' 5 ' 6 ' 7 ' 1 0 ' " ' 13>j17>28>29 The purpose of this article is a) to discuss how spectrographic analysis may be used as a diagnostic and therapeutic tool, and also b) to examine the relationship between subjective and objective data obtained. The South African Journal of Communication Disorders, Vol. 25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) Emerging Esophageal Speech 25 It would be interesting to follow the development of speech from a mechanism that at first is halting, and seemingly acquires a great deal of effort to produce faltering sounds, to the effortless smooth production of sustained speech,4 Despite Brooks Hunt and Va's4 valid comment, more than a decade has passed with a dearth of information available on the developmental aspects of esophageal speech acquisition. The importance of measuring a laryngectomized patient's progress in treatment has likewise been stressed by very few writers.3'3 1 However, in the general field of vocal pathology, more recent impetus has been provided by Rontal, Rontal and Rolnick20 who have advocated the use of serially-made spectrograms. These would provide the referring Ear Nose and Throat Specialist, clinician and patient with a visual display reflecting the nature of the vocal rehabilitation process. The value of assessing Rontal et al's20 finding specifically in the progression of esophageal speech development, was strongly indicated. The paucity of literature with regard to the acoustic and perceptual developmental aspects of esophageal speech acquisition is apparent. The establishment of a description of emerging acoustic characteristics correlated with subjective listener ratings, would be of diagnostic, prognostic, and therapeutic value. It would provide the speech therapist with objective, quantitative criteria against which each laryngectomee's esophageal speech development could be assessed. Furthermore, the understanding of the relationship between the emerging dimensions would facilitate a more explicit approach to therapy for both clinician and client. The term 'esophageal speech' refers to speech production resulting from an air supply from the vicarious air chamber located within the esophageal lumen. The cricopharyngeus has been localized as the pseudoglottic site. T h e outflowing air is interrupted by crude pseudoglottic vibrations which alternatively open and close the esophageal lumen. The Kay Sonograph has been used to provide spectrograms which permit an objective, reliable analysis of the phonetic — acoustic elements of esophageal speech. The Sonograph provides: a) narrow-band displays in which harmonic structure is highlighted, and b) broad-band displays which highlight formant structure (represented as dark, thick vertical bands). Formants are resonances of the supraglottal vocal tract, 'upon which the distinctive quality of vowels and resonant consonants depend.' 9 (See Figure 5, 5-9, and 11-14, a and c, along the horizontal and vertical axes respectively). Kerr and Lanham9 have used broad-band spectrograms to analyze esophageal speech, and have described the following phenomena which have been corroborated by other investigators:10' " ' l 3 ' 2 9 Parameter I: Vibratory cycles resulting from the manner of esophageal sphincter control over air release. A. Rapping: A term used by Kerr and Lanham9 to refer to the slow, irregular, rap and tap-like pulses received by the vocal tract functioning Die Suid-Afrikaanse Tydskrif vir Kommunikasieafwykings, Vol.25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) 26 Glenn Binder as a resonating system. (See Figure 6, 1,8,9,10,11,17,18,19 along the horizontal axis). B. Esophageal burping: The raps smooth out into a succession of air puffs more rapid and regular, and lower in amplitude . . . still lacking in harmonic structure.9 These more rapid cricopharyngeal vibrations facilitate the emergence of a clearer formant structure (See Figure 3, 6- 11, a and b, along the horizontal and vertical axes respectively). C. Friction Noise: This random, aperiodic ill-defined turbulence emanates from the more or less open esophageal sphincter. Friction noise has been found to play a major role in producing quality changes 9 ' 1 0 (See Figure 6, 20-22 along the horizontal axis). D. Possible absence of fundamental frequency: Fundamental frequency is the physical correlate of pitch perception,1 0 and is a measure of the fre- quency of components of periodic waves. Despite reference made to the low pitch of esophageal s p e e c h , 2 ' 5 ' 6 · 2 5 ' 2 < " 2 8 ' 2 9 confusions are prevalent in spectrographic findings related to fundamental frequency characteristics of esophageal speakers. Lanham and Kerr1 3 have suggested that these confusions have arisen from a failure to recognize that a measurable fundamental frequency is often lacking13 in esophageal speech. Hence, the need to examine approximate vibratory rates, as opposed to fundamental frequency is apparent. E. Other less obvious vibratory features: (i) vibration of the tone is more rapid at the onset of phonation, and slows down as the amount of air in the . esophagus decrease; (ii) the opposite phenomenon to (i) (occuring more rarely10), where subglottic vibrations increase; and (iii) pseudoglottic vibrations may cease for a moment, and then continue at a slower rate than before, thus splitting the word or vowel into two parts.1 0 Parameter II: Variables dependent upon degree of neuromuscular control, and affecting periodicity of vibrations: A. Sound caused by air intake into the esophagus (swallowing noise): Spectrographically the energy in swallowing is fairly evenly distributed over the entire frequency scale. Its presence represents what the writer referred to as an 'esophageal click' i.e. one considerably intense pulse (rap) followed by a gap of relative silence prior to the onset of phonation. (See Figure 1, 9 and l l i along the horizontal axis). B. Leakage of air from the mouth of the esophagus: This may occur prior to and in anticipation of the onset of voicing, and is indicative of an inadequate neuromuscular mechanism (See Figure 1, t-5 along the horizontal axis). C. Tracheostomal (Stomal) noise: This refers to the powerful expulsion of air accompanied by an undesirable murmur from the patient's tracheostoma. Strong amplification may occur of those frequencies falling within the bandwidths of the resonators, resulting in horizontal, formant-like bands (See Figure 1, A, along the horizontal axis). D. Gurgling: This occurs auditorily before, during, a n d / o r after The South African Journal of Communication Disorders, Vol. 25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) Emerging Esophageal Speech 27 phonation, and is commonly represented spectrographically by highly irregular wide-spaced raps (See Figure 2, 22-38 along the horizontal axis). Parameter III: Formant Structure. A case study was therefore undertaken with the following aims: I. To identify the emergence over time of selected acoustic character- istics which might contribute to a description of the emerging features of esopheal phonation. II. A further aim was to relate intelligibility ratings of developing esophageal speech to the emergence of these acoustic characteristics. S U B J E C T (S) The S was a 37 year old white Portuguese-speaking male. He underwent a total laryngectomy in February 1977 after laryngoscopy biopsy had revealed the presence of widely infiltrating, poorly differentiated squamous cell carcinoma on both his vocal folds. Total laryngectomy involved removal of the larynx from above, leaving the esophagus and cricopharyngeal muscles essentially intact. It was not necessary to perform a radical neck dissection. Prior to the operation he was a healthy man, with no reported speech or hearing problems. Criteria for selection: (a) The S was required to have undergone a laryngectomy operation and to have commenced speech therapy for the first time postoperatively. (b) He was to be capable of producing an esophageal sound involuntarily. This indicates the intactness of his mechanism for esophageal sound production, making an evaluation of his progress from this most basic level possible.3' (c) Potential anatomical and physiological variables that could interfere with the S's development of esophageal speech had to be considered. Thus, before testing commenced, the Ear Nose and Throat Surgeon who performed the laryngectomy was asked questions pertaining to medical information from the Questionnaire for Biographical, Medical, Social Communication and Speech Training Variables.22 (d) Hearing was bilaterally within normal limits. Therapy: Therapy was administered by a fourth year undergraduate Speech and Hearing Therapy student at the Speech and Hearing Clinic, University of the Witwatersrand. Over a seventy-five day testing period, the S had received approximately 54 sessions of speech therapy. Therapy was directed towards the development of esophageal speech as based primarily upon the work of Berlin,3 and Snidecor.27 METHODOLOGY I T A P E R E C O R D I N G P R O C E D U R E S . A. A Revox tape recorder was used to produce high quality recordings at random time intervals over a 2xk month testing period. During each recording session the S would read the following data: Die Suid-Afrikaanse Tydskrif vir Kommunikasieafwykings, Vol. 25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) 28 Glenn Binder (i) A list of exercises which were constructed by the E. These were based upon Snidecor's27 principles of a nine-stage plan of esophageal speech development. (ii) A further group of test words based upon the six groups of consonants found in the Portuguese language. (iii) The S used only one overt inflation in repeating plosive syllables (da) as many times as possible. This was based upon Berlin's3 measure of mean number of syllables per air charge. (iv) The S used only one inflation in repeating the following two sentences. a. The first sentence contained many liquids: («Luisoc iu LiuneJ^nccJeram ocm Bcxriocnum o j p i t a j ) b. The second sentence contained an equal number of voiceless plosives: (Paula i PeJxuJ.au muituf kocuj ikocj ojjuJ emkasoc) The construction of these two sentences (by the' E), was based upon Moolenaar-Bijl's16'17 observation that some form of air conservation, or esophageal reinflation is accomplished by plosive consonants. (v) On the final recording session, the S read twenty unseen minimal pair words. B. On one occasion, a normal (laryngeal) Portuguese-speaking S read and recorded the same set of stimuli (i) — (v), under the same testing conditions as the S. II SPECTROGRAPHIC/OBJECTIVE ASSESSMENT: A Kay Sonagraph Model 6061-B (Kay Elemetrics 6, Pine Brook, N.J.) was used to produce type B/65 broad-band spectrograms from the tape recordings. Selection of Speech Samples and Methods of Spectrographic Recording: Note: Several narrow band spectrograms were made of utterances produced by the S on the final session. However, these failed to show harmonic components of the complex speech signals, so that the data in the present study was derived from broad band spectrograms only. For practical reasons it was not possible to spectrographically analyze all the data recorded over the 75 day testing period. Thus: (i) Ninety-five broad band spectrograms were produced from a list of 19 words recorded by the S at 15 day intervals. These words were selected, as they constituted a representative sample of consonants in the Portuguese language (eg. ( p e j ) , (bo'd°c), (lpik«), (doiz), (soblimu)). (ii) 20 broad band spectrograms were also obtained from the S's production of minimal pair utterances on the final recording session. (iii) 39 broad band spectrograms were likewise made of the normal S's recordings of the 19 words, and 20 minimal pair word list. The South African Journal of Communication Disorders, Vol. 25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) Emerging Esophageal Speech 29 III SUBJECTIVE PERCEPTUAL ASSESSMENT: Perceptual ratings were made in the following manner: (i) A Portuguese-speaking listener who did not know the S, and who was unaware of the nature of the experiment was asked to subjectively rate the intelligibility of all the data recorded by the S at each of the five recording periods selected for analysis. Intelligibility of single utterances was determined by the number of correct responses made by the listener to the recorded speech stimuli. Intelligibility of sentence material was rated on a five-point equal appearing interval scale. (ii) The Ε was assisted by a Phonetician in perceptually calculating the following data on each of the five recording sessions: (a) The mean number of plosive syllables (da) per overt inflation in five trials,3 and (b) T h e number of words per breath charge in the sentences constructed."" 17 Although these measures were made, together with an evaluation of the relationship between the S's quantified performance over time on skills (a) and (b) above, the findings will not be discussed in the present article. IV PHONETIC ANALYSIS: A Phonetician aided the Ε in phonetically transcribing the 115 broad- band spectrograms. This permitted a direct comparison of unusual auditory qualities perceived by the listener with the spectrographic correlates. RESULTS AND DISCUSSION A. OBJECTIVE DATA. The analysis of spectrographic data was descriptive in nature. Based on the work of several a u t h o r i t i e s 9 ' 1 0 ' " ' 1 3 ' 2 9 the development of the acoustic parameters I — III and their related variables (described above), were examined in the data obtained in five recording sessions. However, for the purpose of this article, it will be sufficient to document the major changes that occurred in the utterance (bo'd°<) on sessions I and V, thereby highlighting the emergence of these parameters. Spectrographic representations of utterances produced by the S on the first recording session (Session Ϊ) revealed the inadequacy of his neuro- muscular mechanism in the following manner: Figure I: Spectrogram showing utterance (bo d <*): Session I. Die Suid-Afrikaanse Tydskrif vir Kommunikasieafwykings, Vol. 25 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) 30 Glenn Binder There was a prominence of random respiratory and tracheostomy turbulence over the whole of the visible frequency scale. Resonance amplification created wide horizontal formants with ill-defined edges (See Formant 2, Figure J, 21-31 along the horizontal axis). The overriding turbulence made clear definition of most utterances virtually impossible. This turbulence appeared to be due to the S's attempts to produce louder, more intelligible speech. This resulted in a burst of outrushing stomal noise frequently louder than the concomitant esophageal voice. Most utterances were characterized by several random pulses (raps) before, and immediately after, indicative of swallowing noise caused by air intake into the esophagus. Gurgling noises were spebtrographically and auditorily prominent in several utterances (See Figure 2 below, 22- 38, - (til)). Figure 2: Spectrogram showing utterance (til) — gmvling snitnd. In contrast to the inconsistency that was a noticeable feature of the spectrographic displays of the S's utterances on the first recording session, spectrograms of utterances produced on the final recording session (Session V) revealed the greatest consistency. This seemed to be due to the S's high morale, in conjunction with increased muscular control and co-ordination over his esophageal air release. Figure 3. below reveals the presence of the burping mode of vibration, with considerable energy located in aperiodic components outside the pulses. According to Kerr and Lanham,1' and Kytta,1" this turbulence is due to the probably incompletely interrupted airflow, characteristic of the burping vibratory mode. Figure 3: Spectrogram showing utterance (bo 'd°<): (Session V). The South African Journal of Communication Disorders, Vol. 25 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) Emerging Esophageal Speech 31 The S's neuromuscular control still appeared insufficiently adequate and this in relation to an inadequate esophageal airflow, manifested spectrographically as follows: (i) Widening of pulses were still evident at the end of utterances, disturbing the continuity of the generally clearly defined formant structure. (See Figure 3, 13-14, where the rate of vibration is ~£ 36,99 per second, as compared with around 86,31 per second at 8-9). (ii) Apparent air leakage from the esophageal mouth prior to (eg. Figure 3, 1-3), and during certain of S's utterances. Narrow band spectrograms (eg. See Figure 4, below) made of several utterances revealed an absence of harmonic structure and measureable fundamental frequency. This appeared a characteristic feature of the S's low pitched esophageal phonation, which Lanham and Kerr1 3 have referred to as 'pitchless'. ι yigf Figure 4: Spectrogram (narrow) showing utterance (pe"). General developmental trends viewed spectrographically Several interesting and significant developmental trends appeared to have emerged. The most marked feature throughout was one of inconsistency, postulated to be related partly to the S's neuromuscular, and psychological status at various points in time. A noticeable progression occurred from a rapping to a burping mode of esophageal sphincter vibration in which pulses became significantly more rapid and lower in amplitude. Vibrations, however, were conspicuously lacking in the rapidity and periodicity reportedly found in superior esophageal, and laryngeal s p e a k e r s . 1 0 ' 2 0 ' 2 3 ' 2 9 Compare the vibratory pattern in Figure 3, above, with that in Figure 5 below. Figure 5: Spectrogram showing utterance (bo'd ) of a laryngeal speaker. Die Suid-Afrikaanse Tydskrif vir Kommunikasieafwykings, Vol. 25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) 32 Glenn Binder Generally, spectrographic evidence refuted the concept of fundamental frequency in esophageal phonatitfn. It revealed a vibratory pattern lacking in periodicity, and devoid of a harmonic structure. These measureable rates of irregular vibration appeared to be comparable to what investigators have measured in assessing the fundamental frequency of esophageal speech, «ίο, 11,23,24,» if this is indeed the case, then Tato's finding29 that fundamental frequency increases with training, was supported. Continuous and more clearly defined formant structure appeared dependent upon a reduction of tracheo-esophageal turbulence, as well as an increasingly regular and rapid vibratory pattern. B. SUBJECTIVE DATA. Listener Ratings of Intelligibility. Table I below reveals an increase in the Portuguese-speaking listener's mean intelligibility ratings over the two- and-a-half month recording period. I II III IV V Carry-over task Mean intellig- ibility rating 1 =20 % 1,5=30% 1,5=30% 3 = 6 0 % 3,5=70% 3,75=75% TABLE I: Mean Intelligibility Ratings as a Function of Stage of Esophageal Speech Development These findings would appear to indicate the S's progression over the observed period of speech acquisition towards becoming a more proficiently-rated esophageal speaker. The listener experienced the greatest ease in understanding the carry-over task items, corroborating the spectrographically-observed evidence of generalized cricopharyngeal control. Comparison of Listener Intelligibility Ratings with Spectrographic Displays. This revealed the following: I. Evidence to suggest a fairly reliable correlation between emerging acoustic characteristics and increasing intelligibility ratings. The latter appeared related to the following acoustic variables: (i) a more rapid and regular burping mode of vibration; (ii) clearer, more continuous formant structure; and (iii) a reduction of tracheo-esophageal noise. The variable (iii) appeared to be the most important independent factor in speech intelligibility ratings, and supported findings reported in the literature. 1 0 ' 1 5 ' 2 3 II. The S's attempts to produce effective voice-voiceless plosive and fricative distinctions manifested in interesting inconsistent discrepancies between perceptual and spectrographic data. The presence of these The South African Journal of Communication Disorders Vol. 25 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) Emerging Esophageal Speech 33 discrepancies supported Nichol's18 contention that voicing is a vulnerable feature in esophageal speech intelligibility. Only two of the numerous discrepancies noted, shall be described in the present article, so as to illustrate the presence of certain phenomena: (i) Target: Voiceless unreleased plosive (p,t,k) (Note: In Portuguese, as opposed to English, voiceless plosives are unreleased). Word initial position: (N=48) Fifteen discrepancies occurred between spectrographic and subjective ratings. The listener heard the powerful voiced cognate predominantly, which was spectrographically represented in the following decreasing order of prominence as a: a. Voiceless released plosive. (N= 11) Example: (pe J ) — recording session II (See Figure 6, 7) Listener's response: (be Γ/ p e l ) Figure 6: Spectrogram showing utterance (pel): (Session 11) b. Voiced released plosive. (N=4) Example: (pipa) — recording session IV (See Figure 7, 1-4) Listener's response: (bipa/pipa). Figure 7: Spectrogram showing utterance (pipa): (Session IV) Die Suid-Afrikaanse Tydskrif vir Kommunikasieafwykings, Vol.25. 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) 34 Glenn Binder (ii) Target: Voiceless fricative (s, J) Word final position: (N= 10) The listener heard an affricated (J), which was likewise represented spectrographically as a voiceless released affricate. Example: (doi J) — recording session IV (See Figure 8, 14-17) Listener's response: (doit J/doi J) Figure 8: Spectrogram showing utterance (doiJ): (Session IV) GENERAL CONCLUSION From the results, the writer has postulated that the above-described discrepancies, revealing the S's ability to produce perceptually powerful, apparently ejected plosive and fricative consonants, may have been due to a mechanism of pharyngeal compression or 'squeezing'. It is apparent, that the intensity and power with which the S produced these consonants could not have been achieved by the small and variable amount of air reported to be present in the upper esophagus. 1 2 ' 2 7 ' 2 8 Rather, the presence of an eggressive glottalic-type of air mechanism, in which a substantial amount of pharyngeal air compression occurred, would appear to facilitate this powerful auditory impression. To produce fricatives, it seemed necessary for the S to block his vocal tract first with a tight labial closure, in order to build up sufficient pressure. Affrication resulted both auditorily and spectrographically. This appeared due to the S's inadequate control and consequent inability to produce this articulatory movement of closure lightly. Similarly, when the S produced a voiceless plosive, a powerful auditory impression was heard. This was caused by a strong intra-oral pressure bursting through a very tight closure made by the S in an attempt to produce a sufficiently audible voiceless stop. Spectrographic evidence likewise supported these above-described findings. For example, when producing medial voiced- released stops, the 'voice' bar would frequently disappear, and spectrographically a gap would appear prior to the release burst. This gap likewise, may have been indicative; of pressure build-up by the S, to produce an effective auditory impression.12 (See Figure 9, 20-22, for an illustrative example). The South African Journal of Communication Disorders Vol. 25 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) Emerging Esophageal Speech 35 ' ' h i : · Figure 9: Spectrogram illustrating mechanism of pharyngeal compression in word-medial position. The only reference to this squeezing-type mechanism appears to have been made by Kerr and Lanham,9 and Pellegrini and Raaglini, cited by Brooks Hunt and Va.4 The latter investigators concluded that a squeezing mechanism forces air through a narrow channel, thus producing effective sound. Esophageal speech has been described as being a compensation of high d e g r e e . 1 ' 1 0 ' 2 1 This pharyngeal eggressive air mechanism, postulated to be operating, would appear to constitute a compensatory mechanism, rendering support for Tikofsky's comment that intelligibility results from modifications 'other than those introduced by the use of a different sound-producing mechanism'.30 I M P L I C A T I O N S Observations made in this study suggested the use of a developmental framework to more effectively quantify and assess the patient's process of esophageal speech acquisition. By using serially-made spectrograms, diagnosis and therapy would indeed be interrelated parts of a continuous process of trying to understand an individual and to help him learn.14 The regular clinical use of these objective measures would: (i) Provide the Ear Nose and Throat Specialist, patient and clinician a meaningful mutual evaluation of the changes in the emergence of esophageal speech. The patient would then be reinforced for intermediary successes, with a concomitant reduction of frustration of having to work towards a remote, sometimes obscure goal of intelligible esophageal speech; and (ii) Offer a motivating device which recognizes the need of every patient to gauge his own rehabilitation. In addition, spectrographic data would provide interested family members and friends the opportunity to visualize the patient's progress, and support his effort more realistically. These objective acoustic measures should constantly be compared with subjective perceptual evaluations. Therapy goals and tasks would then remain more realistically orientated towards the goal of communicative effectiveness in the patient's linquistic environment, as opposed to an unobtainable level of intelligibility. For example, with regard to the 'vulnerable feature of voicing in esophageal speech', 1 8 constant comparison of objective and subjective data would enable the clinician to assess whether: (i) the patient attempted to produce the voiceless Die Suid-Afrikaanse Tydskrif vir Kommunikasieafwykings Vol.25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) 36 Glenn Binder consonant and failed to do so audibly, or (ii) whether he omitted it completely. Finally, spectrograms should never be used as a substitute for good clinical acumen. Judgements in the initial assessment and during the therapy process must be based upon an evaluation of all the parameters available to the clinician, and not on the basis of a single test. However, within these limitations, the use of an analytical, developmental framework would make intelligibility a somewhat more attainable goal for the laryngectomized patient. REFERENCES 1. Bentzen, N., Guld, Α., and Rasmussen, H. (1976): X-ray Video-Tape Studies of Laryngectomized Patients. J. Laryngol. Otol., 90(2), 655- 666. 2. Berg, J.W. van den, and Moolenaar-Bijl, A.J. (1959): Cricopharyngeal Sphincter, Pitch, Intensity, and Fluency in Esophageal Speech. Pract. ORL Basel, 21, 298-315. 3. Berlin, C.I. (1963): Clinical Measurement of Esophageal Speech: (l)Methodology and Curves of Skill Acquisition. J. Sp. Hear. Dis., 28(1), 42-51. 4. Brooks Hunt, R., and Va, R. (1964): Rehabilitation of the Laryngectomee. Laryngoscope, 74(1), 382-395. 5. Curry, E.T., (1962): Frequency Measurement and Pitch Perception in Esophageal Speech. Chapter 5 in Speech Rehabilitation of the Laryngectomized, Snidecor, J.C., (Ed.), Springfield, IL.: Charles C. Thomas. 6. Curry, E.T., and Snidecor, J.C. (1961): Physical Measurement and Pitch Perception in Esophageal Speech. Laryngoscope, 71,415-424. 7. Diedrich, W.M., and Youngstrom, K.A. (1960): A Cineradiographic Study of the Pseudoglottis in Laryngectomized Patients. Paper read at the ASH A Convention. 8. Hoops, H.R., and Noll, J.D. (1969): Relationship of Selected Acoustic Variables to Judgements of Esophageal Speech. J. Commun. Disord., 2, 1-13. 9. Kerr, W.A., and Lanham, L.W. (1973): Anatomical and Spectro- graphic Analysis of the Voice in Disease: A Report of Five Cases. J.S. African Speech Hear. Assoc., 20, 81-107. 10. Kytta, J. (1964): Finnish Esophageal Speech After Laryngectomy. Sound Spectrographic and Cineradiographic Studies'. Acta Oto- Laryngol., Stockholm, Supp. 195, 1-102. 11. Kytta, J. (1964): Spectrographic Studies of the Sound Quality of Esophageal Speech. Acta Oto-Laryngol., Supp. 188, 371-378. 12. Lanham, L.W. (1977): Professor, Department of Phonetics and Linguistics, University of the Witwatersrand, Johannesburg. Personal Communication. June. 13. Lanham, L.W., and Kerr, W.A. (1975): Pitch in Esophageal Speech. J.S. African Speech Hear. Assoc., 22, 31-41. The South African Journal of Communication Disorders, Vol. 25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) Emerging Esophageal Speech 37 14. Lerner, J.W. (1971): Children With Learning Disabilities. Boston. Houghton Mifflin Company. 15 Martin, H. (1963): Rehabilitation of the Laryngectomized. Cancer, ' 16(7), 823-841. 16. Moolenaar-Bijl, A.J. (1951): Some Data On Speech Without Larynx. Folia Phoniat., 3, 20-24. 17. Moolenaar-Bijl. A.J. (1953): The Importance of Certain Consonants in Esophageal Voice After Laryngectomy. Ann. ORL, 62(4), 979-988. 18. Nichols, A.C. (1962): Loudness and Quality in Esophageal Speech and The Artificial Larynx. Chapter 6 in Speech Rehabilitation of the Laryngectomized. Snidecor, J.C., (Ed.) Springfield, IL.: Charles C. Thomas. 19. Perkins, W.H. (1971): Vocal Function: Assessment and Therapy. Chapter 20 in Handbook of Speech Pathology and Audiology. Travis, L.E. (Ed.) N.Y.: Appleton-Century Crofts. 20. Rontal, E„ Rontal, M., and Rolnick, M. (1975): Objective Evaluation of Vocal Pathology Using Voice Spectrography. Ann. ORL, 84(5), Part 1, 662-671. 21. Safran, Α., and Szende, T. (1973): Die Osophagusprache Als Sprach- licher Kampensations Vorgang. Folia Phoniat., 25, 347-364. 22. Shames, G.H., Font, J., and Mathews, J. (1963): Factors Related to Speech Proficiency of the Laryngectomized.' J. Speech Hear. Disord., 28(3), 273-287. 23. Shipp, T. (1967): Frequency, Duration, and Perceptual Measures in Relation to Judgements of Alaryngeal Speech Acceptability. J. Speech Hear Res., 10, 417-427. 24. Sisty, N.L., and Weinberg, B. (1972): Formant Frequency Charac- teristics of Esophageal Speech. J. Speech Hear. Res., 15, 439-448. 25. Snidecor, J.C., and Curry, E.T. (1959): Temporal and Pitch Aspects of Superior Esophageal Speech. Ann. ORL, 68, 623-636. 26. Snidecor, J.C., and Curry, E.T. (1960): How Effectively Can the Laryngectomee Expect to Speak? Laryngoscope, 70, 62-67. 27. Snidecor, J.C. (1962): Speech Therapy For Those With Total Laryngectomy. Chapter 9 in Speech Rehabilitation of the Laryngec- tomized. Snidecor, J.C., (Ed.), Springfield, IL.: Charles C. Thomas. 28. Snidecor, J.C., and Isshiki, N. (1965): Vocal and Air-Use Charac- teristics of a Superior Male Esophageal Speaker. Folia Phoniat., 17(3), 217-232. 29. Tato, J.M. (1954): A Study of Sonospectrographic Characteristics of the Voice in Laryngectomized Patients. Acta Oto-Laryngol., 44, 431- 438. 30. Tikofsky, R.S. (1965): A Comparison of the Intelligibility of Esophageal and Normal Speakers. Folia Phoniat., 17, 19-32. 31. Wepman, J.M., MacGahan, J.Α., Rickard, J.C., and Shelton, N.W. (1953): The Objective Measurement of Progressive Esophageal Speech Development. J. Speech Hear. Dis., 18(3), 257-251. Die Suid-Afrikaanse Tydskrif vir Kommunikasieafwykings, Vol. 25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2) If you don't give them a chance to try the 775 PP what will you say when they find out the facts? HEARING AID CENTRE S.A. (PTY.) LTD. announces the new DANA VOX 775PP behind-the-ear Model. * 0 c D a t a i n a c c o r d a n c e w i t h U S A S S 3 , 8 - 1 9 6 7 a n d D I N 4 5 6 0 5 ( p r e v i o u s H A I C d a t a ) : G a i n : O u t p u t : O u t p u t : P 3 O u t p u t : P 2 O u t p u t : P 1 5 8 d B 1 2 9 d B r e 2 0 μ Ρ α 1 2 0 d B r e 2 0 μ Ρ β 1 1 1 d B r e 2 0 μ Ρ β F r e q u e n c y r a n g e : 2 5 0 - 5 5 0 0 H z A technical novelty Is the electric input terminal called the audio-input socket. This audio-input socket facilitates a direct connection to an educational system, FM receiver, television, infra-red receiver or tape recorder employed for teaching or listening purposes. This adaptor which is easily plugged to the lower part of the aid is an optional extra. / Available only from: HEARING AID CENTRE 212 Harley Chambers 187 Jeppe Street JOHANNESBURG 2001 Tel: 37-2643/4/5 HEARING AID CENTRE 633 Kerkade Centre 267 Church Street PRETORIA 0002 Tel: 2-6379 & 2-7777 HEARING AID CENTRE 104 Cartwrights Corner cor Adderley & Darling Sts CAPE TOWN 8001 Tel: 22-3665 Hearing A i d Centre: S u p p l i e r s of H e a r i n g A i d s & Allied E q u i p m e n t ; A u d i o m e t e r s ; Silent C a b i n s ; E a r P r o t e c t i o n . The South African Journal of Communication Disorders, Vol. 25, 1978 R ep ro du ce d by S ab in et G at ew ay u nd er li ce nc e gr an te d by th e P ub lis he r (d at ed 2 01 2)