Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Joseph S. Perkell is active.

Publication


Featured researches published by Joseph S. Perkell.


Journal of the Acoustical Society of America | 1992

Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements

Joseph S. Perkell; Marc H. Cohen; Mario A. Svirsky; Melanie L. Matthies; Iñaki Garabieta; Michel T. T. Jackson

This paper describes two electromagnetic midsagittal articulometer (EMMA) systems that were developed for transducing articulatory movements during speech production. Alternating magnetic fields are generated by transmitter coils that are mounted in an assembly that fits on the head of a speaker. The fields induce alternating voltages in a number of small transducer coils that are attached to articulators in the midline plane, inside and outside the vocal tract. The transducers are connected by fine lead wires to receiver electronics whose output voltages are processed to yield measures of transducer locations as a function of time. Measurement error can arise with this method, because as the articulators move and change shape, the transducers can undergo a varying amount of rotational misalignment with respect to the transmitter axes; both systems are designed to correct for transducer misalignment. For this purpose, one system uses two transmitters and biaxial transducers; the other uses three transmitters and single-axis transducers. The systems have been compared with one another in terms of their performance, human subjects compatibility, and ease of use. Both systems can produce useful midsagittal-plane data on articular movement, and each one has a specific set of advantages and limitations. (Two commercially available systems are also described briefly for comparison purposes). If appropriate experimental controls are used, the three-transmitter system is preferable for practical reasons.


Speech Communication | 1997

Speech motor control: acoustic goals, saturation effects, auditory feedback and internal models

Joseph S. Perkell; Melanie L. Matthies; Harlan Lane; Frank H. Guenther; Reiner Wilhelms‐Tricarico; Jane Wozniak; Peter Guiod

Abstract A theoretical overview and supporting data are presented about the control of the segmental component of speech production. Findings of “motor-equivalent” trading relations between the contributions of two constrictions to the same acoustic transfer function provide preliminary support for the idea that segmental control is based on acoustic or auditory-perceptual goals. The goals are determined partly by non-linear, quantal relations (called “saturation effects”) between motor commands and articulatory movements and between articulation and sound. Since processing times would be too long to allow the use of auditory feedback for closed-loop error correction in achieving acoustic goals, the control mechanism must use a robust “internal model” of the relation between articulation and the sound output that is learned during speech acquisition. Studies of the speech of cochlear implant and bilateral acoustic neuroma patients provide evidence supporting two roles for auditory feedback in adults: maintenance of the internal model, and monitoring the acoustic environment to help assure intelligibility by guiding relatively rapid adjustments in “postural” parameters underlying average sound level, speaking rate and the amount of prosodically-based inflection of F0 and SPL.


Journal of the Acoustical Society of America | 1999

Articulatory tradeoffs reduce acoustic variability during American English /r/ production

Frank H. Guenther; Carol Y. Espy-Wilson; Suzanne Boyce; Melanie L. Matthies; Majid Zandipour; Joseph S. Perkell

The American English phoneme /r/ has long been associated with large amounts of articulatory variability during production. This paper investigates the hypothesis that the articulatory variations used by a speaker to produce /r/ in different contexts exhibit systematic tradeoffs, or articulatory trading relations, that act to maintain a relatively stable acoustic signal despite the large variations in vocal tract shape. Acoustic and articulatory recordings were collected from seven speakers producing /r/ in five phonetic contexts. For every speaker, the different articulator configurations used to produce /r/ in the different phonetic contexts showed systematic tradeoffs, as evidenced by significant correlations between the positions of transducers mounted on the tongue. Analysis of acoustic and articulatory variabilities revealed that these tradeoffs act to reduce acoustic variability, thus allowing relatively large contextual variations in vocal tract shape for /r/ without seriously degrading the primary acoustic cue. Furthermore, some subjects appeared to use completely different articulatory gestures to produce /r/ in different phonetic contexts. When viewed in light of current models of speech movement control, these results appear to favor models that utilize an acoustic or auditory target for each phoneme over models that utilize a vocal tract shape target for each phoneme.


Journal of the Acoustical Society of America | 1993

Trading relations between tongue-body raising and lip rounding in production of the vowel/u/ : a pilot motor equivalence study

Joseph S. Perkell; Melanie L. Matthies; Mario A. Svirsky; Michael I. Jordan

Articulatory and acoustic data were used to explore the following hypothesis for the vowel /u/: The objective of articulatory movements is an acoustic goal; varying and reciprocal contributions of different articulators may help to constrain acoustic variation in achieving the goal. Previous articulatory studies of similar hypotheses, expressed entirely in articulatory terms, have been confounded by interdependencies of the variables being studied (e.g., lip and mandible displacements). One case in which this problem may be minimized is that of lip rounding and tongue-body raising (formation of a velo-palatal constriction) for the vowel /u/. Lip rounding and tongue-body raising should have similar acoustic effects for /u/, mainly to lower F2. In multiple repetitions, reciprocal contributions of lip rounding and tongue-body raising could help limit F2 variability for /u/; thus this experiment looked for complementary covariation (negative correlations) in measures of these two parameters. An electro-magnetic midsagittal articulometer (EMMA) was used to track movements of midsagittal points on the tongue body, upper and lower lips, and mandible for large numbers of repetitions of utterances containing /u/. (Interpretation of the data was aided by results from area-function-to-formant modeling.) Three of four subjects showed weak negative correlations, tentatively supporting the hypothesis; a fourth showed the opposite pattern: positive correlations of lip rounding and tongue raising. The results are discussed with respect to ideas about motor equivalence, the nature of speech motor programming, and potential improvements to the paradigm.


Journal of the Acoustical Society of America | 1985

Variability in production of the vowels /i/ and /a/

Joseph S. Perkell; Winston L. Nelson

A hypothesis on the nature of articulatory targets for the vowels /i/ and /a/ is proposed, based on acoustic considerations and vowel articulations. The conjecture is that positioning of points on the tongue surface in a repetition experiment should be most accurate in the direction perpendicular to the vocal-tract midline, at the acoustically critical point of maximal constriction for each vowel. The hypothesis was tested by: examining x-ray microbeam data for three speakers, conducting a partial acoustical analysis, and performing a modeling study. Distributions were plotted of the midsagittal locations of three tongue points at the time of maximal excursion toward the vowel target for numbers of examples of the vowels, embedded in a variety of phonetic contexts. More variation was found along a direction parallel to the vocal tract midline than perpendicular to the midline, supporting the hypothesis. Statistics on formant values for one subject have been calculated, and pairwise regressions of displacement and formant data have been run. An articulatory synthesizer [Rubin et al., J. Acoust. Soc. Am. 70, 321-328 (1981)] has been manipulated through displacements similar to the subjects articulatory variation. Although articulatory synthesis showed systematic relationships between articulatory relationships and formant frequencies, there were no significant correlations between the subjects measured articulatory displacements and his formant data. These additional results raise questions about the methodology and point to the need for additional work for an adequate test of the hypothesis.


Journal of Neurolinguistics | 2012

Movement goals and feedback and feedforward control mechanisms in speech production.

Joseph S. Perkell

Studies of speech motor control are described that support a theoretical framework in which fundamental control variables for phonemic movements are multi-dimensional regions in auditory and somatosensory spaces. Auditory feedback is used to acquire and maintain auditory goals and in the development and function of feedback and feedforward control mechanisms. Several lines of evidence support the idea that speakers with more acute sensory discrimination acquire more distinct goal regions and therefore produce speech sounds with greater contrast. Feedback modification findings indicate that fluently produced sound sequences are encoded as feedforward commands, and feedback control serves to correct mismatches between expected and produced sensory consequences.


Journal of the Acoustical Society of America | 2003

Influences of tongue biomechanics on speech movements during the production of velar stop consonants: A modeling study

Pascal Perrier; Yohan Payan; Majid Zandipour; Joseph S. Perkell

This study explores the following hypothesis: forward looping movements of the tongue that are observed in VCV sequences are due partly to the anatomical arrangement of the tongue muscles, how they are used to produce a velar closure, and how the tongue interacts with the palate during consonantal closure. The study uses an anatomically based two-dimensional biomechanical tongue model. Tissue elastic properties are accounted for in finite-element modeling, and movement is controlled by constant-rate control parameter shifts. Tongue raising and lowering movements are produced by the model mainly with the combined actions of the genioglossus, styloglossus, and hyoglossus. Simulations of V1CV2 movements were made, where C is a velar consonant and V is [a], [i], or [u]. Both vowels and consonants are specified in terms of targets, but for the consonant the target is virtual, and cannot be reached because it is beyond the surface of the palate. If V1 is the vowel [a] or [u], the resulting trajectory describes a movement that begins to loop forward before consonant closure and continues to slide along the palate during the closure. This pattern is very stable when moderate changes are made to the specification of the target consonant location and agrees with data published in the literature. If V1 is the vowel [i], looping patterns are also observed, but their orientation was quite sensitive to small changes in the location of the consonant target. These findings also agree with patterns of variability observed in measurements from human speakers, but they contradict data published by Houde [Ph.D. dissertation (1967)]. These observations support the idea that the biomechanical properties of the tongue could be the main factor responsible for the forward loops when V1 is a back vowel, regardless of whether V2 is a back vowel or a front vowel. In the [i] context it seems that additional factors have to be taken into consideration in order to explain the observations made on some speakers.


Journal of Voice | 1989

Glottal airflow and transglottal air pressure measurements for male and female speakers in low, normal, and high pitch

Eva B. Holmberg; Robert E. Hillman; Joseph S. Perkell

Summary Measurements on the inverse filtered airflow waveform and of estimated average transglottal pressure and glottal airflow were made from syllable sequences in low, normal, and high pitch for 25 male and 20 female speakers. Correlation analyses indicated that several of the airflow measurements were more directly related to voice intensity than to fundamental frequency (F 0 ). Results suggested that pressure may have different influences in low and high pitch in this speech task. It is suggested that unexpected results of increased pressure in low pitch were related to maintaining voice quality, that is, avoiding vocal fry. In high pitch, the increased pressure may serve to maintain vocal fold vibration. The findings suggested different underlying laryngeal mechanisms and vocal adjustments for increasing and decreasing F 0 from normal pitch.


Journal of Voice | 2003

Aerodynamic and acoustic voice measurements of patients with vocal nodules: variation in baseline and changes across voice therapy.

Eva B. Holmberg; Patricia Doyle; Joseph S. Perkell; Britta Hammarberg; Robert E. Hillman

An important clinical issue concerns the efficacy of current voice therapy approaches in treating voice disorders, such as vocal nodules. Much research focuses on finding reliable methods for documentation of treatment results. In this second treatment study of ten patients with vocal nodules, who participated in a behaviorally based voice therapy program, 11 aerodynamic (transglottal air pressure and glottal waveform) and acoustic (spl, f0, and spectrum slope) measures were used. Three pretherapy baseline assessments were carried out, followed by one assessment after each of five therapy phases. Measurements were made of two types of speech materials: Strings of repeated /pae/ syllables and sustained /ae/ phonations in two loudness conditions: comfortable loudness and loud voice. The data were normalized using z-scores, which were based on data from 22 normal subjects. The results showed that the aerodynamic measures reflected the presence of vocal pathology to a higher degree than did the acoustic spectral measures, and they should be useful in studies comparing nodule and normal voice production. Large individual session-to-session variation was found for all measures across pretherapy baseline recordings, which contributed to nonsignificant differences between baseline and therapy data.


Journal of the Acoustical Society of America | 1992

Speech of cochlear implant patients: A longitudinal study of vowel production

Joseph S. Perkell; Harlan Lane; Mario A. Svirsky; Jane Webster

Acoustic parameters were measured for vowels spoken in /hVd/ context by four postlingually deafened recipients of multichannel (Ineraid) cochlear implants. Three of the subjects became totally deaf in adulthood after varying periods of partial hearing loss; the fourth became totally deaf at age four. The subjects received different degrees of perceptual benefit from the prosthesis. Recordings were made before, and at intervals following speech processor activation. The measured parameters included F1, F2, F0, SPL, duration, and amplitude difference between the first two harmonic peaks in the log magnitude spectrum (H 1-H2). Numerous changes in parameter values were observed from pre- to post-implant, with differences among subjects. Many changes, but not all, were in the direction of normative data, and most changes were consistent with hypotheses about relations among the parameters. Some of the changes tended to enhance phonemic contrasts; others had the opposite effect. For three subjects, H 1-H2 changed in a direction consistent with measurements of their average air flow when reading; that relation was more complex for the fourth subject. The results are interpreted with respect to: characteristics of the individual subjects, including vowel identification scores; mechanical interactions among glottal and supraglottal articulations; and hypotheses about the role of auditory feedback in the control of speech production. Almost all the observed differences could be attributed to changes in the average settings of speaking rate, F0 and SPL, which presumably can be perceived without the need for spectral place information. Some observed F2 realignment may be attributable to the reception of spectral cues.

Collaboration


Dive into the Joseph S. Perkell's collaboration.

Top Co-Authors

Avatar

Melanie L. Matthies

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Majid Zandipour

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Mark Tiede

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Harlan Lane

Northeastern University

View shared research outputs
Top Co-Authors

Avatar

Jennell Vick

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Mario A. Svirsky

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Harlan Lane

Northeastern University

View shared research outputs
Top Co-Authors

Avatar

Satrajit S. Ghosh

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Eva B. Holmberg

Massachusetts Eye and Ear Infirmary

View shared research outputs
Researchain Logo
Decentralizing Knowledge