Lee, Shao-Hsuan; Fang, Tuan-Jen; Yu, Jen-Fang; Lee, Guo-She
2017-09-01
Auditory feedback can make reflexive responses on sustained vocalizations. Among them, the middle-frequency power of F0 (MFP) may provide a sensitive index to access the subtle changes in different auditory feedback conditions. Phonatory airflow temperature was obtained from 20 healthy adults at two vocal intensity ranges under four auditory feedback conditions: (1) natural auditory feedback (NO); (2) binaural speech noise masking (SN); (3) bone-conducted feedback of self-generated voice (BAF); and (4) SN and BAF simultaneously. The modulations of F0 in low-frequency (0.2 Hz-3 Hz), middle-frequency (3 Hz-8 Hz), and high-frequency (8 Hz-25 Hz) bands were acquired using power spectral analysis of F0. Acoustic and aerodynamic analyses were used to acquire vocal intensity, maximum phonation time (MPT), phonatory airflow, and MFP-based vocal efficiency (MBVE). SN and high vocal intensity decreased MFP and raised MBVE and MPT significantly. BAF showed no effect on MFP but significantly lowered MBVE. Moreover, BAF significantly increased the perception of voice feedback and the sensation of vocal effort. Altered auditory feedback significantly changed the middle-frequency modulations of F0. MFP and MBVE could well detect these subtle responses of audio-vocal feedback. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Riede, Tobias; Tokuda, Isao T.; Farmer, C. G.
2011-01-01
SUMMARY Vocalization is rare among non-avian reptiles, with the exception of the crocodilians, the sister taxon of birds. Crocodilians have a complex vocal repertoire. Their vocal and respiratory system is not well understood but appears to consist of a combination of features that are also found in the extremely vocal avian and mammalian taxa. Anatomical studies suggest that the alligator larynx is able to abduct and adduct the vocal folds, but not to elongate or shorten them, and is therefore lacking a key regulator of frequency, yet alligators can modulate fundamental frequency remarkably well. We investigated the morphological and physiological features of sound production in alligators. Vocal fold length scales isometrically across a wide range of alligator body sizes. The relationship between fundamental frequency and subglottal pressure is significant in some individuals at some isolated points, such as call onset and position of maximum fundamental frequency. The relationship is not consistent over large segments of the call. Fundamental frequency can change faster than expected by pressure changes alone, suggesting an active motor pattern controls frequency and is intrinsic to the larynx. We utilized a two-mass vocal fold model to test whether abduction and adduction could generate this motor pattern. The fine-tuned interplay between subglottal pressure and glottal adduction can achieve frequency modulations much larger than those resulting from subglottal pressure variations alone and of similar magnitude, as observed in alligator calls. We conclude that the alligator larynx represents a sound source with only two control parameters (subglottal pressure and vocal fold adduction) in contrast to the mammalian larynx in which three parameters can be altered to modulate frequency (subglottal pressure, vocal fold adduction and length/tension). PMID:21865521
Lee, Shao-Hsuan; Hsiao, Tzu-Yu; Lee, Guo-She
2015-06-01
Sustained vocalizations of vowels [a], [i], and syllable [mə] were collected in twenty normal-hearing individuals. On vocalizations, five conditions of different audio-vocal feedback were introduced separately to the speakers including no masking, wearing supra-aural headphones only, speech-noise masking, high-pass noise masking, and broad-band-noise masking. Power spectral analysis of vocal fundamental frequency (F0) was used to evaluate the modulations of F0 and linear-predictive-coding was used to acquire first two formants. The results showed that while the formant frequencies were not significantly shifted, low-frequency modulations (<3 Hz) of F0 significantly increased with reduced audio-vocal feedback across speech sounds and were significantly correlated with auditory awareness of speakers' own voices. For sustained speech production, the motor speech controls on F0 may depend on a feedback mechanism while articulation should rely more on a feedforward mechanism. Power spectral analysis of F0 might be applied to evaluate audio-vocal control for various hearing and neurological disorders in the future. Copyright © 2015 Elsevier B.V. All rights reserved.
A Mechanism for Frequency Modulation in Songbirds Shared with Humans
Margoliash, Daniel
2013-01-01
In most animals that vocalize, control of fundamental frequency is a key element for effective communication. In humans, subglottal pressure controls vocal intensity but also influences fundamental frequency during phonation. Given the underlying similarities in the biomechanical mechanisms of vocalization in humans and songbirds, songbirds offer an attractive opportunity to study frequency modulation by pressure. Here, we present a novel technique for dynamic control of subsyringeal pressure in zebra finches. By regulating the opening of a custom-built fast valve connected to the air sac system, we achieved partial or total silencing of specific syllables, and could modify syllabic acoustics through more complex manipulations of air sac pressure. We also observed that more nuanced pressure variations over a limited interval during production of a syllable concomitantly affected the frequency of that syllable segment. These results can be explained in terms of a mathematical model for phonation that incorporates a nonlinear description for the vocal source capable of generating the observed frequency modulations induced by pressure variations. We conclude that the observed interaction between pressure and frequency was a feature of the source, not a result of feedback control. Our results indicate that, beyond regulating phonation or its absence, regulation of pressure is important for control of fundamental frequencies of vocalizations. Thus, although there are separate brainstem pathways for syringeal and respiratory control of song production, both can affect airflow and frequency. We hypothesize that the control of pressure and frequency is combined holistically at higher levels of the vocalization pathways. PMID:23825417
A mechanism for frequency modulation in songbirds shared with humans.
Amador, Ana; Margoliash, Daniel
2013-07-03
In most animals that vocalize, control of fundamental frequency is a key element for effective communication. In humans, subglottal pressure controls vocal intensity but also influences fundamental frequency during phonation. Given the underlying similarities in the biomechanical mechanisms of vocalization in humans and songbirds, songbirds offer an attractive opportunity to study frequency modulation by pressure. Here, we present a novel technique for dynamic control of subsyringeal pressure in zebra finches. By regulating the opening of a custom-built fast valve connected to the air sac system, we achieved partial or total silencing of specific syllables, and could modify syllabic acoustics through more complex manipulations of air sac pressure. We also observed that more nuanced pressure variations over a limited interval during production of a syllable concomitantly affected the frequency of that syllable segment. These results can be explained in terms of a mathematical model for phonation that incorporates a nonlinear description for the vocal source capable of generating the observed frequency modulations induced by pressure variations. We conclude that the observed interaction between pressure and frequency was a feature of the source, not a result of feedback control. Our results indicate that, beyond regulating phonation or its absence, regulation of pressure is important for control of fundamental frequencies of vocalizations. Thus, although there are separate brainstem pathways for syringeal and respiratory control of song production, both can affect airflow and frequency. We hypothesize that the control of pressure and frequency is combined holistically at higher levels of the vocalization pathways.
Strain Modulations as a Mechanism to Reduce Stress Relaxation in Laryngeal Tissues
Hunter, Eric J.; Siegmund, Thomas; Chan, Roger W.
2014-01-01
Vocal fold tissues in animal and human species undergo deformation processes at several types of loading rates: a slow strain involved in vocal fold posturing (on the order of 1 Hz or so), cyclic and faster posturing often found in speech tasks or vocal embellishment (1–10 Hz), and shear strain associated with vocal fold vibration during phonation (100 Hz and higher). Relevant to these deformation patterns are the viscous properties of laryngeal tissues, which exhibit non-linear stress relaxation and recovery. In the current study, a large strain time-dependent constitutive model of human vocal fold tissue is used to investigate effects of phonatory posturing cyclic strain in the range of 1 Hz to 10 Hz. Tissue data for two subjects are considered and used to contrast the potential effects of age. Results suggest that modulation frequency and extent (amplitude), as well as the amount of vocal fold overall strain, all affect the change in stress relaxation with modulation added. Generally, the vocal fold cover reduces the rate of relaxation while the opposite is true for the vocal ligament. Further, higher modulation frequencies appear to reduce the rate of relaxation, primarily affecting the ligament. The potential benefits of cyclic strain, often found in vibrato (around 5 Hz modulation) and intonational inflection, are discussed in terms of vocal effort and vocal pitch maintenance. Additionally, elderly tissue appears to not exhibit these benefits to modulation. The exacerbating effect such modulations may have on certain voice disorders, such as muscle tension dysphonia, are explored. PMID:24614616
Strain modulations as a mechanism to reduce stress relaxation in laryngeal tissues.
Hunter, Eric J; Siegmund, Thomas; Chan, Roger W
2014-01-01
Vocal fold tissues in animal and human species undergo deformation processes at several types of loading rates: a slow strain involved in vocal fold posturing (on the order of 1 Hz or so), cyclic and faster posturing often found in speech tasks or vocal embellishment (1-10 Hz), and shear strain associated with vocal fold vibration during phonation (100 Hz and higher). Relevant to these deformation patterns are the viscous properties of laryngeal tissues, which exhibit non-linear stress relaxation and recovery. In the current study, a large strain time-dependent constitutive model of human vocal fold tissue is used to investigate effects of phonatory posturing cyclic strain in the range of 1 Hz to 10 Hz. Tissue data for two subjects are considered and used to contrast the potential effects of age. Results suggest that modulation frequency and extent (amplitude), as well as the amount of vocal fold overall strain, all affect the change in stress relaxation with modulation added. Generally, the vocal fold cover reduces the rate of relaxation while the opposite is true for the vocal ligament. Further, higher modulation frequencies appear to reduce the rate of relaxation, primarily affecting the ligament. The potential benefits of cyclic strain, often found in vibrato (around 5 Hz modulation) and intonational inflection, are discussed in terms of vocal effort and vocal pitch maintenance. Additionally, elderly tissue appears to not exhibit these benefits to modulation. The exacerbating effect such modulations may have on certain voice disorders, such as muscle tension dysphonia, are explored.
ERIC Educational Resources Information Center
Lien, Yu-An S.; Michener, Carolyn M.; Eadie, Tanya L.; Stepp, Cara E.
2015-01-01
Purpose: The acoustic measure relative fundamental frequency (RFF) was investigated as a potential objective measure to track variations in vocal effort within and across individuals. Method: Twelve speakers with healthy voices created purposeful modulations in their vocal effort during speech tasks. RFF and an aerodynamic measure of vocal effort,…
Amplitude Modulations of Acoustic Communication Signals
NASA Astrophysics Data System (ADS)
Turesson, Hjalmar K.
2011-12-01
In human speech, amplitude modulations at 3 -- 8 Hz are important for discrimination and detection. Two different neurophysiological theories have been proposed to explain this effect. The first theory proposes that, as a consequence of neocortical synaptic dynamics, signals that are amplitude modulated at 3 -- 8 Hz are propagated better than un-modulated signals, or signals modulated above 8 Hz. This suggests that neural activity elicited by vocalizations modulated at 3 -- 8 Hz is optimally transmitted, and the vocalizations better discriminated and detected. The second theory proposes that 3 -- 8 Hz amplitude modulations interact with spontaneous neocortical oscillations. Specifically, vocalizations modulated at 3 -- 8 Hz entrain local populations of neurons, which in turn, modulate the amplitude of high frequency gamma oscillations. This suggests that vocalizations modulated at 3 -- 8 Hz should induce stronger cross-frequency coupling. Similar to human speech, we found that macaque monkey vocalizations also are amplitude modulated between 3 and 8 Hz. Humans and macaque monkeys share similarities in vocal production, implying that the auditory systems subserving perception of acoustic communication signals also share similarities. Based on the similarities between human speech and macaque monkey vocalizations, we addressed how amplitude modulated vocalizations are processed in the auditory cortex of macaque monkeys, and what behavioral relevance modulations may have. Recording single neuron activity, as well as, the activity of local populations of neurons allowed us to test both of the neurophysiological theories presented above. We found that single neuron responses to vocalizations amplitude modulated at 3 -- 8 Hz resulted in better stimulus discrimination than vocalizations lacking 3 -- 8 Hz modulations, and that the effect most likely was mediated by synaptic dynamics. In contrast, we failed to find support for the oscillation-based model proposing a coupling between 3 -- 8 Hz oscillations and gamma band amplitude. In a behavioral experiment, we found that 3 -- 8 amplitude modulations improved auditory detection in noise. In conclusion, our results suggest that, as in human speech, 3 -- 8 Hz amplitude modulations have a behaviorally important effect, and that this effect probably is mediated by synaptic dynamics.
Characterization of ultrasonic vocalizations of Fragile X mice.
Belagodu, Amogh P; Johnson, Aaron M; Galvez, Roberto
2016-09-01
Fragile X Syndrome (FXS) is the leading form of inherited intellectual disability. It is caused by the transcriptional silencing of FMR1, the gene which codes for the Fragile X Mental Retardation Protein (FMRP). Patients who have FXS exhibit numerous behavioral and cognitive impairments, such as attention-deficit/hyperactivity disorder, obsessive compulsive disorder, and autistic-like behaviors. In addition to these behavioral abnormalities, FXS patients have also been shown to exhibit various deficits in communication such as abnormal sentence structures, increased utterances, repetition of sounds and words, and reduced articulation. These deficits can dramatically hinder communication for FXS patients, exacerbating learning and cognition impairments while decreasing their quality of life. To examine the biological underpinnings of these communication abnormalities, studies have used a mouse model of the Fragile X Syndrome; however, these vocalization studies have resulted in inconsistent findings that often do not correlate with abnormalities observed in FXS patients. Interestingly, a detailed examination of frequency modulated vocalizations that are believed to be a better assessment of rodent communication has never been conducted. The following study used courtship separation to conduct a detailed examination of frequency modulated ultrasonic vocalizations (USV) in FXS mice. Our analyses of frequency modulated USVs demonstrated that adult FXS mice exhibited longer phrases and more motifs. Phrases are vocalizations consisting of multiple frequency modulated ultrasonic vocalizations, while motifs are repeated frequency modulated USV patterns. Fragile X mice had a higher proportion of "u" syllables in all USVs and phrases while their wildtype counterparts preferred isolated "h" syllables. Although the specific importance of these syllables towards communication deficits still needs to be evaluated, these findings in production of USVs are consistent with the repetitive and perseverative speech patterns observed in FXS patients. This study demonstrates that FXS mice can be used to study the underlying biological mechanism(s) mediating FXS vocalization abnormalities. Copyright © 2016 Elsevier B.V. All rights reserved.
Discriminating Simulated Vocal Tremor Source Using Amplitude Modulation Spectra
Carbonell, Kathy M.; Lester, Rosemary A.; Story, Brad H.; Lotto, Andrew J.
2014-01-01
Objectives/Hypothesis Sources of vocal tremor are difficult to categorize perceptually and acoustically. This paper describes a preliminary attempt to discriminate vocal tremor sources through the use of spectral measures of the amplitude envelope. The hypothesis is that different vocal tremor sources are associated with distinct patterns of acoustic amplitude modulations. Study Design Statistical categorization methods (discriminant function analysis) were used to discriminate signals from simulated vocal tremor with different sources using only acoustic measures derived from the amplitude envelopes. Methods Simulations of vocal tremor were created by modulating parameters of a vocal fold model corresponding to oscillations of respiratory driving pressure (respiratory tremor), degree of vocal fold adduction (adductory tremor) and fundamental frequency of vocal fold vibration (F0 tremor). The acoustic measures were based on spectral analyses of the amplitude envelope computed across the entire signal and within select frequency bands. Results The signals could be categorized (with accuracy well above chance) in terms of the simulated tremor source using only measures of the amplitude envelope spectrum even when multiple sources of tremor were included. Conclusions These results supply initial support for an amplitude-envelope based approach to identify the source of vocal tremor and provide further evidence for the rich information about talker characteristics present in the temporal structure of the amplitude envelope. PMID:25532813
Lester, Rosemary A.; Story, Brad H.
2015-01-01
The purpose of this study was to determine if adjustments to the voice source [i.e., fundamental frequency (F0), degree of vocal fold adduction] or vocal tract filter (i.e., vocal tract shape for vowels) reduce the perception of simulated laryngeal vocal tremor and to determine if listener perception could be explained by characteristics of the acoustical modulations. This research was carried out using a computational model of speech production that allowed for precise control and manipulation of the glottal and vocal tract configurations. Forty-two healthy adults participated in a perceptual study involving pair-comparisons of the magnitude of “shakiness” with simulated samples of laryngeal vocal tremor. Results revealed that listeners perceived a higher magnitude of voice modulation when simulated samples had a higher mean F0, greater degree of vocal fold adduction, and vocal tract shape for /i/ vs /ɑ/. However, the effect of F0 was significant only when glottal noise was not present in the acoustic signal. Acoustical analyses were performed with the simulated samples to determine the features that affected listeners' judgments. Based on regression analyses, listeners' judgments were predicted to some extent by modulation information present in both low and high frequency bands. PMID:26328711
Volitional exaggeration of body size through fundamental and formant frequency modulation in humans
Pisanski, Katarzyna; Mora, Emanuel C.; Pisanski, Annette; Reby, David; Sorokowski, Piotr; Frackowiak, Tomasz; Feinberg, David R.
2016-01-01
Several mammalian species scale their voice fundamental frequency (F0) and formant frequencies in competitive and mating contexts, reducing vocal tract and laryngeal allometry thereby exaggerating apparent body size. Although humans’ rare capacity to volitionally modulate these same frequencies is thought to subserve articulated speech, the potential function of voice frequency modulation in human nonverbal communication remains largely unexplored. Here, the voices of 167 men and women from Canada, Cuba, and Poland were recorded in a baseline condition and while volitionally imitating a physically small and large body size. Modulation of F0, formant spacing (∆F), and apparent vocal tract length (VTL) were measured using Praat. Our results indicate that men and women spontaneously and systemically increased VTL and decreased F0 to imitate a large body size, and reduced VTL and increased F0 to imitate small size. These voice modulations did not differ substantially across cultures, indicating potentially universal sound-size correspondences or anatomical and biomechanical constraints on voice modulation. In each culture, men generally modulated their voices (particularly formants) more than did women. This latter finding could help to explain sexual dimorphism in F0 and formants that is currently unaccounted for by sexual dimorphism in human vocal anatomy and body size. PMID:27687571
The influence of pitch and loudness changes on the acoustics of vocal tremor.
Dromey, Christopher; Warrick, Paul; Irish, Jonathan
2002-10-01
The effect of tremor on phonation is to modulate an otherwise steady sound source in its amplitude, fundamental frequency, or both. The severity of untreated vocal tremor has been reported to change under certain conditions that may be related to muscle tension. In order to better understand the phenomenon of vocal tremor, its acoustic properties were examined as individuals volitionally altered their pitch and loudness. These voice conditions were anticipated to alter the tension of the intrinsic laryngeal muscles. The voices of 10 individuals with a diagnosis of vocal tremor were recorded before participating in a longitudinal treatment study. They produced vowels at low and high pitch and loudness levels as well as in a comfortable voice condition. Acoustic analyses quantified the amplitude and frequency modulations of the speakers' voices across the various conditions. Individual speakers varied in the way the pitch and loudness changes affected their tremor, but the following statistically significant effects for the speakers as a group were observed: Higher pitch phonation was associated with a more rapid rate for both amplitude and frequency modulations. Amplitude modulation become faster for louder phonation. Low-pitched phonotion led to decreases in the extent of amplitude tremor. Varying pitch led to dramatic changes in the phase relationship between amplitude and frequency modulation in some of the speakers, whereas this effect was not apparent in other speakers.
Modulation of voice related to tremor and vibrato
NASA Astrophysics Data System (ADS)
Lester, Rosemary Anne
Modulation of voice is a result of physiologic oscillation within one or more components of the vocal system including the breathing apparatus (i.e., pressure supply), the larynx (i.e. sound source), and the vocal tract (i.e., sound filter). These oscillations may be caused by pathological tremor associated with neurological disorders like essential tremor or by volitional production of vibrato in singers. Because the acoustical characteristics of voice modulation specific to each component of the vocal system and the effect of these characteristics on perception are not well-understood, it is difficult to assess individuals with vocal tremor and to determine the most effective interventions for reducing the perceptual severity of the disorder. The purpose of the present studies was to determine how the acoustical characteristics associated with laryngeal-based vocal tremor affect the perception of the magnitude of voice modulation, and to determine if adjustments could be made to the voice source and vocal tract filter to alter the acoustic output and reduce the perception of modulation. This research was carried out using both a computational model of speech production and trained singers producing vibrato to simulate laryngeal-based vocal tremor with different voice source characteristics (i.e., vocal fold length and degree of vocal fold adduction) and different vocal tract filter characteristics (i.e., vowel shapes). It was expected that, by making adjustments to the voice source and vocal tract filter that reduce the amplitude of the higher harmonics, the perception of magnitude of voice modulation would be reduced. The results of this study revealed that listeners' perception of the magnitude of modulation of voice was affected by the degree of vocal fold adduction and the vocal tract shape with the computational model, but only by the vocal quality (corresponding to the degree of vocal fold adduction) with the female singer. Based on regression analyses, listeners' judgments were predicted by modulation information in both low and high frequency bands. The findings from these studies indicate that production of a breathy vocal quality might be a useful compensatory strategy for reducing the perceptual severity of modulation of voice for individuals with tremor affecting the larynx.
2011-01-01
Vocal production requires complex planning and coordination of respiratory, laryngeal, and vocal tract movements, which are incompletely understood in most mammals. Rats produce a variety of whistles in the ultrasonic range that are of communicative relevance and of importance as a model system, but the sources of acoustic variability were mostly unknown. The goal was to identify sources of fundamental frequency variability. Subglottal pressure, tracheal airflow, and electromyographic (EMG) data from two intrinsic laryngeal muscles were measured during 22-kHz and 50-kHz call production in awake, spontaneously behaving adult male rats. During ultrasound vocalization, subglottal pressure ranged between 0.8 and 1.9 kPa. Pressure differences between call types were not significant. The relation between fundamental frequency and subglottal pressure within call types was inconsistent. Experimental manipulations of subglottal pressure had only small effects on fundamental frequency. Tracheal airflow patterns were also inconsistently associated with frequency. Pressure and flow seem to play a small role in regulation of fundamental frequency. Muscle activity, however, is precisely regulated and very sensitive to alterations, presumably because of effects on resonance properties in the vocal tract. EMG activity of cricothyroid and thyroarytenoid muscle was tonic in calls with slow or no fundamental frequency modulations, like 22-kHz and flat 50-kHz calls. Both muscles showed brief high-amplitude, alternating bursts at rates up to 150 Hz during production of frequency-modulated 50-kHz calls. A differentiated and fine regulation of intrinsic laryngeal muscles is critical for normal ultrasound vocalization. Many features of the laryngeal muscle activation pattern during ultrasound vocalization in rats are shared with other mammals. PMID:21832032
The acoustic structure of male giant panda bleats varies according to intersexual context.
Charlton, Benjamin D; Keating, Jennifer L; Rengui, Li; Huang, Yan; Swaisgood, Ronald R
2015-09-01
Although the acoustic structure of mammal vocal signals often varies according to the social context of emission, relatively few mammal studies have examined acoustic variation during intersexual advertisement. In the current study male giant panda bleats were recorded during the breeding season in three behavioural contexts: vocalising alone, during vocal interactions with females outside of peak oestrus, and during vocal interactions with peak-oestrous females. Male bleats produced during vocal interactions with peak-oestrous females were longer in duration and had higher mean fundamental frequency than those produced when males were either involved in a vocal interaction with a female outside of peak oestrus or vocalising alone. In addition, males produced bleats with higher rates of fundamental frequency modulation when they were vocalising alone than when they were interacting with females. These results show that acoustic features of male giant panda bleats have the potential to signal the caller's motivational state, and suggest that males increase the rate of fundamental frequency modulation in bleats when they are alone to maximally broadcast their quality and promote close-range contact with receptive females during the breeding season.
Vocalizations produced by humpback whale (Megaptera novaeangliae) calves recorded in Hawaii.
Zoidis, Ann M; Smultea, Mari A; Frankel, Adam S; Hopkins, Julia L; Day, Andy; McFarland, A Sasha; Whitt, Amy D; Fertl, Dagmar
2008-03-01
Although humpback whale (Megaptera novaeangliae) calves are reported to vocalize, this has not been measurably verified. During March 2006, an underwater video camera and two-element hydrophone array were used to record nonsong vocalizations from a mother-calf escort off Hawaii. Acoustic data were analyzed; measured time delays between hydrophones provided bearings to 21 distinct vocalizations produced by the male calf. Signals were pulsed (71%), frequency modulated (19%), or amplitude modulated (10%). They were of simple structure, low frequency (mean=220 Hz), brief duration (mean=170 ms), and relatively narrow bandwidth (mean=2 kHz). The calf produced three series of "grunts" when approaching the diver. During winters of the years 2001-2005 in Hawaii, nonsong vocalizations were recorded in 109 (65%) of 169 groups with a calf using an underwater video and single (omnidirectional) hydrophone. Nonsong vocalizations were most common (34 of 39) in lone mother-calf pairs. A subsample from this dataset of 60 signals assessed to be vocalizations provided strong evidence that 10 male and 18 female calves vocalized based on statistical similarity to the 21 verified calf signals, proximity to an isolated calf (27 of 28 calves), strong signal-to-noise ratio, and/or bubble emissions coincident to sound.
Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations
NASA Astrophysics Data System (ADS)
Zhang, Yu; Jiang, Jack J.
2008-09-01
Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low-frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.
Function and Evolution of Vibrato-like Frequency Modulation in Mammals.
Charlton, Benjamin D; Taylor, Anna M; Reby, David
2017-09-11
Why do distantly related mammals like sheep, giant pandas, and fur seals produce bleats that are characterized by vibrato-like fundamental frequency (F0) modulation? To answer this question, we used psychoacoustic tests and comparative analyses to investigate whether this distinctive vocal feature has evolved to improve the perception of formants, key acoustic components of animal calls that encode important information about the caller's size and identity [1]. Psychoacoustic tests on humans confirmed that vibrato-like F0 modulation improves the ability of listeners to detect differences in the formant patterns of synthetic bleat-like stimuli. Subsequent phylogenetically controlled comparative analyses revealed that vibrato-like F0 modulation has evolved independently in six mammalian orders in vocal signals with relatively high F0 and, therefore, low spectral density (i.e., less harmonic overtones). We also found that mammals modulate the vibrato in these calls over greater frequency extents when the number of harmonic overtones per formant is low, suggesting that this is a mechanism to improve formant perception in calls with low spectral density. Our findings constitute the first evidence that formant perception in non-speech sounds is improved by fundamental frequency modulation and provide a mechanism for the convergent evolution of bleat-like calls in mammals. They also indicate that selection pressures for animals to transmit important information encoded by formant frequencies (on size and identity, for example) are likely to have been a key driver in the evolution of mammal vocal diversity. Copyright © 2017 Elsevier Ltd. All rights reserved.
Ultrasonic Vocalizations Emitted by Flying Squirrels
Murrant, Meghan N.; Bowman, Jeff; Garroway, Colin J.; Prinzen, Brian; Mayberry, Heather; Faure, Paul A.
2013-01-01
Anecdotal reports of ultrasound use by flying squirrels have existed for decades, yet there has been little detailed analysis of their vocalizations. Here we demonstrate that two species of flying squirrel emit ultrasonic vocalizations. We recorded vocalizations from northern (Glaucomys sabrinus) and southern (G. volans) flying squirrels calling in both the laboratory and at a field site in central Ontario, Canada. We demonstrate that flying squirrels produce ultrasonic emissions through recorded bursts of broadband noise and time-frequency structured frequency modulated (FM) vocalizations, some of which were purely ultrasonic. Squirrels emitted three types of ultrasonic calls in laboratory recordings and one type in the field. The variety of signals that were recorded suggest that flying squirrels may use ultrasonic vocalizations to transfer information. Thus, vocalizations may be an important, although still poorly understood, aspect of flying squirrel social biology. PMID:24009728
Sapienza, C M; Crandell, C C; Curtis, B
1999-09-01
Voice problems are a frequent difficulty that teachers experience. Common complaints by teachers include vocal fatigue and hoarseness. One possible explanation for these symptoms is prolonged elevations in vocal loudness within the classroom. This investigation examined the effectiveness of sound-field frequency modulation (FM) amplification on reducing the sound pressure level (SPL) of the teacher's voice during classroom instruction. Specifically, SPL was examined during speech produced in a classroom lecture by 10 teachers with and without the use of sound-field amplification. Results indicated a significant 2.42-dB decrease in SPL with the use of sound-field FM amplification. These data support the use of sound-field amplification in the vocal hygiene regimen recommended to teachers by speech-language pathologists.
Schneiderová, Irena; Zouhar, Jan
2014-01-01
Shrews have rich vocal repertoires that include vocalizations within the human audible frequency range and ultrasonic vocalizations. Here, we recorded and analyzed in detail the acoustic structure of a vocalization with unclear functional significance that was spontaneously produced by 15 adult, captive Asian house shrews (Suncus murinus) while they were lying motionless and resting in their nests. This vocalization was usually emitted repeatedly in a long series with regular intervals. It showed some structural variability; however, the shrews most frequently emitted a tonal, low-frequency vocalization with minimal frequency modulation and a low, non-vocal click that was clearly noticeable at its beginning. There was no effect of sex, but the acoustic structure of the analyzed vocalizations differed significantly between individual shrews. The encoded individuality was low, but it cannot be excluded that this individuality would allow discrimination of family members, i.e., a male and female with their young, collectively resting in a common nest. The question remains whether the Asian house shrews indeed perceive the presence of their mates, parents or young resting in a common nest via the resting-associated vocalization and whether they use it to discriminate among their family members. Additional studies are needed to explain the possible functional significance of resting-associated vocalizations emitted by captive Asian house shrews. Our study highlights that the acoustic communication of shrews is a relatively understudied topic, particularly considering that they are highly vocal mammals. PMID:25390304
Sex-dependent modulation of ultrasonic vocalizations in house mice (Mus musculus musculus)
Reitschmidt, Doris; Noll, Anton; Balazs, Peter; Penn, Dustin J.
2017-01-01
House mice (Mus musculus) emit ultrasonic vocalizations (USVs), which are surprisingly complex and have features of bird song, but their functions are not well understood. Previous studies have reported mixed evidence on whether there are sex differences in USV emission, though vocalization rate or other features may depend upon whether potential receivers are of the same or opposite sex. We recorded the USVs of wild-derived adult house mice (F1 of wild-caught Mus musculus musculus), and we compared the vocalizations of males and females in response to a stimulus mouse of the same- or opposite-sex. To detect and quantify vocalizations, we used an algorithm that automatically detects USVs (Automatic Mouse Ultrasound Detector or A-MUD). We found high individual variation in USV emission rates (4 to 2083 elements/10 min trial) and a skewed distribution, with most mice (60%) emitting few (≤50) elements. We found no differences in the rates of calling between the sexes overall, but mice of both sexes emitted vocalizations at a higher rate and higher frequencies during opposite- compared to same-sex interactions. We also observed a trend toward higher amplitudes by males when presented with a male compared to a female stimulus. Our results suggest that mice modulate the rate and frequency of vocalizations depending upon the sex of potential receivers. PMID:29236704
Lien, Yu-An S; Michener, Carolyn M; Eadie, Tanya L; Stepp, Cara E
2015-06-01
The acoustic measure relative fundamental frequency (RFF) was investigated as a potential objective measure to track variations in vocal effort within and across individuals. Twelve speakers with healthy voices created purposeful modulations in their vocal effort during speech tasks. RFF and an aerodynamic measure of vocal effort, the ratio of sound pressure level to subglottal pressure level, were estimated from the aerodynamic and acoustic signals. Twelve listeners also judged the speech samples for vocal effort using the visual sort and rate method. Relationships between RFF and both the aerodynamic and perceptual measures of vocal effort were weak across speakers (R2 = .06-.26). Within speakers, relationships were variable but much stronger on average (R2 = .45-.56). RFF showed stronger relationships between both the aerodynamic and perceptual measures of vocal effort when examined within individuals versus across individuals. Future work is necessary to establish these relationships in individuals with voice disorders across the therapeutic process.
Michener, Carolyn M.; Eadie, Tanya L.; Stepp, Cara E.
2015-01-01
Purpose The acoustic measure relative fundamental frequency (RFF) was investigated as a potential objective measure to track variations in vocal effort within and across individuals. Method Twelve speakers with healthy voices created purposeful modulations in their vocal effort during speech tasks. RFF and an aerodynamic measure of vocal effort, the ratio of sound pressure level to subglottal pressure level, were estimated from the aerodynamic and acoustic signals. Twelve listeners also judged the speech samples for vocal effort using the visual sort and rate method. Results Relationships between RFF and both the aerodynamic and perceptual measures of vocal effort were weak across speakers (R2 = .06–.26). Within speakers, relationships were variable but much stronger on average (R2 = .45–.56). Conclusions RFF showed stronger relationships between both the aerodynamic and perceptual measures of vocal effort when examined within individuals versus across individuals. Future work is necessary to establish these relationships in individuals with voice disorders across the therapeutic process. PMID:25675090
Multifunctional and Context-Dependent Control of Vocal Acoustics by Individual Muscles
Srivastava, Kyle H.; Elemans, Coen P.H.
2015-01-01
The relationship between muscle activity and behavioral output determines how the brain controls and modifies complex skills. In vocal control, ensembles of muscles are used to precisely tune single acoustic parameters such as fundamental frequency and sound amplitude. If individual vocal muscles were dedicated to the control of single parameters, then the brain could control each parameter independently by modulating the appropriate muscle or muscles. Alternatively, if each muscle influenced multiple parameters, a more complex control strategy would be required to selectively modulate a single parameter. Additionally, it is unknown whether the function of single muscles is fixed or varies across different vocal gestures. A fixed relationship would allow the brain to use the same changes in muscle activation to, for example, increase the fundamental frequency of different vocal gestures, whereas a context-dependent scheme would require the brain to calculate different motor modifications in each case. We tested the hypothesis that single muscles control multiple acoustic parameters and that the function of single muscles varies across gestures using three complementary approaches. First, we recorded electromyographic data from vocal muscles in singing Bengalese finches. Second, we electrically perturbed the activity of single muscles during song. Third, we developed an ex vivo technique to analyze the biomechanical and acoustic consequences of single-muscle perturbations. We found that single muscles drive changes in multiple parameters and that the function of single muscles differs across vocal gestures, suggesting that the brain uses a complex, gesture-dependent control scheme to regulate vocal output. PMID:26490859
Scheerer, N E; Jacobson, D S; Jones, J A
2016-02-09
Auditory feedback plays an important role in the acquisition of fluent speech; however, this role may change once speech is acquired and individuals no longer experience persistent developmental changes to the brain and vocal tract. For this reason, we investigated whether the role of auditory feedback in sensorimotor learning differs across children and adult speakers. Participants produced vocalizations while they heard their vocal pitch predictably or unpredictably shifted downward one semitone. The participants' vocal pitches were measured at the beginning of each vocalization, before auditory feedback was available, to assess the extent to which the deviant auditory feedback modified subsequent speech motor commands. Sensorimotor learning was observed in both children and adults, with participants' initial vocal pitch increasing following trials where they were exposed to predictable, but not unpredictable, frequency-altered feedback. Participants' vocal pitch was also measured across each vocalization, to index the extent to which the deviant auditory feedback was used to modify ongoing vocalizations. While both children and adults were found to increase their vocal pitch following predictable and unpredictable changes to their auditory feedback, adults produced larger compensatory responses. The results of the current study demonstrate that both children and adults rapidly integrate information derived from their auditory feedback to modify subsequent speech motor commands. However, these results also demonstrate that children and adults differ in their ability to use auditory feedback to generate compensatory vocal responses during ongoing vocalization. Since vocal variability also differed across the children and adult groups, these results also suggest that compensatory vocal responses to frequency-altered feedback manipulations initiated at vocalization onset may be modulated by vocal variability. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Chan, Roger W.; Rodriguez, Maritza
2005-09-01
During voice production, the vocal folds undergo airflow-induced self-sustained oscillation at a fundamental frequency of around 100-1000 Hz, with an amplitude of around 1-3 mm. The vocal-fold extracellular matrix (ECM), with appropriate tissue viscoelastic properties, is optimally tuned for such vibration. Vocal-fold fibroblasts regulate the gene expressions for key ECM proteins (e.g., collagen, fibronectin, fibromodulin, and hyaluronic acid), and these expressions are affected by the stress fields experi- enced by the fibroblasts. This study attempts to develop a bioreactor for cultivating cells under a micromechanical environment similar to that in vivo, based on the principle of vibro-acoustography. Vocal-fold fibroblasts from primary culture were grown in 3D, biodegradable scaffolds, and were excited dynamically by the radiation force generated by amplitude modulation of two confocal ultrasound beams of slightly different frequencies. Low-frequency acoustic radiation force was applied to the scaffold surface, and its vibratory response was imaged by videostroboscopy. A phantom tissue (standard viscoelastic material) with known elastic modulus was also excited and its vibratory frequency and amplitude were measured by videostroboscopy. Results showed that the bioreactor was capable of delivering mechanical stimuli to the tissue constructs in a physiological frequency range (100-1000 Hz), supporting its potential for vocal-fold tissue engineering applications. [Work supported by NIH Grant R01 DC006101.
Conversational Entrainment of Vocal Fry in Young Adult Female American English Speakers.
Borrie, Stephanie A; Delfino, Christine R
2017-07-01
Conversational entrainment, the natural tendency for people to modify their behaviors to more closely match their communication partner, is examined as one possible mechanism modulating the prevalence of vocal fry in the speech of young American women engaged in spoken dialogue. Twenty young adult female American English speakers engaged in two spoken dialogue tasks-one with a young adult female American English conversational partner who exhibited substantial vocal fry and one with a young adult female American English conversational partner who exhibited quantifiably less vocal fry. Dialogues were analyzed for proportion of vocal fry, by speaker, and two measures of communicative success (efficiency and enjoyment). Participants employed significantly more vocal fry when conversing with the partner who exhibited substantial vocal fry than when conversing with the partner who exhibited quantifiably less vocal fry. Further, greater similarity between communication partners in their use of vocal fry tracked with higher scores of communicative efficiency and communicative enjoyment. Conversational entrainment offers a mechanistic framework that may be used to explain, to some degree, the frequency with which vocal fry is employed by young American women engaged in spoken dialogue. Further, young American women who modulated their vocal patterns during dialogue to match those of their conversational partner gained more efficiency and enjoyment from their interactions, demonstrating the cognitive and social benefits of entrainment. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Speech adjustments for room acoustics and their effects on vocal effort
Bottalico, Pasquale
2016-01-01
Objectives The aims of the present study are: (1) to analyze the effects of the acoustical environment and the voice style on time dose (Dt_p,) and fundamental frequency (mean fo and standard deviation std_fo), while taking into account the effect of short term vocal fatigue; (2) to predict the self-reported vocal effort from the voice acoustical parameters. Methods Ten male and ten female subjects were recorded while reading a text in normal and loud styles, in three rooms - anechoic, semi-reverberant and reverberant –with and without acrylic glass panels 0.5 m from the mouth, which increased external auditory feedback. Subjects quantified how much effort was required to speak in each condition on a visual analogue scale after each task. Results (Aim1) In the loud style, Dt_p, fo and std_fo increased. The Dt_p was higher in the reverberant room compared to the other two rooms. Both genders tended to increase fo in less reverberant environments, while a more monotonous speech was produced in rooms with greater reverberation. All three voice parameters increased with short-term vocal fatigue. (Aim2) A model of the vocal effort to acoustic vocal parameters is proposed. The SPL (Sound Pressure Level) contributed to 66% of the variance explained by the model, followed by the fundamental frequency (30%) and the modulation in amplitude (4%). Conclusions The results provide insight into how voice acoustical parameters can predict vocal effort. In particular, it increased when SPL and fo increased and when the amplitude voice modulation (std_ΔSPL) decreased. PMID:28029555
Distributed acoustic cues for caller identity in macaque vocalization.
Fukushima, Makoto; Doyle, Alex M; Mullarkey, Matthew P; Mishkin, Mortimer; Averbeck, Bruno B
2015-12-01
Individual primates can be identified by the sound of their voice. Macaques have demonstrated an ability to discern conspecific identity from a harmonically structured 'coo' call. Voice recognition presumably requires the integrated perception of multiple acoustic features. However, it is unclear how this is achieved, given considerable variability across utterances. Specifically, the extent to which information about caller identity is distributed across multiple features remains elusive. We examined these issues by recording and analysing a large sample of calls from eight macaques. Single acoustic features, including fundamental frequency, duration and Weiner entropy, were informative but unreliable for the statistical classification of caller identity. A combination of multiple features, however, allowed for highly accurate caller identification. A regularized classifier that learned to identify callers from the modulation power spectrum of calls found that specific regions of spectral-temporal modulation were informative for caller identification. These ranges are related to acoustic features such as the call's fundamental frequency and FM sweep direction. We further found that the low-frequency spectrotemporal modulation component contained an indexical cue of the caller body size. Thus, cues for caller identity are distributed across identifiable spectrotemporal components corresponding to laryngeal and supralaryngeal components of vocalizations, and the integration of those cues can enable highly reliable caller identification. Our results demonstrate a clear acoustic basis by which individual macaque vocalizations can be recognized.
Distributed acoustic cues for caller identity in macaque vocalization
Doyle, Alex M.; Mullarkey, Matthew P.; Mishkin, Mortimer; Averbeck, Bruno B.
2015-01-01
Individual primates can be identified by the sound of their voice. Macaques have demonstrated an ability to discern conspecific identity from a harmonically structured ‘coo’ call. Voice recognition presumably requires the integrated perception of multiple acoustic features. However, it is unclear how this is achieved, given considerable variability across utterances. Specifically, the extent to which information about caller identity is distributed across multiple features remains elusive. We examined these issues by recording and analysing a large sample of calls from eight macaques. Single acoustic features, including fundamental frequency, duration and Weiner entropy, were informative but unreliable for the statistical classification of caller identity. A combination of multiple features, however, allowed for highly accurate caller identification. A regularized classifier that learned to identify callers from the modulation power spectrum of calls found that specific regions of spectral–temporal modulation were informative for caller identification. These ranges are related to acoustic features such as the call’s fundamental frequency and FM sweep direction. We further found that the low-frequency spectrotemporal modulation component contained an indexical cue of the caller body size. Thus, cues for caller identity are distributed across identifiable spectrotemporal components corresponding to laryngeal and supralaryngeal components of vocalizations, and the integration of those cues can enable highly reliable caller identification. Our results demonstrate a clear acoustic basis by which individual macaque vocalizations can be recognized. PMID:27019727
Mencio, Caitlin; Kuberan, Balagurunathan; Goller, Franz
2017-02-01
Neural control of complex vocal behaviors, such as birdsong and speech, requires integration of biomechanical nonlinearities through muscular output. Although control of airflow and tension of vibrating tissues are known functions of vocal muscles, it remains unclear how specific muscle characteristics contribute to specific acoustic parameters. To address this gap, we removed heparan sulfate chains using heparitinases to perturb neuromuscular transmission subtly in the syrinx of adult male zebra finches (Taeniopygia guttata). Infusion of heparitinases into ventral syringeal muscles altered their excitation threshold and reduced neuromuscular transmission changing their ability to modulate airflow. The changes in muscle activation dynamics caused a reduction in frequency modulation rates and elimination of many high-frequency syllables but did not alter the fundamental frequency of syllables. Sound amplitude was reduced and sound onset pressure was increased, suggesting a role of muscles in the induction of self-sustained oscillations under low-airflow conditions, thus enhancing vocal efficiency. These changes were reversed to preinfusion levels by 7 days after infusion. These results illustrate complex interactions between the control of airflow and tension and further define the importance of syringeal muscle in the control of a variety of acoustic song characteristics. In summary, the findings reported here show that altering neuromuscular transmission can lead to reversible changes to the acoustic structure of song. Understanding the full extent of muscle involvement in song production is critical in decoding the motor program for the production of complex vocal behavior, including our search for parallels between birdsong and human speech motor control. It is largely unknown how fine motor control of acoustic parameters is achieved in vocal organs. Subtle manipulation of syringeal muscle function was used to test how active motor control influences acoustic parameters. Slowed activation kinetics of muscles reduced frequency modulation and, unexpectedly, caused a distinct decrease in sound amplitude and increase in phonation onset pressure. These results show that active control enhances the efficiency of energy conversion in the syrinx. Copyright © 2017 the American Physiological Society.
Neuromuscular control of fundamental frequency and glottal posture at phonation onset
Chhetri, Dinesh K.; Neubauer, Juergen; Berry, David A.
2012-01-01
The laryngeal neuromuscular mechanisms for modulating glottal posture and fundamental frequency are of interest in understanding normal laryngeal physiology and treating vocal pathology. The intrinsic laryngeal muscles in an in vivo canine model were electrically activated in a graded fashion to investigate their effects on onset frequency, phonation onset pressure, vocal fold strain, and glottal distance at the vocal processes. Muscle activation plots for these laryngeal parameters were evaluated for the interaction of following pairs of muscle activation conditions: (1) cricothyroid (CT) versus all laryngeal adductors (TA/LCA/IA), (2) CT versus LCA/IA, (3) CT versus thyroarytenoid (TA) and, (4) TA versus LCA/IA (LCA: lateral cricoarytenoid muscle, IA: interarytenoid). Increases in onset frequency and strain were primarily affected by CT activation. Onset pressure correlated with activation of all adductors in activation condition 1, but primarily with CT activation in conditions 2 and 3. TA and CT were antagonistic for strain. LCA/IA activation primarily closed the cartilaginous glottis while TA activation closed the mid-membranous glottis. PMID:22352513
Thompson, P O; Findley, L T; Vidal, O
1992-12-01
Low-frequency vocalizations were recorded from fin whales, Balaenoptera physalus, in the Gulf of California, Mexico, during three cruises. In March 1985, recorded 20-Hz pulses were in sequences of regular 9-s interpulse intervals. In August 1987, nearly all were in sequences of doublets with alternating 5- and 18-s interpulse intervals. No 20-Hz pulse sequences of any kind were detected in February 1987. The typical pulse modulated from 42 to 20 Hz and its median duration was 0.7 s (1985 data). Most other fin whale sounds were also short tonal pulses averaging 82, 56, and 68 Hz, respectively, for the three cruises; 89% were modulated in frequency, mostly downward. Compared to Atlantic and Pacific Ocean regions, Gulf of California 20-Hz pulses were unique in terms of frequency modulation, interpulse sound levels, and temporal patterns. Fin whales in the Gulf may represent a regional stock revealed by their sound characteristics, a phenomenon previously shown for humpback whales, birds, and fish. Regional differences in fin whale sounds were found in comparisons of Atlantic and Pacific locations.
Koda, Hiroki; Tokuda, Isao T; Wakita, Masumi; Ito, Tsuyoshi; Nishimura, Takeshi
2015-06-01
Whistle-like high-pitched "phee" calls are often used as long-distance vocal advertisements by small-bodied marmosets and tamarins in the dense forests of South America. While the source-filter theory proposes that vibration of the vocal fold is modified independently from the resonance of the supralaryngeal vocal tract (SVT) in human speech, a source-filter coupling that constrains the vibration frequency to SVT resonance effectively produces loud tonal sounds in some musical instruments. Here, a combined approach of acoustic analyses and simulation with helium-modulated voices was used to show that phee calls are produced principally with the same mechanism as in human speech. The animal keeps the fundamental frequency (f0) close to the first formant (F1) of the SVT, to amplify f0. Although f0 and F1 are primarily independent, the degree of their tuning can be strengthened further by a flexible source-filter interaction, the variable strength of which depends upon the cross-sectional area of the laryngeal cavity. The results highlight the evolutionary antiquity and universality of the source-filter model in primates, but the study can also explore the diversification of vocal physiology, including source-filter interaction and its anatomical basis in non-human primates.
Acoustic characteristics of simulated respiratory-induced vocal tremor.
Lester, Rosemary A; Story, Brad H
2013-05-01
The purpose of this study was to investigate the relation of respiratory forced oscillation to the acoustic characteristics of vocal tremor. Acoustical analyses were performed to determine the characteristics of the intensity and fundamental frequency (F0) for speech samples obtained by Farinella, Hixon, Hoit, Story, and Jones (2006) using a respiratory forced oscillation paradigm with 5 healthy adult males to simulate vocal tremor involving respiratory pressure modulation. The analyzed conditions were sustained productions of /a/ with amplitudes of applied pressure of 0, 1, 2, and 4 cmH2O and a rate of 5 Hz. Forced oscillation of the respiratory system produced modulation of the intensity and F0 for all participants. Variability was observed between participants and conditions in the change in intensity and F0 per unit of pressure change, as well as in the mean intensity and F0. However, the extent of modulation of intensity and F0 generally increased as the applied pressure increased, as would be expected. These findings suggest that individuals develop idiosyncratic adaptations to pressure modulations, which are important to understanding aspects of variability in vocal tremor, and highlight the need to assess all components of the speech mechanism that may be directly or indirectly affected by tremor.
Callback response of dugongs to conspecific chirp playbacks.
Ichikawa, Kotaro; Akamatsu, Tomonari; Shinke, Tomio; Adulyanukosol, Kanjana; Arai, Nobuaki
2011-06-01
Dugongs (Dugong dugon) produce bird-like calls such as chirps and trills. The vocal responses of dugongs to playbacks of several acoustic stimuli were investigated. Animals were exposed to four different playback stimuli: a recorded chirp from a wild dugong, a synthesized down-sweep sound, a synthesized constant-frequency sound, and silence. Wild dugongs vocalized more frequently after playback of broadcast chirps than that after constant-frequency sounds or silence. The down-sweep sound also elicited more vocal responses than did silence. No significant difference was found between the broadcast chirps and the down-sweep sound. The ratio of wild dugong chirps to all calls and the dominant frequencies of the wild dugong calls were significantly higher during playbacks of broadcast chirps, down-sweep sounds, and constant-frequency sounds than during those of silence. The source level and duration of dugong chirps increased significantly as signaling distance increased. No significant correlation was found between signaling distance and the source level of trills. These results show that dugongs vocalize to playbacks of frequency-modulated signals and suggest that the source level of dugong chirps may be manipulated to compensate for transmission loss between the source and receiver. This study provides the first behavioral observations revealing the function of dugong chirps. © 2011 Acoustical Society of America
Characteristics of phonation onset in a two-layer vocal fold model.
Zhang, Zhaoyan
2009-02-01
Characteristics of phonation onset were investigated in a two-layer body-cover continuum model of the vocal folds as a function of the biomechanical and geometric properties of the vocal folds. The analysis showed that an increase in either the body or cover stiffness generally increased the phonation threshold pressure and phonation onset frequency, although the effectiveness of varying body or cover stiffness as a pitch control mechanism varied depending on the body-cover stiffness ratio. Increasing body-cover stiffness ratio reduced the vibration amplitude of the body layer, and the vocal fold motion was gradually restricted to the medial surface, resulting in more effective flow modulation and higher sound production efficiency. The fluid-structure interaction induced synchronization of more than one group of eigenmodes so that two or more eigenmodes may be simultaneously destabilized toward phonation onset. At certain conditions, a slight change in vocal fold stiffness or geometry may cause phonation onset to occur as eigenmode synchronization due to a different pair of eigenmodes, leading to sudden changes in phonation onset frequency, vocal fold vibration pattern, and sound production efficiency. Although observed in a linear stability analysis, a similar mechanism may also play a role in register changes at finite-amplitude oscillations.
Difference between the vocalizations of two sister species of pigeons explained in dynamical terms.
Alonso, R Gogui; Kopuchian, Cecilia; Amador, Ana; Suarez, Maria de Los Angeles; Tubaro, Pablo L; Mindlin, Gabriel B
2016-05-01
Vocal communication is an unique example, where the nonlinear nature of the periphery can give rise to complex sounds even when driven by simple neural instructions. In this work we studied the case of two close-related bird species, Patagioenas maculosa and Patagioenas picazuro, whose vocalizations differ only in the timbre. The temporal modulation of the fundamental frequency is similar in both cases, differing only in the existence of sidebands around the fundamental frequency in the P. maculosa. We tested the hypothesis that the qualitative difference between these vocalizations lies in the nonlinear nature of the syrinx. In particular, we propose that the roughness of maculosa's vocalizations is due to an asymmetry between the right and left vibratory membranes, whose nonlinear dynamics generate the sound. To test the hypothesis, we generated a biomechanical model for vocal production with an asymmetric parameter Q with which we can control the level of asymmetry between these membranes. Using this model we generated synthetic vocalizations with the principal acoustic features of both species. In addition, we confirmed the anatomical predictions by making post mortem inspection of the syrinxes, showing that the species with tonal song (picazuro) has a more symmetrical pair of membranes compared to maculosa.
Difference between the vocalizations of two sister species of pigeons explained in dynamical terms
Alonso, R. Gogui; Kopuchian, Cecilia; Amador, Ana; de los Angeles Suarez, Maria; Tubaro, Pablo L.; Mindlin, Gabriel B.
2016-01-01
Vocal communication is a unique example where the nonlinear nature of the periphery can give rise to complex sounds even when driven by simple neural instructions. In this work we studied the case of two close-related bird species, Patagioenas maculosa and Patagioenas picazuro, whose vocalizations differ only in the timbre. The temporal modulation of the fundamental frequency is similar in both cases, differing only in the existence of sidebands around the fundamental frequency in the Patagioenas maculosa. We tested the hypothesis that the qualitative difference between these vocalizations lies in the nonlinear nature of the syrinx. In particular, we propose that the roughness of maculosa's vocalizations is due to an asymmetry between the right and left vibratory membranes, whose nonlinear dynamics generate the sound. To test the hypothesis, we generated a biomechanical model for vocal production with an asymmetric parameter Q with which we can control the level of asymmetry between these membranes. Using this model we generated synthetic vocalizations with the principal acoustic features of both species. In addition, we confirmed the anatomical predictions by making post-mortem inspection of the syrinxes, showing that the species with tonal song (picazuro) has a more symmetrical pair of membranes compared to maculosa. PMID:27033354
Tong, Zhixiang; Duncan, Randall L.
2013-01-01
We are interested in the in vitro engineering of artificial vocal fold tissues via the strategic combination of multipotent mesenchymal stem cells (MSCs), physiologically relevant mechanical stimulations, and biomimetic artificial matrices. We have constructed a vocal fold bioreactor that is capable of imposing vibratory stimulations on the cultured cells at human phonation frequencies. Separately, fibrous poly (ɛ-caprolactone) (PCL) scaffolds emulating the ligamentous structure of the vocal fold were prepared by electrospinning, were incorporated in the vocal fold bioreactor, and were driven into a wave-like motion in an axisymmetrical fashion by the oscillating air. MSC-laden PCL scaffolds were subjected to vibrations at 200 Hz with a normal center displacement of ∼40 μm for a total of 7 days. A continuous (CT) or a 1 h-on-1 h-off (OF) regime with a total dynamic culture time of 12 h per day was applied. The dynamic loading did not cause any physiological trauma to the cells. Immunohistotochemical staining revealed the reinforcement of the actin filament and the enhancement of α5β1 integrin expression under selected dynamic culture conditions. Cellular expression of essential vocal fold extracellular matrix components, such as elastin, hyaluronic acid, and matrix metalloproteinase-1, was significantly elevated as compared with the static controls, and the OF regime is more conducive to matrix production than the CT vibration mode. Analyses of genes of typical fibroblast hallmarks (tenascin-C, collagen III, and procollagen I) as well as markers for MSC differentiation into nonfibroblastic lineages confirmed MSCs' adaptation of fibroblastic behaviors. Overall, the high-frequency vibratory stimulation, when combined with a synthetic fibrous scaffold, serves as a potent modulator of MSC functions. The novel bioreactor system presented here, as a versatile, yet well-controlled model, offers an in vitro platform for understanding vibration-induced mechanotransduction and for engineering of functional vocal fold tissues. PMID:23516973
Mileva, Viktoria R.; Little, Anthony C.; Roberts, S. Craig
2017-01-01
Non-verbal behaviours, including voice characteristics during speech, are an important way to communicate social status. Research suggests that individuals can obtain high social status through dominance (using force and intimidation) or through prestige (by being knowledgeable and skilful). However, little is known regarding differences in the vocal behaviour of men and women in response to dominant and prestigious individuals. Here, we tested within-subject differences in vocal parameters of interviewees during simulated job interviews with dominant, prestigious, and neutral employers (targets), while responding to questions which were classified as introductory, personal, and interpersonal. We found that vocal modulations were apparent between responses to the neutral and high-status targets, with participants, especially those who perceived themselves as low in dominance, increasing fundamental frequency (F0) in response to the dominant and prestigious targets relative to the neutral target. Self-perceived prestige, however, was less related to contextual vocal modulations than self-perceived dominance. Finally, we found that differences in the context of the interview questions participants were asked to respond to (introductory, personal, interpersonal), also affected their vocal parameters, being more prominent in responses to personal and interpersonal questions. Overall, our results suggest that people adjust their vocal parameters according to the perceived social status of the listener as well as their own self-perceived social status. PMID:28614413
Temporal processing of speech in a time-feature space
NASA Astrophysics Data System (ADS)
Avendano, Carlos
1997-09-01
The performance of speech communication systems often degrades under realistic environmental conditions. Adverse environmental factors include additive noise sources, room reverberation, and transmission channel distortions. This work studies the processing of speech in the temporal-feature or modulation spectrum domain, aiming for alleviation of the effects of such disturbances. Speech reflects the geometry of the vocal organs, and the linguistically dominant component is in the shape of the vocal tract. At any given point in time, the shape of the vocal tract is reflected in the short-time spectral envelope of the speech signal. The rate of change of the vocal tract shape appears to be important for the identification of linguistic components. This rate of change, or the rate of change of the short-time spectral envelope can be described by the modulation spectrum, i.e. the spectrum of the time trajectories described by the short-time spectral envelope. For a wide range of frequency bands, the modulation spectrum of speech exhibits a maximum at about 4 Hz, the average syllabic rate. Disturbances often have modulation frequency components outside the speech range, and could in principle be attenuated without significantly affecting the range with relevant linguistic information. Early efforts for exploiting the modulation spectrum domain (temporal processing), such as the dynamic cepstrum or the RASTA processing, used ad hoc designed processing and appear to be suboptimal. As a major contribution, in this dissertation we aim for a systematic data-driven design of temporal processing. First we analytically derive and discuss some properties and merits of temporal processing for speech signals. We attempt to formalize the concept and provide a theoretical background which has been lacking in the field. In the experimental part we apply temporal processing to a number of problems including adaptive noise reduction in cellular telephone environments, reduction of reverberation for speech enhancement, and improvements on automatic recognition of speech degraded by linear distortions and reverberation.
Pinheiro, Ana P; Barros, Carla; Dias, Marcelo; Kotz, Sonja A
2017-12-01
In social interactions, emotionally salient and sudden changes in vocal expressions attract attention. However, only a few studies examined how emotion and attention interact in voice processing. We investigated neutral, happy (laughs) and angry (growls) vocalizations in a modified oddball task. Participants silently counted the targets in each block and rated the valence and arousal of the vocalizations. A combined event-related potential and time-frequency analysis focused on the P3 and pre-stimulus alpha power to capture attention effects in response to unexpected events. Whereas an early differentiation between emotionally salient and neutral vocalizations was reflected in the P3a response, the P3b was selectively enhanced for happy voices. The P3b modulation was predicted by pre-stimulus frontal alpha desynchronization, and by the perceived pleasantness of the targets. These findings indicate that vocal emotions may be differently processed based on task relevance and valence. Increased anticipation and attention to positive vocal cues (laughter) may reflect their high social relevance. Copyright © 2017 Elsevier B.V. All rights reserved.
Characterizing the graded structure of false killer whale (Pseudorca crassidens) vocalizations.
Murray, S O; Mercado, E; Roitblat, H L
1998-09-01
The vocalizations from two, captive false killer whales (Pseudorca crassidens) were analyzed. The structure of the vocalizations was best modeled as lying along a continuum with trains of discrete, exponentially damped sinusoidal pulses at one end and continuous sinusoidal signals at the other end. Pulse trains were graded as a function of the interval between pulses where the minimum interval between pulses could be zero milliseconds. The transition from a pulse train with no inter-pulse interval to a whistle could be modeled by gradations in the degree of damping. There were many examples of vocalizations that were gradually modulated from pulse trains to whistles. There were also vocalizations that showed rapid shifts in signal type--for example, switching immediately from a whistle to a pulse train. These data have implications when considering both the possible function(s) of the vocalizations and the potential sound production mechanism(s). A short-time duty cycle measure was developed to characterize the graded structure of the vocalizations. A random sample of 500 vocalizations was characterized by combining the duty cycle measure with peak frequency measurements. The analysis method proved to be an effective metric for describing the graded structure of false killer whale vocalizations.
A blind climber: The first evidence of ultrasonic echolocation in arboreal mammals.
Panyutina, Aleksandra A; Kuznetsov, Alexander N; Volodin, Ilya A; Abramov, Alexei V; Soldatova, Irina B
2017-03-01
The means of orientation is studied in the Vietnamese pygmy dormouse Typhlomys chapensis, a poorly known enigmatic semi-fossorial semi-arboreal rodent. Data on eye structure are presented, which prove that Typhlomys (translated as "the blind mouse") is incapable of object vision: the retina is folded and retains no more than 2500 ganglion cells in the focal plane, and the optic nerve is subject to gliosis. Hence, Typhlomys has no other means for rapid long-range orientation among tree branches other than echolocation. Ultrasonic vocalization recordings at the frequency range of 50-100 kHz support this hypothesis. The vocalizations are represented by bouts of up to 7 more or less evenly-spaced and uniform frequency-modulated sweep-like pulses in rapid succession. Structurally, these sweeps are similar to frequency-modulated ultrasonic echolocation calls of some bat species, but they are too faint to be revealed with a common bat detector. When recording video simultaneously with the ultrasonic audio, a significantly greater pulse rate during locomotion compared to that of resting animals has been demonstrated. Our findings of locomotion-associated ultrasonic vocalization in a fast-climbing but weakly-sighted small mammal ecotype add support to the "echolocation-first theory" of pre-flight origin of echolocation in bats. © 2016 International Society of Zoological Sciences, Institute of Zoology/Chinese Academy of Sciences and John Wiley & Sons Australia, Ltd.
Vasconcelos, Raquel O.; Fonseca, Paulo J.; Amorim, M. Clara P.; Ladich, Friedrich
2011-01-01
Many fishes rely on their auditory skills to interpret crucial information about predators and prey, and to communicate intraspecifically. Few studies, however, have examined how complex natural sounds are perceived in fishes. We investigated the representation of conspecific mating and agonistic calls in the auditory system of the Lusitanian toadfish Halobatrachus didactylus, and analysed auditory responses to heterospecific signals from ecologically relevant species: a sympatric vocal fish (meagre Argyrosomus regius) and a potential predator (dolphin Tursiops truncatus). Using auditory evoked potential (AEP) recordings, we showed that both sexes can resolve fine features of conspecific calls. The toadfish auditory system was most sensitive to frequencies well represented in the conspecific vocalizations (namely the mating boatwhistle), and revealed a fine representation of duration and pulsed structure of agonistic and mating calls. Stimuli and corresponding AEP amplitudes were highly correlated, indicating an accurate encoding of amplitude modulation. Moreover, Lusitanian toadfish were able to detect T. truncatus foraging sounds and A. regius calls, although at higher amplitudes. We provide strong evidence that the auditory system of a vocal fish, lacking accessory hearing structures, is capable of resolving fine features of complex vocalizations that are probably important for intraspecific communication and other relevant stimuli from the auditory scene. PMID:20861044
Predicting Achievable Fundamental Frequency Ranges in Vocalization Across Species
Titze, Ingo; Riede, Tobias; Mau, Ted
2016-01-01
Vocal folds are used as sound sources in various species, but it is unknown how vocal fold morphologies are optimized for different acoustic objectives. Here we identify two main variables affecting range of vocal fold vibration frequency, namely vocal fold elongation and tissue fiber stress. A simple vibrating string model is used to predict fundamental frequency ranges across species of different vocal fold sizes. While average fundamental frequency is predominantly determined by vocal fold length (larynx size), range of fundamental frequency is facilitated by (1) laryngeal muscles that control elongation and by (2) nonlinearity in tissue fiber tension. One adaptation that would increase fundamental frequency range is greater freedom in joint rotation or gliding of two cartilages (thyroid and cricoid), so that vocal fold length change is maximized. Alternatively, tissue layers can develop to bear a disproportionate fiber tension (i.e., a ligament with high density collagen fibers), increasing the fundamental frequency range and thereby vocal versatility. The range of fundamental frequency across species is thus not simply one-dimensional, but can be conceptualized as the dependent variable in a multi-dimensional morphospace. In humans, this could allow for variations that could be clinically important for voice therapy and vocal fold repair. Alternative solutions could also have importance in vocal training for singing and other highly-skilled vocalizations. PMID:27309543
Visualizing sound emission of elephant vocalizations: evidence for two rumble production types.
Stoeger, Angela S; Heilmann, Gunnar; Zeppelzauer, Matthias; Ganswindt, André; Hensman, Sean; Charlton, Benjamin D
2012-01-01
Recent comparative data reveal that formant frequencies are cues to body size in animals, due to a close relationship between formant frequency spacing, vocal tract length and overall body size. Accordingly, intriguing morphological adaptations to elongate the vocal tract in order to lower formants occur in several species, with the size exaggeration hypothesis being proposed to justify most of these observations. While the elephant trunk is strongly implicated to account for the low formants of elephant rumbles, it is unknown whether elephants emit these vocalizations exclusively through the trunk, or whether the mouth is also involved in rumble production. In this study we used a sound visualization method (an acoustic camera) to record rumbles of five captive African elephants during spatial separation and subsequent bonding situations. Our results showed that the female elephants in our analysis produced two distinct types of rumble vocalizations based on vocal path differences: a nasally- and an orally-emitted rumble. Interestingly, nasal rumbles predominated during contact calling, whereas oral rumbles were mainly produced in bonding situations. In addition, nasal and oral rumbles varied considerably in their acoustic structure. In particular, the values of the first two formants reflected the estimated lengths of the vocal paths, corresponding to a vocal tract length of around 2 meters for nasal, and around 0.7 meters for oral rumbles. These results suggest that African elephants may be switching vocal paths to actively vary vocal tract length (with considerable variation in formants) according to context, and call for further research investigating the function of formant modulation in elephant vocalizations. Furthermore, by confirming the use of the elephant trunk in long distance rumble production, our findings provide an explanation for the extremely low formants in these calls, and may also indicate that formant lowering functions to increase call propagation distances in this species'.
Visualizing Sound Emission of Elephant Vocalizations: Evidence for Two Rumble Production Types
Stoeger, Angela S.; Heilmann, Gunnar; Zeppelzauer, Matthias; Ganswindt, André; Hensman, Sean; Charlton, Benjamin D.
2012-01-01
Recent comparative data reveal that formant frequencies are cues to body size in animals, due to a close relationship between formant frequency spacing, vocal tract length and overall body size. Accordingly, intriguing morphological adaptations to elongate the vocal tract in order to lower formants occur in several species, with the size exaggeration hypothesis being proposed to justify most of these observations. While the elephant trunk is strongly implicated to account for the low formants of elephant rumbles, it is unknown whether elephants emit these vocalizations exclusively through the trunk, or whether the mouth is also involved in rumble production. In this study we used a sound visualization method (an acoustic camera) to record rumbles of five captive African elephants during spatial separation and subsequent bonding situations. Our results showed that the female elephants in our analysis produced two distinct types of rumble vocalizations based on vocal path differences: a nasally- and an orally-emitted rumble. Interestingly, nasal rumbles predominated during contact calling, whereas oral rumbles were mainly produced in bonding situations. In addition, nasal and oral rumbles varied considerably in their acoustic structure. In particular, the values of the first two formants reflected the estimated lengths of the vocal paths, corresponding to a vocal tract length of around 2 meters for nasal, and around 0.7 meters for oral rumbles. These results suggest that African elephants may be switching vocal paths to actively vary vocal tract length (with considerable variation in formants) according to context, and call for further research investigating the function of formant modulation in elephant vocalizations. Furthermore, by confirming the use of the elephant trunk in long distance rumble production, our findings provide an explanation for the extremely low formants in these calls, and may also indicate that formant lowering functions to increase call propagation distances in this species'. PMID:23155427
Freddie Mercury-acoustic analysis of speaking fundamental frequency, vibrato, and subharmonics.
Herbst, Christian T; Hertegard, Stellan; Zangger-Borch, Daniel; Lindestad, Per-Åke
2017-04-01
Freddie Mercury was one of the twentieth century's best-known singers of commercial contemporary music. This study presents an acoustical analysis of his voice production and singing style, based on perceptual and quantitative analysis of publicly available sound recordings. Analysis of six interviews revealed a median speaking fundamental frequency of 117.3 Hz, which is typically found for a baritone voice. Analysis of voice tracks isolated from full band recordings suggested that the singing voice range was 37 semitones within the pitch range of F#2 (about 92.2 Hz) to G5 (about 784 Hz). Evidence for higher phonations up to a fundamental frequency of 1,347 Hz was not deemed reliable. Analysis of 240 sustained notes from 21 a-cappella recordings revealed a surprisingly high mean fundamental frequency modulation rate (vibrato) of 7.0 Hz, reaching the range of vocal tremor. Quantitative analysis utilizing a newly introduced parameter to assess the regularity of vocal vibrato corroborated its perceptually irregular nature, suggesting that vibrato (ir)regularity is a distinctive feature of the singing voice. Imitation of subharmonic phonation samples by a professional rock singer, documented by endoscopic high-speed video at 4,132 frames per second, revealed a 3:1 frequency locked vibratory pattern of vocal folds and ventricular folds.
Paradoxical vocal changes in a trained singer by focally cooling the right superior temporal gyrus
Katlowitz, Kalman A.; Oya, Hiroyuki; Howard, Matthew A.; Greenlee, Jeremy D.W.; Long, Michael A.
2017-01-01
The production and perception of music is preferentially mediated by cortical areas within the right hemisphere, but little is known about how these brain regions individually contribute to this process. In an experienced singer undergoing awake craniotomy, we demonstrated that direct electrical stimulation to a portion of the right posterior superior temporal gyrus (pSTG) selectively interrupted singing but not speaking. We then focally cooled this region to modulate its activity during vocalization. In contrast to similar manipulations in left hemisphere speech production regions, pSTG cooling did not elicit any changes in vocal timing or quality. However, this manipulation led to an increase in the pitch of speaking with no such change in singing. Further analysis revealed that all vocalizations exhibited a cooling-induced increase in the frequency of the first formant, raising the possibility that potential pitch offsets may have been actively avoided during singing. Our results suggest that the right pSTG plays a key role in vocal sensorimotor processing whose impact is dependent on the type of vocalization produced. PMID:28282570
Paradoxical vocal changes in a trained singer by focally cooling the right superior temporal gyrus.
Katlowitz, Kalman A; Oya, Hiroyuki; Howard, Matthew A; Greenlee, Jeremy D W; Long, Michael A
2017-04-01
The production and perception of music is preferentially mediated by cortical areas within the right hemisphere, but little is known about how these brain regions individually contribute to this process. In an experienced singer undergoing awake craniotomy, we demonstrated that direct electrical stimulation to a portion of the right posterior superior temporal gyrus (pSTG) selectively interrupted singing but not speaking. We then focally cooled this region to modulate its activity during vocalization. In contrast to similar manipulations in left hemisphere speech production regions, pSTG cooling did not elicit any changes in vocal timing or quality. However, this manipulation led to an increase in the pitch of speaking with no such change in singing. Further analysis revealed that all vocalizations exhibited a cooling-induced increase in the frequency of the first formant, raising the possibility that potential pitch offsets may have been actively avoided during singing. Our results suggest that the right pSTG plays a key role in vocal sensorimotor processing whose impact is dependent on the type of vocalization produced. Copyright © 2017 Elsevier Ltd. All rights reserved.
Vocal Tremor Analysis with the Vocal Demodulator.
ERIC Educational Resources Information Center
Winholtz, William S.; Ramig, Lorraine Olson
1992-01-01
This paper describes the Vocal Demodulator as a new device for analysis of vocal tremor. The Vocal Demodulator produces amplitude-demodulated and frequency-demodulated outputs and measures the frequency and level of low-frequency tremor components in sustained phonation. The paper describes quantification of the demodulation process, validation…
Unsteady flow motions in the supraglottal region during phonation
NASA Astrophysics Data System (ADS)
Luo, Haoxiang; Dai, Hu
2008-11-01
The highly unsteady flow motions in the larynx are not only responsible for producing the fundamental frequency tone in phonation, but also have a significant contribution to the broadband noise in the human voice. In this work, the laryngeal flow is modeled either as an incompressible pulsatile jet confined in a two-dimensional channel, or a pressure-driven flow modulated by a pair of viscoelastic vocal folds through the flow--structure interaction. The flow in the supraglottal region is found to be dominated by large-scale vortices whose unsteady motions significantly deflect the glottal jet. In the flow--structure interaction, a hybrid model based on the immersed-boundary method is developed to simulate the flow-induced vocal fold vibration, which involves a three-dimensional vocal fold prototype and a two-dimensional viscous flow. Both the flow behavior and the vibratory characteristics of the vocal folds will be presented.
Liu, Hanjun; Wang, Emily Q.; Chen, Zhaocong; Liu, Peng; Larson, Charles R.; Huang, Dongfeng
2010-01-01
The purpose of this cross-language study was to examine whether the online control of voice fundamental frequency (F0) during vowel phonation is influenced by language experience. Native speakers of Cantonese and Mandarin, both tonal languages spoken in China, participated in the experiments. Subjects were asked to vocalize a vowel sound ∕u∕ at their comfortable habitual F0, during which their voice pitch was unexpectedly shifted (±50, ±100, ±200, or ±500 cents, 200 ms duration) and fed back instantaneously to them over headphones. The results showed that Cantonese speakers produced significantly smaller responses than Mandarin speakers when the stimulus magnitude varied from 200 to 500 cents. Further, response magnitudes decreased along with the increase in stimulus magnitude in Cantonese speakers, which was not observed in Mandarin speakers. These findings suggest that online control of voice F0 during vocalization is sensitive to language experience. Further, systematic modulations of vocal responses across stimulus magnitude were observed in Cantonese speakers but not in Mandarin speakers, which indicates that this highly automatic feedback mechanism is sensitive to the specific tonal system of each language. PMID:21218905
Low-frequency vocalizations in the Florida manatee (Trichechus manatus latirostris)
NASA Astrophysics Data System (ADS)
Frisch, Katherine; Frisch, Stefan
2003-10-01
Vocalizations produced by Florida manatees (Trichechus manatus latirostris) have been characterized as being of relatively high frequency, with fundamental tones ranging from 2500-5000 Hz. These sounds have been variously described as squeaks, squeals, and chirps. Vocalizations below 500 Hz have not been previously reported. Two captive-born Florida manatees were recorded at Mote Marine Laboratory in Sarasota, Florida. The analysis of these vocalizations provides evidence of a new category of low-frequency sounds produced by manatees. These sounds are often heard in conjunction with higher-frequency vocalizations. The low-frequency vocalizations are relatively brief and of low amplitude. These vocalizations are perceived as a series of impulses rather than a low-frequency periodic tone. Knowledge of these low-frequency vocalizations could be useful to those developing future management strategies. Interest has recently increased in the development of acoustic detection and deterrence devices to reduce the number of manatee watercraft interactions. The design of appropriate devices must take into account the apparent ability of manatees to perceive and produce sounds of both high and low frequency. It is also important to consider the possibility that acoustic deterrence devices may disrupt the potentially communicative frequencies of manatee vocalizations.
Recurrence plot analysis of nonstationary data: the understanding of curved patterns.
Facchini, A; Kantz, H; Tiezzi, E
2005-08-01
Recurrence plots of the calls of the Nomascus concolor (Western black crested gibbon) and Hylobates lar (White-handed gibbon) show characteristic circular, curved, and hyperbolic patterns superimposed to the main temporal scale of the signal. It is shown that these patterns are related to particular nonstationarities in the signal. Some of them can be reproduced by artificial signals like frequency modulated sinusoids and sinusoids with time divergent frequency. These modulations are too faint to be resolved by conventional time-frequency analysis with similar precision. Therefore, recurrence plots act as a magnifying glass for the detection of multiple temporal scales in slightly modulated signals. The detected phenomena in these acoustic signals can be explained in the biomechanical context by taking in account the role of the muscles controlling the vocal folds.
Is laughter a better vocal change detector than a growl?
Pinheiro, Ana P; Barros, Carla; Vasconcelos, Margarida; Obermeier, Christian; Kotz, Sonja A
2017-07-01
The capacity to predict what should happen next and to minimize any discrepancy between an expected and an actual sensory input (prediction error) is a central aspect of perception. Particularly in vocal communication, the effective prediction of an auditory input that informs the listener about the emotionality of a speaker is critical. What is currently unknown is how the perceived valence of an emotional vocalization affects the capacity to predict and detect a change in the auditory input. This question was probed in a combined event-related potential (ERP) and time-frequency analysis approach. Specifically, we examined the brain response to standards (Repetition Positivity) and to deviants (Mismatch Negativity - MMN), as well as the anticipatory response to the vocal sounds (pre-stimulus beta oscillatory power). Short neutral, happy (laughter), and angry (growls) vocalizations were presented both as standard and deviant stimuli in a passive oddball listening task while participants watched a silent movie and were instructed to ignore the vocalizations. MMN amplitude was increased for happy compared to neutral and angry vocalizations. The Repetition Positivity was enhanced for happy standard vocalizations. Induced pre-stimulus upper beta power was increased for happy vocalizations, and predicted the modulation of the standard Repetition Positivity. These findings indicate enhanced sensory prediction for positive vocalizations such as laughter. Together, the results suggest that positive vocalizations are more effective predictors in social communication than angry and neutral ones, possibly due to their high social significance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Nocturnal "humming" vocalizations: adding a piece to the puzzle of giraffe vocal communication.
Baotic, Anton; Sicks, Florian; Stoeger, Angela S
2015-09-09
Recent research reveals that giraffes (Giraffa camelopardalis sp.) exhibit a socially structured, fission-fusion system. In other species possessing this kind of society, information exchange is important and vocal communication is usually well developed. But is this true for giraffes? Giraffes are known to produce sounds, but there is no evidence that they use vocalizations for communication. Reports on giraffe vocalizations are mainly anecdotal and the missing acoustic descriptions make it difficult to establish a call nomenclature. Despite inconclusive evidence to date, it is widely assumed that giraffes produce infrasonic vocalizations similar to elephants. In order to initiate a more detailed investigation of the vocal communication in giraffes, we collected data of captive individuals during day and night. We particularly focussed on detecting tonal, infrasonic or sustained vocalizations. We collected over 947 h of audio material in three European zoos and quantified the spectral and temporal components of acoustic signals to obtain an accurate set of acoustic parameters. Besides the known burst, snorts and grunts, we detected harmonic, sustained and frequency-modulated "humming" vocalizations during night recordings. None of the recorded vocalizations were within the infrasonic range. These results show that giraffes do produce vocalizations, which, based on their acoustic structure, might have the potential to function as communicative signals to convey information about the physical and motivational attributes of the caller. The data further reveal that the assumption of infrasonic communication in giraffes needs to be considered with caution and requires further investigations in future studies.
Speech Adjustments for Room Acoustics and Their Effects on Vocal Effort.
Bottalico, Pasquale
2017-05-01
The aims of the present study are (1) to analyze the effects of the acoustical environment and the voice style on time dose (D t_p ) and fundamental frequency (mean f 0 and standard deviation std_f 0 ) while taking into account the effect of short-term vocal fatigue and (2) to predict the self-reported vocal effort from the voice acoustical parameters. Ten male and ten female subjects were recorded while reading a text in normal and loud styles, in three rooms-anechoic, semi-reverberant, and reverberant-with and without acrylic glass panels 0.5 m from the mouth, which increased external auditory feedback. Subjects quantified how much effort was required to speak in each condition on a visual analogue scale after each task. (Aim1) In the loud style, D t_p , f 0 , and std_f 0 increased. The D t_p was higher in the reverberant room compared to the other two rooms. Both genders tended to increase f 0 in less reverberant environments, whereas a more monotonous speech was produced in rooms with greater reverberation. All three voice parameters increased with short-term vocal fatigue. (Aim2) A model of the vocal effort to acoustic vocal parameters is proposed. The sound pressure level contributed to 66% of the variance explained by the model, followed by the f 0 (30%) and the modulation in amplitude (4%). The results provide insight into how voice acoustical parameters can predict vocal effort. In particular, it increased when SPL and f 0 increased and when the amplitude voice modulation decreased. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Tervo, Outi M; Parks, Susan E; Miller, Lee A
2009-09-01
Singing behavior has been described from bowhead whales in the Bering Sea during their annual spring migration and from Davis Strait during their spring feeding season. It has been suggested that this spring singing behavior is a remnant of the singing during the winter breeding season, though no winter recordings are available. In this study, the authors describe recordings made during the winter and spring months of bowhead whales in Disko Bay, Western-Greenland. A total of 7091 bowhead whale sounds were analyzed to describe the vocal repertoire, the singing behavior, and the changes in vocal behavior from February to May. The vocal signals could be divided into simple (frequency-modulated) calls (n=483), complex (amplitude-modulated) calls (n=635), and song notes (n=5973). Recordings from the end of February to middle of March were characterized by higher call rates with a greater diversity of call types than recordings made later in the season. This study is the first description of bowhead song from the stock in Western-Greenland during both the winter and spring months, and provides support for the hypothesis that song during the winter months contains more song notes than song from the spring making the winter song more variable.
Phonetogram changes for trained singers over a nine-month period of vocal training.
LeBorgne, Wendy DeLeo; Weinrich, Barbara D
2002-03-01
Professional vocalists encounter demands requiring voluntary control of phonation, while utilizing a considerable range of frequency and intensity. These quantifiable acoustic events can be measured and represented in a phonetogram. Previous research has compared the phonetograms of trained and untrained voices and found significant differences between these groups. This study was designed to assess the effects of vocal training for singers over a period of nine months. Phonetogram contour changes were examined, with the primary focus on expansion of frequency range and/or intensity control. Twenty-one first-year, master's level, vocal music students, who were engaged in an intensive vocal performance curriculum, participated in this study. Following nine months of vocal training, significant differences were revealed in the subjects' mean frequency range and minimum vocal intensity across frequency levels. There was no significant difference for the mean maximum vocal intensity across frequency levels following vocal training.
The acoustic correlates of valence depend on emotion family.
Belyk, Michel; Brown, Steven
2014-07-01
The voice expresses a wide range of emotions through modulations of acoustic parameters such as frequency and amplitude. Although the acoustics of individual emotions are well understood, attempts to describe the acoustic correlates of broad emotional categories such as valence have yielded mixed results. In the present study, we analyzed the acoustics of emotional valence for different families of emotion. We divided emotional vocalizations into "motivational," "moral," and "aesthetic" families as defined by the OCC (Ortony, Clore, and Collins) model of emotion. Subjects viewed emotional scenarios and were cued to vocalize congruent exclamations in response to them, for example, "Yay!" and "Damn!". Positive valence was weakly associated with high-pitched and loud vocalizations. However, valence interacted with emotion family for both pitch and amplitude. A general acoustic code for valence does not hold across families of emotion, whereas family-specific codes provide a more accurate description of vocal emotions. These findings are consolidated into a set of "rules of expression" relating vocal dimensions to emotion dimensions. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Andriolo, Artur; Reis, Sarah S; Amorim, Thiago O S; Sucunza, Federico; de Castro, Franciele R; Maia, Ygor Geyer; Zerbini, Alexandre N; Bortolotto, Guilherme A; Dalla Rosa, Luciano
2015-09-01
Acoustic parameters of killer whale (Orcinus orca) whistles were described for the western South Atlantic Ocean and highlight the occurrence of high frequency whistles. Killer whale signals were recorded on December of 2012, when a pod of four individuals was observed harassing a group of sperm whales. The high frequency whistles were highly stereotyped and were modulated mostly at ultrasonic frequencies. Compared to other contour types, the high frequency whistles are characterized by higher bandwidths, shorter durations, fewer harmonics, and higher sweep rates. The results add to the knowledge of vocal behavior of this species.
Precise Motor Control Enables Rapid Flexibility in Vocal Behavior of Marmoset Monkeys.
Pomberger, Thomas; Risueno-Segovia, Cristina; Löschner, Julia; Hage, Steffen R
2018-03-05
Investigating the evolution of human speech is difficult and controversial because human speech surpasses nonhuman primate vocal communication in scope and flexibility [1-3]. Monkey vocalizations have been assumed to be largely innate, highly affective, and stereotyped for over 50 years [4, 5]. Recently, this perception has dramatically changed. Current studies have revealed distinct learning mechanisms during vocal development [6-8] and vocal flexibility, allowing monkeys to cognitively control when [9, 10], where [11], and what to vocalize [10, 12, 13]. However, specific call features (e.g., duration, frequency) remain surprisingly robust and stable in adult monkeys, resulting in rather stereotyped and discrete call patterns [14]. Additionally, monkeys seem to be unable to modulate their acoustic call structure under reinforced conditions beyond natural constraints [15, 16]. Behavioral experiments have shown that monkeys can stop sequences of calls immediately after acoustic perturbation but cannot interrupt ongoing vocalizations, suggesting that calls consist of single impartible pulses [17, 18]. Using acoustic perturbation triggered by the vocal behavior itself and quantitative measures of resulting vocal adjustments, we show that marmoset monkeys are capable of producing calls with durations beyond the natural boundaries of their repertoire by interrupting ongoing vocalizations rapidly after perturbation onset. Our results indicate that marmosets are capable of interrupting vocalizations only at periodic time points throughout calls, further supported by the occurrence of periodically segmented phees. These ideas overturn decades-old concepts on primate vocal pattern generation, indicating that vocalizations do not consist of one discrete call pattern but are built of many sequentially uttered units, like human speech. Copyright © 2018 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Volodin, Ilya A; Matrosova, Vera A; Frey, Roland; Kozhevnikova, Julia D; Isaeva, Inna L; Volodina, Elena V
2018-06-11
Non-hibernating pikas collect winter food reserves and store them in hay piles. Individualization of alarm calls might allow discrimination between colony members and conspecifics trying to steal food items from a colony pile. We investigated vocal posture, vocal tract length, and individual acoustic variation of alarm calls, emitted by wild-living Altai pikas Ochotona alpina toward a researcher. Recording started when a pika started calling and lasted as long as possible. The alarm call series of 442 individual callers from different colonies consisted of discrete short (0.073-0.157 s), high-frequency (7.31-15.46 kHz), and frequency-modulated calls separated by irregular intervals. Analysis of 442 discrete calls, the second of each series, revealed that 44.34% calls lacked nonlinear phenomena, in 7.02% nonlinear phenomena covered less than half of call duration, and in 48.64% nonlinear phenomena covered more than half of call duration. Peak frequencies varied among individuals but always fitted one of three maxima corresponding to the vocal tract resonance frequencies (formants) calculated for an estimated 45-mm oral vocal tract. Discriminant analysis using variables of 8 calls per series of 36 different callers, each from a different colony, correctly assigned over 90% of the calls to individuals. Consequently, Altai pika alarm calls are individualistic and nonlinear phenomena might further increase this acoustic individualization. Additionally, video analysis revealed a call-synchronous, very fast (0.13-0.23 s) folding, depression, and subsequent re-expansion of the pinna confirming an earlier report of this behavior that apparently contributes to protecting the hearing apparatus from damage by the self-generated high-intensity alarm calls.
Scheerer, Nichole E; Jones, Jeffery A
2014-12-01
Speech production requires the combined effort of a feedback control system driven by sensory feedback, and a feedforward control system driven by internal models. However, the factors that dictate the relative weighting of these feedback and feedforward control systems are unclear. In this event-related potential (ERP) study, participants produced vocalisations while being exposed to blocks of frequency-altered feedback (FAF) perturbations that were either predictable in magnitude (consistently either 50 or 100 cents) or unpredictable in magnitude (50- and 100-cent perturbations varying randomly within each vocalisation). Vocal and P1-N1-P2 ERP responses revealed decreases in the magnitude and trial-to-trial variability of vocal responses, smaller N1 amplitudes, and shorter vocal, P1 and N1 response latencies following predictable FAF perturbation magnitudes. In addition, vocal response magnitudes correlated with N1 amplitudes, vocal response latencies, and P2 latencies. This pattern of results suggests that after repeated exposure to predictable FAF perturbations, the contribution of the feedforward control system increases. Examination of the presentation order of the FAF perturbations revealed smaller compensatory responses, smaller P1 and P2 amplitudes, and shorter N1 latencies when the block of predictable 100-cent perturbations occurred prior to the block of predictable 50-cent perturbations. These results suggest that exposure to large perturbations modulates responses to subsequent perturbations of equal or smaller size. Similarly, exposure to a 100-cent perturbation prior to a 50-cent perturbation within a vocalisation decreased the magnitude of vocal and N1 responses, but increased P1 and P2 latencies. Thus, exposure to a single perturbation can affect responses to subsequent perturbations. © 2014 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Self-Sullivan, Caryn; Gilbertson, Tamra; Evans, William E.
2002-05-01
On January 13, 1999, manatee vocalizations were recorded during a mating herd event in the Orange River, Florida. Although copulation could not be observed, multiple males were observed with exposed penises. During one 25 min sample (1300-1325 h), over 400 manatee signals were recorded. In March 2000, each signal was captured and digitized from the analog tape using a Marantz PMD 501, Ashly equalizer (gain=0, filter=0), MAC 8100, and Canary 1.2.1. In general, signals were 100-200 ms in length, highly harmonic (up to 8 harmonics ranging from 1 to 16 kHz), with little or no frequency modulation. Intervals between signals ranged from less than 1 s to 14 s (mean = 3 s), indicating that manatees do indeed talk (a lot) during sex. Noise from two passing boats was also recorded during the sample period. One abnormally low-frequency signal (0.4 kHz) was recorded during one boat pass. This apparent manatee vocalization could be seen and heard below the boat noise frequency band.
Self-masking: Listening during vocalization. Normal hearing.
Borg, Erik; Bergkvist, Christina; Gustafsson, Dan
2009-06-01
What underlying mechanisms are involved in the ability to talk and listen simultaneously and what role does self-masking play under conditions of hearing impairment? The purpose of the present series of studies is to describe a technique for assessment of masked thresholds during vocalization, to describe normative data for males and females, and to focus on hearing impairment. The masking effect of vocalized [a:] on narrow-band noise pulses (250-8000 Hz) was studied using the maximum vocalization method. An amplitude-modulated series of sound pulses, which sounded like a steam engine, was masked until the criterion of halving the perceived pulse rate was reached. For masking of continuous reading, a just-follow-conversation criterion was applied. Intra-session test-retest reproducibility and inter-session variability were calculated. The results showed that female voices were more efficient in masking high frequency noise bursts than male voices and more efficient in masking both a male and a female test reading. The male had to vocalize 4 dBA louder than the female to produce the same masking effect on the test reading. It is concluded that the method is relatively simple to apply and has small intra-session and fair inter-session variability. Interesting gender differences were observed.
Borch, D Zangger; Sundberg, J; Lindestad, P A; Thalén, M
2004-01-01
The acoustic characteristics of so-called 'dist' tones, commonly used in singing rock music, are analyzed in a case study. In an initial experiment a professional rock singer produced examples of 'dist' tones. The tones were found to contain aperiodicity, SPL at 0.3 m varied between 90 and 96 dB, and subglottal pressure varied in the range of 20-43 cm H2O, a doubling yielding, on average, an SPL increase of 2.3 dB. In a second experiment, the associated vocal fold vibration patterns were recorded by digital high-speed imaging of the same singer. Inverse filtering of the simultaneously recorded audio signal showed that the aperiodicity was caused by a low frequency modulation of the flow glottogram pulse amplitude. This modulation was produced by an aperiodic or periodic vibration of the supraglottic mucosa. This vibration reduced the pulse amplitude by obstructing the airway for some of the pulses produced by the apparently periodically vibrating vocal folds. The supraglottic mucosa vibration can be assumed to be driven by the high airflow produced by the elevated subglottal pressure.
Vocal warm-up increases phonation threshold pressure in soprano singers at high pitch.
Motel, Tamara; Fisher, Kimberly V; Leydon, Ciara
2003-06-01
Vocal warm-up is thought to optimize singing performance. We compared effects of short-term, submaximal, vocal warm-up exercise with those of vocal rest on the soprano voice (n = 10, ages 19-21 years). Dependent variables were the minimum subglottic air pressure required for vocal fold oscillation to occur (phonation threshold pressure, Pth), and the maximum and minimum phonation fundamental frequency. Warm-up increased Pth for high pitch phonation (p = 0.033), but not for comfortable (p = 0.297) or low (p = 0.087) pitch phonation. No significant difference in the maximum phonation frequency (p = 0.193) or minimum frequency (p = 0.222) was observed. An elevated Pth at controlled high pitch, but an unchanging maximum and minimum frequency production suggests that short-term vocal exercise may increase the viscosity of the vocal fold and thus serve to stabilize the high voice.
Mechanisms underlying the social enhancement of vocal learning in songbirds.
Chen, Yining; Matheson, Laura E; Sakata, Jon T
2016-06-14
Social processes profoundly influence speech and language acquisition. Despite the importance of social influences, little is known about how social interactions modulate vocal learning. Like humans, songbirds learn their vocalizations during development, and they provide an excellent opportunity to reveal mechanisms of social influences on vocal learning. Using yoked experimental designs, we demonstrate that social interactions with adult tutors for as little as 1 d significantly enhanced vocal learning. Social influences on attention to song seemed central to the social enhancement of learning because socially tutored birds were more attentive to the tutor's songs than passively tutored birds, and because variation in attentiveness and in the social modulation of attention significantly predicted variation in vocal learning. Attention to song was influenced by both the nature and amount of tutor song: Pupils paid more attention to songs that tutors directed at them and to tutors that produced fewer songs. Tutors altered their song structure when directing songs at pupils in a manner that resembled how humans alter their vocalizations when speaking to infants, that was distinct from how tutors changed their songs when singing to females, and that could influence attention and learning. Furthermore, social interactions that rapidly enhanced learning increased the activity of noradrenergic and dopaminergic midbrain neurons. These data highlight striking parallels between humans and songbirds in the social modulation of vocal learning and suggest that social influences on attention and midbrain circuitry could represent shared mechanisms underlying the social modulation of vocal learning.
Mechanisms underlying the social enhancement of vocal learning in songbirds
Chen, Yining; Matheson, Laura E.; Sakata, Jon T.
2016-01-01
Social processes profoundly influence speech and language acquisition. Despite the importance of social influences, little is known about how social interactions modulate vocal learning. Like humans, songbirds learn their vocalizations during development, and they provide an excellent opportunity to reveal mechanisms of social influences on vocal learning. Using yoked experimental designs, we demonstrate that social interactions with adult tutors for as little as 1 d significantly enhanced vocal learning. Social influences on attention to song seemed central to the social enhancement of learning because socially tutored birds were more attentive to the tutor’s songs than passively tutored birds, and because variation in attentiveness and in the social modulation of attention significantly predicted variation in vocal learning. Attention to song was influenced by both the nature and amount of tutor song: Pupils paid more attention to songs that tutors directed at them and to tutors that produced fewer songs. Tutors altered their song structure when directing songs at pupils in a manner that resembled how humans alter their vocalizations when speaking to infants, that was distinct from how tutors changed their songs when singing to females, and that could influence attention and learning. Furthermore, social interactions that rapidly enhanced learning increased the activity of noradrenergic and dopaminergic midbrain neurons. These data highlight striking parallels between humans and songbirds in the social modulation of vocal learning and suggest that social influences on attention and midbrain circuitry could represent shared mechanisms underlying the social modulation of vocal learning. PMID:27247385
Klemuk, Sarah A; Lu, Xiaoying; Hoffman, Henry T; Titze, Ingo R
2010-05-01
Viscoelastic properties of numerous vocal fold injectables have been reported but not at speaking frequencies. For materials intended for Reinke's space, ramifications of property values are of great concern because of their impact on ease of voice onset. Our objectives were: 1) to measure viscoelastic properties of a new nonresorbing carbomer and well-known vocal fold injectables at vocalization frequencies using established and new instrumentation, and 2) to predict phonation threshold pressures using a computer model with intended placement in Reinke's space. Rheology and phonation threshold pressure calculations. Injectables were evaluated with a traditional rotational rheometer and a new piezo-rotary vibrator. Using these data at vocalization frequencies, phonation threshold pressures (PTP) were calculated for each biomaterial, assuming a low dimensional model with supraglottic coupling and adjusted vocal fold length and thickness at each frequency. Results were normalized to a nominal PTP value. Viscoelastic data were acquired at vocalization frequencies as high as 363 to 1,400 Hz for six new carbomer hydrogels, Hylan B, and Extracel intended for vocal fold Reinke's space injection and for Cymetra (lateral injection). Reliability was confirmed with good data overlap when measuring with either rheometer. PTP predictions ranged from 0.001 to 16 times the nominal PTP value of 0.283 kPa. Accurate viscoelastic measurements of vocal fold injectables are now possible at physiologic frequencies. Hylan B, Extracel, and the new carbomer hydrogels should generate easy vocal onset and sustainable vocalization based on their rheologic properties if injected into Reinke's space. Applications may vary depending on desired longevity of implant. Laryngoscope, 2010.
Rheometric properties of canine vocal fold tissues: Variation with anatomic location
Kimura, Miwako; Mau, Ted; Chan, Roger W.
2010-01-01
Objective To evaluate the in vitro rheometric properties of the canine vocal fold lamina propria and muscle at phonatory frequencies, and their changes with anatomic location. Methods Six canine larynges were harvested immediately postmortem. Viscoelastic shear properties of anterior, middle, and posterior portions of the vocal fold cover (lamina propria) as well as those of the medial thyroarytenoid (TA) muscle (vocalis muscle) were quantified by a linear, controlled-strain simple-shear rheometer. Measurements of elastic shear modulus (G’) and dynamic viscosity (η’) of the specimens were conducted with small-amplitude sinusoidal shear deformation over a frequency range of 1 Hz to 250 Hz. Results All specimens showed similar frequency dependence of the viscoelastic functions, with G’ gradually increasing with frequency and η’ decreasing with frequency monotonically. G’ and η’ of the canine vocalis muscle were significantly higher than those of the canine vocal fold cover, and η’ of the canine vocal fold cover was significantly higher than that of the human vocal fold cover. There were no significant differences in G’ and in η’ between different portions of the canine vocal fold cover. Conclusion These preliminary data based on the canine model suggested that the vocalis muscle, while in a relaxed state in vitro, is significantly stiffer and more viscous than the vocal fold cover during vibration at phonatory frequencies. For large-amplitude vocal fold vibration involving the medial portion of the TA muscle, such distinct differences in viscoelastic properties of different layers of the vocal fold should be taken into account in multi-layered biomechanical models of phonation. PMID:21035291
NASA Astrophysics Data System (ADS)
Soltis, Joseph M.; Savage, Anne; Leong, Kirsten M.
2004-05-01
The most commonly occurring elephant vocalization is the rumble, a frequency-modulated call with infrasonic components. Upwards of ten distinct rumble subtypes have been proposed, but little quantitative work on the acoustic properties of rumbles has been conducted. Rumble vocalizations (N=269) from six females housed at Disney's Animal Kingdom were analyzed. Vocalizations were recorded from microphones in collars around subject necks, and rumbles were digitized and measured using SIGNAL software. Sixteen acoustic variables were measured for each call, extracting both source and filter features. Multidimensional scaling analysis indicates that there are no acoustically distinct rumble subtypes, but that there is quantitative variation across rumbles. Discriminant function analysis showed that the acoustic characteristics of rumbles differ across females. A classification success rate of 65% was achieved when assigning unselected rumbles to one of the six females (test set =64 calls) according to the functions derived from the originally selected calls (training set =205 calls). The rumble is best viewed as a single call type with graded variation, but information regarding individual identity is encoded in female rumbles.
Central pattern generators for social vocalization: Androgen-dependent neurophysiological mechanisms
Bass, Andrew H.; Remage-Healey, Luke
2008-01-01
Historically, most studies of vertebrate central pattern generators (CPGs) have focused on mechanisms for locomotion and respiration. Here, we highlight new results for ectothermic vertebrates, namely teleost fish and amphibians, showing how androgenic steroids can influence the temporal patterning of CPGs for social vocalization. Investigations of vocalizing teleosts show how androgens can rapidly (within minutes) modulate the neurophysiological output of the vocal CPG (fictive vocalizations that mimic the temporal properties of natural vocalizations) inclusive of their divergent actions between species, as well as intraspecific differences between male reproductive morphs. Studies of anuran amphibians (frogs) demonstrate that long-term steroid treatments (wks) can masculinize the fictive vocalizations of females, inclusive of its sensitivity to rapid modulation by serotonin. Given the conserved organization of vocal control systems across vertebrate groups, the vocal CPGs of fish and amphibians provide tractable models for identifying androgen-dependent events that are fundamental to the mechanisms of vocal motor patterning. These basic mechanisms can also inform our understanding of the more complex CPGs for vocalization, and social behaviors in general, that have evolved among birds and mammals. PMID:18262186
Neural Correlates of Vocal Production and Motor Control in Human Heschl's Gyrus
Oya, Hiroyuki; Nourski, Kirill V.; Kawasaki, Hiroto; Larson, Charles R.; Brugge, John F.; Howard, Matthew A.; Greenlee, Jeremy D.W.
2016-01-01
The present study investigated how pitch frequency, a perceptually relevant aspect of periodicity in natural human vocalizations, is encoded in Heschl's gyrus (HG), and how this information may be used to influence vocal pitch motor control. We recorded local field potentials from multicontact depth electrodes implanted in HG of 14 neurosurgical epilepsy patients as they vocalized vowel sounds and received brief (200 ms) pitch perturbations at 100 Cents in their auditory feedback. Event-related band power responses to vocalizations showed sustained frequency following responses that tracked voice fundamental frequency (F0) and were significantly enhanced in posteromedial HG during speaking compared with when subjects listened to the playback of their own voice. In addition to frequency following responses, a transient response component within the high gamma frequency band (75–150 Hz) was identified. When this response followed the onset of vocalization, the magnitude of the response was the same for the speaking and playback conditions. In contrast, when this response followed a pitch shift, its magnitude was significantly enhanced during speaking compared with playback. We also observed that, in anterolateral HG, the power of high gamma responses to pitch shifts correlated with the magnitude of compensatory vocal responses. These findings demonstrate a functional parcellation of HG with neural activity that encodes pitch in natural human voice, distinguishes between self-generated and passively heard vocalizations, detects discrepancies between the intended and heard vocalization, and contains information about the resulting behavioral vocal compensations in response to auditory feedback pitch perturbations. SIGNIFICANCE STATEMENT The present study is a significant contribution to our understanding of sensor-motor mechanisms of vocal production and motor control. The findings demonstrate distinct functional parcellation of core and noncore areas within human auditory cortex on Heschl's gyrus that process natural human vocalizations and pitch perturbations in the auditory feedback. In addition, our data provide evidence for distinct roles of high gamma neural oscillations and frequency following responses for processing periodicity in human vocalizations during vocal production and motor control. PMID:26888939
Sexual selection on male vocal fundamental frequency in humans and other anthropoids.
Puts, David A; Hill, Alexander K; Bailey, Drew H; Walker, Robert S; Rendall, Drew; Wheatley, John R; Welling, Lisa L M; Dawood, Khytam; Cárdenas, Rodrigo; Burriss, Robert P; Jablonski, Nina G; Shriver, Mark D; Weiss, Daniel; Lameira, Adriano R; Apicella, Coren L; Owren, Michael J; Barelli, Claudia; Glenn, Mary E; Ramos-Fernandez, Gabriel
2016-04-27
In many primates, including humans, the vocalizations of males and females differ dramatically, with male vocalizations and vocal anatomy often seeming to exaggerate apparent body size. These traits may be favoured by sexual selection because low-frequency male vocalizations intimidate rivals and/or attract females, but this hypothesis has not been systematically tested across primates, nor is it clear why competitors and potential mates should attend to vocalization frequencies. Here we show across anthropoids that sexual dimorphism in fundamental frequency (F0) increased during evolutionary transitions towards polygyny, and decreased during transitions towards monogamy. Surprisingly, humans exhibit greater F0 sexual dimorphism than any other ape. We also show that low-F0 vocalizations predict perceptions of men's dominance and attractiveness, and predict hormone profiles (low cortisol and high testosterone) related to immune function. These results suggest that low male F0 signals condition to competitors and mates, and evolved in male anthropoids in response to the intensity of mating competition. © 2016 The Author(s).
Formant characteristics of human laughter.
Szameitat, Diana P; Darwin, Chris J; Szameitat, André J; Wildgruber, Dirk; Alter, Kai
2011-01-01
Although laughter is an important aspect of nonverbal vocalization, its acoustic properties are still not fully understood. Extreme articulation during laughter production, such as wide jaw opening, suggests that laughter can have very high first formant (F(1)) frequencies. We measured fundamental frequency and formant frequencies of the vowels produced in the vocalic segments of laughter. Vocalic segments showed higher average F(1) frequencies than those previously reported and individual values could be as high as 1100 Hz for male speakers and 1500 Hz for female speakers. To our knowledge, these are the highest F(1) frequencies reported to date for human vocalizations, exceeding even the F(1) frequencies reported for trained soprano singers. These exceptionally high F(1) values are likely to be based on the extreme positions adopted by the vocal tract during laughter in combination with physiological constraints accompanying the production of a "pressed" voice. Copyright © 2011 The Voice Foundation. All rights reserved.
Fischer, J; Hammerschmidt, K
2011-01-01
Comparative analyses used to reconstruct the evolution of traits associated with the human language faculty, including its socio-cognitive underpinnings, highlight the importance of evolutionary constraints limiting vocal learning in non-human primates. After a brief overview of this field of research and the neural basis of primate vocalizations, we review studies that have addressed the genetic basis of usage and structure of ultrasonic communication in mice, with a focus on the gene FOXP2 involved in specific language impairments and neuroligin genes (NL-3 and NL-4) involved in autism spectrum disorders. Knockout of FoxP2 leads to reduced vocal behavior and eventually premature death. Introducing the human variant of FoxP2 protein into mice, in contrast, results in shifts in frequency and modulation of pup ultrasonic vocalizations. Knockout of NL-3 and NL-4 in mice diminishes social behavior and vocalizations. Although such studies may provide insights into the molecular and neural basis of social and communicative behavior, the structure of mouse vocalizations is largely innate, limiting the suitability of the mouse model to study human speech, a learned mode of production. Although knockout or replacement of single genes has perceptible effects on behavior, these genes are part of larger networks whose functions remain poorly understood. In humans, for instance, deficiencies in NL-4 can lead to a broad spectrum of disorders, suggesting that further factors (experiential and/or genetic) contribute to the variation in clinical symptoms. The precise nature as well as the interaction of these factors is yet to be determined. PMID:20579107
ERIC Educational Resources Information Center
Roy, Nelson; Fetrow, Rebecca A.; Merrill, Ray M.; Dromey, Christopher
2016-01-01
Purpose: Vocal hyperfunction, related to abnormal laryngeal muscle activity, is considered the proximal cause of primary muscle tension dysphonia (pMTD). Relative fundamental frequency (RFF) has been proposed as an objective acoustic marker of vocal hyperfunction. This study examined (a) the ability of RFF to track changes in vocal hyperfunction…
Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques.
Fitch, W T
1997-08-01
Body weight, length, and vocal tract length were measured for 23 rhesus macaques (Macaca mulatta) of various sizes using radiographs and computer graphic techniques. linear predictive coding analysis of tape-recorded threat vocalizations were used to determine vocal tract resonance frequencies ("formants") for the same animals. A new acoustic variable is proposed, "formant dispersion," which should theoretically depend upon vocal tract length. Formant dispersion is the averaged difference between successive formant frequencies, and was found to be closely tied to both vocal tract length and body size. Despite the common claim that voice fundamental frequency (F0) provides an acoustic indication of body size, repeated investigations have failed to support such a relationship in many vertebrate species including humans. Formant dispersion, unlike voice pitch, is proposed to be a reliable predictor of body size in macaques, and probably many other species.
NASA Astrophysics Data System (ADS)
Schwalm, Afton Leigh
California sea lions (Zalophus californianus) are a highly popular and easily recognized marine mammal in zoos, aquariums, circuses, and often seen by ocean visitors. They are highly vocal and gregarious on land. Surprisingly, little research has been performed on the vocalization types, source levels, acoustic properties, and functions of airborne sounds used by California sea lions. This research on airborne vocalizations of California sea lions will advance the understanding of this aspect of California sea lions communication, as well as examine the relationship between health condition and acoustic behavior. Using a PhillipsRTM digital recorder with attached microphone and a calibrated RadioShackRTM sound pressure level meter, acoustical data were recorded opportunistically on California sea lions during rehabilitation at The Marine Mammal Center in Sausalito, CA. Vocalizations were analyzed using frequency, time, and amplitude variables with Raven Pro: Interactive Sound Analysis Software Version 1.4 (The Cornell Lab of Ornithology, Ithaca, NY). Five frequency, three time, and four amplitude variables were analyzed for each vocalization. Differences in frequency, time, and amplitude variables were not significant by sex. The older California sea lion group produced vocalizations that were significantly lower in four frequency variables, significantly longer in two time variables, significantly higher in calibrated maximum and minimum amplitude variables, and significantly lower in frequency at maximum and minimum amplitude compared with pups. Six call types were identified: bark, goat, growl/grumble, bark/grumble, bark/growl, and grumble/moan. The growl/grumble call was higher in dominant beginning, ending, and minimum frequency, as well as in the frequency at maximum amplitude compared with the bark, goat, bark/grumble calls in the first versus last vocalization sample. The goat call was significantly higher in first harmonic interval than any other call type in the all vocalizations sample. The "fate" of a sea lion was categorized as: released, placed at another facility, remained at TMMC, euthanized, or died. To determine if acoustic features could be used to assess the recovery of a pup, the acoustic features of a pup's first recorded vocalization were compared with the frequency, time, and amplitude of the last vocalization recorded (i.e., before it was released or placed at another facility). In addition, all first vocalizations were pooled and all last vocalizations were pooled for acoustic analysis, regardless of their fate. Released pups had shorter duration calls, a greater first harmonic interval, and a higher dominant maximum frequency than either pups that died or pups remaining at TMMC. Released pups had a higher frequency at maximum and minimum amplitude compared to dead and remaining pups. Pups that died had significantly lower dominant ending frequency and a lower dominant minimum frequency than released or remaining pups. These results were supported by other studies on different species of otariids, phocids, and cetaceans. The preliminary analyses presented in this thesis holds promise that with additional data acoustic features of California sea lion airborne vocalizations could indicate sex, age, and possibly health condition or the potential for release.
Audio-vocal interaction in single neurons of the monkey ventrolateral prefrontal cortex.
Hage, Steffen R; Nieder, Andreas
2015-05-06
Complex audio-vocal integration systems depend on a strong interconnection between the auditory and the vocal motor system. To gain cognitive control over audio-vocal interaction during vocal motor control, the PFC needs to be involved. Neurons in the ventrolateral PFC (VLPFC) have been shown to separately encode the sensory perceptions and motor production of vocalizations. It is unknown, however, whether single neurons in the PFC reflect audio-vocal interactions. We therefore recorded single-unit activity in the VLPFC of rhesus monkeys (Macaca mulatta) while they produced vocalizations on command or passively listened to monkey calls. We found that 12% of randomly selected neurons in VLPFC modulated their discharge rate in response to acoustic stimulation with species-specific calls. Almost three-fourths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of vocalization. Based on these audio-vocal interactions, the VLPFC might be well positioned to combine higher order auditory processing with cognitive control of the vocal motor output. Such audio-vocal integration processes in the VLPFC might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2015 the authors 0270-6474/15/357030-11$15.00/0.
Ordóñez-Gómez, José D; Santillán-Doherty, Ana M; Fischer, Julia; Hammerschmidt, Kurt
2018-04-01
Due to several factors such as ecological conditions, group size, and social organization, primates frequently spend time out of visual contact with individuals of their own group. Through the use of long-distance vocalizations, often termed "contact calls," primates are able to maintain contact with out-of-sight individuals. Contact calls have been shown to be individually distinct, and reverberation and attenuation provide information about caller distance. It is less clear, however, whether callers actively change the structure of contact calls depending on the distance to the presumed listeners. We studied this question in spider monkeys (Ateles geoffroyi), a species with complex spatial dynamics (fission-fusion society) that produces highly frequency modulated contact calls, denominated "whinnies." We determined the acoustic characteristics of 566 whinnies recorded from 35 free-ranging spider monkeys that belong to a community located in Mexico, and used cluster analyses, discriminant function analyses, and generalized linear mixed models to assess if they varied in relation to the presumed distance to the listener. Whinnies could be grouped into five subtypes. Since the lowest frequency subtype was mainly produced by spider monkeys that exchanged whinnies at longer distances, and lower frequency calls propagate across longer distances, our results suggest that whinnies vary in order to enhance vocal contact between individuals separated by different distances. Our results also revealed that whinnies convey potential information about caller immediate behaviors and corroborated that these calls are individually distinct. Overall, our results suggest that whinny acoustic variation facilitates the maintenance of vocal contact between individuals living in a society with complex spatial dynamics. © 2018 Wiley Periodicals, Inc.
Drew, R; Sapir, S
1995-06-01
Nineteen trained soprano singers aged 18-30 years vocalized tasks designed to assess average speaking fundamental frequency (SFF) during spontaneous speaking and reading. Vocal range and perceptual characteristics while singing with low intensity and high frequency were also assessed, and subjects completed a survey of vocal habits/symptoms. Recorded signals were digitized prior to being analyzed for SFF using the Kay Computerized Speech Lab program. Subjects were assigned to a normal voice or impaired voice group based on ratings of perceptual tasks and survey results. Data analysis showed group differences in mean SFF, no differences in vocal range, higher mean SFF values for reading than speaking, and 58% ability to perceive speaking in low pitch. The role of speaking in too low pitch as causal for vocal symptoms and need for voice classification differentiation in vocal performance studies are discussed.
Thomas, Ashish; Suyesh, Robin; Biju, S. D.; Bee, Mark A.
2014-01-01
Quantitative descriptions of animal vocalizations can inform an understanding of their evolutionary functions, the mechanisms for their production and perception, and their potential utility in taxonomy, population monitoring, and conservation. The goal of this study was to provide the first acoustical and statistical analysis of the advertisement calls of Nasikabatrachus sahyadrensis. Commonly known as the Indian purple frog, N. sahyadrensis is an endangered species endemic to the Western Ghats of India. As the only known species in its family (Nasikabatrachidae), it has ancient evolutionary ties to frogs restricted to the Seychelles archipelago (Sooglossidae). The role of vocalizations in the behavior of this unique species poses interesting questions, as the animal is fossorial and potentially earless and it breeds explosively above the soil for only about two weeks a year. In this study, we quantified 19 acoustic properties of 208 calls recorded from 10 males. Vocalizations were organized into distinct call groups typically composed of two to six short (59 ms), pulsatile calls, each consisting of about five to seven pulses produced at a rate of about 106 pulses/s. The frequency content of the call consisted of a single dominant peak between 1200–1300 Hz and there was no frequency modulation. The patterns of variation within and among individuals were typical of those seen in other frogs. Few of the properties we measured were related to temperature, body size, or condition, though there was little variation in temperature. Field observations and recordings of captive individuals indicated that males engaged in both antiphonal calling and call overlap with nearby calling neighbors. We discuss our findings in relation to previous work on vocal behavior in other fossorial frogs and in sooglossid frogs. PMID:24516517
Singing modulates parvalbumin interneurons throughout songbird forebrain vocal control circuitry
Zengin-Toktas, Yildiz
2017-01-01
Across species, the performance of vocal signals can be modulated by the social environment. Zebra finches, for example, adjust their song performance when singing to females (‘female-directed’ or FD song) compared to when singing in isolation (‘undirected’ or UD song). These changes are salient, as females prefer the FD song over the UD song. Despite the importance of these performance changes, the neural mechanisms underlying this social modulation remain poorly understood. Previous work in finches has established that expression of the immediate early gene EGR1 is increased during singing and modulated by social context within the vocal control circuitry. Here, we examined whether particular neural subpopulations within those vocal control regions exhibit similar modulations of EGR1 expression. We compared EGR1 expression in neurons expressing parvalbumin (PV), a calcium buffer that modulates network plasticity and homeostasis, among males that performed FD song, males that produced UD song, or males that did not sing. We found that, overall, singing but not social context significantly affected EGR1 expression in PV neurons throughout the vocal control nuclei. We observed differences in EGR1 expression between two classes of PV interneurons in the basal ganglia nucleus Area X. Additionally, we found that singing altered the amount of PV expression in neurons in HVC and Area X and that distinct PV interneuron types in Area X exhibited different patterns of modulation by singing. These data indicate that throughout the vocal control circuitry the singing-related regulation of EGR1 expression in PV neurons may be less influenced by social context than in other neuron types and raise the possibility of cell-type specific differences in plasticity and calcium buffering. PMID:28235074
Measurements of vocal fold tissue viscoelasticity: Approaching the male phonatory frequency range
NASA Astrophysics Data System (ADS)
Chan, Roger W.
2004-06-01
Viscoelastic shear properties of human vocal fold tissues have been reported previously. However, data have only been obtained at very low frequencies (<=15 Hz). This necessitates data extrapolation to the frequency range of phonation based on constitutive modeling and time-temperature superposition. This study attempted to obtain empirical measurements at higher frequencies with the use of a controlled strain torsional rheometer, with a design of directly controlling input strain that introduced significantly smaller system inertial errors compared to controlled stress rheometry. Linear viscoelastic shear properties of the vocal fold mucosa (cover) from 17 canine larynges were quantified at frequencies of up to 50 Hz. Consistent with previous data, results showed that the elastic shear modulus (G'), viscous shear modulus (G''), and damping ratio (ζ) of the vocal fold mucosa were relatively constant across 0.016-50 Hz, whereas the dynamic viscosity (ɛ') decreased monotonically with frequency. Constitutive characterization of the empirical data by a quasilinear viscoelastic model and a statistical network model demonstrated trends of viscoelastic behavior at higher frequencies generally following those observed at lower frequencies. These findings supported the use of controlled strain rheometry for future investigations of the viscoelasticity of vocal fold tissues and phonosurgical biomaterials at phonatory frequencies.
Time-Varying Vocal Folds Vibration Detection Using a 24 GHz Portable Auditory Radar
Hong, Hong; Zhao, Heng; Peng, Zhengyu; Li, Hui; Gu, Chen; Li, Changzhi; Zhu, Xiaohua
2016-01-01
Time-varying vocal folds vibration information is of crucial importance in speech processing, and the traditional devices to acquire speech signals are easily smeared by the high background noise and voice interference. In this paper, we present a non-acoustic way to capture the human vocal folds vibration using a 24-GHz portable auditory radar. Since the vocal folds vibration only reaches several millimeters, the high operating frequency and the 4 × 4 array antennas are applied to achieve the high sensitivity. The Variational Mode Decomposition (VMD) based algorithm is proposed to decompose the radar-detected auditory signal into a sequence of intrinsic modes firstly, and then, extract the time-varying vocal folds vibration frequency from the corresponding mode. Feasibility demonstration, evaluation, and comparison are conducted with tonal and non-tonal languages, and the low relative errors show a high consistency between the radar-detected auditory time-varying vocal folds vibration and acoustic fundamental frequency, except that the auditory radar significantly improves the frequency-resolving power. PMID:27483261
Time-Varying Vocal Folds Vibration Detection Using a 24 GHz Portable Auditory Radar.
Hong, Hong; Zhao, Heng; Peng, Zhengyu; Li, Hui; Gu, Chen; Li, Changzhi; Zhu, Xiaohua
2016-07-28
Time-varying vocal folds vibration information is of crucial importance in speech processing, and the traditional devices to acquire speech signals are easily smeared by the high background noise and voice interference. In this paper, we present a non-acoustic way to capture the human vocal folds vibration using a 24-GHz portable auditory radar. Since the vocal folds vibration only reaches several millimeters, the high operating frequency and the 4 × 4 array antennas are applied to achieve the high sensitivity. The Variational Mode Decomposition (VMD) based algorithm is proposed to decompose the radar-detected auditory signal into a sequence of intrinsic modes firstly, and then, extract the time-varying vocal folds vibration frequency from the corresponding mode. Feasibility demonstration, evaluation, and comparison are conducted with tonal and non-tonal languages, and the low relative errors show a high consistency between the radar-detected auditory time-varying vocal folds vibration and acoustic fundamental frequency, except that the auditory radar significantly improves the frequency-resolving power.
NASA Astrophysics Data System (ADS)
DeRosa, Angela
The present study analyzed the acoustic and perceptual differences in non-singer's singing voice before and after a vocal warm-up. Experiments were conducted with 12 females who had no singing experience and considered themselves to be non-singers. Participants were recorded performing 3 tasks: a musical scale stretching to their most comfortable high and low pitches, sustained productions of the vowels /a/ and /i/, and singing performance of the "Star Spangled Banner." Participants were recorded performing these three tasks before a vocal warm-up, after a vocal warm-up, and then again 2-3 weeks later after 2-3 weeks of practice. Acoustical analysis consisted of formant frequency analysis, singer's formant/singing power ratio analysis, maximum phonation frequency range analysis, and an analysis of jitter, noise to harmonic ratio (NHR), relative average perturbation (RAP), and voice turbulence index (VTI). A perceptual analysis was also conducted with 12 listeners rating comparison performances of before vs. after the vocal warm-up, before vs. after the second vocal warm-up, and after both vocal warm-ups. There were no significant findings for the formant frequency analysis of the vowel /a/, but there was significance for the 1st formant frequency analysis of the vowel /i/. Singer's formant analyzed via Singing Power Ratio analysis showed significance only for the vowel /i/. Maximum phonation frequency range analysis showed a significant increase after the vocal warm-ups. There were no significant findings for the acoustic measures of jitter, NHR, RAP, and VTI. Perceptual analysis showed a significant difference after a vocal warm-up. The results indicate that a singing vocal warm-up can have a significant positive influence on the singing voice of non-singers.
Construction and Characterization of a Novel Vocal Fold Bioreactor
Zerdoum, Aidan B.; Tong, Zhixiang; Bachman, Brendan; Jia, Xinqiao
2014-01-01
In vitro engineering of mechanically active tissues requires the presentation of physiologically relevant mechanical conditions to cultured cells. To emulate the dynamic environment of vocal folds, a novel vocal fold bioreactor capable of producing vibratory stimulations at fundamental phonation frequencies is constructed and characterized. The device is composed of a function generator, a power amplifier, a speaker selector and parallel vibration chambers. Individual vibration chambers are created by sandwiching a custom-made silicone membrane between a pair of acrylic blocks. The silicone membrane not only serves as the bottom of the chamber but also provides a mechanism for securing the cell-laden scaffold. Vibration signals, generated by a speaker mounted underneath the bottom acrylic block, are transmitted to the membrane aerodynamically by the oscillating air. Eight identical vibration modules, fixed on two stationary metal bars, are housed in an anti-humidity chamber for long-term operation in a cell culture incubator. The vibration characteristics of the vocal fold bioreactor are analyzed non-destructively using a Laser Doppler Vibrometer (LDV). The utility of the dynamic culture device is demonstrated by culturing cellular constructs in the presence of 200-Hz sinusoidal vibrations with a mid-membrane displacement of 40 µm. Mesenchymal stem cells cultured in the bioreactor respond to the vibratory signals by altering the synthesis and degradation of vocal fold-relevant, extracellular matrix components. The novel bioreactor system presented herein offers an excellent in vitro platform for studying vibration-induced mechanotransduction and for the engineering of functional vocal fold tissues. PMID:25145349
Construction and characterization of a novel vocal fold bioreactor.
Zerdoum, Aidan B; Tong, Zhixiang; Bachman, Brendan; Jia, Xinqiao
2014-08-01
In vitro engineering of mechanically active tissues requires the presentation of physiologically relevant mechanical conditions to cultured cells. To emulate the dynamic environment of vocal folds, a novel vocal fold bioreactor capable of producing vibratory stimulations at fundamental phonation frequencies is constructed and characterized. The device is composed of a function generator, a power amplifier, a speaker selector and parallel vibration chambers. Individual vibration chambers are created by sandwiching a custom-made silicone membrane between a pair of acrylic blocks. The silicone membrane not only serves as the bottom of the chamber but also provides a mechanism for securing the cell-laden scaffold. Vibration signals, generated by a speaker mounted underneath the bottom acrylic block, are transmitted to the membrane aerodynamically by the oscillating air. Eight identical vibration modules, fixed on two stationary metal bars, are housed in an anti-humidity chamber for long-term operation in a cell culture incubator. The vibration characteristics of the vocal fold bioreactor are analyzed non-destructively using a Laser Doppler Vibrometer (LDV). The utility of the dynamic culture device is demonstrated by culturing cellular constructs in the presence of 200-Hz sinusoidal vibrations with a mid-membrane displacement of 40 µm. Mesenchymal stem cells cultured in the bioreactor respond to the vibratory signals by altering the synthesis and degradation of vocal fold-relevant, extracellular matrix components. The novel bioreactor system presented herein offers an excellent in vitro platform for studying vibration-induced mechanotransduction and for the engineering of functional vocal fold tissues.
Effect of artificially lengthened vocal tract on vocal fold oscillation's fundamental frequency.
Hanamitsu, Masakazu; Kataoka, Hideyuki
2004-06-01
The fundamental frequency of vocal fold oscillation (F(0)) is controlled by laryngeal mechanics and aerodynamic properties. F(0) change per unit change of transglottal pressure (dF/dP) using a shutter valve has been studied and found to have nonlinear, V-shaped relationship with F(0). On the other hand, the vocal tract is also known to affect vocal fold oscillation. This study examined the effect of artificially lengthened vocal tract length on dF/dP. dF/dP was measured in six men using two mouthpieces of different lengths. The dF/dP graph for the longer vocal tract was shifted leftward relative to the shorter one. Using the one-mass model, the nadir of the "V" on the dF/dP graph was strongly influenced by the resonance around the first formant frequency. However, a more precise model is needed to account for the effects of viscosity and turbulence.
Functional subdivisions in low-frequency primary auditory cortex (AI).
Wallace, M N; Palmer, A R
2009-04-01
We wished to test the hypothesis that there are modules in low-frequency AI that can be identified by their responsiveness to communication calls or particular regions of space. Units were recorded in anaesthetised guinea pig AI and stimulated with conspecific vocalizations and a virtual motion stimulus (binaural beats) presented via a closed sound system. Recording tracks were mainly oriented orthogonally to the cortical surface. Some of these contained units that were all time-locked to the structure of the chutter call (14/22 tracks) and/or the purr call (12/22 tracks) and/or that had a preference for stimuli from a particular region of space (8/20 tracks with four contralateral, two ipsilateral and two midline), or where there was a strong asymmetry in the response to beats of different direction (two tracks). We conclude that about half of low-frequency AI is organized into modules that are consistent with separate "what" and "where" pathways.
From electromyographic activity to frequency modulation in zebra finch song.
Döppler, Juan F; Bush, Alan; Goller, Franz; Mindlin, Gabriel B
2018-02-01
Behavior emerges from the interaction between the nervous system and peripheral devices. In the case of birdsong production, a delicate and fast control of several muscles is required to control the configuration of the syrinx (the avian vocal organ) and the respiratory system. In particular, the syringealis ventralis muscle is involved in the control of the tension of the vibrating labia and thus affects the frequency modulation of the sound. Nevertheless, the translation of the instructions (which are electrical in nature) into acoustical features is complex and involves nonlinear, dynamical processes. In this work, we present a model of the dynamics of the syringealis ventralis muscle and the labia, which allows calculating the frequency of the generated sound, using as input the electrical activity recorded in the muscle. In addition, the model provides a framework to interpret inter-syllabic activity and hints at the importance of the biomechanical dynamics in determining behavior.
Correlation between vocal tract symptoms and modern singing handicap index in church gospel singers.
Pinheiro, Joel; Silverio, Kelly Cristina Alves; Siqueira, Larissa Thaís Donalonso; Ramos, Janine Santos; Brasolotto, Alcione Ghedini; Zambon, Fabiana; Behlau, Mara
2017-08-24
To verify the correlation between vocal tract discomfort symptoms and perceived voice handicaps in gospel singers, analyzing possible differences according to gender. 100 gospel singers volunteered, 50 male and 50 female. All participants answered two questionnaires: Vocal Tract Discomfort (VTD) scale and the Modern Singing Handicap Index (MSHI) that investigates the vocal handicap perceived by singers, linking the results of both instruments (p<0.05). Women presented more perceived handicaps and also more frequent and higher intensity vocal tract discomfort. Furthermore, the more frequent and intense the vocal tract symptoms, the higher the vocal handicap for singing. Female gospel singers present higher frequency and intensity of vocal tract discomfort symptoms, as well as higher voice handicap for singing than male gospel singers. The higher the frequency and intensity of the laryngeal symptoms, the higher the vocal handicap will be.
Exploring vocal recovery after cranial nerve injury in Bengalese finches.
Urbano, Catherine M; Peterson, Jennifer R; Cooper, Brenton G
2013-02-08
Songbirds and humans use auditory feedback to acquire and maintain their vocalizations. The Bengalese finch (Lonchura striata domestica) is a songbird species that rapidly modifies its vocal output to adhere to an internal song memory. In this species, the left side of the bipartite vocal organ is specialized for producing louder, higher frequencies (≥2.2kHz) and denervation of the left vocal muscles eliminates these notes. Thus, the return of higher frequency notes after cranial nerve injury can be used as a measure of vocal recovery. Either the left or right side of the syrinx was denervated by resection of the tracheosyringeal portion of the hypoglossal nerve. Histologic analyses of syringeal muscle tissue showed significant muscle atrophy in the denervated side. After left nerve resection, songs were mainly composed of lower frequency syllables, but three out of five birds recovered higher frequency syllables. Right nerve resection minimally affected phonology, but it did change song syntax; syllable sequence became abnormally stereotyped after right nerve resection. Therefore, damage to the neuromuscular control of sound production resulted in reduced motor variability, and Bengalese finches are a potential model for functional vocal recovery following cranial nerve injury. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Adapted to Roar: Functional Morphology of Tiger and Lion Vocal Folds
Klemuk, Sarah A.; Riede, Tobias; Walsh, Edward J.; Titze, Ingo R.
2011-01-01
Vocal production requires active control of the respiratory system, larynx and vocal tract. Vocal sounds in mammals are produced by flow-induced vocal fold oscillation, which requires vocal fold tissue that can sustain the mechanical stress during phonation. Our understanding of the relationship between morphology and vocal function of vocal folds is very limited. Here we tested the hypothesis that vocal fold morphology and viscoelastic properties allow a prediction of fundamental frequency range of sounds that can be produced, and minimal lung pressure necessary to initiate phonation. We tested the hypothesis in lions and tigers who are well-known for producing low frequency and very loud roaring sounds that expose vocal folds to large stresses. In histological sections, we found that the Panthera vocal fold lamina propria consists of a lateral region with adipocytes embedded in a network of collagen and elastin fibers and hyaluronan. There is also a medial region that contains only fibrous proteins and hyaluronan but no fat cells. Young's moduli range between 10 and 2000 kPa for strains up to 60%. Shear moduli ranged between 0.1 and 2 kPa and differed between layers. Biomechanical and morphological data were used to make predictions of fundamental frequency and subglottal pressure ranges. Such predictions agreed well with measurements from natural phonation and phonation of excised larynges, respectively. We assume that fat shapes Panthera vocal folds into an advantageous geometry for phonation and it protects vocal folds. Its primary function is probably not to increase vocal fold mass as suggested previously. The large square-shaped Panthera vocal fold eases phonation onset and thereby extends the dynamic range of the voice. PMID:22073246
Vocal effort modulates the motor planning of short speech structures
NASA Astrophysics Data System (ADS)
Taitz, Alan; Shalom, Diego E.; Trevisan, Marcos A.
2018-05-01
Speech requires programming the sequence of vocal gestures that produce the sounds of words. Here we explored the timing of this program by asking our participants to pronounce, as quickly as possible, a sequence of consonant-consonant-vowel (CCV) structures appearing on screen. We measured the delay between visual presentation and voice onset. In the case of plosive consonants, produced by sharp and well defined movements of the vocal tract, we found that delays are positively correlated with the duration of the transition between consonants. We then used a battery of statistical tests and mathematical vocal models to show that delays reflect the motor planning of CCVs and transitions are proxy indicators of the vocal effort needed to produce them. These results support that the effort required to produce the sequence of movements of a vocal gesture modulates the onset of the motor plan.
Maurer, D; Hess, M; Gross, M
1996-12-01
Theoretic investigations of the "source-filter" model have indicated a pronounced acoustic interaction of glottal source and vocal tract. Empirical investigations of formant pattern variations apart from changes in vowel identity have demonstrated a direct relationship between the fundamental frequency and the patterns. As a consequence of both findings, independence of phonation and articulation may be limited in the speech process. Within the present study, possible interdependence of phonation and phoneme was investigated: vocal fold vibrations and larynx position for vocalizations of different vowels in a healthy man and woman were examined by high-speed light-intensified digital imaging. We found 1) different movements of the vocal folds for vocalizations of different vowel identities within one speaker and at similar fundamental frequency, and 2) constant larynx position within vocalization of one vowel identity, but different positions for vocalizations of different vowel identities. A possible relationship between the vocal fold vibrations and the phoneme is discussed.
Auditory and audio-vocal responses of single neurons in the monkey ventral premotor cortex.
Hage, Steffen R
2018-03-20
Monkey vocalization is a complex behavioral pattern, which is flexibly used in audio-vocal communication. A recently proposed dual neural network model suggests that cognitive control might be involved in this behavior, originating from a frontal cortical network in the prefrontal cortex and mediated via projections from the rostral portion of the ventral premotor cortex (PMvr) and motor cortex to the primary vocal motor network in the brainstem. For the rapid adjustment of vocal output to external acoustic events, strong interconnections between vocal motor and auditory sites are needed, which are present at cortical and subcortical levels. However, the role of the PMvr in audio-vocal integration processes remains unclear. In the present study, single neurons in the PMvr were recorded in rhesus monkeys (Macaca mulatta) while volitionally producing vocalizations in a visual detection task or passively listening to monkey vocalizations. Ten percent of randomly selected neurons in the PMvr modulated their discharge rate in response to acoustic stimulation with species-specific calls. More than four-fifths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of the vocalization. Based on these audio-vocal interactions, the PMvr might be well positioned to mediate higher order auditory processing with cognitive control of the vocal motor output to the primary vocal motor network. Such audio-vocal integration processes in the premotor cortex might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2018 Elsevier B.V. All rights reserved.
Ma, Jie; Kanwal, Jagmeet S.
2014-01-01
The neural substrate for the perception of vocalizations is relatively well described, but how their timing and specificity are tightly coupled with accompanying physiological changes and context-appropriate behaviors remains unresolved. We hypothesized that temporally integrated vocal and emotive responses, especially the expression of fear, vigilance and aggression, originate within the amygdala. To test this hypothesis, we performed electrical microstimulation at 461 highly restricted loci within the basal and central amygdala in awake mustached bats. At a subset of these sites, high frequency stimulation with weak constant current pulses presented at near-threshold levels triggered vocalization of either echolocation pulses or social calls. At the vast majority of locations, microstimulation produced a constellation of changes in autonomic and somatomotor outputs. These changes included widespread co-activation of significant tachycardia and hyperventilation and/or rhythmic ear pinna movements (PMs). In a few locations, responses were constrained to vocalization and/or PMs despite increases in the intensity of stimulation. The probability of eliciting echolocation pulses vs. social calls decreased in a medial-posterior to anterolateral direction within the centrobasal amygdala. Microinjections of kainic acid (KA) at stimulation sites confirmed the contribution of cellular activity rather than fibers-of-passage in the control of multimodal outputs. The results suggest that localized clusters of neurons may simultaneously modulate the activity of multiple central pattern generators (CPGs) present within the brainstem. PMID:24624089
Ma, Jie; Kanwal, Jagmeet S
2014-01-01
The neural substrate for the perception of vocalizations is relatively well described, but how their timing and specificity are tightly coupled with accompanying physiological changes and context-appropriate behaviors remains unresolved. We hypothesized that temporally integrated vocal and emotive responses, especially the expression of fear, vigilance and aggression, originate within the amygdala. To test this hypothesis, we performed electrical microstimulation at 461 highly restricted loci within the basal and central amygdala in awake mustached bats. At a subset of these sites, high frequency stimulation with weak constant current pulses presented at near-threshold levels triggered vocalization of either echolocation pulses or social calls. At the vast majority of locations, microstimulation produced a constellation of changes in autonomic and somatomotor outputs. These changes included widespread co-activation of significant tachycardia and hyperventilation and/or rhythmic ear pinna movements (PMs). In a few locations, responses were constrained to vocalization and/or PMs despite increases in the intensity of stimulation. The probability of eliciting echolocation pulses vs. social calls decreased in a medial-posterior to anterolateral direction within the centrobasal amygdala. Microinjections of kainic acid (KA) at stimulation sites confirmed the contribution of cellular activity rather than fibers-of-passage in the control of multimodal outputs. The results suggest that localized clusters of neurons may simultaneously modulate the activity of multiple central pattern generators (CPGs) present within the brainstem.
Coos, booms, and hoots: The evolution of closed-mouth vocal behavior in birds.
Riede, Tobias; Eliason, Chad M; Miller, Edward H; Goller, Franz; Clarke, Julia A
2016-08-01
Most birds vocalize with an open beak, but vocalization with a closed beak into an inflating cavity occurs in territorial or courtship displays in disparate species throughout birds. Closed-mouth vocalizations generate resonance conditions that favor low-frequency sounds. By contrast, open-mouth vocalizations cover a wider frequency range. Here we describe closed-mouth vocalizations of birds from functional and morphological perspectives and assess the distribution of closed-mouth vocalizations in birds and related outgroups. Ancestral-state optimizations of body size and vocal behavior indicate that closed-mouth vocalizations are unlikely to be ancestral in birds and have evolved independently at least 16 times within Aves, predominantly in large-bodied lineages. Closed-mouth vocalizations are rare in the small-bodied passerines. In light of these results and body size trends in nonavian dinosaurs, we suggest that the capacity for closed-mouth vocalization was present in at least some extinct nonavian dinosaurs. As in birds, this behavior may have been limited to sexually selected vocal displays, and hence would have co-occurred with open-mouthed vocalizations. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
Viscoelastic properties of rabbit vocal folds after augmentation.
Hertegård, Stellan; Dahlqvist, Ake; Laurent, Claude; Borzacchiello, Assunta; Ambrosio, Luigi
2003-03-01
Vocal fold function is closely related to tissue viscoelasticity. Augmentation substances may alter the viscoelastic properties of vocal fold tissues and hence their vibratory capacity. We sought to investigate the viscoelastic properties of rabbit vocal folds in vitro after injections of various augmentation substances. Polytetrafluoroethylene (Teflon), cross-linked collagen (Zyplast), and cross-linked hyaluronan, hylan b gel (Hylaform) were injected into the lamina propria and the thyroarytenoid muscle of rabbit vocal folds. Dynamic viscosity of the injected vocal fold as a function of frequency was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Vocal fold samples injected with hylan b gel showed the lowest dynamic viscosity, quite close to noninjected control samples. Vocal folds injected with polytetrafluoroethylene showed the highest dynamic viscosity followed by the collagen samples. The data indicated that hylan b gel in short-term renders the most natural viscoelastic properties to the vocal fold among the substances tested. This is of importance to restore/preserve the vibratory capacity of the vocal folds when glottal insufficiency is treated with injections.
The importance of hyaluronic acid in vocal fold biomechanics.
Chan, R W; Gray, S D; Titze, I R
2001-06-01
This study examined the influence of hyaluronic acid (HA) on the biomechanical properties of the human vocal fold cover (the superficial layer of the lamina propria). Vocal fold tissues were freshly excised from 5 adult male cadavers and were treated with bovine testicular hyaluronidase to selectively remove HA from the lamina propria extracellular matrix (ECM). Linear viscoelastic shear properties (elastic shear modulus and dynamic viscosity) of the tissue samples before and after enzymatic treatment were quantified as a function of frequency (0.01 to 15 Hz) by a parallel-plate rotational rheometer at 37 degrees C. On removing HA from the vocal fold ECM, the elastic shear modulus (G' ) or stiffness of the vocal fold cover decreased by an average of around 35%, while the dynamic viscosity (eta') increased by 70% at higher frequencies (>1 Hz). The results suggested that HA plays an important role in determining the biomechanical properties of the vocal fold cover. As a highly hydrated glycosaminoglycan in the vocal fold ECM, it likely contributes to the maintenance of an optimal tissue viscosity that may facilitate phonation, and an optimal tissue stiffness that may be important for vocal fundamental frequency control. HA has been proposed as a potential bioimplant for the surgical repair of vocal fold ECM defects (eg, vocal fold scarring and sulcus vocalis). Our results suggested that such clinical use may be potentially optimal for voice production from a biomechanical perspective.
Zhang, Zhaoyan
2016-01-01
The goal of this study is to better understand the cause-effect relation between vocal fold physiology and the resulting vibration pattern and voice acoustics. Using a three-dimensional continuum model of phonation, the effects of changes in vocal fold stiffness, medial surface thickness in the vertical direction, resting glottal opening, and subglottal pressure on vocal fold vibration and different acoustic measures are investigated. The results show that the medial surface thickness has dominant effects on the vertical phase difference between the upper and lower margins of the medial surface, closed quotient, H1-H2, and higher-order harmonics excitation. The main effects of vocal fold approximation or decreasing resting glottal opening are to lower the phonation threshold pressure, reduce noise production, and increase the fundamental frequency. Increasing subglottal pressure is primarily responsible for vocal intensity increase but also leads to significant increase in noise production and an increased fundamental frequency. Increasing AP stiffness significantly increases the fundamental frequency and slightly reduces noise production. The interaction among vocal fold thickness, stiffness, approximation, and subglottal pressure in the control of F0, vocal intensity, and voice quality is discussed. PMID:27106298
The effect of superior auditory skills on vocal accuracy
NASA Astrophysics Data System (ADS)
Amir, Ofer; Amir, Noam; Kishon-Rabin, Liat
2003-02-01
The relationship between auditory perception and vocal production has been typically investigated by evaluating the effect of either altered or degraded auditory feedback on speech production in either normal hearing or hearing-impaired individuals. Our goal in the present study was to examine this relationship in individuals with superior auditory abilities. Thirteen professional musicians and thirteen nonmusicians, with no vocal or singing training, participated in this study. For vocal production accuracy, subjects were presented with three tones. They were asked to reproduce the pitch using the vowel /a/. This procedure was repeated three times. The fundamental frequency of each production was measured using an autocorrelation pitch detection algorithm designed for this study. The musicians' superior auditory abilities (compared to the nonmusicians) were established in a frequency discrimination task reported elsewhere. Results indicate that (a) musicians had better vocal production accuracy than nonmusicians (production errors of 1/2 a semitone compared to 1.3 semitones, respectively); (b) frequency discrimination thresholds explain 43% of the variance of the production data, and (c) all subjects with superior frequency discrimination thresholds showed accurate vocal production; the reverse relationship, however, does not hold true. In this study we provide empirical evidence to the importance of auditory feedback on vocal production in listeners with superior auditory skills.
[Objective study of the voice quality following partial laryngectomy].
Remacle, M; Millet, B
1991-01-01
The high resolution frequency analyzer is used for the study of the vocal quality after partial laryngectomy. The post-operative plot after speech therapy is of good quality when respecting one vocal fold. On the contrary, the heard vocal sound does not correspond to the harmonics of the fundamental frequency but to intense noise from irregular vibrations of the residual laryngeal mucosa (ventricular folds, arytenoids). High resolution frequency analysis contributes to the follow-up of the partial laryngectomy.
Vocal communication in African elephants (Loxodonta africana).
Soltis, Joseph
2010-01-01
Research on vocal communication in African elephants has increased in recent years, both in the wild and in captivity, providing an opportunity to present a comprehensive review of research related to their vocal behavior. Current data indicate that the vocal repertoire consists of perhaps nine acoustically distinct call types, "rumbles" being the most common and acoustically variable. Large vocal production anatomy is responsible for the low-frequency nature of rumbles, with fundamental frequencies in the infrasonic range. Additionally, resonant frequencies of rumbles implicate the trunk in addition to the oral cavity in shaping the acoustic structure of rumbles. Long-distance communication is thought possible because low-frequency sounds propagate more faithfully than high-frequency sounds, and elephants respond to rumbles at distances of up to 2.5 km. Elephant ear anatomy appears designed for detecting low frequencies, and experiments demonstrate that elephants can detect infrasonic tones and discriminate small frequency differences. Two vocal communication functions in the African elephant now have reasonable empirical support. First, closely bonded but spatially separated females engage in rumble exchanges, or "contact calls," that function to coordinate movement or reunite animals. Second, both males and females produce "mate attraction" rumbles that may advertise reproductive states to the opposite sex. Additionally, there is evidence that the structural variation in rumbles reflects the individual identity, reproductive state, and emotional state of callers. Growth in knowledge about the communication system of the African elephant has occurred from a rich combination of research on wild elephants in national parks and captive elephants in zoological parks.
Kuo, Chung-Feng Jeffrey; Wang, Hsing-Won; Hsiao, Shang-Wun; Peng, Kai-Ching; Chou, Ying-Liang; Lai, Chun-Yu; Hsu, Chien-Tung Max
2014-01-01
Physicians clinically use laryngeal video stroboscope as an auxiliary instrument to test glottal diseases, and read vocal fold images and voice quality for diagnosis. As the position of vocal fold varies in each person, the proportion of the vocal fold size as presented in the vocal fold image is different, making it impossible to directly estimate relevant glottis physiological parameters, such as the length, area, perimeter, and opening angle of the glottis. Hence, this study designs an innovative laser projection marking module for the laryngeal video stroboscope to provide reference parameters for image scaling conversion. This innovative laser projection marking module to be installed on the laryngeal video stroboscope using laser beams to project onto the glottis plane, in order to provide reference parameters for scaling conversion of images of laryngeal video stroboscope. Copyright © 2013 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Henrich, Nathalie; D'Alessandro, Christophe; Doval, Boris; Castellengo, Michèle
2005-03-01
This article presents the results of glottal open-quotient measurements in the case of singing voice production. It explores the relationship between open quotient and laryngeal mechanisms, vocal intensity, and fundamental frequency. The audio and electroglottographic signals of 18 classically trained male and female singers were recorded and analyzed with regard to vocal intensity, fundamental frequency, and open quotient. Fundamental frequency and open quotient are derived from the differentiated electroglottographic signal, using the DECOM (DEgg Correlation-based Open quotient Measurement) method. As male and female phonation may differ in respect to vocal-fold vibratory properties, a distinction is made between two different glottal configurations, which are called laryngeal mechanisms: mechanism 1 (related to chest, modal, and male head register) and mechanism 2 (related to falsetto for male and head register for female). The results show that open quotient depends on the laryngeal mechanisms. It ranges from 0.3 to 0.8 in mechanism 1 and from 0.5 to 0.95 in mechanism 2. The open quotient is strongly related to vocal intensity in mechanism 1 and to fundamental frequency in mechanism 2. .
Van Stan, Jarrad H; Mehta, Daryush D; Petit, Robert J; Sternad, Dagmar; Muise, Jason; Burns, James A; Hillman, Robert E
2017-02-01
Ambulatory voice biofeedback (AVB) has the potential to significantly improve voice therapy effectiveness by targeting one of the most challenging aspects of rehabilitation: carryover of desired behaviors outside of the therapy session. Although initial evidence indicates that AVB can alter vocal behavior in daily life, retention of the new behavior after biofeedback has not been demonstrated. Motor learning studies repeatedly have shown retention-related benefits when reducing feedback frequency or providing summary statistics. Therefore, novel AVB settings that are based on these concepts are developed and implemented. The underlying theoretical framework and resultant implementation of innovative AVB settings on a smartphone-based voice monitor are described. A clinical case study demonstrates the functionality of the new relative frequency feedback capabilities. With new technical capabilities, 2 aspects of feedback are directly modifiable for AVB: relative frequency and summary feedback. Although reduced-frequency AVB was associated with improved carryover of a therapeutic vocal behavior (i.e., reduced vocal intensity) in a patient post-excision of vocal fold nodules, causation cannot be assumed. Timing and frequency of AVB schedules can be manipulated to empirically assess generalization of motor learning principles to vocal behavior modification and test the clinical effectiveness of AVB with various feedback schedules.
Mehta, Daryush D.; Petit, Robert J.; Sternad, Dagmar; Muise, Jason; Burns, James A.; Hillman, Robert E.
2017-01-01
Purpose Ambulatory voice biofeedback (AVB) has the potential to significantly improve voice therapy effectiveness by targeting one of the most challenging aspects of rehabilitation: carryover of desired behaviors outside of the therapy session. Although initial evidence indicates that AVB can alter vocal behavior in daily life, retention of the new behavior after biofeedback has not been demonstrated. Motor learning studies repeatedly have shown retention-related benefits when reducing feedback frequency or providing summary statistics. Therefore, novel AVB settings that are based on these concepts are developed and implemented. Method The underlying theoretical framework and resultant implementation of innovative AVB settings on a smartphone-based voice monitor are described. A clinical case study demonstrates the functionality of the new relative frequency feedback capabilities. Results With new technical capabilities, 2 aspects of feedback are directly modifiable for AVB: relative frequency and summary feedback. Although reduced-frequency AVB was associated with improved carryover of a therapeutic vocal behavior (i.e., reduced vocal intensity) in a patient post-excision of vocal fold nodules, causation cannot be assumed. Conclusions Timing and frequency of AVB schedules can be manipulated to empirically assess generalization of motor learning principles to vocal behavior modification and test the clinical effectiveness of AVB with various feedback schedules. PMID:28124070
A Chinese alligator in heliox: formant frequencies in a crocodilian
Reber, Stephan A.; Nishimura, Takeshi; Janisch, Judith; Robertson, Mark; Fitch, W. Tecumseh
2015-01-01
ABSTRACT Crocodilians are among the most vocal non-avian reptiles. Adults of both sexes produce loud vocalizations known as ‘bellows’ year round, with the highest rate during the mating season. Although the specific function of these vocalizations remains unclear, they may advertise the caller's body size, because relative size differences strongly affect courtship and territorial behaviour in crocodilians. In mammals and birds, a common mechanism for producing honest acoustic signals of body size is via formant frequencies (vocal tract resonances). To our knowledge, formants have to date never been documented in any non-avian reptile, and formants do not seem to play a role in the vocalizations of anurans. We tested for formants in crocodilian vocalizations by using playbacks to induce a female Chinese alligator (Alligator sinensis) to bellow in an airtight chamber. During vocalizations, the animal inhaled either normal air or a helium/oxygen mixture (heliox) in which the velocity of sound is increased. Although heliox allows normal respiration, it alters the formant distribution of the sound spectrum. An acoustic analysis of the calls showed that the source signal components remained constant under both conditions, but an upward shift of high-energy frequency bands was observed in heliox. We conclude that these frequency bands represent formants. We suggest that crocodilian vocalizations could thus provide an acoustic indication of body size via formants. Because birds and crocodilians share a common ancestor with all dinosaurs, a better understanding of their vocal production systems may also provide insight into the communication of extinct Archosaurians. PMID:26246611
Lin, Ya; Yamashita, Masaru; Zhang, Jingxian; Ling, Changying; Welham, Nathan V
2009-10-01
Disruption of the vocal fold extracellular matrix (ECM) can induce a profound and refractory dysphonia. Pulsed dye laser (PDL) irradiation has shown early promise as a treatment modality for disordered ECM in patients with chronic vocal fold scar; however, there are limited data addressing the mechanism by which this laser energy might induce cellular and extracellular changes in vocal fold tissues. In this study, we examined the inflammatory and ECM modulating effects of PDL irradiation on normal vocal fold tissues and cultured vocal fold fibroblasts (VFFs). We evaluated the effects of 585 nm PDL irradiation on inflammatory cytokine and collagen/collagenase gene transcription in normal rat vocal folds in vivo (3-168 hours following delivery of approximately 39.46 J/cm(2) fluence) and VFFs in vitro (3-72 hours following delivery of 4.82 or 9.64 J/cm(2) fluence). We also examined morphological vocal fold tissue changes 3 hours, 1 week, and 1 month post-irradiation. PDL irradiation altered inflammatory cytokine and procollagen/collagenase expression at the transcript level, both in vitro and in vivo. Additionally, PDL irradiation induced an inflammatory repair process in vivo that was completed by 1 month with preservation of normal tissue morphology. PDL irradiation can modulate ECM turnover in phenotypically normal vocal folds. Additional work is required to determine if these findings extend to disordered ECM, such as is seen in vocal fold scar. Lasers Surg. Med. 41:585-594, 2009. (c) 2009 Wiley-Liss, Inc.
Kvit, Anton A; Devine, Erin E; Jiang, Jack J; Vamos, Andrew C; Tao, Chao
2015-05-01
Vocal fold tissue is biphasic and consists of a solid extracellular matrix skeleton swelled with interstitial fluid. Interactions between the liquid and solid impact the material properties and stress response of the tissue. The objective of this study was to model the movement of liquid during vocal fold vibration and to estimate the volume of liquid accumulation and stress experienced by the tissue near the anterior-posterior midline, where benign lesions are observed to form. A three-dimensional biphasic finite element model of a single vocal fold was built to solve for the liquid velocity, pore pressure, and von Mises stress during and just after vibration using the commercial finite element software COMSOL Multiphysics (Version 4.3a, 2013, Structural Mechanics and Subsurface Flow Modules). Vibration was induced by applying direct load pressures to the subglottal and intraglottal surfaces. Pressure ranges, frequency, and material parameters were chosen based on those reported in the literature. Postprocessing included liquid velocity, pore pressure, and von Mises stress calculations as well as the frequency-stress and amplitude-stress relationships. Resulting time-averaged velocity vectors during vibration indicated liquid movement toward the midline of the fold, as well as upward movement in the inferior-superior direction. Pore pressure and von Misses stresses were higher in this region just after vibration. A linear relationship was found between the amplitude and pore pressure, whereas a nonlinear relationship was found between the frequency and pore pressure. Although this study had certain computational simplifications, it is the first biphasic finite element model to use a realistic geometry and demonstrate the ability to characterize liquid movement due to vibration. Results indicate that there is a significant amount of liquid that accumulates at the midline; however, the role of this accumulation still requires investigation. Further investigation of these mechanical factors may lend insight into the mechanism of benign lesion formation. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Kvit, Anton A.; Devine, Erin E.; Vamos, Andrew C.; Tao, Chao; Jiang, Jack J.
2015-01-01
OBJECTIVE Vocal fold tissue is biphasic and consists of a solid extracellular matric skeleton swelled with interstitial fluid. Interactions between the liquid and solid impact the material properties and stress response of the tissue. The objective of this study was to model the movement of liquid during vocal fold vibration and estimate the volume of liquid accumulation and stress experienced by the tissue near the anterior-posterior midline, where benign lesions are observed to form. METHODS A three-dimensional biphasic finite element model of a single vocal fold was built to solve for the liquid velocity, pore pressure, and von Mises stress during and just after vibration using the commercial finite element software COMSOL Multiphysics (Version 4.3a, 2013, Structural Mechanics and Subsurface Flow Modules). Vibration was induced by applying direct-load pressures to the subglottal and intraglottal surfaces. Pressure ranges, frequency and material parameters were chosen based on those reported in the literature. Post-processing included liquid velocity, pore pressure and von Mises stress calculations, as well as the frequency-stress and amplitude-stress relationships. RESULTS Resulting time-averaged velocity vectors during vibration indicated liquid movement towards the midline of the fold, as upwards movement in the inferior-superior direction. Pore pressure and von Misses stresses were higher in this region just following vibration. A linear relationship was found between the amplitude and pore pressure, while a nonlinear relationship was found between the frequency and pore pressure. CONCLUSIONS While this study had certain computational simplifications, it is the first biphasic finite element model to employ a realistic geometry and demonstrated the ability to characterize liquid movement due to vibration. Results indicate that there is a significant amount of liquid that accumulates at the midline, however the role of this accumulation still requires investigation. Further investigation of these mechanical factors may lend insight into the mechanism of benign lesion formation. PMID:25619469
Booming far: the long-range vocal strategy of a lekking bird.
Cornec, C; Hingrat, Y; Aubin, T; Rybak, F
2017-08-01
The pressures of selection acting on transmission of information by acoustic signals are particularly high in long-distance communication networks. Males of the North African houbara bustard ( Chlamydotis undulata undulata ) produce extremely low-frequency vocalizations called 'booms' as a component of their courtship displays. These displays are performed on sites separated by a distance of on average 550 m, constituting exploded leks. Here, we investigate the acoustic features of booms involved in species-specific identity. We first assessed the modifications of acoustic parameters during boom transmission at long range within the natural habitat of the species, finding that the frequency content of booms was reliably transmitted up to 600 m. Additionally, by testing males' behavioural responses to playbacks of modified signals, we found that the presence of the second harmonic and the frequency modulation are the key parameters for species identification, and also that a sequence of booms elicited stronger responses than a single boom. Thus, the coding-decoding process relies on redundant and propagation-resistant features, making the booms particularly well adapted for the long-range transmission of information between males. Moreover, by experimentally disentangling the presentation of visual and acoustic signals, we showed that during the booming phase of courtship, the two sensory modalities act in synergy. The acoustic component is dominant in the context of intra-sexual competition. While the visual component is not necessary to induce agonistic response, it acts as an amplifier and reduces the time of detection of the signaller. The utilization of these adaptive strategies allows houbara males to maximize the active space of vocalizations emitted in exploded leks.
Chan, Roger W; Siegmund, Thomas; Zhang, Kai
2009-12-01
Accurate characterization of biomechanical characteristics of the vocal fold is critical for understanding the regulation of vocal fundamental frequency (F(0)), which depends on the active control of the intrinsic laryngeal muscles as well as the passive biomechanical response of the vocal fold lamina propria. Specifically, the tissue stress-strain response and viscoelastic properties under cyclic tensile deformation are relevant, when the vocal folds are subjected to length and tension changes due to posturing. This paper describes a constitutive modeling approach quantifying the relationship between vocal fold stress and strain (or stretch), and establishes predictions of F(0) with the string model of phonation based on the constitutive parameters. Results indicated that transient and time-dependent changes in F(0), including global declinations in declarative sentences, as well as local F(0) overshoots and undershoots, can be partially attributed to the time-dependent viscoplastic response of the vocal fold cover.
Vocalization frequency and duration are coded in separate hindbrain nuclei.
Chagnaud, Boris P; Baker, Robert; Bass, Andrew H
2011-06-14
Temporal patterning is an essential feature of neural networks producing precisely timed behaviours such as vocalizations that are widely used in vertebrate social communication. Here we show that intrinsic and network properties of separate hindbrain neuronal populations encode the natural call attributes of frequency and duration in vocal fish. Intracellular structure/function analyses indicate that call duration is encoded by a sustained membrane depolarization in vocal prepacemaker neurons that innervate downstream pacemaker neurons. Pacemaker neurons, in turn, encode call frequency by rhythmic, ultrafast oscillations in their membrane potential. Pharmacological manipulations show prepacemaker activity to be independent of pacemaker function, thus accounting for natural variation in duration which is the predominant feature distinguishing call types. Prepacemaker neurons also innervate key hindbrain auditory nuclei thereby effectively serving as a call-duration corollary discharge. We propose that premotor compartmentalization of neurons coding distinct acoustic attributes is a fundamental trait of hindbrain vocal pattern generators among vertebrates.
Vocalization frequency and duration are coded in separate hindbrain nuclei
Chagnaud, Boris P.; Baker, Robert; Bass, Andrew H.
2011-01-01
Temporal patterning is an essential feature of neural networks producing precisely timed behaviours such as vocalizations that are widely used in vertebrate social communication. Here we show that intrinsic and network properties of separate hindbrain neuronal populations encode the natural call attributes of frequency and duration in vocal fish. Intracellular structure/function analyses indicate that call duration is encoded by a sustained membrane depolarization in vocal prepacemaker neurons that innervate downstream pacemaker neurons. Pacemaker neurons, in turn, encode call frequency by rhythmic, ultrafast oscillations in their membrane potential. Pharmacological manipulations show prepacemaker activity to be independent of pacemaker function, thus accounting for natural variation in duration which is the predominant feature distinguishing call types. Prepacemaker neurons also innervate key hindbrain auditory nuclei thereby effectively serving as a call-duration corollary discharge. We propose that premotor compartmentalization of neurons coding distinct acoustic attributes is a fundamental trait of hindbrain vocal pattern generators among vertebrates. PMID:21673667
NASA Astrophysics Data System (ADS)
Sisakun, Siphan
2000-12-01
The purpose of this study is to explore the ability of four acoustic parameters, mean fundamental frequency, jitter, shimmer, and harmonics-to-noise ratio, to detect vocal fatigue in student singers. The participants are 15 voice students, who perform two distinct tasks, data collection task and vocal fatiguing task. The data collection task includes the sustained vowel /a/, reading a standard passage, and self-rate on a vocal fatigue form. The vocal fatiguing task is the vocal practice of musical scores for a total of 45 minutes. The four acoustic parameters are extracted using the software EZVoicePlus. The data analyses are performed to answer eight research questions. The first four questions relate to correlations of the self-rating scale and each of the four parameters. The next four research questions relate to differences in the parameters over time using one-factor repeated measures analysis of variance (ANOVA). The result yields a proposed acoustic profile of vocal fatigue in student singers. This profile is characterized by increased fundamental frequency; slightly decreased jitter; slightly decreased shimmer; and slightly increased harmonics-to-noise ratio. The proposed profile requires further investigation.
Vocal development and auditory perception in CBA/CaJ mice
NASA Astrophysics Data System (ADS)
Radziwon, Kelly E.
Mice are useful laboratory subjects because of their small size, their modest cost, and the fact that researchers have created many different strains to study a variety of disorders. In particular, researchers have found nearly 100 naturally occurring mouse mutations with hearing impairments. For these reasons, mice have become an important model for studies of human deafness. Although much is known about the genetic makeup and physiology of the laboratory mouse, far less is known about mouse auditory behavior. To fully understand the effects of genetic mutations on hearing, it is necessary to determine the hearing abilities of these mice. Two experiments here examined various aspects of mouse auditory perception using CBA/CaJ mice, a commonly used mouse strain. The frequency difference limens experiment tested the mouse's ability to discriminate one tone from another based solely on the frequency of the tone. The mice had similar thresholds as wild mice and gerbils but needed a larger change in frequency than humans and cats. The second psychoacoustic experiment sought to determine which cue, frequency or duration, was more salient when the mice had to identify various tones. In this identification task, the mice overwhelmingly classified the tones based on frequency instead of duration, suggesting that mice are using frequency when differentiating one mouse vocalization from another. The other two experiments were more naturalistic and involved both auditory perception and mouse vocal production. Interest in mouse vocalizations is growing because of the potential for mice to become a model of human speech disorders. These experiments traced mouse vocal development from infant to adult, and they tested the mouse's preference for various vocalizations. This was the first known study to analyze the vocalizations of individual mice across development. Results showed large variation in calling rates among the three cages of adult mice but results were highly consistent across all infant vocalizations. Although the preference experiment did not reveal significant differences between various mouse vocalizations, suggestions are given for future attempts to identify mouse preferences for auditory stimuli.
Responses of auditory-cortex neurons to structural features of natural sounds.
Nelken, I; Rotman, Y; Bar Yosef, O
1999-01-14
Sound-processing strategies that use the highly non-random structure of natural sounds may confer evolutionary advantage to many species. Auditory processing of natural sounds has been studied almost exclusively in the context of species-specific vocalizations, although these form only a small part of the acoustic biotope. To study the relationships between properties of natural soundscapes and neuronal processing mechanisms in the auditory system, we analysed sound from a range of different environments. Here we show that for many non-animal sounds and background mixtures of animal sounds, energy in different frequency bands is coherently modulated. Co-modulation of different frequency bands in background noise facilitates the detection of tones in noise by humans, a phenomenon known as co-modulation masking release (CMR). We show that co-modulation also improves the ability of auditory-cortex neurons to detect tones in noise, and we propose that this property of auditory neurons may underlie behavioural CMR. This correspondence may represent an adaptation of the auditory system for the use of an attribute of natural sounds to facilitate real-world processing tasks.
Reconstructing the spectrotemporal modulations of real-life sounds from fMRI response patterns
Santoro, Roberta; Moerel, Michelle; De Martino, Federico; Valente, Giancarlo; Ugurbil, Kamil; Yacoub, Essa; Formisano, Elia
2017-01-01
Ethological views of brain functioning suggest that sound representations and computations in the auditory neural system are optimized finely to process and discriminate behaviorally relevant acoustic features and sounds (e.g., spectrotemporal modulations in the songs of zebra finches). Here, we show that modeling of neural sound representations in terms of frequency-specific spectrotemporal modulations enables accurate and specific reconstruction of real-life sounds from high-resolution functional magnetic resonance imaging (fMRI) response patterns in the human auditory cortex. Region-based analyses indicated that response patterns in separate portions of the auditory cortex are informative of distinctive sets of spectrotemporal modulations. Most relevantly, results revealed that in early auditory regions, and progressively more in surrounding regions, temporal modulations in a range relevant for speech analysis (∼2–4 Hz) were reconstructed more faithfully than other temporal modulations. In early auditory regions, this effect was frequency-dependent and only present for lower frequencies (<∼2 kHz), whereas for higher frequencies, reconstruction accuracy was higher for faster temporal modulations. Further analyses suggested that auditory cortical processing optimized for the fine-grained discrimination of speech and vocal sounds underlies this enhanced reconstruction accuracy. In sum, the present study introduces an approach to embed models of neural sound representations in the analysis of fMRI response patterns. Furthermore, it reveals that, in the human brain, even general purpose and fundamental neural processing mechanisms are shaped by the physical features of real-world stimuli that are most relevant for behavior (i.e., speech, voice). PMID:28420788
NASA Astrophysics Data System (ADS)
Volodin, Ilya A.; Volodina, Elena V.; Frey, Roland; Kirilyuk, Vadim E.; Naidenko, Sergey V.
2017-06-01
In neonate ruminants, the acoustic structure of vocalizations may depend on sex, vocal anatomy, hormonal profiles and body mass and on environmental factors. In neonate wild-living Mongolian gazelles Procapra gutturosa, hand-captured during biomedical monitoring in the Daurian steppes at the Russian-Mongolian border, we spectrographically analysed distress calls and measured body mass of 22 individuals (6 males, 16 females). For 20 (5 male, 15 female) of these individuals, serum testosterone levels were also analysed. In addition, we measured relevant dimensions of the vocal apparatus (larynx, vocal folds, vocal tract) in one stillborn male Mongolian gazelle specimen. Neonate distress calls of either sex were high in maximum fundamental frequency (800-900 Hz), but the beginning and minimum fundamental frequencies were significantly lower in males than in females. Body mass was larger in males than in females. The levels of serum testosterone were marginally higher in males. No correlations were found between either body mass or serum testosterone values and any acoustic variable for males and females analysed together or separately. We discuss that the high-frequency calls of neonate Mongolian gazelles are more typical for closed-habitat neonate ruminants, whereas other open-habitat neonate ruminants (goitred gazelle Gazella subgutturosa, saiga antelope Saiga tatarica and reindeer Rangifer tarandus) produce low-frequency (<200 Hz) distress calls. Proximate cause for the high fundamental frequency of distress calls of neonate Mongolian gazelles is their very short, atypical vocal folds (4 mm) compared to the 7-mm vocal folds of neonate goitred gazelles, producing distress calls as low as 120 Hz.
Viscoelastic properties of three vocal-fold injectable biomaterials at low audio frequencies.
Klemuk, Sarah A; Titze, Ingo R
2004-09-01
Previous measurements of viscoelastic properties of Zyderm were to be extended to low audio frequencies, and properties of two other biomaterials not previously measured, thiolated hyaluronic acid (HA-DTPH) and Cymetra, were obtained. Rheologic investigation. Oscillatory shear stress was applied to each sample using a controlled stress rheometer at frequencies between 0.01 and 100 Hz with a parallel plate apparatus. Versuscoelastic moduli were recorded at each frequency. The calculated resonance frequency of the machine and sample were then used to determine the maximum frequency at which reliable data existed. Extrapolation functions were fit to viscoelastic parameters, which predicted the properties up to 1,000 Hz. Frequency trends of Zyderm were similar to those previously reported, whereas magnitudes were different. The elastic moduli logarithmically increased with frequency, whereas dynamic viscosity demonstrated shear thinning, a condition of primary importance for humans to vocalize over a broad frequency range. Previous measurements were extended from 15 Hz up to 74 Hz. Differences in magnitude between a previous study and the present study were attributed to particulate orientation during testing. Cymetra was found to have nearly identical viscoelastic properties to those of bovine collagen, both in magnitude and frequency trend, with reliable measures extending up to 81 Hz. Rheologic properties of the hyaluronic acid gel were the closest match to cadaveric vocal fold mucosa in magnitude and frequency trend. Viscoelastic properties of Cymetra and Zyderm are nearly the same and are significantly greater than those of vocal fold mucosa. HA-DTPH possesses a good viscoelastic match to vocal fold mucosa and may be useful in future lamina propria repair.
Cicadas impact bird communication in a noisy tropical rainforest
Hall, Robert; Ray, William; Beck, Angela; Zook, James
2015-01-01
Many animals communicate through acoustic signaling, and “acoustic space” may be viewed as a limited resource that organisms compete for. If acoustic signals overlap, the information in them is masked, so there should be selection toward strategies that reduce signal overlap. The extent to which animals are able to partition acoustic space in acoustically diverse habitats such as tropical forests is poorly known. Here, we demonstrate that a single cicada species plays a major role in the frequency and timing of acoustic communication in a neotropical wet forest bird community. Using an automated acoustic monitor, we found that cicadas vary the timing of their signals throughout the day and that the frequency range and timing of bird vocalizations closely track these signals. Birds significantly avoid temporal overlap with cicadas by reducing and often shutting down vocalizations at the onset of cicada signals that utilize the same frequency range. When birds do vocalize at the same time as cicadas, the vocalizations primarily occur at nonoverlapping frequencies with cicada signals. Our results greatly improve our understanding of the community dynamics of acoustic signaling and reveal how patterns in biotic noise shape the frequency and timing of bird vocalizations in tropical forests. PMID:26023277
Two-dimensional model of vocal fold vibration for sound synthesis of voice and soprano singing
NASA Astrophysics Data System (ADS)
Adachi, Seiji; Yu, Jason
2005-05-01
Voiced sounds were simulated with a computer model of the vocal fold composed of a single mass vibrating both parallel and perpendicular to the airflow. Similarities with the two-mass model are found in the amplitudes of the glottal area and the glottal volume flow velocity, the variation in the volume flow waveform with the vocal tract shape, and the dependence of the oscillation amplitude upon the average opening area of the glottis, among other similar features. A few dissimilarities are also found in the more symmetric glottal and volume flow waveforms in the rising and falling phases. The major improvement of the present model over the two-mass model is that it yields a smooth transition between oscillations with an inductive load and a capacitive load of the vocal tract with no sudden jumps in the vibration frequency. Self-excitation is possible both below and above the first formant frequency of the vocal tract. By taking advantage of the wider continuous frequency range, the two-dimensional model can successfully be applied to the sound synthesis of a high-pitched soprano singing, where the fundamental frequency sometimes exceeds the first formant frequency. .
The vocal repertoire of Tibetan macaques (Macaca thibetana): A quantitative classification.
Bernstein, Sofia K; Sheeran, Lori K; Wagner, R Steven; Li, Jin-Hua; Koda, Hiroki
2016-09-01
Vocal repertoires are basic and essential components for describing vocal communication in animals. Studying the entire suite of vocal signals aids investigations on the variation of acoustic structure across social contexts, comparisons on the complexity of communication systems across taxa, and in exploration of the evolutionary origins of species-specific vocalizations. Here, we describe the vocal repertoire of the largest species in the macaque genus, Macaca thibetana. We extracted thirty acoustic parameters from call recordings. Post hoc validation through quantitative analyses of the a priori repertoire classified eleven call types: coo, squawk, squeal, noisy scream, growl, bark, compound squeak, leap coo, weeping, modulated tonal scream, and pant. In comparison to the rest of the genus, Tibetan macaques uttered a wider array of vocalizations in the context of copulations. Previous reports did not include modulated tonal screams and pants during harassment of copulatory dyads. Furthermore, in comparison to the rest of the genus, Tibetan macaque females emit acoustically distinct copulation calls. The vocal repertoire of Tibetan macaques contributes to the literature on the emergence of species-specific calls in the genus Macaca with potential insights from social, reproductive, and ecological comparisons across species. Am. J. Primatol. 78:937-949, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Esch, Barbara E; Carr, James E; Michael, Jack
2005-01-01
Many children with autism do not imitate adult vocalizations, an important skill in learning to talk. Pairing adult vocalizations with preferred stimuli has been shown to increase free-operant vocalizations but effects are temporary; thus, direct reinforcement may be necessary to establish durable vocal behaviors. In Experiment 1, directly reinforced echoic responses did not increase following stimulus-stimulus pairings in three children with autism. Similarly, pairings did not increase free-operant vocalizations in Experiment 2, a replication of Miguel et al. (2002). Experiment 3 demonstrated that shaping increased vowel frequency for one participant. Results suggest that variables are yet to be delineated that influence effectiveness of a stimulus-stimulus pairing procedure on vocalization frequency and acquisition of a verbal operant following such pairings. PMID:22477313
A two-layer composite model of the vocal fold lamina propria for fundamental frequency regulation.
Zhang, Kai; Siegmund, Thomas; Chan, Roger W
2007-08-01
The mechanical properties of the vocal fold lamina propria, including the vocal fold cover and the vocal ligament, play an important role in regulating the fundamental frequency of human phonation. This study examines the equilibrium hyperelastic tensile deformation behavior of cover and ligament specimens isolated from excised human larynges. Ogden's hyperelastic model is used to characterize the tensile stress-stretch behaviors at equilibrium. Several statistically significant differences in the mechanical response differentiating cover and ligament, as well as gender are found. Fundamental frequencies are predicted from a string model and a beam model, both accounting for the cover and the ligament. The beam model predicts nonzero F(0) for the unstretched state of the vocal fold. It is demonstrated that bending stiffness significantly contributes to the predicted F(0), with the ligament contributing to a higher F(0), especially in females. Despite the availability of only a small data set, the model predicts an age dependence of F(0) in males in agreement with experimental findings. Accounting for two mechanisms of fundamental frequency regulation--vocal fold posturing (stretching) and extended clamping--brings predicted F(0) close to the lower bound of the human phonatory range. Advantages and limitations of the current model are discussed.
NASA Astrophysics Data System (ADS)
Rendall, Drew; Kollias, Sophie; Ney, Christina; Lloyd, Peter
2005-02-01
Key voice features-fundamental frequency (F0) and formant frequencies-can vary extensively between individuals. Much of the variation can be traced to differences in the size of the larynx and vocal-tract cavities, but whether these differences in turn simply reflect differences in speaker body size (i.e., neutral vocal allometry) remains unclear. Quantitative analyses were therefore undertaken to test the relationship between speaker body size and voice F0 and formant frequencies for human vowels. To test the taxonomic generality of the relationships, the same analyses were conducted on the vowel-like grunts of baboons, whose phylogenetic proximity to humans and similar vocal production biology and voice acoustic patterns recommend them for such comparative research. For adults of both species, males were larger than females and had lower mean voice F0 and formant frequencies. However, beyond this, F0 variation did not track body-size variation between the sexes in either species, nor within sexes in humans. In humans, formant variation correlated significantly with speaker height but only in males and not in females. Implications for general vocal allometry are discussed as are implications for speech origins theories, and challenges to them, related to laryngeal position and vocal tract length. .
Francis, Clinton D.; Ortega, Catherine P.; Cruz, Alexander
2011-01-01
Anthropogenic noise is prevalent across the globe and can exclude birds from otherwise suitable habitat and negatively influence fitness; however, the mechanisms responsible for species' responses to noise are not always clear. One effect of noise is a reduction in effective acoustic communication through acoustic masking, yet some urban songbirds may compensate for masking by noise through altering their songs. Whether this vocal flexibility accounts for species persistence in noisy areas is unknown. Here, we investigated the influence of noise on habitat use and vocal frequency in two suboscine flycatchers using a natural experiment that isolated effects of noise from confounding stimuli common to urban habitats. With increased noise exposure, grey flycatcher (Empidonax wrightii) occupancy declined, but vocal frequency did not change. By contrast, ash-throated flycatcher (Myiarchus cinerascens) occupancy was uninfluenced by noise, but individuals in areas with greater noise amplitudes vocalized at a higher frequency, although the increase (≈200 kHz) may only marginally improve communication and may represent a secondary effect from increased vocal amplitude. Even so, the different flycatcher behavioural responses suggest that signal change may help some species persist in noisy areas and prompt important questions regarding which species will cope with an increasingly noisy world. PMID:21123268
Vampola, Tomáš; Horáček, Jaromír; Laukkanen, Anne-Maria; Švec, Jan G
2015-04-01
Resonance frequencies of the vocal tract have traditionally been modelled using one-dimensional models. These cannot accurately represent the events in the frequency region of the formant cluster around 2.5-4.5 kHz, however. Here, the vocal tract resonance frequencies and their mode shapes are studied using a three-dimensional finite element model obtained from computed tomography measurements of a subject phonating on vowel [a:]. Instead of the traditional five, up to eight resonance frequencies of the vocal tract were found below the prominent antiresonance around 4.7 kHz. The three extra resonances were found to correspond to modes which were axially asymmetric and involved the piriform sinuses, valleculae, and transverse vibrations in the oral cavity. The results therefore suggest that the phenomenon of speaker's and singer's formant clustering may be more complex than originally thought.
Dynamic Vibration Cooperates with Connective Tissue Growth Factor to Modulate Stem Cell Behaviors
Tong, Zhixiang; Zerdoum, Aidan B.; Duncan, Randall L.
2014-01-01
Vocal fold disorders affect 3–9% of the U.S. population. Tissue engineering offers an alternative strategy for vocal fold repair. Successful engineering of vocal fold tissues requires a strategic combination of therapeutic cells, biomimetic scaffolds, and physiologically relevant mechanical and biochemical factors. Specifically, we aim to create a vocal fold-like microenvironment to coax stem cells to adopt the phenotype of vocal fold fibroblasts (VFFs). Herein, high frequency vibratory stimulations and soluble connective tissue growth factor (CTGF) were sequentially introduced to mesenchymal stem cells (MSCs) cultured on a poly(ɛ-caprolactone) (PCL)-derived microfibrous scaffold for a total of 6 days. The initial 3-day vibratory culture resulted in an increased production of hyaluronic acids (HA), tenascin-C (TNC), decorin (DCN), and matrix metalloproteinase-1 (MMP1). The subsequent 3-day CTGF treatment further enhanced the cellular production of TNC and DCN, whereas CTGF treatment alone without the vibratory preconditioning significantly promoted the synthesis of collagen I (Col 1) and sulfated glycosaminoglycans (sGAGs). The highest level of MMP1, TNC, Col III, and DCN production was found for cells being exposed to the combined vibration and CTGF treatment. Noteworthy, the vibration and CTGF elicited a differential stimulatory effect on elastin (ELN), HA synthase 1 (HAS1), and fibroblast-specific protein-1 (FSP-1). The mitogenic activity of CTGF was only elicited in naïve cells without the vibratory preconditioning. The combined treatment had profound, but opposite effects on mitogen-activated protein kinase (MAPK) pathways, Erk1/2 and p38, and the Erk1/2 pathway was critical for the observed mechano-biochemical responses. Collectively, vibratory stresses and CTGF signals cooperatively coaxed MSCs toward a VFF-like phenotype and accelerated the synthesis and remodeling of vocal fold matrices. PMID:24456068
Harbor Seal (Phoca vitulina) Reproductive Advertisement Behavior and the Effects of Vessel Noise
NASA Astrophysics Data System (ADS)
Matthews, Leanna P.
Harbor seals (Phoca vitulina) are a widely distributed pinniped species that mate underwater. Similar to other aquatically mating pinnipeds, male harbor seals produce vocalizations during the breeding season that function in male-male interactions and possibly as an attractant for females. I investigated multiple aspects of these reproductive advertisement displays in a population of harbor seals in Glacier Bay National Park and Preserve, Alaska. First, I looked at vocal production as a function of environmental variables, including season, daylight, and tidal state. Vocalizations were highly seasonal and detection of these vocalizations peaked in June and July, which correspond with the estimated time of breeding. Vocalizations also varied with light, with the lowest probability of detection during the day and the highest probability of detection at night. The high probability of detection corresponded to when females are known to forage. These results are similar to the vocal behavior of previously studied populations. However, unlike previously studied populations, the detection of harbor seal breeding vocalizations did not vary with tidal state. This is likely due to the location of the hydrophone, as it was not near the haul out and depth was therefore not significantly influenced by changes in tidal height. I also investigated the source levels and call parameters of vocalizations, as well as call rate and territoriality. The average source level of harbor seal breeding vocalizations was 144 dB re 1 ?Pa at 1 m and measurements ranged from 129 to 149 dB re 1 ?Pa. Analysis of call parameters indicated that vocalizations of harbor seals in Glacier Bay were similar in duration to other populations, but were much lower in frequency. During the breeding season, there were two discrete calling areas that likely represent two individual males; the average call rate in these display areas was approximately 1 call per minute. The harbor seal breeding season also overlaps with peak tourism in Glacier Bay, and the majority of tourists visit the park on a motorized vessel. Because of this overlap, I investigated the impacts of vessel noise on the vocal behavior of individual males. In the presence of vessel noise, male harbor seals increase the amplitude of their vocalizations, decrease the duration, and increase the minimum frequency. These vocal shifts are similar to studies of noise impacts on other species across taxa, but it is unknown how this could impact the reproductive success of male harbor seals. Finally, I looked at the role of female preference for male vocalizations. Using playbacks of male vocalizations to captive female harbor seals, I found that females have a higher response to vocalizations that correspond to dominant males. Females were less responsive to subordinate male vocalizations, which had a shorter duration and a higher frequency. Given that male harbor seals decrease the duration and increase the frequency of vocalizations in the presence of noise, it is possible that these vocalizations become less attractive in noise.
The Human Voice in Speech and Singing
NASA Astrophysics Data System (ADS)
Lindblom, Björn; Sundberg, Johan
This chapter
The Human Voice in Speech and Singing
NASA Astrophysics Data System (ADS)
Lindblom, Björn; Sundberg, Johan
This chapter describes various aspects of the human voice as a means of communication in speech and singing. From the point of view of function, vocal sounds can be regarded as the end result of a three stage process: (1) the compression of air in the respiratory system, which produces an exhalatory airstream, (2) the vibrating vocal folds' transformation of this air stream to an intermittent or pulsating air stream, which is a complex tone, referred to as the voice source, and (3) the filtering of this complex tone in the vocal tract resonator. The main function of the respiratory system is to generate an overpressure of air under the glottis, or a subglottal pressure. Section 16.1 describes different aspects of the respiratory system of significance to speech and singing, including lung volume ranges, subglottal pressures, and how this pressure is affected by the ever-varying recoil forces. The complex tone generated when the air stream from the lungs passes the vibrating vocal folds can be varied in at least three dimensions: fundamental frequency, amplitude and spectrum. Section 16.2 describes how these properties of the voice source are affected by the subglottal pressure, the length and stiffness of the vocal folds and how firmly the vocal folds are adducted. Section 16.3 gives an account of the vocal tract filter, how its form determines the frequencies of its resonances, and Sect. 16.4 gives an account for how these resonance frequencies or formants shape the vocal sounds by imposing spectrum peaks separated by spectrum valleys, and how the frequencies of these peaks determine vowel and voice qualities. The remaining sections of the chapter describe various aspects of the acoustic signals used for vocal communication in speech and singing. The syllable structure is discussed in Sect. 16.5, the closely related aspects of rhythmicity and timing in speech and singing is described in Sect. 16.6, and pitch and rhythm aspects in Sect. 16.7. The impressive control of all these acoustic characteristics of vocal signals is discussed in Sect. 16.8, while Sect. 16.9 considers expressive aspects of vocal communication.
Blades, Brittany; Parks, Susan E.
2018-01-01
During the breeding season, male harbor seals (Phoca vitulina) make underwater acoustic displays using vocalizations known as roars. These roars have been shown to function in territory establishment in some breeding areas and have been hypothesized to be important for female choice, but the function of these sounds remains unresolved. This study consisted of a series of playback experiments in which captive female harbor seals were exposed to recordings of male roars to determine if females respond to recordings of male vocalizations and whether or not they respond differently to roars from categories with different acoustic characteristics. The categories included roars with characteristics of dominant males (longest duration, lowest frequency), subordinate males (shortest duration, highest frequency), combinations of call parameters from dominant and subordinate males (long duration, high frequency and short duration, low frequency), and control playbacks of water noise and water noise with tonal signals in the same frequency range as male signals. Results indicate that overall females have a significantly higher level of response to playbacks that imitate male vocalizations when compared to control playbacks of water noise. Specifically, there was a higher level of response to playbacks representing dominant male vocalization when compared to the control playbacks. For most individuals, there was a greater response to playbacks representing dominant male vocalizations compared to playbacks representing subordinate male vocalizations; however, there was no statistical difference between those two playback types. Additionally, there was no difference between the playbacks of call parameter combinations and the controls. Investigating female preference for male harbor seal vocalizations is a critical step in understanding the harbor seal mating system and further studies expanding on this captive study will help shed light on this important issue. PMID:29607261
Matthews, Leanna P; Blades, Brittany; Parks, Susan E
2018-01-01
During the breeding season, male harbor seals ( Phoca vitulina ) make underwater acoustic displays using vocalizations known as roars. These roars have been shown to function in territory establishment in some breeding areas and have been hypothesized to be important for female choice, but the function of these sounds remains unresolved. This study consisted of a series of playback experiments in which captive female harbor seals were exposed to recordings of male roars to determine if females respond to recordings of male vocalizations and whether or not they respond differently to roars from categories with different acoustic characteristics. The categories included roars with characteristics of dominant males (longest duration, lowest frequency), subordinate males (shortest duration, highest frequency), combinations of call parameters from dominant and subordinate males (long duration, high frequency and short duration, low frequency), and control playbacks of water noise and water noise with tonal signals in the same frequency range as male signals. Results indicate that overall females have a significantly higher level of response to playbacks that imitate male vocalizations when compared to control playbacks of water noise. Specifically, there was a higher level of response to playbacks representing dominant male vocalization when compared to the control playbacks. For most individuals, there was a greater response to playbacks representing dominant male vocalizations compared to playbacks representing subordinate male vocalizations; however, there was no statistical difference between those two playback types. Additionally, there was no difference between the playbacks of call parameter combinations and the controls. Investigating female preference for male harbor seal vocalizations is a critical step in understanding the harbor seal mating system and further studies expanding on this captive study will help shed light on this important issue.
Quantifying the Effects of Propagation on Classification of Cetacean Vocalizations
2014-09-30
vocalizations from bowhead and humpback whales , and measuring the received signals at a variety of ranges [6]. The transmitted and received signals will be...and humpback whale calls. Feature Description Duration Global mean subband decay time Local maximum subband decay time Frequency of... humpback vocalizations (extending down to approximately 50 Hz) as well as the higher frequencies used for the propagation experiments (1–4 kHz)1. • It
Bjørgesaeter, Anders; Ugland, Karl Inne; Bjørge, Arne
2004-10-01
The male harbor seal (Phoca vitulina) produces broadband nonharmonic vocalizations underwater during the breeding season. In total, 120 vocalizations from six colonies were analyzed to provide a description of the acoustic structure and for the presence of geographic variation. The complex harbor seal vocalizations may be described by how the frequency bandwidth varies over time. An algorithm that identifies the boundaries between noise and signal from digital spectrograms was developed in order to extract a frequency bandwidth contour. The contours were used as inputs for multivariate analysis. The vocalizations' sound types (e.g., pulsed sound, whistle, and broadband nonharmonic sound) were determined by comparing the vocalizations' spectrographic representations with sound waves produced by known sound sources. Comparison between colonies revealed differences in the frequency contours, as well as some geographical variation in use of sound types. The vocal differences may reflect a limited exchange of individuals between the six colonies due to long distances and strong site fidelity. Geographically different vocal repertoires have potential for identifying discrete breeding colonies of harbor seals, but more information is needed on the nature and extent of early movements of young, the degree of learning, and the stability of the vocal repertoire. A characteristic feature of many vocalizations in this study was the presence of tonal-like introductory phrases that fit into the categories pulsed sound and whistles. The functions of these phrases are unknown but may be important in distance perception and localization of the sound source. The potential behavioral consequences of the observed variability may be indicative of adaptations to different environmental properties influencing determination of distance and direction and plausible different male mating tactics.
D'haeseleer, Evelien; Claeys, Sofie; Bettens, Kim; Leemans, Laura; Van Calster, Ann-Sophie; Van Damme, Nina; Thijs, Zoë; Daelman, Julie; Leyns, Clara; Van Lierde, Kristiane
2017-07-01
The purpose of this study was to measure the objective and subjective vocal quality in women aged between 60 and 75 years. Secondly, the impact of a teaching or singing career on the vocal quality was investigated by comparing the vocal quality of retired women with different careers. This is a case-control study. Seventy-three retired women between 60 and 75 years (mean age: 67 years, standard deviation: 4.49) participated in the study and were divided into three groups: women with a teaching career (n = 21), choir singers with a singing career (n = 12), and women with a non-vocal career (n = 40). All subjects underwent the same assessment protocol consisting of objective (aerodynamic, maximum performance, vocal range, acoustic measurements, and the Dysphonia Severity Index) and subjective (the Voice Handicap Index, auditory-perceptual evaluations by three listeners) voice measurements. In all three groups, objective and perceptual voice analysis showed a mild dysphonia. No differences in the Dysphonia Severity Index were found between the three groups. The voices of choir singers with a singing career were perceived significantly less rough than voices of the women with a non-vocal career. Additionally, the lowest frequency of the frequency range was significantly lower in the retired teachers and choir singers than in the controls. The results of this study prudently suggest that a singing or a teaching career compared with a non-vocal career has a positive impact on the vocal frequency range, and that singing has a positive impact on the perceptual vocal quality of the older female voice. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Simola, Nicola; Paci, Elena; Serra, Marcello; Costa, Giulia; Morelli, Micaela
2018-01-01
Rats emit 50-kHz ultrasonic vocalizations (USVs) to communicate positive emotional states, and these USVs are increasingly being investigated in preclinical studies on reward and motivation. Although it is the activation of dopamine receptors that initiates the emission of 50-kHz USVs, non-dopaminergic mechanisms may modulate calling in the 50 kHz frequency band. To further elucidate these mechanisms, the present study investigated whether the pharmacological manipulation of glucocorticoid signaling influenced calling. Rats were administered corticosterone (1-5 mg/kg, s.c.), the glucocorticoid receptor antagonist mifepristone (40 or 100 mg/kg, s.c.), or the corticosterone synthesis inhibitor metyrapone (50 or 100 mg/kg, i.p.). The effects of these drugs on calling initiation and on calling recorded during nonaggressive social contacts or after the administration of amphetamine (0.25 or 1 mg/kg, i.p.) were then evaluated. Corticosterone failed to initiate the emission of 50-kHz USVs and did not influence pro-social and amphetamine-stimulated calling. Similarly, mifepristone and metyrapone did not initiate calling. However, metyrapone suppressed pro-social calling and calling stimulated by a moderate dose (1 mg/kg, i.p.) of amphetamine. Conversely, mifepristone attenuated calling stimulated by a low (0.25 mg/kg, i.p.), but not moderate (1 mg/kg, i.p.), dose of amphetamine and had no influence on pro-social calling. The present results demonstrate that glucocorticoid signaling modulates calling in the 50 kHz frequency band only in certain conditions and suggest that mechanisms different from the inhibition of corticosterone synthesis may participate in the suppression of calling by metyrapone. © The Author 2017. Published by Oxford University Press on behalf of CINP.
Gender and vocal production mode discrimination using the high frequencies for speech and singing
Monson, Brian B.; Lotto, Andrew J.; Story, Brad H.
2014-01-01
Humans routinely produce acoustical energy at frequencies above 6 kHz during vocalization, but this frequency range is often not represented in communication devices and speech perception research. Recent advancements toward high-definition (HD) voice and extended bandwidth hearing aids have increased the interest in the high frequencies. The potential perceptual information provided by high-frequency energy (HFE) is not well characterized. We found that humans can accomplish tasks of gender discrimination and vocal production mode discrimination (speech vs. singing) when presented with acoustic stimuli containing only HFE at both amplified and normal levels. Performance in these tasks was robust in the presence of low-frequency masking noise. No substantial learning effect was observed. Listeners also were able to identify the sung and spoken text (excerpts from “The Star-Spangled Banner”) with very few exposures. These results add to the increasing evidence that the high frequencies provide at least redundant information about the vocal signal, suggesting that its representation in communication devices (e.g., cell phones, hearing aids, and cochlear implants) and speech/voice synthesizers could improve these devices and benefit normal-hearing and hearing-impaired listeners. PMID:25400613
Analysis of ultrasonic vocalizations emitted by infant rodents.
Branchi, Igor; Santucci, Daniela; Alleva, Enrico
2006-01-01
Altricial rodent pups emit ultrasonic vocalizations (USVs), which are whistle-like sounds with frequencies between 30 and 90 kHz. These signals play an important communicative role in mother-offspring interaction because they elicit in the dam a prompt response as concerning care-giving behaviors. To investigate neurobehavioral development, the analysis of the number of USVs presents several advantages: (1) USVs are one of the few responses produced by very young rodents that can be quantitatively analyzed and elicited by quantifiable stimuli; (2) USV emission follows a clear ontogenetic profile from birth to the second to third week of life, thus allowing longitudinal analysis during very early post-natal ontogeny. The reported role played by several receptor agonists and antagonists in modulating the USV rate makes this measure highly informative in investigating the effects of toxicants and, more generally, psychoactive compounds on the development of selected brain systems.
A memory like a female Fur Seal: long-lasting recognition of pup's voice by mothers.
Mathevon, Nicolas; Charrier, Isabelle; Aubin, Thierry
2004-06-01
In colonial mammals like fur seals, mutual vocal recognition between mothers and their pup is of primary importance for breeding success. Females alternate feeding sea-trips with suckling periods on land, and when coming back from the ocean, they have to vocally find their offspring among numerous similar-looking pups. Young fur seals emit a 'mother-attraction call' that presents individual characteristics. In this paper, we review the perceptual process of pup's call recognition by Subantarctic Fur Seal Arctocephalus tropicalis mothers. To identify their progeny, females rely on the frequency modulation pattern and spectral features of this call. As the acoustic characteristics of a pup's call change throughout the lactation period due to the growing process, mothers have thus to refine their memorization of their pup's voice. Field experiments show that female Fur Seals are able to retain all the successive versions of their pup's call.
Differences between vocalization evoked by social stimuli in feral cats and house cats.
Yeon, Seong C; Kim, Young K; Park, Se J; Lee, Scott S; Lee, Seung Y; Suh, Euy H; Houpt, Katherine A; Chang, Hong H; Lee, Hee C; Yang, Byung G; Lee, Hyo J
2011-06-01
To investigate how socialization can affect the types and characteristics of vocalization produced by cats, feral cats (n=25) and house cats (n=13) were used as subjects, allowing a comparison between cats socialized to people and non-socialized cats. To record vocalization and assess the cats' responses to behavioural stimuli, five test situations were used: approach by a familiar caretaker, by a threatening stranger, by a large doll, by a stranger with a dog and by a stranger with a cat. Feral cats showed extremely aggressive and defensive behaviour in most test situations, and produced higher call rates than those of house cats in the test situations, which could be attributed to less socialization to other animals and to more sensitivity to fearful situations. Differences were observed in the acoustic parameters of feral cats in comparison to those of house cats. The feral cat produced significantly higher frequency in fundamental frequency, peak frequency, 1st quartile frequency, 3rd quartile frequency of growls and hisses in agonistic test situations. In contrast to the growls and hisses, in meow, all acoustic parameters like fundamental frequency, first formant, peak frequency, 1st quartile frequency, and 3rd quartile frequency of house cats were of significantly higher frequency than those of feral cats. Also, house cats produced calls of significantly shorter in duration than feral cats in agonistic test situations. These results support the conclusion that a lack of socialization may affect usage of types of vocalizations, and the vocal characteristics, so that the proper socialization of cat may be essential to be a suitable companion house cat. Copyright © 2011 Elsevier B.V. All rights reserved.
Babies in traffic: infant vocalizations and listener sex modulate auditory motion perception.
Neuhoff, John G; Hamilton, Grace R; Gittleson, Amanda L; Mejia, Adolfo
2014-04-01
Infant vocalizations and "looming sounds" are classes of environmental stimuli that are critically important to survival but can have dramatically different emotional valences. Here, we simultaneously presented listeners with a stationary infant vocalization and a 3D virtual looming tone for which listeners made auditory time-to-arrival judgments. Negatively valenced infant cries produced more cautious (anticipatory) estimates of auditory arrival time of the tone over a no-vocalization control. Positively valenced laughs had the opposite effect, and across all conditions, men showed smaller anticipatory biases than women. In Experiment 2, vocalization-matched vocoded noise stimuli did not influence concurrent auditory time-to-arrival estimates compared with a control condition. In Experiment 3, listeners estimated the egocentric distance of a looming tone that stopped before arriving. For distant stopping points, women estimated the stopping point as closer when the tone was presented with an infant cry than when it was presented with a laugh. For near stopping points, women showed no differential effect of vocalization type. Men did not show differential effects of vocalization type at either distance. Our results support the idea that both the sex of the listener and the emotional valence of infant vocalizations can influence auditory motion perception and can modulate motor responses to other behaviorally relevant environmental sounds. We also find support for previous work that shows sex differences in emotion processing are diminished under conditions of higher stress.
The perceptual features of vocal fatigue as self-reported by a group of actors and singers.
Kitch, J A; Oates, J
1994-09-01
Performers (10 actors/10 singers) rated via a self-report questionnaire the severity of their voice-related changes when vocally fatigued. Similar frequency patterns and perceptual features of vocal fatigue were found across subjects. Actors rated "power" aspects (e.g., voice projection) and singers rated vocal dynamic aspects (e.g., pitch range) of their voices as most affected when vocally fatigued. Vocal fatigue was evidenced by changes in kinesthetic/proprioceptive sensations and vocal dynamics. The causes and context of vocal fatigue were vocal misuse, being "run down," high performance demands, and using high pitch/volume levels. Further research is needed to delineate the perceptual features of "normal" levels of vocal fatigue and its possible causes.
Kobayasi, Kohta I; Hiryu, Shizuko; Shimozawa, Ryota; Riquimaroux, Hiroshi
2012-11-01
Although much is known about the echolocation of horseshoe bats (Rhinolophus spp.), little is known about the characteristics and function of their communication calls. This study focused on a stereotyped behavior of a bat approaching a companion animal in the colony, and examined their interaction and vocalization during this behavior. The bats emit echolocation-like vocalizations when approaching each other and these vocalizations contain a "buildup" pulse sequence, in which the frequency of the pulse increases gradually to normal echolocation pulse frequencies. The results suggest that the echolocation-like pulses serve an important role in communication within the colony.
Modal response of a computational vocal fold model with a substrate layer of adipose tissue.
Jones, Cameron L; Achuthan, Ajit; Erath, Byron D
2015-02-01
This study demonstrates the effect of a substrate layer of adipose tissue on the modal response of the vocal folds, and hence, on the mechanics of voice production. Modal analysis is performed on the vocal fold structure with a lateral layer of adipose tissue. A finite element model is employed, and the first six mode shapes and modal frequencies are studied. The results show significant changes in modal frequencies and substantial variation in mode shapes depending on the strain rate of the adipose tissue. These findings highlight the importance of considering adipose tissue in computational vocal fold modeling.
HUMAN SPEECH: A RESTRICTED USE OF THE MAMMALIAN LARYNX
Titze, Ingo R.
2016-01-01
Purpose Speech has been hailed as unique to human evolution. While the inventory of distinct sounds producible with vocal tract articulators is a great advantage in human oral communication, it is argued here that the larynx as a sound source in speech is limited in its range and capability because a low fundamental frequency is ideal for phonemic intelligibility and source-filter independence. Method Four existing data sets were combined to make an argument regarding exclusive use of the larynx for speech: (1) range of fundamental frequency, (2) laryngeal muscle activation, (3) vocal fold length in relation to sarcomere length of the major laryngeal muscles, and (4) vocal fold morphological development. Results Limited data support the notion that speech tends to produce a contracture of the larynx. The morphological design of the human vocal folds, like that of primates and other mammals, is optimized for vocal communication over distances for which higher fundamental frequency, higher intensity, and fewer unvoiced segments are utilized than in conversational speech. Conclusion The positive message is that raising one’s voice to call, shout, or sing, or executing pitch glides to stretch the vocal folds, can counteract this trend toward a contracted state. PMID:27397113
On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
Lammert, Adam C.; Narayanan, Shrikanth S.
2015-01-01
Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework for considering estimation methods is developed from the basic principles of vocal tract acoustics, and an estimation method is proposed that follows naturally from this framework. The proposed method is evaluated using acoustic characteristics of simulated vocal tracts ranging from 14 to 19 cm in length, as well as real-time magnetic resonance imaging data with synchronous audio from five speakers whose vocal tracts range from 14.5 to 18.0 cm in length. Evaluations show improvements in accuracy over previously proposed methods, with 0.631 and 1.277 cm root mean square error on simulated and human speech data, respectively. Empirical results show that the effectiveness of the proposed method is based on emphasizing higher formant frequencies, which seem less affected by speech articulation. Theoretical predictions of formant sensitivity reinforce this empirical finding. Moreover, theoretical insights are explained regarding the reason for differences in formant sensitivity. PMID:26177102
RIEDE, TOBIAS
2014-01-01
Rodents produce highly variable ultrasound whistles as communication signals unlike many other mammals, who employ flow-induced vocal fold oscillations to produce sound. The role of larynx muscles in controlling sound features across different call types in ultrasound vocalization (USV) was investigated using laryngeal muscle electromyographic (EMG) activity, subglottal pressure measurements and vocal sound output in awake and spontaneously behaving Sprague–Dawley rats. Results support the hypothesis that glottal shape determines fundamental frequency. EMG activities of thyroarytenoid and cricothyroid muscles were aligned with call duration. EMG intensity increased with fundamental frequency. Phasic activities of both muscles were aligned with fast changing fundamental frequency contours, for example in trills. Activities of the sternothyroid and sternohyoid muscles, two muscles involved in vocal production in other mammals, are not critical for the production of rat USV. To test how stereotypic laryngeal and respiratory activity are across call types and individuals, sets of ten EMG and subglottal pressure parameters were measured in six different call types from six rats. Using discriminant function analysis, on average 80% of parameter sets were correctly assigned to their respective call type. This was significantly higher than the chance level. Since fundamental frequency features of USV are tightly associated with stereotypic activity of intrinsic laryngeal muscles and muscles contributing to build-up of subglottal pressure, USV provide insight into the neurophysiological control of peripheral vocal motor patterns. PMID:23423862
Elie, Julie E.; Theunissen, Frédéric E.
2018-01-01
Although a universal code for the acoustic features of animal vocal communication calls may not exist, the thorough analysis of the distinctive acoustical features of vocalization categories is important not only to decipher the acoustical code for a specific species but also to understand the evolution of communication signals and the mechanisms used to produce and understand them. Here, we recorded more than 8,000 examples of almost all the vocalizations of the domesticated zebra finch, Taeniopygia guttata: vocalizations produced to establish contact, to form and maintain pair bonds, to sound an alarm, to communicate distress or to advertise hunger or aggressive intents. We characterized each vocalization type using complete representations that avoided any a priori assumptions on the acoustic code, as well as classical bioacoustics measures that could provide more intuitive interpretations. We then used these acoustical features to rigorously determine the potential information-bearing acoustical features for each vocalization type using both a novel regularized classifier and an unsupervised clustering algorithm. Vocalization categories are discriminated by the shape of their frequency spectrum and by their pitch saliency (noisy to tonal vocalizations) but not particularly by their fundamental frequency. Notably, the spectral shape of zebra finch vocalizations contains peaks or formants that vary systematically across categories and that would be generated by active control of both the vocal organ (source) and the upper vocal tract (filter). PMID:26581377
Stuttering: A novel bullfrog vocalization
NASA Astrophysics Data System (ADS)
Simmons, Andrea; Suggs, Dianne
2004-05-01
The advertisement call of male bullfrogs (Rana catesbeiana) consists of a series of individual croaks, each of which contains multiple harmonics with a missing or attenuated fundamental frequency of approximately 100 Hz. The envelope of individual croaks has typically been represented in the literature as smooth and unmodulated. From an analysis of 5251 advertisement calls from 17 different choruses over two mating seasons, we show that males add an extra modulation (around 4 Hz) to the envelope of individual croaks, following specific rules. We term these extra modulations stutters. Neither single croak calls nor the first croak in multiple croak calls contains stutters. When stuttering begins, it does so with a croak containing a single stutter, and the number of stutters increases linearly (plus or minus 1 stutter, up to 4 stutters) with the number of croaks. This pattern is stable across individual males (N=10). Playback experiments reveal that vocal responses to stuttered and nonstuttered calls vary with proximity to the stimulus. Close males respond with nonstuttered calls, while far males respond with stuttered calls. The data suggest that nonstuttered calls are used for aggressive or territorial purposes, while stuttered calls are used to attract females.
Kobayasi, Kohta I.; Hage, Steffen R.; Berquist, Sean; Feng, Jiang; Zhang, Shuyi; Metzner, Walter
2012-01-01
Mammalian vocalizations exhibit large variations in their spectrotemporal features, although it is still largely unknown which result from intrinsic biomechanical properties of the larynx and which are under direct neuromuscular control. Here we show that mere changes in laryngeal air flow yield several non-linear effects on sound production, in an isolated larynx preparation from horseshoe bats. Most notably, there are sudden jumps between two frequency bands used for either echolocation or communication in natural vocalizations. These jumps resemble changes in “registers” as in yodelling. In contrast, simulated contractions of the main larynx muscle produce linear frequency changes, but are limited to echolocation or communication frequencies. Only by combining non-linear and linear properties can this larynx therefore produce sounds covering the entire frequency range of natural calls. This may give behavioural meaning to yodelling-like vocal behaviour and reshape our thinking about how the brain controls the multitude of spectral vocal features in mammals. PMID:23149729
Sensory-motor interactions for vocal pitch monitoring in non-primary human auditory cortex.
Greenlee, Jeremy D W; Behroozmand, Roozbeh; Larson, Charles R; Jackson, Adam W; Chen, Fangxiang; Hansen, Daniel R; Oya, Hiroyuki; Kawasaki, Hiroto; Howard, Matthew A
2013-01-01
The neural mechanisms underlying processing of auditory feedback during self-vocalization are poorly understood. One technique used to study the role of auditory feedback involves shifting the pitch of the feedback that a speaker receives, known as pitch-shifted feedback. We utilized a pitch shift self-vocalization and playback paradigm to investigate the underlying neural mechanisms of audio-vocal interaction. High-resolution electrocorticography (ECoG) signals were recorded directly from auditory cortex of 10 human subjects while they vocalized and received brief downward (-100 cents) pitch perturbations in their voice auditory feedback (speaking task). ECoG was also recorded when subjects passively listened to playback of their own pitch-shifted vocalizations. Feedback pitch perturbations elicited average evoked potential (AEP) and event-related band power (ERBP) responses, primarily in the high gamma (70-150 Hz) range, in focal areas of non-primary auditory cortex on superior temporal gyrus (STG). The AEPs and high gamma responses were both modulated by speaking compared with playback in a subset of STG contacts. From these contacts, a majority showed significant enhancement of high gamma power and AEP responses during speaking while the remaining contacts showed attenuated response amplitudes. The speaking-induced enhancement effect suggests that engaging the vocal motor system can modulate auditory cortical processing of self-produced sounds in such a way as to increase neural sensitivity for feedback pitch error detection. It is likely that mechanisms such as efference copies may be involved in this process, and modulation of AEP and high gamma responses imply that such modulatory effects may affect different cortical generators within distinctive functional networks that drive voice production and control.
Sensory-Motor Interactions for Vocal Pitch Monitoring in Non-Primary Human Auditory Cortex
Larson, Charles R.; Jackson, Adam W.; Chen, Fangxiang; Hansen, Daniel R.; Oya, Hiroyuki; Kawasaki, Hiroto; Howard, Matthew A.
2013-01-01
The neural mechanisms underlying processing of auditory feedback during self-vocalization are poorly understood. One technique used to study the role of auditory feedback involves shifting the pitch of the feedback that a speaker receives, known as pitch-shifted feedback. We utilized a pitch shift self-vocalization and playback paradigm to investigate the underlying neural mechanisms of audio-vocal interaction. High-resolution electrocorticography (ECoG) signals were recorded directly from auditory cortex of 10 human subjects while they vocalized and received brief downward (−100 cents) pitch perturbations in their voice auditory feedback (speaking task). ECoG was also recorded when subjects passively listened to playback of their own pitch-shifted vocalizations. Feedback pitch perturbations elicited average evoked potential (AEP) and event-related band power (ERBP) responses, primarily in the high gamma (70–150 Hz) range, in focal areas of non-primary auditory cortex on superior temporal gyrus (STG). The AEPs and high gamma responses were both modulated by speaking compared with playback in a subset of STG contacts. From these contacts, a majority showed significant enhancement of high gamma power and AEP responses during speaking while the remaining contacts showed attenuated response amplitudes. The speaking-induced enhancement effect suggests that engaging the vocal motor system can modulate auditory cortical processing of self-produced sounds in such a way as to increase neural sensitivity for feedback pitch error detection. It is likely that mechanisms such as efference copies may be involved in this process, and modulation of AEP and high gamma responses imply that such modulatory effects may affect different cortical generators within distinctive functional networks that drive voice production and control. PMID:23577157
Temperature-dependent regulation of vocal pattern generator.
Yamaguchi, Ayako; Gooler, David; Herrold, Amy; Patel, Shailja; Pong, Winnie W
2008-12-01
Vocalizations of Xenopus laevis are generated by central pattern generators (CPGs). The advertisement call of male X. laevis is a complex biphasic motor rhythm consisting of fast and slow trills (a train of clicks). We found that the trill rate of these advertisement calls is sensitive to temperature and that this rate modification of the vocal rhythms originates in the central pattern generators. In vivo the rates of fast and slow trills increased linearly with an increase in temperature. In vitro a similar linear relation between temperature and compound action potential frequency in the laryngeal nerve was found when fictive advertisement calls were evoked in the isolated brain. Temperature did not limit the contractile properties of laryngeal muscles within the frequency range of vocalizations. We next took advantage of the temperature sensitivity of the vocal CPG in vitro to localize the source of the vocal rhythms. We focused on the dorsal tegmental area of the medulla (DTAM), a brain stem nucleus that is essential for vocal production. We found that bilateral cooling of DTAM reduced both fast and slow trill rates. Thus we conclude that DTAM is a source of biphasic vocal rhythms.
Biomechanical effects of hydration in vocal fold tissues.
Chan, Roger W; Tayama, Niro
2002-05-01
It has often been hypothesized, with little empirical support, that vocal fold hydration affects voice production by mediating changes in vocal fold tissue rheology. To test this hypothesis, we attempted in this study to quantify the effects of hydration on the viscoelastic shear properties of vocal fold tissues in vitro. Osmotic changes in hydration (dehydration and rehydration) of 5 excised canine larynges were induced by sequential incubation of the tissues in isotonic, hypertonic, and hypotonic solutions. Elastic shear modulus (G'), dynamic viscosity eta' and the damping ratio zeta of the vocal fold mucosa (lamina propria) were measured as a function of frequency (0.01 to 15 Hz) with a torsional rheometer. Vocal fold tissue stiffness (G') and viscosity (eta) increased significantly (by 4 to 7 times) with the osmotically induced dehydration, whereas they decreased by 22% to 38% on the induced rehydration. Damping ratio (zeta) also increased with dehydration and decreased with rehydration, but the detected differences were not statistically significant at all frequencies. These findings support the long-standing hypothesis that hydration affects vocal fold vibration by altering tissue rheologic (or viscoelastic) properties. Our results demonstrated the biomechanical importance of hydration in vocal fold tissues and suggested that hydration approaches may potentially improve the biomechanics of phonation in vocal fold lesions involving disordered fluid balance.
Chan, Roger W.
2018-01-01
Viscoelastic shear properties of human vocal fold tissues were previously quantified by the shear moduli (G′ and G″). Yet these small-strain linear measures were unable to describe any nonlinear tissue behavior. This study attempted to characterize the nonlinear viscoelastic response of the vocal fold lamina propria under large-amplitude oscillatory shear (LAOS) with a stress decomposition approach. Human vocal fold cover and vocal ligament specimens from eight subjects were subjected to LAOS rheometric testing with a simple-shear rheometer. The empirical total stress response was decomposed into elastic and viscous stress components, based on odd-integer harmonic decomposition approach with Fourier transform. Nonlinear viscoelastic measures derived from the decomposition were plotted in Pipkin space and as rheological fingerprints to observe the onset of nonlinearity and the type of nonlinear behavior. Results showed that both the vocal fold cover and the vocal ligament experienced intercycle strain softening, intracycle strain stiffening, as well as shear thinning both intercycle and intracycle. The vocal ligament appeared to demonstrate an earlier onset of nonlinearity at phonatory frequencies, and higher sensitivity to changes in frequency and strain. In summary, the stress decomposition approach provided much better insights into the nonlinear viscoelastic behavior of the vocal fold lamina propria than the traditional linear measures. PMID:29780189
Chan, Roger W
2018-05-01
Viscoelastic shear properties of human vocal fold tissues were previously quantified by the shear moduli ( G' and G″ ). Yet these small-strain linear measures were unable to describe any nonlinear tissue behavior. This study attempted to characterize the nonlinear viscoelastic response of the vocal fold lamina propria under large-amplitude oscillatory shear (LAOS) with a stress decomposition approach. Human vocal fold cover and vocal ligament specimens from eight subjects were subjected to LAOS rheometric testing with a simple-shear rheometer. The empirical total stress response was decomposed into elastic and viscous stress components, based on odd-integer harmonic decomposition approach with Fourier transform. Nonlinear viscoelastic measures derived from the decomposition were plotted in Pipkin space and as rheological fingerprints to observe the onset of nonlinearity and the type of nonlinear behavior. Results showed that both the vocal fold cover and the vocal ligament experienced intercycle strain softening, intracycle strain stiffening, as well as shear thinning both intercycle and intracycle. The vocal ligament appeared to demonstrate an earlier onset of nonlinearity at phonatory frequencies, and higher sensitivity to changes in frequency and strain. In summary, the stress decomposition approach provided much better insights into the nonlinear viscoelastic behavior of the vocal fold lamina propria than the traditional linear measures.
Szabo Portela, Annika; Granqvist, Svante; Ternström, Sten; Södersten, Maria
2018-01-01
This study aimed to assess vocal behavior in women with voice-intensive occupations to investigate differences between patients and controls and between work and leisure conditions with environmental noise level as an experimental factor. Patients with work-related voice disorders, 10 with phonasthenia and 10 with vocal nodules, were matched regarding age, profession, and workplace with 20 vocally healthy colleagues. The sound pressure level of environmental noise and the speakers' voice, fundamental frequency, and phonation ratio were registered from morning to night during 1 week with a voice accumulator. Voice data were assessed in low (≤55 dBA), moderate, and high (>70 dBA) environmental noise levels. The average environmental noise level was significantly higher during the work condition for patients with vocal nodules (73.9 dBA) and their controls (73.0 dBA) compared with patients with phonasthenia (68.3 dBA) and their controls (67.1 dBA). The average voice level and the fundamental frequency were also significantly higher during work for the patients with vocal nodules and their controls. During the leisure condition, there were no significant differences in average noise and voice level nor fundamental frequency between the groups. The patients with vocal nodules and their controls spent significantly more time and used their voices significantly more in high-environmental noise levels. High noise levels during work and demands from the occupation impact vocal behavior. Thus, assessment of voice ergonomics should be part of the work environmental management. To reduce environmental noise levels is important to improve voice ergonomic conditions in communication-intensive and vocally demanding workplaces. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
A versatile pitch tracking algorithm: from human speech to killer whale vocalizations.
Shapiro, Ari Daniel; Wang, Chao
2009-07-01
In this article, a pitch tracking algorithm [named discrete logarithmic Fourier transformation-pitch detection algorithm (DLFT-PDA)], originally designed for human telephone speech, was modified for killer whale vocalizations. The multiple frequency components of some of these vocalizations demand a spectral (rather than temporal) approach to pitch tracking. The DLFT-PDA algorithm derives reliable estimations of pitch and the temporal change of pitch from the harmonic structure of the vocal signal. Scores from both estimations are combined in a dynamic programming search to find a smooth pitch track. The algorithm is capable of tracking killer whale calls that contain simultaneous low and high frequency components and compares favorably across most signal to noise ratio ranges to the peak-picking and sidewinder algorithms that have been used for tracking killer whale vocalizations previously.
Murugan, Malavika; Harward, Stephen; Scharff, Constance; Mooney, Richard
2013-12-18
Mutations of the FOXP2 gene impair speech and language development in humans and shRNA-mediated suppression of the avian ortholog FoxP2 disrupts song learning in juvenile zebra finches. How diminished FoxP2 levels affect vocal control and alter the function of neural circuits important to learned vocalizations remains unclear. Here we show that FoxP2 knockdown in the songbird striatum disrupts developmental and social modulation of song variability. Recordings in anesthetized birds show that FoxP2 knockdown interferes with D1R-dependent modulation of activity propagation in a corticostriatal pathway important to song variability, an effect that may be partly attributable to reduced D1R and DARPP-32 protein levels. Furthermore, recordings in singing birds reveal that FoxP2 knockdown prevents social modulation of singing-related activity in this pathway. These findings show that reduced FoxP2 levels interfere with the dopaminergic modulation of vocal variability, which may impede song and speech development by disrupting reinforcement learning mechanisms. Copyright © 2013 Elsevier Inc. All rights reserved.
Murugan, Malavika; Harward, Stephen; Scharff, Constance; Mooney, Richard
2013-01-01
Summary Mutations of the FOXP2 gene impair speech and language development in humans and shRNA-mediated suppression of the avian orthologue FoxP2 disrupts song learning in juvenile zebra finches. How diminished FoxP2 levels affect vocal control and alter the function of neural circuits important to learned vocalizations remains unclear. Here we show that FoxP2 knockdown in the songbird striatum disrupts developmental and social modulation of song variability. Recordings in anaesthetized birds show that FoxP2 knockdown interferes with D1R-dependent modulation of activity propagation in a corticostriatal pathway important to song variability, an effect that may be partly attributable to reduced D1R and DARPP-32 protein levels. Furthermore, recordings in singing birds reveal that FoxP2 knockdown prevents social modulation of singing-related activity in this pathway. These findings show that reduced FoxP2 levels interfere with the dopaminergic modulation of vocal variability, which may impede song and speech development by disrupting reinforcement learning mechanisms. PMID:24268418
An acoustic glottal source for vocal tract physical models
NASA Astrophysics Data System (ADS)
Hannukainen, Antti; Kuortti, Juha; Malinen, Jarmo; Ojalammi, Antti
2017-11-01
A sound source is proposed for the acoustic measurement of physical models of the human vocal tract. The physical models are produced by fast prototyping, based on magnetic resonance imaging during prolonged vowel production. The sound source, accompanied by custom signal processing algorithms, is used for two kinds of measurements from physical models of the vocal tract: (i) amplitude frequency response and resonant frequency measurements, and (ii) signal reconstructions at the source output according to a target pressure waveform with measurements at the mouth position. The proposed source and the software are validated by computational acoustics experiments and measurements on a physical model of the vocal tract corresponding to the vowels [] of a male speaker.
Aerodynamically and acoustically driven modes of vibration in a physical model of the vocal folds.
Zhang, Zhaoyan; Neubauer, Juergen; Berry, David A
2006-11-01
In a single-layered, isotropic, physical model of the vocal folds, distinct phonation types were identified based on the medial surface dynamics of the vocal fold. For acoustically driven phonation, a single, in-phase, x-10 like eigenmode captured the essential dynamics, and coupled with one of the acoustic resonances of the subglottal tract. Thus, the fundamental frequency appeared to be determined primarily by a subglottal acoustic resonance. In contrast, aerodynamically driven phonation did not naturally appear in the single-layered model, but was facilitated by the introduction of a vertical constraint. For this phonation type, fundamental frequency was relatively independent of the acoustic resonances, and two eigenmodes were required to capture the essential dynamics of the vocal fold, including an out-of-phase x-11 like eigenmode and an in-phase x-10 like eigenmode, as described in earlier theoretical work. The two eigenmodes entrained to the same frequency, and were decoupled from subglottal acoustic resonances. With this independence from the acoustic resonances, vocal fold dynamics appeared to be determined primarily by near-field, fluid-structure interactions.
Acoustic correlates of body size and individual identity in banded penguins
Gamba, Marco; Gili, Claudia; Pessani, Daniela
2017-01-01
Animal vocalisations play a role in individual recognition and mate choice. In nesting penguins, acoustic variation in vocalisations originates from distinctiveness in the morphology of the vocal apparatus. Using the source-filter theory approach, we investigated vocal individuality cues and correlates of body size and mass in the ecstatic display songs the Humboldt and Magellanic penguins. We demonstrate that both fundamental frequency (f0) and formants (F1-F4) are essential vocal features to discriminate among individuals. However, we show that only duration and f0 are honest indicators of the body size and mass, respectively. We did not find any effect of body dimension on formants, formant dispersion nor estimated vocal tract length of the emitters. Overall, our findings provide the first evidence that the resonant frequencies of the vocal tract do not correlate with body size in penguins. Our results add important information to a growing body of literature on the role of the different vocal parameters in conveying biologically meaningful information in bird vocalisations. PMID:28199318
A biorobotic model of the human larynx.
Manti, M; Cianchetti, M; Nacci, A; Ursino, F; Laschi, C
2015-08-01
This work focuses on a physical model of the human larynx that replicates its main components and functions. The prototype reproduces the multilayer vocal folds and the ab/adduction movements. In particular, the vocal folds prototype is made with soft materials whose mechanical properties have been obtained to be similar to the natural tissue in terms of viscoelasticity. A computational model was used to study fluid-structure interaction between vocal folds and the airflow. This tool allowed us to make a comparison between theoretical and experimental results. Measurements were performed with this prototype in an experimental platform comprising a controlled air flow, pressure sensors and a high-speed camera for measuring vocal fold vibrations. Data included oscillation frequency at the onset pressure and glottal width. Results show that the combination between vocal fold geometry, mechanical properties and dimensions exhibits an oscillation frequency close to that of the human vocal fold. Moreover, computational results show a high correlation with the experimental one.
Source levels of foraging humpback whale calls.
Fournet, Michelle E H; Matthews, Leanna P; Gabriele, Christine M; Mellinger, David K; Klinck, Holger
2018-02-01
Humpback whales produce a wide range of low- to mid-frequency vocalizations throughout their migratory range. Non-song "calls" dominate this species' vocal repertoire while on high-latitude foraging grounds. The source levels of 426 humpback whale calls in four vocal classes were estimated using a four-element planar array deployed in Glacier Bay National Park and Preserve, Southeast Alaska. There was no significant difference in source levels between humpback whale vocal classes. The mean call source level was 137 dB RMS re 1 μPa @ 1 m in the bandwidth of the call (range 113-157 dB RMS re 1 μPa @ 1 m), where bandwidth is defined as the frequency range from the lowest to the highest frequency component of the call. These values represent a robust estimate of humpback whale source levels on foraging grounds and should append earlier estimates.
Hage, Steffen R.; Jiang, Tinglei; Berquist, Sean W.; Feng, Jiang; Metzner, Walter
2013-01-01
The Lombard effect, an involuntary rise in call amplitude in response to masking ambient noise, represents one of the most efficient mechanisms to optimize signal-to-noise ratio. The Lombard effect occurs in birds and mammals, including humans, and is often associated with several other vocal changes, such as call frequency and duration. Most studies, however, have focused on noise-dependent changes in call amplitude. It is therefore still largely unknown how the adaptive changes in call amplitude relate to associated vocal changes such as frequency shifts, how the underlying mechanisms are linked, and if auditory feedback from the changing vocal output is needed. Here, we examined the Lombard effect and the associated changes in call frequency in a highly vocal mammal, echolocating horseshoe bats. We analyzed how bandpass-filtered noise (BFN; bandwidth 20 kHz) affected their echolocation behavior when BFN was centered on different frequencies within their hearing range. Call amplitudes increased only when BFN was centered on the dominant frequency component of the bats’ calls. In contrast, call frequencies increased for all but one BFN center frequency tested. Both amplitude and frequency rises were extremely fast and occurred in the first call uttered after noise onset, suggesting that no auditory feedback was required. The different effects that varying the BFN center frequency had on amplitude and frequency rises indicate different neural circuits and/or mechanisms underlying these changes. PMID:23431172
Wild, J M; Krützfeldt, N E O
2012-02-15
During singing in songbirds, the extent of beak opening, like the extent of mouth opening in human singers, is partially correlated with the fundamental frequency of the sounds emitted. Since song in songbirds is under the control of "the song system" (a collection of interconnected forebrain nuclei dedicated to the learning and production of song), it might be expected that beak movements during singing would also be controlled by this system. However, direct neural connections between the telencephalic output of the song system and beak muscle motor neurons in the brainstem are conspicuous by their absence, leaving unresolved the question of how beak movements are affected during singing. By using standard tract tracing methods, we sought to answer this question by defining beak premotor neurons and examining their afferent projections. In the caudal medulla, jaw premotor cell bodies were located adjacent to the terminal field of the output of the song system, into which many premotor neurons extended their dendrites. The premotor neurons also received a novel input from the trigeminal ganglion and an overlapping input from a lateral arcopallial component of a trigeminal sensorimotor circuit that traverses the forebrain. The ganglionic input in songbirds, which is not present in doves and pigeons that vocalize with a closed beak, may modulate the activity of beak premotor neurons in concert with the output of the song system. These inputs to jaw premotor neurons could, together, affect beak movements as a means of modulating filter properties of the upper vocal tract during singing. Copyright © 2011 Wiley-Liss, Inc.
Wild, J.M.; Krützfeldt, N.E.O.
2014-01-01
During singing in songbirds, the extent of beak opening, like the extent of mouth opening in human singers, is partially correlated with the fundamental frequency of the sounds emitted. Since song in songbirds is under the control of “the song system” (a collection of interconnected forebrain nuclei dedicated to the learning and production of song), it might be expected that beak movements during singing would also be controlled by this system. However, direct neural connections between the telencephalic output of the song system and beak muscle motor neurons in the brainstem are conspicuous by their absence, leaving unresolved the question of how beak movements are affected during singing. By using standard tract tracing methods, we sought to answer this question by defining beak premotor neurons and examining their afferent projections. In the caudal medulla, jaw premotor cell bodies were located adjacent to the terminal field of the output of the song system, into which many premotor neurons extended their dendrites. The premotor neurons also received a novel input from the trigeminal ganglion and an overlapping input from a lateral arcopallial component of a trigeminal sensorimotor circuit that traverses the forebrain. The ganglionic input in songbirds, which is not present in doves and pigeons that vocalize with a closed beak, may modulate the activity of beak premotor neurons in concert with the output of the song system. These inputs to jaw premotor neurons could, together, affect beak movements as a means of modulating filter properties of the upper vocal tract during singing. PMID:21858818
Noise Pollution Filters Bird Communities Based on Vocal Frequency
Francis, Clinton D.; Ortega, Catherine P.; Cruz, Alexander
2011-01-01
Background Human-generated noise pollution now permeates natural habitats worldwide, presenting evolutionarily novel acoustic conditions unprecedented to most landscapes. These acoustics not only harm humans, but threaten wildlife, and especially birds, via changes to species densities, foraging behavior, reproductive success, and predator-prey interactions. Explanations for negative effects of noise on birds include disruption of acoustic communication through energetic masking, potentially forcing species that rely upon acoustic communication to abandon otherwise suitable areas. However, this hypothesis has not been adequately tested because confounding stimuli often co-vary with noise and are difficult to separate from noise exposure. Methodology/Principal Findings Using a natural experiment that controls for confounding stimuli, we evaluate whether species vocal features or urban-tolerance classifications explain their responses to noise measured through habitat use. Two data sets representing nesting and abundance responses reveal that noise filters bird communities nonrandomly. Signal duration and urban tolerance failed to explain species-specific responses, but birds with low-frequency signals that are more susceptible to masking from noise avoided noisy areas and birds with higher frequency vocalizations remained. Signal frequency was also negatively correlated with body mass, suggesting that larger birds may be more sensitive to noise due to the link between body size and vocal frequency. Conclusions/Significance Our findings suggest that acoustic masking by noise may be a strong selective force shaping the ecology of birds worldwide. Larger birds with lower frequency signals may be excluded from noisy areas, whereas smaller species persist via transmission of higher frequency signals. We discuss our findings as they relate to interspecific relationships among body size, vocal amplitude and frequency and suggest that they are immediately relevant to the global problem of increases in noise by providing critical insight as to which species traits influence tolerance of these novel acoustics. PMID:22096517
Fundamental frequency, phonation maximum time and vocal complaints in morbidly obese women
de SOUZA, Lourdes Bernadete Rocha; PEREIRA, Rayane Medeiros; dos SANTOS, Marquiony Marques; GODOY, Cynthia Meida de Almeida
2014-01-01
Background Obese people have abnormal deposition of fat in the vocal tract that can interfere with the acoustic voice. Aim To relate the fundamental frequency, the maximum phonation time and voice complaints from a group of morbidly obese women. Methods Observational, cross-sectional and descriptive study that included 44 morbidly obese women, mean age of 42.45 (±10.31) years old, observational group and 30 women without obesity, control group, with 33.79 (±4.51)years old. The voice recording was done in a quiet environment, on a laptop using the program ANAGRAF acoustic analysis of speech sounds. To extract the values of fundamental frequency the subjects were asked to produce vowel [a] at usual intensity for a period in average of three seconds. After the voice recording, participants were prompted to produce sustained vowel [ a] , [ i] and [ u] at usual intensity and height, using a stopwatch to measure the time that each participant could hold each vowel. Results The majority, 31(70.5%), had vocal complaints, with a higher percentage for complaints of vocal fatigue 20(64.51%) and voice failures 19(61.29%) followed by dryness of the throat in 15 (48.38%) and effort to speak 13(41.93%). There was no statistically significant difference regarding the mean fundamental frequency of the voice in both groups, but there was significance between the two groups regarding maximum phonation. Conclusion Increased adipose tissue in the vocal tract interfered in the vocal parameters. PMID:24676298
Selective attention modulates early human evoked potentials during emotional face-voice processing.
Ho, Hao Tam; Schröger, Erich; Kotz, Sonja A
2015-04-01
Recent findings on multisensory integration suggest that selective attention influences cross-sensory interactions from an early processing stage. Yet, in the field of emotional face-voice integration, the hypothesis prevails that facial and vocal emotional information interacts preattentively. Using ERPs, we investigated the influence of selective attention on the perception of congruent versus incongruent combinations of neutral and angry facial and vocal expressions. Attention was manipulated via four tasks that directed participants to (i) the facial expression, (ii) the vocal expression, (iii) the emotional congruence between the face and the voice, and (iv) the synchrony between lip movement and speech onset. Our results revealed early interactions between facial and vocal emotional expressions, manifested as modulations of the auditory N1 and P2 amplitude by incongruent emotional face-voice combinations. Although audiovisual emotional interactions within the N1 time window were affected by the attentional manipulations, interactions within the P2 modulation showed no such attentional influence. Thus, we propose that the N1 and P2 are functionally dissociated in terms of emotional face-voice processing and discuss evidence in support of the notion that the N1 is associated with cross-sensory prediction, whereas the P2 relates to the derivation of an emotional percept. Essentially, our findings put the integration of facial and vocal emotional expressions into a new perspective-one that regards the integration process as a composite of multiple, possibly independent subprocesses, some of which are susceptible to attentional modulation, whereas others may be influenced by additional factors.
ERIC Educational Resources Information Center
McKenna, Victoria S.; Llico, Andres F.; Mehta, Daryush D.; Perkell, Joseph S.; Stepp, Cara E.
2017-01-01
Purpose: This study examined the relationship between the magnitude of neck-surface vibration (NSV[subscript Mag]; transduced with an accelerometer) and intraoral estimates of subglottal pressure (P'[subscript sg]) during variations in vocal effort at 3 intensity levels. Method: Twelve vocally healthy adults produced strings of /p?/ syllables in 3…
Wang, Lei; Luo, Jinhong; Wang, Hongna; Ou, Wei; Jiang, Tinglei; Liu, Ying; Lyle, Dennis; Feng, Jiang
2014-02-01
Studying relationships between characteristics of sonar pulses and habitat clutter level is important for the understanding of signal design in bat echolocation. However, most studies have focused on overall spectral and temporal parameters of such vocalizations, with focus less on potential variation in frequency modulation rates (MRs) occurring within each pulse. In the current study, frequency modulation (FM) characteristics were examined in echolocation pulses recorded from big-footed myotis (Myotis macrodactylus) bats as these animals searched for prey in five habitats differing in relative clutter level. Pulses were analyzed using ten parameters, including four structure-related characters which were derived by dividing each pulse into three elements based on two knees in the FM sweep. Results showed that overall frequency, pulse duration, and MR all varied across habitat. The strongest effects were found for MR in the body of the pulse, implying that this particular component plays a major role as M. macrodactylus, and potentially other bat species, adjust to varying clutter levels in their foraging habitats.
Auditory-Motor Control of Vocal Production during Divided Attention: Behavioral and ERP Correlates.
Liu, Ying; Fan, Hao; Li, Jingting; Jones, Jeffery A; Liu, Peng; Zhang, Baofeng; Liu, Hanjun
2018-01-01
When people hear unexpected perturbations in auditory feedback, they produce rapid compensatory adjustments of their vocal behavior. Recent evidence has shown enhanced vocal compensations and cortical event-related potentials (ERPs) in response to attended pitch feedback perturbations, suggesting that this reflex-like behavior is influenced by selective attention. Less is known, however, about auditory-motor integration for voice control during divided attention. The present cross-modal study investigated the behavioral and ERP correlates of auditory feedback control of vocal pitch production during divided attention. During the production of sustained vowels, 32 young adults were instructed to simultaneously attend to both pitch feedback perturbations they heard and flashing red lights they saw. The presentation rate of the visual stimuli was varied to produce a low, intermediate, and high attentional load. The behavioral results showed that the low-load condition elicited significantly smaller vocal compensations for pitch perturbations than the intermediate-load and high-load conditions. As well, the cortical processing of vocal pitch feedback was also modulated as a function of divided attention. When compared to the low-load and intermediate-load conditions, the high-load condition elicited significantly larger N1 responses and smaller P2 responses to pitch perturbations. These findings provide the first neurobehavioral evidence that divided attention can modulate auditory feedback control of vocal pitch production.
Artiodactyl and Perissodactyl acoustics: Identifying distress calls by farm animals
NASA Astrophysics Data System (ADS)
Browning, David G.; Scheifele, Peter M.
2004-05-01
There is growing concern for the welfare of farm animals. Vocal signals are discernable in a herd, generally carry over relatively long ranges, and, as Jahns has shown, can be easily automatically detected. Analysis of vocalizations from the two principal farm animal families show, however, that only a few, a pig's squeal, for example, meet Morton's classic criteria for distress. In general, Artiodactyls (cows, sheep, goats, etc.) have tonal bellows or bleats where apparently one vocalization fits many emotional situations. Duration and repetition, as Grandin has suggested, may be the important criteria in indicating stress. In contrast, Perissodactyls vary frequency during some vocalizations, such as a horse whinny, but no direct connection between frequency change and stress has yet been determined. The apparent reliance of Perissodactyles (with keen eyesight) on visual detection of body language appears to limit to some degree the amount of vocalization.
Aeroelastic Model of Vocal-Fold Vibrating Element for Studying the Phonation Threshold
NASA Astrophysics Data System (ADS)
Horáček, J.; Švec, J. G.
2002-10-01
An original theoretical model for vibration onset of the vocal folds in the air-flow coming from the human subglottal tract is designed, which allows studying the influence of the physical properties of the vocal folds (e.g., geometrical shape, mass, viscosity) on their vibration characteristics (such as the natural frequencies, mode shapes of vibration and the thresholds of instability). The mathematical model of the vocal fold is designed as a simplified dynamic system of two degrees of freedom (rotation and translation) vibrating on an elastic foundation in the wall of a channel conveying air. An approximate unsteady one-dimensional flow theory for the inviscid incompressible fluid is presented for the phonatory air-flow. A generally defined shape of the vocal-fold surface is considered for expressing the unsteady aerodynamic forces in the glottis. The parameters of the mechanical part of the model, i.e., the mass, stiffness and damping matrices, are related to the geometry and material density of the vocal folds as well as to the fundamental natural frequency and damping known from experiments. The coupled numerical solution yields the vibration characteristics (natural frequencies, damping and mode shapes of vibration), including the instability thresholds of the aeroelastic system. The vibration characteristics obtained from the coupled numerical solution of the system appear to be in reasonable qualitative agreement with the physiological data and clinical observations. The model is particularly suitable for studying the phonation threshold, i.e., the onset of vibration of the vocal folds.
Multimodal modeling and validation of simplified vocal tract acoustics for sibilant /s/
NASA Astrophysics Data System (ADS)
Yoshinaga, T.; Van Hirtum, A.; Wada, S.
2017-12-01
To investigate the acoustic characteristics of sibilant /s/, multimodal theory is applied to a simplified vocal tract geometry derived from a CT scan of a single speaker for whom the sound spectrum was gathered. The vocal tract was represented by a concatenation of waveguides with rectangular cross-sections and constant width, and a sound source was placed either at the inlet of the vocal tract or downstream from the constriction representing the sibilant groove. The modeled pressure amplitude was validated experimentally using an acoustic driver or airflow supply at the vocal tract inlet. Results showed that the spectrum predicted with the source at the inlet and including higher-order modes matched the spectrum measured with the acoustic driver at the inlet. Spectra modeled with the source downstream from the constriction captured the first characteristic peak observed for the speaker at 4 kHz. By positioning the source near the upper teeth wall, the higher frequency peak observed for the speaker at 8 kHz was predicted with the inclusion of higher-order modes. At the frequencies of the characteristic peaks, nodes and antinodes of the pressure amplitude were observed in the simplified vocal tract when the source was placed downstream from the constriction. These results indicate that the multimodal approach enables to capture the amplitude and frequency of the peaks in the spectrum as well as the nodes and antinodes of the pressure distribution due to /s/ inside the vocal tract.
NASA Astrophysics Data System (ADS)
Yoshinaga, Tsukasa; Nozaki, Kazunori; Wada, Shigeo
2018-03-01
The sound generation mechanisms of sibilant fricatives were investigated with experimental measurements and large-eddy simulations using a simplified vocal tract model. The vocal tract geometry was simplified to a three-dimensional rectangular channel, and differences in the geometries while pronouncing fricatives /s/ and /∫/ were expressed by shifting the position of the tongue and its constricted flow channel. Experimental results showed that the characteristic peak frequency of the fricatives decreased when the distance between the tongue and teeth increased. Numerical simulations revealed that the jet flow generated from the constriction impinged on the upper teeth wall and caused the main sound source upstream and downstream from the gap between the teeth. While magnitudes of the sound source decreased with increments of the frequency, amplitudes of the pressure downstream from the constriction increased at the peak frequencies of the corresponding tongue position. These results indicate that the sound pressures at the peak frequencies increased by acoustic resonance in the channel downstream from the constriction, and the different frequency characteristics between /s/ and /∫/ were produced by changing the constriction and the acoustic node positions inside the vocal tract.
The Impact of Vocal Hyperfunction on Relative Fundamental Frequency during Voicing Offset and Onset
ERIC Educational Resources Information Center
Stepp, Cara E.; Hillman, Robert E.; Heaton, James T.
2010-01-01
Purpose: This study tested the hypothesis that individuals with vocal hyperfunction would show decreases in relative fundamental frequency (RFF) surrounding a voiceless consonant. Method: This retrospective study of 2 clinical databases used speech samples from 15 control participants and women with hyperfunction-related voice disorders: 82 prior…
Frequency Response of Synthetic Vocal Fold Models with Linear and Nonlinear Material Properties
ERIC Educational Resources Information Center
Shaw, Stephanie M.; Thomson, Scott L.; Dromey, Christopher; Smith, Simeon
2012-01-01
Purpose: The purpose of this study was to create synthetic vocal fold models with nonlinear stress-strain properties and to investigate the effect of linear versus nonlinear material properties on fundamental frequency (F[subscript 0]) during anterior-posterior stretching. Method: Three materially linear and 3 materially nonlinear models were…
ERIC Educational Resources Information Center
Murray, Elizabeth S. Heller; Lien, Yu-An S.; Van Stan, Jarrad H.; Mehta, Daryush D.; Hillman, Robert E.; Noordzij, J. Pieter; Stepp, Cara E.
2017-01-01
Purpose: The purpose of this article is to examine the ability of an acoustic measure, relative fundamental frequency (RFF), to distinguish between two subtypes of vocal hyperfunction (VH): phonotraumatic (PVH) and non-phonotraumatic (NPVH). Method: RFF values were compared among control individuals with typical voices (N = 49), individuals with…
Frequency Response of Synthetic Vocal Fold Models with Linear and Nonlinear Material Properties
Shaw, Stephanie M.; Thomson, Scott L.; Dromey, Christopher; Smith, Simeon
2014-01-01
Purpose The purpose of this study was to create synthetic vocal fold models with nonlinear stress-strain properties and to investigate the effect of linear versus nonlinear material properties on fundamental frequency during anterior-posterior stretching. Method Three materially linear and three materially nonlinear models were created and stretched up to 10 mm in 1 mm increments. Phonation onset pressure (Pon) and fundamental frequency (F0) at Pon were recorded for each length. Measurements were repeated as the models were relaxed in 1 mm increments back to their resting lengths, and tensile tests were conducted to determine the stress-strain responses of linear versus nonlinear models. Results Nonlinear models demonstrated a more substantial frequency response than did linear models and a more predictable pattern of F0 increase with respect to increasing length (although range was inconsistent across models). Pon generally increased with increasing vocal fold length for nonlinear models, whereas for linear models, Pon decreased with increasing length. Conclusions Nonlinear synthetic models appear to more accurately represent the human vocal folds than linear models, especially with respect to F0 response. PMID:22271874
Hearing through the noise: Biologically inspired noise reduction
NASA Astrophysics Data System (ADS)
Lee, Tyler Paul
Vocal communication in the natural world demands that a listener perform a remarkably complicated task in real-time. Vocalizations mix with all other sounds in the environment as they travel to the listener, arriving as a jumbled low-dimensional signal. A listener must then use this signal to extract the structure corresponding to individual sound sources. How this computation is implemented in the brain remains poorly understood, yet an accurate description of such mechanisms would impact a variety of medical and technological applications of sound processing. In this thesis, I describe initial work on how neurons in the secondary auditory cortex of the Zebra Finch extract song from naturalistic background noise. I then build on our understanding of the function of these neurons by creating an algorithm that extracts speech from natural background noise using spectrotemporal modulations. The algorithm, implemented as an artificial neural network, can be flexibly applied to any class of signal or noise and performs better than an optimal frequency-based noise reduction algorithm for a variety of background noises and signal-to-noise ratios. One potential drawback to using spectrotemporal modulations for noise reduction, though, is that analyzing the modulations present in an ongoing sound requires a latency set by the slowest temporal modulation computed. The algorithm avoids this problem by reducing noise predictively, taking advantage of the large amount of temporal structure present in natural sounds. This predictive denoising has ties to recent work suggesting that the auditory system uses attention to focus on predicted regions of spectrotemporal space when performing auditory scene analysis.
Vocal impact of a prolonged reading task in dysphonic versus normophonic female teachers.
Remacle, Angélique; Morsomme, Dominique; Berrué, Elise; Finck, Camille
2012-11-01
This study evaluates the effect of a 2-hour reading task between 70 and 75 dB(A) in 16 normophonic and 16 dysphonic female teachers with vocal nodules. Objective measurements (acoustic analysis, voice range measurements, and aerodynamic measurements) and subjective self-ratings were collected before and every 30 minutes during the reading to determine the voice evolution in both groups. Fundamental frequency, lowest frequency, highest frequency (F-High), highest intensity, and intensity range increase through the reading, whereas shimmer decreases. Maximum phonation time decreases after 30 minutes. Estimated subglottal pressure (ESP) and sound pressure level increase during the first hour. Afterward, ESP decreases. Self-ratings worsen through time. When comparing the normophonic and the dysphonic teachers, self-ratings reveal more complaints in the dysphonic group. Few differences in objective measurements are found between both groups: normophonic teachers show lower ESP, higher F-High, and greater frequency range. Frequency modifications from acoustic analysis and voice range measurements suggest an increased laryngeal tension during vocal load, while subjects perceive a worsening of voice. Aerodynamic parameters depict first a deterioration of voice efficiency and then an adaptation to the prolonged reading. The comparison between both groups shows a discrepancy between objective measurements and self-ratings, suggesting that both approaches are necessary to have a complete view of vocal load effects. Surprisingly, both groups behave similarly through vocal load, without more or quicker deterioration of voice in the dysphonic group. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Multi-component separation and analysis of bat echolocation calls.
DiCecco, John; Gaudette, Jason E; Simmons, James A
2013-01-01
The vast majority of animal vocalizations contain multiple frequency modulated (FM) components with varying amounts of non-linear modulation and harmonic instability. This is especially true of biosonar sounds where precise time-frequency templates are essential for neural information processing of echoes. Understanding the dynamic waveform design by bats and other echolocating animals may help to improve the efficacy of man-made sonar through biomimetic design. Bats are known to adapt their call structure based on the echolocation task, proximity to nearby objects, and density of acoustic clutter. To interpret the significance of these changes, a method was developed for component separation and analysis of biosonar waveforms. Techniques for imaging in the time-frequency plane are typically limited due to the uncertainty principle and interference cross terms. This problem is addressed by extending the use of the fractional Fourier transform to isolate each non-linear component for separate analysis. Once separated, empirical mode decomposition can be used to further examine each component. The Hilbert transform may then successfully extract detailed time-frequency information from each isolated component. This multi-component analysis method is applied to the sonar signals of four species of bats recorded in-flight by radiotelemetry along with a comparison of other common time-frequency representations.
Segmentation of Killer Whale Vocalizations Using the Hilbert-Huang Transform
NASA Astrophysics Data System (ADS)
Adam, Olivier
2008-12-01
The study of cetacean vocalizations is usually based on spectrogram analysis. The feature extraction is obtained from 2D methods like the edge detection algorithm. Difficulties appear when signal-to-noise ratios are weak or when more than one vocalization is simultaneously emitted. This is the case for acoustic observations in a natural environment and especially for the killer whales which swim in groups. To resolve this problem, we propose the use of the Hilbert-Huang transform. First, we illustrate how few modes (5) are satisfactory for the analysis of these calls. Then, we detail our approach which consists of combining the modes for extracting the time-varying frequencies of the vocalizations. This combination takes advantage of one of the empirical mode decomposition properties which is that the successive IMFs represent the original data broken down into frequency components from highest to lowest frequency. To evaluate the performance, our method is first applied on the simulated chirp signals. This approach allows us to link one chirp to one mode. Then we apply it on real signals emitted by killer whales. The results confirm that this method is a favorable alternative for the automatic extraction of killer whale vocalizations.
Three-month-old human infants use vocal cues of body size.
Pietraszewski, David; Wertz, Annie E; Bryant, Gregory A; Wynn, Karen
2017-06-14
Differences in vocal fundamental ( F 0 ) and average formant ( F n ) frequencies covary with body size in most terrestrial mammals, such that larger organisms tend to produce lower frequency sounds than smaller organisms, both between species and also across different sex and life-stage morphs within species. Here we examined whether three-month-old human infants are sensitive to the relationship between body size and sound frequencies. Using a violation-of-expectation paradigm, we found that infants looked longer at stimuli inconsistent with the relationship-that is, a smaller organism producing lower frequency sounds, and a larger organism producing higher frequency sounds-than at stimuli that were consistent with it. This effect was stronger for fundamental frequency than it was for average formant frequency. These results suggest that by three months of age, human infants are already sensitive to the biologically relevant covariation between vocalization frequencies and visual cues to body size. This ability may be a consequence of developmental adaptations for building a phenotype capable of identifying and representing an organism's size, sex and life-stage. © 2017 The Author(s).
Behroozmand, Roozbeh; Karvelis, Laura; Liu, Hanjun; Larson, Charles R.
2009-01-01
Objective The present study investigated whether self-vocalization enhances auditory neural responsiveness to voice pitch feedback perturbation and how this vocalization-induced neural modulation can be affected by the extent of the feedback deviation. Method Event related potentials (ERPs) were recorded in 15 subjects in response to +100, +200 and +500 cents pitch-shifted voice auditory feedback during active vocalization and passive listening to the playback of the self-produced vocalizations. Result The amplitude of the evoked P1 (latency: 73.51 ms) and P2 (latency: 199.55 ms) ERP components in response to feedback perturbation were significantly larger during vocalization than listening. The difference between P2 peak amplitudes during vocalization vs. listening was shown to be significantly larger for +100 than +500 cents stimulus. Conclusion Results indicate that the human auditory cortex is more responsive to voice F0 feedback perturbations during vocalization than passive listening. Greater vocalization-induced enhancement of the auditory responsiveness to smaller feedback perturbations may imply that the audio-vocal system detects and corrects for errors in vocal production that closely match the expected vocal output. Significance Findings of this study support previous suggestions regarding the enhanced auditory sensitivity to feedback alterations during self-vocalization, which may serve the purpose of feedback-based monitoring of one’s voice. PMID:19520602
Siupsinskiene, Nora; Lycke, Hugo
2011-07-01
This prospective cross-sectional study examines the effects of voice training on vocal capabilities in vocally healthy age and gender differentiated groups measured by voice range profile (VRP) and speech range profile (SRP). Frequency and intensity measurements of the VRP and SRP using standard singing and speaking voice protocols were derived from 161 trained choir singers (21 males, 59 females, and 81 prepubescent children) and from 188 nonsingers (38 males, 89 females, and 61 children). When compared with nonsingers, both genders of trained adult and child singers exhibited increased mean pitch range, highest frequency, and VRP area in high frequencies (P<0.05). Female singers and child singers also showed significantly increased mean maximum voice intensity, intensity range, and total VRP area. The logistic regression analysis showed that VRP pitch range, highest frequency, maximum voice intensity, and maximum-minimum intensity range, and SRP slope of speaking curve were the key predictors of voice training. Age, gender, and voice training differentiated norms of VRP and SRP parameters are presented. Significant positive effect of voice training on vocal capabilities, mostly singing voice, was confirmed. The presented norms for trained singers, with key parameters differentiated by gender and age, are suggested for clinical practice of otolaryngologists and speech-language pathologists. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Sex hormones and the female voice.
Abitbol, J; Abitbol, P; Abitbol, B
1999-09-01
In the following, the authors examine the relationship between hormonal climate and the female voice through discussion of hormonal biochemistry and physiology and informal reporting on a study of 197 women with either premenstrual or menopausal voice syndrome. These facts are placed in a larger historical and cultural context, which is inextricably bound to the understanding of the female voice. The female voice evolves from childhood to menopause, under the varied influences of estrogens, progesterone, and testosterone. These hormones are the dominant factor in determining voice changes throughout life. For example, a woman's voice always develops masculine characteristics after an injection of testosterone. Such a change is irreversible. Conversely, male castrati had feminine voices because they lacked the physiologic changes associated with testosterone. The vocal instrument is comprised of the vibratory body, the respiratory power source and the oropharyngeal resonating chambers. Voice is characterized by its intensity, frequency, and harmonics. The harmonics are hormonally dependent. This is illustrated by the changes that occur during male and female puberty: In the female, the impact of estrogens at puberty, in concert with progesterone, produces the characteristics of the female voice, with a fundamental frequency one third lower than that of a child. In the male, androgens released at puberty are responsible for the male vocal frequency, an octave lower than that of a child. Premenstrual vocal syndrome is characterized by vocal fatigue, decreased range, a loss of power and loss of certain harmonics. The syndrome usually starts some 4-5 days before menstruation in some 33% of women. Vocal professionals are particularly affected. Dynamic vocal exploration by televideoendoscopy shows congestion, microvarices, edema of the posterior third of the vocal folds and a loss of its vibratory amplitude. The authors studied 97 premenstrual women who were prescribed a treatment of multivitamins, venous tone stimulants (phlebotonics), and anti-edematous drugs. We obtained symptomatic improvement in 84 patients. The menopausal vocal syndrome is characterized by lowered vocal intensity, vocal fatigue, a decreased range with loss of the high tones and a loss of vocal quality. In a study of 100 menopausal women, 17 presented with a menopausal vocal syndrome. To rehabilitate their voices, and thus their professional lives, patients were prescribed hormone replacement therapy and multi-vitamins. All 97 women showed signs of vocal muscle atrophy, reduction in the thickness of the mucosa and reduced mobility in the cricoarytenoid joint. Multi-factorial therapy (hormone replacement therapy and multi-vitamins) has to be individually adjusted to each case depending on body type, vocal needs, and other factors.
Development of echolocation and communication vocalizations in the big brown bat, Eptesicus fuscus.
Monroy, Jenna A; Carter, Matthew E; Miller, Kimberly E; Covey, Ellen
2011-05-01
Big brown bats form large maternity colonies of up to 200 mothers and their pups. If pups are separated from their mothers, they can locate each other using vocalizations. The goal of this study was to systematically characterize the development of echolocation and communication calls from birth through adulthood to determine whether they develop from a common precursor at the same or different rates, or whether both types are present initially. Three females and their six pups were isolated from our captive breeding colony. We recorded vocal activity from postnatal day 1 to 35, both when the pups were isolated and when they were reunited with their mothers. At birth, pups exclusively emitted isolation calls, with a fundamental frequency range <20 kHz, and duration >30 ms. By the middle of week 1, different types of vocalizations began to emerge. Starting in week 2, pups in the presence of their mothers emitted sounds that resembled adult communication vocalizations, with a lower frequency range and longer durations than isolation calls or echolocation signals. During weeks 2 and 3, these vocalizations were extremely heterogeneous, suggesting that the pups went through a babbling stage before establishing a repertoire of stereotyped adult vocalizations around week 4. By week 4, vocalizations emitted when pups were alone were identical to adult echolocation signals. Echolocation and communication signals both appear to develop from the isolation call, diverging during week 2 and continuing to develop at different rates for several weeks until the adult vocal repertoire is established.
Frey, Roland; Volodin, Ilya; Volodina, Elena; Soldatova, Natalia V; Juldaschev, Erkin T
2011-01-01
Similar to male humans, Homo sapiens, the males of a few polygynous ruminants – red deer Cervus elaphus, fallow deer Dama dama and Mongolian gazelle Procapra gutturosa– have a more or less enlarged, low-resting larynx and are capable of additional dynamic vocal tract elongation by larynx retraction during their rutting calls. The vocal correlates of a large larynx and an elongated vocal tract, a low fundamental frequency and low vocal tract resonance frequencies, deter rival males and attract receptive females. The males of the polygynous goitred gazelle, Gazella subgutturosa, provide another, independently evolved, example of an enlarged and low-resting larynx of high mobility. Relevant aspects of the rutting behaviour of territorial wild male goitred gazelles are described. Video and audio recordings served to study the acoustic effects of the enlarged larynx and vocal tract elongation on male rutting calls. Three call types were discriminated: roars, growls and grunts. In addition, the adult male vocal anatomy during the emission of rutting calls is described and functionally discussed using a 2D-model of larynx retraction. The combined morphological, behavioural and acoustic data are discussed in relation to the hypothesis of sexual selection for male-specific deep voices, resulting in convergent features of vocal anatomy in a few polygynous ruminants and in human males. PMID:21413987
Takahashi, Eri; Hyomoto, Kiri; Riquimaroux, Hiroshi; Watanabe, Yoshiaki; Ohta, Tetsuo; Hiryu, Shizuko
2014-08-15
The echolocation behavior of Pipistrellus abramus during exposure to artificial jamming sounds during flight was investigated. Echolocation pulses emitted by the bats were recorded using a telemetry microphone mounted on the bats' backs, and their adaptation based on acoustic characteristics of emitted pulses was assessed in terms of jamming-avoidance responses (JARs). In experiment 1, frequency-modulated jamming sounds (3 ms duration) mimicking echolocation pulses of P. abramus were prepared. All bats showed significant increases in the terminal frequency of the frequency-modulated pulse by an average of 2.1-4.5 kHz when the terminal frequency of the jamming sounds was lower than the bats' own pulses. This frequency shift was not observed using jamming frequencies that overlapped with or were higher than the bats' own pulses. These findings suggest that JARs in P. abramus are sensitive to the terminal frequency of jamming pulses and that the bats' response pattern was dependent on the slight difference in stimulus frequency. In experiment 2, when bats were repeatedly exposed to a band-limited noise of 70 ms duration, the bats in flight more frequently emitted pulses during silent periods between jamming sounds, suggesting that the bats could actively change the timing of pulse emissions, even during flight, to avoid temporal overlap with jamming sounds. Our findings demonstrate that bats could adjust their vocalized frequency and emission timing during flight in response to acoustic jamming stimuli. © 2014. Published by The Company of Biologists Ltd.
Tchernichovski, Ofer; Marcus, Gary
2014-01-01
Studies of vocal learning in songbirds typically focus on the acquisition of sensory templates for song imitation and on the consequent process of matching song production to templates. However, functional vocal development also requires the capacity to adaptively diverge from sensory templates, and to flexibly assemble vocal units. Examples of adaptive divergence include the corrective imitation of abnormal songs, and the decreased tendency to copy overabundant syllables. Such frequency-dependent effects might mirror tradeoffs between the assimilation of group identity (culture) while establishing individual and flexibly expressive songs. Intriguingly, although the requirements for vocal plasticity vary across songbirds, and more so between birdsong and language, the capacity to flexibly assemble vocal sounds develops in a similar, stepwise manner across species. Therefore, universal features of vocal learning go well beyond the capacity to imitate. PMID:25005823
Hasiniaina, Alida F; Scheumann, Marina; Rina Evasoa, Mamy; Braud, Diane; Rasoloharijaona, Solofonirina; Randrianambinina, Blanchard; Zimmermann, Elke
2018-05-02
The critically endangered Claire's mouse lemur, only found in the evergreen rain forest of the National Park Lokobe (LNP) and a few lowland evergreen rain forest fragments of northern Madagascar, was described recently. The present study provides the first quantified information on vocal acoustics of calls, sound associated behavioral context, acoustic niche, and vocal activity of this species. We recorded vocal and social behavior of six male-female and six male-male dyads in a standardized social-encounter paradigm in June and July 2016 at the LNP, Nosy Bé island. Over six successive nights per dyad, we audio recorded and observed behaviors for 3 hr at the beginning of the activity period. Based on the visual inspection of spectrograms and standardized multiparametric sound analysis, we identified seven different call types. Call types can be discriminated based on a combination of harmonicity, fundamental frequency variation, call duration, and degree of tonality. Acoustic features of tonal call types showed that for communication, mouse lemurs use the cryptic, high frequency/ultrasonic frequency niche. Two call types, the Tsak and the Grunt call, were emitted most frequently. Significant differences in vocal activity of the Tsak call were found between male-female and male-male dyads, linked primarily to agonistic conflicts. Dominant mouse lemurs vocalized more than subdominant ones, suggesting that signaling may present an honest indicator of fitness. A comparison of our findings of the Claire's mouse lemur with published findings of five bioacoustically studied mouse lemur species points to the notion that a complex interplay between ecology, predation pressure, and phylogenetic relatedness may shape the evolution of acoustic divergence between species in this smallest-bodied primate radiation. Thus, comparative bioacoustic studies, using standardized procedures, are promising to unravel the role of vocalization for primate species diversity and evolution and for identifying candidates for vocalization-based non-invasive monitoring for conservation purposes. © 2018 Wiley Periodicals, Inc.
Contribution of the supraglottic larynx to the vocal product: imaging and acoustic analysis
NASA Astrophysics Data System (ADS)
Gracco, L. Carol
1996-04-01
Horizontal supraglottic laryngectomy is a surgical procedure to remove a mass lesion located in the region of the pharynx superior to the true vocal folds. In contrast to full or partial laryngectomy, patients who undergo horizontal supraglottic laryngectomy often present with little or nor involvement to the true vocal folds. This population provides an opportunity to examine the acoustic consequences of altering the pharynx while sparing the laryngeal sound source. Acoustic and magnetic resonance imaging (MRI) data were acquired in a group of four patients before and after supraglottic laryngectomy. Acoustic measures included the identification of vocal tract resonances and the fundamental frequency of the vocal fold vibration. 3D reconstruction of the pharyngeal portion of each subjects' vocal tract were made from MRIs taken during phonation and volume measures were obtained. These measures reveal a variable, but often dramatic difference in the surgically-altered area of the pharynx and changes in the formant frequencies of the vowel/i/post surgically. In some cases the presence of the tumor created a deviation from the expected formant values pre-operatively with post-operative values approaching normal. Patients who also underwent radiation treatment post surgically tended to have greater constriction in the pharyngeal area of the vocal tract.
Ishikawa, Camila Cristina; Pinheiro, Thais Gonçalves; Hachiya, Adriana; Montagnoli, Arlindo Neto; Tsuji, Domingos Hiroshi
2017-05-01
The aim of this study was to evaluate the effects of cricothyroid muscle contraction on vocal fold vibration, as evaluated with high-speed videoendoscopy, and to identify one or more aspects of vocal fold vibration that could be used as an irrefutable indicator of unilateral cricothyroid muscle paralysis. This was an experimental study employing excised human larynges. Twenty freshly excised human larynges were evaluated during artificially produced vibration. Each larynx was assessed in three situations: bilateral cricothyroid muscle contraction, unilateral cricothyroid muscle contraction, and no contraction of either cricothyroid muscle. The following parameters were evaluated by high-speed videoendoscopy: fundamental frequency, periodicity, amplitude of vocal fold vibration, and phase symmetry between the vocal folds. Although neither unilateral nor bilateral cricothyroid muscle contraction altered the periodicity of vibration or the occurrence of phase asymmetry, there was a significant decrease in fundamental frequency in parallel with decreasing longitudinal tension. We also found an increase in vibration amplitude of right and left vocal folds, which were similar in terms of their behavior for this parameter in the various situations studied. Our results suggest that differences in vibration amplitude and phase symmetry between vocal folds are not reliable indicators of unilateral cricothyroid muscle paralysis. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Manatee (Trichechus manatus) vocalization usage in relation to environmental noise levels.
Miksis-Olds, Jennifer L; Tyack, Peter L
2009-03-01
Noise can interfere with acoustic communication by masking signals that contain biologically important information. Communication theory recognizes several ways a sender can modify its acoustic signal to compensate for noise, including increasing the source level of a signal, its repetition, its duration, shifting frequency outside that of the noise band, or shifting the timing of signal emission outside of noise periods. The extent to which animals would be expected to use these compensation mechanisms depends on the benefit of successful communication, risk of failure, and the cost of compensation. Here we study whether a coastal marine mammal, the manatee, can modify vocalizations as a function of behavioral context and ambient noise level. To investigate whether and how manatees modify their vocalizations, natural vocalization usage and structure were examined in terms of vocalization rate, duration, frequency, and source level. Vocalizations were classified into two call types, chirps and squeaks, which were analyzed independently. In conditions of elevated noise levels, call rates decreased during feeding and social behaviors, and the duration of each call type was differently influenced by the presence of calves. These results suggest that ambient noise levels do have a detectable effect on manatee communication and that manatees modify their vocalizations as a function of noise in specific behavioral contexts.
Underwater audiogram of the California sea lion by the conditioned vocalization technique1
Schusterman, Ronald J.; Balliet, Richard F.; Nixon, James
1972-01-01
Conditioning techniques were developed demonstrating that pure tone frequencies under water can exert nearly perfect control over the underwater click vocalizations of the California sea lion (Zalophus californianus). Conditioned vocalizations proved to be a reliable way of obtaining underwater sound detection thresholds in Zalophus at 13 different frequencies, covering a frequency range of 250 to 64,000 Hz. The audiogram generated by these threshold measurements suggests that under water, the range of maximal sensitivity for Zalophus lies between one and 28 kHz with best sensitivity at 16 kHz. Between 28 and 36 kHz there is a loss in sensitivity of 60 dB/octave. However, with relatively intense acoustic signals (> 38 dB re 1 μb underwater), Zalophus will respond to frequencies at least as high as 192 kHz. These results are compared with the underwater hearing of other marine mammals. ImagesFig. 1. PMID:5033891
Vocal mechanics in Darwin's finches: correlation of beak gape and song frequency.
Podos, Jeffrey; Southall, Joel A; Rossi-Santos, Marcos R
2004-02-01
Recent studies of vocal mechanics in songbirds have identified a functional role for the beak in sound production. The vocal tract (trachea and beak) filters harmonic overtones from sounds produced by the syrinx, and birds can fine-tune vocal tract resonance properties through changes in beak gape. In this study, we examine patterns of beak gape during song production in seven species of Darwin's finches of the Galápagos Islands. Our principal goals were to characterize the relationship between beak gape and vocal frequency during song production and to explore the possible influence therein of diversity in beak morphology and body size. Birds were audio and video recorded (at 30 frames s(-1)) as they sang in the field, and 164 song sequences were analyzed. We found that song frequency regressed significantly and positively on beak gape for 38 of 56 individuals and for all seven species examined. This finding provides broad support for a resonance model of vocal tract function in Darwin's finches. Comparison among species revealed significant variation in regression y-intercept values. Body size correlated negatively with y-intercept values, although not at a statistically significant level. We failed to detect variation in regression slopes among finch species, although the regression slopes of Darwin's finch and two North American sparrow species were found to differ. Analysis within one species (Geospiza fortis) revealed significant inter-individual variation in regression parameters; these parameters did not correlate with song frequency features or plumage scores. Our results suggest that patterns of beak use during song production were conserved during the Darwin's finch adaptive radiation, despite the evolution of substantial variation in beak morphology and body size.
Darawsheh, Wesam B; Natour, Yaser S; Sada, Eve G
2018-07-01
This pilot study aimed to evaluate the internal consistency, convergent construct validity and criterion validity of Arabic version of the Vocal Tract Discomfort Scale (VTDS), and to investigate the correlation between the scores of the VTDS, the VHI and the acoustic measures of fundamental frequency (F0), shimmer, jitter and signal-to-noise ratio (SNR). A cross-sectional study where 97 participants participated (47 males and 50 females) (mean age 20.5 ± 2.1 years) (31 student singers and 66 other non-professional voice user students). Participants were without self-perceived voice disorders who completed the VTDS-Arab scale and the Voice Handicap Index (VHI-Arab), and recorded a vocal sample of/a:/at a comfortable level. A positive internal consistency that signifies reliability was confirmed by Cronbach's α = .884 and 0.874 for the VTDS-Arab frequency and severity subscales, respectively. A moderate positive correlation was found between the VTDS-Arab (frequency, severity, total) and the VHI-Arab total where values of Pearson's correlation coefficient were r= 0.459, 0.430 and 0.451, respectively. Weak correlations were found between all of the acoustic measures and the scores of the VTDS-Arab and VHI-Arab (total and subscales). The area under curve for the VTDS was AUC= 0.824, 0.804 and 0.817 for the VTDS frequency, VTDS severity and VTDS total, respectively. The VTDS-Arab is a valid and reliable tool in measuring vocal tract sensations and predicting the perception of vocal handicap in student singers and can be used to predict the vocal load among professional voice users.
Riede, Tobias; Goller, Franz
2010-10-01
Song production in songbirds is a model system for studying learned vocal behavior. As in humans, bird phonation involves three main motor systems (respiration, vocal organ and vocal tract). The avian respiratory mechanism uses pressure regulation in air sacs to ventilate a rigid lung. In songbirds sound is generated with two independently controlled sound sources, which reside in a uniquely avian vocal organ, the syrinx. However, the physical sound generation mechanism in the syrinx shows strong analogies to that in the human larynx, such that both can be characterized as myoelastic-aerodynamic sound sources. Similarities include active adduction and abduction, oscillating tissue masses which modulate flow rate through the organ and a layered structure of the oscillating tissue masses giving rise to complex viscoelastic properties. Differences in the functional morphology of the sound producing system between birds and humans require specific motor control patterns. The songbird vocal apparatus is adapted for high speed, suggesting that temporal patterns and fast modulation of sound features are important in acoustic communication. Rapid respiratory patterns determine the coarse temporal structure of song and maintain gas exchange even during very long songs. The respiratory system also contributes to the fine control of airflow. Muscular control of the vocal organ regulates airflow and acoustic features. The upper vocal tract of birds filters the sounds generated in the syrinx, and filter properties are actively adjusted. Nonlinear source-filter interactions may also play a role. The unique morphology and biomechanical system for sound production in birds presents an interesting model for exploring parallels in control mechanisms that give rise to highly convergent physical patterns of sound generation. More comparative work should provide a rich source for our understanding of the evolution of complex sound producing systems. Copyright © 2009 Elsevier Inc. All rights reserved.
Forti, Lucas Rodriguez; Foratto, Roseli Maria; Márquez, Rafael; Pereira, Vânia Rosa; Toledo, Luís Felipe
2018-01-01
Anuran vocalizations, such as advertisement and release calls, are informative for taxonomy because species recognition can be based on those signals. Thus, a proper acoustic description of the calls may support taxonomic decisions and may contribute to knowledge about amphibian phylogeny. Here we present a perspective on advertisement call descriptions of the frog subfamily Lophyohylinae, through a literature review and a spatial analysis presenting bioacoustic coldspots (sites with high diversity of species lacking advertisement call descriptions) for this taxonomic group. Additionally, we describe the advertisement and release calls of the still poorly known treefrog, Itapotihyla langsdorffii . We analyzed recordings of six males using the software Raven Pro 1.4 and calculated the coefficient of variation for classifying static and dynamic acoustic properties. We found that more than half of the species within the subfamily do not have their vocalizations described yet. Most of these species are distributed in the western and northern Amazon, where recording sampling effort should be strengthened in order to fill these gaps. The advertisement call of I. langsdorffii is composed of 3-18 short unpulsed notes (mean of 13 ms long), presents harmonic structure, and has a peak dominant frequency of about 1.4 kHz. This call usually presents amplitude modulation, with decreasing intensity along the sequence of notes. The release call is a simple unpulsed note with an average duration of 9 ms, and peak dominant frequency around 1.8 kHz. Temporal properties presented higher variations than spectral properties at both intra- and inter-individual levels. However, only peak dominant frequency was static at intra-individual level. High variability in temporal properties and lower variations related to spectral ones is usual for anurans; The first set of variables is determined by social environment or temperature, while the second is usually related to species-recognition process. Here we review and expand the acoustic knowledge of the subfamily Lophyohylinae, highlighting areas and species for future research.
Hodges-Simeon, Carolyn R; Gurven, Michael; Puts, David A; Gaulin, Steven J C
2014-07-01
Fundamental and formant frequencies influence perceived pitch and are sexually dimorphic in humans. The information content of these acoustic parameters can illuminate the forces of sexual selection shaping vocal sex differences as well as the mechanisms that ensure signal reliability. We use multiple regression to examine the relationships between somatic (height, adiposity, and strength) and acoustic (fundamental frequency [ F 0 ], formant position [ P f ], and fundamental frequency variation [ F 0 -SD]) characteristics in a sample of peripubertal Bolivian Tsimane. Results indicate that among males-but not females-strength is the strongest predictor of F 0 and P f and that F 0 and P f are independent predictors of strength when height and adiposity are controlled. These findings suggest that listeners may attend to vocal frequencies because they signal honest, nonredundant information about male strength and threat potential, which are strongly related to physical maturity and which cannot be ascertained from visual or other indicators of height or adiposity alone.
Computational model for vocal tract dynamics in a suboscine bird.
Assaneo, M F; Trevisan, M A
2010-09-01
In a recent work, active use of the vocal tract has been reported for singing oscines. The reconfiguration of the vocal tract during song serves to match its resonances to the syringeal fundamental frequency, demonstrating a precise coordination of the two main pieces of the avian vocal system for songbirds characterized by tonal songs. In this work we investigated the Great Kiskadee (Pitangus sulfuratus), a suboscine bird whose calls display a rich harmonic content. Using a recently developed mathematical model for the syrinx and a mobile vocal tract, we set up a computational model that provides a plausible reconstruction of the vocal tract movement using a few spectral features taken from the utterances. Moreover, synthetic calls were generated using the articulated vocal tract that accounts for all the acoustical features observed experimentally.
Hamaguchi, Kosuke; Mooney, Richard
2012-01-01
Complex brain functions, such as the capacity to learn and modulate vocal sequences, depend on activity propagation in highly distributed neural networks. To explore the synaptic basis of activity propagation in such networks, we made dual in vivo intracellular recordings in anesthetized zebra finches from the input (nucleus HVC) and output (lateral magnocellular nucleus of the anterior nidopallium (LMAN)) neurons of a songbird cortico-basal ganglia (BG) pathway necessary to the learning and modulation of vocal motor sequences. These recordings reveal evidence of bidirectional interactions, rather than only feedforward propagation of activity from HVC to LMAN, as had been previously supposed. A combination of dual and triple recording configurations and pharmacological manipulations was used to map out circuitry by which activity propagates from LMAN to HVC. These experiments indicate that activity travels to HVC through at least two independent ipsilateral pathways, one of which involves fast signaling through a midbrain dopaminergic cell group, reminiscent of recurrent mesocortical loops described in mammals. We then used in vivo pharmacological manipulations to establish that augmented LMAN activity is sufficient to restore high levels of sequence variability in adult birds, suggesting that recurrent interactions through highly distributed forebrain – midbrain pathways can modulate learned vocal sequences. PMID:22915110
Kisko, Theresa M; Himmler, Brett T; Himmler, Stephanie M; Euston, David R; Pellis, Sergio M
2015-02-01
During playful interactions, juvenile rats emit many 50-kHz ultrasonic vocalizations, which are associated with a positive affective state. In addition, these calls may also serve a communicative role - as play signals that promote playful contact. Consistent with this hypothesis, a previous study found that vocalizations are more frequent prior to playful contact than after contact is terminated. The present study uses devocalized rats to test three predictions arising from the play signals hypothesis. First, if vocalizations are used to facilitate contact, then in pairs of rats in which one is devocalized, the higher frequency of pre-contact calling should only be present when the intact rat is initiating the approach. Second, when both partners in a playing pair are devocalized, the frequency of play should be reduced and the typical pattern of playful wrestling disrupted. Finally, when given a choice to play with a vocal and a non-vocal partner, rats should prefer to play with the one able to vocalize. The second prediction was supported in that the frequency of playful interactions as well as some typical patterns of play was disrupted. Even though the data for the other two predictions did not produce the expected findings, they support the conclusion that, in rats, 50-kHz calls are likely to function to maintain a playful mood and for them to signal to one another during play fighting. Copyright © 2014 Elsevier B.V. All rights reserved.
Transgender Phonosurgery: A Systematic Review and Meta-analysis.
Song, Tara Elena; Jiang, Nancy
2017-05-01
Objectives Different surgical techniques have been described in the literature to increase vocal pitch. The purpose of this study is to systematically review these surgeries and perform a meta-analysis to determine which technique increases pitch the most. Data Sources CINAHL, Cochrane, Embase, Medline, PubMed, and Science Direct. Review Methods A systematic review and meta-analysis of the literature was performed using the CINAHL, Cochrane, Embase, Medline, PubMed, and Science Direct databases. Studies were eligible for inclusion if they evaluated pitch-elevating phonosurgical techniques in live humans and performed pre- and postoperative acoustic analysis. Data were gathered regarding surgical technique, pre- and postoperative fundamental frequencies, perioperative care measures, and complications. Results Twenty-nine studies were identified. After applying inclusion and exclusion criteria, a total of 13 studies were included in the meta-analysis. Mechanisms of pitch elevation included increasing vocal cord tension (cricothyroid approximation), shortening the vocal cord length (cold knife glottoplasty, laser-shortening glottoplasty), and decreasing mass (laser reduction glottoplasty). The most common interventions were shortening techniques and cricothyroid approximation (6 studies each). The largest increase in fundamental frequency was seen with techniques that shortened the vocal cords. Preoperative speech therapy, postoperative voice rest, and reporting of patient satisfaction were inconsistent. Many of the studies were limited by low power and short length of follow-up. Conclusions Multiple techniques for elevation of vocal pitch exist, but vocal cord shortening procedures appear to result in the largest increase in fundamental frequency.
Infant-Mother Vocalization Patterns: A Replication and Extension.
ERIC Educational Resources Information Center
Kilbourne, Brock K.; Ginsburg, Gerald P.
This study reports a replication of an earlier study by Kilbourne and Ginsberg (1980) which indicated the occurrence of a transition from predominantly coacting to predominantly alternating infant-mother vocalization patterns. In addition, the present study examined the modulating influences of nursing activity and mother's focus of attention upon…
Granqvist, Svante; Simberg, Susanna; Hertegård, Stellan; Holmqvist, Sofia; Larsson, Hans; Lindestad, Per-Åke; Södersten, Maria; Hammarberg, Britta
2015-10-01
Phonation into glass tubes ('resonance tubes'), keeping the free end of the tube in water, has been a frequently used voice therapy method in Finland and more recently also in other countries. The purpose of this exploratory study was to investigate what effects tube phonation with and without water has on the larynx. Two participants were included in the study. The methods used were high-speed imaging, electroglottographic observations of vocal fold vibrations, and measurements of oral pressure during tube phonation. Results showed that the fluctuation in the back pressure during tube phonation in water altered the vocal fold vibrations. In the high-speed imaging, effects were found in the open quotient and amplitude variation of the glottal opening. The open quotient increased with increasing water depth (from 2 cm to 6 cm). A modulation effect by the water bubbles on the vocal fold vibrations was seen both in the high-speed glottal area tracings and in the electroglottography signal. A second experiment revealed that the increased average oral pressure was largely determined by the water depth. The increased open quotient can possibly be explained by an increased abduction of the vocal folds and/or a reduced transglottal pressure. The back pressure of the bubbles also modulates glottal vibrations with a possible 'massage' effect on the vocal folds. This effect and the well-defined average pressure increase due to the known water depth are different from those of other methods using a semi-occluded vocal tract.
Shear properties of vocal fold mucosal tissues and their effect on vocal fold oscillation
NASA Astrophysics Data System (ADS)
Chan, Roger Wai Kai
Viscoelastic shear properties of vocal fold mucosal tissues and phonosurgical biomaterials were measured with a parallel-plate rotational rheometer. Elastic, viscous and damping properties were quantified as a function of frequency (0.01 Hz to 15 Hz) for human vocal fold mucosal tissues (N = 15), implantable biomaterials commonly used in the treatment of vocal fold paralysis (Teflon, gelatin, and collagen) (the non-mucosal group), and biomaterials currently or potentially useful in the treatment of vocal fold mucosal defects (adipose tissue or fat, hyaluronic acid, and fibronectin) (the mucosal group). It was found that intersubject differences as large as an order of magnitude were often observed for the shear properties of vocal fold mucosal tissues, part of which may be age- and gender-related. Shear properties of the non-mucosal group biomaterials were often much higher than those of the mucosal group biomaterials, which were relatively close to the shear properties of mucosal tissues. Viscoelastic and rheological modeling showed that shear properties of human vocal fold mucosa may be described by a quasi-linear viscoelastic theory and a statistical network theory, based upon which extrapolations to audio frequencies were possible. A theory of small-amplitude vocal fold oscillation was revisited to describe the effects of tissue shear properties on vocal fold oscillation and phonation threshold pressure, a measure of the 'ease' of phonation and an objective indication of vocal function. It was found that phonation threshold pressure is directly related to the viscous shear modulus or the 'effective damping modulus', a concept proposed to quantify the effective amount of damping in vocal fold oscillation. The mucosal group biomaterials were incorporated into the artificial vocal fold mucosa of a physical model in order to empirically assess their effects on phonation threshold pressure. Results showed that higher threshold pressures were consistently observed for higher concentrations of hyaluronic acid and for hyaluronic acid mixed with fibronectin, in correlation with their differences in viscous shear modulus and effective damping modulus. Implications for phonosurgery were discussed in terms of the choice of optimal biomaterials for the surgical management of vocal fold mucosal defects and lamina propria deficiencies.
A hypothesis on a role of oxytocin in the social mechanisms of speech and vocal learning.
Theofanopoulou, Constantina; Boeckx, Cedric; Jarvis, Erich D
2017-08-30
Language acquisition in humans and song learning in songbirds naturally happen as a social learning experience, providing an excellent opportunity to reveal social motivation and reward mechanisms that boost sensorimotor learning. Our knowledge about the molecules and circuits that control these social mechanisms for vocal learning and language is limited. Here we propose a hypothesis of a role for oxytocin (OT) in the social motivation and evolution of vocal learning and language. Building upon existing evidence, we suggest specific neural pathways and mechanisms through which OT might modulate vocal learning circuits in specific developmental stages. © 2017 The Authors.
A hypothesis on a role of oxytocin in the social mechanisms of speech and vocal learning
Jarvis, Erich D.
2017-01-01
Language acquisition in humans and song learning in songbirds naturally happen as a social learning experience, providing an excellent opportunity to reveal social motivation and reward mechanisms that boost sensorimotor learning. Our knowledge about the molecules and circuits that control these social mechanisms for vocal learning and language is limited. Here we propose a hypothesis of a role for oxytocin (OT) in the social motivation and evolution of vocal learning and language. Building upon existing evidence, we suggest specific neural pathways and mechanisms through which OT might modulate vocal learning circuits in specific developmental stages. PMID:28835557
Volodin, Ilya A; Zaytseva, Alexandra S; Ilchenko, Olga G; Volodina, Elena V; Chebotareva, Anastasia L
2012-08-15
Self-produced seismic vibrations have been found for some subterranean rodents but have not been reported for any Insectivora species, although seismic sensitivity has been confirmed for blind sand-dwelling chrysochlorid golden moles. Studying the vocal behaviour of captive piebald shrews, Diplomesodon pulchellum, we documented vibrations, apparently generated by the whole-body wall muscles, from 11 (5 male, 6 female) of 19 animals, placed singly on a drum membrane. The airborne waves of the vibratory drumming were digitally recorded and then analysed spectrographically. The mean frequency of vibration was 160.5 Hz. This frequency matched the periodicity of the deep sinusoidal frequency modulation (159.4 Hz) found in loud screech calls of the same subjects. The body vibration was not related to thermoregulation, hunger-related depletion of energy resources or fear, as it was produced by well-fed, calm animals, at warm ambient temperatures. We hypothesize that in the solitary, nocturnal, digging desert piebald shrew, body vibrations may be used for seismic exploration of substrate density, to avoid energy-costly digging of packed sand for burrowing and foraging. At the same time, the piercing quality of screech calls due to the deep sinusoidal frequency modulation, matching the periodicity of body vibration, may be important for agonistic communication in this species.
Changes in Infants' Vocalizations as a Function of Differential Acoustic Stimulation
ERIC Educational Resources Information Center
Webster, R. L.; And Others
1972-01-01
Results of this study indicated that the frequency of an auditory stimulus is a dimension to which infants differentially respond in terms of response rate and acoustic characteristics of their vocalizations. (Authors)
Acoustic, respiratory kinematic and electromyographic effects of vocal training
NASA Astrophysics Data System (ADS)
Mendes, Ana Paula De Brito Garcia
The longitudinal effects of vocal training on the respiratory, phonatory and articulatory systems were investigated in this study. During four semesters, fourteen voice major students were recorded while speaking and singing. Acoustic, temporal, respiratory kinematic and electromyographic parameters were measured to determine changes in the three systems as a function of vocal training. Acoustic measures of the speaking voice included fundamental frequency, sound pressure level (SPL), percent jitter and shimmer, and harmonic-to-noise ratio. Temporal measures included duration of sentences, diphthongs and the closure durations of stop consonants. Acoustic measures of the singing voice included fundamental frequency and sound pressure level of the phonational range, vibrato pulses per second, vibrato amplitude variation and the presence of the singer's formant. Analysis of the data revealed that vocal training had a significant effect on the singing voice. Fundamental frequency and SPL of the 90% level and 90--10% of the phonational range increased significantly during four semesters of vocal training. Physiological data was collected from four subjects during three semesters of vocal training. Respiratory kinematic measures included lung volume, rib cage and abdominal excursions extracted from spoken sung samples. Descriptive statistics revealed that rib cage and abdominal excursions increased from the 1st to the 2nd semester and decrease from the 2nd to the 3rd semester of vocal training. Electromyographic measures of the pectoralis major, rectus abdominis and external obliques muscles revealed that burst duration means decreased from the 1st to the 2nd semester and increased from the 2nd to the 3rd semester. Peak amplitude means increased from the 1st to the 2nd and decreased from the 2nd to the 3rd semester of vocal training. Chest wall excursions and muscle force generation of the three muscles increased as the demanding level and the length of the phonatory tasks increased.
NASA Astrophysics Data System (ADS)
Edwards, Sharry K.
2005-04-01
Over the past 20+ years the pioneering field of Human Bioacoustics, which includes voice spectral analysis, has begun to model the frequencies and architecture of human vocalizations to identify the innate mathematical templates found within the various system of the human body. Using the idea that the voice is a holographic representation of health and wellness, these non-invasive techniques are being advanced to the extent that a computerized Vocal Profile, using a system of Frequency Equivalents, can be used to accurately quantify, organize, interpret, define, and extrapolate biometric information from the human voice. This information, in turn, provides the opportunity to predict, direct, and maintain intrinsic form and function. This novel approach has provided an accumulation of significant data but until recently has been without an efficient biological framework of reference. The emerging Mathematical Model being assembled through Human Bioacoustic research likely has the potential to allow Vocal Profiling to be used to predict and monitor health issues from the very first cries of a newborn through the frequency foundations of disease and aging.
Frequency response of synthetic vocal fold models with linear and nonlinear material properties.
Shaw, Stephanie M; Thomson, Scott L; Dromey, Christopher; Smith, Simeon
2012-10-01
The purpose of this study was to create synthetic vocal fold models with nonlinear stress-strain properties and to investigate the effect of linear versus nonlinear material properties on fundamental frequency (F0) during anterior-posterior stretching. Three materially linear and 3 materially nonlinear models were created and stretched up to 10 mm in 1-mm increments. Phonation onset pressure (Pon) and F0 at Pon were recorded for each length. Measurements were repeated as the models were relaxed in 1-mm increments back to their resting lengths, and tensile tests were conducted to determine the stress-strain responses of linear versus nonlinear models. Nonlinear models demonstrated a more substantial frequency response than did linear models and a more predictable pattern of F0 increase with respect to increasing length (although range was inconsistent across models). Pon generally increased with increasing vocal fold length for nonlinear models, whereas for linear models, Pon decreased with increasing length. Nonlinear synthetic models appear to more accurately represent the human vocal folds than do linear models, especially with respect to F0 response.
The effect of voice amplification on occupational vocal dose in elementary school teachers.
Gaskill, Christopher S; O'Brien, Shenendoah G; Tinter, Sara R
2012-09-01
Two elementary school teachers, one with and one without a history of vocal complaints, wore a vocal dosimeter all day at school for a 3-week period. In the second week, each teacher wore a portable voice amplifier. Each teacher showed a reduction in vocal intensity during the week of amplification, with a larger effect for the teacher with vocal difficulties. This teacher also showed a decrease in hourly vocal fold distance dose as measured by the dosimeter despite incurring longer phonation times. Fundamental frequency and vocal fold cycle dose did not appear to be affected by the use of amplification during the teaching day. Both teachers showed evidence of a possible moderate effect of adjusting vocal intensity in the week after amplification, possibly as a means to recalibrate their perceived vocal loudness. This study demonstrates the usefulness of both vocal dosimetry and amplification in monitoring and modifying vocal dose in an occupational setting and reinforces previous data suggesting the effectiveness of amplification in reducing the vocal load in schoolteachers. Implications of the data for future research regarding prevention and treatment of occupational voice disorders are discussed. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Chen, Shiang-Fan; Jones, Gareth; Rossiter, Stephen J.
2009-01-01
The origin and maintenance of intraspecific variation in vocal signals is important for population divergence and speciation. Where vocalizations are transmitted by vertical cultural inheritance, similarity will reflect co-ancestry, and thus vocal divergence should reflect genetic structure. Horseshoe bats are characterized by echolocation calls dominated by a constant frequency component that is partly determined by maternal imprinting. Although previous studies showed that constant frequency calls are also influenced by some non-genetic factors, it is not known how frequency relates to genetic structure. To test this, we related constant frequency variation to genetic and non-genetic variables in the Formosan lesser horseshoe bat (Rhinolophus monoceros). Recordings of bats from across Taiwan revealed that females called at higher frequencies than males; however, we found no effect of environmental or morphological factors on call frequency. By comparison, variation showed clear population structure, with frequencies lower in the centre and east, and higher in the north and south. Within these regions, frequency divergence was directional and correlated with geographical distance, suggesting that call frequencies are subject to cultural drift. However, microsatellite clustering analysis showed that broad differences in constant frequency among populations corresponded to discontinuities in allele frequencies resulting from vicariant events. Our results provide evidence that the processes shaping genetic subdivision have concomitant consequences for divergence in echolocation call frequency. PMID:19692399
Chan, Roger W; Rodriguez, Maritza L
2008-08-01
Previous studies reporting the linear viscoelastic shear properties of the human vocal fold cover or mucosa have been based on torsional rheometry, with measurements limited to low audio frequencies, up to around 80 Hz. This paper describes the design and validation of a custom-built, controlled-strain, linear, simple-shear rheometer system capable of direct empirical measurements of viscoelastic shear properties at phonatory frequencies. A tissue specimen was subjected to simple shear between two parallel, rigid acrylic plates, with a linear motor creating a translational sinusoidal displacement of the specimen via the upper plate, and the lower plate transmitting the harmonic shear force resulting from the viscoelastic response of the specimen. The displacement of the specimen was measured by a linear variable differential transformer whereas the shear force was detected by a piezoelectric transducer. The frequency response characteristics of these system components were assessed by vibration experiments with accelerometers. Measurements of the viscoelastic shear moduli (G' and G") of a standard ANSI S2.21 polyurethane material and those of human vocal fold cover specimens were made, along with estimation of the system signal and noise levels. Preliminary results showed that the rheometer can provide valid and reliable rheometric data of vocal fold lamina propria specimens at frequencies of up to around 250 Hz, well into the phonatory range.
Relation between voice disorders and work in a group of Community Health Workers.
Cipriano, Fabiana Gonçalves; Ferreira, Léslie Piccolotto; Servilha, Emilse Aparecida Merlin; Marsiglia, Regina Maria Giffoni
2013-01-01
To analyze the relationship between voice disorders and work in a group of Community Health Agents (CHA). The subjects of this study were 65 CHA working in the city of São Paulo. Thefiinstrument used for data collection was an adaptation of the questionnaire named Conditions of Vocal Production - Teachers (CPV-P). The results were keyed in twice and submitted to statistical analysis, in order to verify: the self-reported frequency of voice disorder frequency of present vocal symptoms, the association among the three most frequently reported present symptoms, and environmental and organizational aspects of work. Of the 65 (100%) CHA in the study, 37 (56.9%) self-reported having present or past vocal disorders. The most frequently reported present symptoms were: dry throat, tiredness when speaking, and burning sensation in the throat. There was significant association between: taking work to home, having personal items stolen, police intervention, violence against employees and vocal symptom dry throat, not having enough time to complete all tasks, difficulty in leaving work, inadequate furniture, intense physical strain, objects stolen from the health unit, racism and vocal symptom tiredness when speaking, dust, job dissatisfaction, work stress, building destruction, drug issues, and vocal symptom burning in throat. Based on the obtained results, the initial hypothesis of association between the development of vocal disorders among the subjects and the adversities present in their work environment and organization was confirmed.
Relationship of the Cricothyroid Space with Vocal Range in Female Singers.
Pullon, Beverley
2017-01-01
This study aims to investigate the relationship between the anterior cricothyroid (CT) space at rest with vocal range in female singers. Potential associations with and between voice categories, age, ethnicity, anthropometric indices, neck dimensions, laryngeal dimensions, vocal data along with habitual speaking fundamental frequency were also explored. This is a cohort study. Laryngeal dimensions anterior CT space and heights of the thyroid and cricoid cartilages were measured using ultrasound in 43 healthy, classically trained, female singers during quiet respiration. Voice categories (soprano and mezzo-soprano), age, ethnicity, weight, height, body mass index, neck circumference and length, anterior thyroid and cricoid cartilage heights, practice and performance vocal range, lowest and highest practice and performance notes along with habitual speaking fundamental frequency were collected. The main finding was that mezzo-sopranos have a significantly wider resting CT space than sopranos (11.6 mm versus 10.4 mm; P = 0.007). Mezzo-sopranos also had significantly lower "lowest and highest" performance notes than sopranos. There was no significant correlation between the magnitudes of the anterior CT space with vocal range. The participants with the narrowest and widest anterior CT space had similar vocal ranges. These results suggest that the CT space is not the major determinant of performance vocal range. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Detection of the Vibration Signal from Human Vocal Folds Using a 94-GHz Millimeter-Wave Radar
Chen, Fuming; Li, Sheng; Zhang, Yang; Wang, Jianqi
2017-01-01
The detection of the vibration signal from human vocal folds provides essential information for studying human phonation and diagnosing voice disorders. Doppler radar technology has enabled the noncontact measurement of the human-vocal-fold vibration. However, existing systems must be placed in close proximity to the human throat and detailed information may be lost because of the low operating frequency. In this paper, a long-distance detection method, involving the use of a 94-GHz millimeter-wave radar sensor, is proposed for detecting the vibration signals from human vocal folds. An algorithm that combines empirical mode decomposition (EMD) and the auto-correlation function (ACF) method is proposed for detecting the signal. First, the EMD method is employed to suppress the noise of the radar-detected signal. Further, the ratio of the energy and entropy is used to detect voice activity in the radar-detected signal, following which, a short-time ACF is employed to extract the vibration signal of the human vocal folds from the processed signal. For validating the method and assessing the performance of the radar system, a vibration measurement sensor and microphone system are additionally employed for comparison. The experimental results obtained from the spectrograms, the vibration frequency of the vocal folds, and coherence analysis demonstrate that the proposed method can effectively detect the vibration of human vocal folds from a long detection distance. PMID:28282892
The Sound Broadcasting System of the Bullfrog
NASA Astrophysics Data System (ADS)
Purgue, Alejandro P.
1995-01-01
This work presents a comparison across selected species of several aspects of the mechanism of sound broadcasting in anuran amphibians. These studies indicate that all anuran species studied to date broadcast their calls through structures that resonate at the dominant frequency in their calls. Measurements of the magnitude of the transfer function of the radiating structures show that the structures responsible for radiating the bulk of the energy present in the call vary depending on the species considered. Bullfrogs (Rana catesbeiana) radiate most of the energy (89% sound level) present in their calls through their eardrums. In this species the transfer function of the eardrum displays several peaks coincident in frequency and amplitude with the energy distribution observed in the mating and release call of the species. The vocal sac and gular area contribute energy only in the lower band (150 to 400 Hz) of the call. The ears are responsible for radiating additional frequency bands to the ones being radiated through the gular area and vocal sacs. This condition appears to be derived. In Rana pipiens the ears also broadcast a significant portion of the energy present in the call (63% sound level) but the frequencies of the aural emissions are a subset of those frequencies radiated through the vocal sac and gular area. Character optimization suggests that this is the primitive condition for ranid frogs. Finally, the barking treefrog (Hyla gratiosa) appears to use two different structures to radiate different portions of the call. The low frequency band appears to be preferentially radiated through the lungs while the high frequency components of the call are radiated through the vocal sac.
Yamauchi, Akihito; Yokonishi, Hisayuki; Imagawa, Hiroshi; Sakakibara, Ken-Ichi; Nito, Takaharu; Tayama, Niro
2016-11-01
The goal of this work was to objectively elucidate the vibratory characteristics of vocal fold paralysis (VFP) using high-speed digital imaging (HSDI). HSDI was performed in 29 vocally healthy subjects (12 women and 17 men) and in 107 patients with VFP (40 women and 67 men). Then, the HSDI data were evaluated by visual-perceptual rating, single-line kymography, multiline kymography, laryngotopography, and glottal area waveform analysis. Patients with VFP compared with vocally healthy subjects revealed more frequent incomplete glottal closure, greater asymmetry in amplitude, mucosal wave, frequency, and phase, as well as larger open quotient, smaller speed index, larger maximal and minimal glottal area, and smaller glottal area difference. Paralyzed vocal folds in VFP revealed reduced mucosal wave than nonparalyzed vocal folds in VFP or in intact vocal folds in vocally healthy subjects. HSDI was effective in documenting the characteristics of vocal fold vibrations in patients with VFP and in exploring the vibratory disturbance for estimating the severity of dysphonia. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Kim, Geunyoung; Walden, Tedra A; Knieps, Linda J
2010-04-01
Studies of infant social referencing have indicated that infants might be more influenced by vocal information contained in emotional messages than by facial expression, especially during fearful message conditions. The present study investigated the characteristics of emotional channels that parents used during social referencing, and corresponding infants' behavioral changes. Results of Study 1 indicated that parents used more vocal information during positive message conditions. Unlike previous findings, infants' behavioral change was related to the frequency of vocal information during positive condition. For fearful messages, infants were more influenced by the number of multi-modal channels used and the frequency of visual information. Study 2 further showed that the intensity of vocal tone was related to infant regulation only during positive message conditions. The results imply that understanding of social context is important to make sense of parent-infant's emotional interaction. Copyright 2010 Elsevier Inc. All rights reserved.
Sigafoos, Jeff; Didden, Robert; O'Reilly, Mark
2003-01-01
We evaluated the role of digitized speech output on the maintenance of requesting and frequency of vocalizations in three children with developmental disabilities. The children were taught to request access to preferred objects using an augmentative communication speech-generating device (SGD). Following acquisition, rates of requesting and vocalizations were compared across two conditions (speech output on versus speech output off) that were alternated on a session-by-session basis. There were no major or consistent differences across the two conditions for the three children, suggesting that access to preferred objects was the critical variable maintaining use of the SGDs. The results also suggest that feedback in the form of digitized speech from the SGD did not inhibit vocalizations. One child began to speak single words during the latter part of the study, suggesting that in some cases AAC intervention involving SGDs may facilitate speech.
Xu, Chet C; Chan, Roger W; Sun, Han; Zhan, Xiaowei
2017-11-01
A mixed-effects model approach was introduced in this study for the statistical analysis of rheological data of vocal fold tissues, in order to account for the data correlation caused by multiple measurements of each tissue sample across the test frequency range. Such data correlation had often been overlooked in previous studies in the past decades. The viscoelastic shear properties of the vocal fold lamina propria of two commonly used laryngeal research animal species (i.e. rabbit, porcine) were measured by a linear, controlled-strain simple-shear rheometer. Along with published canine and human rheological data, the vocal fold viscoelastic shear moduli of these animal species were compared to those of human over a frequency range of 1-250Hz using the mixed-effects models. Our results indicated that tissues of the rabbit, canine and porcine vocal fold lamina propria were significantly stiffer and more viscous than those of human. Mixed-effects models were shown to be able to more accurately analyze rheological data generated from repeated measurements. Copyright © 2017 Elsevier Ltd. All rights reserved.
Berg Soto, Alvaro; Marsh, Helene; Everingham, Yvette; Smith, Joshua N; Parra, Guido J; Noad, Michael
2014-08-01
Australian snubfin and Indo-Pacific humpback dolphins co-occur throughout most of their range in coastal waters of tropical Australia. Little is known of their ecology or acoustic repertoires. Vocalizations from humpback and snubfin dolphins were recorded in two locations along the Queensland coast during 2008 and 2010 to describe their vocalizations and evaluate the acoustic differences between these two species. Broad vocalization types were categorized qualitatively. Both species produced click trains burst pulses and whistles. Principal component analysis of the nine acoustic variables extracted from the whistles produced nine principal components that were input into discriminant function analyses to classify 96% of humpback dolphin whistles and about 78% of snubfin dolphin calls correctly. Results indicate clear acoustic differences between the vocal whistle repertoires of these two species. A stepwise routine identified two principal components as significantly distinguishable between whistles of each species: frequency parameters and frequency trend ratio. The capacity to identify these species using acoustic monitoring techniques has the potential to provide information on presence/absence, habitat use and relative abundance for each species.
Schneider, Berit; Zumtobel, Michaela; Prettenhofer, Walter; Aichstill, Birgitta; Jocher, Werner
2010-03-01
Only limited data on normal vocal constitution and vocal capabilities in school-aged children are available. To take better care of children's voices, it might be helpful to know voice ranges and limits of not only vocally trained but also vocally untrained children. Goal of this study was the evaluation of singing voice capabilities of vocally healthy children with different social and vocal/musical backgrounds using voice range profile measurements (VRP). VRP percentiles that reflect constitutional aspects were suggested. In this cross-sectional study, 186 children (aged between seven and 10 years), attending five schools, were included. VRP measurements were performed under field conditions. Interviews and questionnaires regarding vocal strain and vocal training were applied; the answers were used for classification of singing activity and vocal training (KLASAK). All children reached a mean singing voice range of at least two octaves. By using the answers of interviews and questionnaires, the children could be classified according to vocal strain and vocal training. The groups showed no significant differences regarding VRP measurements. In the following step, percentiles were calculated. Twenty-five percent of all children (P25) reached a minimum voice range of almost two octaves, namely, 22 semitones (ST) from 220 to 784 Hz with soft and loud singing. Half of the children (P50) had a voice range of 24 ST (2 octaves), while soft singing and a larger voice range of 26 ST while loud singing. The measurements of third quartile (P75) revealed that 25% of children have even a larger voice range than 29 dB (from 196 Hz/g to 1047 Hz/c3) and can sing at most frequencies louder than 90 dB. P90 demonstrated that 10% of the children can sing even lower or higher than the frequency range between 196 Hz/g and 1319 Hz/e3 analyzed. The voice range seems not to be constrained by social but by voice/musical background: children of vocally/musically encouraged schools had wider voice ranges. This underlines the necessity of regular singing lessons already in primary schools. The percentile VRP introduced might help to evaluate the vocal constitution and vocal capabilities of a child. Copyright (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Luo, Huan; Wang, Yadong; Poeppel, David; Simon, Jonathan Z
2007-12-01
Complex natural sounds (e.g., animal vocalizations or speech) can be characterized by specific spectrotemporal patterns the components of which change in both frequency (FM) and amplitude (AM). The neural coding of AM and FM has been widely studied in humans and animals but typically with either pure AM or pure FM stimuli. The neural mechanisms employed to perceptually unify AM and FM acoustic features remain unclear. Using stimuli with simultaneous sinusoidal AM (at rate f(AM) = 37 Hz) and FM (with varying rates f(FM)), magnetoencephalography (MEG) is used to investigate the elicited auditory steady-state response (aSSR) at relevant frequencies (f(AM), f(FM), f(AM) + f(FM)). Previous work demonstrated that for sounds with slower FM dynamics (f(FM) < 5 Hz), the phase of the aSSR at f(AM) tracked the FM; in other words, AM and FM features were co-tracked and co-represented by "phase modulation" encoding. This study explores the neural coding mechanism for stimuli with faster FM dynamics (< or =30 Hz), demonstrating that at faster rates (f(FM) > 5 Hz), there is a transition from pure phase modulation encoding to a single-upper-sideband (SSB) response (at frequency f(AM) + f(FM)) pattern. We propose that this unexpected SSB response can be explained by the additional involvement of subsidiary AM encoding responses simultaneously to, and in quadrature with, the ongoing phase modulation. These results, using MEG to reveal a possible neural encoding of specific acoustic properties, demonstrate more generally that physiological tests of encoding hypotheses can be performed noninvasively on human subjects, complementing invasive, single-unit recordings in animals.
Luo, Xin; Fu, Qian-Jie; Galvin, John J.
2007-01-01
The present study investigated the ability of normal-hearing listeners and cochlear implant users to recognize vocal emotions. Sentences were produced by 1 male and 1 female talker according to 5 target emotions: angry, anxious, happy, sad, and neutral. Overall amplitude differences between the stimuli were either preserved or normalized. In experiment 1, vocal emotion recognition was measured in normal-hearing and cochlear implant listeners; cochlear implant subjects were tested using their clinically assigned processors. When overall amplitude cues were preserved, normal-hearing listeners achieved near-perfect performance, whereas listeners with cochlear implant recognized less than half of the target emotions. Removing the overall amplitude cues significantly worsened mean normal-hearing and cochlear implant performance. In experiment 2, vocal emotion recognition was measured in listeners with cochlear implant as a function of the number of channels (from 1 to 8) and envelope filter cutoff frequency (50 vs 400 Hz) in experimental speech processors. In experiment 3, vocal emotion recognition was measured in normal-hearing listeners as a function of the number of channels (from 1 to 16) and envelope filter cutoff frequency (50 vs 500 Hz) in acoustic cochlear implant simulations. Results from experiments 2 and 3 showed that both cochlear implant and normal-hearing performance significantly improved as the number of channels or the envelope filter cutoff frequency was increased. The results suggest that spectral, temporal, and overall amplitude cues each contribute to vocal emotion recognition. The poorer cochlear implant performance is most likely attributable to the lack of salient pitch cues and the limited functional spectral resolution. PMID:18003871
Spornick, Nicholas; Guptill, Virginia; Koziol, Deloris; Wesley, Robert; Finkel, Julia; Quezado, Zenaide M.N.
2012-01-01
Sine-wave electrical stimulation at frequencies 2000, 250, and 5 Hz to respectively evaluate Aβ, Aδ, and C sensory neurons has recently been added to the armamentarium used to evaluate sensory neurons. We developed an automated nociception assay using sine-wave stimulation methodology to determine current vocalization threshold in response to 2000, 250, and 5 Hz and examine the effects of sex, analgesics, and anesthetics in mice. At baseline, males had significantly higher mean current vocalization thresholds compared with female mice at 2000, 250, and 5 Hz (p ≤ 0.019). By 1 h after intrathecal injections of morphine there were significant increases in current vocalization threshold percent changes from baseline that varied with doses (p = 0.0001) and frequency used (p < 0.0001). Specifically, with increasing doses of morphine, there were significantly greater increases in current vocalization threshold percent changes from baseline in response to 5 Hz compared with 250 and 2000 Hz stimulation in a significantly ordered pattern: 5 Hz > 250 Hz (p < 0.0001) and 250 Hz > 2000 Hz (p = 0.0002). Forty-five minutes after exposure, there were no effects of isoflurane on current vocalization thresholds at any frequency. Therefore, our findings suggest that this automated nociception assay using sine-wave stimulation in mice, can be valuable for measurements of the effects of sex, opioids, and anesthetics on the response to electrical stimuli that preferentially stimulate Aβ, Aδ, and C-sensory fibers in vivo. This investigation suggests the validation of this assay and supports its use to examine mechanisms of nociception in mice. PMID:21864576
Narayanan, Shrikanth
2009-01-01
We describe a method for unsupervised region segmentation of an image using its spatial frequency domain representation. The algorithm was designed to process large sequences of real-time magnetic resonance (MR) images containing the 2-D midsagittal view of a human vocal tract airway. The segmentation algorithm uses an anatomically informed object model, whose fit to the observed image data is hierarchically optimized using a gradient descent procedure. The goal of the algorithm is to automatically extract the time-varying vocal tract outline and the position of the articulators to facilitate the study of the shaping of the vocal tract during speech production. PMID:19244005
Cazau, Dorian; Adam, Olivier; Aubin, Thierry; Laitman, Jeffrey T; Reidenberg, Joy S
2016-10-10
Although mammalian vocalizations are predominantly harmonically structured, they can exhibit an acoustic complexity with nonlinear vocal sounds, including deterministic chaos and frequency jumps. Such sounds are normative events in mammalian vocalizations, and can be directly traceable to the nonlinear nature of vocal-fold dynamics underlying typical mammalian sound production. In this study, we give qualitative descriptions and quantitative analyses of nonlinearities in the song repertoire of humpback whales from the Ste Marie channel (Madagascar) to provide more insight into the potential communication functions and underlying production mechanisms of these features. A low-dimensional biomechanical modeling of the whale's U-fold (vocal folds homolog) is used to relate specific vocal mechanisms to nonlinear vocal features. Recordings of living humpback whales were searched for occurrences of vocal nonlinearities (instabilities). Temporal distributions of nonlinearities were assessed within sound units, and between different songs. The anatomical production sources of vocal nonlinearities and the communication context of their occurrences in recordings are discussed. Our results show that vocal nonlinearities may be a communication strategy that conveys information about the whale's body size and physical fitness, and thus may be an important component of humpback whale songs.
NASA Astrophysics Data System (ADS)
Cazau, Dorian; Adam, Olivier; Aubin, Thierry; Laitman, Jeffrey T.; Reidenberg, Joy S.
2016-10-01
Although mammalian vocalizations are predominantly harmonically structured, they can exhibit an acoustic complexity with nonlinear vocal sounds, including deterministic chaos and frequency jumps. Such sounds are normative events in mammalian vocalizations, and can be directly traceable to the nonlinear nature of vocal-fold dynamics underlying typical mammalian sound production. In this study, we give qualitative descriptions and quantitative analyses of nonlinearities in the song repertoire of humpback whales from the Ste Marie channel (Madagascar) to provide more insight into the potential communication functions and underlying production mechanisms of these features. A low-dimensional biomechanical modeling of the whale’s U-fold (vocal folds homolog) is used to relate specific vocal mechanisms to nonlinear vocal features. Recordings of living humpback whales were searched for occurrences of vocal nonlinearities (instabilities). Temporal distributions of nonlinearities were assessed within sound units, and between different songs. The anatomical production sources of vocal nonlinearities and the communication context of their occurrences in recordings are discussed. Our results show that vocal nonlinearities may be a communication strategy that conveys information about the whale’s body size and physical fitness, and thus may be an important component of humpback whale songs.
Matthews, Leanna P; Parks, Susan E; Fournet, Michelle E H; Gabriele, Christine M; Womble, Jamie N; Klinck, Holger
2017-03-01
Source levels of harbor seal breeding vocalizations were estimated using a three-element planar hydrophone array near the Beardslee Islands in Glacier Bay National Park and Preserve, Alaska. The average source level for these calls was 144 dB RMS re 1 μPa at 1 m in the 40-500 Hz frequency band. Source level estimates ranged from 129 to 149 dB RMS re 1 μPa. Four call parameters, including minimum frequency, peak frequency, total duration, and pulse duration, were also measured. These measurements indicated that breeding vocalizations of harbor seals near the Beardslee Islands of Glacier Bay National Park are similar in duration (average total duration: 4.8 s, average pulse duration: 3.0 s) to previously reported values from other populations, but are 170-220 Hz lower in average minimum frequency (78 Hz).
Burgdorf, Jeffrey; Moskal, Joseph R; Brudzynski, Stefan M; Panksepp, Jaak
2013-08-15
Early childhood autism is characterized by deficits in social approach and play behaviors, socio-emotional relatedness, and communication/speech abnormalities, as well as repetitive behaviors. These core neuropsychological features of autism can be modeled in laboratory rats, and the results may be useful for drug discovery and therapeutic development. We review data that show that rats selectively bred for low rates of play-related pro-social ultrasonic vocalizations (USVs) can be used to model social deficit symptoms of autism. Low-line animals engage in less social contact time with conspecifics, show lower rates of play induced pro-social USVs, and show an increased proportion of non-frequency modulated (i.e. monotonous) ultrasonic vocalizations compared to non-selectively bred random-line animals. Gene expression patterns in the low-line animals show significant enrichment in autism-associated genes, and the NMDA receptor family was identified as a significant hub. Treatment of low-line animals with the NMDAR functional glycine site partial agonist, GLYX-13, rescued the deficits in play-induced pro-social 50-kHz USVs and reduced monotonous USVs. Since the NMDA receptor has been implicated in the genesis of autistic symptoms, it is possible that GLYX-13 may be of therapeutic value in the treatment of autism. Copyright © 2013 Elsevier B.V. All rights reserved.
Perceptual, auditory and acoustic vocal analysis of speech and singing in choir conductors.
Rehder, Maria Inês Beltrati Cornacchioni; Behlau, Mara
2008-01-01
the voice of choir conductors. to evaluate the vocal quality of choir conductors based on the production of a sustained vowel during singing and when speaking in order to observe auditory and acoustic differences. participants of this study were 100 choir conductors, with an equal distribution between genders. Participants were asked to produce the sustained vowel "é" using a singing and speaking voice. Speech samples were analyzed based on auditory-perceptive and acoustic parameters. The auditory-perceptive analysis was carried out by two speech-language pathologist, specialists in this field of knowledge. The acoustic analysis was carried out with the support of the computer software Doctor Speech (Tiger Electronics, SRD, USA, version 4.0), using the Real Analysis module. the auditory-perceptive analysis of the vocal quality indicated that most conductors have adapted voices, presenting more alterations in their speaking voice. The acoustic analysis indicated different values between genders and between the different production modalities. The fundamental frequency was higher in the singing voice, as well as the values for the first formant; the second formant presented lower values in the singing voice, with statistically significant results only for women. the voice of choir conductors is adapted, presenting fewer deviations in the singing voice when compared to the speaking voice. Productions differ based the voice modality, singing or speaking.
Ciucci, Michelle; Ma, Teh-Sheng; Fox, Cynthia; Kane, Jacqueline; Ramig, Lorraine; Schallert, Timothy
2007-01-01
The sensorimotor speech/voice deficits associated with Parkinson Disease have been well-documented in humans. They are largely resistant to pharmacological and surgical treatment, but respond to intensive speech treatment. The mechanisms underlying this phenomenon are not well understood and are difficult to systematically test in humans. Thus we turn to the rat as a model. The purpose of this study is to compare the ultrasonic vocalization (USV) of rats in three conditions: control, haloperidol-induced transient dopamine depletion, and unilateral 6-hydroxydopamine (6-OHDA) induced moderately-severe degeneration of dopamine neurons. It was hypothesized that both dopamine-altered conditions would lead to a change in the features of the USV acoustic signal. Results demonstrated that bandwidth decreased in the dopamine-altered rats. This is the first study to document a degradation of the acoustic signal of frequency-modulated 50-kHz calls as a result of interfering with dopamine synaptic transmission in rats. The data suggest that mild transient dopamine depletion with haloperidol or even unilateral degeneration of dopamine neurons is associated with changes in the USV acoustic signal. Thus, dopaminergic dysfunction appears to influence USV production. This study provides a foundation to examine the role of dopamine in sensorimotor processes underlying USV production and potentially to explore treatments for dopamine deficiency-related impaired vocal outcome. PMID:17397940
Burgdorf, Jeffrey; Moskal, Joseph R.; Brudzynski, Stefan M.; Panksepp, Jaak
2016-01-01
Early childhood autism is characterized by deficits in social approach and play behaviors, socio-emotional relatedness, and communication/speech abnormalities, as well as repetitive behaviors. These core neuropsychological features of autism can be modeled in laboratory rats, and the results may be useful for drug discovery and therapeutic development. We review data that show that rats selectively bred for low rates of play-related pro-social ultrasonic vocalizations (USVs) can be used to model social deficit symptoms of autism. Low-line animals engage in less social contact time with conspecifics, show lower rates of play induced pro-social USVs, and show an increased proportion of non-frequency modulated (i.e. monotonous) ultrasonic vocalizations compared to non-selectively bred random-line animals. Gene expression patterns in the low-line animals show significant enrichment in autism-associated genes, and the NMDA receptor family was identified as a significant hub. Treatment of low-line animals with the NMDAR functional glycine site partial agonist, GLYX-13, rescued the deficits in play-induced pro-social 50-kHz USVs and reduced monotonous USVs. Since the NMDA receptor has been implicated in the genesis of autistic symptoms, it is possible that GLYX-13 may be of therapeutic value in the treatment of autism. PMID:23623884
Physically Challenging Song Traits, Male Quality, and Reproductive Success in House Wrens
Cramer, Emily R. A.
2013-01-01
Physically challenging signals are likely to honestly indicate signaler quality. In trilled bird song two physically challenging parameters are vocal deviation (the speed of sound frequency modulation) and trill consistency (how precisely syllables are repeated). As predicted, in several species, they correlate with male quality, are preferred by females, and/or function in male-male signaling. Species may experience different selective pressures on their songs, however; for instance, there may be opposing selection between song complexity and song performance difficulty, such that in species where song complexity is strongly selected, there may not be strong selection on performance-based traits. I tested whether vocal deviation and trill consistency are signals of male quality in house wrens (Troglodytes aedon), a species with complex song structure. Males’ singing ability did not correlate with male quality, except that older males sang with higher trill consistency, and males with more consistent trills responded more aggressively to playback (although a previous study found no effect of stimulus trill consistency on males’ responses to playback). Males singing more challenging songs did not gain in polygyny, extra-pair paternity, or annual reproductive success. Moreover, none of the standard male quality measures I investigated correlated with mating or reproductive success. I conclude that vocal deviation and trill consistency do not signal male quality in this species. PMID:23527137
Liu, Ying; Hu, Huijing; Jones, Jeffery A; Guo, Zhiqiang; Li, Weifeng; Chen, Xi; Liu, Peng; Liu, Hanjun
2015-08-01
Speakers rapidly adjust their ongoing vocal productions to compensate for errors they hear in their auditory feedback. It is currently unclear what role attention plays in these vocal compensations. This event-related potential (ERP) study examined the influence of selective and divided attention on the vocal and cortical responses to pitch errors heard in auditory feedback regarding ongoing vocalisations. During the production of a sustained vowel, participants briefly heard their vocal pitch shifted up two semitones while they actively attended to auditory or visual events (selective attention), or both auditory and visual events (divided attention), or were not told to attend to either modality (control condition). The behavioral results showed that attending to the pitch perturbations elicited larger vocal compensations than attending to the visual stimuli. Moreover, ERPs were likewise sensitive to the attentional manipulations: P2 responses to pitch perturbations were larger when participants attended to the auditory stimuli compared to when they attended to the visual stimuli, and compared to when they were not explicitly told to attend to either the visual or auditory stimuli. By contrast, dividing attention between the auditory and visual modalities caused suppressed P2 responses relative to all the other conditions and caused enhanced N1 responses relative to the control condition. These findings provide strong evidence for the influence of attention on the mechanisms underlying the auditory-vocal integration in the processing of pitch feedback errors. In addition, selective attention and divided attention appear to modulate the neurobehavioral processing of pitch feedback errors in different ways. © 2015 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
Acoustic signatures of sound source-tract coupling.
Arneodo, Ezequiel M; Perl, Yonatan Sanz; Mindlin, Gabriel B
2011-04-01
Birdsong is a complex behavior, which results from the interaction between a nervous system and a biomechanical peripheral device. While much has been learned about how complex sounds are generated in the vocal organ, little has been learned about the signature on the vocalizations of the nonlinear effects introduced by the acoustic interactions between a sound source and the vocal tract. The variety of morphologies among bird species makes birdsong a most suitable model to study phenomena associated to the production of complex vocalizations. Inspired by the sound production mechanisms of songbirds, in this work we study a mathematical model of a vocal organ, in which a simple sound source interacts with a tract, leading to a delay differential equation. We explore the system numerically, and by taking it to the weakly nonlinear limit, we are able to examine its periodic solutions analytically. By these means we are able to explore the dynamics of oscillatory solutions of a sound source-tract coupled system, which are qualitatively different from those of a sound source-filter model of a vocal organ. Nonlinear features of the solutions are proposed as the underlying mechanisms of observed phenomena in birdsong, such as unilaterally produced "frequency jumps," enhancement of resonances, and the shift of the fundamental frequency observed in heliox experiments. ©2011 American Physical Society
Acoustic signatures of sound source-tract coupling
Arneodo, Ezequiel M.; Perl, Yonatan Sanz; Mindlin, Gabriel B.
2014-01-01
Birdsong is a complex behavior, which results from the interaction between a nervous system and a biomechanical peripheral device. While much has been learned about how complex sounds are generated in the vocal organ, little has been learned about the signature on the vocalizations of the nonlinear effects introduced by the acoustic interactions between a sound source and the vocal tract. The variety of morphologies among bird species makes birdsong a most suitable model to study phenomena associated to the production of complex vocalizations. Inspired by the sound production mechanisms of songbirds, in this work we study a mathematical model of a vocal organ, in which a simple sound source interacts with a tract, leading to a delay differential equation. We explore the system numerically, and by taking it to the weakly nonlinear limit, we are able to examine its periodic solutions analytically. By these means we are able to explore the dynamics of oscillatory solutions of a sound source-tract coupled system, which are qualitatively different from those of a sound source-filter model of a vocal organ. Nonlinear features of the solutions are proposed as the underlying mechanisms of observed phenomena in birdsong, such as unilaterally produced “frequency jumps,” enhancement of resonances, and the shift of the fundamental frequency observed in heliox experiments. PMID:21599213
Vocal fundamental and formant frequencies affect perceptions of speaker cooperativeness.
Knowles, Kristen K; Little, Anthony C
2016-01-01
In recent years, the perception of social traits in faces and voices has received much attention. Facial and vocal masculinity are linked to perceptions of trustworthiness; however, while feminine faces are generally considered to be trustworthy, vocal trustworthiness is associated with masculinized vocal features. Vocal traits such as pitch and formants have previously been associated with perceived social traits such as trustworthiness and dominance, but the link between these measurements and perceptions of cooperativeness have yet to be examined. In Experiment 1, cooperativeness ratings of male and female voices were examined against four vocal measurements: fundamental frequency (F0), pitch variation (F0-SD), formant dispersion (Df), and formant position (Pf). Feminine pitch traits (F0 and F0-SD) and masculine formant traits (Df and Pf) were associated with higher cooperativeness ratings. In Experiment 2, manipulated voices with feminized F0 were found to be more cooperative than voices with masculinized F0(,) among both male and female speakers, confirming our results from Experiment 1. Feminine pitch qualities may indicate an individual who is friendly and non-threatening, while masculine formant qualities may reflect an individual that is socially dominant or prestigious, and the perception of these associated traits may influence the perceived cooperativeness of the speakers.
Automated Assessment of Child Vocalization Development Using LENA.
Richards, Jeffrey A; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance
2017-07-12
To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Assessment was based on full-day audio recordings collected in a child's unrestricted, natural language environment. AVA estimates were derived using automatic speech recognition modeling techniques to categorize and quantify the sounds in child vocalizations (e.g., protophones and phonemes). These were expressed as phone and biphone frequencies, reduced to principal components, and inputted to age-based multiple linear regression models to predict independently collected criterion-expressive language scores. From these models, we generated vocal development AVA estimates as age-standardized scores and development age estimates. AVA estimates demonstrated strong statistical reliability and validity when compared with standard criterion expressive language assessments. Automated analysis of child vocalizations extracted from full-day recordings in natural settings offers a novel and efficient means to assess children's expressive vocal development. More research remains to identify specific mechanisms of operation.
Zhang, Zhaoyan
2015-01-01
Maintaining a small glottal opening across a large range of voice conditions is critical to normal voice production. This study investigated the effectiveness of vocal fold approximation and stiffening in regulating glottal opening and airflow during phonation, using a three-dimensional numerical model of phonation. The results showed that with increasing subglottal pressure the vocal folds were gradually pushed open, leading to increased mean glottal opening and flow rate. A small glottal opening and a mean glottal flow rate typical of human phonation can be maintained against increasing subglottal pressure by proportionally increasing the degree of vocal fold approximation for low to medium subglottal pressures and vocal fold stiffening at high subglottal pressures. Although sound intensity was primarily determined by the subglottal pressure, the results suggest that, to maintain small glottal opening as the sound intensity increases, one has to simultaneously tighten vocal fold approximation and/or stiffen the vocal folds, resulting in increased glottal resistance, vocal efficiency, and fundamental frequency. PMID:25698022
Pasch, Bret; Abbasi, Mustafa Z; Wilson, Macey; Zhao, Daniel; Searle, Jeremy B; Webster, Michael S; Rice, Aaron N
2016-04-01
Nutritional stress can have lasting impacts on the development of traits involved in vocal production. Cross-fostering experiments are often used to examine the propensity for vocal learning in a variety of taxa, but few studies assess the influence of malnourishment that can occur as a byproduct of this technique. In this study, we reciprocally cross-fostered sister taxa of voluble grasshopper mice (genus Onychomys) to explore their propensity for vocal learning. Vocalizations of Onychomys leucogaster did not differ between control and cross-fostered animals, but cross-fostered Onychomys arenicola produced vocalizations that were higher in frequency in a direction away from tutors. These same animals exhibited a transient reduction in body mass early in development, indicative of malnutrition. Our findings simultaneously refute vocal learning and support the developmental stress hypothesis to highlight the importance of early ontogeny on the production of vocalizations later in life. Copyright © 2016 Elsevier Inc. All rights reserved.
Low frequency dove coos vary across noise gradients in an urbanized environment.
Guo, Fengyi; Bonebrake, Timothy C; Dingle, Caroline
2016-08-01
Urbanization poses a challenge to bird communication due to signal masking by ambient noise and reflective surfaces that lead to signal degradation. Bird species (especially oscines) have been shown to alter their singing behaviour to increase signal efficiency in highly urbanized environments. However, few studies on the effects of noise on song structure have included birds with low frequency vocal signals which may be especially vulnerable to noise pollution due to significant frequency overlap of their signals with traffic noise. We compared the perch coos of spotted doves (Streptopelia chinensis), a species with very low frequency vocalizations, in different background noise levels across urban and peri-urban areas in Hong Kong. We documented a 10% upward shift in the minimum frequency of coos of spotted doves across the noise gradient (a relatively small but significant shift), and a reduced maximum frequency in urban habitats with a higher density of built up area. Hong Kong doves had significantly higher minimum and maximum frequencies than doves from throughout their range (from mostly rural sites). Our results indicate that urban species with extremely low sound frequencies such as doves can alter their vocalizations in response to variable urban acoustic environments. Copyright © 2016 Elsevier B.V. All rights reserved.
Acoustic Analysis and Electroglottography in Elite Vocal Performers.
Villafuerte-Gonzalez, Rocio; Valadez-Jimenez, Victor M; Sierra-Ramirez, Jose A; Ysunza, Pablo Antonio; Chavarria-Villafuerte, Karen; Hernandez-Lopez, Xochiquetzal
2017-05-01
Acoustic analysis of voice (AAV) and electroglottography (EGG) have been used for assessing vocal quality in patients with voice disorders. The effectiveness of these procedures for detecting mild disturbances in vocal quality in elite vocal performers has been controversial. To compare acoustic parameters obtained by AAV and EGG before and after vocal training to determine the effectiveness of these procedures for detecting vocal improvements in elite vocal performers. Thirty-three elite vocal performers were studied. The study group included 14 males and 19 females, ages 18-40 years, without a history of voice disorders. Acoustic parameters were obtained through AAV and EGG before and after vocal training using the Linklater method. Nonsignificant differences (P > 0.05) were found between values of fundamental frequency (F 0 ), shimmer, and jitter obtained by both procedures before vocal training. Mean F 0 was similar after vocal training. Jitter percentage as measured by AAV showed nonsignificant differences (P > 0.05) before and after vocal training. Shimmer percentage as measured by AAV demonstrated a significant reduction (P < 0.05) after vocal training. As measured by EGG after vocal training, shimmer and jitter were significantly reduced (P < 0.05); open quotient was significantly increased (P < 0.05); and irregularity was significantly reduced (P < 0.05). AAV and EGG were effective for detecting improvements in vocal function after vocal training in male and female elite vocal performers undergoing vocal training. EGG demonstrated better efficacy for detecting improvements and provided additional parameters as compared to AAV. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Guo, Zhiqiang; Wu, Xiuqin; Li, Weifeng; Jones, Jeffery A; Yan, Nan; Sheft, Stanley; Liu, Peng; Liu, Hanjun
2017-10-25
Although working memory (WM) is considered as an emergent property of the speech perception and production systems, the role of WM in sensorimotor integration during speech processing is largely unknown. We conducted two event-related potential experiments with female and male young adults to investigate the contribution of WM to the neurobehavioural processing of altered auditory feedback during vocal production. A delayed match-to-sample task that required participants to indicate whether the pitch feedback perturbations they heard during vocalizations in test and sample sequences matched, elicited significantly larger vocal compensations, larger N1 responses in the left middle and superior temporal gyrus, and smaller P2 responses in the left middle and superior temporal gyrus, inferior parietal lobule, somatosensory cortex, right inferior frontal gyrus, and insula compared with a control task that did not require memory retention of the sequence of pitch perturbations. On the other hand, participants who underwent extensive auditory WM training produced suppressed vocal compensations that were correlated with improved auditory WM capacity, and enhanced P2 responses in the left middle frontal gyrus, inferior parietal lobule, right inferior frontal gyrus, and insula that were predicted by pretraining auditory WM capacity. These findings indicate that WM can enhance the perception of voice auditory feedback errors while inhibiting compensatory vocal behavior to prevent voice control from being excessively influenced by auditory feedback. This study provides the first evidence that auditory-motor integration for voice control can be modulated by top-down influences arising from WM, rather than modulated exclusively by bottom-up and automatic processes. SIGNIFICANCE STATEMENT One outstanding question that remains unsolved in speech motor control is how the mismatch between predicted and actual voice auditory feedback is detected and corrected. The present study provides two lines of converging evidence, for the first time, that working memory cannot only enhance the perception of vocal feedback errors but also exert inhibitory control over vocal motor behavior. These findings represent a major advance in our understanding of the top-down modulatory mechanisms that support the detection and correction of prediction-feedback mismatches during sensorimotor control of speech production driven by working memory. Rather than being an exclusively bottom-up and automatic process, auditory-motor integration for voice control can be modulated by top-down influences arising from working memory. Copyright © 2017 the authors 0270-6474/17/3710324-11$15.00/0.
Great talent, excellent voices-no problem for pubertal girls?
Decoster, Wivine; Ghesquiere, Sofie; Van Steenberge, Sebastiaan
2008-01-01
This research on 17 girls (aged 9;9 y to 16;11 y) singing in an established choir was focused on two issues: 1) the variety in physical and vocal development using Gackle's model, and 2) the matching of vocal demands and abilities. Developmental and acoustical data on the speaking and singing voice revealed considerable variation between individual girl singers. The model was greatly applicable. However, all girls had a greater total singing range, mainly in favour of the lower tones, and 11 girls used a lower speaking fundamental frequency. A third of the girls met the vocal and developmental features of their stage at a younger age. Next the lower limit of the frequency range of all girls was several semitones below the lowest notes of the pieces being worked on at the time of the experiment. However the upper limit of the pieces coincided with or exceeded their upper frequency limit.
Musical Structure Modulates Semantic Priming in Vocal Music
ERIC Educational Resources Information Center
Poulin-Charronnat, Benedicte; Bigand, Emmanuel; Madurell, Francois; Peereman, Ronald
2005-01-01
It has been shown that harmonic structure may influence the processing of phonemes whatever the extent of participants' musical expertise [Bigand, E., Tillmann, B., Poulin, B., D'Adamo, D. A., & Madurell, F. (2001). The effect of harmonic context on phoneme monitoring in vocal music. "Cognition," 81, B11-B20]. The present study goes a step further…
Zeskind, Philip Sanford; McMurray, Matthew S.; Garber, Kristin A.; Neuspiel, Juliana M.; Cox, Elizabeth T.; Grewen, Karen M.; Mayes, Linda C.; Johns, Josephine M.
2011-01-01
The purpose of this article is to describe the development of translational methods by which spectrum analysis of human infant crying and rat pup ultrasonic vocalizations (USVs) can be used to assess potentially adverse effects of various prenatal conditions on early neurobehavioral development. The study of human infant crying has resulted in a rich set of measures that has long been used to assess early neurobehavioral insult due to non-optimal prenatal environments, even among seemingly healthy newborn and young infants. In another domain of study, the analysis of rat put USVs has been conducted via paradigms that allow for better experimental control over correlated prenatal conditions that may confound findings and conclusions regarding the effects of specific prenatal experiences. The development of translational methods by which cry vocalizations of both species can be analyzed may provide the opportunity for findings from the two approaches of inquiry to inform one another through their respective strengths. To this end, we present an enhanced taxonomy of a novel set of common measures of cry vocalizations of both human infants and rat pups based on a conceptual framework that emphasizes infant crying as a graded and dynamic acoustic signal. This set includes latency to vocalization onset, duration and repetition rate of expiratory components, duration of inter-vocalization-intervals and spectral features of the sound, including the frequency and amplitude of the fundamental and dominant frequencies. We also present a new set of classifications of rat pup USV waveforms that include qualitative shifts in fundamental frequency, similar to the presence of qualitative shifts in fundamental frequency that have previously been related to insults to neurobehavioral integrity in human infants. Challenges to the development of translational analyses, including the use of different terminologies, methods of recording, and spectral analyses are discussed, as well as descriptions of automated processes, software solutions, and pitfalls. PMID:22028695
Chan, R W
2001-09-01
Empirical data on the viscoelastic shear properties of human vocal-fold mucosa (cover) were recently reported at relatively low frequency (0.01-15 Hz). For the data to become relevant to voice production, attempts have been made to parametrize and extrapolate the data to higher frequencies using constitutive modeling [Chan and Titze, J. Acoust. Soc. Am. 107, 565-580 (2000)]. This study investigated the feasibility of an alternative approach for data extrapolation, namely the principle of time-temperature superposition (TTS). TTS is a hybrid theoretical-empirical approach widely used by rheologists to estimate the viscoelastic properties of polymeric systems at time or frequency scales not readily accessible experimentally. It is based on the observation that for many polymers, the molecular configurational changes that occur in a given time scale at a low temperature correspond to those that occur in a shorter time scale at a higher temperature. Using a rotational rheometer, the elastic shear modulus (G') and viscous shear modulus (G'') of vocal-fold cover (superficial layer of lamina propria) tissue samples were measured at 0.01-15 Hz at relatively low temperatures (5 degrees-37 degrees C). Data were empirically shifted according to TTS, yielding composite "master curves" for predicting the magnitude of the shear moduli at higher frequencies at 37 degrees C. Results showed that TTS may be a feasible approach for estimating the viscoelastic shear properties of vocal-fold tissues at frequencies of phonation (on the order of 100-1000 Hz).
Lopes, Leonardo Wanderley; de Oliveira Florencio, Vanessa; Silva, Priscila Oliveira Costa; da Nóbrega E Ugulino, Ana Celiane; Almeida, Anna Alice
2018-01-04
We aimed to correlate the Vocal Tract Discomfort Scale (VTDS) with the Voice Symptom Scale (VoiSS) for evaluation of patients with dysphonia. In addition, we aimed to compare vocal tract discomfort symptoms in patients with and without self-reported voice problem. This is a descriptive, cross-sectional, and retrospective study. We analyzed 143 women and 62 men with voice disorders, as confirmed by endoscopic larynx examination. All patients completed the VTDS and VoiSS at vocal evaluation. Descriptive statistics and the Spearman correlation test were applied to all variables. The degree of covariance of variables was noted. The Mann-Whitney U test was used to compare the average number of discomfort symptoms among patients with and without self-reported voice problems. A weak to moderate positive correlation was observed between the average number, frequency, and intensity of comfort symptom and the total score, physical domain score, and limitation domain score of the VoiSS. The vocal tract discomfort symptoms and the emotional domain score of the VoiSS were weakly correlated. Patients with self-reported voice problems had a higher number, frequency, and intensity of vocal tract discomfort symptoms. There is correlation between the VTDS and VoiSS scales, with greater references to vocal tract discomfort symptom in patients with self-reported voice problems. Therefore, the discomfort symptoms seem to influence the perception of the impact of a voice problem. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Unusual Volcanic Tremor Observations in Fogo Island, Cape Verde
NASA Astrophysics Data System (ADS)
Custodio, S. I.; Heleno, S. I.
2004-12-01
Volcanic tremor is a ground motion characterized by well-defined frequencies, and has traditionally been explained by the movement of fluids, namely magma, in conduits or cracks (Chouet, 1996). Thus tremor has the potential to reveal key aspects of volcanic structure and dynamics. Two types of previously unreported seismic signals have been observed in Fogo volcano: a) tide-modulated seismic noise and volcanic tremor, and b) high-frequency low-attenuation harmonic tremor. Amplitude modulation of seismic noise can be detected by simple eye-inspection of raw data in some stations of the VIGIL Network, Fogo Volcano. A more detailed analysis shows that certain frequency bands which we interpret as volcanic tremor, mainly in the range 2.0-3.0Hz, are preferentially modulated. The main frequency of modulation is 1.93 c.p.d., which corresponds to M2, the semi-diurnal lunar harmonic. Air pressure and temperature, which are continuously monitored in Fogo Island, have been analyzed and cannot explain the observed periodicity. Thus we conclude that seismic noise and tremor amplitudes are controlled by tides (Custodio et al., 2003). A relation between the tidal modulation and hydrothermal systems activity is suspected and under investigation. High-frequency (HF) tremor (5-20 Hz) has been recorded simultaneously in several stations in Fogo Island and even in different islands of the Cape Verde archipelago (up to distances of 120 km). In volcanic environments high-frequency motions are normally recorded in a small area close to the source, due to the strong attenuation of seismic waves. Non-volcanic origins for HF tremor were examined: cultural noise, whale vocalizations, ship noise, electronic/processing artifacts and path and/or site effects were all considered and dismissed. Emergent arrivals and strong site effects render source location a difficult task, but the analysis of wave polarizations and amplitude distributions seems to point to an offshore source. Two alternative mechanisms are presently being considered: a) propagation in the ocean sound channel of T-waves generated by resonance in a shallow conduit/chamber, and b) existence of a deep strong source, such as a large fluid-filled crack, capable of producing tremor with a complex pattern that propagates to large distances.
Higher songs of city birds may not be an individual response to noise.
Zollinger, Sue Anne; Slater, Peter J B; Nemeth, Erwin; Brumm, Henrik
2017-08-16
It has been observed in many songbird species that populations in noisy urban areas sing with a higher minimum frequency than do matched populations in quieter, less developed areas. However, why and how this divergence occurs is not yet understood. We experimentally tested whether chronic noise exposure during vocal learning results in songs with higher minimum frequencies in great tits ( Parus major ), the first species for which a correlation between anthropogenic noise and song frequency was observed. We also tested vocal plasticity of adult great tits in response to changing background noise levels by measuring song frequency and amplitude as we changed noise conditions. We show that noise exposure during ontogeny did not result in songs with higher minimum frequencies. In addition, we found that adult birds did not make any frequency or song usage adjustments when their background noise conditions were changed after song crystallization. These results challenge the common view of vocal adjustments by city birds, as they suggest that either noise itself is not the causal force driving the divergence of song frequency between urban and forest populations, or that noise induces population-wide changes over a time scale of several generations rather than causing changes in individual behaviour. © 2017 The Author(s).
Iyer, Suneeti Nathani; Oller, D. Kimbrough
2010-01-01
Little research has been conducted on the development of suprasegmental characteristics of vocalizations in typically developing infants (TDI) and the role of audition in the development of these characteristics. The purpose of the present study was to examine the longitudinal development of fundamental frequency (F0) in eight TDI and eight infants with severe-to-profound hearing loss matched for level of vocal development. Results revealed no significant changes in F0 with advances in pre-language vocal development for TDI. Infants with hearing loss, however, showed a statistically reliable higher variability of F0 than TDI, when age was accounted for as a covariate. The results suggest development of F0 may be strongly influenced by audition. PMID:19031191
Weaning reactions in beef cattle are adaptively adjusted to the state of the cow and the calf.
Stěhulová, I; Valníčková, B; Šárová, R; Špinka, M
2017-03-01
Abrupt weaning as practiced in beef cattle husbandry is stressful for both the cow and her offspring. However, the reaction to weaning varies among individuals. Based on the theory of maternal care allocation, we derived and tested the following hypotheses: 1) cow reaction to weaning will be stronger if the calf is young, if the calf is a female, and if the calf had higher daily weight gain; 2) cows in a higher parity and cows that are not concurrently pregnant will react more on weaning; and 3) young and female calves, and also calves with higher daily weight gain will respond more to weaning. We recorded frequency of vocalization and time spent moving in 50 cow-calf pairs (27 males and 23 females) immediately after weaning at 151 to 274 d of age. The recordings were made at 0 to 2 h, 6 to 8 h, and 24 to 26 h after the separation of the calves from the cows. Linear mixed models were used to test the predictions. In cows, age of the calf had the strongest effect with mothers of younger calves vocalizing more ( < 0.05). Frequency of vocalization was higher in mothers of calves with higher daily weight gain ( < 0.01) and in nonpregnant mothers ( < 0.01). Frequency of the moving was higher in younger cows ( < 0.05). Sex of the calf had no effect. In calves, females vocalized ( < 0.001) and moved ( < 0.01) more than males and calves with higher daily weight gain also called more ( < 0.01). The relationships between the 2 behaviors and their time courses were different in cows and calves. In cows, vocalization and movement were correlated ( < 0.001) and both increased until 6 to 8 h and then plateaued or declined ( < 0.001). In calves, vocalizations steadily increased until 24 to 26 h ( < 0.001) whereas movement remained unchanged in time and was uncorrelated with vocalizations. These differences indicate that vocalization may be a more sensitive indicator of weaning stress than movement. Our results document that the ability to adaptively adjust mother-young interactions has been preserved in domesticated beef cattle.
Occurrence Frequencies of Acoustic Patterns of Vocal Fry in American English Speakers.
Abdelli-Beruh, Nassima B; Drugman, Thomas; Red Owl, R H
2016-11-01
The goal of this study was to analyze the occurrence frequencies of three individual acoustic patterns (A, B, C) and of vocal fry overall (A + B + C) as a function of gender, word position in the sentence (Not Last Word vs. Last Word), and sentence length (number of words in a sentence). This is an experimental design. Twenty-five male and 29 female American English (AE) speakers read the Grandfather Passage. The recordings were processed by a Matlab toolbox designed for the analysis and detection of creaky segments, automatically identified using the Kane-Drugman algorithm. The experiment produced subsamples of outcomes, three that reflect a single, discrete acoustic pattern (A, B, or C) and the fourth that reflects the occurrence frequency counts of Vocal Fry Overall without regard to any specific pattern. Zero-truncated Poisson regression analyses were conducted with Gender and Word Position as predictors and Sentence Length as a covariate. The results of the present study showed that the occurrence frequencies of the three acoustic patterns and vocal fry overall (A + B + C) are greatest at the end of sentences but are unaffected by sentence length. The findings also reveal that AE female speakers exhibit Pattern C significantly more frequently than Pattern B, and the converse holds for AE male speakers. Future studies are needed to confirm such outcomes, assess the perceptual salience of these acoustic patterns, and determine the physiological correlates of these acoustic patterns. The findings have implications for the design of new excitation models of vocal fry. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Makagon, Maja M; Funayama, E Sumie; Owren, Michael J
2008-07-01
Relatively few empirical data are available concerning the role of auditory experience in nonverbal human vocal behavior, such as laughter production. This study compared the acoustic properties of laughter in 19 congenitally, bilaterally, and profoundly deaf college students and in 23 normally hearing control participants. Analyses focused on degree of voicing, mouth position, air-flow direction, temporal features, relative amplitude, fundamental frequency, and formant frequencies. Results showed that laughter produced by the deaf participants was fundamentally similar to that produced by the normally hearing individuals, which in turn was consistent with previously reported findings. Finding comparable acoustic properties in the sounds produced by deaf and hearing vocalizers confirms the presumption that laughter is importantly grounded in human biology, and that auditory experience with this vocalization is not necessary for it to emerge in species-typical form. Some differences were found between the laughter of deaf and hearing groups; the most important being that the deaf participants produced lower-amplitude and longer-duration laughs. These discrepancies are likely due to a combination of the physiological and social factors that routinely affect profoundly deaf individuals, including low overall rates of vocal fold use and pressure from the hearing world to suppress spontaneous vocalizations.
Razak, K A
2012-04-01
Frequency-modulated (FM) sweeps are common components of species-specific vocalizations. The intensity of FM sweeps can cover a wide range in the natural environment, but whether intensity affects neural selectivity for FM sweeps is unclear. Bats, such as the pallid bat, which use FM sweeps for echolocation, are suited to address this issue, because the intensity of echoes will vary with target distance. In this study, FM sweep rate selectivity of pallid bat auditory cortex neurons was measured using downward sweeps at different intensities. Neurons became more selective for FM sweep rates present in the bat's echolocation calls as intensity increased. Increased selectivity resulted from stronger inhibition of responses to slower sweep rates. The timing and bandwidth of inhibition generated by frequencies on the high side of the excitatory tuning curve [sideband high-frequency inhibition (HFI)] shape rate selectivity in cortical neurons in the pallid bat. To determine whether intensity-dependent changes in FM rate selectivity were due to altered inhibition, the timing and bandwidth of HFI were quantified at multiple intensities using the two-tone inhibition paradigm. HFI arrived faster relative to excitation as sound intensity increased. The bandwidth of HFI also increased with intensity. The changes in HFI predicted intensity-dependent changes in FM rate selectivity. These data suggest that neural selectivity for a sweep parameter is not static but shifts with intensity due to changes in properties of sideband inhibition.
Sulter, A M; Schutte, H K; Miller, D G
1996-06-01
To determine the influence of the factors gender, vocal training, sound intensity, pitch, and aging on vocal function, videolaryngostroboscopic images of 214 subjects, subdivided according to gender and status of vocal training, were evaluated by three judges with standardized rating scales, comprising aspects of laryngeal appearance (larynx/pharynx ratio; epiglottal shape; asymmetry arytenoid region; compensatory adjustments; thickness, width, length, and elasticity of vocal folds) and glottal functioning (amplitudes of excursion; duration, percentage, and type of vocal fold closure; phase differences; location of glottal chink). The video registrations were made while the subjects performed a set of phonatory tasks, comprising the utterance of the vowel /i/ at three levels of both fundamental frequency and sound intensity. Analysis of the rating scales showed generally sufficient agreement among judges. With the exception of more frequently observed complete closure and lateral phase differences of vocal fold excursions in trained subjects, no further differences were established between untrained and trained subjects. With an alpha level of p = 0.005, men differed from women with respect to laryngeal appearance (larynx/pharynx ratio, compensatory adjustments, and the presence of omega and deviant-shaped epiglottises), and their vocal folds were rated thicker in the vertical dimension, smaller in the lateral dimension, longer, and more tense, with smaller amplitudes of excursion during vibration. Glottal closure in male subjects was rated more complete, but briefer in duration. Significant effects of the factors pitch, sound intensity, and age on vocal fold appearance and glottal functioning were ascertained. Awareness of the influence of these factors, as well as the factor gender, on the rated scales is essential for an adequate evaluation of laryngostroboscopic images.
Body height, immunity, facial and vocal attractiveness in young men.
Skrinda, Ilona; Krama, Tatjana; Kecko, Sanita; Moore, Fhionna R; Kaasik, Ants; Meija, Laila; Lietuvietis, Vilnis; Rantala, Markus J; Krams, Indrikis
2014-12-01
Health, facial and vocal attributes and body height of men may affect a diverse range of social outcomes such as attractiveness to potential mates and competition for resources. Despite evidence that each parameter plays a role in mate choice, the relative role of each and inter-relationships between them, is still poorly understood. In this study, we tested relationships both between these parameters and with testosterone and immune function. We report positive relationships between testosterone with facial masculinity and attractiveness, and we found that facial masculinity predicted facial attractiveness and antibody response to a vaccine. Moreover, the relationship between antibody response to a hepatitis B vaccine and body height was found to be non-linear, with a positive relationship up to a height of 188 cm, but an inverse relationship in taller men. We found that vocal attractiveness was dependent upon vocal masculinity. The relationship between vocal attractiveness and body height was also non-linear, with a positive relationship of up to 178 cm, which then decreased in taller men. We did not find a significant relationship between body height and the fundamental frequency of vowel sounds provided by young men, while body height negatively correlated with the frequency of second formant. However, formant frequency was not associated with the strength of immune response. Our results demonstrate the potential of vaccination research to reveal costly traits that govern evolution of mate choice in humans and the importance of trade-offs among these traits.
Body height, immunity, facial and vocal attractiveness in young men
NASA Astrophysics Data System (ADS)
Skrinda, Ilona; Krama, Tatjana; Kecko, Sanita; Moore, Fhionna R.; Kaasik, Ants; Meija, Laila; Lietuvietis, Vilnis; Rantala, Markus J.; Krams, Indrikis
2014-12-01
Health, facial and vocal attributes and body height of men may affect a diverse range of social outcomes such as attractiveness to potential mates and competition for resources. Despite evidence that each parameter plays a role in mate choice, the relative role of each and inter-relationships between them, is still poorly understood. In this study, we tested relationships both between these parameters and with testosterone and immune function. We report positive relationships between testosterone with facial masculinity and attractiveness, and we found that facial masculinity predicted facial attractiveness and antibody response to a vaccine. Moreover, the relationship between antibody response to a hepatitis B vaccine and body height was found to be non-linear, with a positive relationship up to a height of 188 cm, but an inverse relationship in taller men. We found that vocal attractiveness was dependent upon vocal masculinity. The relationship between vocal attractiveness and body height was also non-linear, with a positive relationship of up to 178 cm, which then decreased in taller men. We did not find a significant relationship between body height and the fundamental frequency of vowel sounds provided by young men, while body height negatively correlated with the frequency of second formant. However, formant frequency was not associated with the strength of immune response. Our results demonstrate the potential of vaccination research to reveal costly traits that govern evolution of mate choice in humans and the importance of trade-offs among these traits.
Female Presence and Estrous State Influence Mouse Ultrasonic Courtship Vocalizations
Hanson, Jessica L.; Hurley, Laura M.
2012-01-01
The laboratory mouse is an emerging model for context-dependent vocal signaling and reception. Mouse ultrasonic vocalizations are robustly produced in social contexts. In adults, male vocalization during courtship has become a model of interest for signal-receiver interactions. These vocalizations can be grouped into syllable types that are consistently produced by different subspecies and strains of mice. Vocalizations are unique to individuals, vary across development, and depend on social housing conditions. The behavioral significance of different syllable types, including the contexts in which different vocalizations are made and the responses listeners have to different types of vocalizations, is not well understood. We examined the effect of female presence and estrous state on male vocalizations by exploring the use of syllable types and the parameters of syllables during courtship. We also explored correlations between vocalizations and other behaviors. These experimental manipulations produced four main findings: 1) vocalizations varied among males, 2) the production of USVs and an increase in the use of a specific syllable type were temporally related to mounting behavior, 3) the frequency (kHz), bandwidth, and duration of syllables produced by males were influenced by the estrous phase of female partners, and 4) syllable types changed when females were removed. These findings show that mouse ultrasonic courtship vocalizations are sensitive to changes in female phase and presence, further demonstrating the context-sensitivity of these calls. PMID:22815817
Horita, Haruhito; Kobayashi, Masahiko; Liu, Wan-chun; Oka, Kotaro; Jarvis, Erich D.; Wada, Kazuhiro
2012-01-01
Mechanisms for the evolution of convergent behavioral traits are largely unknown. Vocal learning is one such trait that evolved multiple times and is necessary in humans for the acquisition of spoken language. Among birds, vocal learning is evolved in songbirds, parrots, and hummingbirds. Each time similar forebrain song nuclei specialized for vocal learning and production have evolved. This finding led to the hypothesis that the behavioral and neuroanatomical convergences for vocal learning could be associated with molecular convergence. We previously found that the neural activity-induced gene dual specificity phosphatase 1 (dusp1) was up-regulated in non-vocal circuits, specifically in sensory-input neurons of the thalamus and telencephalon; however, dusp1 was not up-regulated in higher order sensory neurons or motor circuits. Here we show that song motor nuclei are an exception to this pattern. The song nuclei of species from all known vocal learning avian lineages showed motor-driven up-regulation of dusp1 expression induced by singing. There was no detectable motor-driven dusp1 expression throughout the rest of the forebrain after non-vocal motor performance. This pattern contrasts with expression of the commonly studied activity-induced gene egr1, which shows motor-driven expression in song nuclei induced by singing, but also motor-driven expression in adjacent brain regions after non-vocal motor behaviors. In the vocal non-learning avian species, we found no detectable vocalizing-driven dusp1 expression in the forebrain. These findings suggest that independent evolutions of neural systems for vocal learning were accompanied by selection for specialized motor-driven expression of the dusp1 gene in those circuits. This specialized expression of dusp1 could potentially lead to differential regulation of dusp1-modulated molecular cascades in vocal learning circuits. PMID:22876306
A simple frequency-scaling rule for animal communication
NASA Astrophysics Data System (ADS)
Fletcher, Neville H.
2004-05-01
Different animals use widely different frequencies for sound communication, and it is reasonable to assume that evolution has adapted these frequencies to give greatest conspecific communication distance for a given vocal effort. Acoustic analysis shows that the optimal communication frequency is inversely proportional to about the 0.4 power of the animal's body mass. Comparison with observational data indicates that this prediction is well supported in practice. For animals of a given class, for example mammals, the maximum communication distance varies about as the 0.6 power of the animal's mass. There is, however, a wide spread of observed results because of the different emphasis placed upon vocal effort in the evolution of different animal species.
Vocal warm-up practices and perceptions in vocalists: a pilot survey.
Gish, Allison; Kunduk, Melda; Sims, Loraine; McWhorter, Andrew J
2012-01-01
Investigated in a pilot study the type, duration, and frequency of vocal warm-up regimens in the singing community using a survey. One hundred seventeen participants completed an online survey. Participants included voice students from undergraduate, masters, and doctoral music programs and professional singers. Fifty-four percent of participants reported always using vocal warm-up before singing. Twenty-two percent of the participants used vocal cool down. The most preferred warm-up duration was of 5-10 minutes in duration. Despite using vocal warm-up, 26% of the participants reported experiencing voice problems. Females tended to use vocal warm-up more frequently than males. Females also tended to use longer warm-up sessions than males. Education of the participants did not appear to have any noticeable effect on the vocal warm-up practices. The most commonly used singing warm-up exercises were ascending/descending five-note scales, ascending/descending octave scales, legato arpeggios, and glissandi. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Effects of social games on infant vocalizations*.
Hsu, Hui-Chin; Iyer, Suneeti Nathani; Fogel, Alan
2014-01-01
The aim of the present study was to examine the contextual effects of social games on prelinguistic vocalizations. The two main goals were to (1) investigate the functions of vocalizations as symptoms of affective arousal and symbols of social understanding, and (2) explore form-function (de)coupling relations between vocalization types and game contexts. Seventy-one six-month-olds and sixty-four twelve-month-olds played with their mothers in normal and perturbed tickle and peek-a-boo games. The effects of infant age, game, game climax, and game perturbation on the frequency and types of infant vocalizations were examined. Results showed twelve-month-olds vocalized more mature canonical syllables during peek-a-boo and more primitive quasi-resonant nuclei during tickle than six-month-olds. Six- and twelve-month-olds increased their vocalizations from the set-up to climax during peek-a-boo, but they did not show such an increase during tickle. Findings support the symptom function of prelinguistic vocalizations reflecting affective arousal and the prevalence of form-function decoupling during the first year of life.
Chan, R W; Titze, I R
2000-01-01
The viscoelastic shear properties of human vocal fold mucosa (cover) were previously measured as a function of frequency [Chan and Titze, J. Acoust. Soc. Am. 106, 2008-2021 (1999)], but data were obtained only in a frequency range of 0.01-15 Hz, an order of magnitude below typical frequencies of vocal fold oscillation (on the order of 100 Hz). This study represents an attempt to extrapolate the data to higher frequencies based on two viscoelastic theories, (1) a quasilinear viscoelastic theory widely used for the constitutive modeling of the viscoelastic properties of biological tissues [Fung, Biomechanics (Springer-Verlag, New York, 1993), pp. 277-292], and (2) a molecular (statistical network) theory commonly used for the rheological modeling of polymeric materials [Zhu et al., J. Biomech. 24, 1007-1018 (1991)]. Analytical expressions of elastic and viscous shear moduli, dynamic viscosity, and damping ratio based on the two theories with specific model parameters were applied to curve-fit the empirical data. Results showed that the theoretical predictions matched the empirical data reasonably well, allowing for parametric descriptions of the data and their extrapolations to frequencies of phonation.
Trygonis, Vasilis; Gerstein, Edmund; Moir, Jim; McCulloch, Stephen
2013-12-01
Passive acoustic surveys were conducted to assess the vocal behavior of North Atlantic right whales (Eubalaena glacialis) in the designated critical calving habitat along the shallow coastal waters of southeastern United States. Underwater vocalizations were recorded using autonomous buoys deployed in close proximity to surface active groups (SAGs). Nine main vocalization types were identified with manual inspection of spectrograms, and standard acoustic descriptors were extracted. Classification trees were used to examine the distinguishing characteristics of calls and quantify their variability within the SAG vocal repertoire. The results show that descriptors of frequency, bandwidth, and spectral disorder are the most important parameters for partitioning the SAG repertoire, contrary to duration-related measures. The reported source levels and vocalization statistics provide sound production data vital to inform regional passive acoustic monitoring and conservation for this endangered species.
Automatic classification of killer whale vocalizations using dynamic time warping.
Brown, Judith C; Miller, Patrick J O
2007-08-01
A set of killer whale sounds from Marineland were recently classified automatically [Brown et al., J. Acoust. Soc. Am. 119, EL34-EL40 (2006)] into call types using dynamic time warping (DTW), multidimensional scaling, and kmeans clustering to give near-perfect agreement with a perceptual classification. Here the effectiveness of four DTW algorithms on a larger and much more challenging set of calls by Northern Resident whales will be examined, with each call consisting of two independently modulated pitch contours and having considerable overlap in contours for several of the perceptual call types. Classification results are given for each of the four algorithms for the low frequency contour (LFC), the high frequency contour (HFC), their derivatives, and weighted sums of the distances corresponding to LFC with HFC, LFC with its derivative, and HFC with its derivative. The best agreement with the perceptual classification was 90% attained by the Sakoe-Chiba algorithm for the low frequency contours alone.
Integrating perspectives on vocal performance and consistency
Sakata, Jon T.; Vehrencamp, Sandra L.
2012-01-01
SUMMARY Recent experiments in divergent fields of birdsong have revealed that vocal performance is important for reproductive success and under active control by distinct neural circuits. Vocal consistency, the degree to which the spectral properties (e.g. dominant or fundamental frequency) of song elements are produced consistently from rendition to rendition, has been highlighted as a biologically important aspect of vocal performance. Here, we synthesize functional, developmental and mechanistic (neurophysiological) perspectives to generate an integrated understanding of this facet of vocal performance. Behavioral studies in the field and laboratory have found that vocal consistency is affected by social context, season and development, and, moreover, positively correlated with reproductive success. Mechanistic investigations have revealed a contribution of forebrain and basal ganglia circuits and sex steroid hormones to the control of vocal consistency. Across behavioral, developmental and mechanistic studies, a convergent theme regarding the importance of vocal practice in juvenile and adult songbirds emerges, providing a basis for linking these levels of analysis. By understanding vocal consistency at these levels, we gain an appreciation for the various dimensions of song control and plasticity and argue that genes regulating the function of basal ganglia circuits and sex steroid hormones could be sculpted by sexual selection. PMID:22189763
Teachers' voice use in teaching environments: a field study using ambulatory phonation monitor.
Lyberg Åhlander, Viveka; Pelegrín García, David; Whitling, Susanna; Rydell, Roland; Löfqvist, Anders
2014-11-01
This case-control designed field study examines the vocal behavior in teachers with self-estimated voice problems (VP) and their age- and school-matched voice healthy (VH) colleagues. It was hypothesized that teachers with and teachers without VP use their voices differently regarding fundamental frequency, sound pressure level (SPL), and in relation to the background noise. Teachers with self-estimated VP (n = 14; two males and 12 females) were age and gender matched to VH school colleagues (n = 14; two males and 12 females). The subjects, recruited from an earlier study, had been examined in laryngeal, vocal, hearing, and psychosocial aspects. The fundamental frequency, SPL, and phonation time were recorded with an Ambulatory Phonation Monitor during one representative workday. The teachers reported their activities in a structured diary. The SPL (including teachers' and students' activity and ambient noise) was recorded with a sound level meter; the room temperature and air quality were measured simultaneously. The acoustic properties of the empty classrooms were measured. Teachers with VP behaved vocally different from their VH peers, in particular during teaching sessions. The phonation time was significantly higher in the group with VP, and the number of vibratory cycles differed between the female teachers. The F0 pattern, related to the vocal SPL and room acoustics, differed between the groups. The results suggest a different vocal behavior in subjects with subjective VP and a higher vocal load with fewer possibilities for vocal recovery. Copyright © 2014 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal Dose Measures: Quantifying Accumulated Vibration Exposure in Vocal Fold Tissues
Titze, Ingo R.; Švec, Jan G.; Popolo, Peter S.
2011-01-01
To measure the exposure to self-induced tissue vibration in speech, three vocal doses were defined and described: distance dose, which accumulates the distance that tissue particles of the vocal folds travel in an oscillatory trajectory; energy dissipation dose, which accumulates the total amount of heat dissipated over a unit volume of vocal fold tissues; and time dose, which accumulates the total phonation time. These doses were compared to a previously used vocal dose measure, the vocal loading index, which accumulates the number of vibration cycles of the vocal folds. Empirical rules for viscosity and vocal fold deformation were used to calculate all the doses from the fundamental frequency (F0) and sound pressure level (SPL) values of speech. Six participants were asked to read in normal, monotone, and exaggerated speech and the doses associated with these vocalizations were calculated. The results showed that large F0 and SPL variations in speech affected the dose measures, suggesting that accumulation of phonation time alone is insufficient. The vibration exposure of the vocal folds in normal speech was related to the industrial limits for hand-transmitted vibration, in which the safe distance dose was derived to be about 500 m. This limit was found rather low for vocalization; it was related to a comparable time dose of about 17 min of continuous vocalization, or about 35 min of continuous reading with normal breathing and unvoiced segments. The voicing pauses in normal speech and dialogue effectively prolong the safe time dose. The derived safety limits for vocalization will likely require refinement based on a more detailed knowledge of the differences in hand and vocal fold tissue morphology and their response to vibrational stress, and on the effect of recovery of the vocal fold tissue during voicing pauses. PMID:12959470
Vocal dose measures: quantifying accumulated vibration exposure in vocal fold tissues.
Titze, Ingo R; Svec, Jan G; Popolo, Peter S
2003-08-01
To measure the exposure to self-induced tissue vibration in speech, three vocal doses were defined and described: distance dose, which accumulates the distance that tissue particles of the vocal folds travel in an oscillatory trajectory; energy dissipation dose, which accumulates the total amount of heat dissipated over a unit volume of vocal fold tissues; and time dose, which accumulates the total phonation time. These doses were compared to a previously used vocal dose measure, the vocal loading index, which accumulates the number of vibration cycles of the vocal folds. Empirical rules for viscosity and vocal fold deformation were used to calculate all the doses from the fundamental frequency (F0) and sound pressure level (SPL) values of speech. Six participants were asked to read in normal, monotone, and exaggerated speech and the doses associated with these vocalizations were calculated. The results showed that large F0 and SPL variations in speech affected the dose measures, suggesting that accumulation of phonation time alone is insufficient. The vibration exposure of the vocal folds in normal speech was related to the industrial limits for hand-transmitted vibration, in which the safe distance dose was derived to be about 500 m. This limit was found rather low for vocalization; it was related to a comparable time dose of about 17 min of continuous vocalization, or about 35 min of continuous reading with normal breathing and unvoiced segments. The voicing pauses in normal speech and dialogue effectively prolong the safe time dose. The derived safety limits for vocalization will likely require refinement based on a more detailed knowledge of the differences in hand and vocal fold tissue morphology and their response to vibrational stress, and on the effect of recovery of the vocal fold tissue during voicing pauses.
Vocal warm-up and breathing training for teachers: randomized clinical trial
Pereira, Lílian Paternostro de Pina; Masson, Maria Lúcia Vaz; Carvalho, Fernando Martins
2015-01-01
OBJECTIVE To compare the effectiveness of two speech therapy interventions, vocal warm-up and breathing training, focusing on teachers’ voice quality. METHODS A single-blind, randomized, parallel clinical trial was conducted. The research included 31 20 to 60-year old teachers from a public school in Salvador, BA, Northeasatern Brazil, with minimum workloads of 20 hours a week, who have or have not reported having vocal alterations. The exclusion criteria were the following: being a smoker, excessive alcohol consumption, receiving additional speech therapy assistance while taking part in the study, being affected by upper respiratory tract infections, professional use of the voice in another activity, neurological disorders, and history of cardiopulmonary pathologies. The subjects were distributed through simple randomization in groups vocal warm-up (n = 14) and breathing training (n = 17). The teachers’ voice quality was subjectively evaluated through the Voice Handicap Index (Índice de Desvantagem Vocal, in the Brazilian version) and computerized voice analysis (average fundamental frequency, jitter, shimmer, noise, and glottal-to-noise excitation ratio) by speech therapists. RESULTS Before the interventions, the groups were similar regarding sociodemographic characteristics, teaching activities, and vocal quality. The variations before and after the intervention in self-assessment and acoustic voice indicators have not significantly differed between the groups. In the comparison between groups before and after the six-week interventions, significant reductions in the Voice Handicap Index of subjects in both groups were observed, as wells as reduced average fundamental frequencies in the vocal warm-up group and increased shimmer in the breathing training group. Subjects from the vocal warm-up group reported speaking more easily and having their voices more improved in a general way as compared to the breathing training group. CONCLUSIONS Both interventions were similar regarding their effects on the teachers’ voice quality. However, each contribution has individually contributed to improve the teachers’ voice quality, especially the vocal warm-up. PMID:26465664
Scheiner, Elisabeth; Hammerschmidt, Kurt; Jürgens, Uwe; Zwirner, Petra
2002-12-01
The nonverbal vocal utterances of seven normally hearing infants were studied within their first year of life with respect to age- and emotion-related changes. Supported by a multiparametric acoustic analysis it was possible to distinguish one inspiratory and eleven expiratory call types. Most of the call types appeared within the first two months; some emerged in the majority of infants not until the 5th ("laugh") or 7th month ("babble"). Age-related changes in acoustic structure were found in only 4 call types ("discomfort cry," "short discomfort cry," "wail," "moan"). The acoustic changes were characterized mainly by an increase in harmonic-to-noise ratio and homogeneity of the call, a decrease in frequency range and a downward shift of acoustic energy from higher to lower frequencies. Emotion-related differences were found in the acoustic structure of single call types as well as in the frequency of occurrence of different call types. A change from positive to negative emotional state was accompanied by an increase in call duration, frequency range, and peak frequency (frequency with the highest amplitude within the power spectrum). Negative emotions, in addition, were characterized by a significantly higher rate of "crying," "hic" and "ingressive vocalizations" than positive emotions, while positive emotions showed a significantly higher rate of "babble," "laugh," and "raspberry."
Fuentes-López, Eduardo; Fuente, Adrian; Contreras, Karem V
2017-12-18
The aim of this study is to determine possible associations between vocal hygiene habits and self-reported vocal symptoms in telemarketers. A cross-sectional study that included 79 operators from call centres in Chile was carried out. Their vocal hygiene habits and self-reported symptoms were investigated using a validated and reliable questionnaire created for the purposes of this study. Forty-five percent of telemarketers reported having one or more vocal symptoms. Among them, 16.46% reported that their voices tense up when talking and 10.13% needed to clear their throat to make their voices clearer. Five percent mentioned that they always talk without taking a break and 40.51% reported using their voices in noisy environments. The number of working hours per day and inadequate vocal hygiene habits were associated with the presence of self-reported symptoms. Additionally, an interaction between the use of the voice in noisy environments and not taking breaks during the day was observed. Finally, the frequency of inadequate vocal hygiene habits was associated with the number of symptoms reported. Using the voice in noisy environments and talking without taking breaks were both associated with the presence of specific vocal symptoms. This study provides some evidence about the interaction between these two inadequate vocal hygiene habits that potentiates vocal symptoms.
Scattoni, Maria Luisa; Crawley, Jacqueline; Ricceri, Laura
2009-01-01
In neonatal mice ultrasonic vocalizations have been studied both as an early communicative behavior of the pup-mother dyad and as a sign of an aversive affective state. Adult mice of both sexes produce complex ultrasonic vocalization patterns in different experimental/social contexts. All these vocalizations are becoming an increasingly valuable assay for behavioral phenotyping throughout the mouse life-span and alterations of the ultrasound patterns have been reported in several mouse models of neurodevelopmental disorders. Here we also show that the modulation of vocalizations by maternal cues (maternal potentiation paradigm) – originally identified and investigated in rats - can be measured in C57Bl/6 mouse pups with appropriate modifications of the rat protocol and can likely be applied to mouse behavioral phenotyping. In addition we suggest that a detailed qualitative evaluation of neonatal calls together with analysis of adult mouse vocalization patterns in both sexes in social settings, may lead to a greater understanding of the communication value of vocalizations in mice. Importantly, both neonatal and adult USV altered patterns can be determined during the behavioural phenotyping of mouse models of human neurodevelopmental and neuropsychiatric disorders, starting from those in which deficits in communication are a primary symptom. PMID:18771687
Patten, Elena; Belardi, Katie; Baranek, Grace T; Watson, Linda R; Labban, Jeffrey D; Oller, D Kimbrough
2014-10-01
Canonical babbling is a critical milestone for speech development and is usually well in place by 10 months. The possibility that infants with autism spectrum disorder (ASD) show late onset of canonical babbling has so far eluded evaluation. Rate of vocalization or "volubility" has also been suggested as possibly aberrant in infants with ASD. We conducted a retrospective video study examining vocalizations of 37 infants at 9-12 and 15-18 months. Twenty-three of the 37 infants were later diagnosed with ASD and indeed produced low rates of canonical babbling and low volubility by comparison with the 14 typically developing infants. The study thus supports suggestions that very early vocal patterns may prove to be a useful component of early screening and diagnosis of ASD.
Patten, Elena; Belardi, Katie; Baranek, Grace T.; Watson, Linda R.; Labban, Jeffrey D.; Oller, D. Kimbrough
2014-01-01
Canonical babbling is a critical milestone for speech development and is usually well in place by 10 months. The possibility that infants with ASD show late onset of canonical babbling has so far eluded evaluation. Rate of vocalization or “volubility” has also been suggested as possibly aberrant in infants with ASD. We conducted a retrospective video study examining vocalizations of 37 infants at 9–12 and 15–18 months. Twenty-three of the 37 infants were later diagnosed with ASD and indeed produced low rates of canonical babbling and low volubility by comparison with the 14 typically developing infants. The study thus supports suggestions that very early vocal patterns may prove to be a useful component of early screening and diagnosis of ASD. PMID:24482292
Reproduction of mouse-pup ultrasonic vocalizations by nanocrystalline silicon thermoacoustic emitter
NASA Astrophysics Data System (ADS)
Kihara, Takashi; Harada, Toshihiro; Kato, Masahiro; Nakano, Kiyoshi; Murakami, Osamu; Kikusui, Takefumi; Koshida, Nobuyoshi
2006-01-01
As one of the functional properties of ultrasound generator based on efficient thermal transfer at the nanocrystalline silicon (nc-Si) layer surface, its potential as an ultrasonic simulator of vocalization signals is demonstrated by using the acoustic data of mouse-pup calls. The device composed of a surface-heating thin-film electrode, an nc-Si layer, and a single-crystalline silicon (c-Si) wafer, exhibits an almost completely flat frequency response over a wide range without any mechanical surface vibration systems. It is shown that the fabricated emitter can reproduce digitally recorded ultrasonic mouse-pups vocalizations very accurately in terms of the call duration, frequency dispersion, and sound pressure level. The thermoacoustic nc-Si device provides a powerful physical means for the understanding of ultrasonic communication mechanisms in various living animals.
Viscoelasticity of rabbit vocal folds after injection augmentation.
Dahlqvist, Ake; Gärskog, Ola; Laurent, Claude; Hertegård, Stellan; Ambrosio, Luigi; Borzacchiello, Assunta
2004-01-01
Vocal fold function is related to the viscoelasticity of the vocal fold tissue. Augmentation substances used for injection treatment of voice insufficiency may alter the viscoelastic properties of vocal folds and their vibratory capacity. The objective was to compare the mechanical properties (viscoelasticity) of various injectable substances and the viscoelasticity of rabbit vocal folds, 6 months after injection with one of these substances. Animal model. Cross-linked collagen (Zyplast), double cross-linked hyaluronan (hylan B gel), dextranomers in hyaluronan (DHIA), and polytetrafluoroethylene (Teflon) were injected into rabbit vocal folds. Six months after the injection, the animals were killed and the right- and left-side vocal folds were removed. Dynamic viscosity of the injected substances and the vocal folds was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Hylan B gel and DiHA showed the lowest dynamic viscosity values, and vocal folds injected with these substances also showed the lowest dynamic viscosity (similar to noninjected control samples). Teflon (and vocal folds injected with Teflon) showed the highest dynamic viscosity values, followed by the collagen samples. Substances with low viscoelasticity alter the mechanical properties of the vocal fold to a lesser degree than substances with a high viscoelasticity. The data indicated that hylan B gel and DiHA render the most natural viscoelastic properties to the vocal folds. These substances seem to be appropriate for preserving or restoring the vibratory capacity of the vocal folds when glottal insufficiency is treated with augmentative injections.
Cooperative vocal control in marmoset monkeys via vocal feedback
Choi, Jung Yoon; Takahashi, Daniel Y.
2015-01-01
Humans adjust speech amplitude as a function of distance from a listener; we do so in a manner that would compensate for such distance. This ability is presumed to be the product of high-level sociocognitive skills. Nonhuman primates are thought to lack such socially related flexibility in vocal production. Using predictions from a simple arousal-based model whereby vocal feedback from a conspecific modulates the drive to produce a vocalization, we tested whether another primate exhibits this type of cooperative vocal control. We conducted a playback experiment with marmoset monkeys and simulated “far-away” and “nearby” conspecifics using contact calls that differed in sound intensity. We found that marmoset monkeys increased the amplitude of their contact calls and produced such calls with shorter response latencies toward more distant conspecifics. The same was not true in response to changing levels of background noise. To account for how simulated conspecific distance can change both the amplitude and timing of vocal responses, we developed a model that incorporates dynamic interactions between the auditory system and limbic “drive” systems. Overall, our data show that, like humans, marmoset monkeys cooperatively control the acoustics of their vocalizations according to changes in listener distance, increasing the likelihood that a conspecific will hear their call. However, we propose that such cooperative vocal control is a system property that does not necessitate any particularly advanced sociocognitive skill. At least in marmosets, this vocal control can be parsimoniously explained by the regulation of arousal states across two interacting individuals via vocal feedback. PMID:25925323
Conserved mechanisms of vocalization coding in mammalian and songbird auditory midbrain.
Woolley, Sarah M N; Portfors, Christine V
2013-11-01
The ubiquity of social vocalizations among animals provides the opportunity to identify conserved mechanisms of auditory processing that subserve communication. Identifying auditory coding properties that are shared across vocal communicators will provide insight into how human auditory processing leads to speech perception. Here, we compare auditory response properties and neural coding of social vocalizations in auditory midbrain neurons of mammalian and avian vocal communicators. The auditory midbrain is a nexus of auditory processing because it receives and integrates information from multiple parallel pathways and provides the ascending auditory input to the thalamus. The auditory midbrain is also the first region in the ascending auditory system where neurons show complex tuning properties that are correlated with the acoustics of social vocalizations. Single unit studies in mice, bats and zebra finches reveal shared principles of auditory coding including tonotopy, excitatory and inhibitory interactions that shape responses to vocal signals, nonlinear response properties that are important for auditory coding of social vocalizations and modulation tuning. Additionally, single neuron responses in the mouse and songbird midbrain are reliable, selective for specific syllables, and rely on spike timing for neural discrimination of distinct vocalizations. We propose that future research on auditory coding of vocalizations in mouse and songbird midbrain neurons adopt similar experimental and analytical approaches so that conserved principles of vocalization coding may be distinguished from those that are specialized for each species. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives". Copyright © 2013 Elsevier B.V. All rights reserved.
The Sound of Dominance: Vocal Precursors of Perceived Dominance during Interpersonal Influence.
ERIC Educational Resources Information Center
Tusing, Kyle James; Dillard, James Price
2000-01-01
Determines the effects of vocal cues on judgments of dominance in an interpersonal influence context. Indicates that mean amplitude and amplitude standard deviation were positively associated with dominance judgments, whereas speech rate was negatively associated with dominance judgments. Finds that mean fundamental frequency was positively…
Acoustic Characteristics of Simulated Respiratory-Induced Vocal Tremor
ERIC Educational Resources Information Center
Lester, Rosemary A.; Story, Brad H.
2013-01-01
Purpose: The purpose of this study was to investigate the relation of respiratory forced oscillation to the acoustic characteristics of vocal tremor. Method: Acoustical analyses were performed to determine the characteristics of the intensity and fundamental frequency (F[subscript 0]) for speech samples obtained by Farinella, Hixon, Hoit, Story,…
Dunlop, Rebecca A
Many theories and communication models developed from terrestrial studies focus on a simple dyadic exchange between a sender and receiver. During social interactions, the "frequency code" hypothesis suggests that frequency characteristics of vocal signals can simultaneously encode for static signaler attributes (size or sex) and dynamic information, such as motivation or emotional state. However, the additional presence of a bystander may result in a change of signaling behavior if the costs and benefits associated with the presence of this bystander are different from that of a simple dyad. In this study, two common humpback whale social calls ("wops" and "grumbles") were tested for differences related to group social behavior and the presence of bystanders. "Wop" parameters were stable with group social behavior, but were emitted at lower (14 dB) levels in the presence of a nearby singing whale compared to when a singing whale was not in the area. "Grumbles" were emitted at lower (30-39 Hz) fundamental frequencies in affiliative compared to non-affiliative groups and, in the presence of a nearby singing whale, were also emitted at lower (14 dB) levels. Vocal rates did not significantly change. The results suggest that, in humpbacks, the frequency in certain sound types relates to the social behavior of the vocalizing group, implying a frequency code system. The presence of a nearby audible bystander (a singing whale) had no effect on this frequency code, but by reducing their acoustic level, the signal-to-noise ratio at the singer would have been below 0, making it difficult for the singer to audibly detect the group. The frequency, duration, and amplitude parameters of humpback whale social vocalizations were tested between different social contexts: group social behavior (affiliating versus non-affiliating), the presence of a nearby singing whale, and the presence of a nearby non-singing group. "Grumbles" (commonly heard low-frequency unmodulated sounds) frequencies were lower in affiliating groups compared to non-affiliating groups, suggesting a change in group motivation (such as levels of aggression). "Wop" (another common sound type) structure (frequency and duration) was similar in affiliating and non-affiliating groups. In the presence of an audible bystander (a singing whale), both sound types were emitted at similar rates, but much lower amplitudes (14 dB), vastly reducing the detectability of these sounds by the singer. This suggests that these groups were acoustically avoiding the singing whale. They did not, however, acoustically respond to the presence of a nearby non-singing group.
The impact of perilaryngeal vibration on the self-perception of loudness and the Lombard effect.
Brajot, François-Xavier; Nguyen, Don; DiGiovanni, Jeffrey; Gracco, Vincent L
2018-04-05
The role of somatosensory feedback in speech and the perception of loudness was assessed in adults without speech or hearing disorders. Participants completed two tasks: loudness magnitude estimation of a short vowel and oral reading of a standard passage. Both tasks were carried out in each of three conditions: no-masking, auditory masking alone, and mixed auditory masking plus vibration of the perilaryngeal area. A Lombard effect was elicited in both masking conditions: speakers unconsciously increased vocal intensity. Perilaryngeal vibration further increased vocal intensity above what was observed for auditory masking alone. Both masking conditions affected fundamental frequency and the first formant frequency as well, but only vibration was associated with a significant change in the second formant frequency. An additional analysis of pure-tone thresholds found no difference in auditory thresholds between masking conditions. Taken together, these findings indicate that perilaryngeal vibration effectively masked somatosensory feedback, resulting in an enhanced Lombard effect (increased vocal intensity) that did not alter speakers' self-perception of loudness. This implies that the Lombard effect results from a general sensorimotor process, rather than from a specific audio-vocal mechanism, and that the conscious self-monitoring of speech intensity is not directly based on either auditory or somatosensory feedback.
NASA Astrophysics Data System (ADS)
O'Connell-Rodwell, Caitlin E.; Wood, Jason D.; Gunther, Roland; Klemperer, Simon; Rodwell, Timothy C.; Puria, Sunil; Sapolsky, Robert; Kinzley, Colleen; Arnason, Byron T.; Hart, Lynette A.
2004-05-01
Seismic correlates of low-frequency vocalizations in African and Asian elephants propagate in the ground at different velocities, with the potential of traveling farther than their airborne counterparts. A semblance technique applied to linear moveouts on narrow-bandpass-filtered data, coupled with forward modeling, demonstrates that the complex waves observed are the interference of an air wave and a Rayleigh wave traveling at the appropriate velocities. The Rayleigh wave appears to be generated at or close to the elephant, either by coupling through the elephant's body or through the air near the body to the ground. Low-frequency elephant vocalizations were reproduced seismically and played back to both a captive elephant and to elephant breeding herds in the wild, monitoring the elephants' behavioral responses, spacing between herd members and time spent at the water hole as an index of heightened vigilance. Breeding herds detected and responded appropriately to seismically transmitted elephant warning calls. The captive studies promise to elucidate a vibrotactile threshold of sensitivity for the elephant foot. Elephants may benefit from the exploitation of seismic cues as an additional communication modality, thus expanding their signaling repertoire and extending their range of potential communication and eavesdropping beyond that possible with airborne sound.
Vocal Tract Discomfort and Voice-Related Quality of Life in Wind Instrumentalists.
Cappellaro, Juliane; Beber, Bárbara Costa
2018-05-01
This study aimed to investigate vocal tract discomfort and quality of life in the voice of wind instrumentalists. It is a cross-sectional study. The sample was composed of 37 musicians of the orchestra of Caxias do Sul city, RS, Brazil. The participants answered a nonstandard questionnaire about demographic and professional information, the Voice-Related Quality of Life (V-RQOL), the Vocal Tract Discomfort (VTD) scale, and additional items about fatigue after playing the instrument and pain in the cervical muscles. Correlation analyses were performed using Spearman correlation test. The most frequent symptoms mentioned by musicians in the VTD, for both frequency and intensity of occurrence, were dryness, ache, irritability, and cervical muscle pain, in addition to the frequency of occurrence of fatigue after playing. The musicians showed high scores in the V-RQOL survey. Several symptoms evaluated by the VTD had a negative correlation with the musicians' years of orchestra membership and with V-RQOL scores. Symptoms of vocal tract discomfort are present in wind instrumentalists in low frequency and intensity of occurrence. However, these symptoms affect the musicians' voice-related quality of life, and they occur more in musicians with fewer years of orchestra membership. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Complex coevolution of wing, tail, and vocal sounds of courting male bee hummingbirds.
Clark, Christopher J; McGuire, Jimmy A; Bonaccorso, Elisa; Berv, Jacob S; Prum, Richard O
2018-03-01
Phenotypic characters with a complex physical basis may have a correspondingly complex evolutionary history. Males in the "bee" hummingbird clade court females with sound from tail-feathers, which flutter during display dives. On a phylogeny of 35 species, flutter sound frequency evolves as a gradual, continuous character on most branches. But on at least six internal branches fall two types of major, saltational changes: mode of flutter changes, or the feather that is the sound source changes, causing frequency to jump from one discrete value to another. In addition to their tail "instruments," males also court females with sound from their syrinx and wing feathers, and may transfer or switch instruments over evolutionary time. In support of this, we found a negative phylogenetic correlation between presence of wing trills and singing. We hypothesize this transference occurs because wing trills and vocal songs serve similar functions and are thus redundant. There are also three independent origins of self-convergence of multiple signals, in which the same species produces both a vocal (sung) frequency sweep, and a highly similar nonvocal sound. Moreover, production of vocal, learned song has been lost repeatedly. Male bee hummingbirds court females with a diverse, coevolving array of acoustic traits. © 2018 The Author(s). Evolution © 2018 The Society for the Study of Evolution.
Bálint, Anna; Faragó, Tamás; Miklósi, Ádám; Pongrácz, Péter
2016-11-01
Body size is an important feature that affects fighting ability; however, size-related parameters of agonistic vocalizations are difficult to manipulate because of anatomical constraints within the vocal production system. Rare examples of acoustic size modulation are due to specific features that enable the sender to steadily communicate exaggerated body size. However, one could argue that it would be more adaptive if senders could adjust their signaling behavior to the fighting potential of their actual opponent. So far there has been no experimental evidence for this possibility. We tested this hypothesis by exposing family dogs (Canis familiaris) to humans with potentially different fighting ability. In a within-subject experiment, 64 dogs of various breeds consecutively faced two threateningly approaching humans, either two men or two women of different stature, or a man and a woman of similar or different stature. We found that the dogs' vocal responses were affected by the gender of the threatening stranger and the dog owner's gender. Dogs with a female owner, or those dogs which came from a household where both genders were present, reacted with growls of lower values of the Pitch-Formant component (including deeper fundamental frequency and lower formant dispersion) to threatening men. Our results are the first to show that non-human animals react with dynamic alteration of acoustic parameters related to their individual indexical features (body size), depending on the level of threat in an agonistic encounter.
Acoustic analysis of trill sounds.
Dhananjaya, N; Yegnanarayana, B; Bhaskararao, Peri
2012-04-01
In this paper, the acoustic-phonetic characteristics of steady apical trills--trill sounds produced by the periodic vibration of the apex of the tongue--are studied. Signal processing methods, namely, zero-frequency filtering and zero-time liftering of speech signals, are used to analyze the excitation source and the resonance characteristics of the vocal tract system, respectively. Although it is natural to expect the effect of trilling on the resonances of the vocal tract system, it is interesting to note that trilling influences the glottal source of excitation as well. The excitation characteristics derived using zero-frequency filtering of speech signals are glottal epochs, strength of impulses at the glottal epochs, and instantaneous fundamental frequency of the glottal vibration. Analysis based on zero-time liftering of speech signals is used to study the dynamic resonance characteristics of vocal tract system during the production of trill sounds. Qualitative analysis of trill sounds in different vowel contexts, and the acoustic cues that may help spotting trills in continuous speech are discussed.
Dehling, J Maximilian; Matsui, Masafumi
2013-01-01
We describe a new species of Leptolalax from Gunung Mulu National Park in eastern Sarawak, Malaysian Borneo. The new species had been assigned to Leptolalax dringi and Leptolalax gracilis in the past. It is shown to differ from both these species and from all other species of the genus by a unique combination of morphological characters including large body size, rounded snout, interorbital distance being smaller than width of upper eyelid, bipartite subgular vocal sac in males, basal toe webbing, shagreened skin with tiny tubercles on dorsum and dorsal side of head, angled supratympanic fold, small pectoral glands, absence of supraaxillary glands and ventrolateral glandular ridges, spotted venter, advertisement call consisting of long series of 8-289 notes, each composed of three or four pulses, and dominant frequency at 7225-9190 Hz, with prominent frequency modulation.
You are only as old as you sound: auditory aftereffects in vocal age perception.
Zäske, Romi; Schweinberger, Stefan R
2011-12-01
High-level adaptation not only biases the perception of faces, but also causes transient distortions in auditory perception of non-linguistic voice information about gender, identity, and emotional intonation. Here we report a novel auditory aftereffect in perceiving vocal age: age estimates were elevated in age-morphed test voices when preceded by adaptor voices of young speakers (∼20 yrs), compared to old adaptor voices (∼70 yrs). This vocal age aftereffect (VAAE) complements a recently reported face aftereffect (Schweinberger et al., 2010) and points to selective neuronal coding of vocal age. Intriguingly, post-adaptation assessment revealed that VAAEs could persist for minutes after adaptation, although reduced in magnitude. As an important qualification, VAAEs during post-adaptation were modulated by gender congruency between speaker and listener. For both male and female listeners, VAAEs were much reduced for test voices of opposite gender. Overall, this study establishes a new auditory aftereffect in the perception of vocal age. We offer a tentative sociobiological explanation for the differential, gender-dependent recovery from vocal age adaptation. Copyright © 2011 Elsevier B.V. All rights reserved.
The siren song of vocal fundamental frequency for romantic relationships.
Weusthoff, Sarah; Baucom, Brian R; Hahlweg, Kurt
2013-01-01
A multitude of factors contribute to why and how romantic relationships are formed as well as whether they ultimately succeed or fail. Drawing on evolutionary models of attraction and speech production as well as integrative models of relationship functioning, this review argues that paralinguistic cues (more specifically the fundamental frequency of the voice) that are initially a strong source of attraction also increase couples' risk for relationship failure. Conceptual similarities and differences between the multiple operationalizations and interpretations of vocal fundamental frequency are discussed and guidelines are presented for understanding both convergent and non-convergent findings. Implications for clinical practice and future research are discussed.
NASA Astrophysics Data System (ADS)
Rupitsch, Stefan J.; Ilg, Jürgen; Sutor, Alexander; Lerch, Reinhard; Döllinger, Michael
2011-08-01
In order to obtain a deeper understanding of the human phonation process and the mechanisms generating sound, realistic setups are built up containing artificial vocal folds. Usually, these vocal folds consist of viscoelastic materials (e.g., polyurethane mixtures). Reliable simulation based studies on the setups require the mechanical properties of the utilized viscoelastic materials. The aim of this work is the identification of mechanical material parameters (Young's modulus, Poisson's ratio, and loss factor) for those materials. Therefore, we suggest a low-cost measurement setup, the so-called vibration transmission analyzer (VTA) enabling to analyze the transfer behavior of viscoelastic materials for propagating mechanical waves. With the aid of a mathematical Inverse Method, the material parameters are adjusted in a convenient way so that the simulation results coincide with the measurement results for the transfer behavior. Contrary to other works, we determine frequency dependent functions for the mechanical properties characterizing the viscoelastic material in the frequency range of human speech (100-250 Hz). The results for three different materials clearly show that the Poisson's ratio is close to 0.5 and that the Young's modulus increases with higher frequencies. For a frequency of 400 Hz, the Young's modulus of the investigated viscoelastic materials is approximately 80% higher than for the static case (0 Hz). We verify the identified mechanical properties with experiments on fabricated vocal fold models. Thereby, only small deviations between measurements and simulations occur.
Sisneros, Joseph A
2009-03-01
The plainfin midshipman fish (Porichthys notatus Girard, 1854) is a vocal species of batrachoidid fish that generates acoustic signals for intraspecific communication during social and reproductive activity and has become a good model for investigating the neural and endocrine mechanisms of vocal-acoustic communication. Reproductively active female plainfin midshipman fish use their auditory sense to detect and locate "singing" males, which produce a multiharmonic advertisement call to attract females for spawning. The seasonal onset of male advertisement calling in the midshipman fish coincides with an increase in the range of frequency sensitivity of the female's inner ear saccule, the main organ of hearing, thus leading to enhanced encoding of the dominant frequency components of male advertisement calls. Non-reproductive females treated with either testosterone or 17β-estradiol exhibit a dramatic increase in the inner ear's frequency sensitivity that mimics the reproductive female's auditory phenotype and leads to an increased detection of the male's advertisement call. This novel form of auditory plasticity provides an adaptable mechanism that enhances coupling between sender and receiver in vocal communication. This review focuses on recent evidence for seasonal reproductive-state and steroid-dependent plasticity of auditory frequency sensitivity in the peripheral auditory system of the midshipman fish. The potential steroid-dependent mechanism(s) that lead to this novel form of auditory and behavioral plasticity are also discussed. © 2009 ISZS, Blackwell Publishing and IOZ/CAS.
Integrating cues of social interest and voice pitch in men's preferences for women's voices.
Jones, Benedict C; Feinberg, David R; Debruine, Lisa M; Little, Anthony C; Vukovic, Jovana
2008-04-23
Most previous studies of vocal attractiveness have focused on preferences for physical characteristics of voices such as pitch. Here we examine the content of vocalizations in interaction with such physical traits, finding that vocal cues of social interest modulate the strength of men's preferences for raised pitch in women's voices. Men showed stronger preferences for raised pitch when judging the voices of women who appeared interested in the listener than when judging the voices of women who appeared relatively disinterested in the listener. These findings show that voice preferences are not determined solely by physical properties of voices and that men integrate information about voice pitch and the degree of social interest expressed by women when forming voice preferences. Women's preferences for raised pitch in women's voices were not modulated by cues of social interest, suggesting that the integration of cues of social interest and voice pitch when men judge the attractiveness of women's voices may reflect adaptations that promote efficient allocation of men's mating effort.
Ghassemi, Marzyeh; Van Stan, Jarrad H; Mehta, Daryush D; Zañartu, Matías; Cheyne, Harold A; Hillman, Robert E; Guttag, John V
2014-06-01
Voice disorders are medical conditions that often result from vocal abuse/misuse which is referred to generically as vocal hyperfunction. Standard voice assessment approaches cannot accurately determine the actual nature, prevalence, and pathological impact of hyperfunctional vocal behaviors because such behaviors can vary greatly across the course of an individual's typical day and may not be clearly demonstrated during a brief clinical encounter. Thus, it would be clinically valuable to develop noninvasive ambulatory measures that can reliably differentiate vocal hyperfunction from normal patterns of vocal behavior. As an initial step toward this goal we used an accelerometer taped to the neck surface to provide a continuous, noninvasive acceleration signal designed to capture some aspects of vocal behavior related to vocal cord nodules, a common manifestation of vocal hyperfunction. We gathered data from 12 female adult patients diagnosed with vocal fold nodules and 12 control speakers matched for age and occupation. We derived features from weeklong neck-surface acceleration recordings by using distributions of sound pressure level and fundamental frequency over 5-min windows of the acceleration signal and normalized these features so that intersubject comparisons were meaningful. We then used supervised machine learning to show that the two groups exhibit distinct vocal behaviors that can be detected using the acceleration signal. We were able to correctly classify 22 of the 24 subjects, suggesting that in the future measures of the acceleration signal could be used to detect patients with the types of aberrant vocal behaviors that are associated with hyperfunctional voice disorders.
Dunlop, Rebecca A.; Cato, Douglas H.; Noad, Michael J.
2010-01-01
High background noise is an important obstacle in successful signal detection and perception of an intended acoustic signal. To overcome this problem, many animals modify their acoustic signal by increasing the repetition rate, duration, amplitude or frequency range of the signal. An alternative method to ensure successful signal reception, yet to be tested in animals, involves the use of two different types of signal, where one signal type may enhance the other in periods of high background noise. Humpback whale communication signals comprise two different types: vocal signals, and surface-generated signals such as ‘breaching’ or ‘pectoral slapping’. We found that humpback whales gradually switched from primarily vocal to primarily surface-generated communication in increasing wind speeds and background noise levels, though kept both signal types in their repertoire. Vocal signals have the advantage of having higher information content but may have the disadvantage of loosing this information in a noisy environment. Surface-generated sounds have energy distributed over a greater frequency range and may be less likely to become confused in periods of high wind-generated noise but have less information content when compared with vocal sounds. Therefore, surface-generated sounds may improve detection or enhance the perception of vocal signals in a noisy environment. PMID:20392731
Cui, Jianguo; Tang, Yezhong; Narins, Peter M
2012-06-23
During female mate choice, both the male's phenotype and resources (e.g. his nest) contribute to the chooser's fitness. Animals other than humans are not known to advertise resource characteristics to potential mates through vocal communication; although in some species of anurans and birds, females do evaluate male qualities through vocal communication. Here, we demonstrate that calls of the male Emei music frog (Babina dauchina), vocalizing from male-built nests, reflect nest structure information that can be recognized by females. Inside-nest calls consisted of notes with energy concentrated at lower frequency ranges and longer note durations when compared with outside-nest calls. Centre frequencies and note durations of the inside calls positively correlate with the area of the burrow entrance and the depth of the burrow, respectively. When given a choice between outside and inside calls played back alternately, more than 70 per cent of the females (33/47) chose inside calls. These results demonstrate that males of this species faithfully advertise whether or not they possess a nest to potential mates by vocal communication, which probably facilitates optimal mate selection by females. These results revealed a novel function of advertisement calls, which is consistent with the wide variation in both call complexity and social behaviour within amphibians.
Makagon, Maja M.; Funayama, E. Sumie; Owren, Michael J.
2008-01-01
Relatively few empirical data are available concerning the role of auditory experience in nonverbal human vocal behavior, such as laughter production. This study compared the acoustic properties of laughter in 19 congenitally, bilaterally, and profoundly deaf college students and in 23 normally hearing control participants. Analyses focused on degree of voicing, mouth position, air-flow direction, temporal features, relative amplitude, fundamental frequency, and formant frequencies. Results showed that laughter produced by the deaf participants was fundamentally similar to that produced by the normally hearing individuals, which in turn was consistent with previously reported findings. Finding comparable acoustic properties in the sounds produced by deaf and hearing vocalizers confirms the presumption that laughter is importantly grounded in human biology, and that auditory experience with this vocalization is not necessary for it to emerge in species-typical form. Some differences were found between the laughter of deaf and hearing groups; the most important being that the deaf participants produced lower-amplitude and longer-duration laughs. These discrepancies are likely due to a combination of the physiological and social factors that routinely affect profoundly deaf individuals, including low overall rates of vocal fold use and pressure from the hearing world to suppress spontaneous vocalizations. PMID:18646991
The effect of sleep deprivation on vocal expression of emotion in adolescents and adults.
McGlinchey, Eleanor L; Talbot, Lisa S; Chang, Keng-Hao; Kaplan, Katherine A; Dahl, Ronald E; Harvey, Allison G
2011-09-01
Investigate the impact of sleep deprivation on vocal expression of emotion. Within-group repeated measures analysis involving sleep deprivation and rested conditions. Experimental laboratory setting. Fifty-five healthy participants (24 females), including 38 adolescents aged 11-15 y and 17 adults aged 30-60 y. A multimethod approach was used to examine vocal expression of emotion in interviews conducted at 22:30 and 06:30. On that night, participants slept a maximum of 2 h. Interviews were analyzed for vocal expression of emotion via computerized text analysis, human rater judgments, and computerized acoustic properties. Computerized text analysis and human rater judgments indicated decreases in positive emotion in all participants at 06:30 relative to 22:30, and adolescents displayed a significantly greater decrease in positive emotion via computerized text analysis relative to adults. Increases in negative emotion were observed among all participants using human rater judgments. Results for the computerized acoustic properties indicated decreases in pitch, bark energy (intensity) in certain high frequency bands, and vocal sharpness (reduction in high frequency bands > 1000 Hz). These findings support the importance of sleep for healthy emotional functioning in adults, and further suggest that adolescents are differentially vulnerable to the emotional consequences of sleep deprivation.
Dunlop, Rebecca A; Cato, Douglas H; Noad, Michael J
2010-08-22
High background noise is an important obstacle in successful signal detection and perception of an intended acoustic signal. To overcome this problem, many animals modify their acoustic signal by increasing the repetition rate, duration, amplitude or frequency range of the signal. An alternative method to ensure successful signal reception, yet to be tested in animals, involves the use of two different types of signal, where one signal type may enhance the other in periods of high background noise. Humpback whale communication signals comprise two different types: vocal signals, and surface-generated signals such as 'breaching' or 'pectoral slapping'. We found that humpback whales gradually switched from primarily vocal to primarily surface-generated communication in increasing wind speeds and background noise levels, though kept both signal types in their repertoire. Vocal signals have the advantage of having higher information content but may have the disadvantage of loosing this information in a noisy environment. Surface-generated sounds have energy distributed over a greater frequency range and may be less likely to become confused in periods of high wind-generated noise but have less information content when compared with vocal sounds. Therefore, surface-generated sounds may improve detection or enhance the perception of vocal signals in a noisy environment.
Roy, Nelson; Fetrow, Rebecca A; Merrill, Ray M; Dromey, Christopher
2016-10-01
Vocal hyperfunction, related to abnormal laryngeal muscle activity, is considered the proximal cause of primary muscle tension dysphonia (pMTD). Relative fundamental frequency (RFF) has been proposed as an objective acoustic marker of vocal hyperfunction. This study examined (a) the ability of RFF to track changes in vocal hyperfunction after treatment for pMTD and (b) the influence of dysphonia severity, among other factors, on the feasibility of RFF computation. RFF calculations and dysphonia severity ratings were derived from pre- and posttreatment recordings from 111 women with pMTD and 20 healthy controls. Three vowel-voiceless consonant-vowel stimuli were analyzed. RFF onset slope consistently varied as a function of group (pMTD vs. controls) and time (pretherapy vs. posttherapy). Significant correlations between RFF onset cycle 1 and dysphonia severity were observed. However, in many samples, RFF could not be computed, and adjusted odds ratios revealed that these unanalyzable data were linked to dysphonia severity, phonetic (vowel-voiceless consonant-vowel) context, and group (pMTD vs. control). RFF onset appears to be sensitive to the presence and degree of suspected vocal hyperfunction before and after therapy. The large number of unanalyzable samples (related especially to dysphonia severity in the pMTD group) represents an important limitation.
Lower Vocal Tract Morphologic Adjustments Are Relevant for Voice Timbre in Singing.
Mainka, Alexander; Poznyakovskiy, Anton; Platzek, Ivan; Fleischer, Mario; Sundberg, Johan; Mürbe, Dirk
2015-01-01
The vocal tract shape is crucial to voice production. Its lower part seems particularly relevant for voice timbre. This study analyzes the detailed morphology of parts of the epilaryngeal tube and the hypopharynx for the sustained German vowels /a/, /e/, /i/, /o/, and /u/ by thirteen male singer subjects who were at the beginning of their academic singing studies. Analysis was based on two different phonatory conditions: a natural, speech-like phonation and a singing phonation, like in classical singing. 3D models of the vocal tract were derived from magnetic resonance imaging and compared with long-term average spectrum analysis of audio recordings from the same subjects. Comparison of singing to the speech-like phonation, which served as reference, showed significant adjustments of the lower vocal tract: an average lowering of the larynx by 8 mm and an increase of the hypopharyngeal cross-sectional area (+ 21:9%) and volume (+ 16:8%). Changes in the analyzed epilaryngeal portion of the vocal tract were not significant. Consequently, lower larynx-to-hypopharynx area and volume ratios were found in singing compared to the speech-like phonation. All evaluated measures of the lower vocal tract varied significantly with vowel quality. Acoustically, an increase of high frequency energy in singing correlated with a wider hypopharyngeal area. The findings offer an explanation how classical male singers might succeed in producing a voice timbre with increased high frequency energy, creating a singer`s formant cluster.
Lower Vocal Tract Morphologic Adjustments Are Relevant for Voice Timbre in Singing
Mainka, Alexander; Poznyakovskiy, Anton; Platzek, Ivan; Fleischer, Mario; Sundberg, Johan; Mürbe, Dirk
2015-01-01
The vocal tract shape is crucial to voice production. Its lower part seems particularly relevant for voice timbre. This study analyzes the detailed morphology of parts of the epilaryngeal tube and the hypopharynx for the sustained German vowels /a/, /e/, /i/, /o/, and /u/ by thirteen male singer subjects who were at the beginning of their academic singing studies. Analysis was based on two different phonatory conditions: a natural, speech-like phonation and a singing phonation, like in classical singing. 3D models of the vocal tract were derived from magnetic resonance imaging and compared with long-term average spectrum analysis of audio recordings from the same subjects. Comparison of singing to the speech-like phonation, which served as reference, showed significant adjustments of the lower vocal tract: an average lowering of the larynx by 8 mm and an increase of the hypopharyngeal cross-sectional area (+ 21.9%) and volume (+ 16.8%). Changes in the analyzed epilaryngeal portion of the vocal tract were not significant. Consequently, lower larynx-to-hypopharynx area and volume ratios were found in singing compared to the speech-like phonation. All evaluated measures of the lower vocal tract varied significantly with vowel quality. Acoustically, an increase of high frequency energy in singing correlated with a wider hypopharyngeal area. The findings offer an explanation how classical male singers might succeed in producing a voice timbre with increased high frequency energy, creating a singer‘s formant cluster. PMID:26186691
Major depressive disorder discrimination using vocal acoustic features.
Taguchi, Takaya; Tachikawa, Hirokazu; Nemoto, Kiyotaka; Suzuki, Masayuki; Nagano, Toru; Tachibana, Ryuki; Nishimura, Masafumi; Arai, Tetsuaki
2018-01-01
The voice carries various information produced by vibrations of the vocal cords and the vocal tract. Though many studies have reported a relationship between vocal acoustic features and depression, including mel-frequency cepstrum coefficients (MFCCs) which applied to speech recognition, there have been few studies in which acoustic features allowed discrimination of patients with depressive disorder. Vocal acoustic features as biomarker of depression could make differential diagnosis of patients with depressive state. In order to achieve differential diagnosis of depression, in this preliminary study, we examined whether vocal acoustic features could allow discrimination between depressive patients and healthy controls. Subjects were 36 patients who met the criteria for major depressive disorder and 36 healthy controls with no current or past psychiatric disorders. Voices of reading out digits before and after verbal fluency task were recorded. Voices were analyzed using OpenSMILE. The extracted acoustic features, including MFCCs, were used for group comparison and discriminant analysis between patients and controls. The second dimension of MFCC (MFCC 2) was significantly different between groups and allowed the discrimination between patients and controls with a sensitivity of 77.8% and a specificity of 86.1%. The difference in MFCC 2 between the two groups reflected an energy difference of frequency around 2000-3000Hz. The MFCC 2 was significantly different between depressive patients and controls. This feature could be a useful biomarker to detect major depressive disorder. Sample size was relatively small. Psychotropics could have a confounding effect on voice. Copyright © 2017 Elsevier B.V. All rights reserved.
Wühr, Peter; Duthoo, Wout; Notebaert, Wim
2015-01-01
Three experiments investigated transfer of list-wide proportion congruent (LWPC) effects from a set of congruent and incongruent items with different frequency (inducer task) to a set of congruent and incongruent items with equal frequency (diagnostic task). Experiments 1 and 2 mixed items from horizontal and vertical Simon tasks. Tasks always involved different stimuli that varied on the same dimension (colour) in Experiment 1 and on different dimensions (colour, shape) in Experiment 2. Experiment 3 mixed trials from a manual Simon task with trials from a vocal Stroop task, with colour being the relevant stimulus in both tasks. There were two major results. First, we observed transfer of LWPC effects in Experiments 1 and 3, when tasks shared the relevant dimension, but not in Experiment 2. Second, sequential modulations of congruency effects transferred in Experiment 1 only. Hence, the different transfer patterns suggest that LWPC effects and sequential modulations arise from different mechanisms. Moreover, the observation of transfer supports an account of LWPC effects in terms of list-wide cognitive control, while being at odds with accounts in terms of stimulus-response (contingency) learning and item-specific control.
Optoreflectometry determination of the resonance properties of a vocal fold.
Garrel, Renaud; Nicollas, Richard; Giovanni, Antoine; Ouaknine, Maurice
2007-09-01
A new method of measuring the resonance properties of a vocal fold using electromagnetic excitation and laser optoreflectometry for response monitoring is described. Two resonance peaks were experimentally identified with one magnet stuck on the vocal fold at frequencies F0(1m)=54.7 Hz and F0'(1m)=35.8 Hz. The addition of a second magnet allowed calculation of the actual viscoelastic properties of the vocal fold: F0=71.8 Hz; quality factor Q=8.03; mass m=0.057 g; stiffness k=11.6 Nm; and damping zeta=0.0032 Nm(-1). A numerical simulation of a two-layered model verified the experimental data.
Ng, Manwa L; Yan, Nan; Chan, Venus; Chen, Yang; Lam, Paul K Y
2018-06-28
Previous studies of the laryngectomized vocal tract using formant frequencies reported contradictory findings. Imagining studies of the vocal tract in alaryngeal speakers are limited due to the possible radiation effect as well as the cost and time associated with the studies. The present study examined the vocal tract configuration of laryngectomized individuals using acoustic reflection technology. Thirty alaryngeal and 30 laryngeal male speakers of Cantonese participated in the study. A pharyngometer was used to obtain volumetric information of the vocal tract. All speakers were instructed to imitate the production of /a/ when the length and volume information of the oral cavity, pharyngeal cavity, and the entire vocal tract were obtained. The data of alaryngeal and laryngeal speakers were compared. Pharyngometric measurements revealed no significant difference in the vocal tract dimensions between laryngeal and alaryngeal speakers. Despite the removal of the larynx and a possible alteration in the pharyngeal cavity during total laryngectomy, the vocal tract configuration (length and volume) in laryngectomized individuals was not significantly different from laryngeal speakers. It is suggested that other factors might have affected formant measures in previous studies. © 2018 S. Karger AG, Basel.
A theoretical study of F0-F1 interaction with application to resonant speaking and singing voice.
Titze, Ingo R
2004-09-01
An interactive source-filter system, consisting of a three-mass body-cover model of the vocal folds and a wave reflection model of the vocal tract, was used to test the dependence of vocal fold vibration on the vocal tract. The degree of interaction is governed by the epilarynx tube, which raises the vocal tract impedance to match the impedance of the glottis. The key component of the impedance is inertive reactance. Whenever there is inertive reactance, the vocal tract assists the vocal folds in vibration. The amplitude of vibration and the glottal flow can more than double, and the oral radiated power can increase up to 10 dB. As F0 approaches F1, the first formant frequency, the interactive source-filter system loses its advantage (because inertive reactance changes to compliant reactance) and the noninteractive system produces greater vocal output. Thus, from a voice training and control standpoint, there may be reasons to operate the system in either interactive and noninteractive modes. The harmonics 2F0 and 3F0 can also benefit from being positioned slightly below F1.
Vocal function in introverts and extraverts during a psychological stress reactivity protocol.
Dietrich, Maria; Verdolini Abbott, Katherine
2012-06-01
To examine the proposal that introversion predictably influences extralaryngeal and vocal behavior in vocally healthy individuals compared with individuals with extraversion and whether differences are of a nature that may support a risk hypothesis for primary muscle tension dysphonia. Fifty-four vocally healthy female adults between the ages of 18 and 35 years were divided into 2 groups: introversion (n = 27) and extraversion (n = 27). All participants completed a psychological stress reactivity experiment. Before, during, and after the stressor (public speaking), participants were assessed on extralaryngeal muscle activity (surface electromyography: submental, infrahyoid; control site: tibialis anterior), perceived vocal effort, and vocal acoustics (fundamental frequency and intensity). Participants in the introversion group exhibited significantly greater infrahyoid muscle activity throughout the protocol and during perceived stress than participants in the extraversion group. For both groups, perceived vocal effort significantly increased during stress, and acoustic measures significantly decreased. Infrahyoid muscle activity during the stress phase was significantly correlated with introversion and Voice Handicap Index scores but not with vocal effort scores. The data provided evidence of distinct differences in extralaryngeal behavior between introverts and extraverts. The findings are consistent with the trait theory of voice disorders (Roy & Bless, 2000).
Forlano, Paul M; Sisneros, Joseph A
2016-01-01
The plainfin midshipman fish (Porichthys notatus) is a well-studied model to understand the neural and endocrine mechanisms underlying vocal-acoustic communication across vertebrates. It is well established that steroid hormones such as estrogen drive seasonal peripheral auditory plasticity in female Porichthys in order to better encode the male's advertisement call. However, little is known of the neural substrates that underlie the motivation and coordinated behavioral response to auditory social signals. Catecholamines, which include dopamine and noradrenaline, are good candidates for this function, as they are thought to modulate the salience of and reinforce appropriate behavior to socially relevant stimuli. This chapter summarizes our recent studies which aimed to characterize catecholamine innervation in the central and peripheral auditory system of Porichthys as well as test the hypotheses that innervation of the auditory system is seasonally plastic and catecholaminergic neurons are activated in response to conspecific vocalizations. Of particular significance is the discovery of direct dopaminergic innervation of the saccule, the main hearing end organ, by neurons in the diencephalon, which also robustly innervate the cholinergic auditory efferent nucleus in the hindbrain. Seasonal changes in dopamine innervation in both these areas appear dependent on reproductive state in females and may ultimately function to modulate the sensitivity of the peripheral auditory system as an adaptation to the seasonally changing soundscape. Diencephalic dopaminergic neurons are indeed active in response to exposure to midshipman vocalizations and are in a perfect position to integrate the detection and appropriate motor response to conspecific acoustic signals for successful reproduction.
Filippi, Piera; Congdon, Jenna V; Hoang, John; Bowling, Daniel L; Reber, Stephan A; Pašukonis, Andrius; Hoeschele, Marisa; Ocklenburg, Sebastian; de Boer, Bart; Sturdy, Christopher B; Newen, Albert; Güntürkün, Onur
2017-07-26
Writing over a century ago, Darwin hypothesized that vocal expression of emotion dates back to our earliest terrestrial ancestors. If this hypothesis is true, we should expect to find cross-species acoustic universals in emotional vocalizations. Studies suggest that acoustic attributes of aroused vocalizations are shared across many mammalian species, and that humans can use these attributes to infer emotional content. But do these acoustic attributes extend to non-mammalian vertebrates? In this study, we asked human participants to judge the emotional content of vocalizations of nine vertebrate species representing three different biological classes-Amphibia, Reptilia (non-aves and aves) and Mammalia. We found that humans are able to identify higher levels of arousal in vocalizations across all species. This result was consistent across different language groups (English, German and Mandarin native speakers), suggesting that this ability is biologically rooted in humans. Our findings indicate that humans use multiple acoustic parameters to infer relative arousal in vocalizations for each species, but mainly rely on fundamental frequency and spectral centre of gravity to identify higher arousal vocalizations across species. These results suggest that fundamental mechanisms of vocal emotional expression are shared among vertebrates and could represent a homologous signalling system. © 2017 The Author(s).
Lee, Kyung Sook; Shin, Yee Jin; Yoo, Hee Jeong; Lee, Gui Jong; Ryu, Jeong; Son, Oweol; Cho, Sook Whan
2018-05-01
This study aimed to examine the development of socializing and emotional expressions through vocalizations and joint attention (JA) behaviors in Korean-speaking children with autism spectrum disorder (ASD), compared to those with developmental delay (DD). Video samples were collected from 28 toddlers with ASD and 18 age-matched toddlers with DD, and vocalizations were each coded in detail for the purpose of this retrospective research. In addition to some statistical analysis, Computerized Language Analysis was conducted to obtain the final results. Although they produced a higher number of vocalizations than the DD group, the ASD group did not engage in emotional or social interactions with their caretakers, whereas the DD group did. The children with ASD used more atypical vocalizations and socially unengaged vocalizations than the children with DD did. JA using vocalizations in the ASD group, in particular, was largely dyadic, with triadic types occurring at a significantly lower frequency than those in the DD group. Results from this study indicate the importance of assessing early vocalizations in toddlers with ASD, suggesting that some common symptoms of ASD, such as lack of typical, emotional, and social functions in early vocalizations, could be used to develop screening and intervention programs related to ASD. © Copyright: Yonsei University College of Medicine 2018.
Day, Nancy F; Kimball, Todd Haswell; Aamodt, Caitlin M; Heston, Jonathan B; Hilliard, Austin T; Xiao, Xinshu; White, Stephanie A
2018-01-01
Human speech is one of the few examples of vocal learning among mammals yet ~half of avian species exhibit this ability. Its neurogenetic basis is largely unknown beyond a shared requirement for FoxP2 in both humans and zebra finches. We manipulated FoxP2 isoforms in Area X, a song-specific region of the avian striatopallidum analogous to human anterior striatum, during a critical period for song development. We delineate, for the first time, unique contributions of each isoform to vocal learning. Weighted gene coexpression network analysis of RNA-seq data revealed gene modules correlated to singing, learning, or vocal variability. Coexpression related to singing was found in juvenile and adult Area X whereas coexpression correlated to learning was unique to juveniles. The confluence of learning and singing coexpression in juvenile Area X may underscore molecular processes that drive vocal learning in young zebra finches and, by analogy, humans. PMID:29360038
Laukkanen, Anne-Maria; Pulakka, Hannu; Alku, Paavo; Vilkman, Erkki; Hertegård, Stellan; Lindestad, Per-Ake; Larsson, Hans; Granqvist, Svante
2007-01-01
Vocal exercises that increase the vocal tract impedance are widely used in voice training and therapy. The present study applies a versatile methodology to investigate phonation during varying artificial extension of the vocal tract. Two males and one female phonated into a hard-walled plastic tube (phi 2 cm), whose physical length was randomly pair-wise changed between 30 cm, 60 cm and 100 cm. High-speed image (1900 f/sec) sequences of the vocal folds were obtained via a rigid endoscope. Acoustic and electroglottographic signals (EGG) were recorded. Oral pressure during shuttering of the tube was used to give an estimate of subglottic pressure (Psub). The only trend observed was that with the two longer tubes compared to the shortest one, fundamental frequency was lower, open time of the glottis shorter, and Psub higher. The results may partly reflect increased vocal tract impedance as such and partly the increased vocal effort to compensate for it. In other parameters there were individual differences in tube length-related changes, suggesting complexity of the coupling between supraglottic space and the glottis.
Communication modality sampling for a toddler with Angelman syndrome.
Hyppa Martin, Jolene; Reichle, Joe; Dimian, Adele; Chen, Mo
2013-10-01
Vocal, gestural, and graphic communication modes were implemented concurrently with a toddler with Angelman syndrome to identify the most efficiently learned communication mode to emphasize in an initial augmentative communication system. Symbols representing preferred objects were introduced in vocal, gestural, and graphic communication modes using an alternating treatment single-subject experimental design. Conventionally accepted prompting strategies were used to teach symbols in each communication mode. Because the learner did not vocally imitate, vocal mode intervention focused on increasing vocal frequency as an initial step. When graphic and gestural mode performances were compared, the learner most accurately produced requests in graphic mode (percentage of nonoverlapping data = 96). Given the lack of success in prompting vocal productions, a comparison between vocal and the other two communication modes was not made. A growing body of evidence suggests that concurrent modality sampling is a promising low-inference, data-driven procedure that can be used to inform selection of a communication mode(s) for initial emphasis with young children. Concurrent modality sampling can guide clinical decisions regarding the allocation of treatment resources to promote success in building an initial communicative repertoire.
LaZerte, Stefanie E.; Slabbekoorn, Hans; Otter, Ken A.
2016-01-01
Urban noise can interfere with avian communication through masking, but birds can reduce this interference by altering their vocalizations. Although several experimental studies indicate that birds can rapidly change their vocalizations in response to sudden increases in ambient noise, none have investigated whether this is a learned response that depends on previous exposure. Black-capped chickadees (Poecile atricapillus) change the frequency of their songs in response to both fluctuating traffic noise and experimental noise. We investigated whether these responses to fluctuating noise depend on familiarity with noise. We confirmed that males in noisy areas sang higher-frequency songs than those in quiet areas, but found that only males in already-noisy territories shifted songs upwards in immediate response to experimental noise. Unexpectedly, males in more quiet territories shifted songs downwards in response to experimental noise. These results suggest that chickadees may require prior experience with fluctuating noise to adjust vocalizations in such a way as to minimize masking. Thus, learning to cope may be an important part of adjusting to acoustic life in the city. PMID:27358372
LaZerte, Stefanie E; Slabbekoorn, Hans; Otter, Ken A
2016-06-29
Urban noise can interfere with avian communication through masking, but birds can reduce this interference by altering their vocalizations. Although several experimental studies indicate that birds can rapidly change their vocalizations in response to sudden increases in ambient noise, none have investigated whether this is a learned response that depends on previous exposure. Black-capped chickadees (Poecile atricapillus) change the frequency of their songs in response to both fluctuating traffic noise and experimental noise. We investigated whether these responses to fluctuating noise depend on familiarity with noise. We confirmed that males in noisy areas sang higher-frequency songs than those in quiet areas, but found that only males in already-noisy territories shifted songs upwards in immediate response to experimental noise. Unexpectedly, males in more quiet territories shifted songs downwards in response to experimental noise. These results suggest that chickadees may require prior experience with fluctuating noise to adjust vocalizations in such a way as to minimize masking. Thus, learning to cope may be an important part of adjusting to acoustic life in the city. © 2016 The Author(s).
An approach for automatic classification of grouper vocalizations with passive acoustic monitoring.
Ibrahim, Ali K; Chérubin, Laurent M; Zhuang, Hanqi; Schärer Umpierre, Michelle T; Dalgleish, Fraser; Erdol, Nurgun; Ouyang, B; Dalgleish, A
2018-02-01
Grouper, a family of marine fishes, produce distinct vocalizations associated with their reproductive behavior during spawning aggregation. These low frequencies sounds (50-350 Hz) consist of a series of pulses repeated at a variable rate. In this paper, an approach is presented for automatic classification of grouper vocalizations from ambient sounds recorded in situ with fixed hydrophones based on weighted features and sparse classifier. Group sounds were labeled initially by humans for training and testing various feature extraction and classification methods. In the feature extraction phase, four types of features were used to extract features of sounds produced by groupers. Once the sound features were extracted, three types of representative classifiers were applied to categorize the species that produced these sounds. Experimental results showed that the overall percentage of identification using the best combination of the selected feature extractor weighted mel frequency cepstral coefficients and sparse classifier achieved 82.7% accuracy. The proposed algorithm has been implemented in an autonomous platform (wave glider) for real-time detection and classification of group vocalizations.
Nelson, Danielle V; Klinck, Holger; Carbaugh-Rutland, Alexander; Mathis, Codey L; Morzillo, Anita T; Garcia, Tiffany S
2017-01-01
Loss of acoustic habitat due to anthropogenic noise is a key environmental stressor for vocal amphibian species, a taxonomic group that is experiencing global population declines. The Pacific chorus frog ( Pseudacris regilla ) is the most common vocal species of the Pacific Northwest and can occupy human-dominated habitat types, including agricultural and urban wetlands. This species is exposed to anthropogenic noise, which can interfere with vocalizations during the breeding season. We hypothesized that Pacific chorus frogs would alter the spatial and temporal structure of their breeding vocalizations in response to road noise, a widespread anthropogenic stressor. We compared Pacific chorus frog call structure and ambient road noise levels along a gradient of road noise exposures in the Willamette Valley, Oregon, USA. We used both passive acoustic monitoring and directional recordings to determine source level (i.e., amplitude or volume), dominant frequency (i.e., pitch), call duration, and call rate of individual frogs and to quantify ambient road noise levels. Pacific chorus frogs were unable to change their vocalizations to compensate for road noise. A model of the active space and time ("spatiotemporal communication") over which a Pacific chorus frog vocalization could be heard revealed that in high-noise habitats, spatiotemporal communication was drastically reduced for an individual. This may have implications for the reproductive success of this species, which relies on specific call repertoires to portray relative fitness and attract mates. Using the acoustic call parameters defined by this study (frequency, source level, call rate, and call duration), we developed a simplified model of acoustic communication space-time for this species. This model can be used in combination with models that determine the insertion loss for various acoustic barriers to define the impact of anthropogenic noise on the radius of communication in threatened species. Additionally, this model can be applied to other vocal taxonomic groups provided the necessary acoustic parameters are determined, including the frequency parameters and perception thresholds. Reduction in acoustic habitat by anthropogenic noise may emerge as a compounding environmental stressor for an already sensitive taxonomic group.
Kendall, Katherine A; Leonard, Rebecca J
2011-01-01
Up to one-third of patients presenting with adductor spasmodic dysphonia will have an associated vocal tremor. These patients may not respond fully to treatment using thyroarytenoid (TA) muscle botulinum toxin (Botox) injection. Treatment failures are attributed to the involvement of multiple muscle groups in the tremor. This study evaluates the results of combined interarytenoid (IA) and TA muscle Botox injection in a group of 27 patients with adductor spasmodic dysphonia and vocal tremor and in four patients with severe vocal tremor alone. Patient-satisfaction data were reviewed retrospectively. Pre- and postinjection acoustic data were collected prospectively. Acoustic measures of fundamental frequency and cycle-by-cycle variability in frequency (jitter) and intensity (shimmer) were obtained from 15 patients' sustained vowel productions. Measures were collected after TA muscle injection, alone, and after combined TA and IA (TA+IA) muscle injections. In addition, two experienced voice clinicians blindly assessed tremor severity from recordings made for each patient in the two conditions. Patients were also queried regarding their satisfaction with the results of the injections and whether they desired to continue receiving TA+IA treatment. Significant improvement in all acoustic measures except for % jitter was observed after the TA+IA muscle injections. Listeners identified voice samples after TA+IA muscle injections as demonstrating less tremor in 73% of the paired comparisons. Sixty-seven percent of the patients with spasmodic dysphonia and vocal tremor wished to continue to receive IA muscle injections. Only one patient with severe vocal tremor wished to continue with injections. The addition of an IA muscle Botox injection to the treatment of patients with a combination adductor spasmodic dysphonia and vocal tremor may improve voice outcomes. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Forlano, Paul M; Marchaterre, Margaret; Deitcher, David L; Bass, Andrew H
2010-02-15
Across all major vertebrate groups, androgen receptors (ARs) have been identified in neural circuits that shape reproductive-related behaviors, including vocalization. The vocal control network of teleost fishes presents an archetypal example of how a vertebrate nervous system produces social, context-dependent sounds. We cloned a partial cDNA of AR that was used to generate specific probes to localize AR expression throughout the central nervous system of the vocal plainfin midshipman fish (Porichthys notatus). In the forebrain, AR mRNA is abundant in proposed homologs of the mammalian striatum and amygdala, and in anterior and posterior parvocellular and magnocellular nuclei of the preoptic area, nucleus preglomerulosus, and posterior, ventral and anterior tuberal nuclei of the hypothalamus. Many of these nuclei are part of the known vocal and auditory circuitry in midshipman. The midbrain periaqueductal gray, an essential link between forebrain and hindbrain vocal circuitry, and the lateral line recipient nucleus medialis in the rostral hindbrain also express abundant AR mRNA. In the caudal hindbrain-spinal vocal circuit, high AR mRNA is found in the vocal prepacemaker nucleus and along the dorsal periphery of the vocal motor nucleus congruent with the known pattern of expression of aromatase-containing glial cells. Additionally, abundant AR mRNA expression is shown for the first time in the inner ear of a vertebrate. The distribution of AR mRNA strongly supports the role of androgens as modulators of behaviorally defined vocal, auditory, and neuroendocrine circuits in teleost fish and vertebrates in general. 2009 Wiley-Liss, Inc.
Tervo, Outi M; Christoffersen, Mads F; Simon, Malene; Miller, Lee A; Jensen, Frants H; Parks, Susan E; Madsen, Peter T
2012-01-01
The low-frequency, powerful vocalizations of blue and fin whales may potentially be detected by conspecifics across entire ocean basins. In contrast, humpback and bowhead whales produce equally powerful, but more complex broadband vocalizations composed of higher frequencies that suffer from higher attenuation. Here we evaluate the active space of high frequency song notes of bowhead whales (Balaena mysticetus) in Western Greenland using measurements of song source levels and ambient noise. Four independent, GPS-synchronized hydrophones were deployed through holes in the ice to localize vocalizing bowhead whales, estimate source levels and measure ambient noise. The song had a mean apparent source level of 185±2 dB rms re 1 µPa @ 1 m and a high mean centroid frequency of 444±48 Hz. Using measured ambient noise levels in the area and Arctic sound spreading models, the estimated active space of these song notes is between 40 and 130 km, an order of magnitude smaller than the estimated active space of low frequency blue and fin whale songs produced at similar source levels and for similar noise conditions. We propose that bowhead whales spatially compensate for their smaller communication range through mating aggregations that co-evolved with broadband song to form a complex and dynamic acoustically mediated sexual display.
Tervo, Outi M.; Christoffersen, Mads F.; Simon, Malene; Miller, Lee A.; Jensen, Frants H.; Parks, Susan E.; Madsen, Peter T.
2012-01-01
The low-frequency, powerful vocalizations of blue and fin whales may potentially be detected by conspecifics across entire ocean basins. In contrast, humpback and bowhead whales produce equally powerful, but more complex broadband vocalizations composed of higher frequencies that suffer from higher attenuation. Here we evaluate the active space of high frequency song notes of bowhead whales (Balaena mysticetus) in Western Greenland using measurements of song source levels and ambient noise. Four independent, GPS-synchronized hydrophones were deployed through holes in the ice to localize vocalizing bowhead whales, estimate source levels and measure ambient noise. The song had a mean apparent source level of 185±2 dB rms re 1 µPa @ 1 m and a high mean centroid frequency of 444±48 Hz. Using measured ambient noise levels in the area and Arctic sound spreading models, the estimated active space of these song notes is between 40 and 130 km, an order of magnitude smaller than the estimated active space of low frequency blue and fin whale songs produced at similar source levels and for similar noise conditions. We propose that bowhead whales spatially compensate for their smaller communication range through mating aggregations that co-evolved with broadband song to form a complex and dynamic acoustically mediated sexual display. PMID:23300591
Voice Range Profiles of Singing Students: The Effects of Training Duration and Institution.
Lycke, Hugo; Siupsinskiene, Nora
2016-01-01
The aim of the study was to assess differences in voice parameters measured by the physiological voice range profile (VRP) in groups of vocally healthy subjects differentiated by the duration of vocal training and the training institution. Six basic frequency- and intensity-related VRP parameters and the frequency dip of the register transition zone were determined from VRP recordings of 162 females studying in individual singing lessons (1st-5th level) in Dutch, Belgian, English, and French public or private training facilities. Sixty-seven nonsinging female students served as controls. Singing students in more advanced singing classes demonstrated a significantly greater frequency range, particularly at high frequencies, than did first-year students. Students with private training showed a significantly increased mean intensity range in comparison to those in group classes, while students with musical theater training exhibited significantly increased frequency- and intensity-related VRP parameters in comparison to the students with classical training. When compared to nonsingers, all singing student subgroups showed significant increases in all basic VRP parameters. However, the register transition parameter was not influenced by training duration or institution. Our study suggests that the extension of physiological vocal limits might depend on training duration and institution. © 2016 S. Karger AG, Basel.
Fluid-acoustic interactions and their impact on pathological voiced speech
NASA Astrophysics Data System (ADS)
Erath, Byron D.; Zanartu, Matias; Peterson, Sean D.; Plesniak, Michael W.
2011-11-01
Voiced speech is produced by vibration of the vocal fold structures. Vocal fold dynamics arise from aerodynamic pressure loadings, tissue properties, and acoustic modulation of the driving pressures. Recent speech science advancements have produced a physiologically-realistic fluid flow solver (BLEAP) capable of prescribing asymmetric intraglottal flow attachment that can be easily assimilated into reduced order models of speech. The BLEAP flow solver is extended to incorporate acoustic loading and sound propagation in the vocal tract by implementing a wave reflection analog approach for sound propagation based on the governing BLEAP equations. This enhanced physiological description of the physics of voiced speech is implemented into a two-mass model of speech. The impact of fluid-acoustic interactions on vocal fold dynamics is elucidated for both normal and pathological speech through linear and nonlinear analysis techniques. Supported by NSF Grant CBET-1036280.
Experimental analysis of the characteristics of artificial vocal folds.
Misun, Vojtech; Svancara, Pavel; Vasek, Martin
2011-05-01
Specialized literature presents a number of models describing the function of the vocal folds. In most of those models, an emphasis is placed on the air flowing through the glottis and, further, on the effect of the parameters of the air alone (its mass, speed, and so forth). The article focuses on the constructional definition of artificial vocal folds and their experimental analysis. The analysis is conducted for voiced source voice phonation and for the changing mean value of the subglottal pressure. The article further deals with the analysis of the pressure of the airflow through the vocal folds, which is cut (separated) into individual pulses by the vibrating vocal folds. The analysis results show that air pulse characteristics are relevant to voice generation, as they are produced by the flowing air and vibrating vocal folds. A number of artificial vocal folds have been constructed to date, and the aforementioned view of their phonation is confirmed by their analysis. The experiments have confirmed that man is able to consciously affect only two parameters of the source voice, that is, its fundamental frequency and voice intensity. The main forces acting on the vocal folds during phonation are as follows: subglottal air pressure and elastic and inertia forces of the vocal folds' structure. The correctness of the function of the artificial vocal folds is documented by the experimental verification of the spectra of several types of artificial vocal folds. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Hsiao, Chun-Jen; Hsu, Chih-Hsiang; Lin, Ching-Lung; Wu, Chung-Hsin; Jen, Philip Hung-Sun
2016-08-17
Although echolocating bats and other mammals share the basic design of laryngeal apparatus for sound production and auditory system for sound reception, they have a specialized laryngeal mechanism for ultrasonic sound emissions as well as a highly developed auditory system for processing species-specific sounds. Because the sounds used by bats for echolocation and rodents for communication are quite different, there must be differences in the central nervous system devoted to producing and processing species-specific sounds between them. The present study examines the difference in the relative size of several brain structures and expression of auditory-related and vocal-related proteins in the central nervous system of echolocation bats and rodents. Here, we report that bats using constant frequency-frequency-modulated sounds (CF-FM bats) and FM bats for echolocation have a larger volume of midbrain nuclei (inferior and superior colliculi) and cerebellum relative to the size of the brain than rodents (mice and rats). However, the former have a smaller volume of the cerebrum and olfactory bulb, but greater expression of otoferlin and forkhead box protein P2 than the latter. Although the size of both midbrain colliculi is comparable in both CF-FM and FM bats, CF-FM bats have a larger cerebrum and greater expression of otoferlin and forkhead box protein P2 than FM bats. These differences in brain structure and protein expression are discussed in relation to their biologically relevant sounds and foraging behavior.
Short term effect of hubble-bubble smoking on voice.
Hamdan, A-L; Sibai, A; Mahfoud, L; Oubari, D; Ashkar, J; Fuleihan, N
2011-05-01
To investigate the short term effect of hubble-bubble smoking on voice. Prospective study. Eighteen non-dysphonic subjects (seven men and 11 women) with a history of hubble-bubble smoking and no history of cigarette smoking underwent acoustic analysis and laryngeal video-stroboscopic examination before and 30 minutes after hubble-bubble smoking. On laryngeal video-stroboscopy, none of the subjects had vocal fold erythema either before or after smoking. Five patients had mild vocal fold oedema both before and after smoking. After smoking, there was a slight increase in the number of subjects with thick mucus between the vocal folds (six, vs four before smoking) and with vocal fold vessel dilation (two, vs one before smoking). Acoustic analysis indicated a drop in habitual pitch, fundamental frequency and voice turbulence index after smoking, and an increase in noise-to-harmonics ratio. Even 30 minutes of hubble-bubble smoking can cause a drop in vocal pitch and an increase in laryngeal secretions and vocal fold vasodilation.
Aerodynamic and acoustic effects of abrupt frequency changes in excised larynges.
Alipour, Fariborz; Finnegan, Eileen M; Scherer, Ronald C
2009-04-01
To determine the aerodynamic and acoustic effects due to a sudden change from chest to falsetto register or vice versa. It was hypothesized that the continuous change in subglottal pressure and flow rate alone (pressure-flow sweep [PFS]) can trigger a mode change in the canine larynx. Ten canine larynges were each mounted over a tapered tube that supplied pressurized, heated, and humidified air. Glottographic signals were recorded during each PFS experiment, during which airflow was increased in a gradual manner for a period of 20-30 s. Abrupt changes in fundamental frequency (F(0)) and mode of vibration occurred during the PFS in the passive larynx without any change in adduction or elongation. The lower frequency mode of oscillation of the vocal folds, perceptually identified as the chest register, had relatively large amplitude oscillation, significant vocal fold contact, a rich spectral content, and a relatively loud audio signal. The higher frequency mode of oscillation, perceptually identified as falsetto, had little or no vocal fold contact and a dominant first partial. Relatively abrupt F(0) changes also occurred for gradual adduction changes, with the chest register corresponding to greater adduction, falsetto to less adduction.
NASA Astrophysics Data System (ADS)
Mindach, Debrah; Thomas, Jeanette
2005-09-01
Automated underwater recordings taken during the austral breeding season of the Weddell seal (Leptonychotes weddellii) in Antarctica also provided data on the vocalizations of predators in the area; leopard seals (Hydrurga leptonyx) and killer whales (Orcinus orca). Weddell seals inhabit fast ice areas to give birth, mate, and molt. Near the end of the breeding season in December the fast ice often breaks out and the two pack ice predators are able to move near the Weddell seal colonies and prey on them, especially pups. Recordings were taken continuously for a 2.5-min period each hour from mid-October 1977 and late-January 1978 at Hutton Cliffs and South Turtle Rock Crack, in McMurdo Sound. The leopard seals increased their trill calls when killer whales came into the area as evidenced by an increase in their frequency-modulated squeak calls. Weddell seals decreased their vocalization rate dramatically (~10 sounds/min) compared to during the peak of the breeding season (~75 sounds/min). Perhaps by being quiet, Weddell seals do not attract predators to their area.
Isolating N400 as neural marker of vocal anger processing in 6-11-year old children.
Chronaki, Georgia; Broyd, Samantha; Garner, Matthew; Hadwin, Julie A; Thompson, Margaret J J; Sonuga-Barke, Edmund J S
2012-04-01
Vocal anger is a salient social signal serving adaptive functions in typical child development. Despite recent advances in the developmental neuroscience of emotion processing with regard to visual stimuli, little remains known about the neural correlates of vocal anger processing in childhood. This study represents the first attempt to isolate a neural marker of vocal anger processing in children using electrophysiological methods. We compared ERP wave forms during the processing of non-word emotional vocal stimuli in a population sample of 55 6-11-year-old typically developing children. Children listened to three types of stimuli expressing angry, happy, and neutral prosody and completed an emotion identification task with three response options (angry, happy and neutral/'ok'). A distinctive N400 component which was modulated by emotional content of vocal stimulus was observed in children over parietal and occipital scalp regions-amplitudes were significantly attenuated to angry compared to happy and neutral voices. Findings of the present study regarding the N400 are compatible with adult studies showing reduced N400 amplitudes to negative compared to neutral emotional stimuli. Implications for studies of the neural basis of vocal anger processing in children are discussed. Copyright © 2011 Elsevier Ltd. All rights reserved.
Voice Use Among Music Theory Teachers: A Voice Dosimetry and Self-Assessment Study.
Schiller, Isabel S; Morsomme, Dominique; Remacle, Angélique
2017-07-25
This study aimed (1) to investigate music theory teachers' professional and extra-professional vocal loading and background noise exposure, (2) to determine the correlation between vocal loading and background noise, and (3) to determine the correlation between vocal loading and self-evaluation data. Using voice dosimetry, 13 music theory teachers were monitored for one workweek. The parameters analyzed were voice sound pressure level (SPL), fundamental frequency (F0), phonation time, vocal loading index (VLI), and noise SPL. Spearman correlation was used to correlate vocal loading parameters (voice SPL, F0, and phonation time) and noise SPL. Each day, the subjects self-assessed their voice using visual analog scales. VLI and self-evaluation data were correlated using Spearman correlation. Vocal loading parameters and noise SPL were significantly higher in the professional than in the extra-professional environment. Voice SPL, phonation time, and female subjects' F0 correlated positively with noise SPL. VLI correlated with self-assessed voice quality, vocal fatigue, and amount of singing and speaking voice produced. Teaching music theory is a profession with high vocal demands. More background noise is associated with increased vocal loading and may indirectly increase the risk for voice disorders. Correlations between VLI and self-assessments suggest that these teachers are well aware of their vocal demands and feel their effect on voice quality and vocal fatigue. Visual analog scales seem to represent a useful tool for subjective vocal loading assessment and associated symptoms in these professional voice users. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The Interaction of Surface Hydration and Vocal Loading on Voice Measures.
Fujiki, Robert Brinton; Chapleau, Abigail; Sundarrajan, Anusha; McKenna, Victoria; Sivasankar, M Preeti
2017-03-01
Vocal loading tasks provide insight regarding the mechanisms underlying healthy laryngeal function. Determining the manner in which the larynx can most efficiently be loaded is a complex task. The goal of this study was to determine if vocal loading could be achieved in 30 minutes by altering phonatory mode. Owing to the fact that surface hydration facilitates efficient vocal fold oscillation, the effects of environmental humidity on vocal loading were also examined. This study also investigated whether the detrimental effects of vocal loading could be attenuated by increasing environmental humidity. Sixteen vocally healthy adults (8 men, 8 women) completed a 30-minute vocal loading task in low and moderate humidity. The order of humidities was counterbalanced across subjects. The vocal loading task consisted of reading with elevated pitch and pressed vocal quality and low pitch and pressed and/or raspy vocal quality in the presence of 65 dB ambient, multi-talker babble noise. Significant effects were observed for (1) cepstral peak prominence on soft sustained phonation at 10th and 80th pitches, (2) perceived phonatory effort, and (3) perceived tiredness ratings. No loading effects were observed for cepstral peak prominence on the rainbow passage, although fundamental frequency on the rainbow passage increased post loading. No main effect was observed for humidity. Following a 30-minute vocal loading task involving altering laryngeal vibratory mode in combination with increased volume. Also, moderate environmental humidity did not significantly attenuate the negative effects of loading. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Learned Vocal Variation Is Associated with Abrupt Cryptic Genetic Change in a Parrot Species Complex
Ribot, Raoul F. H.; Buchanan, Katherine L.; Endler, John A.; Joseph, Leo; Bennett, Andrew T. D.; Berg, Mathew L.
2012-01-01
Contact zones between subspecies or closely related species offer valuable insights into speciation processes. A typical feature of such zones is the presence of clinal variation in multiple traits. The nature of these traits and the concordance among clines are expected to influence whether and how quickly speciation will proceed. Learned signals, such as vocalizations in species having vocal learning (e.g. humans, many birds, bats and cetaceans), can exhibit rapid change and may accelerate reproductive isolation between populations. Therefore, particularly strong concordance among clines in learned signals and population genetic structure may be expected, even among continuous populations in the early stages of speciation. However, empirical evidence for this pattern is often limited because differences in vocalisations between populations are driven by habitat differences or have evolved in allopatry. We tested for this pattern in a unique system where we may be able to separate effects of habitat and evolutionary history. We studied geographic variation in the vocalizations of the crimson rosella (Platycercus elegans) parrot species complex. Parrots are well known for their life-long vocal learning and cognitive abilities. We analysed contact calls across a ca 1300 km transect encompassing populations that differed in neutral genetic markers and plumage colour. We found steep clinal changes in two acoustic variables (fundamental frequency and peak frequency position). The positions of the two clines in vocal traits were concordant with a steep cline in microsatellite-based genetic variation, but were discordant with the steep clines in mtDNA, plumage and habitat. Our study provides new evidence that vocal variation, in a species with vocal learning, can coincide with areas of restricted gene flow across geographically continuous populations. Our results suggest that traits that evolve culturally can be strongly associated with reduced gene flow between populations, and therefore may promote speciation, even in the absence of other barriers. PMID:23227179
Vocal power and pressure–flow relationships in excised tiger larynges
Titze, Ingo R.; Fitch, W. Tecumseh; Hunter, Eric J.; Alipour, Fariborz; Montequin, Douglas; Armstrong, Douglas L.; McGee, JoAnn; Walsh, Edward J.
2010-01-01
Despite the functional importance of loud, low-pitched vocalizations in big cats of the genus Panthera, little is known about the physics and physiology of the mechanisms producing such calls. We investigated laryngeal sound production in the laboratory using an excised-larynx setup combined with sound-level measurements and pressure–flow instrumentation. The larynges of five tigers (three Siberian or Amur, one generic non-pedigreed tiger with Bengal ancestry and one Sumatran), which had died of natural causes, were provided by Omaha's Henry Doorly Zoo over a five-year period. Anatomical investigation indicated the presence of both a rigid cartilaginous plate in the arytenoid portion of the glottis, and a vocal fold fused with a ventricular fold. Both of these features have been confusingly termed ‘vocal pads’ in the previous literature. We successfully induced phonation in all of these larynges. Our results showed that aerodynamic power in the glottis was of the order of 1.0 W for all specimens, acoustic power radiated (without a vocal tract) was of the order of 0.1 mW, and fundamental frequency ranged between 20 and 100 Hz when a lung pressure in the range of 0–2.0 kPa was applied. The mean glottal airflow increased to the order of 1.0 l s–1 per 1.0 kPa of pressure, which is predictable from scaling human and canine larynges by glottal length and vibrational amplitude. Phonation threshold pressure was remarkably low, on the order of 0.3 kPa, which is lower than for human and canine larynges phonated without a vocal tract. Our results indicate that a vocal fold length approximately three times greater than that of humans is predictive of the low fundamental frequency, and the extraordinarily flat and broad medial surface of the vocal folds is predictive of the low phonation threshold pressure. PMID:21037066
Vocal power and pressure-flow relationships in excised tiger larynges.
Titze, Ingo R; Fitch, W Tecumseh; Hunter, Eric J; Alipour, Fariborz; Montequin, Douglas; Armstrong, Douglas L; McGee, Joann; Walsh, Edward J
2010-11-15
Despite the functional importance of loud, low-pitched vocalizations in big cats of the genus Panthera, little is known about the physics and physiology of the mechanisms producing such calls. We investigated laryngeal sound production in the laboratory using an excised-larynx setup combined with sound-level measurements and pressure-flow instrumentation. The larynges of five tigers (three Siberian or Amur, one generic non-pedigreed tiger with Bengal ancestry and one Sumatran), which had died of natural causes, were provided by Omaha's Henry Doorly Zoo over a five-year period. Anatomical investigation indicated the presence of both a rigid cartilaginous plate in the arytenoid portion of the glottis, and a vocal fold fused with a ventricular fold. Both of these features have been confusingly termed 'vocal pads' in the previous literature. We successfully induced phonation in all of these larynges. Our results showed that aerodynamic power in the glottis was of the order of 1.0 W for all specimens, acoustic power radiated (without a vocal tract) was of the order of 0.1 mW, and fundamental frequency ranged between 20 and 100 Hz when a lung pressure in the range of 0-2.0 kPa was applied. The mean glottal airflow increased to the order of 1.0 l s(-1) per 1.0 kPa of pressure, which is predictable from scaling human and canine larynges by glottal length and vibrational amplitude. Phonation threshold pressure was remarkably low, on the order of 0.3 kPa, which is lower than for human and canine larynges phonated without a vocal tract. Our results indicate that a vocal fold length approximately three times greater than that of humans is predictive of the low fundamental frequency, and the extraordinarily flat and broad medial surface of the vocal folds is predictive of the low phonation threshold pressure.
Fernández-Vargas, Marcela; Johnston, Robert E
2015-01-01
Vocal signaling is one of many behaviors that animals perform during social interactions. Vocalizations produced by both sexes before mating can communicate sex, identity and condition of the caller. Adult golden hamsters produce ultrasonic vocalizations (USV) after intersexual contact. To determine whether these vocalizations are sexually dimorphic, we analyzed the vocal repertoire for sex differences in: 1) calling rates, 2) composition (structural complexity, call types and nonlinear phenomena) and 3) acoustic structure. In addition, we examined it for individual variation in the calls. The vocal repertoire was mainly composed of 1-note simple calls and at least half of them presented some degree of deterministic chaos. The prevalence of this nonlinear phenomenon was confirmed by low values of harmonic-to-noise ratio for most calls. We found modest sexual differences between repertoires. Males were more likely than females to produce tonal and less chaotic calls, as well as call types with frequency jumps. Multivariate analysis of the acoustic features of 1-note simple calls revealed significant sex differences in the second axis represented mostly by entropy and bandwidth parameters. Male calls showed lower entropy and inter-quartile bandwidth than female calls. Because the variation of acoustic structure within individuals was higher than among individuals, USV could not be reliably assigned to the correct individual. Interestingly, however, this high variability, augmented by the prevalence of chaos and frequency jumps, could be the result of increased vocal effort. Hamsters motivated to produce high calling rates also produced longer calls of broader bandwidth. Thus, the sex differences found could be the result of different sex preferences but also of a sex difference in calling motivation or condition. We suggest that variable and complex USV may have been selected to increase responsiveness of a potential mate by communicating sexual arousal and preventing habituation to the caller.
Modulation-Frequency-Specific Adaptation in Awake Auditory Cortex
Beitel, Ralph E.; Vollmer, Maike; Heiser, Marc A.; Schreiner, Christoph E.
2015-01-01
Amplitude modulations are fundamental features of natural signals, including human speech and nonhuman primate vocalizations. Because natural signals frequently occur in the context of other competing signals, we used a forward-masking paradigm to investigate how the modulation context of a prior signal affects cortical responses to subsequent modulated sounds. Psychophysical “modulation masking,” in which the presentation of a modulated “masker” signal elevates the threshold for detecting the modulation of a subsequent stimulus, has been interpreted as evidence of a central modulation filterbank and modeled accordingly. Whether cortical modulation tuning is compatible with such models remains unknown. By recording responses to pairs of sinusoidally amplitude modulated (SAM) tones in the auditory cortex of awake squirrel monkeys, we show that the prior presentation of the SAM masker elicited persistent and tuned suppression of the firing rate to subsequent SAM signals. Population averages of these effects are compatible with adaptation in broadly tuned modulation channels. In contrast, modulation context had little effect on the synchrony of the cortical representation of the second SAM stimuli and the tuning of such effects did not match that observed for firing rate. Our results suggest that, although the temporal representation of modulated signals is more robust to changes in stimulus context than representations based on average firing rate, this representation is not fully exploited and psychophysical modulation masking more closely mirrors physiological rate suppression and that rate tuning for a given stimulus feature in a given neuron's signal pathway appears sufficient to engender context-sensitive cortical adaptation. PMID:25878263
Receiver bias and the acoustic ecology of aye-ayes (Daubentonia madagascariensis).
Ramsier, Marissa A; Dominy, Nathaniel J
2012-11-01
The aye-aye is a rare lemur from Madagascar that uses its highly specialized middle digit for percussive foraging. This acoustic behavior, also termed tap-scanning, produces dominant frequencies between 6 and 15 kHz. An enhanced auditory sensitivity to these frequencies raises the possibility that the acoustic and auditory specializations of aye-ayes have imposed constraints on the evolution of their vocal signals, especially their primary long-distance vocalization, the screech. Here we explore this concept, termed receiver bias, and suggest that the dominant frequency of the screech call (~2.7 kHz) represents an evolutionary compromise between the opposing adaptive advantages of long-distance sound propagation and enhanced detection by conspecific receivers.
Current Understanding and Future Directions for Vocal Fold Mechanobiology
Li, Nicole Y.K.; Heris, Hossein K.; Mongeau, Luc
2013-01-01
The vocal folds, which are located in the larynx, are the main organ of voice production for human communication. The vocal folds are under continuous biomechanical stress similar to other mechanically active organs, such as the heart, lungs, tendons and muscles. During speech and singing, the vocal folds oscillate at frequencies ranging from 20 Hz to 3 kHz with amplitudes of a few millimeters. The biomechanical stress associated with accumulated phonation is believed to alter vocal fold cell activity and tissue structure in many ways. Excessive phonatory stress can damage tissue structure and induce a cell-mediated inflammatory response, resulting in a pathological vocal fold lesion. On the other hand, phonatory stress is one major factor in the maturation of the vocal folds into a specialized tri-layer structure. One specific form of vocal fold oscillation, which involves low impact and large amplitude excursion, is prescribed therapeutically for patients with mild vocal fold injuries. Although biomechanical forces affect vocal fold physiology and pathology, there is little understanding of how mechanical forces regulate these processes at the cellular and molecular level. Research into vocal fold mechanobiology has burgeoned over the past several years. Vocal fold bioreactors are being developed in several laboratories to provide a biomimic environment that allows the systematic manipulation of physical and biological factors on the cells of interest in vitro. Computer models have been used to simulate the integrated response of cells and proteins as a function of phonation stress. The purpose of this paper is to review current research on the mechanobiology of the vocal folds as it relates to growth, pathogenesis and treatment as well as to propose specific research directions that will advance our understanding of this subject. PMID:24812638
Paternal kin recognition in the high frequency / ultrasonic range in a solitary foraging mammal
2012-01-01
Background Kin selection is a driving force in the evolution of mammalian social complexity. Recognition of paternal kin using vocalizations occurs in taxa with cohesive, complex social groups. This is the first investigation of paternal kin recognition via vocalizations in a small-brained, solitary foraging mammal, the grey mouse lemur (Microcebus murinus), a frequent model for ancestral primates. We analyzed the high frequency/ultrasonic male advertisement (courtship) call and alarm call. Results Multi-parametric analyses of the calls’ acoustic parameters and discriminant function analyses showed that advertisement calls, but not alarm calls, contain patrilineal signatures. Playback experiments controlling for familiarity showed that females paid more attention to advertisement calls from unrelated males than from their fathers. Reactions to alarm calls from unrelated males and fathers did not differ. Conclusions 1) Findings provide the first evidence of paternal kin recognition via vocalizations in a small-brained, solitarily foraging mammal. 2) High predation, small body size, and dispersed social systems may select for acoustic paternal kin recognition in the high frequency/ultrasonic ranges, thus limiting risks of inbreeding and eavesdropping by predators or conspecific competitors. 3) Paternal kin recognition via vocalizations in mammals is not dependent upon a large brain and high social complexity, but may already have been an integral part of the dispersed social networks from which more complex, kin-based sociality emerged. PMID:23198727
Roy, Sabyasachi; Zhao, Lingyun; Wang, Xiaoqin
2016-11-30
Although evidence from human studies has long indicated the crucial role of the frontal cortex in speech production, it has remained uncertain whether the frontal cortex in nonhuman primates plays a similar role in vocal communication. Previous studies of prefrontal and premotor cortices of macaque monkeys have found neural signals associated with cue- and reward-conditioned vocal production, but not with self-initiated or spontaneous vocalizations (Coudé et al., 2011; Hage and Nieder, 2013), which casts doubt on the role of the frontal cortex of the Old World monkeys in vocal communication. A recent study of marmoset frontal cortex observed modulated neural activities associated with self-initiated vocal production (Miller et al., 2015), but it did not delineate whether these neural activities were specifically attributed to vocal production or if they may result from other nonvocal motor activity such as orofacial motor movement. In the present study, we attempted to resolve these issues and examined single neuron activities in premotor cortex during natural vocal exchanges in the common marmoset (Callithrix jacchus), a highly vocal New World primate. Neural activation and suppression were observed both before and during self-initiated vocal production. Furthermore, by comparing neural activities between self-initiated vocal production and nonvocal orofacial motor movement, we identified a subpopulation of neurons in marmoset premotor cortex that was activated or suppressed by vocal production, but not by orofacial movement. These findings provide clear evidence of the premotor cortex's involvement in self-initiated vocal production in natural vocal behaviors of a New World primate. Human frontal cortex plays a crucial role in speech production. However, it has remained unclear whether the frontal cortex of nonhuman primates is involved in the production of self-initiated vocalizations during natural vocal communication. Using a wireless multichannel neural recording technique, we observed in the premotor cortex neural activation and suppression both before and during self-initiated vocalizations when marmosets, a highly vocal New World primate species, engaged in vocal exchanges with conspecifics. A novel finding of the present study is the discovery of a subpopulation of premotor cortex neurons that was activated by vocal production, but not by orofacial movement. These observations provide clear evidence of the premotor cortex's involvement in vocal production in a New World primate species. Copyright © 2016 the authors 0270-6474/16/3612168-12$15.00/0.
Network models of frequency modulated sweep detection.
Skorheim, Steven; Razak, Khaleel; Bazhenov, Maxim
2014-01-01
Frequency modulated (FM) sweeps are common in species-specific vocalizations, including human speech. Auditory neurons selective for the direction and rate of frequency change in FM sweeps are present across species, but the synaptic mechanisms underlying such selectivity are only beginning to be understood. Even less is known about mechanisms of experience-dependent changes in FM sweep selectivity. We present three network models of synaptic mechanisms of FM sweep direction and rate selectivity that explains experimental data: (1) The 'facilitation' model contains frequency selective cells operating as coincidence detectors, summing up multiple excitatory inputs with different time delays. (2) The 'duration tuned' model depends on interactions between delayed excitation and early inhibition. The strength of delayed excitation determines the preferred duration. Inhibitory rebound can reinforce the delayed excitation. (3) The 'inhibitory sideband' model uses frequency selective inputs to a network of excitatory and inhibitory cells. The strength and asymmetry of these connections results in neurons responsive to sweeps in a single direction of sufficient sweep rate. Variations of these properties, can explain the diversity of rate-dependent direction selectivity seen across species. We show that the inhibitory sideband model can be trained using spike timing dependent plasticity (STDP) to develop direction selectivity from a non-selective network. These models provide a means to compare the proposed synaptic and spectrotemporal mechanisms of FM sweep processing and can be utilized to explore cellular mechanisms underlying experience- or training-dependent changes in spectrotemporal processing across animal models. Given the analogy between FM sweeps and visual motion, these models can serve a broader function in studying stimulus movement across sensory epithelia.
Neck Circumference and Vocal Parameters in Women Before and After Bariatric Surgery.
de Souza, Lourdes Bernadete Rocha; Pernambuco, Leandro de Araújo; dos Santos, Marquiony Marques; Pereira, Rayane Medeiros
2016-03-01
Morbidly obese patients may suffer from vocal disorders, as vocal production is directly related to the volume of the vocal tract, and the large-scale accumulation of fat in this region may interfere with voice production. The aim of this study was to analyze the neck circumference, fundamental frequency, and maximum phonation time of a group of morbidly obese women before and after bariatric surgery. An observational, longitudinal, and descriptive study was performed with patients of the Obesity and Related Diseases Surgery Unit of a university hospital. A total of 21 morbidly obese women aged 28-68 years, with a mean age of 41.33 years, participated in the study. Neck circumference was measured using a tape measure. To obtain fundamental frequency values, the patient was asked to produce the vowel [a] at normal intensity and pitch for an average period of 3 s. After recording, the participants were asked to produce the sustained vowels [a], [i], and [u] at normal intensity and pitch, with a stopwatch used to measure maximum phonation time. Eight months after surgery, patients were reassessed using the same data collecting procedures as were carried out prior to surgery. After surgery, there was an increase in the average value of fundamental frequency and maximum phonation time for all the vowels and a reduction in neck circumference. The differences were statistically significant. Weight reduction and a consequent decrease in neck circumference affected the changes in maximum phonation time and fundamental frequency values in the voices of these patients, after weight loss.
Over-vibration induced blood perfusion and vascular permeability changes may lead to vocal edema.
Wang, Jiajia; Devine, Erin; Fang, Rui; Jiang, Jack J
2017-01-01
To observe blood perfusion and vascular permeability changes under varying vibration frequency exposures. Animal model. Blood perfusion was measured using laser Doppler flowmetry in eight rabbit auricular vessels (four rabbits) under nonvibration, and 62.5-Hz/1-mm, 125-Hz/1-mm, and 250-Hz/0.5-mm vibration frequency/amplitude exposures. Another 12 rabbits were randomly divided into vibration only and vibration with histamine groups. After 3 hours of continuous 125-Hz, 1-mm amplitude vibration of the auricle, vascular permeability was analyzed by absorbance of Evans blue-albumin complex. Significantly lower blood perfusion was observed in the vibration group, compared with no vibration exposure controls. Blood perfusion decreased 29 ± 16% as the vibration frequency was increased from 62.5 Hz to 125 Hz with the vibration amplitude constant at 1 mm. When the frequency was increased from 125 Hz to 250 Hz, while the amplitude was decreased from 1 mm to 0.5 mm, blood flow perfusion further decreased 29 ± 29%, and the decline tendency in blood perfusion showed no significant difference (P = .992). Meanwhile, in the vibration with histamine group, vascular permeability of the vibrated ears increased significantly compared to the nonvibrated ears (P = .005). Overvibration of the vocal folds due to voice overuse or abuse may significantly reduce blood perfusion, and increase vascular permeability in the vocal fold in inflammatory situations, which may lead to the formation of vocal edema. NA Laryngoscope, 127:148-152, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Kleber, Boris; Zeitouni, Anthony G; Friberg, Anders; Zatorre, Robert J
2013-04-03
Somatosensation plays an important role in the motor control of vocal functions, yet its neural correlate and relation to vocal learning is not well understood. We used fMRI in 17 trained singers and 12 nonsingers to study the effects of vocal-fold anesthesia on the vocal-motor singing network as a function of singing expertise. Tasks required participants to sing musical target intervals under normal conditions and after anesthesia. At the behavioral level, anesthesia altered pitch accuracy in both groups, but singers were less affected than nonsingers, indicating an experience-dependent effect of the intervention. At the neural level, this difference was accompanied by distinct patterns of decreased activation in singers (cortical and subcortical sensory and motor areas) and nonsingers (subcortical motor areas only) respectively, suggesting that anesthesia affected the higher-level voluntary (explicit) motor and sensorimotor integration network more in experienced singers, and the lower-level (implicit) subcortical motor loops in nonsingers. The right anterior insular cortex (AIC) was identified as the principal area dissociating the effect of expertise as a function of anesthesia by three separate sources of evidence. First, it responded differently to anesthesia in singers (decreased activation) and nonsingers (increased activation). Second, functional connectivity between AIC and bilateral A1, M1, and S1 was reduced in singers but augmented in nonsingers. Third, increased BOLD activity in right AIC in singers was correlated with larger pitch deviation under anesthesia. We conclude that the right AIC and sensory-motor areas play a role in experience-dependent modulation of feedback integration for vocal motor control during singing.
Valentinuzzi, Veronica S.; Zufiaurre, Emmanuel
2016-01-01
The underground environment poses particular communication challenges for subterranean rodents. Some loud and low-pitched acoustic signals that can travel long distances are appropriate for long-range underground communication and have been suggested to be territorial signals. Long-range vocalizations (LRVs) are important in long-distance communication in Ctenomys tuco-tucos. We characterized the LRV of the Anillaco Tuco-Tuco (Ctenomys sp.) using recordings from free-living individuals and described the behavioral context in which this vocalization was produced during laboratory staged encounters between individuals of both sexes. Long-range calls of Anillaco tuco-tucos are low-frequency, broad-band, loud, and long sounds composed by the repetition of two syllable types: series (formed by notes and soft-notes) and individual notes. All vocalizations were initiated with series, but not all had individual notes. Males were heavier than females and gave significantly lower-pitched vocalizations, but acoustic features were independent of body mass in males. The pronounced variation among individuals in the arrangement and number of syllables and the existence of three types of series (dyads, triads, and tetrads), created a diverse collection of syntactic patterns in vocalizations that would provide the opportunity to encode multiple types of information. The existence of complex syntactic patterns and the description of soft-notes represent new aspects of the vocal communication of Ctenomys. Long-distance vocalizations by Anillaco Tuco-Tucos appear to be territorial signals used mostly in male-male interactions. First, emission of LRVs resulted in de-escalation or space-keeping in male-male and male-female encounters in laboratory experiments. Second, these vocalizations were produced most frequently (in the field and in the lab) by males in our study population. Third, males produced LRVs with greater frequency during male-male encounters compared to male-female encounters. Finally, males appear to have larger home ranges that were more spatially segregated than those of females, suggesting that males may have greater need for long-distance signals that advertise their presence. Due to their apparent rarity, the function and acoustic features of LRV in female tuco-tucos remain inadequately known. PMID:27761344
A Bayesian Account of Vocal Adaptation to Pitch-Shifted Auditory Feedback
Hahnloser, Richard H. R.
2017-01-01
Motor systems are highly adaptive. Both birds and humans compensate for synthetically induced shifts in the pitch (fundamental frequency) of auditory feedback stemming from their vocalizations. Pitch-shift compensation is partial in the sense that large shifts lead to smaller relative compensatory adjustments of vocal pitch than small shifts. Also, compensation is larger in subjects with high motor variability. To formulate a mechanistic description of these findings, we adapt a Bayesian model of error relevance. We assume that vocal-auditory feedback loops in the brain cope optimally with known sensory and motor variability. Based on measurements of motor variability, optimal compensatory responses in our model provide accurate fits to published experimental data. Optimal compensation correctly predicts sensory acuity, which has been estimated in psychophysical experiments as just-noticeable pitch differences. Our model extends the utility of Bayesian approaches to adaptive vocal behaviors. PMID:28135267
Effects of Masking Noise on Laryngeal Resistance for Breathy, Normal, and Pressed Voice
ERIC Educational Resources Information Center
Grillo, Elizabeth U.; Abbott, Katherine Verdolini; Lee, Timothy D.
2010-01-01
Purpose: The purpose of the present study was to explore the effects of masking noise on laryngeal resistance for breathy, normal, and pressed voice in vocally trained women. Method: Eighteen vocally trained women produced breathy, normal, and pressed voice across 7 fundamental frequencies during a repeated CV utterance of /pi/ under normal and…
Spatial location influences vocal interactions in bullfrog choruses
Bates, Mary E.; Cropp, Brett F.; Gonchar, Marina; Knowles, Jeffrey; Simmons, James A.; Simmons, Andrea Megela
2010-01-01
A multiple sensor array was employed to identify the spatial locations of all vocalizing male bullfrogs (Rana catesbeiana) in five natural choruses. Patterns of vocal activity collected with this array were compared with computer simulations of chorus activity. Bullfrogs were not randomly spaced within choruses, but tended to cluster into closely spaced groups of two to five vocalizing males. There were nonrandom, differing patterns of vocal interactions within clusters of closely spaced males and between different clusters. Bullfrogs located within the same cluster tended to overlap or alternate call notes with two or more other males in that cluster. These near-simultaneous calling bouts produced advertisement calls with more pronounced amplitude modulation than occurred in nonoverlapping notes or calls. Bullfrogs located in different clusters more often alternated entire calls or overlapped only small segments of their calls. They also tended to respond sequentially to calls of their farther neighbors compared to their nearer neighbors. Results of computational analyses showed that the observed patterns of vocal interactions were significantly different than expected based on random activity. The use of a multiple sensor array provides a richer view of the dynamics of choruses than available based on single microphone techniques. PMID:20370047
Responses of male cricket frogs (Acris crepitans) to attenuated and degraded advertisement calls.
Venator, Kurt R; Ryan, Michael J; Wilczynski, Walter
2017-05-01
We examined the vocal and non-vocal responses of male cricket frogs ( Acris crepitans ) to conspecific advertisement calls that had been attenuated or degraded by reducing the depth of amplitude modulation (AM). Both are characteristic of changes to the call as it is transmitted through natural habitats. As stimulus calls became more intense or less degraded, male cricket frogs gradually decreased their call rate and increased the number of call groups and pulse groups in their calls, changes indicative of increased aggressive interactions. At the higher intensities and lower degradation levels, the probability that males would shift to one of two non-vocal behavioral responses, attacking the perceived intruder or ceasing calling and abandoning the call site, gradually increased. The results show that differences in signal attenuation and AM degradation levels are perceived by males and trigger both vocal and non-vocal behavioral responses consistent with their use in evaluating the distance to a challenging male. Furthermore, the results indicate that the male responses are graded, increasing as intensity rises and degradation falls, and hierarchical, with vocal responses preceding behavioral responses over the range of intensities and degradation levels presented.
Responses of male cricket frogs (Acris crepitans) to attenuated and degraded advertisement calls
Venator, Kurt R.; Ryan, Michael J.; Wilczynski, Walter
2017-01-01
We examined the vocal and non-vocal responses of male cricket frogs (Acris crepitans) to conspecific advertisement calls that had been attenuated or degraded by reducing the depth of amplitude modulation (AM). Both are characteristic of changes to the call as it is transmitted through natural habitats. As stimulus calls became more intense or less degraded, male cricket frogs gradually decreased their call rate and increased the number of call groups and pulse groups in their calls, changes indicative of increased aggressive interactions. At the higher intensities and lower degradation levels, the probability that males would shift to one of two non-vocal behavioral responses, attacking the perceived intruder or ceasing calling and abandoning the call site, gradually increased. The results show that differences in signal attenuation and AM degradation levels are perceived by males and trigger both vocal and non-vocal behavioral responses consistent with their use in evaluating the distance to a challenging male. Furthermore, the results indicate that the male responses are graded, increasing as intensity rises and degradation falls, and hierarchical, with vocal responses preceding behavioral responses over the range of intensities and degradation levels presented. PMID:28966421
A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation
NASA Astrophysics Data System (ADS)
Apostol, Lian; Perrier, Pascal; Bailly, Gérard
2004-01-01
A method is proposed to model the interspeaker variability of formant patterns for oral vowels. It is assumed that this variability originates in the differences existing among speakers in the respective lengths of their front and back vocal-tract cavities. In order to characterize, from the spectral description of the acoustic speech signal, these vocal-tract differences between speakers, each formant is interpreted, according to the concept of formant-cavity affiliation, as a resonance of a specific vocal-tract cavity. Its frequency can thus be directly related to the corresponding cavity length, and a transformation model can be proposed from a speaker A to a speaker B on the basis of the frequency ratios of the formants corresponding to the same resonances. In order to minimize the number of sounds to be recorded for each speaker in order to carry out this speaker transformation, the frequency ratios are exactly computed only for the three extreme cardinal vowels [eye, aye, you] and they are approximated for the remaining vowels through an interpolation function. The method is evaluated through its capacity to transform the (F1,F2) formant patterns of eight oral vowels pronounced by five male speakers into the (F1,F2) patterns of the corresponding vowels generated by an articulatory model of the vocal tract. The resulting formant patterns are compared to those provided by normalization techniques published in the literature. The proposed method is found to be efficient, but a number of limitations are also observed and discussed. These limitations can be associated with the formant-cavity affiliation model itself or with a possible influence of speaker-specific vocal-tract geometry in the cross-sectional direction, which the model might not have taken into account.
Vocal Parameters of Elderly Female Choir Singers
Aquino, Fernanda Salvatico de; Ferreira, Léslie Piccolotto
2015-01-01
Introduction Due to increased life expectancy among the population, studying the vocal parameters of the elderly is key to promoting vocal health in old age. Objective This study aims to analyze the profile of the extension of speech of elderly female choristers, according to age group. Method The study counted on the participation of 25 elderly female choristers from the Choir of Messianic Church of São Paulo, with ages varying between 63 and 82 years, and an average of 71 years (standard deviation of 5.22). The elders were divided into two groups: G1 aged 63 to 71 years and G2 aged 72 to 82. We asked that each participant count from 20 to 30 in weak, medium, strong, and very strong intensities. Their speech was registered by the software Vocalgrama that allows the evaluation of the profile of speech range. We then submitted the parameters of frequency and intensity to descriptive analysis, both in minimum and maximum levels, and range of spoken voice. Results The average of minimum and maximum frequencies were respectively 134.82–349.96 Hz for G1 and 137.28–348.59 Hz for G2; the average for minimum and maximum intensities were respectively 40.28–95.50 dB for G1 and 40.63–94.35 dB for G2; the vocal range used in speech was 215.14 Hz for G1 and 211.30 Hz for G2. Conclusion The minimum and maximum frequencies, maximum intensity, and vocal range presented differences in favor of the younger elder group. PMID:26722341
How low can you go? Physical production mechanism of elephant infrasonic vocalizations.
Herbst, Christian T; Stoeger, Angela S; Frey, Roland; Lohscheller, Jörg; Titze, Ingo R; Gumpenberger, Michaela; Fitch, W Tecumseh
2012-08-03
Elephants can communicate using sounds below the range of human hearing ("infrasounds" below 20 hertz). It is commonly speculated that these vocalizations are produced in the larynx, either by neurally controlled muscle twitching (as in cat purring) or by flow-induced self-sustained vibrations of the vocal folds (as in human speech and song). We used direct high-speed video observations of an excised elephant larynx to demonstrate flow-induced self-sustained vocal fold vibration in the absence of any neural signals, thus excluding the need for any "purring" mechanism. The observed physical principles of voice production apply to a wide variety of mammals, extending across a remarkably large range of fundamental frequencies and body sizes, spanning more than five orders of magnitude.
THE EFFECTS OF MATCHED STIMULATION AND RESPONSE INTERRUPTION AND REDIRECTION ON VOCAL STEREOTYPY
Love, Jessica J; Miguel, Caio F; Fernand, Jonathan K; LaBrie, Jillian K
2012-01-01
Stereotypy has been classified as repetitive behavior that does not serve any apparent function. Two procedures that have been found to reduce rates of vocal stereotypy effectively are response interruption and redirection (RIRD) and noncontingent access to matched stimulation (MS). The purpose of the current study was to evaluate the effects of RIRD alone, MS alone, and MS combined with RIRD. One participant's results suggested similar suppressive effects on vocal stereotypy across treatment conditions. For the second participant, a slightly greater suppression of stereotypy was associated with MS + RIRD. In addition, both participants emitted a greater frequency of appropriate vocalizations in conditions with RIRD. Data suggest that the addition of MS might facilitate the implementation of RIRD in applied settings. PMID:23060668
Maternal Vocal Feedback to 9-Month-Old Infant Siblings of Children with ASD
Talbott, Meagan R.; Nelson, Charles A.; Tager-Flusberg, Helen
2016-01-01
Infant siblings of children with autism spectrum disorder display differences in early language and social communication skills beginning as early as the first year of life. While environmental influences on early language development are well documented in other infant populations, they have received relatively little attention inside of the infant sibling context. In this study, we analyzed home video diaries collected prospectively as part of a longitudinal study of infant siblings. Infant vowel and consonant-vowel vocalizations and maternal language-promoting and non-promoting verbal responses were scored for 30 infant siblings and 30 low risk control infants at 9 months of age. Analyses evaluated whether infant siblings or their mothers exhibited differences from low risk dyads in vocalization frequency or distribution, and whether mothers’ responses were associated with other features of the high risk context. Analyses were conducted with respect to both initial risk group and preliminary outcome classification. Overall, we found no differences in infants’ consonant-vowel vocalizations, the frequency of overall maternal utterances, or the distribution of mothers’ response types. Both groups of infants produced more vowel than consonant-vowel vocalizations, and both groups of mothers responded to consonant-vowel vocalizations with more language-promoting than non-promoting responses. These results indicate that as a group, mothers of high risk infants provide equally high quality linguistic input to their infants in the first year of life and suggest that impoverished maternal linguistic input does not contribute to high risk infants’ initial language difficulties. Implications for intervention strategies are also discussed. PMID:26174704
Kim, Keun Ho; Ku, Boncho; Kang, Namsik; Kim, Young-Su; Jang, Jun-Su; Kim, Jong Yeol
2012-01-01
The voice has been used to classify the four constitution types, and to recognize a subject's health condition by extracting meaningful physical quantities, in traditional Korean medicine. In this paper, we propose a method of selecting the reliable variables from various voice features, such as frequency derivative features, frequency band ratios, and intensity, from vowels and a sentence. Further, we suggest a process to extract independent variables by eliminating explanatory variables and reducing their correlation and remove outlying data to enable reliable discriminant analysis. Moreover, the suitable division of data for analysis, according to the gender and age of subjects, is discussed. Finally, the vocal features are applied to a discriminant analysis to classify each constitution type. This method of voice classification can be widely used in the u-Healthcare system of personalized medicine and for improving diagnostic accuracy. PMID:22529874
Flow-structure interaction simulation of voice production in a canine larynx
NASA Astrophysics Data System (ADS)
Jiang, Weili; Zheng, Xudong; Xue, Qian; Oren, Liran; Khosla, Sid
2017-11-01
Experimental measurements conducted on a hemi-larynx canine vocal fold showed that negative pressures formed in the glottis near the superior surface of the vocal fold in the closing phase even without a supra-glottal vocal tract. It was hypothesized that such negative pressures were due to intraglottal vortices caused by flow separation in a divergent vocal tract during vocal fold closing phase. This work aims to test this hypothesis from the numerical aspect. Flow-structure interaction simulations are performed in realistic canine laryngeal shapes. In the simulations, a sharp interface immersed boundary method based incompressible flow solver is utilized to model the air flow; a finite element based solid mechanics solver is utilized to model the vocal fold vibration. The geometric structure of the vocal fold and vocal tract are based on MRI scans of a mongrel canine. The vocal fold tissue is modeled as transversely isotropic nonlinear materials with a vertical stiffness gradient. Numerical indentation is first performed and compared with the experiment data to obtain the material properties. Simulation setup about the inlet and outlet pressure follows the setup in the experiment. Simulation results including the fundamental frequency, air flow rate, the divergent angle will be compared with the experimental data, providing the validation of the simulation approach. The relationship between flow separation, intra-glottal vortices, divergent angle and flow rate will be comprehensively analyzed.
Ultrasonic vocalization changes and FOXP2 expression after experimental stroke.
Doran, Sarah J; Trammel, Cassandra; Benashaski, Sharon E; Venna, Venugopal Reddy; McCullough, Louise D
2015-04-15
Speech impairments affect one in four stroke survivors. However, animal models of post-ischemic vocalization deficits are limited. Male mice vocalize at ultrasonic frequencies when exposed to an estrous female mouse. In this study we assessed vocalization patterns and quantity in male mice after cerebral ischemia. FOXP2, a gene associated with verbal dyspraxia in humans, with known roles in neurogenesis and synaptic plasticity, was also examined after injury. Using a transient middle cerebral artery occlusion (MCAO) model, we assessed correlates of vocal impairment at several time-points after stroke. Further, to identify possible lateralization of vocalization deficits induced by left and right hemispheric strokes were compared. Significant differences in vocalization quantity were observed between stroke and sham animals that persisted for a month after injury. Injury to the left hemisphere reduced early vocalizations more profoundly than those to the right hemisphere. Nuclear expression of Foxp2 was elevated early after stroke (at 6h), but significantly decreased 24h after injury in both the nucleus and the cytoplasm. Neuronal Foxp2 expression increased in stroke mice compared to sham animals 4 weeks after injury. This study demonstrates that quantifiable deficits in ultrasonic vocalizations (USVs) are seen after stroke. USV may be a useful tool to assess chronic behavioral recovery in murine models of stroke. Copyright © 2015 Elsevier B.V. All rights reserved.
Vocal Generalization Depends on Gesture Identity and Sequence
Sober, Samuel J.
2014-01-01
Generalization, the brain's ability to transfer motor learning from one context to another, occurs in a wide range of complex behaviors. However, the rules of generalization in vocal behavior are poorly understood, and it is unknown how vocal learning generalizes across an animal's entire repertoire of natural vocalizations and sequences. Here, we asked whether generalization occurs in a nonhuman vocal learner and quantified its properties. We hypothesized that adaptive error correction of a vocal gesture produced in one sequence would generalize to the same gesture produced in other sequences. To test our hypothesis, we manipulated the fundamental frequency (pitch) of auditory feedback in Bengalese finches (Lonchura striata var. domestica) to create sensory errors during vocal gestures (song syllables) produced in particular sequences. As hypothesized, error-corrective learning on pitch-shifted vocal gestures generalized to the same gestures produced in other sequential contexts. Surprisingly, generalization magnitude depended strongly on sequential distance from the pitch-shifted syllables, with greater adaptation for gestures produced near to the pitch-shifted syllable. A further unexpected result was that nonshifted syllables changed their pitch in the direction opposite from the shifted syllables. This apparently antiadaptive pattern of generalization could not be explained by correlations between generalization and the acoustic similarity to the pitch-shifted syllable. These findings therefore suggest that generalization depends on the type of vocal gesture and its sequential context relative to other gestures and may reflect an advantageous strategy for vocal learning and maintenance. PMID:24741046
Comparison of voice-use profiles between elementary classroom and music teachers.
Morrow, Sharon L; Connor, Nadine P
2011-05-01
Among teachers, music teachers are roughly four times more likely than classroom teachers to develop voice-related problems. Although it has been established that music teachers use their voices at high intensities and durations in the course of their workday, voice-use profiles concerning the amount and intensity of vocal use and vocal load have neither been quantified nor has vocal load for music teachers been compared with classroom teachers using these same voice-use parameters. In this study, total phonation time, fundamental frequency (F₀), and vocal intensity (dB SPL [sound pressure level]) were measured or estimated directly using a KayPENTAX Ambulatory Phonation Monitor (KayPENTAX, Lincoln Park, NJ). Vocal load was calculated as cycle and distance dose, as defined by Švec et al (2003), which integrates total phonation time, F₀, and vocal intensity. Twelve participants (n = 7 elementary music teachers and n = 5 elementary classroom teachers) were monitored during five full teaching days of one workweek to determine average vocal load for these two groups of teachers. Statistically significant differences in all measures were found between the two groups (P < 0.05) with large effect sizes for all parameters. These results suggest that typical vocal loads for music teachers are substantially higher than those experienced by classroom teachers (P < 0.01). This study suggests that reducing vocal load may have immediate clinical and educational benefits in vocal health in music teachers. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Echolocating bats rely on audiovocal feedback to adapt sonar signal design.
Luo, Jinhong; Moss, Cynthia F
2017-10-10
Many species of bat emit acoustic signals and use information carried by echoes reflecting from nearby objects to navigate and forage. It is widely documented that echolocating bats adjust the features of sonar calls in response to echo feedback; however, it remains unknown whether audiovocal feedback contributes to sonar call design. Audiovocal feedback refers to the monitoring of one's own vocalizations during call production and has been intensively studied in nonecholocating animals. Audiovocal feedback not only is a necessary component of vocal learning but also guides the control of the spectro-temporal structure of vocalizations. Here, we show that audiovocal feedback is directly involved in the echolocating bat's control of sonar call features. As big brown bats tracked targets from a stationary position, we played acoustic jamming signals, simulating calls of another bat, timed to selectively perturb audiovocal feedback or echo feedback. We found that the bats exhibited the largest call-frequency adjustments when the jamming signals occurred during vocal production. By contrast, bats did not show sonar call-frequency adjustments when the jamming signals coincided with the arrival of target echoes. Furthermore, bats rapidly adapted sonar call design in the first vocalization following the jamming signal, revealing a response latency in the range of 66 to 94 ms. Thus, bats, like songbirds and humans, rely on audiovocal feedback to structure sonar signal design.
Van Stan, Jarrad H.; Mehta, Daryush D.; Zeitels, Steven M.; Burns, James A.; Barbu, Anca M.; Hillman, Robert E.
2015-01-01
Objectives Clinical management of phonotraumatic vocal fold lesions (nodules, polyps) is based largely on assumptions that abnormalities in habitual levels of sound pressure level (SPL), fundamental frequency (f0), and/or amount of voice use play a major role in lesion development and chronic persistence. This study used ambulatory voice monitoring to evaluate if significant differences in voice use exist between patients with phonotraumatic lesions and normal matched controls. Methods Subjects were 70 adult females: 35 with vocal fold nodules or polyps and 35 age-, sex-, and occupation-matched normal individuals. Weeklong summary statistics of voice use were computed from anterior neck surface acceleration recorded using a smartphone-based ambulatory voice monitor. Results Paired t-tests and Kolmogorov-Smirnov tests resulted in no statistically significant differences between patients and matched controls regarding average measures of SPL, f0, vocal dose measures, and voicing/voice rest periods. Paired t-tests comparing f0 variability between the groups resulted in statistically significant differences with moderate effect sizes. Conclusions Individuals with phonotraumatic lesions did not exhibit differences in average ambulatory measures of vocal behavior when compared with matched controls. More refined characterizations of underlying phonatory mechanisms and other potentially contributing causes are warranted to better understand risk factors associated with phonotraumatic lesions. PMID:26024911
Effects of Asymmetric Superior Laryngeal Nerve Stimulation on Glottic Posture, Acoustics, Vibration
Chhetri, Dinesh K.; Neubauer, Juergen; Bergeron, Jennifer L.; Sofer, Elazar; Peng, Kevin A.; Jamal, Nausheen
2013-01-01
Objectives Evaluate the effects of asymmetric superior laryngeal nerve stimulation on the vibratory phase, laryngeal posture, and acoustics. Study Design Basic science study using an in vivo canine model. Methods The superior laryngeal nerves were symmetrically and asymmetrically stimulated over eight activation levels to mimic laryngeal asymmetries representing various levels of superior laryngeal nerve paresis and paralysis conditions. Glottal posture change, vocal fold speed, and vibration of these 64 distinct laryngeal activation conditions were evaluated by high speed video and concurrent acoustic and aerodynamic recordings. Assessments were made at phonation onset. Results Vibratory phase was symmetric in all symmetric activation conditions but consistent phase asymmetry towards the vocal fold with higher superior laryngeal nerve activation was observed. Superior laryngeal nerve paresis and paralysis conditions had reduced vocal fold strain and fundamental frequency. Superior laryngeal nerve activation increased vocal fold closure speed, but this effect was more pronounced for the ipsilateral vocal fold. Increasing asymmetry led to aperiodic and chaotic vibration. Conclusions This study directly links vocal fold tension asymmetry with vibratory phase asymmetry; in particular the side with greater tension leads in the opening phase. The clinical observations of vocal fold lag, reduced vocal range, and aperiodic voice in superior laryngeal paresis and paralysis is also supported. PMID:23712542
High-precision spatial localization of mouse vocalizations during social interaction.
Heckman, Jesse J; Proville, Rémi; Heckman, Gert J; Azarfar, Alireza; Celikel, Tansu; Englitz, Bernhard
2017-06-07
Mice display a wide repertoire of vocalizations that varies with age, sex, and context. Especially during courtship, mice emit ultrasonic vocalizations (USVs) of high complexity, whose detailed structure is poorly understood. As animals of both sexes vocalize, the study of social vocalizations requires attributing single USVs to individuals. The state-of-the-art in sound localization for USVs allows spatial localization at centimeter resolution, however, animals interact at closer ranges, involving tactile, snout-snout exploration. Hence, improved algorithms are required to reliably assign USVs. We develop multiple solutions to USV localization, and derive an analytical solution for arbitrary vertical microphone positions. The algorithms are compared on wideband acoustic noise and single mouse vocalizations, and applied to social interactions with optically tracked mouse positions. A novel, (frequency) envelope weighted generalised cross-correlation outperforms classical cross-correlation techniques. It achieves a median error of ~1.4 mm for noise and ~4-8.5 mm for vocalizations. Using this algorithms in combination with a level criterion, we can improve the assignment for interacting mice. We report significant differences in mean USV properties between CBA mice of different sexes during social interaction. Hence, the improved USV attribution to individuals lays the basis for a deeper understanding of social vocalizations, in particular sequences of USVs.
Van Lierde, Kristiane M; De Bodt, Marc; Dhaeseleer, Evelien; Wuyts, Floris; Claeys, Sofie
2010-05-01
The purpose of the present study is to measure the effectiveness of two treatment techniques--vocalization with abdominal breath support and manual circumlaryngeal therapy (MCT)--in patients with muscle tension dysphonia (MTD). The vocal quality before and after the two treatment techniques was measured by means of the dysphonia severity index (DSI), which is designed to establish an objective and quantitative correlate of the perceived vocal quality. The DSI is based on the weighted combination of the following set of voice measurements: maximum phonation time (MPT), highest frequency, lowest intensity, and jitter. The repeated-measures analysis of variance (ANOVA) revealed a significant difference between the objective overall vocal quality before and after MCT. No significant differences were measured between the objective overall vocal quality before and after vocalization with abdominal breath support. This study showed evidence that MCT is an effective treatment technique for patients with elevated laryngeal position, increased laryngeal muscle tension, and MTD. The precise way in which MCT has an effect on vocal quality has not been addressed in this experiment, but merits study. Further research into this topic could focus on electromyography (EMG) recordings in relation to vocal improvements with larger sample of subjects. (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Picciulin, Marta; Sebastianutto, Linda; Codarin, Antonio; Calcagno, Giuliana; Ferrero, Enrico A
2012-11-01
This study investigated whether or not boat noise causes variations in brown meagre (Sciaena umbra) vocalizations recorded in a nearshore Mediterranean marine reserve. Six nocturnal experimental sessions were carried out from June to September 2009. In each of them, a recreational boat passed over vocalizing fish 6 times with 1 boat passage every 10 min. For this purpose three different boats were used in random order: an 8.5-m cabin-cruiser (CC), a 5-m fiberglass boat (FB), and a 7-m inflatable boat (INF). In situ continuous acoustic recordings were collected using a self-standing sonobuoy. Because boat noise levels largely exceeded both background noise and S. umbra vocalizations in the species' hearing frequency range, masking of acoustic communication was assumed. Although no immediate effect was observed during a single boat passage, the S. umbra mean pulse rate increased over multiple boat passages in the experimental condition but not in the control condition, excluding that the observed effect was due to a natural rise in fish vocalizations. The observed vocal enhancement may result either from an increased density of callers or from an increased number of pulses/sounds produced by already acoustically active individuals, as a form of vocal compensation. These two explanations are discussed.
Vocal Fry Use in Adult Female Speakers Exposed to Two Languages.
Gibson, Todd A; Summers, Connie; Walls, Sydney
2017-07-01
Several studies have identified the widespread use of vocal fry among American women. Popular explanations for this phenomenon appeal to sociolinguistic purposes that likely take significant time for second language users to learn. The objective of this study was to determine if mere exposure to this vocal register, as opposed to nuanced sociolinguistic motivations, might explain its widespread use. This study used multigroup within- and between-subjects design. Fifty-eight women from one of three language background groups (functionally monolingual in English, functionally monolingual in Spanish, and Spanish-English bilinguals) living in El Paso, Texas, repeated a list of nonwords conforming to the sound rules of English and another list of nonwords conforming to the sound rules of Spanish. Perceptual analysis identified each episode of vocal fry. There were no statistically significant differences between groups in their frequency of vocal fry use despite large differences in their amount of English-language exposure. All groups produced more vocal fry when repeating English than when repeating Spanish nonwords. Because the human perceptual system encodes for vocal qualities even after minimal language experience, the widespread use of vocal fry among female residents in the United States likely is owing to mere exposure to English rather than nuanced sociolinguistic motivations. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal Accuracy and Neural Plasticity Following Micromelody-Discrimination Training
Zarate, Jean Mary; Delhommeau, Karine; Wood, Sean; Zatorre, Robert J.
2010-01-01
Background Recent behavioral studies report correlational evidence to suggest that non-musicians with good pitch discrimination sing more accurately than those with poorer auditory skills. However, other studies have reported a dissociation between perceptual and vocal production skills. In order to elucidate the relationship between auditory discrimination skills and vocal accuracy, we administered an auditory-discrimination training paradigm to a group of non-musicians to determine whether training-enhanced auditory discrimination would specifically result in improved vocal accuracy. Methodology/Principal Findings We utilized micromelodies (i.e., melodies with seven different interval scales, each smaller than a semitone) as the main stimuli for auditory discrimination training and testing, and we used single-note and melodic singing tasks to assess vocal accuracy in two groups of non-musicians (experimental and control). To determine if any training-induced improvements in vocal accuracy would be accompanied by related modulations in cortical activity during singing, the experimental group of non-musicians also performed the singing tasks while undergoing functional magnetic resonance imaging (fMRI). Following training, the experimental group exhibited significant enhancements in micromelody discrimination compared to controls. However, we did not observe a correlated improvement in vocal accuracy during single-note or melodic singing, nor did we detect any training-induced changes in activity within brain regions associated with singing. Conclusions/Significance Given the observations from our auditory training regimen, we therefore conclude that perceptual discrimination training alone is not sufficient to improve vocal accuracy in non-musicians, supporting the suggested dissociation between auditory perception and vocal production. PMID:20567521
A laryngographic and laryngoscopic study of Northern Vietnamese tones.
Brunelle, Marc; Nguyên, Duy Duong; Nguyên, Khac Hùng
2010-01-01
A laryngographic and laryngoscopic study of tone production in Northern Vietnamese, a language whose tones combine both fundamental frequency (f0) modulations and voice qualities (phonation types), was conducted with 5 male and 5 female speakers. Results show that the f0 contours of Northern Vietnamese tones are not only attributable to changes in vocal fold length and tension (partly through changes in larynx height), but that f0 drops are also largely caused by the glottal configurations responsible for the contrastive voice qualities associated with some of the tones. We also find that voice quality contrasts are mostly due to glottal constriction: they occasionally involve additional ventricular fold incursion and epiglottal constriction, but these articulations are usually absent. Copyright © 2010 S. Karger AG, Basel.
The Effectiveness of Low-Level Light Therapy in Attenuating Vocal Fatigue.
Kagan, Loraine Sydney; Heaton, James T
2017-05-01
Low-level light therapy (LLLT) is effective in reducing inflammation, promoting wound healing, and preventing tissue damage, but has not yet been studied in the treatment of voice disorders. The objective of this study was to investigate the possible effectiveness of LLLT in attenuating symptoms of vocal fatigue created by a vocal loading task as measured by acoustic, aerodynamic, and self-reported vocal effort. In a randomized, prospective study, 16 vocally healthy adults divided into four groups underwent a 1-hour vocal loading procedure, followed by infrared wavelength LLLT (828 nm), red wavelength LLLT (628 nm), heat, or no heat-light (control) treatment targeting the laryngeal region of the ventral neck surface. Phonation threshold pressure (PTP), relative fundamental frequency (RFF), and the inability to produce soft voice (IPSV) self-perceptual rating scale were recorded (1) at baseline, (2) immediately after vocal loading, (3) after treatment, and (4) 1 hour after treatment. Vocal loading significantly increased PTP and IPSV and decreased onset and offset RFFs, consistent with a shift toward vocal dysfunction. Red light significantly normalized the combination of PTP, IPSV, and RFF measures compared to other conditions. RFF is sensitive to a vocal loading task in conjunction with PTP and IPSV, and red LLLT may have a normalizing effect on objective and subjective measures of vocal fatigue. The results of this study lay the groundwork and rationale for future research to optimize LLLT wavelength combinations and overall dose. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Acoustic alterations of ultrasonic vocalization in rat pups induced by perinatal hypothyroidism.
Wada, Hiromi
2017-03-01
Perinatal hypothyroidism causes serious damage to auditory functions that are essential for vocalization development. In rat pups, perinatal hypothyroidism potentially affects the development of ultrasonic vocalization (USV) as a result of hearing deficits. This study examined the effect of perinatal hypothyroidism on the development of USVs in rat pups. Twelve pregnant rats were divided into three groups and treated with the anti-thyroid drug methimazole (MMI) via drinking water, from gestational day 15 to postnatal day (PND) 21. The MMI concentration (w/v) was 0% (control group), 0.01% (low-dose group), or 0.015% (high-dose group). After birth, the pups were individually separated from the dam and littermates on PNDs 5, 10, 15, and 20, and their USVs were recorded for 5min. On PNDs 5 and 10, compared with the control group, the low- and high-dose groups exhibited reductions of both frequency-modulated and downward USVs. On PND 15, however, the low- and high-dose groups displayed increases in number, duration, and amplitude of USVs compared with those in the control group. Lower body weights were observed for the low- and high-dose groups than for the control group. Total thyroxine concentrations in plasma were dose-dependently reduced. The onset of auditory functions appeared on PNDs 11-14. Thus, the rat pups were unable to hear externally produced USVs before PND 11. USVs emitted on PNDs 5 and 10 might have been spontaneous and independent of the pups' own or littermate-emitted USVs. The developmental retardation of vocalization-related organs or muscles might underlie the acoustic alterations of USVs on PNDs 5 and 10. The greater number, duration, and amplitude of USVs on PND 15, after which the hearing onset occurred, suggested that the elevation of auditory thresholds occurred as a result of hearing deficits in the low- and high-dose groups. Perinatal hypothyroidism appears to have caused acoustic alterations in the USV development. Copyright © 2016 Elsevier B.V. All rights reserved.
Cohen, Alex S; Dinzeo, Thomas J; Donovan, Neila J; Brown, Caitlin E; Morrison, Sean C
2015-03-30
Vocal expression reflects an integral component of communication that varies considerably within individuals across contexts and is disrupted in a range of neurological and psychiatric disorders. There is reason to suspect that variability in vocal expression reflects, in part, the availability of "on-line" resources (e.g., working memory, attention). Thus, understanding vocal expression is a potentially important biometric index of information processing, not only across but within individuals over time. A first step in this line of research involves establishing a link between vocal expression and information processing systems in healthy adults. The present study employed a dual attention experimental task where participants provided natural speech while simultaneously engaged in a baseline, medium or high nonverbal processing-load task. Objective, automated, and computerized analysis was employed to measure vocal expression in 226 adults. Increased processing load resulted in longer pauses, fewer utterances, greater silence overall and less variability in frequency and intensity levels. These results provide compelling evidence of a link between information processing resources and vocal expression, and provide important information for the development of an automated, inexpensive and uninvasive biometric measure of information processing. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Vocal tract resonances in speech, singing, and playing musical instruments
Wolfe, Joe; Garnier, Maëva; Smith, John
2009-01-01
In both the voice and musical wind instruments, a valve (vocal folds, lips, or reed) lies between an upstream and downstream duct: trachea and vocal tract for the voice; vocal tract and bore for the instrument. Examining the structural similarities and functional differences gives insight into their operation and the duct-valve interactions. In speech and singing, vocal tract resonances usually determine the spectral envelope and usually have a smaller influence on the operating frequency. The resonances are important not only for the phonemic information they produce, but also because of their contribution to voice timbre, loudness, and efficiency. The role of the tract resonances is usually different in brass and some woodwind instruments, where they modify and to some extent compete or collaborate with resonances of the instrument to control the vibration of a reed or the player’s lips, and∕or the spectrum of air flow into the instrument. We give a brief overview of oscillator mechanisms and vocal tract acoustics. We discuss recent and current research on how the acoustical resonances of the vocal tract are involved in singing and the playing of musical wind instruments. Finally, we compare techniques used in determining tract resonances and suggest some future developments. PMID:19649157
Effects of background noise on acoustic characteristics of Bengalese finch songs.
Shiba, Shintaro; Okanoya, Kazuo; Tachibana, Ryosuke O
2016-12-01
Online regulation of vocalization in response to auditory feedback is one of the essential issues for vocal communication. One such audio-vocal interaction is the Lombard effect, an involuntary increase in vocal amplitude in response to the presence of background noise. Along with vocal amplitude, other acoustic characteristics, including fundamental frequency (F0), also change in some species. Bengalese finches (Lonchura striata var. domestica) are a suitable model for comparative, ethological, and neuroscientific studies on audio-vocal interaction because they require real-time auditory feedback of their own songs to maintain normal singing. Here, the changes in amplitude and F0 with a focus on the distinct song elements (i.e., notes) of Bengalese finches under noise presentation are demonstrated. To accurately analyze these acoustic characteristics, two different bandpass-filtered noises at two levels of sound intensity were used. The results confirmed that the Lombard effect occurs at the note level of Bengalese finch song. Further, individually specific modes of changes in F0 are shown. These behavioral changes suggested the vocal control mechanisms on which the auditory feedback is based have a predictable effect on amplitude, but complex spectral effects on individual note production.
Vocal tract resonances in speech, singing, and playing musical instruments.
Wolfe, Joe; Garnier, Maëva; Smith, John
2009-01-01
IN BOTH THE VOICE AND MUSICAL WIND INSTRUMENTS, A VALVE (VOCAL FOLDS, LIPS, OR REED) LIES BETWEEN AN UPSTREAM AND DOWNSTREAM DUCT: trachea and vocal tract for the voice; vocal tract and bore for the instrument. Examining the structural similarities and functional differences gives insight into their operation and the duct-valve interactions. In speech and singing, vocal tract resonances usually determine the spectral envelope and usually have a smaller influence on the operating frequency. The resonances are important not only for the phonemic information they produce, but also because of their contribution to voice timbre, loudness, and efficiency. The role of the tract resonances is usually different in brass and some woodwind instruments, where they modify and to some extent compete or collaborate with resonances of the instrument to control the vibration of a reed or the player's lips, andor the spectrum of air flow into the instrument. We give a brief overview of oscillator mechanisms and vocal tract acoustics. We discuss recent and current research on how the acoustical resonances of the vocal tract are involved in singing and the playing of musical wind instruments. Finally, we compare techniques used in determining tract resonances and suggest some future developments.
Critical ratios of beluga whales (Delphinapterus leucas) and masked signal duration.
Erbe, Christine
2008-10-01
This article examines the masking of a complex beluga vocalization by natural and anthropogenic noise. The call consisted of six 150 ms pulses exhibiting spectral peaks between 800 Hz and 8 kHz. Comparing the spectra and spectrograms of the call and noises at detection threshold showed that the animal did not hear the entire call at threshold. It only heard parts of the call in frequency and time. From the masked hearing thresholds in broadband continuous noises, critical ratios were computed. Fletcher critical bands were narrower than either 15 or 111 of an octave at the low frequencies of the call (<2 kHz), depending on which frequency the animal cued on. From the masked hearing thresholds in intermittent noises, the audible signal duration at detection threshold was computed. The intermittent noises differed in gap length, gap number, and masking, but the total audible signal duration at threshold was the same: 660 ms. This observation supports a multiple-looks model. The two amplitude modulated noises exhibited weaker masking than the unmodulated noises hinting at a comodulation masking release.
Acoustic signals of baby black caimans.
Vergne, Amélie L; Aubin, Thierry; Taylor, Peter; Mathevon, Nicolas
2011-12-01
In spite of the importance of crocodilian vocalizations for the understanding of the evolution of sound communication in Archosauria and due to the small number of experimental investigations, information concerning the vocal world of crocodilians is limited. By studying black caimans Melanosuchus niger in their natural habitat, here we supply the experimental evidence that juvenile crocodilians can use a graded sound system in order to elicit adapted behavioral responses from their mother and siblings. By analyzing the acoustic structure of calls emitted in two different situations ('undisturbed context', during which spontaneous calls of juvenile caimans were recorded without perturbing the group, and a simulated 'predator attack', during which calls were recorded while shaking juveniles) and by testing their biological relevance through playback experiments, we reveal the existence of two functionally different types of juvenile calls that produce a different response from the mother and other siblings. Young black caimans can thus modulate the structure of their vocalizations along an acoustic continuum as a function of the emission context. Playback experiments show that both mother and juveniles discriminate between these 'distress' and 'contact' calls. Acoustic communication is thus an important component mediating relationships within family groups in caimans as it is in birds, their archosaurian relatives. Although probably limited, the vocal repertoire of young crocodilians is capable of transmitting the information necessary for allowing siblings and mother to modulate their behavior. Copyright © 2011 Elsevier GmbH. All rights reserved.
McGettigan, Carolyn; Eisner, Frank; Agnew, Zarinah K; Manly, Tom; Wisbey, Duncan; Scott, Sophie K
2014-01-01
Historically, the study of human identity perception has focused on faces, but the voice is also central to our expressions and experiences of identity (P. Belin, Fecteau, & Bedard, 2004). Our voices are highly flexible and dynamic; talkers speak differently depending on their health, emotional state, and the social setting, as well as extrinsic factors such as background noise. However, to date, there have been no studies of the neural correlates of identity modulation in speech production. In the current fMRI experiment, we measured the neural activity supporting controlled voice change in adult participants performing spoken impressions. We reveal that deliberate modulation of vocal identity recruits the left anterior insula and inferior frontal gyrus, supporting the planning of novel articulations. Bilateral sites in posterior superior temporal/inferior parietal cortex and a region in right mid/anterior superior temporal sulcus showed greater responses during the emulation of specific vocal identities than for impressions of generic accents. Using functional connectivity analyses, we describe roles for these three sites in their interactions with the brain regions supporting speech planning and production. Our findings mark a significant step toward understanding the neural control of vocal identity, with wider implications for the cognitive control of voluntary motor acts. PMID:23691984
The Effect of Syllable Repetition Rate on Vocal Characteristics
ERIC Educational Resources Information Center
Topbas, Oya; Orlikoff, Robert F.; St. Louis, Kenneth O.
2012-01-01
This study examined whether mean vocal fundamental frequency ("F"[subscript 0]) or speech sound pressure level (SPL) varies with changes in syllable repetition rate. Twenty-four young adults (12 M and 12 F) repeated the syllables/p[inverted v]/,/p[inverted v]t[schwa]/, and/p[inverted v]t[schwa]k[schwa]/at a modeled "slow" rate of approximately one…
The acoustic features of human laughter
NASA Astrophysics Data System (ADS)
Bachorowski, Jo-Anne; Owren, Michael J.
2002-05-01
Remarkably little is known about the acoustic features of laughter, despite laughter's ubiquitous role in human vocal communication. Outcomes are described for 1024 naturally produced laugh bouts recorded from 97 young adults. Acoustic analysis focused on temporal characteristics, production modes, source- and filter-related effects, and indexical cues to laugher sex and individual identity. The results indicate that laughter is a remarkably complex vocal signal, with evident diversity in both production modes and fundamental frequency characteristics. Also of interest was finding a consistent lack of articulation effects in supralaryngeal filtering. Outcomes are compared to previously advanced hypotheses and conjectures about this species-typical vocal signal.
Sousa-Lima, Renata S
2006-06-01
This letter concerns the paper "Intraspecific and geographic variation of West Indian manatee (Trichechus manatus spp.) vocalizations" [Nowacek et al., J. Acoust. Soc. Am. 114, 66-69 (2003)]. The purpose here is to correct the fundamental frequency range and information on intraindividual variation in the vocalizations of Amazonian manatees reported by Nowacek et al. (2003) in citing the paper "Signature information and individual recognition in the isolation calls of Amazonian manatees, Trichechus inunguis (Mammalia: Sirenia)" [Sousa-Lima et al., Anim. Behav. 63, 301-310 (2002)].
Wenstrup, J J
1999-11-01
The auditory cortex of the mustached bat (Pteronotus parnellii) displays some of the most highly developed physiological and organizational features described in mammalian auditory cortex. This study examines response properties and organization in the medial geniculate body (MGB) that may contribute to these features of auditory cortex. About 25% of 427 auditory responses had simple frequency tuning with single excitatory tuning curves. The remainder displayed more complex frequency tuning using two-tone or noise stimuli. Most of these were combination-sensitive, responsive to combinations of different frequency bands within sonar or social vocalizations. They included FM-FM neurons, responsive to different harmonic elements of the frequency modulated (FM) sweep in the sonar signal, and H1-CF neurons, responsive to combinations of the bat's first sonar harmonic (H1) and a higher harmonic of the constant frequency (CF) sonar signal. Most combination-sensitive neurons (86%) showed facilitatory interactions. Neurons tuned to frequencies outside the biosonar range also displayed combination-sensitive responses, perhaps related to analyses of social vocalizations. Complex spectral responses were distributed throughout dorsal and ventral divisions of the MGB, forming a major feature of this bat's analysis of complex sounds. The auditory sector of the thalamic reticular nucleus also was dominated by complex spectral responses to sounds. The ventral division was organized tonotopically, based on best frequencies of singly tuned neurons and higher best frequencies of combination-sensitive neurons. Best frequencies were lowest ventrolaterally, increasing dorsally and then ventromedially. However, representations of frequencies associated with higher harmonics of the FM sonar signal were reduced greatly. Frequency organization in the dorsal division was not tonotopic; within the middle one-third of MGB, combination-sensitive responses to second and third harmonic CF sonar signals (60-63 and 90-94 kHz) occurred in adjacent regions. In the rostral one-third, combination-sensitive responses to second, third, and fourth harmonic FM frequency bands predominated. These FM-FM neurons, thought to be selective for delay between an emitted pulse and echo, showed some organization of delay selectivity. The organization of frequency sensitivity in the MGB suggests a major rewiring of the output of the central nucleus of the inferior colliculus, by which collicular neurons tuned to the bat's FM sonar signals mostly project to the dorsal, not the ventral, division. Because physiological differences between collicular and MGB neurons are minor, a major role of the tecto-thalamic projection in the mustached bat may be the reorganization of responses to provide for cortical representations of sonar target features.
Deguchi, Shinji; Kawashima, Kazutaka; Washio, Seiichi
2008-12-01
The effect of artificially altered transglottal pressures on the voice fundamental frequency (F0) is known to be associated with vocal fold stiffness. Its measurement, though useful as a potential diagnostic tool for noncontact assessment of vocal fold stiffness, often requires manual and painstaking determination of an unstable F0 of voice. Here, we provide a computer-aided technique that enables one to carry out the determination easily and accurately. Human subjects vocalized in accordance with a series of reference sounds from a speaker controlled by a computer. Transglottal pressures were altered by means of a valve embedded in a mouthpiece. Time-varying vocal F0 was extracted, without manual procedures, from a specific range of the voice spectrum determined on the basis of the controlled reference sounds. The validity of the proposed technique was assessed for 11 healthy subjects. Fluctuating voice F0 was tracked automatically during experiments, providing the relationship between transglottal pressure change and F0 on the computer. The proposed technique overcomes the difficulty in automatic determination of the voice F0, which tends to be transient both in normal voice and in some types of pathological voice.
Acoustic characteristics of phonation in "wet voice" conditions.
Murugappan, Shanmugam; Boyce, Suzanne; Khosla, Sid; Kelchner, Lisa; Gutmark, Ephraim
2010-04-01
A perceptible change in phonation characteristics after a swallow has long been considered evidence that food and/or drink material has entered the laryngeal vestibule and is on the surface of the vocal folds as they vibrate. The current paper investigates the acoustic characteristics of phonation when liquid material is present on the vocal folds, using ex vivo porcine larynges as a model. Consistent with instrumental examinations of swallowing disorders or dysphagia in humans, three liquids of different Varibar viscosity ("thin liquid," "nectar," and "honey") were studied at constant volume. The presence of materials on the folds during phonation was generally found to suppress the higher frequency harmonics and generate intermittent additional frequencies in the low and high end of the acoustic spectrum. Perturbation measures showed a higher percentage of jitter and shimmer when liquid material was present on the folds during phonation, but they were unable to differentiate statistically between the three fluid conditions. The finite correlation dimension and positive Lyapunov exponent measures indicated that the presence of materials on the vocal folds excited a chaotic system. Further, these measures were able to reliably differentiate between the baseline and different types of liquid on the vocal folds.
Acoustic characteristics of phonation in “wet voice” conditions
Murugappan, Shanmugam; Boyce, Suzanne; Khosla, Sid; Kelchner, Lisa; Gutmark, Ephraim
2010-01-01
A perceptible change in phonation characteristics after a swallow has long been considered evidence that food and∕or drink material has entered the laryngeal vestibule and is on the surface of the vocal folds as they vibrate. The current paper investigates the acoustic characteristics of phonation when liquid material is present on the vocal folds, using ex vivo porcine larynges as a model. Consistent with instrumental examinations of swallowing disorders or dysphagia in humans, three liquids of different Varibar viscosity (“thin liquid,” “nectar,” and “honey”) were studied at constant volume. The presence of materials on the folds during phonation was generally found to suppress the higher frequency harmonics and generate intermittent additional frequencies in the low and high end of the acoustic spectrum. Perturbation measures showed a higher percentage of jitter and shimmer when liquid material was present on the folds during phonation, but they were unable to differentiate statistically between the three fluid conditions. The finite correlation dimension and positive Lyapunov exponent measures indicated that the presence of materials on the vocal folds excited a chaotic system. Further, these measures were able to reliably differentiate between the baseline and different types of liquid on the vocal folds. PMID:20370039
Automated extraction and classification of time-frequency contours in humpback vocalizations.
Ou, Hui; Au, Whitlow W L; Zurk, Lisa M; Lammers, Marc O
2013-01-01
A time-frequency contour extraction and classification algorithm was created to analyze humpback whale vocalizations. The algorithm automatically extracted contours of whale vocalization units by searching for gray-level discontinuities in the spectrogram images. The unit-to-unit similarity was quantified by cross-correlating the contour lines. A library of distinctive humpback units was then generated by applying an unsupervised, cluster-based learning algorithm. The purpose of this study was to provide a fast and automated feature selection tool to describe the vocal signatures of animal groups. This approach could benefit a variety of applications such as species description, identification, and evolution of song structures. The algorithm was tested on humpback whale song data recorded at various locations in Hawaii from 2002 to 2003. Results presented in this paper showed low probability of false alarm (0%-4%) under noisy environments with small boat vessels and snapping shrimp. The classification algorithm was tested on a controlled set of 30 units forming six unit types, and all the units were correctly classified. In a case study on humpback data collected in the Auau Chanel, Hawaii, in 2002, the algorithm extracted 951 units, which were classified into 12 distinctive types.
Phonation Threshold Pressure Measurement With a Semi-Occluded Vocal Tract
Titze, Ingo R.
2015-01-01
Purpose The purpose of this article was to determine if a semi-occluded vocal tract could be used to measure phonation threshold pressure. This is in contrast to the shutter technique, where an alternation between a fully occluded tract and an unoccluded tract is used. Method Five male and 5 female volunteers phonated through a thin straw held between the lips. Oral pressure behind the lips was measured. Mathematical predictions of phonation threshold pressures were compared to the measured ones over a range of frequencies. Results It was shown that, for a 2.5-mm diameter straw, phonation threshold pressures were obtainable over a 2-octave range of fundamental frequency by all volunteers. In magnitude, the pressures agreed with the 0.2–0.5 kPa values obtained in previous investigations. Sensitivity to viscoelastic and geometric properties of the vocal folds was generally not compromised with greater oral impedance, but some differences were predicted theoretically in contrast to an open mouth configuration. Conclusion Because phonation threshold pressure is always dependent on vocal tract interaction, it may be advantageous to choose an exact and fixed oral semi-occlusion for the measurement and interpret the results in light of the known acoustic load. PMID:19641082
Information content and acoustic structure of male African elephant social rumbles
Stoeger, Angela S.; Baotic, Anton
2016-01-01
Until recently, the prevailing theory about male African elephants (Loxodonta africana) was that, once adult and sexually mature, males are solitary and targeted only at finding estrous females. While this is true during the state of ‘musth’ (a condition characterized by aggressive behavior and elevated androgen levels), ‘non-musth’ males exhibit a social system seemingly based on companionship, dominance and established hierarchies. Research on elephant vocal communication has so far focused on females, and very little is known about the acoustic structure and the information content of male vocalizations. Using the source and filter theory approach, we analyzed social rumbles of 10 male African elephants. Our results reveal that male rumbles encode information about individuality and maturity (age and size), with formant frequencies and absolute fundamental frequency values having the most informative power. This first comprehensive study on male elephant vocalizations gives important indications on their potential functional relevance for male-male and male-female communication. Our results suggest that, similar to the highly social females, future research on male elephant vocal behavior will reveal a complex communication system in which social knowledge, companionship, hierarchy, reproductive competition and the need to communicate over long distances play key roles. PMID:27273586
Individual killer whale vocal variation during intra-group behavioral dynamics
NASA Astrophysics Data System (ADS)
Grebner, Dawn M.
The scientific goal of this dissertation was to carefully study the signal structure of killer whale communications and vocal complexity and link them to behavioral circumstances. The overall objective of this research sought to provide insight into killer whale call content and usage which may be conveying information to conspecifics in order to maintain group cohesion. Data were collected in the summers of 2006 and 2007 in Johnstone Strait, British Columbia. For both individuals and small groups, vocalizations were isolated using a triangular hydrophone array and the behavioral movement patterns were captured by a theodolite and video camera positioned on a cliff overlooking the hyrophone locations. This dissertation is divided into four analysis chapters. In Chapter 3, discriminant analysis was used to validate the four N04 call subtypes which were originally parsed due to variations in slope segments. The first two functions of the discriminant analysis explained 97% of the variability. Most of the variability for the N04 call was found in the front convex and the terminal portions of the call, while very little variability was found in the center region of the call. This research revealed that individual killer whales produced multiple subtypes of the N04 call. No correlations of behaviors to acoustic parameters obtained were found. The aim of the Chapter 4 was to determine if killer whale calling behavior varied prior to and after the animals had joined. Pulsed call rates were found to be greater pre- compared to post-joining events. Two-way vocal exchanges were more common occurring 74% of the time during pre-joining events. In Chapter 5, initiated and first response to calls varied between age/sex class groups when mothers were separated from an offspring. Solo mothers and calves initiated pulsed calls more often than they responded. Most of the no vocal responses were due to mothers who were foraging. Finally, observations of the frequency split in N04 calls discussed in Chapter 6 showed that the higher frequency component (HFC) was always associated with sideband 7 (SB7) of the lower frequency component (LFC). Insight into Northern Resident killer whale intra-group vocal dynamics would aid our understanding of vocal behaviors of many other marine mammal species that rely on vocal exchanges for prey capture, group movement or survival. This is the first study to focus on killer whale vocal content and usage as it pertains to intra-group dynamics for (1) mother and offspring separations and (2) for all individuals prior to joining events, as well as (3) individual usage in a diverging pulsed call. It is also the first time the N04 call has been parsed into subtypes.
Fernández, B; Alberti, I; Kitchen, I; Paz Viveros, M
1999-01-29
To address the existence of possible functional interactions between delta- and mu- receptors in relation to the affective component of pain, we have studied the effects of functional blockade of delta-receptors by a chronic treatment with naltrindole (1 mg/kg, 8 consecutive days) on antinociceptive responses to morphine (2 and 5 mg/kg) in the tail electric stimulation test, in adult male rats. The thresholds for the motor response (tail withdrawal), vocalization during stimulus and vocalization afterdischarge were assessed. These responses are considered to be integrated at spinal, medulla oblongata and diencephalon-rhinencephalon levels, respectively. The results show that the vocalization during stimulus and the vocalization afterdischarge were significantly affected by morphine in a dose dependent manner, the latter response being the most sensitive to the effects of the mu-opioid agonist. However, no significant effect was observed on motor responses at the doses used in this study. Chronic naltrindole treatment did not modify the inhibitory effect of morphine on the vocalization responses. Since the vocalization afterdischarge is related to the affective component of pain, the data suggest that the delta-opioid receptor is not involved in the supraspinal mechanisms at which these responses are organized and that there is not a mu-delta interaction in the modulation of the affective responses to noxious electrical stimulation.
Modeling coupled aerodynamics and vocal fold dynamics using immersed boundary methods.
Duncan, Comer; Zhai, Guangnian; Scherer, Ronald
2006-11-01
The penalty immersed boundary (PIB) method, originally introduced by Peskin (1972) to model the function of the mammalian heart, is tested as a fluid-structure interaction model of the closely coupled dynamics of the vocal folds and aerodynamics in phonation. Two-dimensional vocal folds are simulated with material properties chosen to result in self-oscillation and volume flows in physiological frequency ranges. Properties of the glottal flow field, including vorticity, are studied in conjunction with the dynamic vocal fold motion. The results of using the PIB method to model self-oscillating vocal folds for the case of 8 cm H20 as the transglottal pressure gradient are described. The volume flow at 8 cm H20, the transglottal pressure, and vortex dynamics associated with the self-oscillating model are shown. Volume flow is also given for 2, 4, and 12 cm H2O, illustrating the robustness of the model to a range of transglottal pressures. The results indicate that the PIB method applied to modeling phonation has good potential for the study of the interdependence of aerodynamics and vocal fold motion.
Computational Modeling of Fluid–Structure–Acoustics Interaction during Voice Production
Jiang, Weili; Zheng, Xudong; Xue, Qian
2017-01-01
The paper presented a three-dimensional, first-principle based fluid–structure–acoustics interaction computer model of voice production, which employed a more realistic human laryngeal and vocal tract geometries. Self-sustained vibrations, important convergent–divergent vibration pattern of the vocal folds, and entrainment of the two dominant vibratory modes were captured. Voice quality-associated parameters including the frequency, open quotient, skewness quotient, and flow rate of the glottal flow waveform were found to be well within the normal physiological ranges. The analogy between the vocal tract and a quarter-wave resonator was demonstrated. The acoustic perturbed flux and pressure inside the glottis were found to be at the same order with their incompressible counterparts, suggesting strong source–filter interactions during voice production. Such high fidelity computational model will be useful for investigating a variety of pathological conditions that involve complex vibrations, such as vocal fold paralysis, vocal nodules, and vocal polyps. The model is also an important step toward a patient-specific surgical planning tool that can serve as a no-risk trial and error platform for different procedures, such as injection of biomaterials and thyroplastic medialization. PMID:28243588
The neural network classification of false killer whale (Pseudorca crassidens) vocalizations.
Murray, S O; Mercado, E; Roitblat, H L
1998-12-01
This study reports the use of unsupervised, self-organizing neural network to categorize the repertoire of false killer whale vocalizations. Self-organizing networks are capable of detecting patterns in their input and partitioning those patterns into categories without requiring that the number or types of categories be predefined. The inputs for the neural networks were two-dimensional characterization of false killer whale vocalization, where each vocalization was characterized by a sequence of short-time measurements of duty cycle and peak frequency. The first neural network used competitive learning, where units in a competitive layer distributed themselves to recognize frequently presented input vectors. This network resulted in classes representing typical patterns in the vocalizations. The second network was a Kohonen feature map which organized the outputs topologically, providing a graphical organization of pattern relationships. The networks performed well as measured by (1) the average correlation between the input vectors and the weight vectors for each category, and (2) the ability of the networks to classify novel vocalizations. The techniques used in this study could easily be applied to other species and facilitate the development of objective, comprehensive repertoire models.
Kunduk, Melda; Vansant, Mathew B; Ikuma, Takeshi; McWhorter, Andrew
2017-03-01
This study investigated the effect of menstrual cycle on vocal fold vibratory characteristics in young women using high-speed digital imaging. This study examined the menstrual phase effect on five objective high-speed imaging parameters and two self-rated perceptual parameters. The effects of oral birth control use were also investigated. Thirteen subjects with no prior voice complaints were included in this study. All data were collected at three different time periods (premenses, postmenses, ovulation) over the course of one menstrual cycle. For five of the 13 subjects, data were collected for two consecutive cycles. Six of 13 subjects were oral birth control users. From high-speed imaging data, five objective parameters were computed: fundamental frequency, fundamental frequency deviation, harmonics-to-noise ratio, harmonic richness factor, and ratio of first and second harmonics. They were supplemented by two self-rated parameters: Reflux Severity Index and perceptual voice quality rating. Analysis included mixed model linear analysis with repeated measures. Results indicated no significant main effects for menstrual phase, between-cycle, or birth control use in the analysis for mean fundamental frequency, fundamental frequency deviation, harmonics-to-noise ratio, harmonic richness factor, first and second harmonics, Reflux Severity Index, and perceptual voice quality rating. Additionally, there were no interaction effects. Hormone fluctuations observed across the menstrual cycle do not appear to have direct effect on vocal fold vibratory characteristics in young women with no voice concerns. Birth control use, on the other hand, may have influence on spectral richness of vocal fold vibration. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Pulse register phonation in Diana monkey alarm calls
NASA Astrophysics Data System (ADS)
Riede, Tobias; Zuberbühler, Klaus
2003-05-01
The adult male Diana monkeys (Cercopithecus diana) produce predator-specific alarm calls in response to two of their predators, the crowned eagles and the leopards. The acoustic structure of these alarm calls is remarkable for a number of theoretical and empirical reasons. First, although pulsed phonation has been described in a variety of mammalian vocalizations, very little is known about the underlying production mechanism. Second, Diana monkey alarm calls are based almost exclusively on this vocal production mechanism to an extent that has never been documented in mammalian vocal behavior. Finally, the Diana monkeys' pulsed phonation strongly resembles the pulse register in human speech, where fundamental frequency is mainly controlled by subglottal pressure. Here, we report the results of a detailed acoustic analysis to investigate the production mechanism of Diana monkey alarm calls. Within calls, we found a positive correlation between the fundamental frequency and the pulse amplitude, suggesting that both humans and monkeys control fundamental frequency by subglottal pressure. While in humans pulsed phonation is usually considered pathological or artificial, male Diana monkeys rely exclusively on pulsed phonation, suggesting a functional adaptation. Moreover, we were unable to document any nonlinear phenomena, despite the fact that they occur frequently in the vocal repertoire of humans and nonhumans, further suggesting that the very robust Diana monkey pulse production mechanism has evolved for a particular functional purpose. We discuss the implications of these findings for the structural evolution of Diana monkey alarm calls and suggest that the restricted variability in fundamental frequency and robustness of the source signal gave rise to the formant patterns observed in Diana monkey alarm calls, used to convey predator information.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chera, Bhishamjit S.; Amdur, Robert J., E-mail: amdurr@shands.ufl.ed; Morris, Christopher G.
2010-08-01
Purpose: To compare radiation doses to carotid arteries among various radiotherapy techniques for treatment of early-stage squamous cell carcinoma (SCC) of the true vocal cords. Methods and Materials: Five patients were simulated using computed tomography (CT). Clinical and planning target volumes (PTV) were created for bilateral and unilateral stage T1 vocal cord cancers. Planning risk volumes for the carotid arteries and spinal cord were delineated. For each patient, three treatment plans were designed for bilateral and unilateral target volumes: opposed laterals (LATS), three-dimensional conformal radiotherapy (3DCRT), and intensity-modulated radiotherapy (IMRT), for a total of 30 plans. More than 95% ofmore » the PTV received the prescription dose (63Gy at 2.25 Gy per treatment). Results: Carotid dose was lowest with IMRT. With a bilateral vocal cord target, the median carotid dose was 10Gy with IMRT vs. 25 Gy with 3DCRT and 38 Gy with LATS (p < 0.05); with a unilateral target, the median carotid dose was 4 Gy with IMRT vs. 19 Gy with 3DCRT and 39 Gy with LATS (p < 0.05). The dosimetric tradeoff with IMRT is a small area of high dose in the PTV. The worst heterogeneity results were at a maximum point dose of 80 Gy (127%) in a unilateral target that was close to the carotid. Conclusions: There is no question that IMRT can reduce the dose to the carotid arteries in patients with early-stage vocal cord cancer. The question is whether the potential advantage of reducing the carotid dose outweighs the risk of tumor recurrence due to contouring errors and organ motion and the risk of complications from dose heterogeneity.« less
Differential coding of conspecific vocalizations in the ventral auditory cortical stream.
Fukushima, Makoto; Saunders, Richard C; Leopold, David A; Mishkin, Mortimer; Averbeck, Bruno B
2014-03-26
The mammalian auditory cortex integrates spectral and temporal acoustic features to support the perception of complex sounds, including conspecific vocalizations. Here we investigate coding of vocal stimuli in different subfields in macaque auditory cortex. We simultaneously measured auditory evoked potentials over a large swath of primary and higher order auditory cortex along the supratemporal plane in three animals chronically using high-density microelectrocorticographic arrays. To evaluate the capacity of neural activity to discriminate individual stimuli in these high-dimensional datasets, we applied a regularized multivariate classifier to evoked potentials to conspecific vocalizations. We found a gradual decrease in the level of overall classification performance along the caudal to rostral axis. Furthermore, the performance in the caudal sectors was similar across individual stimuli, whereas the performance in the rostral sectors significantly differed for different stimuli. Moreover, the information about vocalizations in the caudal sectors was similar to the information about synthetic stimuli that contained only the spectral or temporal features of the original vocalizations. In the rostral sectors, however, the classification for vocalizations was significantly better than that for the synthetic stimuli, suggesting that conjoined spectral and temporal features were necessary to explain differential coding of vocalizations in the rostral areas. We also found that this coding in the rostral sector was carried primarily in the theta frequency band of the response. These findings illustrate a progression in neural coding of conspecific vocalizations along the ventral auditory pathway.
Differential Coding of Conspecific Vocalizations in the Ventral Auditory Cortical Stream
Saunders, Richard C.; Leopold, David A.; Mishkin, Mortimer; Averbeck, Bruno B.
2014-01-01
The mammalian auditory cortex integrates spectral and temporal acoustic features to support the perception of complex sounds, including conspecific vocalizations. Here we investigate coding of vocal stimuli in different subfields in macaque auditory cortex. We simultaneously measured auditory evoked potentials over a large swath of primary and higher order auditory cortex along the supratemporal plane in three animals chronically using high-density microelectrocorticographic arrays. To evaluate the capacity of neural activity to discriminate individual stimuli in these high-dimensional datasets, we applied a regularized multivariate classifier to evoked potentials to conspecific vocalizations. We found a gradual decrease in the level of overall classification performance along the caudal to rostral axis. Furthermore, the performance in the caudal sectors was similar across individual stimuli, whereas the performance in the rostral sectors significantly differed for different stimuli. Moreover, the information about vocalizations in the caudal sectors was similar to the information about synthetic stimuli that contained only the spectral or temporal features of the original vocalizations. In the rostral sectors, however, the classification for vocalizations was significantly better than that for the synthetic stimuli, suggesting that conjoined spectral and temporal features were necessary to explain differential coding of vocalizations in the rostral areas. We also found that this coding in the rostral sector was carried primarily in the theta frequency band of the response. These findings illustrate a progression in neural coding of conspecific vocalizations along the ventral auditory pathway. PMID:24672012
Non-invasive In vivo measurement of the shear modulus of human vocal fold tissue
Kazemirad, Siavash; Bakhshaee, Hani; Mongeau, Luc; Kost, Karen
2014-01-01
Voice is the essential part of singing and speech communication. Voice disorders significantly affect the quality of life. The viscoelastic mechanical properties of the vocal fold mucosa determine the characteristics of the vocal folds oscillations, and thereby voice quality. In the present study, a non-invasive method was developed to determine the shear modulus of human vocal fold tissue in vivo via measurements of the mucosal wave propagation speed during phonation. Images of four human subjects’ vocal folds were captured using high speed digital imaging (HSDI) and magnetic resonance imaging (MRI) for different phonation pitches, specifically fundamental frequencies between 110 to 440 Hz. The MRI images were used to obtain the morphometric dimensions of each subject's vocal folds in order to determine the pixel size in the high-speed images. The mucosal wave propagation speed was determined for each subject and at each pitch value using an automated image processing algorithm. The transverse shear modulus of the vocal fold mucosa was then calculated from a surface (Rayleigh) wave propagation dispersion equation using the measured wave speeds. It was found that the mucosal wave propagation speed and therefore the shear modulus of the vocal fold tissue were generally greater at higher pitches. The results were in good agreement with those from other studies obtained via in vitro measurements, thereby supporting the validity of the proposed measurement method. This method offers the potential for in vivo clinical assessments of vocal folds viscoelasticity from HSDI. PMID:24433668
Mechanism of and Threshold Biomechanical Conditions for Falsetto Voice Onset
Deguchi, Shinji
2011-01-01
The sound source of a voice is produced by the self-excited oscillation of the vocal folds. In modal voice production, a drastic increase in transglottal pressure after vocal fold closure works as a driving force that develops self-excitation. Another type of vocal fold oscillation with less pronounced glottal closure observed in falsetto voice production has been accounted for by the mucosal wave theory. The classical theory assumes a quasi-steady flow, and the expected driving force onto the vocal folds under wavelike motion is derived from the Bernoulli effect. However, wavelike motion is not always observed during falsetto voice production. More importantly, the application of the quasi-steady assumption to a falsetto voice with a fundamental frequency of several hundred hertz is unsupported by experiments. These considerations suggested that the mechanism of falsetto voice onset may be essentially different from that explained by the mucosal wave theory. In this paper, an alternative mechanism is submitted that explains how self-excitation reminiscent of the falsetto voice could be produced independent of the glottal closure and wavelike motion. This new explanation is derived through analytical procedures by employing only general unsteady equations of motion for flow and solids. The analysis demonstrated that a convective acceleration of a flow induced by rapid wall movement functions as a negative damping force, leading to the self-excitation of the vocal folds. The critical subglottal pressure and volume flow are expressed as functions of vocal fold biomechanical properties, geometry, and voice fundamental frequency. The analytically derived conditions are qualitatively and quantitatively reasonable in view of reported measurement data of the thresholds required for falsetto voice onset. Understanding of the voice onset mechanism and the explicit mathematical descriptions of thresholds would be beneficial for the diagnosis and treatment of voice diseases and the development of artificial vocal folds. PMID:21408178
Zimmer-Nowicka, Joanna; Januszewska-Stańczyk, Henryka
2011-07-01
Upper respiratory tract infections (URTI) are among the major causes of dysphonia. There are only scarce data available on the incidence and predisposing factors of URTI in young singers, in particular, during a period of intense voice training. The data were obtained from medical records and a 43-item questionnaire distributed among 94 students of the vocal faculty (66 females and 28 males-age: 23.5±3.7 years) at all levels of their studies. The questions were divided into several categories, that is, personal, anthropometric, demographic, history of vocal education, and both general and singer-specific health risk factors. The rate of URTI showed a steady decrease during vocal studies. The strongest factor predisposing to infections in the multivariate regression model was nonadherence to vocal hygiene. There was also a weak protective effect of a regular holiday rest and negative effect of allergy. The prevalence of several recognized risk factors of URTI was exceptionally high in the group of vocal students, for example, passive smoking (42.5%), poor dental status (39.4%), frequent gastric complaints (44.7%), and allergy (50%). Despite the persistence of many risk factors throughout the vocal studies, the frequency of URTI significantly decreases most likely because of vocal hygiene education and growing professional experience. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Sexual Hearing: The influence of sex hormones on acoustic communication in frogs
Arch, Victoria S.; Narins, Peter M.
2009-01-01
The majority of anuran amphibians (frogs and toads) use acoustic communication to mediate sexual behavior and reproduction. Generally, females find and select their mates using acoustic cues provided by males in the form of conspicuous advertisement calls. In these species, vocal signal production and reception are intimately tied to successful reproduction. Research with anurans has demonstrated that acoustic communication is modulated by reproductive hormones, including gonadal steroids and peptide neuromodulators. Most of these studies have focused on the ways in which hormonal systems influence vocal signal production; however, here we will concentrate on a growing body of literature that examines hormonal modulation of call reception. This literature suggests that reproductive hormones contribute to the coordination of reproductive behaviors between signaler and receiver by modulating sensitivity and spectral filtering of the anuran auditory system. It has become evident that the hormonal systems that influence reproductive behaviors are highly conserved among vertebrate taxa, thus studying the endocrine and neuromodulatory bases of acoustic communication in frogs and toads can lead to insights of broader applicability to hormonal modulation of vertebrate sensory physiology and behavior. PMID:19272318
Signal analysis of the female singing voice: Features for perceptual singer identity
NASA Astrophysics Data System (ADS)
Mellody, Maureen
2001-07-01
Individual singing voices tend to be easy for a listener to identify, particularly when compared to the difficulty of identifying the performer of any other musical instrument. What cues does a listener use to identify a particular singing voice? This work seeks to identify a set of features with which one can synthesize notes with the vocal quality of a particular singer. Such analysis and synthesis influences computer music (in the creation of synthetic sounds with different timbre), vocal pedagogy (as a training tool to help singers understand properties of their own voice as well as different professional-quality voices), and vocal health (to identify improper behavior in vocal production). The problem of singer identification is approached in three phases: signal analysis, the development of low- order representations, and perceptual evaluation. To perform the signal analysis, a high-resolution time- frequency distribution is applied to vowel tokens from sopranos and mezzo-sopranos. From these results, low- order representations are created for each singer's notes, which are used to synthesize sounds with the timbral quality of that singer. Finally, these synthesized sounds, along with original recordings, are evaluated by trained listeners in a variety of perceptual experiments to determine the extent to which the vocal quality of the desired singer is captured. Results from the signal analysis show that amplitude and frequency estimates extracted from the time-frequency signal analysis can be used to re-create each signal with little degradation in quality and no loss of perceptual identity. Low-order representations derived from the signal analysis are used in clustering and classification, which successfully clusters signals with corresponding singer identity. Finally, perceptual results indicate that trained listeners are, surprisingly, only modestly successful at correctly identifying the singer of a recording, and find the task to be particularly difficult for certain voices and extremely easy for others. Listeners also indicate that the majority of sounds synthesized with the low-order representations sufficiently capture the desired vocal timbre. Again, the task is easy for certain voices and much more difficult when evaluating other singers, consistent with the results from the original recordings.
Vowels Development in Babbling of typically developing 6-to-12-month old Persian-learning Infants.
Fotuhi, Mina; Yadegari, Fariba; Teymouri, Robab
2017-10-01
Pre-linguistic vocalizations including early consonants, vowels, and their combinations into syllables are considered as important predictors of the speech and language development. The purpose of this study was to examine vowel development in babblings of normally developing Persian-learning infants. Eight typically developing 6-8-month-old Persian-learning infants (3 boys and 5 girls) participated in this 4-month longitudinal descriptive-analytic study. A weekly 30-60-minute audio- and video-recording was obtained at home from the comfort state vocalizations of infants and the mother-child interactions. A total of 74:02:03 hours of vocalizations were phonetically transcribed. Seven vowels comprising /i/,/e/,/a/,/u/,/o/,/ɑ/, and /ә/ were identified in the babblings. The inter-rater reliability was obtained for 20% of vocalizations. The data were analyzed by repeated measures ANOVA and Pearson's correlation coefficient using SPSS software version 20. The results showed that two vowels /a/ (46.04) and /e/ (23.60) were produced with the highest mean frequency of occurrence, respectively. Regarding front/back dimension, the front vowels were the most prominent ones (71.87); in terms of height, low (46.78) and mid (32.45) vowels occurred maximally. A good inter-rater reliability was obtained (0.99, P < .01). The increased frequency of occurrence of the low and mid front vowels in the current study was consistent with previous studies on the emergence of vowels in pre-linguistic vocalization in other languages.
Cochlear implanted children present vocal parameters within normal standards.
de Souza, Lourdes Bernadete Rocha; Bevilacqua, Maria Cecília; Brasolotto, Alcione Ghedini; Coelho, Ana Cristina
2012-08-01
to compare acoustic and perceptual parameters regarding the voice of cochlear implanted children, with normal hearing children. this is a cross-sectional, quantitative and qualitative study. Thirty six cochlear implanted children aged between 3 y and 3 m to 5 y and 9 m and 25 children with normal hearing, aged between 3 y and 11 m and 6 y and 6 m, participated in this study. The recordings and the acoustics analysis of the sustained vowel/a/and spontaneous speech were performed using the PRAAT program. The parameters analyzed for the sustained vowel were the mean of the fundamental frequency, jitter, shimmer and harmonic-to-noise ratio (HNR). For the spontaneous speech, the minimum and maximum frequencies and the number of semitones were extracted. The perceptual analysis of the speech material was analyzed using visual-analogical scales of 100 points, composing the aspects related to the overall severity of the vocal deviation, roughness, breathiness, strain, pitch, loudness and resonance deviation, and instability. This last parameter was only analyzed for the sustained vowel. The results demonstrated that the majority of the vocal parameters analyzed in the samples of the implanted children disclosed values similar to those obtained by the group of children with normal hearing. implanted children who participate in a (re) habilitation and follow-up program, can present vocal characteristics similar to those vocal characteristics of children with normal hearing. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
In Vivo Measurement of Pediatric Vocal Fold Motion Using Structured Light Laser Projection
Patel, Rita R.; Donohue, Kevin D.; Lau, Daniel; Unnikrishnan, Harikrishnan
2013-01-01
Summary Objective The aim of the study was to present the development of a miniature structured light laser projection endoscope and to quantify vocal fold length and vibratory features related to impact stress of the pediatric glottis using high-speed imaging. Study Design The custom-developed laser projection system consists of a green laser with a 4-mm diameter optics module at the tip of the endoscope, projecting 20 vertical laser lines on the glottis. Measurements of absolute phonatory vocal fold length, membranous vocal fold length, peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity were obtained in five children (6–9 years), two adult male and three adult female participants without voice disorders, and one child (10 years) with bilateral vocal fold nodules during modal phonation. Results Independent measurements made on the glottal length of a vocal fold phantom demonstrated a 0.13 mm bias error with a standard deviation of 0.23 mm, indicating adequate precision and accuracy for measuring vocal fold structures and displacement. First, in vivo measurements of amplitude-to-length ratio, peak closing velocity, and impact velocity during phonation in pediatric population and a child with vocal fold nodules are reported. Conclusion The proposed laser projection system can be used to obtain in vivo measurements of absolute length and vibratory features in children and adults. Children have large amplitude-to-length ratio compared with typically developing adults, whereas nodules result in larger peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity compared with typically developing children. PMID:23809569
Sound Rhythms Are Encoded by Postinhibitory Rebound Spiking in the Superior Paraolivary Nucleus
Felix, Richard A.; Fridberger, Anders; Leijon, Sara; Berrebi, Albert S.; Magnusson, Anna K.
2013-01-01
The superior paraolivary nucleus (SPON) is a prominent structure in the auditory brainstem. In contrast to the principal superior olivary nuclei with identified roles in processing binaural sound localization cues, the role of the SPON in hearing is not well understood. A combined in vitro and in vivo approach was used to investigate the cellular properties of SPON neurons in the mouse. Patch-clamp recordings in brain slices revealed that brief and well timed postinhibitory rebound spiking, generated by the interaction of two subthreshold-activated ion currents, is a hallmark of SPON neurons. The Ih current determines the timing of the rebound, whereas the T-type Ca2+ current boosts the rebound to spike threshold. This precisely timed rebound spiking provides a physiological explanation for the sensitivity of SPON neurons to sinusoidally amplitude-modulated (SAM) tones in vivo, where peaks in the sound envelope drive inhibitory inputs and SPON neurons fire action potentials during the waveform troughs. Consistent with this notion, SPON neurons display intrinsic tuning to frequency-modulated sinusoidal currents (1–15Hz) in vitro and discharge with strong synchrony to SAMs with modulation frequencies between 1 and 20 Hz in vivo. The results of this study suggest that the SPON is particularly well suited to encode rhythmic sound patterns. Such temporal periodicity information is likely important for detection of communication cues, such as the acoustic envelopes of animal vocalizations and speech signals. PMID:21880918
Sound rhythms are encoded by postinhibitory rebound spiking in the superior paraolivary nucleus.
Felix, Richard A; Fridberger, Anders; Leijon, Sara; Berrebi, Albert S; Magnusson, Anna K
2011-08-31
The superior paraolivary nucleus (SPON) is a prominent structure in the auditory brainstem. In contrast to the principal superior olivary nuclei with identified roles in processing binaural sound localization cues, the role of the SPON in hearing is not well understood. A combined in vitro and in vivo approach was used to investigate the cellular properties of SPON neurons in the mouse. Patch-clamp recordings in brain slices revealed that brief and well timed postinhibitory rebound spiking, generated by the interaction of two subthreshold-activated ion currents, is a hallmark of SPON neurons. The I(h) current determines the timing of the rebound, whereas the T-type Ca(2+) current boosts the rebound to spike threshold. This precisely timed rebound spiking provides a physiological explanation for the sensitivity of SPON neurons to sinusoidally amplitude-modulated (SAM) tones in vivo, where peaks in the sound envelope drive inhibitory inputs and SPON neurons fire action potentials during the waveform troughs. Consistent with this notion, SPON neurons display intrinsic tuning to frequency-modulated sinusoidal currents (1-15Hz) in vitro and discharge with strong synchrony to SAMs with modulation frequencies between 1 and 20 Hz in vivo. The results of this study suggest that the SPON is particularly well suited to encode rhythmic sound patterns. Such temporal periodicity information is likely important for detection of communication cues, such as the acoustic envelopes of animal vocalizations and speech signals.
Vocal specialization through tracheal elongation in an extinct Miocene pheasant from China.
Li, Zhiheng; Clarke, Julia A; Eliason, Chad M; Stidham, Thomas A; Deng, Tao; Zhou, Zhonghe
2018-05-25
Modifications to the upper vocal tract involving hyper-elongated tracheae have evolved many times within crown birds, and their evolution has been linked to a 'size exaggeration' hypothesis in acoustic signaling and communication, whereby smaller-sized birds can produce louder sounds. A fossil skeleton of a new extinct species of wildfowl (Galliformes: Phasianidae) from the late Miocene of China, preserves an elongated, coiled trachea that represents the oldest fossil record of this vocal modification in birds and the first documentation of its evolution within pheasants. The phylogenetic position of this species within Phasianidae has not been fully resolved, but appears to document a separate independent origination of this vocal modification within Galliformes. The fossil preserves a coiled section of the trachea and other remains supporting a tracheal length longer than the bird's body. This extinct species likely produced vocalizations with a lower fundamental frequency and reduced harmonics compared to similarly-sized pheasants. The independent evolution of this vocal feature in galliforms living in both open and closed habitats does not appear to be correlated with other factors of biology or its open savanna-like habitat. Features present in the fossil that are typically associated with sexual dimorphism suggest that sexual selection may have resulted in the evolution of both the morphology and vocalization mechanism in this extinct species.
Hyaluronic acid (with fibronectin) as a bioimplant for the vocal fold mucosa.
Chan, R W; Titze, I R
1999-07-01
To measure the viscoelastic shear properties of hyaluronic acid, with and without fibronectin, and to compare them with those of the human vocal fold mucosa and other phonosurgical biomaterials. Viscoelastic shear properties of various implantable biomaterials (Teflon, gelatin, collagen, fat, hyaluronic acid, and hyaluronic acid with fibronectin) were measured with a parallel-plate rotational rheometer. Elastic and viscous shear properties were quantified as a function of oscillation frequency (0.01-15 Hz) at 37 degrees C. The shear properties of hyaluronic acid were relatively close to those of human vocal fold mucosal tissues reported previously. Hyaluronic acid at specific concentrations (0.5%-1%), with or without fibronectin, was found to exhibit viscous shear properties (viscous shear modulus and dynamic viscosity) similar to those of the average male and female vocal fold mucosa. According to a theory that establishes the effects of tissue shear properties on vocal fold oscillation, phonation threshold pressure (a measure of the ease of phonation) is directly related to the viscous shear modulus of the vibrating vocal fold mucosa. Therefore, our findings suggest that hyaluronic acid, either by itself or mixed with fibronectin, may be a potentially optimal bioimplant for the surgical management of vocal fold mucosal defects and lamina propria deficiencies (e.g., scarring) from a biomechanical standpoint.
Feminization laryngoplasty: assessment of surgical pitch elevation.
Thomas, James P; Macmillan, Cody
2013-09-01
The aim of this study is to analyze change in pitch following feminization laryngoplasty, a technique to alter the vocal tract of male to female transgender patients. This is a retrospective review of 94 patients undergoing feminization laryngoplasty between June 2002 and April 2012 of which 76 individuals completed follow-up audio recordings. Feminization laryngoplasty is a procedure removing the anterior thyroid cartilage, collapsing the diameter of the larynx as well as shortening and tensioning the vocal folds to raise the pitch. Changes in comfortable speaking pitch, lowest vocal pitch and highest vocal pitch are assessed before and after surgery. Acoustic parameters of speaking pitch and vocal range were compared between pre- and postoperative results. The average comfortable speaking pitch preoperatively, C3# (139 Hz), was raised an average of six semitones to G3 (196 Hz), after surgical intervention. The lowest attainable pitch was raised an average of seven semitones and the highest attainable pitch decreased by an average of two semitones. One aspect of the procedure, thyrohyoid approximation (introduced in 2006 to alter resonance), did not affect pitch. Feminization laryngoplasty successfully increased the comfortable fundamental frequency of speech and removed the lowest notes from the patient's vocal range. It does not typically raise the upper limits of the vocal range.
Tyson, Reny B; Nowacek, Douglas P; Miller, Patrick J O
2007-09-01
Nonlinear phenomena or nonlinearities in animal vocalizations include features such as subharmonics, deterministic chaos, biphonation, and frequency jumps that until recently were generally ignored in acoustic analyses. Recent documentation of these phenomena in several species suggests that they may play a communicative role, though the exact function is still under investigation. Here, qualitative descriptions and quantitative analyses of nonlinearities in the vocalizations of killer whales (Orcinus orca) and North Atlantic right whales (Eubalaena glacialis) are provided. All four nonlinear features were present in both species, with at least one feature occurring in 92.4% of killer and 65.7% of right whale vocalizations analyzed. Occurrence of biphonation varied the most between species, being present in 89.0% of killer whale vocalizations and only 20.4% of right whale vocalizations. Because deterministic chaos is qualitatively and quantitatively different than random or Gaussian noise, a program (TISEAN) designed specifically to identify deterministic chaos to confirm the presence of this nonlinearity was used. All segments tested in this software indicate that both species do indeed exhibit deterministic chaos. The results of this study provide confirmation that such features are common in the vocalizations of cetacean species and lay the groundwork for future studies.
``Sub-prime" Biophysics: Acoustic assessment of animal stress
NASA Astrophysics Data System (ADS)
Browning, David
2007-10-01
Animal welfare is of increasing concern. Vocalizations can be easily monitored and for some animals, such as the ``yip'' of a dog, stress is easily discernible. Unfortunately for many important farm animals, such as cows, sheep, and horses, the impact of stress on vocalizations appears to be more subtle. Our work is presently focused on the frequency spectra of horse whinnies. A whinny is comprised of two components; a tonal structure, and a varying frequency component or ``call.'' Results to date are presented on whether a horse can control this ``call'' so that there is a significant difference between a ``stressed'' whinny and a ``happy'' whinny.
Yu, Chengzhu; Hansen, John H L
2017-03-01
Human physiology has evolved to accommodate environmental conditions, including temperature, pressure, and air chemistry unique to Earth. However, the environment in space varies significantly compared to that on Earth and, therefore, variability is expected in astronauts' speech production mechanism. In this study, the variations of astronaut voice characteristics during the NASA Apollo 11 mission are analyzed. Specifically, acoustical features such as fundamental frequency and phoneme formant structure that are closely related to the speech production system are studied. For a further understanding of astronauts' vocal tract spectrum variation in space, a maximum likelihood frequency warping based analysis is proposed to detect the vocal tract spectrum displacement during space conditions. The results from fundamental frequency, formant structure, as well as vocal spectrum displacement indicate that astronauts change their speech production mechanism when in space. Moreover, the experimental results for astronaut voice identification tasks indicate that current speaker recognition solutions are highly vulnerable to astronaut voice production variations in space conditions. Future recommendations from this study suggest that successful applications of speaker recognition during extended space missions require robust speaker modeling techniques that could effectively adapt to voice production variation caused by diverse space conditions.
NASA Astrophysics Data System (ADS)
Rendall, Drew; Owren, Michael J.; Weerts, Elise; Hienz, Robert D.
2004-01-01
This study quantifies sex differences in the acoustic structure of vowel-like grunt vocalizations in baboons (Papio spp.) and tests the basic perceptual discriminability of these differences to baboon listeners. Acoustic analyses were performed on 1028 grunts recorded from 27 adult baboons (11 males and 16 females) in southern Africa, focusing specifically on the fundamental frequency (F0) and formant frequencies. The mean F0 and the mean frequencies of the first three formants were all significantly lower in males than they were in females, more dramatically so for F0. Experiments using standard psychophysical procedures subsequently tested the discriminability of adult male and adult female grunts. After learning to discriminate the grunt of one male from that of one female, five baboon subjects subsequently generalized this discrimination both to new call tokens from the same individuals and to grunts from novel males and females. These results are discussed in the context of both the possible vocal anatomical basis for sex differences in call structure and the potential perceptual mechanisms involved in their processing by listeners, particularly as these relate to analogous issues in human speech production and perception.
Helium Speech: An Application of Standing Waves
NASA Astrophysics Data System (ADS)
Wentworth, Christopher D.
2011-04-01
Taking a breath of helium gas and then speaking or singing to the class is a favorite demonstration for an introductory physics course, as it usually elicits appreciative laughter, which serves to energize the class session. Students will usually report that the helium speech "raises the frequency" of the voice. A more accurate description of the phenomenon requires that we distinguish between the frequencies of sound produced by the larynx and the filtering of those frequencies by the vocal tract. We will describe here an experiment done by introductory physics students that uses helium speech as a context for learning about the human vocal system and as an application of the standing sound-wave concept. Modern acoustic analysis software easily obtained by instructors for student use allows data to be obtained and analyzed quickly.
Vibrational dynamics of vocal folds using nonlinear normal modes.
Pinheiro, Alan P; Kerschen, Gaëtan
2013-08-01
Many previous works involving physical models, excised and in vivo larynges have pointed out nonlinear vibration in vocal folds during voice production. Moreover, theoretical studies involving mechanical modeling of these folds have tried to gain a profound understanding of the observed nonlinear phenomena. In this context, the present work uses the nonlinear normal mode theory to investigate the nonlinear modal behavior of 16 subjects using a two-mass mechanical modeling of the vocal folds. The free response of the conservative system at different energy levels is considered to assess the impact of the structural nonlinearity of the vocal fold tissues. The results show very interesting and complex nonlinear phenomena including frequency-energy dependence, subharmonic regimes and, in some cases, modal interactions, entrainment and bifurcations. Copyright © 2012 IPEM. Published by Elsevier Ltd. All rights reserved.
Mooshammer, Christine
2010-01-01
This study uses acoustic and physiological measures to compare laryngeal reflexes of global changes in vocal effort to the effects of modulating such aspects of linguistic prominence as sentence accent, induced by focus variation, and word stress. Seven speakers were recorded by using a laryngograph. The laryngographic pulses were preprocessed to normalize time and amplitude. The laryngographic pulse shape was quantified using open and skewness quotients and also by applying a functional version of the principal component analysis. Acoustic measures included the acoustic open quotient and spectral balance in the vowel ∕e∕ during the test syllable. The open quotient and the laryngographic pulse shape indicated a significantly shorter open phase for loud speech than for soft speech. Similar results were found for lexical stress, suggesting that lexical stress and loud speech are produced with a similar voice source mechanism. Stressed syllables were distinguished from unstressed syllables by their open phase and pulse shape, even in the absence of sentence accent. Evidence for laryngeal involvement in signaling focus, independent of fundamental frequency changes, was not as consistent across speakers. Acoustic results on various spectral balance measures were generally much less consistent compared to results from laryngographic data. PMID:20136226
Numerical analysis of effects of transglottal pressure change on fundamental frequency of phonation.
Deguchi, Shinji; Matsuzaki, Yuji; Ikeda, Tadashige
2007-02-01
In humans, a decrease in transglottal pressure (Pt) causes an increase in the fundamental frequency of phonation (F0) only at a specific voice pitch within the modal register, the mechanism of which remains unclear. In the present study, numerical analyses were performed to investigate the mechanism of the voice pitch-dependent positive change of F0 due to Pt decrease. The airflow and the airway, including the vocal folds, were modeled in terms of mechanics of fluid and structure. Simulations of phonation using the numerical model indicated that Pt affects both the average position and the average amplitude magnitude of vocal fold self-excited oscillation in a non-monotonous manner. This effect results in voice pitch-dependent responses of F0 to Pt decreases, including the positive response of F0 as actually observed in humans. The findings of the present study highlight the importance of considering self-excited oscillation of the vocal folds in elucidation of the phonation mechanism.
Development of auditory sensitivity in budgerigars (Melopsittacus undulatus)
NASA Astrophysics Data System (ADS)
Brittan-Powell, Elizabeth F.; Dooling, Robert J.
2004-06-01
Auditory feedback influences the development of vocalizations in songbirds and parrots; however, little is known about the development of hearing in these birds. The auditory brainstem response was used to track the development of auditory sensitivity in budgerigars from hatch to 6 weeks of age. Responses were first obtained from 1-week-old at high stimulation levels at frequencies at or below 2 kHz, showing that budgerigars do not hear well at hatch. Over the next week, thresholds improved markedly, and responses were obtained for almost all test frequencies throughout the range of hearing by 14 days. By 3 weeks posthatch, birds' best sensitivity shifted from 2 to 2.86 kHz, and the shape of the auditory brainstem response (ABR) audiogram became similar to that of adult budgerigars. About a week before leaving the nest, ABR audiograms of young budgerigars are very similar to those of adult birds. These data complement what is known about vocal development in budgerigars and show that hearing is fully developed by the time that vocal learning begins.
Effect on LTAS of vocal loudness variation.
Nordenberg, Maria; Sundberg, Johan
2004-01-01
Long-term-average spectrum (LTAS) is an efficient method for voice analysis, revealing both voice source and formant characteristics. However, the LTAS contour is non-uniformly affected by vocal loudness. This variation was analyzed in 15 male and 16 female untrained voices reading a text 7 times at different degrees of vocal loudness, mean change in overall equivalent sound level (Leq) amounting to 27.9 dB and 28.4 dB for the female and male subjects. For all frequency values up to 4 kHz, spectrum level was strongly and linearly correlated with Leq for each subject. The gain factor, that is to say, the rate of level increase, varied with frequency, from about 0.5 at low frequencies to about 1.5 in the frequency range 1.5-3 kHz. Using the gain factors for a subject, LTAS contours could be predicted at any Leq within the measured range, with an average accuracy of 2-3 dB below 4 kHz. Mean LTAS calculated for an Leq of 70 dB for each subject showed considerable individual variation for both males and females, SD of the level varying between 7 dB and 4 dB depending on frequency. On the other hand, the results also suggest that meaningful comparisons of LTAS, recorded for example before and after voice therapy, can be made, provided that the documentation includes a set of recordings at different loudness levels from one recording session.
Bright, Leah; Secko, Michael; Mehta, Ninfa; Paladino, Lorenzo; Sinert, Richard
2014-01-01
Background: Ultrasound is a readily available, non-invasive technique to visualize airway dimensions at the patient's bedside and possibly predict difficult airways before invasively looking; however, it has rarely been used for emergency investigation of the larynx. There is limited literature on the sonographic measurements of true vocal cords in adults and normal parameters must be established before abnormal parameters can be accurately identified. Objectives: The primary objective of the following study is to identify the normal sonographic values of human true vocal cords in an adult population. A secondary objective is to determine if there is a difference in true vocal cord measurements in people with different body mass indices (BMIs). The third objective was to determine if there was a statistical difference in the measurements for both genders. Materials and Methods: True vocal cord measurements were obtained in healthy volunteers by ultrasound fellowship trained emergency medicine physicians using a high frequency linear transducer orientated transversely across the anterior surface of the neck at the level of the thyroid cartilage. The width of the true vocal cord was measured perpendicularly to the length of the cord at its mid-portion. This method was duplicated from a previous study to create a standard of measurement acquisition. Results: A total of 38 subjects were enrolled. The study demonstrated no correlation between vocal cord measurements and patient's characteristics of height, weight, or BMI's. When accounting for vocal cord measurements by gender, males had larger BMI's and larger vocal cord measurements compared with females subjects with a statistically significant different in right vocal cord measurements for females compared with male subjects. Conclusion: No correlation was seen between vocal cord measurements and person's BMIs. In the study group of normal volunteers, there was a difference in size between the male and female vocal cord size. PMID:24812456
Bright, Leah; Secko, Michael; Mehta, Ninfa; Paladino, Lorenzo; Sinert, Richard
2014-04-01
Ultrasound is a readily available, non-invasive technique to visualize airway dimensions at the patient's bedside and possibly predict difficult airways before invasively looking; however, it has rarely been used for emergency investigation of the larynx. There is limited literature on the sonographic measurements of true vocal cords in adults and normal parameters must be established before abnormal parameters can be accurately identified. The primary objective of the following study is to identify the normal sonographic values of human true vocal cords in an adult population. A secondary objective is to determine if there is a difference in true vocal cord measurements in people with different body mass indices (BMIs). The third objective was to determine if there was a statistical difference in the measurements for both genders. True vocal cord measurements were obtained in healthy volunteers by ultrasound fellowship trained emergency medicine physicians using a high frequency linear transducer orientated transversely across the anterior surface of the neck at the level of the thyroid cartilage. The width of the true vocal cord was measured perpendicularly to the length of the cord at its mid-portion. This method was duplicated from a previous study to create a standard of measurement acquisition. A total of 38 subjects were enrolled. The study demonstrated no correlation between vocal cord measurements and patient's characteristics of height, weight, or BMI's. When accounting for vocal cord measurements by gender, males had larger BMI's and larger vocal cord measurements compared with females subjects with a statistically significant different in right vocal cord measurements for females compared with male subjects. No correlation was seen between vocal cord measurements and person's BMIs. In the study group of normal volunteers, there was a difference in size between the male and female vocal cord size.
Voice amplification as a means of reducing vocal load for elementary music teachers.
Morrow, Sharon L; Connor, Nadine P
2011-07-01
Music teachers are over four times more likely than classroom teachers to develop voice disorders and greater than eight times more likely to have voice-related problems than the general public. Research has shown that individual voice-use parameters of phonation time, fundamental frequency and vocal intensity, as well as vocal load as calculated by cycle dose and distance dose are significantly higher for music teachers than their classroom teacher counterparts. Finding effective and inexpensive prophylactic measures to decrease vocal load for music teachers is an important aspect for voice preservation for this group of professional voice users. The purpose of this study was to determine the effects of voice amplification on vocal intensity and vocal load in the workplace as measured using a KayPENTAX Ambulatory Phonation Monitor (APM) (KayPENTAX, Lincoln Park, NJ). Seven music teachers were monitored for 1 workweek using an APM to determine average vocal intensity (dB sound pressure level [SPL]) and vocal load as calculated by cycle dose and distance dose. Participants were monitored a second week while using a voice amplification unit (Asyst ChatterVox; Asyst Communications Company, Inc., Indian Creek, IL). Significant decreases in mean vocal intensity of 7.00-dB SPL (P<0.001) were found using amplification, along with significant decreases (P=0.001) in cycle dose and distance dose. In addition, mean phonation time was found to decrease using amplification (P=0.023). These data suggest that voice amplification may be an effective intervention to decrease the potentially damaging vocal loads experienced by elementary music teachers in the classroom. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Frey, Roland; Gebler, Alban; Fritsch, Guido; Nygrén, Kaarlo; Weissengruber, Gerald E
2007-01-01
Laryngeal air sacs have evolved convergently in diverse mammalian lineages including insectivores, bats, rodents, pinnipeds, ungulates and primates, but their precise function has remained elusive. Among cervids, the vocal tract of reindeer has evolved an unpaired inflatable ventrorostral laryngeal air sac. This air sac is not present at birth but emerges during ontogenetic development. It protrudes from the laryngeal vestibulum via a short duct between the epiglottis and the thyroid cartilage. In the female the growth of the air sac stops at the age of 2–3 years, whereas in males it continues to grow up to the age of about 6 years, leading to a pronounced sexual dimorphism of the air sac. In adult females it is of moderate size (about 100 cm3), whereas in adult males it is large (3000–4000 cm3) and becomes asymmetric extending either to the left or to the right side of the neck. In both adult females and males the ventral air sac walls touch the integument. In the adult male the air sac is laterally covered by the mandibular portion of the sternocephalic muscle and the skin. Both sexes of reindeer have a double stylohyoid muscle and a thyroepiglottic muscle. Possibly these muscles assist in inflation of the air sac. Head-and-neck specimens were subjected to macroscopic anatomical dissection, computer tomographic analysis and skeletonization. In addition, isolated larynges were studied for comparison. Acoustic recordings were made during an autumn round-up of semi-domestic reindeer in Finland and in a small zoo herd. Male reindeer adopt a specific posture when emitting their serial hoarse rutting calls. Head and neck are kept low and the throat region is extended. In the ventral neck region, roughly corresponding to the position of the large air sac, there is a mane of longer hairs. Neck swelling and mane spreading during vocalization may act as an optical signal to other males and females. The air sac, as a side branch of the vocal tract, can be considered as an additional acoustic filter. Individual acoustic recognition may have been the primary function in the evolution of a size-variable air sac, and this function is retained in mother–young communication. In males sexual selection seems to have favoured a considerable size increase of the air sac and a switch to call series instead of single calls. Vocalization became restricted to the rutting period serving the attraction of females. We propose two possibilities for the acoustic function of the air sac in vocalization that do not exclude each other. The first assumes a coupling between air sac and the environment, resulting in an acoustic output that is a combination of the vocal tract resonance frequencies emitted via mouth and nostrils and the resonance frequencies of the air sac transmitted via the neck skin. The second assumes a weak coupling so that resonance frequencies of the air sac are lost to surrounding tissues by dissipation. In this case the resonance frequencies of the air sac solely influence the signal that is further filtered by the remaining vocal tract. According to our results one acoustic effect of the air sac in adult reindeer might be to mask formants of the vocal tract proper. In other cervid species, however, formants of rutting calls convey essential information on the quality of the sender, related to its potential reproductive success, to conspecifics. Further studies are required to solve this inconsistency. PMID:17310544
Mapping the Early Language Environment Using All-Day Recordings and Automated Analysis.
Gilkerson, Jill; Richards, Jeffrey A; Warren, Steven F; Montgomery, Judith K; Greenwood, Charles R; Kimbrough Oller, D; Hansen, John H L; Paul, Terrance D
2017-05-17
This research provided a first-generation standardization of automated language environment estimates, validated these estimates against standard language assessments, and extended on previous research reporting language behavior differences across socioeconomic groups. Typically developing children between 2 to 48 months of age completed monthly, daylong recordings in their natural language environments over a span of approximately 6-38 months. The resulting data set contained 3,213 12-hr recordings automatically analyzed by using the Language Environment Analysis (LENA) System to generate estimates of (a) the number of adult words in the child's environment, (b) the amount of caregiver-child interaction, and (c) the frequency of child vocal output. Child vocalization frequency and turn-taking increased with age, whereas adult word counts were age independent after early infancy. Child vocalization and conversational turn estimates predicted 7%-16% of the variance observed in child language assessment scores. Lower socioeconomic status (SES) children produced fewer vocalizations, engaged in fewer adult-child interactions, and were exposed to fewer daily adult words compared with their higher socioeconomic status peers, but within-group variability was high. The results offer new insight into the landscape of the early language environment, with clinical implications for identification of children at-risk for impoverished language environments.
Functional outcome of vocal fold medialization thyroplasty with a hydroxyapatite implant.
Storck, Claudio; Brockmann, Meike; Schnellmann, Elvira; Stoeckli, Sandro J; Schmid, Stephan
2007-06-01
Unilateral vocal fold paralysis can cause a persistent incomplete glottal closure during phonation, resulting in impaired voice function. The aim of this study was to evaluate functional results of medialization thyroplasty using a hydroxyapatite implant (VoCoM). Prospective observational cohort study. Between 1999 and 2003, a total of 26 patients (19 men, 7 women) undergoing medialization thyroplasty using a hydroxyapatite implant because of unilateral vocal fold paralysis were enrolled in the study. To evaluate voice function, the following parameters were measured preoperatively and postoperatively: mean fundamental frequency, mean sound pressure level, frequency and amplitude range (voice range profile), and maximum phonation time. A perceptual assessment of hoarseness was conducted using the Roughness, Breathiness, Hoarseness scale. Furthermore, the magnitude of voice related impairment of the patient's communication skills was rated on a 7-point scale. A combined parameter called the Voice Dysfunction Index (VDI) was used to rate vocal performance. All patients showed a statistically significant improvement in the VDI, in perceptual voice analysis, in maximum phonation time, and in the dynamic range of voice. One patient experienced a postoperative wound hemorrhage as a minor complication. No further complications or implant extrusions were observed. Medialization thyroplasty using a hydroxyapatite implant is a secure and efficient phonosurgical procedure. Voice quality and patient satisfaction improve significantly after treatment.
Chhetri, Dinesh K.; Neubauer, Juergen; Sofer, Elazar
2015-01-01
Objectives/Hypothesis Evaluate the influence of asymmetric recurrent laryngeal nerve (RLN) stimulation on the vibratory phase, acoustics and aerodynamics of phonation. Study Design Basic science study using an in vivo canine model. Methods The RLNs were symmetrically and asymmetrically stimulated over eight graded levels to test a range of vocal fold activation conditions from subtle paresis to paralysis. Vibratory phase, fundamental frequency (F0), subglottal pressure, and airflow were noted at phonation onset. The evaluations were repeated for three levels of symmetric superior laryngeal nerve (SLN) stimulation. Results Asymmetric laryngeal adductor activation from asymmetric left-right RLN stimulation led to a consistent pattern of vibratory phase asymmetry, with the more activated vocal fold leading in the opening phase of the glottal cycle and in mucosal wave amplitude. Vibratory amplitude asymmetry was also observed, with more lateral excursion of the glottis of the less activated side. Onset fundamental frequency was higher with asymmetric activation because the two RLNs were synergistic in decreasing F0, glottal width, and strain. Phonation onset pressure increased and airflow decreased with symmetric RLN activation. Conclusion Asymmetric laryngeal activation from RLN paresis and paralysis has consistent effects on vocal fold vibration, acoustics, and aerodynamics. This information may be useful in diagnosis and management of vocal fold paresis. PMID:24913182
Chhetri, Dinesh K; Neubauer, Juergen; Sofer, Elazar
2014-11-01
Evaluate the influence of asymmetric recurrent laryngeal nerve (RLN) stimulation on the vibratory phase, acoustics and aerodynamics of phonation. Basic science study using an in vivo canine model. The RLNs were symmetrically and asymmetrically stimulated over eight graded levels to test a range of vocal fold activation conditions from subtle paresis to paralysis. Vibratory phase, fundamental frequency (F0 ), subglottal pressure, and airflow were noted at phonation onset. The evaluations were repeated for three levels of symmetric superior laryngeal nerve (SLN) stimulation. Asymmetric laryngeal adductor activation from asymmetric left-right RLN stimulation led to a consistent pattern of vibratory phase asymmetry, with the more activated vocal fold leading in the opening phase of the glottal cycle and in mucosal wave amplitude. Vibratory amplitude asymmetry was also observed, with more lateral excursion of the glottis of the less activated side. Onset fundamental frequency was higher with asymmetric activation because the two RLNs were synergistic in decreasing F0 , glottal width, and strain. Phonation onset pressure increased and airflow decreased with symmetric RLN activation. Asymmetric laryngeal activation from RLN paresis and paralysis has consistent effects on vocal fold vibration, acoustics, and aerodynamics. This information may be useful in diagnosis and management of vocal fold paresis. N/A. © 2014 The American Laryngological, Rhinological and Otological Society, Inc.
Larrouy-Maestri, Pauline; Magis, David; Morsomme, Dominique
2014-05-01
The operatic singing technique is frequently used in classical music. Several acoustical parameters of this specific technique have been studied but how these parameters combine remains unclear. This study aims to further characterize the Western operatic singing technique by observing the effects of melody and technique on acoustical and musical parameters of the singing voice. Fifty professional singers performed two contrasting melodies (popular song and romantic melody) with two vocal techniques (with and without operatic singing technique). The common quality parameters (energy distribution, vibrato rate, and extent), perturbation parameters (standard deviation of the fundamental frequency, signal-to-noise ratio, jitter, and shimmer), and musical features (fundamental frequency of the starting note, average tempo, and sound pressure level) of the 200 sung performances were analyzed. The results regarding the effect of melody and technique on the acoustical and musical parameters show that the choice of melody had a limited impact on the parameters observed, whereas a particular vocal profile appeared depending on the vocal technique used. This study confirms that vocal technique affects most of the parameters examined. In addition, the observation of quality, perturbation, and musical parameters contributes to a better understanding of the Western operatic singing technique. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Pitch bending and glissandi on the clarinet: roles of the vocal tract and partial tone hole closure.
Chen, Jer-Ming; Smith, John; Wolfe, Joe
2009-09-01
Clarinettists combine non-standard fingerings with particular vocal tract configurations to achieve pitch bending, i.e., sounding pitches that can deviate substantially from those of standard fingerings. Impedance spectra were measured in the mouth of expert clarinettists while they played normally and during pitch bending, using a measurement head incorporated within a functioning clarinet mouthpiece. These were compared with the input impedance spectra of the clarinet for the fingerings used. Partially uncovering a tone hole by sliding a finger raises the frequency of clarinet impedance peaks, thereby allowing smooth increases in sounding pitch over some of the range. To bend notes in the second register and higher, however, clarinettists produce vocal tract resonances whose impedance maxima have magnitudes comparable with those of the bore resonance, which then may influence or determine the sounding frequency. It is much easier to bend notes down than up because of the phase relations of the bore and tract resonances, and the compliance of the reed. Expert clarinettists performed the glissando opening of Gershwin's 'Rhapsody in Blue'. Here, players coordinate the two effects: They slide their fingers gradually over open tone holes, while simultaneously adjusting a strong vocal tract resonance to the desired pitch.
The vocal monotony of monogamy
NASA Astrophysics Data System (ADS)
Thomas, Jeanette
2003-04-01
There are four phocids in waters around Antarctica: Weddell, leopard, crabeater, and Ross seals. These four species provide a unique opportunity to examine underwater vocal behavior in species sharing the same ecosystem. Some species live in pack ice, others in factice, but all are restricted to the Antarctic or sub-Antarctic islands. All breed and produce vocalizations under water. Social systems range from polygyny in large breeding colonies, to serial monogamy, to solitary species. The type of mating system influences the number of underwater vocalizations in the repertoire, with monogamous seals producing only a single call, polygynous species producing up to 35 calls, and solitary species an intermediate number of about 10 calls. Breeding occurs during the austral spring and each species carves-out an acoustic niche for communicating, with species using different frequency ranges, temporal patterns, and amplitude changes to convey their species-specific calls and presumably reduce acoustic competition. Some species exhibit geographic variations in their vocalizations around the continent, which may reflect discrete breeding populations. Some seals become silent during a vulnerable time of predation by killer whales, perhaps to avoid detection. Overall, vocalizations of these seals exhibit adaptive characteristics that reflect the co-evolution among species in the same ecosystem.
Performance of a reduced-order FSI model for flow-induced vocal fold vibration
NASA Astrophysics Data System (ADS)
Chang, Siyuan; Luo, Haoxiang; Luo's lab Team
2016-11-01
Vocal fold vibration during speech production involves a three-dimensional unsteady glottal jet flow and three-dimensional nonlinear tissue mechanics. A full 3D fluid-structure interaction (FSI) model is computationally expensive even though it provides most accurate information about the system. On the other hand, an efficient reduced-order FSI model is useful for fast simulation and analysis of the vocal fold dynamics, which is often needed in procedures such as optimization and parameter estimation. In this work, we study the performance of a reduced-order model as compared with the corresponding full 3D model in terms of its accuracy in predicting the vibration frequency and deformation mode. In the reduced-order model, we use a 1D flow model coupled with a 3D tissue model. Two different hyperelastic tissue behaviors are assumed. In addition, the vocal fold thickness and subglottal pressure are varied for systematic comparison. The result shows that the reduced-order model provides consistent predictions as the full 3D model across different tissue material assumptions and subglottal pressures. However, the vocal fold thickness has most effect on the model accuracy, especially when the vocal fold is thin. Supported by the NSF.
Niebudek-Bogusz, Ewa; Sliwińska-Kowalska, Mariola
2006-01-01
An assessment of the vocal system, as a part of the medical certification of occupational diseases, should be objective and reliable. Therefore, interest in the method of acoustic voice analysis enabling objective assessment of voice parameters is still growing. The aim of the present study was to evaluate the applicability of acoustic analysis with vocal loading test to the diagnostics of occupational voice disorders. The results of acoustic voice analysis were compared using IRIS software for phoniatrics, before and after a 30-min vocal loading test in 35 female teachers with diagnosed occupational voice disorders (group I) and in 31 female teachers with functional dysphonia (group II). In group I, vocal effort produced significant abnormalities in voice acoustic parameters, compared to group II. These included significantly increased mean fundamental frequency (Fo) value (by 11 Hz) and worsened jitter, shimmer and NHR parameters. Also, the percentage of subjects showing abnormalities in voice acoustic analysis was higher in this group. Conducting voice acoustic analysis before and after the vocal loading test makes it possible to objectively confirm irreversible voice impairments in persons with work-related pathologies of the larynx, which is essential for medical certification of occupational voice diseases.
Factors associated with vocal fold pathologies in teachers.
Souza, Carla Lima de; Carvalho, Fernando Martins; Araújo, Tânia Maria de; Reis, Eduardo José Farias Borges Dos; Lima, Verônica Maria Cadena; Porto, Lauro Antonio
2011-10-01
To analyze factors associated with the prevalence of the medical diagnosis of vocal fold pathologies in teachers. A census-based epidemiological, cross-sectional study was conducted with 4,495 public primary and secondary school teachers in the city of Salvador, Northeastern Brazil, between March and April 2006. The dependent variable was the self-reported medical diagnosis of vocal fold pathologies and the independent variables were sociodemographic characteristics; professional activity; work organization/interpersonal relationships; physical work environment characteristics; frequency of common mental disorders, measured by the Self-Reporting Questionnaire-20 (SRQ-20 >7); and general health conditions. Descriptive statistical, bivariate and multiple logistic regression analysis techniques were used. The prevalence of self-reported medical diagnosis of vocal fold pathologies was 18.9%. In the logistic regression analysis, the variables that remained associated with this medical diagnosis were as follows: being female, having worked as a teacher for more than seven years, excessive voice use, reporting more than five unfavorable physical work environment characteristics and presence of common mental disorders. The presence of self-reported vocal fold pathologies was associated with factors that point out the need of actions that promote teachers' vocal health and changes in their work structure and organization.
An Acoustic Analysis of the Genus Microhyla (Anura: Microhylidae) of Sri Lanka.
Wijayathilaka, Nayana; Meegaskumbura, Madhava
2016-01-01
Vocalizing behavior of frogs and toads, once quantified, is useful for systematics, rapid species identification, behavioral experimentation and conservation monitoring. But yet, for many lineages vocalizations remain unknown or poorly quantified, especially in diversity rich tropical regions. Here we provide a quantitative acoustical analysis for all four Sri Lankan congeners of the genus Microhyla. Three of these species are endemic to the island, but Microhyla ornata is regionally widespread. Two of these endemics, M. karunaratnei (Critically Endangered) and M. zeylanica (Endangered), are highly threatened montane isolates; the other, M. mihintalei, is relatively common across the dry lowlands. We recorded and analyzed 100 advertisement calls from five calling males for each species, except for M. zeylanica, which only had 53 calls from three males suitable for analyses. All four species call in choruses and their vocal repertoires are simple compared to most frogs. Their calls contain multiple pulses and no frequency modulation. We quantified eight call characters. Call duration and number of pulses were higher for the two montane isolates (inhabiting cooler habitats at higher altitudes) compared to their lowland congeners. Microhyla zeylanica has the longest call duration (of 1.8 ± 0.12 s) and the highest number of pulses (of 61-92 pulses). The smallest of the species, Microhyla karunaratnei (16.2-18.3 mm), has the highest mean dominant frequency (3.3 ± 0.14 kHz) and pulse rate (77 ± 5.8 pulses per second). The calls separate well in the Principal Component space: PC1 axis is mostly explained by the number of pulses per call and call duration; PC2 is mostly explained by the pulse rate. A canonical means plot of a Discriminant Function analysis shows non-overlapping 95% confidence ellipses. This suggests that some call parameters can be used to distinguish these species effectively. We provide detailed descriptions for eight call properties and compare these with congeners for which data is available. This work provides a foundation for comparative bioacoustic analyses and species monitoring while facilitating the systematics of Microhyla across its range.
Acoustic characteristics used by Japanese macaques for individual discrimination.
Furuyama, Takafumi; Kobayasi, Kohta I; Riquimaroux, Hiroshi
2017-10-01
The vocalizations of primates contain information about speaker individuality. Many primates, including humans, are able to distinguish conspecifics based solely on vocalizations. The purpose of this study was to investigate the acoustic characteristics used by Japanese macaques in individual vocal discrimination. Furthermore, we tested human subjects using monkey vocalizations to evaluate species specificity with respect to such discriminations. Two monkeys and five humans were trained to discriminate the coo calls of two unfamiliar monkeys. We created a stimulus continuum between the vocalizations of the two monkeys as a set of probe stimuli (whole morph). We also created two sets of continua in which only one acoustic parameter, fundamental frequency ( f 0 ) or vocal tract characteristic (VTC), was changed from the coo call of one monkey to that of another while the other acoustic feature remained the same ( f 0 morph and VTC morph, respectively). According to the results, the reaction times both of monkeys and humans were correlated with the morph proportion under the whole morph and f 0 morph conditions. The reaction time to the VTC morph was correlated with the morph proportion in both monkeys, whereas the reaction time in humans, on average, was not correlated with morph proportion. Japanese monkeys relied more consistently on VTC than did humans for discriminating monkey vocalizations. Our results support the idea that the auditory system of primates is specialized for processing conspecific vocalizations and suggest that VTC is a significant acoustic feature used by Japanese macaques to discriminate conspecific vocalizations. © 2017. Published by The Company of Biologists Ltd.
Acoustic and Auditory Perception Effects of the Voice Therapy Technique Finger Kazoo in Adult Women.
Christmann, Mara Keli; Cielo, Carla Aparecida
2017-05-01
This study aimed to verify and to correlate acoustic and auditory-perceptual measures of glottic source after the performance of finger kazoo (FK) technique. This is an experimental, cross-sectional, and qualitative study. We made an analysis of the vowel [a:] in 46 adult women with neither vocal complaints nor laryngeal alterations, through the Multi-Dimensional Voice Program Advanced and RASATI scale, before and immediately after performing three series of FK and 5 minutes after a period of silence. Kappa, Friedman, Wilcoxon, and Spearman tests were used. We found significant increase in fundamental frequency, reduction of amplitude variation, and degree of sub-harmonics immediately after performing FK. Positive correlations were measures of frequency and its perturbation, measures of amplitude, of soft phonation index, of degree and number of unvoiced segments with aspects of RASATI. Negative correlations were voice turbulence index, measures of frequency and its perturbation, and measures of soft phonation index with aspects of RASATI. There was fundamental frequency increase, within normal limits, and reduction of acoustic measures related to presence of noise and instability. In general, acoustic measures, suggestive of noise and instability, were reduced according to the decrease of perceptive-auditory aspects of vocal alteration. It shows that both instruments are complementary and that the acoustic vocal effect was positive. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vibration stimulates vocal mucosa-like matrix expression by hydrogel-encapsulated fibroblasts.
Kutty, Jaishankar K; Webb, Ken
2010-01-01
The composition and organization of the vocal fold extracellular matrix (ECM) provide the viscoelastic mechanical properties that are required to sustain high-frequency vibration during voice production. Although vocal injury and pathology are known to produce alterations in matrix physiology, the mechanisms responsible for the development and maintenance of vocal fold ECM are poorly understood. The objective of this study was to investigate the effect of physiologically relevant vibratory stimulation on ECM gene expression and synthesis by fibroblasts encapsulated within hyaluronic acid hydrogels that approximate the viscoelastic properties of vocal mucosa. Relative to static controls, samples exposed to vibration exhibited significant increases in mRNA expression levels of HA synthase 2, decorin, fibromodulin and MMP-1, while collagen and elastin expression were relatively unchanged. Expression levels exhibited a temporal response, with maximum increases observed after 3 and 5 days of vibratory stimulation and significant downregulation observed at 10 days. Quantitative assays of matrix accumulation confirmed significant increases in sulphated glycosaminoglycans and significant decreases in collagen after 5 and 10 days of vibratory culture, relative to static controls. Cellular remodelling and hydrogel viscosity were affected by vibratory stimulation and were influenced by varying the encapsulated cell density. These results indicate that vibration is a critical epigenetic factor regulating vocal fold ECM and suggest that rapid restoration of the phonatory microenvironment may provide a basis for reducing vocal scarring, restoring native matrix composition and improving vocal quality. 2009 John Wiley & Sons, Ltd.
Building and Verifying a Predictive Model of Interruption Resumption
2012-03-01
field, the vocal module speaks, the motor module moves the body, and the con- figural and manipulative modules perform spatial proces- sing [14]–[16...person cannot remember themselves. As described earlier, the model depends critically upon the basic properties of declarative memories. When a...success because the model’s ability to re- trieve an episodic code depends critically on the amount of time spent on the interruption. Also recall that
Gelotophobia and the Challenges of Implementing Laughter into Virtual Agents Interactions
Ruch, Willibald F.; Platt, Tracey; Hofmann, Jennifer; Niewiadomski, Radosław; Urbain, Jérôme; Mancini, Maurizio; Dupont, Stéphane
2014-01-01
This study investigated which features of AVATAR laughter are perceived threatening for individuals with a fear of being laughed at (gelotophobia), and individuals with no gelotophobia. Laughter samples were systematically varied (e.g., intensity, laughter pitch, and energy for the voice, intensity of facial actions of the face) in three modalities: animated facial expressions, synthesized auditory laughter vocalizations, and motion capture generated puppets displaying laughter body movements. In the online study 123 adults completed, the GELOPH <15 > (Ruch and Proyer, 2008a,b) and rated randomly presented videos of the three modalities for how malicious, how friendly, how real the laughter was (0 not at all to 8 extremely). Additionally, an open question asked which markers led to the perception of friendliness/maliciousness. The current study identified features in all modalities of laughter stimuli that were perceived as malicious in general, and some that were gelotophobia specific. For facial expressions of AVATARS, medium intensity laughs triggered highest maliciousness in the gelotophobes. In the auditory stimuli, the fundamental frequency modulations and the variation in intensity were indicative of maliciousness. In the body, backwards and forward movements and rocking vs. jerking movements distinguished the most malicious from the least malicious laugh. From the open answers, the shape and appearance of the lips curling induced feelings that the expression was malicious for non-gelotophobes and that the movement round the eyes, elicited the face to appear as friendly. This was opposite for gelotophobes. Gelotophobia savvy AVATARS should be of high intensity, containing lip and eye movements and be fast, non-repetitive voiced vocalization, variable and of short duration. It should not contain any features that indicate a down-regulation in the voice or body, or indicate voluntary/cognitive modulation. PMID:25477803
Hayase, Shin; Wada, Kazuhiro
2018-06-23
Learned vocalization, including birdsong and human speech, is acquired through self-motivated vocal practice during the sensitive period of vocal learning. The zebra finch (Taeniopygia guttata) develops a song characterized by vocal variability and crystalizes a defined song pattern as adulthood. However, it remains unknown how vocal variability is regulated with diurnal singing during the sensorimotor learning period. Here, we investigated the expression of activity-dependent neuroplasticity-related gene Arc during the early plastic song phase to examine its potential association with vocal plasticity. We first confirmed that multiple acoustic features of syllables in the plastic song were dramatically and simultaneously modulated during the first 3 hours of singing in a day and the altered features were maintained until sleep. Concurrently, Arc was intensely induced during morning singing and a subsequent attenuation during afternoon singing in the robust nucleus of the arcopallium (RA) and the interfacial nucleus of the nidopallium (NIf). The singing-driven Arc expression was not altered by circadian rhythm, but rather reduced during the day as juveniles produced more songs. Song stabilization accelerated by testosterone administration in juveniles was accompanied with attenuation of Arc induction in RA and NIf. In contrast, although early-deafened birds produced highly unstable song even at adulthood, singing-driven Arc expression was not different between intact and early-deafened adults. These results suggest a potential functional link between Arc expression in RA and NIf and vocal plasticity during the sensorimotor phase of song learning. Nonetheless, Arc expression did not reflect the quality of bird's own song or auditory feedback. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Merullo, Devin P; Cordes, Melissa A; Susan DeVries, M; Stevenson, Sharon A; Riters, Lauren V
2015-11-01
Vocalizations coordinate social interactions in many species and often are important for behaviors such as mate attraction or territorial defense. Although the neural circuitry underlying vocal communication is well-known for some animal groups, such as songbirds, the motivational processes that regulate vocal signals are not as clearly understood. Neurotensin (NT) is a neuropeptide implicated in motivation that can modulate the activity of dopaminergic neurons. Dopaminergic projections from the ventral tegmental area (VTA) are key to mediating highly motivated, goal-directed behaviors, including sexually-motivated birdsong. However, the role of NT in modifying vocal communication or other social behaviors has not been well-studied. Here in European starlings (Sturnus vulgaris) we analyzed relationships between sexually-motivated song and NT and NT1 receptor (NTSR1) expression in VTA. Additionally, we examined NT and NTSR1 expression in four regions that receive dopaminergic projections from VTA and are involved in courtship song: the medial preoptic nucleus (POM), the lateral septum (LS), Area X, and HVC. Relationships between NT and NTSR1 expression and non-vocal courtship and agonistic behaviors were also examined. NT expression in Area X positively related to sexually-motivated song production. NT expression in POM positively correlated with non-vocal courtship behavior and agonistic behavior. NT expression in POM was greatest in males owning nesting sites, and the opposite pattern was observed for NTSR1 expression in LS. These results are the first to implicate NT in Area X in birdsong, and further highlight NT as a potential neuromodulator for the control of vocal communication and other social behaviors. Copyright © 2015 Elsevier Inc. All rights reserved.
In Vivo measurement of pediatric vocal fold motion using structured light laser projection.
Patel, Rita R; Donohue, Kevin D; Lau, Daniel; Unnikrishnan, Harikrishnan
2013-07-01
The aim of the study was to present the development of a miniature structured light laser projection endoscope and to quantify vocal fold length and vibratory features related to impact stress of the pediatric glottis using high-speed imaging. The custom-developed laser projection system consists of a green laser with a 4-mm diameter optics module at the tip of the endoscope, projecting 20 vertical laser lines on the glottis. Measurements of absolute phonatory vocal fold length, membranous vocal fold length, peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity were obtained in five children (6-9 years), two adult male and three adult female participants without voice disorders, and one child (10 years) with bilateral vocal fold nodules during modal phonation. Independent measurements made on the glottal length of a vocal fold phantom demonstrated a 0.13mm bias error with a standard deviation of 0.23mm, indicating adequate precision and accuracy for measuring vocal fold structures and displacement. First, in vivo measurements of amplitude-to-length ratio, peak closing velocity, and impact velocity during phonation in pediatric population and a child with vocal fold nodules are reported. The proposed laser projection system can be used to obtain in vivo measurements of absolute length and vibratory features in children and adults. Children have large amplitude-to-length ratio compared with typically developing adults, whereas nodules result in larger peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity compared with typically developing children. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Automatic and quantitative measurement of laryngeal video stroboscopic images.
Kuo, Chung-Feng Jeffrey; Kuo, Joseph; Hsiao, Shang-Wun; Lee, Chi-Lung; Lee, Jih-Chin; Ke, Bo-Han
2017-01-01
The laryngeal video stroboscope is an important instrument for physicians to analyze abnormalities and diseases in the glottal area. Stroboscope has been widely used around the world. However, without quantized indices, physicians can only make subjective judgment on glottal images. We designed a new laser projection marking module and applied it onto the laryngeal video stroboscope to provide scale conversion reference parameters for glottal imaging and to convert the physiological parameters of glottis. Image processing technology was used to segment the important image regions of interest. Information of the glottis was quantified, and the vocal fold image segmentation system was completed to assist clinical diagnosis and increase accuracy. Regarding image processing, histogram equalization was used to enhance glottis image contrast. The center weighted median filters image noise while retaining the texture of the glottal image. Statistical threshold determination was used for automatic segmentation of a glottal image. As the glottis image contains saliva and light spots, which are classified as the noise of the image, noise was eliminated by erosion, expansion, disconnection, and closure techniques to highlight the vocal area. We also used image processing to automatically identify an image of vocal fold region in order to quantify information from the glottal image, such as glottal area, vocal fold perimeter, vocal fold length, glottal width, and vocal fold angle. The quantized glottis image database was created to assist physicians in diagnosing glottis diseases more objectively.
Multidimensional vocal assessment after laser treatment for recurrent respiratory papillomatosis.
Kono, Takeyuki; Yabe, Haruna; Uno, Kosuke; Saito, Koichiro; Ogawa, Kaoru
2017-03-01
Recurrent respiratory papillomatosis (RRP) is a benign epithelial tumor that exhibits a high frequency of recurrence. This study assesses the vocal function after laser treatment for RRP, particularly in relation to the frequency of surgery. Retrospective study. Thirty RRP patients who underwent laser surgery that controlled the tumor were included. Preoperative and postoperative Grade, Roughness, Breathiness, Asthenia, and Strain Scale, videostroboscopic findings, aerodynamic and acoustic parameters, and self-assessment questionnaires were measured and compared with an age- and sex-matched control group. Subsequently, to evaluate the association between postoperative voice quality and the number of surgeries, the patients were divided into three groups (group 1: single surgery, group 2: 2-5 surgeries, group3: >6 surgeries), and comparative multidimensional vocal assessments were performed. The mean number of surgeries was 3.4 (range, 1-8). Although all patients exhibited poorer vocal function than the control group preoperatively, they showed improvement in postoperative subjective and objective parameters. However, four patients who underwent one surgery with relatively aggressive ablation exhibited vocal cord scarring and deteriorated objective parameters. All remaining patients showed voice quality that was on par with the control group. Subgroup analysis proved no association between post-therapeutic voice quality and the patient characteristics, including preoperative staging and the number of surgical treatments performed. RRP patients can achieve a close to normal voice with high satisfaction even after recurrent surgical treatment when ablation of a subepithelial lesion using sufficient laser energy is adequate. 3b Laryngoscope, 127:679-684, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Effects of adventitious acute vocal trauma: Relative fundamental frequency and listener perception
Murray, Elizabeth Heller; Hands, Gabrielle L.; Calabrese, Carolyn R.; Stepp, Cara E.
2015-01-01
Objective High voice users (individuals who demonstrate excessive or loud vocal use) are at risk for developing voice disorders. The objective of this study was to examine, both acoustically and perceptually, vocal changes in healthy speakers following an acute period of high voice use. Methods Members of a university women’s volleyball team (N=12) were recorded a week prior (Pre) and week following (Post) the 10-week spring season; N=6 control speakers were recorded over the same time period for comparison. Speakers read four sentences, which were analyzed for relative fundamental frequency (RFF). Eight naïve listeners participated in an auditory-perceptual visual sort and rate (VSR) task, in which they rated each voice sample’s overall severity and strain. Results No significant differences were found as a function of time point in the VSR ratings for the volleyball group. Onset cycle 1 RFF values were significantly lower (p = 0.04) in the Post recordings of the volleyball participants compared to Pre recordings, but there was no significant difference (p = 0.20) in offset cycle 10 RFF values. Receiver operating characteristic analyses indicated moderate sensitivity and specificity of onset cycle 1 RFF for discrimination between the volleyball and control participants. Changes were not apparent in the control group as a function of time for either, onset cycle 1 RFF, offset cycle 10 FF, or either vocal attribute. Conclusion Onset cycle 1 RFF may be an effective marker for detecting vocal changes over an acute high voice use period of time before perceptual changes are noted. PMID:26028369
Complex vibratory patterns in an elephant larynx.
Herbst, Christian T; Svec, Jan G; Lohscheller, Jörg; Frey, Roland; Gumpenberger, Michaela; Stoeger, Angela S; Fitch, W Tecumseh
2013-11-01
Elephants' low-frequency vocalizations are produced by flow-induced self-sustaining oscillations of laryngeal tissue. To date, little is known in detail about the vibratory phenomena in the elephant larynx. Here, we provide a first descriptive report of the complex oscillatory features found in the excised larynx of a 25 year old female African elephant (Loxodonta africana), the largest animal sound generator ever studied experimentally. Sound production was documented with high-speed video, acoustic measurements, air flow and sound pressure level recordings. The anatomy of the larynx was studied with computed tomography (CT) and dissections. Elephant CT vocal anatomy data were further compared with the anatomy of an adult human male. We observed numerous unusual phenomena, not typically reported in human vocal fold vibrations. Phase delays along both the inferior-superior and anterior-posterior (A-P) dimension were commonly observed, as well as transverse travelling wave patterns along the A-P dimension, previously not documented in the literature. Acoustic energy was mainly created during the instant of glottal opening. The vestibular folds, when adducted, participated in tissue vibration, effectively increasing the generated sound pressure level by 12 dB. The complexity of the observed phenomena is partly attributed to the distinct laryngeal anatomy of the elephant larynx, which is not simply a large-scale version of its human counterpart. Travelling waves may be facilitated by low fundamental frequencies and increased vocal fold tension. A travelling wave model is proposed, to account for three types of phenomena: A-P travelling waves, 'conventional' standing wave patterns, and irregular vocal fold vibration.
Fillis, Michelle Moreira Abujamra; Andrade, Selma Maffei de; González, Alberto Durán; Melanda, Francine Nesello; Mesas, Arthur Eumann
2016-01-01
This study aimed to estimate the prevalence of self-reported vocal problems among primary schoolteachers and to identify associated occupational factors, using a cross-sectional design and face-to-face interviews with 967 teachers in 20 public schools in Londrina, Paraná State, Brazil. Prevalence of self-reported vocal problems was 25.7%. Adjusted analyses showed associations with characteristics of the employment relationship (workweek ≥ 40 hours and poor perception of salaries and health benefits), characteristics of the work environment (number of students per class and exposure to chalk dust and microorganisms), psychological factors (low job satisfaction, limited opportunities to express opinions, worse relationship with superiors, and poor balance between professional and personal life), and violence (insults and bullying). Vocal disorders affected one in four primary schoolteachers and were associated with various characteristics of the teaching profession (both structural and work-related).
2015-01-01
The goal of this study was to analyse perceptually and acoustically the voices of patients with Unilateral Vocal Fold Paralysis (UVFP) and compare them to the voices of normal subjects. These voices were analysed perceptually with the GRBAS scale and acoustically using the following parameters: mean fundamental frequency (F0), standard-deviation of F0, jitter (ppq5), shimmer (apq11), mean harmonics-to-noise ratio (HNR), mean first (F1) and second (F2) formants frequency, and standard-deviation of F1 and F2 frequencies. Statistically significant differences were found in all of the perceptual parameters. Also the jitter, shimmer, HNR, standard-deviation of F0, and standard-deviation of the frequency of F2 were statistically different between groups, for both genders. In the male data differences were also found in F1 and F2 frequencies values and in the standard-deviation of the frequency of F1. This study allowed the documentation of the alterations resulting from UVFP and addressed the exploration of parameters with limited information for this pathology. PMID:26557690
The program complex for vocal recognition
NASA Astrophysics Data System (ADS)
Konev, Anton; Kostyuchenko, Evgeny; Yakimuk, Alexey
2017-01-01
This article discusses the possibility of applying the algorithm of determining the pitch frequency for the note recognition problems. Preliminary study of programs-analogues were carried out for programs with function “recognition of the music”. The software package based on the algorithm for pitch frequency calculation was implemented and tested. It was shown that the algorithm allows recognizing the notes in the vocal performance of the user. A single musical instrument, a set of musical instruments, and a human voice humming a tune can be the sound source. The input file is initially presented in the .wav format or is recorded in this format from a microphone. Processing is performed by sequentially determining the pitch frequency and conversion of its values to the note. According to test results, modification of algorithms used in the complex was planned.
[Comparative evaluation of mastoidoplasty results in application of various plastic materials].
Zaporoshchenko, A Iu; Kravchenko, S V
2015-01-01
The results of surgical treatment of 62 patients, suffering chronic purulent middle otitis, were analyzed. The structure of mastoid processus and attic constitutes a base for choice of middle ear surgical sanation. Sanation operation with preservation or reconstruction of external acoustical meatus posterior wall was finished with combined mastoidoplasty using autobone, spongioid bone bioimplant Tutoplast or bioceramic material "Sintekost". Achievement of a steady sanating effect have promoted in late postoperative period a trustworthy lowering of the perception threshold of the bone--conducted sounds as on vocal, and also on high frequencies, while of the air--conducted sounds--on vocal frequencies. This permits in perspective to perform a hearing--improving operations with good functional result.
Lewis, James W.; Talkington, William J.; Walker, Nathan A.; Spirou, George A.; Jajosky, Audrey; Frum, Chris
2009-01-01
The ability to detect and rapidly process harmonic sounds, which in nature are typical of animal vocalizations and speech, can be critical for communication among conspecifics and for survival. Single-unit studies have reported neurons in auditory cortex sensitive to specific combinations of frequencies (e.g. harmonics), theorized to rapidly abstract or filter for specific structures of incoming sounds, where large ensembles of such neurons may constitute spectral templates. We studied the contribution of harmonic structure to activation of putative spectral templates in human auditory cortex by using a wide variety of animal vocalizations, as well as artificially constructed iterated rippled noises (IRNs). Both the IRNs and vocalization sounds were quantitatively characterized by calculating a global harmonics-to-noise ratio (HNR). Using fMRI we identified HNR-sensitive regions when presenting either artificial IRNs and/or recordings of natural animal vocalizations. This activation included regions situated between functionally defined primary auditory cortices and regions preferential for processing human non-verbal vocalizations or speech sounds. These results demonstrate that the HNR of sound reflects an important second-order acoustic signal attribute that parametrically activates distinct pathways of human auditory cortex. Thus, these results provide novel support for putative spectral templates, which may subserve a major role in the hierarchical processing of vocalizations as a distinct category of behaviorally relevant sound. PMID:19228981
The singing/acting mature adult--singing instruction perspective.
Westerman Gregg, J
1997-06-01
Complete knowledge of anatomy and physiology of the vocal mechanism and tract is essential for the voice teacher to be maximally effective. Possible contributing factors to vocal attrition in the mature singer/actor are outlined: poor posture, inadequate respiratory function, lack of adequate hydration, phonatory hyperfunction, habitual speaking pitch at too low a frequency, lack of resonance, tongue tension affecting phonation, resonation, and articulation. Techniques for rehabilitation of the damaged voice are recommended.
2018-01-01
This study tested the hypothesis that object-based attention modulates the discrimination of level increments in stop-consonant noise bursts. With consonant-vowel-consonant (CvC) words consisting of an ≈80-dB vowel (v), a pre-vocalic (Cv) and a post-vocalic (vC) stop-consonant noise burst (≈60-dB SPL), we measured discrimination thresholds (LDTs) for level increments (ΔL) in the noise bursts presented either in CvC context or in isolation. In the 2-interval 2-alternative forced-choice task, each observation interval presented a CvC word (e.g., /pæk/ /pæk/), and normal-hearing participants had to discern ΔL in the Cv or vC burst. Based on the linguistic word labels, the auditory events of each trial were perceived as two auditory objects (Cv-v-vC and Cv-v-vC) that group together the bursts and vowels, hindering selective attention to ΔL. To discern ΔL in Cv or vC, the events must be reorganized into three auditory objects: the to-be-attended pre-vocalic (Cv–Cv) or post-vocalic burst pair (vC–vC), and the to-be-ignored vowel pair (v–v). Our results suggest that instead of being automatic this reorganization requires training, in spite of using familiar CvC words. Relative to bursts in isolation, bursts in context always produced inferior ΔL discrimination accuracy (a context effect), which depended strongly on the acoustic separation between the bursts and the vowel, being much keener for the object apart from (post-vocalic) than for the object adjoining (pre-vocalic) the vowel (a temporal-position effect). Variability in CvC dimensions that did not alter the noise-burst perceptual grouping had minor effects on discrimination accuracy. In addition to being robust and persistent, these effects are relatively general, evincing in forced-choice tasks with one or two observation intervals, with or without variability in the temporal position of ΔL, and with either fixed or roving CvC standards. The results lend support to the hypothesis. PMID:29364931
Surgery and proton pump inhibitors for treatment of vocal process granulomas.
Hong-Gang, Duan; He-Juan, Jin; Chun-Quan, Zheng; Guo-Kang, Fan
2013-11-01
The aim of this study was to analyze the outcomes of vocal process granulomas treated with surgery and proton pump inhibitors and to specify related factors of recurrence. The medical records of patients with diagnosis of vocal process granuloma between 2000 and 2012 were reviewed. All patients were treated with surgery and proton pump inhibitors for at least 1 month. Forty-one patients were reviewed; mean follow-up time was 45 months. There was no recurrence among the patients who had a recent history of intubation. The recurrence rates of contact granuloma was 38.7 %, and significantly related to the frequency of surgery (P = 0.042), but was not significantly associated with the history of acid reflux (P = 0.676) and vocal abuse (P = 0.447), lesion size (P = 0.203) or surgical techniques (P = 0.331). Surgery combined with proton pump inhibitors was partially effective for the vocal process granulomas, especially with intubated patients. However, repeat surgery for recurrent contact granuloma should be preceded with caution due to high recurrence rates.
Combining Multiobjective Optimization and Cluster Analysis to Study Vocal Fold Functional Morphology
Palaparthi, Anil; Riede, Tobias
2017-01-01
Morphological design and the relationship between form and function have great influence on the functionality of a biological organ. However, the simultaneous investigation of morphological diversity and function is difficult in complex natural systems. We have developed a multiobjective optimization (MOO) approach in association with cluster analysis to study the form-function relation in vocal folds. An evolutionary algorithm (NSGA-II) was used to integrate MOO with an existing finite element model of the laryngeal sound source. Vocal fold morphology parameters served as decision variables and acoustic requirements (fundamental frequency, sound pressure level) as objective functions. A two-layer and a three-layer vocal fold configuration were explored to produce the targeted acoustic requirements. The mutation and crossover parameters of the NSGA-II algorithm were chosen to maximize a hypervolume indicator. The results were expressed using cluster analysis and were validated against a brute force method. Results from the MOO and the brute force approaches were comparable. The MOO approach demonstrated greater resolution in the exploration of the morphological space. In association with cluster analysis, MOO can efficiently explore vocal fold functional morphology. PMID:24771563
Acoustic characterization of ultrasonic vocalizations by a nocturnal primate Tarsius syrichta.
Gursky-Doyen, Sharon
2013-07-01
This preliminary study characterizes the ultrasonic vocalizations produced by Philippine tarsiers, Tarsius syrichta. Data were collected at the Philippine Tarsier Foundation Sanctuary in Corella, Bohol, Philippines, from July through October 2010. Recordings were made on a Wildlife Acoustics Ultrasonic Song Meter 2 BAT from 29 wild, free-living adult resident T. syrichta (23 females and six males). A total of 10,309 USVs were recorded. These vocalizations fell into three main categories: chirps, twitters, and whistles. Chirps were the most frequent, followed by twitters and whistles. Whereas chirps and twitters were emitted by both male and female Philippine tarsiers, whistles were only emitted by adult males. Given that vocalizations reported in this study were exclusively recorded during capture and handling, it is very likely that these vocalizations function as distress calls. However, as the long whistle was only given by adult males who were captured at the same time as the female or the group's infant, the function of the long whistle might be slightly different than the function of the other relatively lower-frequency USVs.
What can vortices tell us about vocal fold vibration and voice production.
Khosla, Sid; Murugappan, Shanmugam; Gutmark, Ephraim
2008-06-01
Much clinical research on laryngeal airflow has assumed that airflow is unidirectional. This review will summarize what additional knowledge can be obtained about vocal fold vibration and voice production by studying rotational motion, or vortices, in laryngeal airflow. Recent work suggests two types of vortices that may strongly contribute to voice quality. The first kind forms just above the vocal folds during glottal closing, and is formed by flow separation in the glottis; these flow separation vortices significantly contribute to rapid closing of the glottis, and hence, to producing loudness and high frequency harmonics in the acoustic spectrum. The second is a group of highly three-dimensional and coherent supraglottal vortices, which can produce sound by interaction with structures in the vocal tract. Present work is also described that suggests that certain laryngeal pathologies, such as asymmetric vocal fold tension, will significantly modify both types of vortices, with adverse impact on sound production: decreased rate of glottal closure, increased broadband noise, and a decreased signal to noise ratio. Recent research supports the hypothesis that glottal airflow contains certain vortical structures that significantly contribute to voice quality.
Modulating Phonation Through Alteration of Vocal Fold Medial Surface Contour
Mau, Ted; Muhlestein, Joseph; Callahan, Sean; Chan, Roger W.
2012-01-01
Objectives 1. To test whether alteration of the vocal fold medial surface contour can improve phonation. 2. To demonstrate that implant material properties affect vibration even when implant is deep to the vocal fold lamina propria. Study Design Induced phonation of excised human larynges. Methods Thirteen larynges were harvested within 24 hours post-mortem. Phonation threshold pressure (PTP) and flow (PTF) were measured before and after vocal fold injections using either calcium hydroxylapatite (CaHA) or hyaluronic acid (HA). Small-volume injections (median 0.0625 mL) were targeted to the infero-medial aspect of the thyroarytenoid (TA) muscle. Implant locations were assessed histologically. Results The effect of implantation on PTP was material-dependent. CaHA tended to increase PTP, whereas HA tended to decrease PTP (Wilcoxon test P = 0.00013 for onset). In contrast, the effect of implantation on PTF was similar, with both materials tending to decrease PTF (P = 0.16 for onset). Histology confirmed implant presence in the inferior half of the vocal fold vertical thickness. Conclusions Taken together, these data suggested the implants may have altered the vocal fold medial surface contour, potentially resulting in a less convergent or more rectangular glottal geometry as a means to improve phonation. An implant with a closer viscoelastic match to vocal fold cover is desirable for this purpose, as material properties can affect vibration even when the implant is not placed within the lamina propria. This result is consistent with theoretical predictions and implies greater need for surgical precision in implant placement and care in material selection. PMID:22865592
Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback.
Behroozmand, Roozbeh; Larson, Charles R
2011-06-06
The motor-driven predictions about expected sensory feedback (efference copies) have been proposed to play an important role in recognition of sensory consequences of self-produced motor actions. In the auditory system, this effect was suggested to result in suppression of sensory neural responses to self-produced voices that are predicted by the efference copies during vocal production in comparison with passive listening to the playback of the identical self-vocalizations. In the present study, event-related potentials (ERPs) were recorded in response to upward pitch shift stimuli (PSS) with five different magnitudes (0, +50, +100, +200 and +400 cents) at voice onset during active vocal production and passive listening to the playback. Results indicated that the suppression of the N1 component during vocal production was largest for unaltered voice feedback (PSS: 0 cents), became smaller as the magnitude of PSS increased to 200 cents, and was almost completely eliminated in response to 400 cents stimuli. Findings of the present study suggest that the brain utilizes the motor predictions (efference copies) to determine the source of incoming stimuli and maximally suppresses the auditory responses to unaltered feedback of self-vocalizations. The reduction of suppression for 50, 100 and 200 cents and its elimination for 400 cents pitch-shifted voice auditory feedback support the idea that motor-driven suppression of voice feedback leads to distinctly different sensory neural processing of self vs. non-self vocalizations. This characteristic may enable the audio-vocal system to more effectively detect and correct for unexpected errors in the feedback of self-produced voice pitch compared with externally-generated sounds.
Infant Cries Rattle Adult Cognition.
Dudek, Joanna; Faress, Ahmed; Bornstein, Marc H; Haley, David W
2016-01-01
The attention-grabbing quality of the infant cry is well recognized, but how the emotional valence of infant vocal signals affects adult cognition and cortical activity has heretofore been unknown. We examined the effects of two contrasting infant vocalizations (cries vs. laughs) on adult performance on a Stroop task using a cross-modal distraction paradigm in which infant distractors were vocal and targets were visual. Infant vocalizations were presented before (Experiment 1) or during each Stroop trial (Experiment 2). To evaluate the influence of infant vocalizations on cognitive control, neural responses to the Stroop task were obtained by measuring electroencephalography (EEG) and event-related potentials (ERPs) in Experiment 1. Based on the previously demonstrated existence of negative arousal bias, we hypothesized that cry vocalizations would be more distracting and invoke greater conflict processing than laugh vocalizations. Similarly, we expected participants to have greater difficulty shifting attention from the vocal distractors to the target task after hearing cries vs. after hearing laughs. Behavioral results from both experiments showed a cry interference effect, in which task performance was slower with cry than with laugh distractors. Electrophysiology data further revealed that cries more than laughs reduced attention to the task (smaller P200) and increased conflict processing (larger N450), albeit differently for incongruent and congruent trials. Results from a correlation analysis showed that the amplitudes of P200 and N450 were inversely related, suggesting a reciprocal relationship between attention and conflict processing. The findings suggest that cognitive control processes contribute to an attention bias to infant signals, which is modulated in part by the valence of the infant vocalization and the demands of the cognitive task. The findings thus support the notion that infant cries elicit a negative arousal bias that is distracting; they also identify, for the first time, the neural dynamics underlying the unique influence that infant cries and laughs have on cognitive control.
Penna, Mario; Velásquez, Nelson; Solís, Rigoberto
2008-04-01
Thresholds for evoked vocal responses and thresholds of multiunit midbrain auditory responses to pure tones and synthetic calls were investigated in males of Pleurodema thaul, as behavioral thresholds well above auditory sensitivity have been reported for other anurans. Thresholds for evoked vocal responses to synthetic advertisement calls played back at increasing intensity averaged 43 dB RMS SPL (range 31-52 dB RMS SPL), measured at the subjects' position. Number of pulses increased with stimulus intensities, reaching a plateau at about 18-39 dB above threshold and decreased at higher intensities. Latency to call followed inverse trends relative to number of pulses. Neural audiograms yielded an average best threshold in the high frequency range of 46.6 dB RMS SPL (range 41-51 dB RMS SPL) and a center frequency of 1.9 kHz (range 1.7-2.6 kHz). Auditory thresholds for a synthetic call having a carrier frequency of 2.1 kHz averaged 44 dB RMS SPL (range 39-47 dB RMS SPL). The similarity between thresholds for advertisement calling and auditory thresholds for the advertisement call indicates that male P. thaul use the full extent of their auditory sensitivity in acoustic interactions, likely an evolutionary adaptation allowing chorusing activity in low-density aggregations.
Keough, Dwayne; Jones, Jeffery A.
2009-01-01
Singing requires accurate control of the fundamental frequency (F0) of the voice. This study examined trained singers’ and untrained singers’ (nonsingers’) sensitivity to subtle manipulations in auditory feedback and the subsequent effect on the mapping between F0 feedback and vocal control. Participants produced the consonant-vowel ∕ta∕ while receiving auditory feedback that was shifted up and down in frequency. Results showed that singers and nonsingers compensated to a similar degree when presented with frequency-altered feedback (FAF); however, singers’ F0 values were consistently closer to the intended pitch target. Moreover, singers initiated their compensatory responses when auditory feedback was shifted up or down 6 cents or more, compared to nonsingers who began compensating when feedback was shifted up 26 cents and down 22 cents. Additionally, examination of the first 50 ms of vocalization indicated that participants commenced subsequent vocal utterances, during FAF, near the F0 value on previous shift trials. Interestingly, nonsingers commenced F0 productions below the pitch target and increased their F0 until they matched the note. Thus, singers and nonsingers rely on an internal model to regulate voice F0, but singers’ models appear to be more sensitive in response to subtle discrepancies in auditory feedback. PMID:19640048
Keough, Dwayne; Jones, Jeffery A
2009-08-01
Singing requires accurate control of the fundamental frequency (F0) of the voice. This study examined trained singers' and untrained singers' (nonsingers') sensitivity to subtle manipulations in auditory feedback and the subsequent effect on the mapping between F0 feedback and vocal control. Participants produced the consonant-vowel /ta/ while receiving auditory feedback that was shifted up and down in frequency. Results showed that singers and nonsingers compensated to a similar degree when presented with frequency-altered feedback (FAF); however, singers' F0 values were consistently closer to the intended pitch target. Moreover, singers initiated their compensatory responses when auditory feedback was shifted up or down 6 cents or more, compared to nonsingers who began compensating when feedback was shifted up 26 cents and down 22 cents. Additionally, examination of the first 50 ms of vocalization indicated that participants commenced subsequent vocal utterances, during FAF, near the F0 value on previous shift trials. Interestingly, nonsingers commenced F0 productions below the pitch target and increased their F0 until they matched the note. Thus, singers and nonsingers rely on an internal model to regulate voice F0, but singers' models appear to be more sensitive in response to subtle discrepancies in auditory feedback.
Vocal tract resonances in singing: The soprano voice
NASA Astrophysics Data System (ADS)
Joliveau, Elodie; Smith, John; Wolfe, Joe
2004-10-01
The vocal tract resonances of trained soprano singers were measured while they sang a range of vowels softly at different pitches. The measurements were made by broad band acoustic excitation at the mouth, which allowed the resonances of the tract to be measured simultaneously with and independently from the harmonics of the voice. At low pitch, when the lowest resonance frequency R1 exceeded f0, the values of the first two resonances R1 and R2 varied little with frequency and had values consistent with normal speech. At higher pitches, however, when f0 exceeded the value of R1 observed at low pitch, R1 increased with f0 so that R1 was approximately equal to f0. R2 also increased over this high pitch range, probably as an incidental consequence of the tuning of R1. R3 increased slightly but systematically, across the whole pitch range measured. There was no evidence that any resonances are tuned close to harmonics of the pitch frequency except for R1 at high pitch. The variations in R1 and R2 at high pitch mean that vowels move, converge, and overlap their positions on the vocal plane (R2,R1) to an extent that implies loss of intelligibility. .
TauG-guidance of transients in expressive musical performance.
Schogler, Benjaman; Pepping, Gert-Jan; Lee, David N
2008-08-01
The sounds in expressive musical performance, and the movements that produce them, offer insight into temporal patterns in the brain that generate expression. To gain understanding of these brain patterns, we analyzed two types of transient sounds, and the movements that produced them, during a vocal duet and a bass solo. The transient sounds studied were inter-tone f (0)(t)-glides (the continuous change in fundamental frequency, f (0)(t), when gliding from one tone to the next), and attack intensity-glides (the continuous rise in sound intensity when attacking, or initiating, a tone). The temporal patterns of the inter-tone f (0)(t)-glides and attack intensity-glides, and of the movements producing them, all conformed to the mathematical function, tau (G)(t) (called tauG), predicted by General Tau Theory, and assumed to be generated in the brain. The values of the parameters of the tau (G)(t) function were modulated by the performers when they modulated musical expression. Thus the tau (G)(t) function appears to be a fundamental of brain activity entailed in the generation of expressive temporal patterns of movement and sound.
How bodies and voices interact in early emotion perception.
Jessen, Sarah; Obleser, Jonas; Kotz, Sonja A
2012-01-01
Successful social communication draws strongly on the correct interpretation of others' body and vocal expressions. Both can provide emotional information and often occur simultaneously. Yet their interplay has hardly been studied. Using electroencephalography, we investigated the temporal development underlying their neural interaction in auditory and visual perception. In particular, we tested whether this interaction qualifies as true integration following multisensory integration principles such as inverse effectiveness. Emotional vocalizations were embedded in either low or high levels of noise and presented with or without video clips of matching emotional body expressions. In both, high and low noise conditions, a reduction in auditory N100 amplitude was observed for audiovisual stimuli. However, only under high noise, the N100 peaked earlier in the audiovisual than the auditory condition, suggesting facilitatory effects as predicted by the inverse effectiveness principle. Similarly, we observed earlier N100 peaks in response to emotional compared to neutral audiovisual stimuli. This was not the case in the unimodal auditory condition. Furthermore, suppression of beta-band oscillations (15-25 Hz) primarily reflecting biological motion perception was modulated 200-400 ms after the vocalization. While larger differences in suppression between audiovisual and audio stimuli in high compared to low noise levels were found for emotional stimuli, no such difference was observed for neutral stimuli. This observation is in accordance with the inverse effectiveness principle and suggests a modulation of integration by emotional content. Overall, results show that ecologically valid, complex stimuli such as joined body and vocal expressions are effectively integrated very early in processing.
Performance of a reduced-order FSI model for flow-induced vocal fold vibration
NASA Astrophysics Data System (ADS)
Luo, Haoxiang; Chang, Siyuan; Chen, Ye; Rousseau, Bernard; PhonoSim Team
2017-11-01
Vocal fold vibration during speech production involves a three-dimensional unsteady glottal jet flow and three-dimensional nonlinear tissue mechanics. A full 3D fluid-structure interaction (FSI) model is computationally expensive even though it provides most accurate information about the system. On the other hand, an efficient reduced-order FSI model is useful for fast simulation and analysis of the vocal fold dynamics, which can be applied in procedures such as optimization and parameter estimation. In this work, we study performance of a reduced-order model as compared with the corresponding full 3D model in terms of its accuracy in predicting the vibration frequency and deformation mode. In the reduced-order model, we use a 1D flow model coupled with a 3D tissue model that is the same as in the full 3D model. Two different hyperelastic tissue behaviors are assumed. In addition, the vocal fold thickness and subglottal pressure are varied for systematic comparison. The result shows that the reduced-order model provides consistent predictions as the full 3D model across different tissue material assumptions and subglottal pressures. However, the vocal fold thickness has most effect on the model accuracy, especially when the vocal fold is thin.
Training of Working Memory Impacts Neural Processing of Vocal Pitch Regulation
Li, Weifeng; Guo, Zhiqiang; Jones, Jeffery A.; Huang, Xiyan; Chen, Xi; Liu, Peng; Chen, Shaozhen; Liu, Hanjun
2015-01-01
Working memory training can improve the performance of tasks that were not trained. Whether auditory-motor integration for voice control can benefit from working memory training, however, remains unclear. The present event-related potential (ERP) study examined the impact of working memory training on the auditory-motor processing of vocal pitch. Trained participants underwent adaptive working memory training using a digit span backwards paradigm, while control participants did not receive any training. Before and after training, both trained and control participants were exposed to frequency-altered auditory feedback while producing vocalizations. After training, trained participants exhibited significantly decreased N1 amplitudes and increased P2 amplitudes in response to pitch errors in voice auditory feedback. In addition, there was a significant positive correlation between the degree of improvement in working memory capacity and the post-pre difference in P2 amplitudes. Training-related changes in the vocal compensation, however, were not observed. There was no systematic change in either vocal or cortical responses for control participants. These findings provide evidence that working memory training impacts the cortical processing of feedback errors in vocal pitch regulation. This enhanced cortical processing may be the result of increased neural efficiency in the detection of pitch errors between the intended and actual feedback. PMID:26553373
NASA Astrophysics Data System (ADS)
Ringenberg, Hunter; Rogers, Dylan; Wei, Nathaniel; Krane, Michael; Wei, Timothy
2017-11-01
The objective of this study is to apply experimental data to theoretical framework of Krane (2013) in which the principal aeroacoustic source is expressed in terms of vocal fold drag, glottal jet dynamic head, and glottal exit volume flow, reconciling formal theoretical aeroacoustic descriptions of phonation with more traditional lumped-element descriptions. These quantities appear in the integral equations of motion for phonatory flow. In this way time resolved velocity field measurements can be used to compute time-resolved estimates of the relevant terms in the integral equations of motion, including phonation aeroacoustic source strength. A simplified 10x scale vocal fold model from Krane, et al. (2007) was used to examine symmetric, i.e. `healthy', oscillatory motion of the vocal folds. By using water as the working fluid, very high spatial and temporal resolution was achieved. Temporal variation of transglottal pressure was simultaneously measured with flow on the vocal fold model mid-height. Experiments were dynamically scaled to examine a range of frequencies corresponding to male and female voice. The simultaneity of the pressure and flow provides new insights into the aeroacoustics associated with vocal fold oscillations. Supported by NIH Grant No. 2R01 DC005642-11.