Audio-vocal interaction in single neurons of the monkey ventrolateral prefrontal cortex.
Hage, Steffen R; Nieder, Andreas
2015-05-06
Complex audio-vocal integration systems depend on a strong interconnection between the auditory and the vocal motor system. To gain cognitive control over audio-vocal interaction during vocal motor control, the PFC needs to be involved. Neurons in the ventrolateral PFC (VLPFC) have been shown to separately encode the sensory perceptions and motor production of vocalizations. It is unknown, however, whether single neurons in the PFC reflect audio-vocal interactions. We therefore recorded single-unit activity in the VLPFC of rhesus monkeys (Macaca mulatta) while they produced vocalizations on command or passively listened to monkey calls. We found that 12% of randomly selected neurons in VLPFC modulated their discharge rate in response to acoustic stimulation with species-specific calls. Almost three-fourths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of vocalization. Based on these audio-vocal interactions, the VLPFC might be well positioned to combine higher order auditory processing with cognitive control of the vocal motor output. Such audio-vocal integration processes in the VLPFC might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2015 the authors 0270-6474/15/357030-11$15.00/0.
Cooperative vocal control in marmoset monkeys via vocal feedback
Choi, Jung Yoon; Takahashi, Daniel Y.
2015-01-01
Humans adjust speech amplitude as a function of distance from a listener; we do so in a manner that would compensate for such distance. This ability is presumed to be the product of high-level sociocognitive skills. Nonhuman primates are thought to lack such socially related flexibility in vocal production. Using predictions from a simple arousal-based model whereby vocal feedback from a conspecific modulates the drive to produce a vocalization, we tested whether another primate exhibits this type of cooperative vocal control. We conducted a playback experiment with marmoset monkeys and simulated “far-away” and “nearby” conspecifics using contact calls that differed in sound intensity. We found that marmoset monkeys increased the amplitude of their contact calls and produced such calls with shorter response latencies toward more distant conspecifics. The same was not true in response to changing levels of background noise. To account for how simulated conspecific distance can change both the amplitude and timing of vocal responses, we developed a model that incorporates dynamic interactions between the auditory system and limbic “drive” systems. Overall, our data show that, like humans, marmoset monkeys cooperatively control the acoustics of their vocalizations according to changes in listener distance, increasing the likelihood that a conspecific will hear their call. However, we propose that such cooperative vocal control is a system property that does not necessitate any particularly advanced sociocognitive skill. At least in marmosets, this vocal control can be parsimoniously explained by the regulation of arousal states across two interacting individuals via vocal feedback. PMID:25925323
Neuroendocrine control of seasonal plasticity in the auditory and vocal systems of fish
Forlano, Paul M.; Sisneros, Joseph A.; Rohmann, Kevin N.; Bass, Andrew H.
2014-01-01
Seasonal changes in reproductive-related vocal behavior are widespread among fishes. This review highlights recent studies of the vocal plainfin midshipman fish, Porichthys notatus, a neuroethological model system used for the past two decades to explore neural and endocrine mechanisms of vocal-acoustic social behaviors shared with tetrapods. Integrative approaches combining behavior, neurophysiology, neuropharmacology, neuroanatomy, and gene expression methodologies have taken advantage of simple, stereotyped and easily quantifiable behaviors controlled by discrete neural networks in this model system to enable discoveries such as the first demonstration of adaptive seasonal plasticity in the auditory periphery of a vertebrate as well as rapid steroid and neuropeptide effects on vocal physiology and behavior. This simple model system has now revealed cellular and molecular mechanisms underlying seasonal and steroid-driven auditory and vocal plasticity in the vertebrate brain. PMID:25168757
Riede, Tobias; Goller, Franz
2010-10-01
Song production in songbirds is a model system for studying learned vocal behavior. As in humans, bird phonation involves three main motor systems (respiration, vocal organ and vocal tract). The avian respiratory mechanism uses pressure regulation in air sacs to ventilate a rigid lung. In songbirds sound is generated with two independently controlled sound sources, which reside in a uniquely avian vocal organ, the syrinx. However, the physical sound generation mechanism in the syrinx shows strong analogies to that in the human larynx, such that both can be characterized as myoelastic-aerodynamic sound sources. Similarities include active adduction and abduction, oscillating tissue masses which modulate flow rate through the organ and a layered structure of the oscillating tissue masses giving rise to complex viscoelastic properties. Differences in the functional morphology of the sound producing system between birds and humans require specific motor control patterns. The songbird vocal apparatus is adapted for high speed, suggesting that temporal patterns and fast modulation of sound features are important in acoustic communication. Rapid respiratory patterns determine the coarse temporal structure of song and maintain gas exchange even during very long songs. The respiratory system also contributes to the fine control of airflow. Muscular control of the vocal organ regulates airflow and acoustic features. The upper vocal tract of birds filters the sounds generated in the syrinx, and filter properties are actively adjusted. Nonlinear source-filter interactions may also play a role. The unique morphology and biomechanical system for sound production in birds presents an interesting model for exploring parallels in control mechanisms that give rise to highly convergent physical patterns of sound generation. More comparative work should provide a rich source for our understanding of the evolution of complex sound producing systems. Copyright © 2009 Elsevier Inc. All rights reserved.
A Brain for Speech. Evolutionary Continuity in Primate and Human Auditory-Vocal Processing
Aboitiz, Francisco
2018-01-01
In this review article, I propose a continuous evolution from the auditory-vocal apparatus and its mechanisms of neural control in non-human primates, to the peripheral organs and the neural control of human speech. Although there is an overall conservatism both in peripheral systems and in central neural circuits, a few changes were critical for the expansion of vocal plasticity and the elaboration of proto-speech in early humans. Two of the most relevant changes were the acquisition of direct cortical control of the vocal fold musculature and the consolidation of an auditory-vocal articulatory circuit, encompassing auditory areas in the temporoparietal junction and prefrontal and motor areas in the frontal cortex. This articulatory loop, also referred to as the phonological loop, enhanced vocal working memory capacity, enabling early humans to learn increasingly complex utterances. The auditory-vocal circuit became progressively coupled to multimodal systems conveying information about objects and events, which gradually led to the acquisition of modern speech. Gestural communication accompanies the development of vocal communication since very early in human evolution, and although both systems co-evolved tightly in the beginning, at some point speech became the main channel of communication. PMID:29636657
Central Nervous System Control of Voice and Swallowing
Ludlow, Christy L.
2015-01-01
This review of the central nervous control systems for voice and swallowing has suggested that the traditional concepts of a separation between cortical and limbic and brain stem control should be refined and more integrative. For voice production, a separation of the non-human vocalization system from the human learned voice production system has been posited based primarily on studies of non-human primates. However, recent humans studies of emotionally based vocalizations and human volitional voice production has shown more integration between these two systems than previously proposed. Recent human studies have shown that reflexive vocalization as well as learned voice production not involving speech, involve a common integrative system. On the other hand, recent studies of non-human primates have provided evidence of some cortical activity during vocalization and cortical changes with training during vocal behavior. For swallowing, evidence from the macaque and functional brain imaging in humans indicates that the control for the pharyngeal phase of swallowing is not primarily under brain stem mechanisms as previously proposed. Studies suggest that the initiation and patterning of swallowing for the pharyngeal phase is also under active cortical control for both spontaneous as well as volitional swallowing in awake humans and non-human primates. PMID:26241238
Exploring the anatomical encoding of voice with a mathematical model of the vocal system.
Assaneo, M Florencia; Sitt, Jacobo; Varoquaux, Gael; Sigman, Mariano; Cohen, Laurent; Trevisan, Marcos A
2016-11-01
The faculty of language depends on the interplay between the production and perception of speech sounds. A relevant open question is whether the dimensions that organize voice perception in the brain are acoustical or depend on properties of the vocal system that produced it. One of the main empirical difficulties in answering this question is to generate sounds that vary along a continuum according to the anatomical properties the vocal apparatus that produced them. Here we use a mathematical model that offers the unique possibility of synthesizing vocal sounds by controlling a small set of anatomically based parameters. In a first stage the quality of the synthetic voice was evaluated. Using specific time traces for sub-glottal pressure and tension of the vocal folds, the synthetic voices generated perceptual responses, which are indistinguishable from those of real speech. The synthesizer was then used to investigate how the auditory cortex responds to the perception of voice depending on the anatomy of the vocal apparatus. Our fMRI results show that sounds are perceived as human vocalizations when produced by a vocal system that follows a simple relationship between the size of the vocal folds and the vocal tract. We found that these anatomical parameters encode the perceptual vocal identity (male, female, child) and show that the brain areas that respond to human speech also encode vocal identity. On the basis of these results, we propose that this low-dimensional model of the vocal system is capable of generating realistic voices and represents a novel tool to explore the voice perception with a precise control of the anatomical variables that generate speech. Furthermore, the model provides an explanation of how auditory cortices encode voices in terms of the anatomical parameters of the vocal system. Copyright © 2016 Elsevier Inc. All rights reserved.
Early experience shapes vocal neural coding and perception in songbirds
Woolley, Sarah M. N.
2012-01-01
Songbirds, like humans, are highly accomplished vocal learners. The many parallels between speech and birdsong and conserved features of mammalian and avian auditory systems have led to the emergence of the songbird as a model system for studying the perceptual mechanisms of vocal communication. Laboratory research on songbirds allows the careful control of early life experience and high-resolution analysis of brain function during vocal learning, production and perception. Here, I review what songbird studies have revealed about the role of early experience in the development of vocal behavior, auditory perception and the processing of learned vocalizations by auditory neurons. The findings of these studies suggest general principles for how exposure to vocalizations during development and into adulthood influences the perception of learned vocal signals. PMID:22711657
Vocal development in a Waddington landscape
Teramoto, Yayoi; Takahashi, Daniel Y; Holmes, Philip; Ghazanfar, Asif A
2017-01-01
Vocal development is the adaptive coordination of the vocal apparatus, muscles, the nervous system, and social interaction. Here, we use a quantitative framework based on optimal control theory and Waddington’s landscape metaphor to provide an integrated view of this process. With a biomechanical model of the marmoset monkey vocal apparatus and behavioral developmental data, we show that only the combination of the developing vocal tract, vocal apparatus muscles and nervous system can fully account for the patterns of vocal development. Together, these elements influence the shape of the monkeys’ vocal developmental landscape, tilting, rotating or shifting it in different ways. We can thus use this framework to make quantitative predictions regarding how interfering factors or experimental perturbations can change the landscape within a species, or to explain comparative differences in vocal development across species DOI: http://dx.doi.org/10.7554/eLife.20782.001 PMID:28092262
The respiratory-vocal system of songbirds: anatomy, physiology, and neural control.
Schmidt, Marc F; Martin Wild, J
2014-01-01
This wide-ranging review presents an overview of the respiratory-vocal system in songbirds, which are the only other vertebrate group known to display a degree of respiratory control during song rivalling that of humans during speech; this despite the fact that the peripheral components of both the respiratory and vocal systems differ substantially in the two groups. We first provide a brief description of these peripheral components in songbirds (lungs, air sacs and respiratory muscles, vocal organ (syrinx), upper vocal tract) and then proceed to a review of the organization of central respiratory-related neurons in the spinal cord and brainstem, the latter having an organization fundamentally similar to that of the ventral respiratory group of mammals. The second half of the review describes the nature of the motor commands generated in a specialized "cortical" song control circuit and how these might engage brainstem respiratory networks to shape the temporal structure of song. We also discuss a bilaterally projecting "respiratory-thalamic" pathway that links the respiratory system to "cortical" song control nuclei. This necessary pathway for song originates in the brainstem's primary inspiratory center and is hypothesized to play a vital role in synchronizing song motor commands both within and across hemispheres. © 2014 Elsevier B.V. All rights reserved.
The respiratory-vocal system of songbirds: Anatomy, physiology, and neural control
Schmidt, Marc F.; Wild, J. Martin
2015-01-01
This wide-ranging review presents an overview of the respiratory-vocal system in songbirds, which are the only other vertebrate group known to display a degree of respiratory control during song rivalling that of humans during speech; this despite the fact that the peripheral components of both the respiratory and vocal systems differ substantially in the two groups. We first provide a brief description of these peripheral components in songbirds (lungs, air sacs and respiratory muscles, vocal organ (syrinx), upper vocal tract) and then proceed to a review of the organization of central respiratory-related neurons in the spinal cord and brainstem, the latter having an organization fundamentally similar to that of the ventral respiratory group of mammals. The second half of the review describes the nature of the motor commands generated in a specialized “cortical” song control circuit and how these might engage brainstem respiratory networks to shape the temporal structure of song. We also discuss a bilaterally projecting “respiratory-thalamic” pathway that links the respiratory system to “cortical” song control nuclei. This necessary pathway for song originates in the brainstem’s primary inspiratory center and is hypothesized to play a vital role in synchronizing song motor commands both within and across hemispheres. PMID:25194204
Schmidt, Marc F.; McLean, Judith; Goller, Franz
2011-01-01
The production of vocalizations is intimately linked to the respiratory system. Despite our understanding of neural circuits that generate normal respiratory patterns, very little is understood regarding how these ponto-medullary circuits become engaged during vocal production. Songbirds offer a potentially powerful model system for addressing this relationship. Songs dramatically alter the respiratory pattern in ways that are often highly predictable and songbirds have a specialized telencephalic vocal motor circuit that provides massive innervation to a brainstem respiratory network that shares many similarities with its mammalian counterpart. In this review, we highlight interactions between the song motor circuit and the respiratory system, describing how both systems likely interact to produce the complex respiratory patterns that are observed during vocalization. We also discuss how the respiratory system, through its bilateral bottom-up projections to thalamus, might play a key role in sending precisely timed signals that synchronize premotor activity in both hemispheres. PMID:21984733
A robotic voice simulator and the interactive training for hearing-impaired people.
Sawada, Hideyuki; Kitani, Mitsuki; Hayashi, Yasumori
2008-01-01
A talking and singing robot which adaptively learns the vocalization skill by means of an auditory feedback learning algorithm is being developed. The robot consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. In this study, the robot is applied to the training system of speech articulation for the hearing-impaired, because the robot is able to reproduce their vocalization and to teach them how it is to be improved to generate clear speech. The paper briefly introduces the mechanical construction of the robot and how it autonomously acquires the vocalization skill in the auditory feedback learning by listening to human speech. Then the training system is described, together with the evaluation of the speech training by auditory impaired people.
Knockout of Foxp2 disrupts vocal development in mice.
Castellucci, Gregg A; McGinley, Matthew J; McCormick, David A
2016-03-16
The FOXP2 gene is important for the development of proper speech motor control in humans. However, the role of the gene in general vocal behavior in other mammals, including mice, is unclear. Here, we track the vocal development of Foxp2 heterozygous knockout (Foxp2+/-) mice and their wildtype (WT) littermates from juvenile to adult ages, and observe severe abnormalities in the courtship song of Foxp2+/- mice. In comparison to their WT littermates, Foxp2+/- mice vocalized less, produced shorter syllable sequences, and possessed an abnormal syllable inventory. In addition, Foxp2+/- song also exhibited irregular rhythmic structure, and its development did not follow the consistent trajectories observed in WT vocalizations. These results demonstrate that the Foxp2 gene is critical for normal vocal behavior in juvenile and adult mice, and that Foxp2 mutant mice may provide a tractable model system for the study of the gene's role in general vocal motor control.
Scheerer, Nichole E; Jones, Jeffery A
2014-12-01
Speech production requires the combined effort of a feedback control system driven by sensory feedback, and a feedforward control system driven by internal models. However, the factors that dictate the relative weighting of these feedback and feedforward control systems are unclear. In this event-related potential (ERP) study, participants produced vocalisations while being exposed to blocks of frequency-altered feedback (FAF) perturbations that were either predictable in magnitude (consistently either 50 or 100 cents) or unpredictable in magnitude (50- and 100-cent perturbations varying randomly within each vocalisation). Vocal and P1-N1-P2 ERP responses revealed decreases in the magnitude and trial-to-trial variability of vocal responses, smaller N1 amplitudes, and shorter vocal, P1 and N1 response latencies following predictable FAF perturbation magnitudes. In addition, vocal response magnitudes correlated with N1 amplitudes, vocal response latencies, and P2 latencies. This pattern of results suggests that after repeated exposure to predictable FAF perturbations, the contribution of the feedforward control system increases. Examination of the presentation order of the FAF perturbations revealed smaller compensatory responses, smaller P1 and P2 amplitudes, and shorter N1 latencies when the block of predictable 100-cent perturbations occurred prior to the block of predictable 50-cent perturbations. These results suggest that exposure to large perturbations modulates responses to subsequent perturbations of equal or smaller size. Similarly, exposure to a 100-cent perturbation prior to a 50-cent perturbation within a vocalisation decreased the magnitude of vocal and N1 responses, but increased P1 and P2 latencies. Thus, exposure to a single perturbation can affect responses to subsequent perturbations. © 2014 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
A theoretical study of F0-F1 interaction with application to resonant speaking and singing voice.
Titze, Ingo R
2004-09-01
An interactive source-filter system, consisting of a three-mass body-cover model of the vocal folds and a wave reflection model of the vocal tract, was used to test the dependence of vocal fold vibration on the vocal tract. The degree of interaction is governed by the epilarynx tube, which raises the vocal tract impedance to match the impedance of the glottis. The key component of the impedance is inertive reactance. Whenever there is inertive reactance, the vocal tract assists the vocal folds in vibration. The amplitude of vibration and the glottal flow can more than double, and the oral radiated power can increase up to 10 dB. As F0 approaches F1, the first formant frequency, the interactive source-filter system loses its advantage (because inertive reactance changes to compliant reactance) and the noninteractive system produces greater vocal output. Thus, from a voice training and control standpoint, there may be reasons to operate the system in either interactive and noninteractive modes. The harmonics 2F0 and 3F0 can also benefit from being positioned slightly below F1.
NASA Astrophysics Data System (ADS)
Coudé, Gino
2016-03-01
This comment will be focused on the role of monkey vocal control in the evolution of language. I will essentially reiterate the observations expressed in a commentary [1] about the book ;How the brain got language: the mirror system hypothesis;, written by Arbib [2]. I will hopefully clarify our suggestion that non-human primates vocal communication, in conjunction with gestures, could have had an active role in the emergence of the first voluntary forms of utterances that will later shape protospeech. This suggestion is mainly rooted in neurophysiological data about vocal control in monkey. I will very briefly summarize how neurophysiological data allowed us to suggest a possible role for monkey vocalization in language evolution. We conducted a study [3] in which we recorded from ventral premotor cortex (PMv) of macaques trained to emit vocalizations (i.e. coo-calls). The results showed that the rostro-lateral part of PMv contains neurons that fire during conditioned vocalization. The involvement of PMv in vocalization production was further supported by electrical microstimulation of the cortical sector where some of the vocalization neurons were found. Microstimulation elicited in some cases a combination of jaw, tongue and larynx movements. To us, the evolutionary implications of those results were obvious: a partial voluntary vocal control was already taking place in the primate PMv cortex some 25 million years ago.
Gestures, vocalizations, and memory in language origins.
Aboitiz, Francisco
2012-01-01
THIS ARTICLE DISCUSSES THE POSSIBLE HOMOLOGIES BETWEEN THE HUMAN LANGUAGE NETWORKS AND COMPARABLE AUDITORY PROJECTION SYSTEMS IN THE MACAQUE BRAIN, IN AN ATTEMPT TO RECONCILE TWO EXISTING VIEWS ON LANGUAGE EVOLUTION: one that emphasizes hand control and gestures, and the other that emphasizes auditory-vocal mechanisms. The capacity for language is based on relatively well defined neural substrates whose rudiments have been traced in the non-human primate brain. At its core, this circuit constitutes an auditory-vocal sensorimotor circuit with two main components, a "ventral pathway" connecting anterior auditory regions with anterior ventrolateral prefrontal areas, and a "dorsal pathway" connecting auditory areas with parietal areas and with posterior ventrolateral prefrontal areas via the arcuate fasciculus and the superior longitudinal fasciculus. In humans, the dorsal circuit is especially important for phonological processing and phonological working memory, capacities that are critical for language acquisition and for complex syntax processing. In the macaque, the homolog of the dorsal circuit overlaps with an inferior parietal-premotor network for hand and gesture selection that is under voluntary control, while vocalizations are largely fixed and involuntary. The recruitment of the dorsal component for vocalization behavior in the human lineage, together with a direct cortical control of the subcortical vocalizing system, are proposed to represent a fundamental innovation in human evolution, generating an inflection point that permitted the explosion of vocal language and human communication. In this context, vocal communication and gesturing have a common history in primate communication.
Arriaga, Gustavo; Zhou, Eric P.; Jarvis, Erich D.
2012-01-01
Humans and song-learning birds communicate acoustically using learned vocalizations. The characteristic features of this social communication behavior include vocal control by forebrain motor areas, a direct cortical projection to brainstem vocal motor neurons, and dependence on auditory feedback to develop and maintain learned vocalizations. These features have so far not been found in closely related primate and avian species that do not learn vocalizations. Male mice produce courtship ultrasonic vocalizations with acoustic features similar to songs of song-learning birds. However, it is assumed that mice lack a forebrain system for vocal modification and that their ultrasonic vocalizations are innate. Here we investigated the mouse song system and discovered that it includes a motor cortex region active during singing, that projects directly to brainstem vocal motor neurons and is necessary for keeping song more stereotyped and on pitch. We also discovered that male mice depend on auditory feedback to maintain some ultrasonic song features, and that sub-strains with differences in their songs can match each other's pitch when cross-housed under competitive social conditions. We conclude that male mice have some limited vocal modification abilities with at least some neuroanatomical features thought to be unique to humans and song-learning birds. To explain our findings, we propose a continuum hypothesis of vocal learning. PMID:23071596
Knockout of Foxp2 disrupts vocal development in mice
Castellucci, Gregg A.; McGinley, Matthew J.; McCormick, David A.
2016-01-01
The FOXP2 gene is important for the development of proper speech motor control in humans. However, the role of the gene in general vocal behavior in other mammals, including mice, is unclear. Here, we track the vocal development of Foxp2 heterozygous knockout (Foxp2+/−) mice and their wildtype (WT) littermates from juvenile to adult ages, and observe severe abnormalities in the courtship song of Foxp2+/− mice. In comparison to their WT littermates, Foxp2+/− mice vocalized less, produced shorter syllable sequences, and possessed an abnormal syllable inventory. In addition, Foxp2+/− song also exhibited irregular rhythmic structure, and its development did not follow the consistent trajectories observed in WT vocalizations. These results demonstrate that the Foxp2 gene is critical for normal vocal behavior in juvenile and adult mice, and that Foxp2 mutant mice may provide a tractable model system for the study of the gene’s role in general vocal motor control. PMID:26980647
Auditory and audio-vocal responses of single neurons in the monkey ventral premotor cortex.
Hage, Steffen R
2018-03-20
Monkey vocalization is a complex behavioral pattern, which is flexibly used in audio-vocal communication. A recently proposed dual neural network model suggests that cognitive control might be involved in this behavior, originating from a frontal cortical network in the prefrontal cortex and mediated via projections from the rostral portion of the ventral premotor cortex (PMvr) and motor cortex to the primary vocal motor network in the brainstem. For the rapid adjustment of vocal output to external acoustic events, strong interconnections between vocal motor and auditory sites are needed, which are present at cortical and subcortical levels. However, the role of the PMvr in audio-vocal integration processes remains unclear. In the present study, single neurons in the PMvr were recorded in rhesus monkeys (Macaca mulatta) while volitionally producing vocalizations in a visual detection task or passively listening to monkey vocalizations. Ten percent of randomly selected neurons in the PMvr modulated their discharge rate in response to acoustic stimulation with species-specific calls. More than four-fifths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of the vocalization. Based on these audio-vocal interactions, the PMvr might be well positioned to mediate higher order auditory processing with cognitive control of the vocal motor output to the primary vocal motor network. Such audio-vocal integration processes in the premotor cortex might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2018 Elsevier B.V. All rights reserved.
Central pattern generators for social vocalization: Androgen-dependent neurophysiological mechanisms
Bass, Andrew H.; Remage-Healey, Luke
2008-01-01
Historically, most studies of vertebrate central pattern generators (CPGs) have focused on mechanisms for locomotion and respiration. Here, we highlight new results for ectothermic vertebrates, namely teleost fish and amphibians, showing how androgenic steroids can influence the temporal patterning of CPGs for social vocalization. Investigations of vocalizing teleosts show how androgens can rapidly (within minutes) modulate the neurophysiological output of the vocal CPG (fictive vocalizations that mimic the temporal properties of natural vocalizations) inclusive of their divergent actions between species, as well as intraspecific differences between male reproductive morphs. Studies of anuran amphibians (frogs) demonstrate that long-term steroid treatments (wks) can masculinize the fictive vocalizations of females, inclusive of its sensitivity to rapid modulation by serotonin. Given the conserved organization of vocal control systems across vertebrate groups, the vocal CPGs of fish and amphibians provide tractable models for identifying androgen-dependent events that are fundamental to the mechanisms of vocal motor patterning. These basic mechanisms can also inform our understanding of the more complex CPGs for vocalization, and social behaviors in general, that have evolved among birds and mammals. PMID:18262186
A Mechanism for Frequency Modulation in Songbirds Shared with Humans
Margoliash, Daniel
2013-01-01
In most animals that vocalize, control of fundamental frequency is a key element for effective communication. In humans, subglottal pressure controls vocal intensity but also influences fundamental frequency during phonation. Given the underlying similarities in the biomechanical mechanisms of vocalization in humans and songbirds, songbirds offer an attractive opportunity to study frequency modulation by pressure. Here, we present a novel technique for dynamic control of subsyringeal pressure in zebra finches. By regulating the opening of a custom-built fast valve connected to the air sac system, we achieved partial or total silencing of specific syllables, and could modify syllabic acoustics through more complex manipulations of air sac pressure. We also observed that more nuanced pressure variations over a limited interval during production of a syllable concomitantly affected the frequency of that syllable segment. These results can be explained in terms of a mathematical model for phonation that incorporates a nonlinear description for the vocal source capable of generating the observed frequency modulations induced by pressure variations. We conclude that the observed interaction between pressure and frequency was a feature of the source, not a result of feedback control. Our results indicate that, beyond regulating phonation or its absence, regulation of pressure is important for control of fundamental frequencies of vocalizations. Thus, although there are separate brainstem pathways for syringeal and respiratory control of song production, both can affect airflow and frequency. We hypothesize that the control of pressure and frequency is combined holistically at higher levels of the vocalization pathways. PMID:23825417
A mechanism for frequency modulation in songbirds shared with humans.
Amador, Ana; Margoliash, Daniel
2013-07-03
In most animals that vocalize, control of fundamental frequency is a key element for effective communication. In humans, subglottal pressure controls vocal intensity but also influences fundamental frequency during phonation. Given the underlying similarities in the biomechanical mechanisms of vocalization in humans and songbirds, songbirds offer an attractive opportunity to study frequency modulation by pressure. Here, we present a novel technique for dynamic control of subsyringeal pressure in zebra finches. By regulating the opening of a custom-built fast valve connected to the air sac system, we achieved partial or total silencing of specific syllables, and could modify syllabic acoustics through more complex manipulations of air sac pressure. We also observed that more nuanced pressure variations over a limited interval during production of a syllable concomitantly affected the frequency of that syllable segment. These results can be explained in terms of a mathematical model for phonation that incorporates a nonlinear description for the vocal source capable of generating the observed frequency modulations induced by pressure variations. We conclude that the observed interaction between pressure and frequency was a feature of the source, not a result of feedback control. Our results indicate that, beyond regulating phonation or its absence, regulation of pressure is important for control of fundamental frequencies of vocalizations. Thus, although there are separate brainstem pathways for syringeal and respiratory control of song production, both can affect airflow and frequency. We hypothesize that the control of pressure and frequency is combined holistically at higher levels of the vocalization pathways.
Temporal Lobe Epilepsy Alters Auditory-motor Integration For Voice Control
Li, Weifeng; Chen, Ziyi; Yan, Nan; Jones, Jeffery A.; Guo, Zhiqiang; Huang, Xiyan; Chen, Shaozhen; Liu, Peng; Liu, Hanjun
2016-01-01
Temporal lobe epilepsy (TLE) is the most common drug-refractory focal epilepsy in adults. Previous research has shown that patients with TLE exhibit decreased performance in listening to speech sounds and deficits in the cortical processing of auditory information. Whether TLE compromises auditory-motor integration for voice control, however, remains largely unknown. To address this question, event-related potentials (ERPs) and vocal responses to vocal pitch errors (1/2 or 2 semitones upward) heard in auditory feedback were compared across 28 patients with TLE and 28 healthy controls. Patients with TLE produced significantly larger vocal responses but smaller P2 responses than healthy controls. Moreover, patients with TLE exhibited a positive correlation between vocal response magnitude and baseline voice variability and a negative correlation between P2 amplitude and disease duration. Graphical network analyses revealed a disrupted neuronal network for patients with TLE with a significant increase of clustering coefficients and path lengths as compared to healthy controls. These findings provide strong evidence that TLE is associated with an atypical integration of the auditory and motor systems for vocal pitch regulation, and that the functional networks that support the auditory-motor processing of pitch feedback errors differ between patients with TLE and healthy controls. PMID:27356768
Measurements of vocal fold tissue viscoelasticity: Approaching the male phonatory frequency range
NASA Astrophysics Data System (ADS)
Chan, Roger W.
2004-06-01
Viscoelastic shear properties of human vocal fold tissues have been reported previously. However, data have only been obtained at very low frequencies (<=15 Hz). This necessitates data extrapolation to the frequency range of phonation based on constitutive modeling and time-temperature superposition. This study attempted to obtain empirical measurements at higher frequencies with the use of a controlled strain torsional rheometer, with a design of directly controlling input strain that introduced significantly smaller system inertial errors compared to controlled stress rheometry. Linear viscoelastic shear properties of the vocal fold mucosa (cover) from 17 canine larynges were quantified at frequencies of up to 50 Hz. Consistent with previous data, results showed that the elastic shear modulus (G'), viscous shear modulus (G''), and damping ratio (ζ) of the vocal fold mucosa were relatively constant across 0.016-50 Hz, whereas the dynamic viscosity (ɛ') decreased monotonically with frequency. Constitutive characterization of the empirical data by a quasilinear viscoelastic model and a statistical network model demonstrated trends of viscoelastic behavior at higher frequencies generally following those observed at lower frequencies. These findings supported the use of controlled strain rheometry for future investigations of the viscoelasticity of vocal fold tissues and phonosurgical biomaterials at phonatory frequencies.
Keough, Dwayne
2011-01-01
Research on the control of visually guided limb movements indicates that the brain learns and continuously updates an internal model that maps the relationship between motor commands and sensory feedback. A growing body of work suggests that an internal model that relates motor commands to sensory feedback also supports vocal control. There is evidence from arm-reaching studies that shows that when provided with a contextual cue, the motor system can acquire multiple internal models, which allows an animal to adapt to different perturbations in diverse contexts. In this study we show that trained singers can rapidly acquire multiple internal models regarding voice fundamental frequency (F0). These models accommodate different perturbations to ongoing auditory feedback. Participants heard three musical notes and reproduced each one in succession. The musical targets could serve as a contextual cue to indicate which direction (up or down) feedback would be altered on each trial; however, participants were not explicitly instructed to use this strategy. When participants were gradually exposed to altered feedback adaptation was observed immediately following vocal onset. Aftereffects were target specific and did not influence vocal productions on subsequent trials. When target notes were no longer a contextual cue, adaptation occurred during altered feedback trials and evidence for trial-by-trial adaptation was found. These findings indicate that the brain is exceptionally sensitive to the deviations between auditory feedback and the predicted consequence of a motor command during vocalization. Moreover, these results indicate that, with contextual cues, the vocal control system may maintain multiple internal models that are capable of independent modification during different tasks or environments. PMID:21346208
Makiyama, Kiyoshi; Yoshihashi, Hidetaka; Mogitate, Manabu; Kida, Akinori
2005-04-01
To determine the role of the adjustment of expiratory effort in the control of vocal intensity. An intensity-loading test was performed by using the airway interruption method. Three groups of subjects were used: a control group thought to resemble normal vocal fold closure, a group of patients with Reinke's edema thought to represent increased mass at the level of the vocal folds, and a group with vocal fold paralysis that was thought to represent a group with lack of adequate vocal fold closure. In the control group, expiratory lung pressure and airway resistance slightly increased. In the patients with Reinke's edema, expiratory lung pressure, and airway resistance significantly increased. In this group, the voice intensity was controlled by laryngeal adjustment, but a greater expiratory effort was needed because of a greater increase in glottal resistance. In the patients with vocal cord paralysis, airway resistance did not increase even with a high-intensity voice. Vocal intensity was controlled by expiratory effort. If there is sufficient ability for laryngeal adjustment, vocal intensity is controlled primarily by laryngeal adjustment and by expiratory adjustment in response to increased glottal resistance. However, vocal intensity is controlled by expiratory effort when laryngeal adjustment ability is poor.
Riede, Tobias; Tokuda, Isao T.; Farmer, C. G.
2011-01-01
SUMMARY Vocalization is rare among non-avian reptiles, with the exception of the crocodilians, the sister taxon of birds. Crocodilians have a complex vocal repertoire. Their vocal and respiratory system is not well understood but appears to consist of a combination of features that are also found in the extremely vocal avian and mammalian taxa. Anatomical studies suggest that the alligator larynx is able to abduct and adduct the vocal folds, but not to elongate or shorten them, and is therefore lacking a key regulator of frequency, yet alligators can modulate fundamental frequency remarkably well. We investigated the morphological and physiological features of sound production in alligators. Vocal fold length scales isometrically across a wide range of alligator body sizes. The relationship between fundamental frequency and subglottal pressure is significant in some individuals at some isolated points, such as call onset and position of maximum fundamental frequency. The relationship is not consistent over large segments of the call. Fundamental frequency can change faster than expected by pressure changes alone, suggesting an active motor pattern controls frequency and is intrinsic to the larynx. We utilized a two-mass vocal fold model to test whether abduction and adduction could generate this motor pattern. The fine-tuned interplay between subglottal pressure and glottal adduction can achieve frequency modulations much larger than those resulting from subglottal pressure variations alone and of similar magnitude, as observed in alligator calls. We conclude that the alligator larynx represents a sound source with only two control parameters (subglottal pressure and vocal fold adduction) in contrast to the mammalian larynx in which three parameters can be altered to modulate frequency (subglottal pressure, vocal fold adduction and length/tension). PMID:21865521
Tokarev, Kirill; Tiunova, Anna; Scharff, Constance; Anokhin, Konstantin
2011-01-01
Specialized neural pathways, the song system, are required for acquiring, producing, and perceiving learned avian vocalizations. Birds that do not learn to produce their vocalizations lack telencephalic song system components. It is not known whether the song system forebrain regions are exclusively evolved for song or whether they also process information not related to song that might reflect their 'evolutionary history'. To address this question we monitored the induction of two immediate-early genes (IEGs) c-Fos and ZENK in various regions of the song system in zebra finches (Taeniopygia guttata) in response to an aversive food learning paradigm; this involves the association of a food item with a noxious stimulus that affects the oropharyngeal-esophageal cavity and tongue, causing subsequent avoidance of that food item. The motor response results in beak and head movements but not vocalizations. IEGs have been extensively used to map neuro-molecular correlates of song motor production and auditory processing. As previously reported, neurons in two pallial vocal motor regions, HVC and RA, expressed IEGs after singing. Surprisingly, c-Fos was induced equivalently also after food aversion learning in the absence of singing. The density of c-Fos positive neurons was significantly higher than that of birds in control conditions. This was not the case in two other pallial song nuclei important for vocal plasticity, LMAN and Area X, although singing did induce IEGs in these structures, as reported previously. Our results are consistent with the possibility that some of the song nuclei may participate in non-vocal learning and the populations of neurons involved in the two tasks show partial overlap. These findings underscore the previously advanced notion that the specialized forebrain pre-motor nuclei controlling song evolved from circuits involved in behaviors related to feeding.
Vocal Pitch Discrimination in the Motor System
ERIC Educational Resources Information Center
D'Ausilio, Alessandro; Bufalari, Ilaria; Salmas, Paola; Busan, Pierpaolo; Fadiga, Luciano
2011-01-01
Speech production can be broadly separated into two distinct components: Phonation and Articulation. These two aspects require the efficient control of several phono-articulatory effectors. Speech is indeed generated by the vibration of the vocal-folds in the larynx (F0) followed by "filtering" by articulators, to select certain resonant…
A new approach to geometrical measurements in an animal model of vocal fold scar.
Jabbour, Noel; Krishna, Priya D; Osborne, James; Rosen, Clark A
2009-01-01
A standard method for quantifying the geometric properties of vocal folds has not been widely adopted. An ideal method of geometrical measurement should effectively quantify the dimensions of the medial vibratory portion of the vocal fold, should be easily performed, should yield consistent results, and should be readily available at little to no cost. We have developed a new approach for geometrical measurements to meet these goals. The objective of this study is to describe this new approach and to assess its effectiveness in a canine model of vocal fold scar. One hundred thirty-five mid-membranous coronal sections of vocal folds from 10 canines (five with unilateral surgical scarring) were examined by light microscopy; digital images were captured. ImageJ was used to measure a variety of described parameters. Comparison between scarred vocal folds and control vocal folds was made. At least 20% of the slides for each vocal fold were randomly selected (n=42) for repeat measurements of interrater and intrarater reliability. A statistically significant difference between scarred and control vocal folds was obtained for horizontal distance (P<0.001), vertical distance (P=0.005), area (P<0.001), mean optical density (OD) (P<0.001), and OD at defined points along the length of the vocal fold (P< or =0.009). Reliability calculations for intrarater and interrater measurements ranged from r=0.845 to r=0.994 and from r=0.734 to r=0.976, respectively. The proposed approach for geometrical measurements meets the intended objectives in a canine model of vocal fold scar. Future work is needed to apply this approach to other model systems.
Tissue engineering therapies for the vocal fold lamina propria.
Kutty, Jaishankar K; Webb, Ken
2009-09-01
The vocal folds are laryngeal connective tissues with complex matrix composition/organization that provide the viscoelastic mechanical properties required for voice production. Vocal fold injury results in alterations in tissue structure and corresponding changes in tissue biomechanics that reduce vocal quality. Recent work has begun to elucidate the biochemical changes underlying injury-induced pathology and to apply tissue engineering principles to the prevention and reversal of vocal fold scarring. Based on the extensive history of injectable biomaterials in laryngeal surgery, a major focus of regenerative therapies has been the development of novel scaffolds with controlled in vivo residence time and viscoelastic properties approximating the native tissue. Additional strategies have included cell transplantation and delivery of the antifibrotic cytokine hepatocyte growth factor, as well as investigation of the effects of the unique vocal fold vibratory microenvironment using in vitro dynamic culture systems. Recent achievements of significant reductions in fibrosis and improved recovery of native tissue viscoelasticity and vibratory/functional performance in animal models are rapidly moving vocal fold tissue engineering toward clinical application.
Forlano, Paul M; Marchaterre, Margaret; Deitcher, David L; Bass, Andrew H
2010-02-15
Across all major vertebrate groups, androgen receptors (ARs) have been identified in neural circuits that shape reproductive-related behaviors, including vocalization. The vocal control network of teleost fishes presents an archetypal example of how a vertebrate nervous system produces social, context-dependent sounds. We cloned a partial cDNA of AR that was used to generate specific probes to localize AR expression throughout the central nervous system of the vocal plainfin midshipman fish (Porichthys notatus). In the forebrain, AR mRNA is abundant in proposed homologs of the mammalian striatum and amygdala, and in anterior and posterior parvocellular and magnocellular nuclei of the preoptic area, nucleus preglomerulosus, and posterior, ventral and anterior tuberal nuclei of the hypothalamus. Many of these nuclei are part of the known vocal and auditory circuitry in midshipman. The midbrain periaqueductal gray, an essential link between forebrain and hindbrain vocal circuitry, and the lateral line recipient nucleus medialis in the rostral hindbrain also express abundant AR mRNA. In the caudal hindbrain-spinal vocal circuit, high AR mRNA is found in the vocal prepacemaker nucleus and along the dorsal periphery of the vocal motor nucleus congruent with the known pattern of expression of aromatase-containing glial cells. Additionally, abundant AR mRNA expression is shown for the first time in the inner ear of a vertebrate. The distribution of AR mRNA strongly supports the role of androgens as modulators of behaviorally defined vocal, auditory, and neuroendocrine circuits in teleost fish and vertebrates in general. 2009 Wiley-Liss, Inc.
Fee, Michale S.
2011-01-01
Learned motor behaviors require descending forebrain control to be coordinated with midbrain and brainstem motor systems. In songbirds, such as the zebra finch, regular breathing is controlled by brainstem centers, but when the adult songbird begins to sing, its breathing becomes tightly coordinated with forebrain-controlled vocalizations. The periods of silence (gaps) between song syllables are typically filled with brief breaths, allowing the bird to sing uninterrupted for many seconds. While substantial progress has been made in identifying the brain areas and pathways involved in vocal and respiratory control, it is not understood how respiratory and vocal control is coordinated by forebrain motor circuits. Here we combine a recently developed technique for localized brain cooling, together with recordings of thoracic air sac pressure, to examine the role of cortical premotor nucleus HVC (proper name) in respiratory-vocal coordination. We found that HVC cooling, in addition to slowing all song timescales as previously reported, also increased the duration of expiratory pulses (EPs) and inspiratory pulses (IPs). Expiratory pulses, like song syllables, were stretched uniformly by HVC cooling, but most inspiratory pulses exhibited non-uniform stretch of pressure waveform such that the majority of stretch occurred late in the IP. Indeed, some IPs appeared to change duration by the earlier or later truncation of an underlying inspiratory event. These findings are consistent with the idea that during singing the temporal structure of EPs is under the direct control of forebrain circuits, whereas that of IPs can be strongly influenced by circuits downstream of HVC, likely in the brainstem. An analysis of the temporal jitter of respiratory and vocal structure suggests that IPs may be initiated by HVC at the end of each syllable and terminated by HVC immediately before the onset of the next syllable. PMID:21980466
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bouchard, Kristofer E.; Conant, David F.; Anumanchipalli, Gopala K.
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial-especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship acrossmore » speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics.« less
Anumanchipalli, Gopala K.; Dichter, Benjamin; Chaisanguanthum, Kris S.; Johnson, Keith; Chang, Edward F.
2016-01-01
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial—especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship across speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics. PMID:27019106
Bouchard, Kristofer E.; Conant, David F.; Anumanchipalli, Gopala K.; ...
2016-03-28
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial-especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship acrossmore » speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics.« less
Iconicity can ground the creation of vocal symbols.
Perlman, Marcus; Dale, Rick; Lupyan, Gary
2015-08-01
Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative 'vocal' charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic 'meaning template' we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems.
Two organizing principles of vocal production: Implications for nonhuman and human primates.
Owren, Michael J; Amoss, R Toby; Rendall, Drew
2011-06-01
Vocal communication in nonhuman primates receives considerable research attention, with many investigators arguing for similarities between this calling and speech in humans. Data from development and neural organization show a central role of affect in monkey and ape sounds, however, suggesting that their calls are homologous to spontaneous human emotional vocalizations while having little relation to spoken language. Based on this evidence, we propose two principles that can be useful in evaluating the many and disparate empirical findings that bear on the nature of vocal production in nonhuman and human primates. One principle distinguishes production-first from reception-first vocal development, referring to the markedly different role of auditory-motor experience in each case. The second highlights a phenomenon dubbed dual neural pathways, specifically that when a species with an existing vocal system evolves a new functionally distinct vocalization capability, it occurs through emergence of a second parallel neural pathway rather than through expansion of the extant circuitry. With these principles as a backdrop, we review evidence of acoustic modification of calling associated with background noise, conditioning effects, audience composition, and vocal convergence and divergence in nonhuman primates. Although each kind of evidence has been interpreted to show flexible cognitively mediated control over vocal production, we suggest that most are more consistent with affectively grounded mechanisms. The lone exception is production of simple, novel sounds in great apes, which is argued to reveal at least some degree of volitional vocal control. If also present in early hominins, the cortically based circuitry surmised to be associated with these rudimentary capabilities likely also provided the substrate for later emergence of the neural pathway allowing volitional production in modern humans. © 2010 Wiley-Liss, Inc.
Blades, Brittany; Parks, Susan E.
2018-01-01
During the breeding season, male harbor seals (Phoca vitulina) make underwater acoustic displays using vocalizations known as roars. These roars have been shown to function in territory establishment in some breeding areas and have been hypothesized to be important for female choice, but the function of these sounds remains unresolved. This study consisted of a series of playback experiments in which captive female harbor seals were exposed to recordings of male roars to determine if females respond to recordings of male vocalizations and whether or not they respond differently to roars from categories with different acoustic characteristics. The categories included roars with characteristics of dominant males (longest duration, lowest frequency), subordinate males (shortest duration, highest frequency), combinations of call parameters from dominant and subordinate males (long duration, high frequency and short duration, low frequency), and control playbacks of water noise and water noise with tonal signals in the same frequency range as male signals. Results indicate that overall females have a significantly higher level of response to playbacks that imitate male vocalizations when compared to control playbacks of water noise. Specifically, there was a higher level of response to playbacks representing dominant male vocalization when compared to the control playbacks. For most individuals, there was a greater response to playbacks representing dominant male vocalizations compared to playbacks representing subordinate male vocalizations; however, there was no statistical difference between those two playback types. Additionally, there was no difference between the playbacks of call parameter combinations and the controls. Investigating female preference for male harbor seal vocalizations is a critical step in understanding the harbor seal mating system and further studies expanding on this captive study will help shed light on this important issue. PMID:29607261
Matthews, Leanna P; Blades, Brittany; Parks, Susan E
2018-01-01
During the breeding season, male harbor seals ( Phoca vitulina ) make underwater acoustic displays using vocalizations known as roars. These roars have been shown to function in territory establishment in some breeding areas and have been hypothesized to be important for female choice, but the function of these sounds remains unresolved. This study consisted of a series of playback experiments in which captive female harbor seals were exposed to recordings of male roars to determine if females respond to recordings of male vocalizations and whether or not they respond differently to roars from categories with different acoustic characteristics. The categories included roars with characteristics of dominant males (longest duration, lowest frequency), subordinate males (shortest duration, highest frequency), combinations of call parameters from dominant and subordinate males (long duration, high frequency and short duration, low frequency), and control playbacks of water noise and water noise with tonal signals in the same frequency range as male signals. Results indicate that overall females have a significantly higher level of response to playbacks that imitate male vocalizations when compared to control playbacks of water noise. Specifically, there was a higher level of response to playbacks representing dominant male vocalization when compared to the control playbacks. For most individuals, there was a greater response to playbacks representing dominant male vocalizations compared to playbacks representing subordinate male vocalizations; however, there was no statistical difference between those two playback types. Additionally, there was no difference between the playbacks of call parameter combinations and the controls. Investigating female preference for male harbor seal vocalizations is a critical step in understanding the harbor seal mating system and further studies expanding on this captive study will help shed light on this important issue.
Universal mechanisms of sound production and control in birds and mammals
Elemans, C.P.H; Rasmussen, J.H.; Herbst, C.T.; Düring, D.N.; Zollinger, S.A.; Brumm, H.; Srivastava, K.; Svane, N.; Ding, M.; Larsen, O.N.; Sober, S.J.; Švec, J.G.
2015-01-01
As animals vocalize, their vocal organ transforms motor commands into vocalizations for social communication. In birds, the physical mechanisms by which vocalizations are produced and controlled remain unresolved because of the extreme difficulty in obtaining in vivo measurements. Here, we introduce an ex vivo preparation of the avian vocal organ that allows simultaneous high-speed imaging, muscle stimulation and kinematic and acoustic analyses to reveal the mechanisms of vocal production in birds across a wide range of taxa. Remarkably, we show that all species tested employ the myoelastic-aerodynamic (MEAD) mechanism, the same mechanism used to produce human speech. Furthermore, we show substantial redundancy in the control of key vocal parameters ex vivo, suggesting that in vivo vocalizations may also not be specified by unique motor commands. We propose that such motor redundancy can aid vocal learning and is common to MEAD sound production across birds and mammals, including humans. PMID:26612008
Universal mechanisms of sound production and control in birds and mammals.
Elemans, C P H; Rasmussen, J H; Herbst, C T; Düring, D N; Zollinger, S A; Brumm, H; Srivastava, K; Svane, N; Ding, M; Larsen, O N; Sober, S J; Švec, J G
2015-11-27
As animals vocalize, their vocal organ transforms motor commands into vocalizations for social communication. In birds, the physical mechanisms by which vocalizations are produced and controlled remain unresolved because of the extreme difficulty in obtaining in vivo measurements. Here, we introduce an ex vivo preparation of the avian vocal organ that allows simultaneous high-speed imaging, muscle stimulation and kinematic and acoustic analyses to reveal the mechanisms of vocal production in birds across a wide range of taxa. Remarkably, we show that all species tested employ the myoelastic-aerodynamic (MEAD) mechanism, the same mechanism used to produce human speech. Furthermore, we show substantial redundancy in the control of key vocal parameters ex vivo, suggesting that in vivo vocalizations may also not be specified by unique motor commands. We propose that such motor redundancy can aid vocal learning and is common to MEAD sound production across birds and mammals, including humans.
Iconicity can ground the creation of vocal symbols
Perlman, Marcus; Dale, Rick; Lupyan, Gary
2015-01-01
Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative ‘vocal’ charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic ‘meaning template’ we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems. PMID:26361547
Adapted to Roar: Functional Morphology of Tiger and Lion Vocal Folds
Klemuk, Sarah A.; Riede, Tobias; Walsh, Edward J.; Titze, Ingo R.
2011-01-01
Vocal production requires active control of the respiratory system, larynx and vocal tract. Vocal sounds in mammals are produced by flow-induced vocal fold oscillation, which requires vocal fold tissue that can sustain the mechanical stress during phonation. Our understanding of the relationship between morphology and vocal function of vocal folds is very limited. Here we tested the hypothesis that vocal fold morphology and viscoelastic properties allow a prediction of fundamental frequency range of sounds that can be produced, and minimal lung pressure necessary to initiate phonation. We tested the hypothesis in lions and tigers who are well-known for producing low frequency and very loud roaring sounds that expose vocal folds to large stresses. In histological sections, we found that the Panthera vocal fold lamina propria consists of a lateral region with adipocytes embedded in a network of collagen and elastin fibers and hyaluronan. There is also a medial region that contains only fibrous proteins and hyaluronan but no fat cells. Young's moduli range between 10 and 2000 kPa for strains up to 60%. Shear moduli ranged between 0.1 and 2 kPa and differed between layers. Biomechanical and morphological data were used to make predictions of fundamental frequency and subglottal pressure ranges. Such predictions agreed well with measurements from natural phonation and phonation of excised larynges, respectively. We assume that fat shapes Panthera vocal folds into an advantageous geometry for phonation and it protects vocal folds. Its primary function is probably not to increase vocal fold mass as suggested previously. The large square-shaped Panthera vocal fold eases phonation onset and thereby extends the dynamic range of the voice. PMID:22073246
Wang, Weian; Lu, Rong
2013-06-01
To investigate the effect of laryngoscopic surgery combined with nasal endoscopic system for the treatment of vocal cords benign lesions. Fifty-two patients admitted to our department with vocal cords benign lesions (including vocal polyps, vocal nodules, vocal cord cyst) underwent laryngoscopic surgery combined with nasal endoscopic system. All patients were treated successfully once and for all without any significant postoperative complication. The laryngoscopic surgery combined with nasal endoscopic system is a safe, minimally invasive and simple method for the treatment of benign lesions of vocal cords.
Effects of Voice Harmonic Complexity on ERP Responses to Pitch-Shifted Auditory Feedback
Behroozmand, Roozbeh; Korzyukov, Oleg; Larson, Charles R.
2011-01-01
Objective The present study investigated the neural mechanisms of voice pitch control for different levels of harmonic complexity in the auditory feedback. Methods Event-related potentials (ERPs) were recorded in response to +200 cents pitch perturbations in the auditory feedback of self-produced natural human vocalizations, complex and pure tone stimuli during active vocalization and passive listening conditions. Results During active vocal production, ERP amplitudes were largest in response to pitch shifts in the natural voice, moderately large for non-voice complex stimuli and smallest for the pure tones. However, during passive listening, neural responses were equally large for pitch shifts in voice and non-voice complex stimuli but still larger than that for pure tones. Conclusions These findings suggest that pitch change detection is facilitated for spectrally rich sounds such as natural human voice and non-voice complex stimuli compared with pure tones. Vocalization-induced increase in neural responses for voice feedback suggests that sensory processing of naturally-produced complex sounds such as human voice is enhanced by means of motor-driven mechanisms (e.g. efference copies) during vocal production. Significance This enhancement may enable the audio-vocal system to more effectively detect and correct for vocal errors in the feedback of natural human vocalizations to maintain an intended vocal output for speaking. PMID:21719346
Imaizumi, Mitsuyoshi; Thibeault, Susan L; Leydon, Ciara
2014-11-01
Extent of vocal fold injury impacts the nature and timing of wound healing and voice outcomes. However, depth and extent of the lesion created to study wound healing in animal models vary across studies, likely contributing to different outcomes. Our goal was to create a surgery classification system to enable comparison of postoperative outcomes across animal vocal fold wound-healing studies. Prospective, controlled animal study. Rats underwent one of three types of unilateral vocal fold surgeries classified by depth and length of resection. The surgeries were: for subepithelial injury, resection of epithelium and superficial layer of the lamina propria at the midmembranous portion of the vocal fold; for transmucosal injury, resection of epithelium and lamina propria; and for transmuscular injury, resection of epithelium, lamina propria, and superficial portion of the vocalis muscle. Wound healing was evaluated histologically at various time points up to 35 days postinjury. Complete healing occurred by 14 days postsurgery for subepithelial injury, and by day 35 for transmucosal injury. Injury remained present at day 35 for transmuscular injury. Timing and completeness of healing varied by extent and depth of resection. Scarless healing occurred rapidly following subepithelial injury, whereas scarring was observed at 5 weeks after transmuscular injury. The proposed classification system may facilitate comparison of surgical outcomes across vocal fold wound-healing studies. N/A. © 2014 The American Laryngological, Rhinological and Otological Society, Inc.
The Human Voice in Speech and Singing
NASA Astrophysics Data System (ADS)
Lindblom, Björn; Sundberg, Johan
This chapter
The Human Voice in Speech and Singing
NASA Astrophysics Data System (ADS)
Lindblom, Björn; Sundberg, Johan
This chapter describes various aspects of the human voice as a means of communication in speech and singing. From the point of view of function, vocal sounds can be regarded as the end result of a three stage process: (1) the compression of air in the respiratory system, which produces an exhalatory airstream, (2) the vibrating vocal folds' transformation of this air stream to an intermittent or pulsating air stream, which is a complex tone, referred to as the voice source, and (3) the filtering of this complex tone in the vocal tract resonator. The main function of the respiratory system is to generate an overpressure of air under the glottis, or a subglottal pressure. Section 16.1 describes different aspects of the respiratory system of significance to speech and singing, including lung volume ranges, subglottal pressures, and how this pressure is affected by the ever-varying recoil forces. The complex tone generated when the air stream from the lungs passes the vibrating vocal folds can be varied in at least three dimensions: fundamental frequency, amplitude and spectrum. Section 16.2 describes how these properties of the voice source are affected by the subglottal pressure, the length and stiffness of the vocal folds and how firmly the vocal folds are adducted. Section 16.3 gives an account of the vocal tract filter, how its form determines the frequencies of its resonances, and Sect. 16.4 gives an account for how these resonance frequencies or formants shape the vocal sounds by imposing spectrum peaks separated by spectrum valleys, and how the frequencies of these peaks determine vowel and voice qualities. The remaining sections of the chapter describe various aspects of the acoustic signals used for vocal communication in speech and singing. The syllable structure is discussed in Sect. 16.5, the closely related aspects of rhythmicity and timing in speech and singing is described in Sect. 16.6, and pitch and rhythm aspects in Sect. 16.7. The impressive control of all these acoustic characteristics of vocal signals is discussed in Sect. 16.8, while Sect. 16.9 considers expressive aspects of vocal communication.
Tokarev, Kirill; Tiunova, Anna
2011-01-01
Background Specialized neural pathways, the song system, are required for acquiring, producing, and perceiving learned avian vocalizations. Birds that do not learn to produce their vocalizations lack telencephalic song system components. It is not known whether the song system forebrain regions are exclusively evolved for song or whether they also process information not related to song that might reflect their ‘evolutionary history’. Methodology/Principal Findings To address this question we monitored the induction of two immediate-early genes (IEGs) c-Fos and ZENK in various regions of the song system in zebra finches (Taeniopygia guttata) in response to an aversive food learning paradigm; this involves the association of a food item with a noxious stimulus that affects the oropharyngeal-esophageal cavity and tongue, causing subsequent avoidance of that food item. The motor response results in beak and head movements but not vocalizations. IEGs have been extensively used to map neuro-molecular correlates of song motor production and auditory processing. As previously reported, neurons in two pallial vocal motor regions, HVC and RA, expressed IEGs after singing. Surprisingly, c-Fos was induced equivalently also after food aversion learning in the absence of singing. The density of c-Fos positive neurons was significantly higher than that of birds in control conditions. This was not the case in two other pallial song nuclei important for vocal plasticity, LMAN and Area X, although singing did induce IEGs in these structures, as reported previously. Conclusions/Significance Our results are consistent with the possibility that some of the song nuclei may participate in non-vocal learning and the populations of neurons involved in the two tasks show partial overlap. These findings underscore the previously advanced notion that the specialized forebrain pre-motor nuclei controlling song evolved from circuits involved in behaviors related to feeding. PMID:21695176
Zhou, Zhou; Ge, Pingjiang; Liu, Qian; Liu, Ming; Zhang, Wei
2015-08-01
To investigate the applicability of the eysphonia severity index (DSI) in evaluating effects of surgery between before and after groups of vocal polyp patients. Analyses of measurement data pre and pro-surgery of 70 vocal polyp patients and 35 no voice disorders volunteers (control group). The voice quality was measured subjectively with the voice handicap index (VHI), the GRBAS and fiber electronic laryngoscopy. Measures of maximum phonation time (MPT), shimmer and jitter were obtained for each subject by using DiVAS 2.30 (XION, Germany). The DiVAS 2.30 had spotanenously calculate the scores of DSI. Using SPSS 17.0 to find the differences of DSI scores among the three groups by one-way ANOVA variance analysis. And finding out of the correlation with DSI scores and VHI scores, GRBAS, MPT, jitter and shimmer. DSI improved significantly after surgery in the vocal polyps group (mean difference DSI -2.92 and 1.87, respectively) and also in the control group (mean difference DSI -2.92 and 2.30, respectively). However, no significant difference between the control group and the after surgery group. By using Pearson correlation analysis, this study observed a strong correlation between the DSI scores and the VHI scores, the values of GRBAS, shimmer (P < 0.01). DSI is an effective and high accuracy multi-parameter system for evaluation of vocal cord polyp patients as an independent assessment of dysphonia. DSI also can be used in evaluation of the effects of the vocal polyps surgery.
Neural Correlates of the Lombard Effect in Primate Auditory Cortex
Eliades, Steven J.
2012-01-01
Speaking is a sensory-motor process that involves constant self-monitoring to ensure accurate vocal production. Self-monitoring of vocal feedback allows rapid adjustment to correct perceived differences between intended and produced vocalizations. One important behavior in vocal feedback control is a compensatory increase in vocal intensity in response to noise masking during vocal production, commonly referred to as the Lombard effect. This behavior requires mechanisms for continuously monitoring auditory feedback during speaking. However, the underlying neural mechanisms are poorly understood. Here we show that when marmoset monkeys vocalize in the presence of masking noise that disrupts vocal feedback, the compensatory increase in vocal intensity is accompanied by a shift in auditory cortex activity toward neural response patterns seen during vocalizations under normal feedback condition. Furthermore, we show that neural activity in auditory cortex during a vocalization phrase predicts vocal intensity compensation in subsequent phrases. These observations demonstrate that the auditory cortex participates in self-monitoring during the Lombard effect, and may play a role in the compensation of noise masking during feedback-mediated vocal control. PMID:22855821
Collagen Content Limits Optical Coherence Tomography Image Depth in Porcine Vocal Fold Tissue.
Garcia, Jordan A; Benboujja, Fouzi; Beaudette, Kathy; Rogers, Derek; Maurer, Rie; Boudoux, Caroline; Hartnick, Christopher J
2016-11-01
Vocal fold scarring, a condition defined by increased collagen content, is challenging to treat without a method of noninvasively assessing vocal fold structure in vivo. The goal of this study was to observe the effects of vocal fold collagen content on optical coherence tomography imaging to develop a quantifiable marker of disease. Excised specimen study. Massachusetts Eye and Ear Infirmary. Porcine vocal folds were injected with collagenase to remove collagen from the lamina propria. Optical coherence tomography imaging was performed preinjection and at 0, 45, 90, and 180 minutes postinjection. Mean pixel intensity (or image brightness) was extracted from images of collagenase- and control-treated hemilarynges. Texture analysis of the lamina propria at each injection site was performed to extract image contrast. Two-factor repeated measure analysis of variance and t tests were used to determine statistical significance. Picrosirius red staining was performed to confirm collagenase activity. Mean pixel intensity was higher at injection sites of collagenase-treated vocal folds than control vocal folds (P < .0001). Fold change in image contrast was significantly increased in collagenase-treated vocal folds than control vocal folds (P = .002). Picrosirius red staining in control specimens revealed collagen fibrils most prominent in the subepithelium and above the thyroarytenoid muscle. Specimens treated with collagenase exhibited a loss of these structures. Collagen removal from vocal fold tissue increases image brightness of underlying structures. This inverse relationship may be useful in treating vocal fold scarring in patients. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Impact of call center work in subjective voice symptoms and complaints--an analytic study.
Rechenberg, Leila; Goulart, Bárbara Niegia Garcia de; Roithmann, Renato
2011-12-01
To estimate the prevalence of vocal symptoms, occupational risk factors, associated symptoms and their impact on the professional activity of the telemarketers. Cross-section analytical study with 124 telemarketers and 109 administrative workers (control group) selected from a random sample stratified by gender. The subjects answered an anonymous self-administered questionnaire involving issues related to the presence of vocal symptoms, potential risk factors for dysphonia, and vocal impact of symptoms in professional activity. The presence of one or more voice symptoms that occurred daily or weekly was considered positive for the presence of vocal symptoms. The prevalence of vocal symptoms was found in 33% of telemarketers and in 21% of the control group, indicating an association between vocal symptoms and the activity of the telemarketer. When adjusted for confounders, this association remained in the sense of risk. In telemarketers, the sensation of dry air, ambient noise, and lack of vocal rest were the most frequently reported complaints reported by those presenting vocal symptoms. Almost 70% of telemarketers with vocal symptoms reported that these symptoms interfere with their professional activity. The rate of absenteeism by vocal symptoms in this group was 29%. Vocal symptoms are common in most telemarketers when compared to their peer controls, and significantly affect their job performance.
Laughter as an approach to vocal evolution: The bipedal theory.
Provine, Robert R
2017-02-01
Laughter is a simple, stereotyped, innate, human play vocalization that is ideal for the study of vocal evolution. The basic approach of describing the act of laughter and when we do it has revealed a variety of phenomena of social, linguistic, and neurological significance. Findings include the acoustic structure of laughter, the minimal voluntary control of laughter, the punctuation effect (which describes the placement of laughter in conversation and indicates the dominance of speech over laughter), and the role of laughter in human matching and mating. Especially notable is the use of laughter to discover why humans can speak and other apes cannot. Quadrupeds, including our primate ancestors, have a 1:1 relation between breathing and stride because their thorax must absorb forelimb impacts during running. The direct link between breathing and locomotion limits vocalizations to short, simple utterances, such as the characteristic panting chimpanzee laugh (one sound per inward or outward breath). The evolution of bipedal locomotion freed the respiration system of its support function during running, permitting greater breath control and the selection for human-type laughter (a parsed exhalation), and subsequently the virtuosic, sustained, expiratory vocalization of speech. This is the basis of the bipedal theory of speech evolution.
Contingent imitation increases verbal interaction in children with autism spectrum disorders.
Ishizuka, Yuka; Yamamoto, Jun-Ichi
2016-11-01
Several studies have suggested that contingent adult imitation increase nonverbal communication, such as attention and proximity to adults, in children with autism spectrum disorders. However, few studies have shown the effect of contingent imitation on verbal communication. This study examined whether children with autism were able to promote verbal interaction such as vocal imitation, vocalization, and vocal turn-taking via contingent imitation. We used an alternating treatment design composed of the conditions of contingent imitation and control for six children with autism (aged 33-63 months). For contingent imitation condition, adults imitated children's vocalization immediately. For control condition, adults did not imitate but gave a vocal response immediately. Results showed that in contingent imitation condition, all children increased the number of vocal imitations and vocal turn-takings compared with control condition. The number of vocalizations increased in both condition for all children. Overall, it is suggested that all children promote verbal interaction via contingent imitation. © The Author(s) 2016.
Effects of voice harmonic complexity on ERP responses to pitch-shifted auditory feedback.
Behroozmand, Roozbeh; Korzyukov, Oleg; Larson, Charles R
2011-12-01
The present study investigated the neural mechanisms of voice pitch control for different levels of harmonic complexity in the auditory feedback. Event-related potentials (ERPs) were recorded in response to+200 cents pitch perturbations in the auditory feedback of self-produced natural human vocalizations, complex and pure tone stimuli during active vocalization and passive listening conditions. During active vocal production, ERP amplitudes were largest in response to pitch shifts in the natural voice, moderately large for non-voice complex stimuli and smallest for the pure tones. However, during passive listening, neural responses were equally large for pitch shifts in voice and non-voice complex stimuli but still larger than that for pure tones. These findings suggest that pitch change detection is facilitated for spectrally rich sounds such as natural human voice and non-voice complex stimuli compared with pure tones. Vocalization-induced increase in neural responses for voice feedback suggests that sensory processing of naturally-produced complex sounds such as human voice is enhanced by means of motor-driven mechanisms (e.g. efference copies) during vocal production. This enhancement may enable the audio-vocal system to more effectively detect and correct for vocal errors in the feedback of natural human vocalizations to maintain an intended vocal output for speaking. Copyright © 2011 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Multifunctional and Context-Dependent Control of Vocal Acoustics by Individual Muscles
Srivastava, Kyle H.; Elemans, Coen P.H.
2015-01-01
The relationship between muscle activity and behavioral output determines how the brain controls and modifies complex skills. In vocal control, ensembles of muscles are used to precisely tune single acoustic parameters such as fundamental frequency and sound amplitude. If individual vocal muscles were dedicated to the control of single parameters, then the brain could control each parameter independently by modulating the appropriate muscle or muscles. Alternatively, if each muscle influenced multiple parameters, a more complex control strategy would be required to selectively modulate a single parameter. Additionally, it is unknown whether the function of single muscles is fixed or varies across different vocal gestures. A fixed relationship would allow the brain to use the same changes in muscle activation to, for example, increase the fundamental frequency of different vocal gestures, whereas a context-dependent scheme would require the brain to calculate different motor modifications in each case. We tested the hypothesis that single muscles control multiple acoustic parameters and that the function of single muscles varies across gestures using three complementary approaches. First, we recorded electromyographic data from vocal muscles in singing Bengalese finches. Second, we electrically perturbed the activity of single muscles during song. Third, we developed an ex vivo technique to analyze the biomechanical and acoustic consequences of single-muscle perturbations. We found that single muscles drive changes in multiple parameters and that the function of single muscles differs across vocal gestures, suggesting that the brain uses a complex, gesture-dependent control scheme to regulate vocal output. PMID:26490859
Voice disorder in systemic lupus erythematosus
de Macedo, Milena S. F. C.; da Silva Filho, Manoel
2017-01-01
Systemic lupus erythematosus (SLE) is a chronic disease characterized by progressive tissue damage. In recent decades, novel treatments have greatly extended the life span of SLE patients. This creates a high demand for identifying the overarching symptoms associated with SLE and developing therapies that improve their life quality under chronic care. We hypothesized that SLE patients would present dysphonic symptoms. Given that voice disorders can reduce life quality, identifying a potential SLE-related dysphonia could be relevant for the appraisal and management of this disease. We measured objective vocal parameters and perceived vocal quality with the GRBAS (Grade, Roughness, Breathiness, Asthenia, Strain) scale in SLE patients and compared them to matched healthy controls. SLE patients also filled a questionnaire reporting perceived vocal deficits. SLE patients had significantly lower vocal intensity and harmonics to noise ratio, as well as increased jitter and shimmer. All subjective parameters of the GRBAS scale were significantly abnormal in SLE patients. Additionally, the vast majority of SLE patients (29/36) reported at least one perceived vocal deficit, with the most prevalent deficits being vocal fatigue (19/36) and hoarseness (17/36). Self-reported voice deficits were highly correlated with altered GRBAS scores. Additionally, tissue damage scores in different organ systems correlated with dysphonic symptoms, suggesting that some features of SLE-related dysphonia are due to tissue damage. Our results show that a large fraction of SLE patients suffers from perceivable dysphonia and may benefit from voice therapy in order to improve quality of life. PMID:28414781
Vocal learning in elephants: neural bases and adaptive context
Stoeger, Angela S; Manger, Paul
2014-01-01
In the last decade clear evidence has accumulated that elephants are capable of vocal production learning. Examples of vocal imitation are documented in African (Loxodonta africana) and Asian (Elephas maximus) elephants, but little is known about the function of vocal learning within the natural communication systems of either species. We are also just starting to identify the neural basis of elephant vocalizations. The African elephant diencephalon and brainstem possess specializations related to aspects of neural information processing in the motor system (affecting the timing and learning of trunk movements) and the auditory and vocalization system. Comparative interdisciplinary (from behavioral to neuroanatomical) studies are strongly warranted to increase our understanding of both vocal learning and vocal behavior in elephants. PMID:25062469
Effect of high-dose vocal fold injection of cidofovir and bevacizumab in a porcine model.
Ahmed, Mostafa M; Connor, Matthew P; Palazzolo, Mitzi; Thompson, Michelle E; Lospinoso, Josh; O'Connor, Peter; Howard, N Scott; Maturo, Stephen C
2017-03-01
Perform a follow-up study to investigate the histologic impact of high-dose intralaryngeal cidofovir injections in porcine vocal cords, either alone or in combination with bevacizumab, and compared to saline controls. This was an in vivo study involving 24 pigs with blinded pathologist review of specimens. Six groups were created, with four subjects in each group. Each subject received 10 or 20 mg of either cidofovir or bevacizumab alone, or in combination, injected into the right vocal cord. The left vocal fold was used as a saline control. Three separate injections were made at 2-week intervals. Larynges were harvested at 8 and 12 weeks, stained with hematoxylin and eosin and trichrome stain, and reviewed for histologic changes by two blinded pathologists. Minimal inflammation, edema, and atypia were noted with all treatments. Increased glandular inflammation was noted with 10 mg bevacizumab (P < 0.05), which decreased when combined with 10 mg cidofovir (P < 0.05). No lamina propria or muscle fibrosis was observed. Drug duration had no statistically significant histologic impact. High-dose cidofovir and bevacizumab do not induce detrimental vocal fold changes. Combination cidofovir and bevacizumab do not cause vocal fold scarring. Further work is needed to assess systemic concentration with this high-dose combination in humans. N/A. Laryngoscope, 127:671-675, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
González-García, Nadia; Rendón, Pablo L
2017-05-23
The neural correlates of consonance and dissonance perception have been widely studied, but not the neural correlates of consonance and dissonance production. The most straightforward manner of musical production is singing, but, from an imaging perspective, it still presents more challenges than listening because it involves motor activity. The accurate singing of musical intervals requires integration between auditory feedback processing and vocal motor control in order to correctly produce each note. This protocol presents a method that permits the monitoring of neural activations associated with the vocal production of consonant and dissonant intervals. Four musical intervals, two consonant and two dissonant, are used as stimuli, both for an auditory discrimination test and a task that involves first listening to and then reproducing given intervals. Participants, all female vocal students at the conservatory level, were studied using functional Magnetic Resonance Imaging (fMRI) during the performance of the singing task, with the listening task serving as a control condition. In this manner, the activity of both the motor and auditory systems was observed, and a measure of vocal accuracy during the singing task was also obtained. Thus, the protocol can also be used to track activations associated with singing different types of intervals or with singing the required notes more accurately. The results indicate that singing dissonant intervals requires greater participation of the neural mechanisms responsible for the integration of external feedback from the auditory and sensorimotor systems than does singing consonant intervals.
Hoarseness and vocal tract discomfort and associated risk factors in air traffic controllers.
Korn, Gustavo Polacow; Villar, Anna Carolina; Azevedo, Renata Rangel
2018-04-05
An air traffic controller is a professional who performs air traffic control functions in air traffic control units and is responsible for controlling the various stages of a flight. To compare hoarseness and vocal tract discomfort and their risk factors among air traffic controllers in the approach control of São Paulo. In a cross-sectional survey, a voice self-evaluation adapted from to self-evaluation prepared by the Brazilian Ministry of Labor for teachers was administered to 76 air traffic controllers at approach control of São Paulo, Brazil. The percentage of hoarseness and vocal tract discomfort was 19.7% and 38.2%, respectively. In relation to air pollution, the percentages of hoarseness and vocal tract discomfort were higher among those who consider their working environment to be intolerable than among those in a comfortable or disturbing environment. The percentage of hoarseness was higher among those who seek medical advice due to vocal complaints and among those who experience difficulty using their voice at work than among those who experience mild or no difficulty. The percentage of vocal tract discomfort was higher among those in a very tense and stressful environment than among those who consider their work environment to be mild or moderately tense and stressful. The percentage of vocal tract discomfort was higher among those who describe themselves as very tense and stressed or tense and stressed than among those who describe themselves as calm. Additionally, the percentage of vocal tract discomfort was higher among those who care about their health. Among air traffic controllers, the percentage of vocal tract discomfort was almost twice that of hoarseness. Both symptoms are prevalent among air traffic controllers who considered their workplace intolerable in terms of air pollution. Vocal tract discomfort was related to a tense and stressful environment, and hoarseness was related to difficulty using the voice at work. Copyright © 2018 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
Life Experience of Patients With Unilateral Vocal Fold Paralysis.
Francis, David O; Sherman, Ariel E; Hovis, Kristen L; Bonnet, Kemberlee; Schlundt, David; Garrett, C Gaelyn; Davies, Louise
2018-05-01
Clinicians and patients benefit when they have a clear understanding of how medical conditions influence patients' life experiences. Patients' perspectives on life with unilateral vocal fold paralysis have not been well described. To promote patient-centered care by characterizing the patient experiences of living with unilateral vocal fold paralysis. This study used mixed methods: surveys using the voice and dysphagia handicap indexes (VHI and DHI) and semistructured interviews with adults with unilateral vocal cord paralysis recruited from a tertiary voice center. Recorded interviews were transcribed, coded using a hierarchical coding system, and analyzed using an iterative inductive-deductive approach. Symptom domains of the patient experience. In 36 patients (26 [72%] were female, and the median age and interquartile range [IQR] were 63 years [48-68 years]; median interview duration, 42 minutes), median VHI and DHI scores were 96 (IQR, 77-108) and 55.5 (IQR, 35-89) at the time of interviews, respectively. Frustration, isolation, fear, and altered self-identity were primary themes permeating patients' experiences. Frustrations related to limitations in communication, employment, and the medical system. Sources of fear included a loss of control, fear of further dysfunction or permanent disability, concern for health consequences (eg, aspiration pneumonia), and/or an inability to call for help in emergency situations. These experiences were modified by the following factors: resilience, self-efficacy, perceived sense of control, and social support systems. Effects of unilateral vocal fold paralysis extend beyond impaired voice and other somatic symptoms. Awareness of the extent to which these patients experience frustration, isolation, fear, and altered self-identity is important. A patient-centered approach to optimizing unilateral vocal fold paralysis treatment is enhanced by an understanding of both the physical dimension of this condition and how patients cope with the considerable emotional and social consequences. Recognizing the psychosocial dimensions of disease allows clinicians to communicate more effectively, be more empathetic, and to better personalize treatment plans, which may lead to improved patient care and patient satisfaction.
Oral breathing challenge in participants with vocal attrition.
Sivasankar, Mahalakshmi; Fisher, Kimberly V
2003-12-01
Vocal folds undergo osmotic challenge by mouth breathing during singing, exercising, and loud speaking. Just 15 min of obligatory oral breathing, to dry the vocal folds, increases phonation threshold pressure (Pth) and expiratory vocal effort in healthy speakers (M. Sivasankar & K. Fisher, 2002). We questioned whether oral breathing is more detrimental to phonation in healthy participants with a history of temporary vocal attrition. The effects of a 15-min oral or nasal breathing challenge on Pth and perceived expiratory vocal effort were compared for participants reporting symptoms of vocal attrition (N = 18, ages 19-38 years) and normal controls (N = 20, ages 19-33 years). Post-challenge-prechallenge differences in Pth (deltaPth) and effort (deltaEffort) revealed that oral breathing, but not nasal breathing, increased Pth (p < .001 ) and effort (p < .001) at low, comfortable, and high pitch. deltaPth was significantly greater in participants with vocal attrition than in normal controls (p < .001). Nasal breathing reduced Pth for all controls but not for all participants reporting vocal attrition. deltaPth was significantly and linearly correlated with deltaEffort (rvocal attrition = .81, p < .001; rcontrol = .84, p < .001). We speculate that the greater increases in Pth in participants reporting vocal attrition may result from delayed or inadequate compensatory response to superficial laryngeal dehydration. Obligatory oral breathing may place voice users at risk for exacerbating vocal attrition. That sol layer depletion by obligatory oral breathing increased Pth and vocal effort provides support for the role of superficial hydration in maintaining ease of phonation.
Integrating perspectives on vocal performance and consistency
Sakata, Jon T.; Vehrencamp, Sandra L.
2012-01-01
SUMMARY Recent experiments in divergent fields of birdsong have revealed that vocal performance is important for reproductive success and under active control by distinct neural circuits. Vocal consistency, the degree to which the spectral properties (e.g. dominant or fundamental frequency) of song elements are produced consistently from rendition to rendition, has been highlighted as a biologically important aspect of vocal performance. Here, we synthesize functional, developmental and mechanistic (neurophysiological) perspectives to generate an integrated understanding of this facet of vocal performance. Behavioral studies in the field and laboratory have found that vocal consistency is affected by social context, season and development, and, moreover, positively correlated with reproductive success. Mechanistic investigations have revealed a contribution of forebrain and basal ganglia circuits and sex steroid hormones to the control of vocal consistency. Across behavioral, developmental and mechanistic studies, a convergent theme regarding the importance of vocal practice in juvenile and adult songbirds emerges, providing a basis for linking these levels of analysis. By understanding vocal consistency at these levels, we gain an appreciation for the various dimensions of song control and plasticity and argue that genes regulating the function of basal ganglia circuits and sex steroid hormones could be sculpted by sexual selection. PMID:22189763
Sohrabji, F; Nordeen, E J; Nordeen, K W
1990-01-01
Area X, a large sexually dimorphic nucleus in the avian ventral forebrain, is part of a highly discrete system of interconnected nuclei that have been implicated in either song learning or adult song production. Previously, this nucleus has been included in the song system because of its substantial connections with other vocal control nuclei, and because its volume is positively correlated with the capacity for song. In order to directly assess the role of Area X in song behavior, this nucleus was bilaterally lesioned in both juvenile and adult zebra finches, using ibotenic acid. We report here that lesioning Area X disrupts normal song development in juvenile birds, but does not affect the production of stereotyped song by adult birds. Although juvenile-lesioned birds were consistently judged as being in earlier stages of vocal development than age-matched controls, they continued to produce normal song-like vocalizations. Thus, unlike the lateral magnocellular nucleus of the anterior neostriatum, another avian forebrain nucleus implicated in song learning, Area X does not seem to be necessary for sustaining production of juvenile song. Rather, the behavioral results suggest Area X is important for either the acquisition of a song model or the improvement of song through vocal practice.
Phonetogram changes for trained singers over a nine-month period of vocal training.
LeBorgne, Wendy DeLeo; Weinrich, Barbara D
2002-03-01
Professional vocalists encounter demands requiring voluntary control of phonation, while utilizing a considerable range of frequency and intensity. These quantifiable acoustic events can be measured and represented in a phonetogram. Previous research has compared the phonetograms of trained and untrained voices and found significant differences between these groups. This study was designed to assess the effects of vocal training for singers over a period of nine months. Phonetogram contour changes were examined, with the primary focus on expansion of frequency range and/or intensity control. Twenty-one first-year, master's level, vocal music students, who were engaged in an intensive vocal performance curriculum, participated in this study. Following nine months of vocal training, significant differences were revealed in the subjects' mean frequency range and minimum vocal intensity across frequency levels. There was no significant difference for the mean maximum vocal intensity across frequency levels following vocal training.
Vocal coordination and vocal imitation: a role for mirror neurons?
Newman, John D
2014-04-01
Some birds and mammals have vocal communication systems in which coordination between individuals is important. Examples would include duetting or antiphonal calling in some birds and mammals, rapid exchanges of the same vocalization, and vocal exchanges between paired individuals and other nearby pairs. Mirror neurons may play a role in such systems but become functional only after experience.
Tardif, Carole; Lainé, France; Rodriguez, Mélissa; Gepner, Bruno
2007-09-01
This study examined the effects of slowing down presentation of facial expressions and their corresponding vocal sounds on facial expression recognition and facial and/or vocal imitation in children with autism. Twelve autistic children and twenty-four normal control children were presented with emotional and non-emotional facial expressions on CD-Rom, under audio or silent conditions, and under dynamic visual conditions (slowly, very slowly, at normal speed) plus a static control. Overall, children with autism showed lower performance in expression recognition and more induced facial-vocal imitation than controls. In the autistic group, facial expression recognition and induced facial-vocal imitation were significantly enhanced in slow conditions. Findings may give new perspectives for understanding and intervention for verbal and emotional perceptive and communicative impairments in autistic populations.
Atypical neural responses to vocal anger in attention-deficit/hyperactivity disorder.
Chronaki, Georgia; Benikos, Nicholas; Fairchild, Graeme; Sonuga-Barke, Edmund J S
2015-04-01
Deficits in facial emotion processing, reported in attention-deficit/hyperactivity disorder (ADHD), have been linked to both early perceptual and later attentional components of event-related potentials (ERPs). However, the neural underpinnings of vocal emotion processing deficits in ADHD have yet to be characterised. Here, we report the first ERP study of vocal affective prosody processing in ADHD. Event-related potentials of 6-11-year-old children with ADHD (n = 25) and typically developing controls (n = 25) were recorded as they completed a task measuring recognition of vocal prosodic stimuli (angry, happy and neutral). Audiometric assessments were conducted to screen for hearing impairments. Children with ADHD were less accurate than controls at recognising vocal anger. Relative to controls, they displayed enhanced N100 and attenuated P300 components to vocal anger. The P300 effect was reduced, but remained significant, after controlling for N100 effects by rebaselining. Only the N100 effect was significant when children with ADHD and comorbid conduct disorder (n = 10) were excluded. This study provides the first evidence linking ADHD to atypical neural activity during the early perceptual stages of vocal anger processing. These effects may reflect preattentive hyper-vigilance to vocal anger in ADHD. © 2014 Association for Child and Adolescent Mental Health.
Vocal Hyperfunction in Parents of Children With Attention Deficit Hyperactivity Disorder.
Teresa, Garcia-Real; Díaz-Román, Tomás M
2016-05-01
The objective of this study was to evaluate the presence of habits and symptoms of vocal hyperfunction in the parents of children with attention deficit hyperactivity disorder (ADHD). Parents of 24 children with ADHD and 30 children of a control group completed a specific questionnaire to detect the hyperfunctional use of the voice (excessive talking, excessive loudness, talking too fast, and shouting), hoarseness, vocal fatigue, mental and physical fatigue, and the degree of parental concern for the vocal health of their child. Parents of children with ADHD spoke more often, faster, and stronger than the parents of the control group; in addition, they also used a louder volume than they usually used when they spoke to their children. The parents manifested more vocal, mental, and physical fatigue than the parents of the control group. There was a significant correlation between the "concern" for the vocal health of their children with respect to vocal symptoms of the children, the habits of vocal hyperfunctioning, and the symptoms suffered by the parents. These results suggest that the parents of children with ADHD change their vocal attitude when communicating with their children. Most likely, the increased concern of parents with ADHD children and their respective level of stress lead to hyperfunctional vocal usage. This subsequently leads to symptoms of vocal, physical, and mental fatigue at the end of the day. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Szabo Portela, Annika; Granqvist, Svante; Ternström, Sten; Södersten, Maria
2018-01-01
This study aimed to assess vocal behavior in women with voice-intensive occupations to investigate differences between patients and controls and between work and leisure conditions with environmental noise level as an experimental factor. Patients with work-related voice disorders, 10 with phonasthenia and 10 with vocal nodules, were matched regarding age, profession, and workplace with 20 vocally healthy colleagues. The sound pressure level of environmental noise and the speakers' voice, fundamental frequency, and phonation ratio were registered from morning to night during 1 week with a voice accumulator. Voice data were assessed in low (≤55 dBA), moderate, and high (>70 dBA) environmental noise levels. The average environmental noise level was significantly higher during the work condition for patients with vocal nodules (73.9 dBA) and their controls (73.0 dBA) compared with patients with phonasthenia (68.3 dBA) and their controls (67.1 dBA). The average voice level and the fundamental frequency were also significantly higher during work for the patients with vocal nodules and their controls. During the leisure condition, there were no significant differences in average noise and voice level nor fundamental frequency between the groups. The patients with vocal nodules and their controls spent significantly more time and used their voices significantly more in high-environmental noise levels. High noise levels during work and demands from the occupation impact vocal behavior. Thus, assessment of voice ergonomics should be part of the work environmental management. To reduce environmental noise levels is important to improve voice ergonomic conditions in communication-intensive and vocally demanding workplaces. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Spoken language and arm gestures are controlled by the same motor control system.
Gentilucci, Maurizio; Dalla Volta, Riccardo
2008-06-01
Arm movements can influence language comprehension much as semantics can influence arm movement planning. Arm movement itself can be used as a linguistic signal. We reviewed neurophysiological and behavioural evidence that manual gestures and vocal language share the same control system. Studies of primate premotor cortex and, in particular, of the so-called "mirror system", including humans, suggest the existence of a dual hand/mouth motor command system involved in ingestion activities. This may be the platform on which a combined manual and vocal communication system was constructed. In humans, speech is typically accompanied by manual gesture, speech production itself is influenced by executing or observing transitive hand actions, and manual actions play an important role in the development of speech, from the babbling stage onwards. Behavioural data also show reciprocal influence between word and symbolic gestures. Neuroimaging and repetitive transcranial magnetic stimulation (rTMS) data suggest that the system governing both speech and gesture is located in Broca's area. In general, the presented data support the hypothesis that the hand motor-control system is involved in higher order cognition.
Campbell's monkeys concatenate vocalizations into context-specific call sequences
Ouattara, Karim; Lemasson, Alban; Zuberbühler, Klaus
2009-01-01
Primate vocal behavior is often considered irrelevant in modeling human language evolution, mainly because of the caller's limited vocal control and apparent lack of intentional signaling. Here, we present the results of a long-term study on Campbell's monkeys, which has revealed an unrivaled degree of vocal complexity. Adult males produced six different loud call types, which they combined into various sequences in highly context-specific ways. We found stereotyped sequences that were strongly associated with cohesion and travel, falling trees, neighboring groups, nonpredatory animals, unspecific predatory threat, and specific predator classes. Within the responses to predators, we found that crowned eagles triggered four and leopards three different sequences, depending on how the caller learned about their presence. Callers followed a number of principles when concatenating sequences, such as nonrandom transition probabilities of call types, addition of specific calls into an existing sequence to form a different one, or recombination of two sequences to form a third one. We conclude that these primates have overcome some of the constraints of limited vocal control by combinatorial organization. As the different sequences were so tightly linked to specific external events, the Campbell's monkey call system may be the most complex example of ‘proto-syntax’ in animal communication known to date. PMID:20007377
Effect of a Single Musical Cakra Activation Manoeuvre on Body Temperature: An Exploratory Study
Sumathy, Sundar; Parmar, Parin N
2016-01-01
Cakra activation/balancing and music therapy are part of the traditional Indian healing system. Little is known about effect of musical (vocal) technique of cakra activation on body temperature. We conducted a single-session exploratory study to evaluate effects of a single musical (vocal) cakra activation manoeuvre on body temperature in controlled settings. Seven healthy adults performed a single musical (vocal) cakra activation manoeuvre for approximately 12 minutes in controlled environmental conditions. Pre- and post-manoeuvre body temperatures were recorded with a clinical mercury thermometer. After a single manoeuvre, increase in body temperature was recorded in all seven subjects. The range of increase in body temperature was from 0.2°F to 1.4°F; with mean temperature rise being 0.5°F and median temperature rise being 0.4°F. We conclude that a single session of musical (vocal) technique of cakra activation elevated body temperatures in all 7 subjects. Further research is required to study effects of various cakra activation techniques on body temperature and other physiological parameters. PMID:28182030
Effect of a Single Musical Cakra Activation Manoeuvre on Body Temperature: An Exploratory Study.
Sumathy, Sundar; Parmar, Parin N
2016-01-01
Cakra activation/balancing and music therapy are part of the traditional Indian healing system. Little is known about effect of musical (vocal) technique of cakra activation on body temperature. We conducted a single-session exploratory study to evaluate effects of a single musical (vocal) cakra activation manoeuvre on body temperature in controlled settings. Seven healthy adults performed a single musical (vocal) cakra activation manoeuvre for approximately 12 minutes in controlled environmental conditions. Pre- and post-manoeuvre body temperatures were recorded with a clinical mercury thermometer. After a single manoeuvre, increase in body temperature was recorded in all seven subjects. The range of increase in body temperature was from 0.2°F to 1.4°F; with mean temperature rise being 0.5°F and median temperature rise being 0.4°F. We conclude that a single session of musical (vocal) technique of cakra activation elevated body temperatures in all 7 subjects. Further research is required to study effects of various cakra activation techniques on body temperature and other physiological parameters.
Neural Correlates of Vocal Production and Motor Control in Human Heschl's Gyrus
Oya, Hiroyuki; Nourski, Kirill V.; Kawasaki, Hiroto; Larson, Charles R.; Brugge, John F.; Howard, Matthew A.; Greenlee, Jeremy D.W.
2016-01-01
The present study investigated how pitch frequency, a perceptually relevant aspect of periodicity in natural human vocalizations, is encoded in Heschl's gyrus (HG), and how this information may be used to influence vocal pitch motor control. We recorded local field potentials from multicontact depth electrodes implanted in HG of 14 neurosurgical epilepsy patients as they vocalized vowel sounds and received brief (200 ms) pitch perturbations at 100 Cents in their auditory feedback. Event-related band power responses to vocalizations showed sustained frequency following responses that tracked voice fundamental frequency (F0) and were significantly enhanced in posteromedial HG during speaking compared with when subjects listened to the playback of their own voice. In addition to frequency following responses, a transient response component within the high gamma frequency band (75–150 Hz) was identified. When this response followed the onset of vocalization, the magnitude of the response was the same for the speaking and playback conditions. In contrast, when this response followed a pitch shift, its magnitude was significantly enhanced during speaking compared with playback. We also observed that, in anterolateral HG, the power of high gamma responses to pitch shifts correlated with the magnitude of compensatory vocal responses. These findings demonstrate a functional parcellation of HG with neural activity that encodes pitch in natural human voice, distinguishes between self-generated and passively heard vocalizations, detects discrepancies between the intended and heard vocalization, and contains information about the resulting behavioral vocal compensations in response to auditory feedback pitch perturbations. SIGNIFICANCE STATEMENT The present study is a significant contribution to our understanding of sensor-motor mechanisms of vocal production and motor control. The findings demonstrate distinct functional parcellation of core and noncore areas within human auditory cortex on Heschl's gyrus that process natural human vocalizations and pitch perturbations in the auditory feedback. In addition, our data provide evidence for distinct roles of high gamma neural oscillations and frequency following responses for processing periodicity in human vocalizations during vocal production and motor control. PMID:26888939
Coetzee, Johann F; Lubbers, Brian V; Toerber, Scott E; Gehring, Ronette; Thomson, Daniel U; White, Bradley J; Apley, Michael D
2008-06-01
To evaluate plasma concentrations of substance P (SP) and cortisol in calves after castration or simulated castration. 10 Angus-crossbred calves. Calves were acclimated for 5 days, assigned to a block on the basis of scrotal circumference, and randomly assigned to a castrated or simulated-castrated (control) group. Blood samples were collected twice before, at the time of (0 hours), and at several times points after castration or simulated castration. Vocalization and attitude scores were determined at time of castration or simulated castration. Plasma concentrations of SP and cortisol were determined by use of competitive and chemiluminescent enzyme immunoassays, respectively. Data were analyzed by use of repeated-measures analysis with a mixed model. Mean +/- SEM cortisol concentration in castrated calves (78.88+/-10.07 nmol/L) was similar to that in uncastrated control calves (73.01+/-10.07 nmol/L). However, mean SP concentration in castrated calves (506.43+/-38.11 pg/mL) was significantly higher than the concentration in control calves (386.42+/-40.09 pg/mL). Mean cortisol concentration in calves with vocalization scores of 0 was not significantly different from the concentration in calves with vocalization scores of 3. However, calves with vocalization scores of 3 had significantly higher SP concentrations, compared with SP concentrations for calves with vocalization scores of 0. Similar cortisol concentrations were measured in castrated and control calves. A significant increase in plasma concentrations of SP after castration suggested a likely association with nociception. These results may affect assessment of animal well-being in livestock production systems.
Bicarbonate availability for vocal fold epithelial defense to acidic challenge.
Durkes, Abigail; Sivasankar, M Preeti
2014-01-01
Bicarbonate is critical for acid-base tissue homeostasis. In this study we investigated the role of bicarbonate ion transport in vocal fold epithelial defense to acid challenges. Acidic insults to the larynx are common in gastric reflux, carcinogenesis and metastasis, and acute inflammation. Ion transport was measured in viable porcine vocal fold epithelium. First, 18 vocal folds were exposed to either the carbonic anhydrase antagonist acetazolamide or to vehicle. Second, 32 vocal folds were exposed to either a control buffer or a bicarbonate-free buffer on their luminal or basolateral surface or both. Third, 32 vocal folds were challenged with acid in the presence of bicarbonate-free or control buffer. The vocal fold transepithelial resistance was greater than 300 Ω*cm(2), suggesting robust barrier integrity. Ion transport did not change after exposure to acetazolamide (p > 0.05). Exposure to bicarbonate-free buffer did not compromise vocal fold ion transport (p > 0.05). Ion transport increased after acid challenge. This increase approached statistical significance and was the greatest for the control buffer and for the bicarbonate-free buffer applied to the basolateral surface. Bicarbonate secretion may contribute to vocal fold defense against acid challenge. Our data offer a potential novel role for bicarbonate as a therapeutic agent to reduce pH abnormalities in the larynx and prevent associated pathological changes.
Neuronal Control of Mammalian Vocalization, with Special Reference to the Squirrel Monkey
NASA Astrophysics Data System (ADS)
Jürgens, Uwe
Squirrel monkey vocalization can be considered as a suitable model for the study in humans of the neurobiological basis of nonverbal emotional vocal utterances, such as laughing, crying, and groaning. Evaluation of electrical and chemical brain stimulation data, lesioning studies, single-neurone recordings, and neuroanatomical tracing work leads to the following conclusions: The periaqueductal gray and laterally bordering tegmentum of the midbrain represent a crucial area for the production of vocalization. This area collects the various vocalization-triggering stimuli, such as auditory, visual, and somatosensory input from diverse sensory-processing structures, motivation-controlling input from some limbic structures, and volitional impulses from the anterior cingulate cortex. Destruction of this area causes mutism. It is still under dispute whether the periaqueductal region harbors the vocal pattern generator or merely couples vocalization-triggering information to motor-coordinating structures further downward in the brainstem. The periaqueductal region is connected with the phonatory motoneuron pools indirectly via one or several interneurons. The nucleus retroambiguus represents a crucial relay station for the laryngeal and expiratory component of vocalization. The articulatory component reaches the orofacial motoneuron pools via the parvocellular reticular formation. Essential proprioceptive feedback from the larynx and lungs enter the vocal-controlling network via the solitary tract nucleus.
Piristine, Hande C; Choetso, Tenzin; Gobes, Sharon M H
2016-11-01
Sensory feedback is essential for acquiring and maintaining complex motor behaviors, including birdsong. In zebra finches, auditory feedback reaches the song control circuits primarily through the nucleus interfacialis nidopalii (Nif), which provides excitatory input to HVC (proper name)-a premotor region essential for the production of learned vocalizations. Despite being one of the major inputs to the song control pathway, the role of Nif in generating vocalizations is not well understood. To address this, we transiently inactivated Nif in late juvenile zebra finches. Upon Nif inactivation (in both hemispheres or on one side only), birds went from singing stereotyped zebra finch song to uttering highly variable and unstructured vocalizations resembling sub-song, an early juvenile song form driven by a basal ganglia circuit. Simultaneously inactivating Nif and LMAN (lateral magnocellular nucleus of the anterior nidopallium), the output nucleus of a basal ganglia circuit, inhibited song production altogether. These results suggest that Nif is required for generating the premotor drive for song. Permanent Nif lesions, in contrast, have only transient effects on vocal production, with song recovering within a day. The sensorimotor nucleus Nif thus produces a premotor drive to the motor pathway that is acutely required for generating learned vocalizations, but once permanently removed, the song system can compensate for its absence. © 2016 Wiley Periodicals, Inc. Develop Neurobiol 76: 1213-1225, 2016. © 2016 Wiley Periodicals, Inc.
A rabbit vocal fold laser scarring model for testing lamina propria tissue engineering therapies
Mau, Ted; Du, Mindy; Xu, Chet C.
2015-01-01
Objectives/Hypothesis To develop a vocal fold scarring model using an ablative laser in the rabbit as a platform for testing bioengineered therapies for missing or damaged lamina propria. Study Design Prospective controlled animal study. Methods An optimal laser energy level was first determined by assessing the depths of vocal fold injury created by a Holmium:YAG laser at various energy levels on fresh cadaveric rabbit larynges. The selected energy level was then used to create controlled unilateral injuries in vocal folds of New Zealand white rabbits, with the contralateral folds serving as uninjured controls. After 4 weeks, the larynges were harvested and subjected to excised-larynx phonation with high-speed imaging and immunohistochemical staining for collagen types I and III, elastin, and hyaluronic acid (HA) with quantitative histological analysis. Results 1.8 joules produced full-thickness injury of the lamina propria without extensive muscle injury. After 4 weeks, the injured vocal folds vibrated with reduced amplitude (P = 0.036) in excised-larynx phonation compared to normal vocal folds. The injured vocal folds contained a higher relative density of collagen type I (P = 0.004), higher elastin (P = 0.022), and lower HA (P = 0.030) compared to normal controls. Collagen type III was unchanged. Conclusions With its potential for higher precision of injury, this laser vocal fold scarring model may serve as an alternative to scarring produced by cold instruments for studying the effects of vocal fold lamina propria bioengineered therapies. Level of Evidence N/A. PMID:24715695
Singing ability is rooted in vocal-motor control of pitch.
Hutchins, Sean; Larrouy-Maestri, Pauline; Peretz, Isabelle
2014-11-01
The inability to vocally match a pitch can be caused by poor pitch perception or by poor vocal-motor control. Although previous studies have tried to examine the relationship between pitch perception and vocal production, they have failed to control for the timbre of the target to be matched. In the present study, we compare pitch-matching accuracy with an unfamiliar instrument (the slider) and with the voice, designed such that the slider plays back recordings of the participant's own voice. We also measured pitch accuracy in singing a familiar melody ("Happy Birthday") to assess the relationship between single-pitch-matching tasks and melodic singing. Our results showed that participants (all nonmusicians) were significantly better at matching recordings of their own voices with the slider than with their voice, indicating that vocal-motor control is an important limiting factor on singing ability. We also found significant correlations between the ability to sing a melody in tune and vocal pitch matching, but not pitch matching on the slider. Better melodic singers also tended to have higher quality voices (as measured by acoustic variables). These results provide important evidence about the role of vocal-motor control in poor singing ability and demonstrate that single-pitch-matching tasks can be useful in measuring general singing abilities.
Zhang, Lucy T.; Yang, Jubiao
2017-01-01
In this work we explore the aerodynamics flow characteristics of a coupled fluid-structure interaction system using a generalized Bernoulli equation derived directly from the Cauchy momentum equations. Unlike the conventional Bernoulli equation where incompressible, inviscid, and steady flow conditions are assumed, this generalized Bernoulli equation includes the contributions from compressibility, viscous, and unsteadiness, which could be essential in defining aerodynamic characteristics. The application of the derived Bernoulli’s principle is on a fully-coupled fluid-structure interaction simulation of the vocal folds vibration. The coupled system is simulated using the immersed finite element method where compressible Navier-Stokes equations are used to describe the air and an elastic pliable structure to describe the vocal fold. The vibration of the vocal fold works to open and close the glottal flow. The aerodynamics flow characteristics are evaluated using the derived Bernoulli’s principles for a vibration cycle in a carefully partitioned control volume based on the moving structure. The results agree very well to experimental observations, which validate the strategy and its use in other types of flow characteristics that involve coupled fluid-structure interactions. PMID:29527541
Zhang, Lucy T; Yang, Jubiao
2016-12-01
In this work we explore the aerodynamics flow characteristics of a coupled fluid-structure interaction system using a generalized Bernoulli equation derived directly from the Cauchy momentum equations. Unlike the conventional Bernoulli equation where incompressible, inviscid, and steady flow conditions are assumed, this generalized Bernoulli equation includes the contributions from compressibility, viscous, and unsteadiness, which could be essential in defining aerodynamic characteristics. The application of the derived Bernoulli's principle is on a fully-coupled fluid-structure interaction simulation of the vocal folds vibration. The coupled system is simulated using the immersed finite element method where compressible Navier-Stokes equations are used to describe the air and an elastic pliable structure to describe the vocal fold. The vibration of the vocal fold works to open and close the glottal flow. The aerodynamics flow characteristics are evaluated using the derived Bernoulli's principles for a vibration cycle in a carefully partitioned control volume based on the moving structure. The results agree very well to experimental observations, which validate the strategy and its use in other types of flow characteristics that involve coupled fluid-structure interactions.
Female mice ultrasonically interact with males during courtship displays
Neunuebel, Joshua P; Taylor, Adam L; Arthur, Ben J; Egnor, SE Roian
2015-01-01
During courtship males attract females with elaborate behaviors. In mice, these displays include ultrasonic vocalizations. Ultrasonic courtship vocalizations were previously attributed to the courting male, despite evidence that both sexes produce virtually indistinguishable vocalizations. Because of this similarity, and the difficulty of assigning vocalizations to individuals, the vocal contribution of each individual during courtship is unknown. To address this question, we developed a microphone array system to localize vocalizations from socially interacting, individual adult mice. With this system, we show that female mice vocally interact with males during courtship. Males and females jointly increased their vocalization rates during chases. Furthermore, a female's participation in these vocal interactions may function as a signal that indicates a state of increased receptivity. Our results reveal a novel form of vocal communication during mouse courtship, and lay the groundwork for a mechanistic dissection of communication during social behavior. DOI: http://dx.doi.org/10.7554/eLife.06203.001 PMID:26020291
Dual Neural Network Model for the Evolution of Speech and Language.
Hage, Steffen R; Nieder, Andreas
2016-12-01
Explaining the evolution of speech and language poses one of the biggest challenges in biology. We propose a dual network model that posits a volitional articulatory motor network (VAMN) originating in the prefrontal cortex (PFC; including Broca's area) that cognitively controls vocal output of a phylogenetically conserved primary vocal motor network (PVMN) situated in subcortical structures. By comparing the connections between these two systems in human and nonhuman primate brains, we identify crucial biological preadaptations in monkeys for the emergence of a language system in humans. This model of language evolution explains the exclusiveness of non-verbal communication sounds (e.g., cries) in infants with an immature PFC, as well as the observed emergence of non-linguistic vocalizations in adults after frontal lobe pathologies. Copyright © 2016 Elsevier Ltd. All rights reserved.
FoxP2 Expression in a Highly Vocal Teleost Fish with Comparisons to Tetrapods.
Pengra, Ian G G; Marchaterre, Margaret A; Bass, Andrew H
2018-04-19
Motivated by studies of speech deficits in humans, several studies over the past two decades have investigated the potential role of a forkhead domain transcription factor, FoxP2, in the central control of acoustic signaling/vocalization among vertebrates. Comparative neuroanatomical studies that mainly include mammalian and avian species have mapped the distribution of FoxP2 expression in multiple brain regions that imply a greater functional significance beyond vocalization that might be shared broadly across vertebrate lineages. To date, reports for teleost fish have been limited in number and scope to nonvocal species. Here, we map the neuroanatomical distribution of FoxP2 mRNA expression in a highly vocal teleost, the plainfin midshipman (Porichthys notatus). We report an extensive overlap between FoxP2 expression and vocal, auditory, and steroid-signaling systems with robust expression at multiple sites in the telencephalon, the preoptic area, the diencephalon, and the midbrain. Label was far more restricted in the hindbrain though robust in one region of the reticular formation. A comparison with other teleosts and tetrapods suggests an evolutionarily conserved FoxP2 phenotype important to vocal-acoustic and, more broadly, sensorimotor function among vertebrates. © 2018 S. Karger AG, Basel.
The role of auditory and kinaesthetic feedback mechanisms on phonatory stability in children.
Rathna Kumar, S B; Azeem, Suhail; Choudhary, Abhishek Kumar; Prakash, S G R
2013-12-01
Auditory feedback plays an important role in phonatory control. When auditory feedback is disrupted, various changes are observed in vocal motor control. Vocal intensity and fundamental frequency (F0) levels tend to increase in response to auditory masking. Because of the close reflexive links between the auditory and phonatory systems, it is likely that phonatory stability may be disrupted when auditory feedback is disrupted or altered. However, studies on phonatory stability under auditory masking condition in adult subjects showed that most of the subjects maintained normal levels of phonatory stability. The authors in the earlier investigations suggested that auditory feedback is not the sole contributor to vocal motor control and phonatory stability, a complex neuromuscular reflex system known as kinaesthetic feedback may play a role in controlling phonatory stability when auditory feedback is disrupted or lacking. This proposes the need to further investigate this phenomenon as to whether children show similar patterns of phonatory stability under auditory masking since their neuromotor systems are still at developmental stage, less mature and are less resistant to altered auditory feedback than adults. A total of 40 normal hearing and speaking children (20 male and 20 female) between the age group of 6 and 8 years participated as subjects. The acoustic parameters such as shimmer, jitter and harmonic-to-noise ratio (HNR) were measures and compared between no masking condition (0 dB ML) and masking condition (90 dB ML). Despite the neuromotor systems being less mature in children and less resistant than adults to altered auditory feedback, most of the children in the study demonstrated increased phonatory stability which was reflected by reduced shimmer, jitter and increased HNR values. This study implicates that most of the children demonstrate well established patterns of kinaesthetic feedback, which might have allowed them to maintain normal levels of vocal motor control even in the presence of disturbed auditory feedback. Hence, it can be concluded that children also exhibit kinaesthetic feedback mechanism to control phonatory stability when auditory feedback is disrupted which in turn highlights the importance of kinaesthetic feedback to be included in the therapeutic/intervention approaches for children with hearing and neurogenic speech deficits.
The Relations of Mothers' Controlling Vocalizations to Children's Intrinsic Motivation.
ERIC Educational Resources Information Center
Deci, Edward L.; And Others
1993-01-01
Coded maternal vocalizations during videotaped play sessions of mothers and their six- or seven-year-old children. Children's intrinsic motivation was assessed by observing children's play when they were alone in a room. Found a negative relationship between maternal controlling vocalizations and children's intrinsic motivation. (MM)
Lee, Shao-Hsuan; Hsiao, Tzu-Yu; Lee, Guo-She
2015-06-01
Sustained vocalizations of vowels [a], [i], and syllable [mə] were collected in twenty normal-hearing individuals. On vocalizations, five conditions of different audio-vocal feedback were introduced separately to the speakers including no masking, wearing supra-aural headphones only, speech-noise masking, high-pass noise masking, and broad-band-noise masking. Power spectral analysis of vocal fundamental frequency (F0) was used to evaluate the modulations of F0 and linear-predictive-coding was used to acquire first two formants. The results showed that while the formant frequencies were not significantly shifted, low-frequency modulations (<3 Hz) of F0 significantly increased with reduced audio-vocal feedback across speech sounds and were significantly correlated with auditory awareness of speakers' own voices. For sustained speech production, the motor speech controls on F0 may depend on a feedback mechanism while articulation should rely more on a feedforward mechanism. Power spectral analysis of F0 might be applied to evaluate audio-vocal control for various hearing and neurological disorders in the future. Copyright © 2015 Elsevier B.V. All rights reserved.
Hansen, Jennifer K; Thibeault, Susan L; Walsh, Jennifer F; Shu, Xiao Zheng; Prestwich, Glenn D
2005-09-01
A prospective, controlled animal study was performed to determine whether the use of injectable, chemically modified hyaluronic acid (HA) derivatives at the time of intentional vocal fold resection might facilitate wound repair and preserve the unique viscoelastic properties of the vocal fold extracellular matrix. We performed bilateral vocal fold biopsies on 33 rabbits. Two groups of rabbits were unilaterally treated with 2 different HA derivatives--Carbylan-SX and HA-DTPH-PEGDA--at the time of resection. Saline was injected as a control into the contralateral fold. The animals were painlessly sacrificed 3 weeks after biopsy and injection. The outcomes measured included histologic fibrosis level, tissue HA level, and tissue viscosity and elasticity. The Carbylan-SX-treated vocal folds were found to have significantly less fibrosis than the saline-treated controls. The levels of HA in the treated vocal folds were not significantly different from those in the controls at 3 weeks as measured by enzyme-linked immunosorbent assay. The Carbylan-SX-treated vocal folds had significantly improved biomechanical properties of elasticity and viscosity. The HA-DTPH-PEGDA injections yielded significantly improved viscosity, but not elasticity. Prophylactic in vivo manipulation of the extracellular matrix with an injectable Carbylan-SX hydrogel appears to induce vocal fold tissue regeneration to yield optimal tissue composition and biomechanical properties favorable for phonation.
Sielska-Badurek, Ewelina; Osuch-Wójcikiewicz, Ewa; Sobol, Maria; Kazanecka, Ewa; Niemczyk, Kazimierz
2017-01-01
This study investigated vocal function knowledge and vocal tract sensorimotor self-awareness and the impact of functional voice rehabilitation on vocal function knowledge and self-awareness. This is a prospective, randomized study. Twenty singers (study group [SG]) completed a questionnaire before and after functional voice rehabilitation. Twenty additional singers, representing the control group, also completed the questionnaire without functional voice rehabilitation at a 3-month interval. The questionnaire consisted of three parts. The first part evaluated the singers' attitude to the anatomical and physiological knowledge of the vocal tract and their self-esteem of the knowledge level. The second part assessed the theoretical knowledge of the singers' vocal tract physiology. The third part of the questionnaire assessed singers' sensorimotor self-awareness of the vocal tract. The results showed that most singers indicated that knowledge of the vocal tract's anatomy and physiology is useful (59% SG, 67% control group). However, 75% of all participants defined their knowledge of the vocal tract's anatomy and physiology as weak or inadequate. In the SG, vocal function knowledge at the first assessment was 45%. After rehabilitation, the level increased to 67.7%. Vocal tract sensorimotor self-awareness initially was 38.9% in SG but rose to 66.7%. Findings of the study suggest that classical singers lack knowledge about the physiology of the vocal mechanism, especially the breathing patterns. In addition, they have low sensorimotor self-awareness of their vocal tract. The results suggest that singers would benefit from receiving services from phoniatrists and speech-language pathologists during their voice training. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Fujimaki, Yoko; Tsunoda, Koichi; Kobayashi, Rika; Tonghyo, Chong; Tanaka, Fujinobu; Kuroda, Hiroyuki; Numata, Tsutomu; Ishii, Toyota; Kuroda, Reiko; Masuda, Sawako; Hashimoto, Sho; Misawa, Hayato; Shindo, Naoko; Mori, Takahiro; Mori, Hiroko; Uchiyama, Naoki; Kamei, Yuichirou; Tanaka, Masashi; Hamaya, Hironobu; Funatsuki, Shingo; Usui, Satoko; Ito, Ikuno; Hamada, Kohei; Shindo, Akihito; Tokumaru, Yutaka; Morita, Yoko; Ueha, Rumi; Nito, Takaharu; Kikuta, Shu; Sekimoto, Sotaro; Kondo, Kenji; Sakamoto, Takashi; Itoh, Kenji; Yamasoba, Tatsuya; Matsumoto, Sumio
2017-08-01
To evaluate the effect of a self-controlled vocal exercise in elderly people with glottal closure insufficiency. Parallel-arm, individual randomized controlled trial. Patients who visited one of 10 medical centers under the National Hospital Organization group in Japan for the first time, aged 60 years or older, complaining of aspiration or hoarseness, and endoscopically confirmed to have glottal closure insufficiency owing to vocal cord atrophy, were enrolled in this study. They were randomly assigned to an intervention or a control group. The patients of the intervention group were given guidance and a DVD about a self-controlled vocal exercise. The maximum phonation time which is a measure of glottal closure was evaluated, and the number of patients who developed pneumonia during the six months was compared between the two groups. Of the 543 patients enrolled in this trial, 259 were allocated into the intervention group and 284 into the control; 60 of the intervention group and 75 of the control were not able to continue the trial. A total of 199 patients (age 73.9 ±7.25 years) in the intervention group and 209 (73.3 ±6.68 years) in the control completed the six-month trial. Intervention of the self-controlled vocal exercise extended the maximum phonation time significantly ( p < 0.001). There were two hospitalizations for pneumonia in the intervention group and 18 in the control group, representing a significant difference ( p < 0.001). The self-controlled vocal exercise allowed patients to achieve vocal cord adduction and improve glottal closure insufficiency, which reduced the rate of hospitalization for pneumonia significantly. gov Identifier-UMIN000015567.
Hertegård, S; Dahlqvist, A; Goodyer, E
2006-07-01
The scarring model resulted in significant damage and elevated viscoelasticity of the lamina propria. Hyaluronan preparations may alter viscoelasticity in scarred rabbit vocal folds. Vocal fold scarring results in stiffness of the lamina propria and severe voice problems. The aims of this study were to examine the degree of scarring achieved in the experiment and to measure the viscoelastic properties after injection of hyaluronan in rabbit vocal folds. Twenty-two vocal folds from 15 New Zealand rabbits were scarred, 8 vocal folds were controls. After 8 weeks 12 of the scarred vocal folds received injections with 2 types of cross-linked hyaluronan products and 10 scarred folds were injected with saline. After 11 more weeks the animals were sacrificed. After dissection, 15 vocal folds were frozen for viscoelastic measurements, whereas 14 vocal folds were prepared and stained. Measurements were made of the lamina propria thickness. Viscoelasticity was measured on intact vocal folds with a linear skin rheometer (LSR) adapted to laryngeal measurements. Measurements on the digitized slides showed a thickened lamina propria in the scarred samples as compared with the normal vocal folds (p<0.05). The viscoelastic analysis showed a tendency to stiffening of the scarred vocal folds as compared with the normal controls (p=0.05). There was large variation in stiffness between the two injected hyaluronan products.
Reduced auditory processing capacity during vocalization in children with Selective Mutism.
Arie, Miri; Henkin, Yael; Lamy, Dominique; Tetin-Schneider, Simona; Apter, Alan; Sadeh, Avi; Bar-Haim, Yair
2007-02-01
Because abnormal Auditory Efferent Activity (AEA) is associated with auditory distortions during vocalization, we tested whether auditory processing is impaired during vocalization in children with Selective Mutism (SM). Participants were children with SM and abnormal AEA, children with SM and normal AEA, and normally speaking controls, who had to detect aurally presented target words embedded within word lists under two conditions: silence (single task), and while vocalizing (dual task). To ascertain specificity of auditory-vocal deficit, effects of concurrent vocalizing were also examined during a visual task. Children with SM and abnormal AEA showed impaired auditory processing during vocalization relative to children with SM and normal AEA, and relative to control children. This impairment is specific to the auditory modality and does not reflect difficulties in dual task per se. The data extends previous findings suggesting that deficient auditory processing is involved in speech selectivity in SM.
Physical mechanisms may be as important as brain mechanisms in evolution of speech.
de Boer, Bart; Perlman, Marcus
2014-12-01
We present two arguments why physical adaptations for vocalization may be as important as neural adaptations. First, fine control over vocalization is not easy for physical reasons, and modern humans may be exceptional. Second, we present an example of a gorilla that shows rudimentary voluntary control over vocalization, indicating that some neural control is already shared with great apes.
Bicarbonate Availability for Vocal Fold Epithelial Defense to Acidic Challenge
Durkes, Abigail; Sivasankar, M. Preeti
2014-01-01
Objectives Bicarbonate is critical for acid-base tissue homeostasis. In this study we investigated the role of bicarbonate ion transport in vocal fold epithelial defense to acid challenges. Acidic insults to the larynx are common in gastric reflux, carcinogenesis and metastasis, and acute inflammation. Methods Ion transport was measured in viable, porcine vocal fold epithelium. First, 18 vocal folds were exposed to either the carbonic anhydrase antagonist acetazolamide or to vehicle. Second, 32 vocal folds were exposed to either a control buffer or a bicarbonate-free buffer on their luminal or basolateral surface or both. Third, vocal folds were challenged with acid in the presence of bicarbonate-free or control buffer. Results The vocal fold transepithelial resistance was greater than 300 Ω*cm2, suggesting robust barrier integrity. Ion transport did not change after exposure to acetazolamide (p > 0.05). Exposure to bicarbonate-free buffer did not compromise vocal fold ion transport (p > 0.05). Ion transport increased after acid challenge. This increase approached statistical significance and was the greatest for the control buffer and for the bicarbonate-free buffer applied to the basolateral surface. Conclusions Bicarbonate secretion may contribute to vocal fold defense against acid challenge. Our data offer a potential novel role for bicarbonate as a therapeutic agent to reduce pH abnormalities in the larynx and prevent associated pathological changes. PMID:24574427
Hemispheric processing of vocal emblem sounds.
Neumann-Werth, Yael; Levy, Erika S; Obler, Loraine K
2013-01-01
Vocal emblems, such as shh and brr, are speech sounds that have linguistic and nonlinguistic features; thus, it is unclear how they are processed in the brain. Five adult dextral individuals with left-brain damage and moderate-severe Wernicke's aphasia, five adult dextral individuals with right-brain damage, and five Controls participated in two tasks: (1) matching vocal emblems to photographs ('picture task') and (2) matching vocal emblems to verbal translations ('phrase task'). Cross-group statistical analyses on items on which the Controls performed at ceiling revealed lower accuracy by the group with left-brain damage (than by Controls) on both tasks, and lower accuracy by the group with right-brain damage (than by Controls) on the picture task. Additionally, the group with left-brain damage performed significantly less accurately than the group with right-brain damage on the phrase task only. Findings suggest that comprehension of vocal emblems recruits more left- than right-hemisphere processing.
Modification of spectral features by nonhuman primates
Weiss, Daniel J.; Hotchkin, Cara F.; Parks, Susan E.
2017-01-01
Ackermann et al. discuss the lack of evidence for vocal control in nonhuman primates. We suggest that nonhuman primates may be capable of achieving greater vocal control than previously supposed. In support of this assertion, we discuss new evidence that nonhuman primates are capable of modifying spectral features in their vocalizations. PMID:25514964
The Effect of Vocal Training Methods on Improving Turkish Accent Defects
ERIC Educational Resources Information Center
Aycan, Kivanc; Evren, Gul Fahriye
2016-01-01
Problem Statement: Despite analyses of how vocal training methods can correct or improve Turkish-language accent defects, for most voice educators, the most important methods continue to be breathe management control and correct vocalization exercises. We therefore sought to demonstrate the relationship of song lyrics to breathe control, accent…
Primate feedstock for the evolution of consonants.
Lameira, Adriano R; Maddieson, Ian; Zuberbühler, Klaus
2014-02-01
The evolution of speech remains an elusive scientific problem. A widespread notion is that vocal learning, underlined by vocal-fold control, is a key prerequisite for speech evolution. Although present in birds and non-primate mammals, vocal learning is ostensibly absent in non-human primates. Here we argue that the main road to speech evolution has been through controlling the supralaryngeal vocal tract, for which we find evidence for evolutionary continuity within the great apes. Copyright © 2013 Elsevier Ltd. All rights reserved.
Audio-vocal system regulation in children with autism spectrum disorders.
Russo, Nicole; Larson, Charles; Kraus, Nina
2008-06-01
Do children with autism spectrum disorders (ASD) respond similarly to perturbations in auditory feedback as typically developing (TD) children? Presentation of pitch-shifted voice auditory feedback to vocalizing participants reveals a close coupling between the processing of auditory feedback and vocal motor control. This paradigm was used to test the hypothesis that abnormalities in the audio-vocal system would negatively impact ASD compensatory responses to perturbed auditory feedback. Voice fundamental frequency (F(0)) was measured while children produced an /a/ sound into a microphone. The voice signal was fed back to the subjects in real time through headphones. During production, the feedback was pitch shifted (-100 cents, 200 ms) at random intervals for 80 trials. Averaged voice F(0) responses to pitch-shifted stimuli were calculated and correlated with both mental and language abilities as tested via standardized tests. A subset of children with ASD produced larger responses to perturbed auditory feedback than TD children, while the other children with ASD produced significantly lower response magnitudes. Furthermore, robust relationships between language ability, response magnitude and time of peak magnitude were identified. Because auditory feedback helps to stabilize voice F(0) (a major acoustic cue of prosody) and individuals with ASD have problems with prosody, this study identified potential mechanisms of dysfunction in the audio-vocal system for voice pitch regulation in some children with ASD. Objectively quantifying this deficit may inform both the assessment of a subgroup of ASD children with prosody deficits, as well as remediation strategies that incorporate pitch training.
Benichov, Jonathan I; Globerson, Eitan; Tchernichovski, Ofer
2016-01-01
Humans and oscine songbirds share the rare capacity for vocal learning. Songbirds have the ability to acquire songs and calls of various rhythms through imitation. In several species, birds can even coordinate the timing of their vocalizations with other individuals in duets that are synchronized with millisecond-accuracy. It is not known, however, if songbirds can perceive rhythms holistically nor if they are capable of spontaneous entrainment to complex rhythms, in a manner similar to humans. Here we review emerging evidence from studies of rhythm generation and vocal coordination across songbirds and humans. In particular, recently developed experimental methods have revealed neural mechanisms underlying the temporal structure of song and have allowed us to test birds' abilities to predict the timing of rhythmic social signals. Surprisingly, zebra finches can readily learn to anticipate the calls of a "vocal robot" partner and alter the timing of their answers to avoid jamming, even in reference to complex rhythmic patterns. This capacity resembles, to some extent, human predictive motor response to an external beat. In songbirds, this is driven, at least in part, by the forebrain song system, which controls song timing and is essential for vocal learning. Building upon previous evidence for spontaneous entrainment in human and non-human vocal learners, we propose a comparative framework for future studies aimed at identifying shared mechanism of rhythm production and perception across songbirds and humans.
ERIC Educational Resources Information Center
Riede, Tobias; Goller, Franz
2010-01-01
Song production in songbirds is a model system for studying learned vocal behavior. As in humans, bird phonation involves three main motor systems (respiration, vocal organ and vocal tract). The avian respiratory mechanism uses pressure regulation in air sacs to ventilate a rigid lung. In songbirds sound is generated with two independently…
Klemuk, Sarah A.; Chen, Xia; Quinchia Johnson, Beatriz H.
2009-01-01
Objectives To determine if the utilization of injectable chemically-modified hyaluronan (HA) derivative at the time of intentional vocal fold resection may facilitate wound repair and preserve the unique viscoelastic properties of the extracellular matrix and lamina propria 6 months after treatment. Study Design Prospective, controlled animal study. Methods Twelve rabbit vocal folds were biopsied bilaterally, and the left side of vocal fold was treated with Extracel, an injectable, chemically-modified HA derivative, and the right side of vocal fold was injected with saline as control at the time of resection. Animals were sacrificed six months after biopsy and injection. Outcomes measured include transcription levels for procollagen, fibronectin, fibromodulin, TGF-β1, hyaluronan synthase and hyaluronidase and tissue biomechanics -- viscosity and elasticity. Results Extracel treated vocal folds were found to have significantly less fibrosis than saline treated controls. Extracel treated vocal folds had significantly improved biomechanical properties of elasticity and viscosity. Significantly decreased levels of fibronectin, fibromodulin, TGF-β1, procollagen I and hyaluronan synthase were measured. Conclusions Prophylactic in vivo manipulation of the extracellular matrix with an injectable HA hydrogel appears to induce vocal fold tissue regeneration to yield improved tissue composition and biomechanical properties at 6 months. PMID:20456912
Monkey vocal tracts are speech-ready.
Fitch, W Tecumseh; de Boer, Bart; Mathur, Neil; Ghazanfar, Asif A
2016-12-01
For four decades, the inability of nonhuman primates to produce human speech sounds has been claimed to stem from limitations in their vocal tract anatomy, a conclusion based on plaster casts made from the vocal tract of a monkey cadaver. We used x-ray videos to quantify vocal tract dynamics in living macaques during vocalization, facial displays, and feeding. We demonstrate that the macaque vocal tract could easily produce an adequate range of speech sounds to support spoken language, showing that previous techniques based on postmortem samples drastically underestimated primate vocal capabilities. Our findings imply that the evolution of human speech capabilities required neural changes rather than modifications of vocal anatomy. Macaques have a speech-ready vocal tract but lack a speech-ready brain to control it.
Guo, Zhiqiang; Wu, Xiuqin; Li, Weifeng; Jones, Jeffery A; Yan, Nan; Sheft, Stanley; Liu, Peng; Liu, Hanjun
2017-10-25
Although working memory (WM) is considered as an emergent property of the speech perception and production systems, the role of WM in sensorimotor integration during speech processing is largely unknown. We conducted two event-related potential experiments with female and male young adults to investigate the contribution of WM to the neurobehavioural processing of altered auditory feedback during vocal production. A delayed match-to-sample task that required participants to indicate whether the pitch feedback perturbations they heard during vocalizations in test and sample sequences matched, elicited significantly larger vocal compensations, larger N1 responses in the left middle and superior temporal gyrus, and smaller P2 responses in the left middle and superior temporal gyrus, inferior parietal lobule, somatosensory cortex, right inferior frontal gyrus, and insula compared with a control task that did not require memory retention of the sequence of pitch perturbations. On the other hand, participants who underwent extensive auditory WM training produced suppressed vocal compensations that were correlated with improved auditory WM capacity, and enhanced P2 responses in the left middle frontal gyrus, inferior parietal lobule, right inferior frontal gyrus, and insula that were predicted by pretraining auditory WM capacity. These findings indicate that WM can enhance the perception of voice auditory feedback errors while inhibiting compensatory vocal behavior to prevent voice control from being excessively influenced by auditory feedback. This study provides the first evidence that auditory-motor integration for voice control can be modulated by top-down influences arising from WM, rather than modulated exclusively by bottom-up and automatic processes. SIGNIFICANCE STATEMENT One outstanding question that remains unsolved in speech motor control is how the mismatch between predicted and actual voice auditory feedback is detected and corrected. The present study provides two lines of converging evidence, for the first time, that working memory cannot only enhance the perception of vocal feedback errors but also exert inhibitory control over vocal motor behavior. These findings represent a major advance in our understanding of the top-down modulatory mechanisms that support the detection and correction of prediction-feedback mismatches during sensorimotor control of speech production driven by working memory. Rather than being an exclusively bottom-up and automatic process, auditory-motor integration for voice control can be modulated by top-down influences arising from working memory. Copyright © 2017 the authors 0270-6474/17/3710324-11$15.00/0.
Effects of speech style, room acoustics, and vocal fatigue on vocal effort
Bottalico, Pasquale; Graetzer, Simone; Hunter, Eric J.
2016-01-01
Vocal effort is a physiological measure that accounts for changes in voice production as vocal loading increases. It has been quantified in terms of sound pressure level (SPL). This study investigates how vocal effort is affected by speaking style, room acoustics, and short-term vocal fatigue. Twenty subjects were recorded while reading a text at normal and loud volumes in anechoic, semi-reverberant, and reverberant rooms in the presence of classroom babble noise. The acoustics in each environment were modified by creating a strong first reflection in the talker position. After each task, the subjects answered questions addressing their perception of the vocal effort, comfort, control, and clarity of their own voice. Variation in SPL for each subject was measured per task. It was found that SPL and self-reported effort increased in the loud style and decreased when the reflective panels were present and when reverberation time increased. Self-reported comfort and control decreased in the loud style, while self-reported clarity increased when panels were present. The lowest magnitude of vocal fatigue was experienced in the semi-reverberant room. The results indicate that early reflections may be used to reduce vocal effort without modifying reverberation time. PMID:27250179
Effects of Operant Discrimination Training on the Vocalizations of Nonverbal Children with Autism
ERIC Educational Resources Information Center
Lepper, Tracy L.; Petursdottir, Anna Ingeborg; Esch, Barbara E.
2013-01-01
We evaluated the effects of operant discrimination training (ODT) on the vocalizations of 3 boys with autism. We compared ODT to a stimulus-stimulus pairing (SSP) condition and a control condition in an adapted alternating-treatments design. ODT increased the target vocalizations of all participants compared to the control condition, and its…
A Computational Study of Vocal Fold Dehydration During Phonation.
Wu, Liang; Zhang, Zhaoyan
2017-12-01
While vocal fold dehydration is often considered an important factor contributing to vocal fatigue, it still remains unclear whether vocal fold vibration alone is able to induce severe dehydration that has a noticeable effect on phonation and perceived vocal effort. A three-dimensional model was developed to investigate vocal fold systemic dehydration and surface dehydration during phonation. Based on the linear poroelastic theory, the model considered water resupply from blood vessels through the lateral boundary, water movement within the vocal folds, water exchange between the vocal folds and the surface liquid layer through the epithelium, and surface fluid accumulation and discharge to the glottal airway. Parametric studies were conducted to investigate water loss within the vocal folds and from the surface after a 5-min sustained phonation under different permeability and vibration conditions. The results showed that the dehydration generally increased with increasing vibration amplitude, increasing epithelial permeability, and reduced water resupply. With adequate water resupply, a large-amplitude vibration can induce an overall systemic dehydration as high as 3%. The distribution of water loss within the vocal folds was non-uniform, and a local dehydration higher than 5% was observed even under conditions of a low overall systemic dehydration (<1%). Such high level of water loss may severely affect tissue properties, muscular functions, and phonations characteristics. In contrast, water loss of the surface liquid layer was generally an order of magnitude higher than water loss inside the vocal folds, indicating that the surface dehydration level is likely not a good indicator of the systemic dehydration.
Liu, Hanjun; Wang, Emily Q.; Chen, Zhaocong; Liu, Peng; Larson, Charles R.; Huang, Dongfeng
2010-01-01
The purpose of this cross-language study was to examine whether the online control of voice fundamental frequency (F0) during vowel phonation is influenced by language experience. Native speakers of Cantonese and Mandarin, both tonal languages spoken in China, participated in the experiments. Subjects were asked to vocalize a vowel sound ∕u∕ at their comfortable habitual F0, during which their voice pitch was unexpectedly shifted (±50, ±100, ±200, or ±500 cents, 200 ms duration) and fed back instantaneously to them over headphones. The results showed that Cantonese speakers produced significantly smaller responses than Mandarin speakers when the stimulus magnitude varied from 200 to 500 cents. Further, response magnitudes decreased along with the increase in stimulus magnitude in Cantonese speakers, which was not observed in Mandarin speakers. These findings suggest that online control of voice F0 during vocalization is sensitive to language experience. Further, systematic modulations of vocal responses across stimulus magnitude were observed in Cantonese speakers but not in Mandarin speakers, which indicates that this highly automatic feedback mechanism is sensitive to the specific tonal system of each language. PMID:21218905
A Multidimensional Study of Vocal Function Following Radiation Therapy for Laryngeal Cancers.
Angadi, Vrushali; Dressler, Emily; Stemple, Joseph
2017-06-01
Radiation therapy (XRT) has proven to be an effective curative modality in the treatment of laryngeal cancers. However, XRT also has deleterious effects on vocal function. To demonstrate the multidimensional nature of deficits in vocal function as a result of radiation therapy for laryngeal cancer. Cohort study. Vocal function parameters were chosen from the 5 domains of voice assessment to complete a multidimensional assessment battery. Adults irradiated (XRT group) for laryngeal cancers were compared to a control group of individuals with no history of head and neck cancers or radiation therapy. The control group was matched in age, sex, and pack years of smoking. Eighteen participants were recruited for the study. The XRT group demonstrated significantly worse clinical values as compared to the control group across select parameters in the each of the 5 domains of voice assessment. Radiation therapy for laryngeal cancers results in multidimensional deficits in vocal function. Notably, these deficits persist long term. In the present study sample, multidimensional deficits were persistent 2 to 7 years following completion of XRT. The observed multidimensional persistent vocal difficulties highlight the importance of vocal rehabilitation in the irradiated larynx cancer population.
Birds, primates, and spoken language origins: behavioral phenotypes and neurobiological substrates
Petkov, Christopher I.; Jarvis, Erich D.
2012-01-01
Vocal learners such as humans and songbirds can learn to produce elaborate patterns of structurally organized vocalizations, whereas many other vertebrates such as non-human primates and most other bird groups either cannot or do so to a very limited degree. To explain the similarities among humans and vocal-learning birds and the differences with other species, various theories have been proposed. One set of theories are motor theories, which underscore the role of the motor system as an evolutionary substrate for vocal production learning. For instance, the motor theory of speech and song perception proposes enhanced auditory perceptual learning of speech in humans and song in birds, which suggests a considerable level of neurobiological specialization. Another, a motor theory of vocal learning origin, proposes that the brain pathways that control the learning and production of song and speech were derived from adjacent motor brain pathways. Another set of theories are cognitive theories, which address the interface between cognition and the auditory-vocal domains to support language learning in humans. Here we critically review the behavioral and neurobiological evidence for parallels and differences between the so-called vocal learners and vocal non-learners in the context of motor and cognitive theories. In doing so, we note that behaviorally vocal-production learning abilities are more distributed than categorical, as are the auditory-learning abilities of animals. We propose testable hypotheses on the extent of the specializations and cross-species correspondences suggested by motor and cognitive theories. We believe that determining how spoken language evolved is likely to become clearer with concerted efforts in testing comparative data from many non-human animal species. PMID:22912615
Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update
Mehta, Daryush D.; Van Stan, Jarrad H.; Zañartu, Matías; Ghassemi, Marzyeh; Guttag, John V.; Espinoza, Víctor M.; Cortés, Juan P.; Cheyne, Harold A.; Hillman, Robert E.
2015-01-01
Many common voice disorders are chronic or recurring conditions that are likely to result from inefficient and/or abusive patterns of vocal behavior, referred to as vocal hyperfunction. The clinical management of hyperfunctional voice disorders would be greatly enhanced by the ability to monitor and quantify detrimental vocal behaviors during an individual’s activities of daily life. This paper provides an update on ongoing work that uses a miniature accelerometer on the neck surface below the larynx to collect a large set of ambulatory data on patients with hyperfunctional voice disorders (before and after treatment) and matched-control subjects. Three types of analysis approaches are being employed in an effort to identify the best set of measures for differentiating among hyperfunctional and normal patterns of vocal behavior: (1) ambulatory measures of voice use that include vocal dose and voice quality correlates, (2) aerodynamic measures based on glottal airflow estimates extracted from the accelerometer signal using subject-specific vocal system models, and (3) classification based on machine learning and pattern recognition approaches that have been used successfully in analyzing long-term recordings of other physiological signals. Preliminary results demonstrate the potential for ambulatory voice monitoring to improve the diagnosis and treatment of common hyperfunctional voice disorders. PMID:26528472
Yosida, Shigeto; Okanoya, Kazuo
2012-02-01
Generation of the motor patterns of emotional sounds in mammals occurs in the periaqueductal gray matter of the midbrain and is not directly controlled by the cortex. The medial frontal cortex indirectly controls vocalizations, based on the recognition of social context. We examined whether the medial frontal cortex was responsible for antiphonal vocalization, or turn-taking, in naked mole-rats. In normal turn-taking, naked mole-rats vocalize more frequently to dominant individuals than to subordinate ones. Bilateral lesions of the medial frontal cortex disrupted differentiation of call rates to the stimulus animals, which had varied social relationships to the subject. However, medial frontal cortex lesions did not affect either the acoustic properties of the vocalizations or the timing of the vocal exchanges. This suggests that the medial frontal cortex may be involved in social cognition or decision making during turn-taking, while other regions of the brain regulate when animals vocalize and the vocalizations themselves.
A Quantum Approach to Multi-Agent Systems (MAS), Organizations, and Control
2003-06-01
interdependent interactions between individuals represented approximately as vocal harmonic I resonators. Then the growth rate of an organization fits ...A quantum approach to multi-agent systems (MAS), organizations , and control W.F. Lawless Paine College 1235 15th Street Augusta, GA 30901...AND SUBTITLE A quantum approach to multi-agent systems (MAS), organizations , and control 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT
Vocalization Subsystem Responses to a Temporarily Induced Unilateral Vocal Fold Paralysis
ERIC Educational Resources Information Center
Croake, Daniel J.; Andreatta, Richard D.; Stemple, Joseph C.
2018-01-01
Purpose: The purpose of this study is to quantify the interactions of the 3 vocalization subsystems of respiration, phonation, and resonance before, during, and after a perturbation to the larynx (temporarily induced unilateral vocal fold paralysis) in 10 vocally healthy participants. Using dynamic systems theory as a guide, we hypothesized that…
Ellis, Jesse M S; Riters, Lauren V
2012-01-01
Transmitting information via communicative signals is integral to interacting with conspecifics, and some species achieve this task by varying vocalizations to reflect context. Although signal variation is critical to social interactions, the underlying neural control has not been studied. In response to a predator, black-capped chickadees (Poecile atricapilla) produce mobbing calls (chick-a-dee calls) with various parameters, some of which convey information about the threat stimulus. We predicted that vocal parameters indicative of threat would be associated with distinct patterns of neuronal activity within brain areas involved in social behavior and those involved in the sensorimotor control of vocal production. To test this prediction, we measured the syntax and structural aspects of chick-a-dee call production in response to a hawk model and assessed the protein product of the immediate early gene FOS in brain regions implicated in context-specific vocal and social behavior. These regions include the medial preoptic area (POM) and lateral septum (LS), as well as regions involved in vocal motor control, including the dorsomedial nucleus of the intercollicular complex and the HVC. We found correlations linking call rate (previously demonstrated to reflect threat) to labeling in the POM and LS. Labeling in the HVC correlated with the number of D notes per call, which may also signal threat level. Labeling in the call control region dorsomedial nucleus was associated with the structure of D notes and the overall number of notes, but not call rate or type of notes produced. These results suggest that the POM and LS may influence attributes of vocalizations produced in response to predators and that the brain region implicated in song control, the HVC, also influences call production. Because variation in chick-a-dee call rate indicates predator threat, we speculate that these areas could integrate with motor control regions to imbue mobbing signals with additional information about threat level. Copyright © 2011 S. Karger AG, Basel.
[Temperament of children with vocal fold nodules].
Wei, Youhua; Wang, Zhinan; Xu, Zhongqiang; Chen, Ping; Hao, Lili
2009-11-01
To examine the temperament of children with vocal fold nodules. To compare the temperament dimension and temperamental types of 42 children with vocal fold nodules with 46 vocally normal children, using Chinese children's Temperament Problem Screening system (CCTPSs). The children with vocal fold nodules differed significantly from the comparison group in their temperament dimension's adaptability, intensity of reaction, mood value, persistency and temperamental types. There are more difficult and slow-to-warm-up children in patients with vocal fold nodules than vocally normal children.
Konishi, Hanako; Mizota, Toshiyuki; Fukuda, Kazuhiko
2015-06-01
We report a case of persistent bilateral vocal cord paralysis which developed after spine surgery under general anesthesia in a patient with multiple system atrophy. A 64-year-old woman was scheduled to receive spinal fusion surgery for kyphoscoliosis. She did not have apparent symptoms of vocal cord paralysis such as hoarseness before surgery. The surgery was performed smoothly under general anesthesia with endotracheal intubation. However, immediately after extubation, the patient developed severe upper airway obstruction and was re-intubated. Fiberoptic laryngoscopy revealed bilateral vocal cord abductor paralysis. Vocal cord paralysis did not improve and she received tracheotomy on the 12th day after surgery. She also showed symptoms of autonomic nervous system dysfunction and cerebellar ataxia, and was diagnosed as multiple system atrophy on postoperative day 64. We discuss differential diagnosis of persistent vocal cord paralysis after general anesthesia, and anesthetic management of a patient with multiple system atrophy.
Fukushima, Makoto; Saunders, Richard C; Fujii, Naotaka; Averbeck, Bruno B; Mishkin, Mortimer
2014-01-01
Vocal production is an example of controlled motor behavior with high temporal precision. Previous studies have decoded auditory evoked cortical activity while monkeys listened to vocalization sounds. On the other hand, there have been few attempts at decoding motor cortical activity during vocal production. Here we recorded cortical activity during vocal production in the macaque with a chronically implanted electrocorticographic (ECoG) electrode array. The array detected robust activity in motor cortex during vocal production. We used a nonlinear dynamical model of the vocal organ to reduce the dimensionality of `Coo' calls produced by the monkey. We then used linear regression to evaluate the information in motor cortical activity for this reduced representation of calls. This simple linear model accounted for circa 65% of the variance in the reduced sound representations, supporting the feasibility of using the dynamical model of the vocal organ for decoding motor cortical activity during vocal production.
Thibeault, Susan L; Klemuk, Sarah A; Chen, Xia; Quinchia Johnson, Beatriz H
2011-03-01
To determine if the utilization of injectable chemically modified hyaluronan (HA) derivative at the time of intentional vocal fold resection may facilitate wound repair and preserve the unique viscoelastic properties of the extracellular matrix (ECM) and lamina propria 6 months after treatment. Prospective, controlled animal study. Twelve rabbit vocal folds were biopsied bilaterally, and the left side of vocal fold was treated with Extracel, an injectable, chemically modified HA derivative, and the right side of vocal fold was injected with saline as control at the time of resection. Animals were sacrificed 6 months after biopsy and injection. Outcomes measured include transcription levels for procollagen, fibronectin, fibromodulin, transforming growth factor beta one (TGF-β1), HA synthase, and hyaluronidase, and tissue biomechanics-viscosity and elasticity. Extracel-treated vocal folds were found to have significantly less fibrosis than saline-treated controls. Extracel-treated vocal folds had significantly improved biomechanical properties of elasticity and viscosity. Significantly decreased levels of fibronectin, fibromodulin, TGF-β1, procollagen I, and HA synthase were measured. Prophylactic in vivo manipulation of the ECM with an injectable HA hydrogel appears to induce vocal fold tissue regeneration to yield improved tissue composition and biomechanical properties at 6 months. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Auditory-Motor Control of Vocal Production during Divided Attention: Behavioral and ERP Correlates.
Liu, Ying; Fan, Hao; Li, Jingting; Jones, Jeffery A; Liu, Peng; Zhang, Baofeng; Liu, Hanjun
2018-01-01
When people hear unexpected perturbations in auditory feedback, they produce rapid compensatory adjustments of their vocal behavior. Recent evidence has shown enhanced vocal compensations and cortical event-related potentials (ERPs) in response to attended pitch feedback perturbations, suggesting that this reflex-like behavior is influenced by selective attention. Less is known, however, about auditory-motor integration for voice control during divided attention. The present cross-modal study investigated the behavioral and ERP correlates of auditory feedback control of vocal pitch production during divided attention. During the production of sustained vowels, 32 young adults were instructed to simultaneously attend to both pitch feedback perturbations they heard and flashing red lights they saw. The presentation rate of the visual stimuli was varied to produce a low, intermediate, and high attentional load. The behavioral results showed that the low-load condition elicited significantly smaller vocal compensations for pitch perturbations than the intermediate-load and high-load conditions. As well, the cortical processing of vocal pitch feedback was also modulated as a function of divided attention. When compared to the low-load and intermediate-load conditions, the high-load condition elicited significantly larger N1 responses and smaller P2 responses to pitch perturbations. These findings provide the first neurobehavioral evidence that divided attention can modulate auditory feedback control of vocal pitch production.
Järvinen, Anna; Ng, Rowena; Crivelli, Davide; Neumann, Dirk; Arnold, Andrew J; Woo-VonHoogenstyn, Nicholas; Lai, Philip; Trauner, Doris; Bellugi, Ursula
2016-01-01
Both Williams syndrome (WS) and autism spectrum disorders (ASD) are associated with unusual auditory phenotypes with respect to processing vocal and musical stimuli, which may be shaped by the atypical social profiles that characterize the syndromes. Autonomic nervous system (ANS) reactivity to vocal and musical emotional stimuli was examined in 12 children with WS, 17 children with ASD, and 20 typically developing (TD) children, and related to their level of social functioning. The results of this small-scale study showed that after controlling for between-group differences in cognitive ability, all groups showed similar emotion identification performance across conditions. Additionally, in ASD, lower autonomic reactivity to human voice, and in TD, to musical emotion, was related to more normal social functioning. Compared to TD, both clinical groups showed increased arousal to vocalizations. A further result highlighted uniquely increased arousal to music in WS, contrasted with a decrease in arousal in ASD and TD. The ASD and WS groups exhibited arousal patterns suggestive of diminished habituation to the auditory stimuli. The results are discussed in the context of the clinical presentation of WS and ASD. © 2015 Wiley Periodicals, Inc.
Choi, Jeong-Seok; Kim, Nahn Ju; Klemuk, Sarah; Jang, Yun Ho; Park, In Suh; Ahn, Kyung Hyun; Lim, Jae-Yol; Kim, Young-Mo
2012-09-01
To compare the rheological characteristics of structurally different hyaluronic acid (HA)-based biomaterials that are presently used for phonosurgery and to investigate their influence on the viscoelastic properties of vocal folds after implantation in an in vivo rabbit model. In vitro and in vivo rheometric investigation. Experimental laboratory, Inha and Seoul National Universities. Viscoelastic shear properties of 3 HA-based biomaterials (Rofilan, Restylane, and Reviderm) were measured with a strain-controlled rheometer. These biomaterials were injected into the deep layers of rabbit vocal folds, and viscoelastic moduli of the injected vocal folds were determined 2 months after the injection. The vocal fold specimens were observed using a light microscope and a transmission electron microscope. All HA-based biomaterials showed similar levels of shear viscosity, which were slightly higher than that of human vocal folds reported in previous studies. Compared with noninjected control vocal folds, there were no significant differences in the magnitudes of both elastic shear modulus (G') and viscous modulus (G") of injected vocal folds among all of the materials. Light microscopic images showed that all materials were observed in the deep layers of vocal folds and electron scanning images revealed that injected HA particles were homogeneously distributed in regions of collagenous fibers. HA-based biomaterials could preserve the viscoelastic properties of the vocal folds, when they were injected into vocal folds in an in vivo rabbit model. However, further studies on the influence of the biomaterials on the viscoelasticity of human vocal folds in ECM surroundings are still needed.
Convergent Differential Regulation of Parvalbumin in the Brains of Vocal Learners
Hara, Erina; Rivas, Miriam V.; Ward, James M.; Okanoya, Kazuo; Jarvis, Erich D.
2012-01-01
Spoken language and learned song are complex communication behaviors found in only a few species, including humans and three groups of distantly related birds – songbirds, parrots, and hummingbirds. Despite their large phylogenetic distances, these vocal learners show convergent behaviors and associated brain pathways for vocal communication. However, it is not clear whether this behavioral and anatomical convergence is associated with molecular convergence. Here we used oligo microarrays to screen for genes differentially regulated in brain nuclei necessary for producing learned vocalizations relative to adjacent brain areas that control other behaviors in avian vocal learners versus vocal non-learners. A top candidate gene in our screen was a calcium-binding protein, parvalbumin (PV). In situ hybridization verification revealed that PV was expressed significantly higher throughout the song motor pathway, including brainstem vocal motor neurons relative to the surrounding brain regions of all distantly related avian vocal learners. This differential expression was specific to PV and vocal learners, as it was not found in avian vocal non-learners nor for control genes in learners and non-learners. Similar to the vocal learning birds, higher PV up-regulation was found in the brainstem tongue motor neurons used for speech production in humans relative to a non-human primate, macaques. These results suggest repeated convergent evolution of differential PV up-regulation in the brains of vocal learners separated by more than 65–300 million years from a common ancestor and that the specialized behaviors of learned song and speech may require extra calcium buffering and signaling. PMID:22238614
Precise Motor Control Enables Rapid Flexibility in Vocal Behavior of Marmoset Monkeys.
Pomberger, Thomas; Risueno-Segovia, Cristina; Löschner, Julia; Hage, Steffen R
2018-03-05
Investigating the evolution of human speech is difficult and controversial because human speech surpasses nonhuman primate vocal communication in scope and flexibility [1-3]. Monkey vocalizations have been assumed to be largely innate, highly affective, and stereotyped for over 50 years [4, 5]. Recently, this perception has dramatically changed. Current studies have revealed distinct learning mechanisms during vocal development [6-8] and vocal flexibility, allowing monkeys to cognitively control when [9, 10], where [11], and what to vocalize [10, 12, 13]. However, specific call features (e.g., duration, frequency) remain surprisingly robust and stable in adult monkeys, resulting in rather stereotyped and discrete call patterns [14]. Additionally, monkeys seem to be unable to modulate their acoustic call structure under reinforced conditions beyond natural constraints [15, 16]. Behavioral experiments have shown that monkeys can stop sequences of calls immediately after acoustic perturbation but cannot interrupt ongoing vocalizations, suggesting that calls consist of single impartible pulses [17, 18]. Using acoustic perturbation triggered by the vocal behavior itself and quantitative measures of resulting vocal adjustments, we show that marmoset monkeys are capable of producing calls with durations beyond the natural boundaries of their repertoire by interrupting ongoing vocalizations rapidly after perturbation onset. Our results indicate that marmosets are capable of interrupting vocalizations only at periodic time points throughout calls, further supported by the occurrence of periodically segmented phees. These ideas overturn decades-old concepts on primate vocal pattern generation, indicating that vocalizations do not consist of one discrete call pattern but are built of many sequentially uttered units, like human speech. Copyright © 2018 The Author(s). Published by Elsevier Ltd.. All rights reserved.
A budget of energy transfer in a sustained vocal folds vibration in glottis
NASA Astrophysics Data System (ADS)
Zhang, Lucy; Yang, Jubiao; Krane, Michael
2016-11-01
A set of force and energy balance equations using the control volume approach is derived based on the first principles of physics for a sustained vocal folds vibration in glottis. The control volume analysis is done for compressible airflow in a moving and deforming control volume in the vicinity of the vocal folds. The interaction between laryngeal airflow and vocal folds are successfully simulated using the modified Immersed Finite Element Method (mIFEM), a fully coupled approach to simulate fluid-structure interactions. Detailed mathematical terms are separated out for deeper physical understanding and utilization of mechanical energy is quantified with the derived equation. The results show that majority of energy input is consumed for driving laryngeal airflow, while a smaller portion is for compensating viscous losses in and sustaining the vibration of the vocal folds. We acknowledge the funding support of NIH 2R01DC005642-10A1.
Kuo, Chung-Feng Jeffrey; Chu, Yueng-Hsiang; Wang, Po-Chun; Lai, Chun-Yu; Chu, Wen-Lin; Leu, Yi-Shing; Wang, Hsing-Won
2013-12-01
The human larynx is an important organ for voice production and respiratory mechanisms. The vocal cord is approximated for voice production and open for breathing. The videolaryngoscope is widely used for vocal cord examination. At present, physicians usually diagnose vocal cord diseases by manually selecting the image of the vocal cord opening to the largest extent (abduction), thus maximally exposing the vocal cord lesion. On the other hand, the severity of diseases such as vocal palsy, atrophic vocal cord is largely dependent on the vocal cord closing to the smallest extent (adduction). Therefore, diseases can be assessed by the image of the vocal cord opening to the largest extent, and the seriousness of breathy voice is closely correlated to the gap between vocal cords when closing to the smallest extent. The aim of the study was to design an automatic vocal cord image selection system to improve the conventional selection process by physicians and enhance diagnosis efficiency. Also, due to the unwanted fuzzy images resulting from examination process caused by human factors as well as the non-vocal cord images, texture analysis is added in this study to measure image entropy to establish a screening and elimination system to effectively enhance the accuracy of selecting the image of the vocal cord closing to the smallest extent. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Core and Shell Song Systems Unique to the Parrot Brain
Chakraborty, Mukta; Walløe, Solveig; Nedergaard, Signe; Fridel, Emma E.; Dabelsteen, Torben; Pakkenberg, Bente; Bertelsen, Mads F.; Dorrestein, Gerry M.; Brauth, Steven E.; Durand, Sarah E.; Jarvis, Erich D.
2015-01-01
The ability to imitate complex sounds is rare, and among birds has been found only in parrots, songbirds, and hummingbirds. Parrots exhibit the most advanced vocal mimicry among non-human animals. A few studies have noted differences in connectivity, brain position and shape in the vocal learning systems of parrots relative to songbirds and hummingbirds. However, only one parrot species, the budgerigar, has been examined and no differences in the presence of song system structures were found with other avian vocal learners. Motivated by questions of whether there are important differences in the vocal systems of parrots relative to other vocal learners, we used specialized constitutive gene expression, singing-driven gene expression, and neural connectivity tracing experiments to further characterize the song system of budgerigars and/or other parrots. We found that the parrot brain uniquely contains a song system within a song system. The parrot “core” song system is similar to the song systems of songbirds and hummingbirds, whereas the “shell” song system is unique to parrots. The core with only rudimentary shell regions were found in the New Zealand kea, representing one of the only living species at a basal divergence with all other parrots, implying that parrots evolved vocal learning systems at least 29 million years ago. Relative size differences in the core and shell regions occur among species, which we suggest could be related to species differences in vocal and cognitive abilities. PMID:26107173
Arneodo, Ezequiel M; Perl, Yonatan Sanz; Goller, Franz; Mindlin, Gabriel B
2012-01-01
Because of the parallels found with human language production and acquisition, birdsong is an ideal animal model to study general mechanisms underlying complex, learned motor behavior. The rich and diverse vocalizations of songbirds emerge as a result of the interaction between a pattern generator in the brain and a highly nontrivial nonlinear periphery. Much of the complexity of this vocal behavior has been understood by studying the physics of the avian vocal organ, particularly the syrinx. A mathematical model describing the complex periphery as a nonlinear dynamical system leads to the conclusion that nontrivial behavior emerges even when the organ is commanded by simple motor instructions: smooth paths in a low dimensional parameter space. An analysis of the model provides insight into which parameters are responsible for generating a rich variety of diverse vocalizations, and what the physiological meaning of these parameters is. By recording the physiological motor instructions elicited by a spontaneously singing muted bird and computing the model on a Digital Signal Processor in real-time, we produce realistic synthetic vocalizations that replace the bird's own auditory feedback. In this way, we build a bio-prosthetic avian vocal organ driven by a freely behaving bird via its physiologically coded motor commands. Since it is based on a low-dimensional nonlinear mathematical model of the peripheral effector, the emulation of the motor behavior requires light computation, in such a way that our bio-prosthetic device can be implemented on a portable platform.
The Vocal Tract Organ: A New Musical Instrument Using 3-D Printed Vocal Tracts.
Howard, David M
2017-10-27
The advent and now increasingly widespread availability of 3-D printers is transforming our understanding of the natural world by enabling observations to be made in a tangible manner. This paper describes the use of 3-D printed models of the vocal tract for different vowels that are used to create an acoustic output when stimulated with an appropriate sound source in a new musical instrument: the Vocal Tract Organ. The shape of each printed vocal tract is recovered from magnetic resonance imaging. It sits atop a loudspeaker to which is provided an acoustic L-F model larynx input signal that is controlled by the notes played on a musical instrument digital interface device such as a keyboard. The larynx input is subject to vibrato with extent and frequency adjustable as desired within the ranges usually found for human singing. Polyphonic inputs for choral singing textures can be applied via a single loudspeaker and vocal tract, invoking the approximation of linearity in the voice production system, thereby making multiple vowel stops a possibility while keeping the complexity of the instrument in reasonable check. The Vocal Tract Organ offers a much more human and natural sounding result than the traditional Vox Humana stops found in larger pipe organs, offering the possibility of enhancing pipe organs of the future as well as becoming the basis for a "multi-vowel" chamber organ in its own right. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Tu, Joyce C.
2006-01-01
In the present study, joint-control training was applied when teaching manded selection responses to children with autism. Four vocal children with autism participated in the first experiment, two males (ages seven and eight) and two females (ages seven and nine). The results showed that it was only after object-word naming was trained under joint…
Kleber, Boris; Veit, Ralf; Moll, Christina Valérie; Gaser, Christian; Birbaumer, Niels; Lotze, Martin
2016-06-01
In contrast to instrumental musicians, professional singers do not train on a specific instrument but perfect a motor system that has already been extensively trained during speech motor development. Previous functional imaging studies suggest that experience with singing is associated with enhanced somatosensory-based vocal motor control. However, experience-dependent structural plasticity in vocal musicians has rarely been studied. We investigated voxel-based morphometry (VBM) in 27 professional classical singers and compared gray matter volume in regions of the "singing-network" to an age-matched group of 28 healthy volunteers with no special singing experience. We found right hemispheric volume increases in professional singers in ventral primary somatosensory cortex (larynx S1) and adjacent rostral supramarginal gyrus (BA40), as well as in secondary somatosensory (S2) and primary auditory cortices (A1). Moreover, we found that earlier commencement with vocal training correlated with increased gray-matter volume in S1. However, in contrast to studies with instrumental musicians, this correlation only emerged in singers who began their formal training after the age of 14years, when speech motor development has reached its first plateau. Structural data thus confirm and extend previous functional reports suggesting a pivotal role of somatosensation in vocal motor control with increased experience in singing. Results furthermore indicate a sensitive period for developing additional vocal skills after speech motor coordination has matured. Copyright © 2016 Elsevier Inc. All rights reserved.
Fenzl, Thomas; Schuller, Gerd
2005-01-01
Background Echolocating bats emit vocalizations that can be classified either as echolocation calls or communication calls. Neural control of both types of calls must govern the same pool of motoneurons responsible for vocalizations. Electrical microstimulation in the periaqueductal gray matter (PAG) elicits both communication and echolocation calls, whereas stimulation of the paralemniscal area (PLA) induces only echolocation calls. In both the PAG and the PLA, the current thresholds for triggering natural vocalizations do not habituate to stimuli and remain low even for long stimulation periods, indicating that these structures have relative direct access to the final common pathway for vocalization. This study intended to clarify whether echolocation calls and communication calls are controlled differentially below the level of the PAG via separate vocal pathways before converging on the motoneurons used in vocalization. Results Both structures were probed simultaneously in a single experimental approach. Two stimulation electrodes were chronically implanted within the PAG in order to elicit either echolocation or communication calls. Blockade of the ipsilateral PLA site with iontophoretically application of the glutamate antagonist kynurenic acid did not impede either echolocation or communication calls elicited from the PAG. However, blockade of the contralateral PLA suppresses PAG-elicited echolocation calls but not communication calls. In both cases the blockade was reversible. Conclusion The neural control of echolocation and communication calls seems to be differentially organized below the level of the PAG. The PLA is an essential functional unit for echolocation call control before the descending pathways share again the final common pathway for vocalization. PMID:16053533
Automated Assessment of Child Vocalization Development Using LENA
ERIC Educational Resources Information Center
Richards, Jeffrey A.; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance
2017-01-01
Purpose: To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Method: Assessment was based on full-day audio…
Adipose stromal cells improve healing of vocal fold scar: Morphological and functional evidences.
de Bonnecaze, Guillaume; Chaput, Benoit; Woisard, Virginie; Uro-Coste, Emmanuelle; Swider, Pascal; Vergez, Sebastien; Serrano, Elie; Casteilla, Louis; Planat-Benard, Valerie
2016-08-01
Adipose derived stromal cells (ASCs) are abundant and easy to prepare. Such cells may be useful for treating severe vocal disturbance caused by acute vocal fold scars. Prospective animal experiments with controls. Twenty New-Zealand white rabbits were used in the present study. We evaluated vocal fold healing, with or without injection of autologous ASCs, after acute scarring. A defined lesion was created and the ASCs were immediately injected. Vocal fold regeneration was evaluated histomorphometrically and via viscoelastic analysis using an electrodynamic shaker. Six weeks after ASC injection, vocal folds exhibited significantly less inflammation than control folds (P < 0.005). In addition, hypertrophy of the lamina propria and fibrosis were significantly reduced upon ASC injection (P < 0.02). The decrease in viscoelastic parameters was less important in the ASC injected group compared to the noninjected group (P = 0.08). Injection of autologous ASCs improved vocal fold healing in our preclinical model. Further studies are needed, but this method may be useful in humans. NA. Laryngoscope, 126:E278-E285, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Phonology and Vocal Behavior in Toddlers with Autism Spectrum Disorders
Schoen, Elizabeth; Paul, Rhea; Chawarska, Katyrzyna
2011-01-01
Scientific Abstract The purpose of this study is to examine the phonological and other vocal productions of children, 18-36 months, with autism spectrum disorder (ASD) and to compare these productions to those of age-matched and language-matched controls. Speech samples were obtained from 30 toddlers with ASD, 11 age-matched toddlers and 23 language-matched toddlers during either parent-child or clinician-child play sessions. Samples were coded for a variety of speech-like and non-speech vocalization productions. Toddlers with ASD produced speech-like vocalizations similar to those of language-matched peers, but produced significantly more atypical non-speech vocalizations when compared to both control groups.Toddlers with ASD show speech-like sound production that is linked to their language level, in a manner similar to that seen in typical development. The main area of difference in vocal development in this population is in the production of atypical vocalizations. Findings suggest that toddlers with autism spectrum disorders might not tune into the language model of their environment. Failure to attend to the ambient language environment negatively impacts the ability to acquire spoken language. PMID:21308998
Hertegård, S; Cedervall, J; Svensson, B; Forsberg, K; Maurer, F H J; Vidovska, D; Olivius, P; Ahrlund-Richter, L; Le Blanc, K
2006-07-01
The aim of this study was to analyze the short-term viscoelastic and histologic properties of scarred rabbit vocal folds after injection of human mesenchymal stem cells (MSC) as well as the degree of MSC survival. Because MSCs are antiinflammatory and regenerate mesenchymal tissues, can MSC injection reduce vocal fold scarring after injury? Twelve vocal folds from 10 New Zealand rabbits were scarred by a localized resection and injected with human MSC or saline. Eight vocal folds were left as controls. After 4 weeks, 10 larynges were stained for histology and evaluation of the lamina propria thickness. Collagen type I content was analyzed from six rabbits. MSC survival was analyzed by fluorescent in situ hybridization staining from three rabbits. Viscoelasticity for 10 vocal folds was analyzed in a parallel-plate rheometer. The rheometry on fresh-frozen samples showed decreased dynamic viscosity and lower elastic modulus (P<.01) in the scarred samples injected with MSC as compared with the untreated scarred group. Normal controls had lower dynamic viscosity and elastic modulus as compared with the scarred untreated and treated vocal folds (P<.01). Histologic analysis showed a higher content of collagen type 1 in the scarred samples as compared with the normal vocal folds and with the scarred folds treated with MSC. MSCs remained in all samples analyzed. The treated scarred vocal folds showed persistent MSC. Injection of scarred rabbit vocal folds with MSC rendered improved viscoelastic parameters and less signs of scarring expressed as collagen content in comparison to the untreated scarred vocal folds.
Training of Working Memory Impacts Neural Processing of Vocal Pitch Regulation
Li, Weifeng; Guo, Zhiqiang; Jones, Jeffery A.; Huang, Xiyan; Chen, Xi; Liu, Peng; Chen, Shaozhen; Liu, Hanjun
2015-01-01
Working memory training can improve the performance of tasks that were not trained. Whether auditory-motor integration for voice control can benefit from working memory training, however, remains unclear. The present event-related potential (ERP) study examined the impact of working memory training on the auditory-motor processing of vocal pitch. Trained participants underwent adaptive working memory training using a digit span backwards paradigm, while control participants did not receive any training. Before and after training, both trained and control participants were exposed to frequency-altered auditory feedback while producing vocalizations. After training, trained participants exhibited significantly decreased N1 amplitudes and increased P2 amplitudes in response to pitch errors in voice auditory feedback. In addition, there was a significant positive correlation between the degree of improvement in working memory capacity and the post-pre difference in P2 amplitudes. Training-related changes in the vocal compensation, however, were not observed. There was no systematic change in either vocal or cortical responses for control participants. These findings provide evidence that working memory training impacts the cortical processing of feedback errors in vocal pitch regulation. This enhanced cortical processing may be the result of increased neural efficiency in the detection of pitch errors between the intended and actual feedback. PMID:26553373
Characterization of chronic vocal fold scarring in a rabbit model.
Rousseau, Bernard; Hirano, Shigeru; Chan, Roger W; Welham, Nathan V; Thibeault, Susan L; Ford, Charles N; Bless, Diane M
2004-03-01
The purpose of the current study was to assess the histologic and rheologic properties of the scarred vocal fold lamina propria during a chronic phase of wound repair in a rabbit model. Eighteen rabbit larynges were scarred using a procedure that involved stripping the vocal fold lamina propria down to the thyroarytenoid muscle, using 3-mm microforceps. The approximate dimension of injury to the vocal fold was 3 x 1.5 x 0.5 mm [length x width x depth]. At 6 months postoperatively, histologic analysis of the scarred and control lamina propria in eight of these rabbits was completed for collagen, procollagen, elastin, and hyaluronic acid. Compared with control samples, scarred tissue samples revealed fragmented and disorganized elastin fibers. Additionally, collagen was significantly increased, organized, and formed thick bundles in the scarred vocal fold lamina propria. Measurements of the viscoelastic shear properties of the scarred and control lamina propria in the remaining 10 rabbits revealed increased elastic shear modulus (G') in 8 of 10 scarred samples and increased dynamic viscosity (eta') in 9 of 10 scarred samples. Although rheologic differences were not statistically significant, they revealed that on average, scarred samples were stiffer and more viscous than the normal controls. Histologic data are interpreted as indicating that by 6 months postinjury, the scarred rabbit vocal fold has reached a mature phase of wound repair, characterized by an increased, organized, and thick bundle collagen matrix. Rheologic data are interpreted as providing support for the potential role of increased, thick bundle collagen, and a disorganized elastin network on shear stiffness and dynamic viscosity in the chronic vocal fold scar. Based on these results, a 6-month postoperative time frame is proposed for future studies of chronic vocal fold scarring using the rabbit animal model.
ERIC Educational Resources Information Center
Tardif, Carole; Laine, France; Rodriguez, Melissa; Gepner, Bruno
2007-01-01
This study examined the effects of slowing down presentation of facial expressions and their corresponding vocal sounds on facial expression recognition and facial and/or vocal imitation in children with autism. Twelve autistic children and twenty-four normal control children were presented with emotional and non-emotional facial expressions on…
IMRT for Image-Guided Single Vocal Cord Irradiation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Osman, Sarah O.S., E-mail: s.osman@erasmusmc.nl; Astreinidou, Eleftheria; Boer, Hans C.J. de
2012-02-01
Purpose: We have been developing an image-guided single vocal cord irradiation technique to treat patients with stage T1a glottic carcinoma. In the present study, we compared the dose coverage to the affected vocal cord and the dose delivered to the organs at risk using conventional, intensity-modulated radiotherapy (IMRT) coplanar, and IMRT non-coplanar techniques. Methods and Materials: For 10 patients, conventional treatment plans using two laterally opposed wedged 6-MV photon beams were calculated in XiO (Elekta-CMS treatment planning system). An in-house IMRT/beam angle optimization algorithm was used to obtain the coplanar and non-coplanar optimized beam angles. Using these angles, the IMRTmore » plans were generated in Monaco (IMRT treatment planning system, Elekta-CMS) with the implemented Monte Carlo dose calculation algorithm. The organs at risk included the contralateral vocal cord, arytenoids, swallowing muscles, carotid arteries, and spinal cord. The prescription dose was 66 Gy in 33 fractions. Results: For the conventional plans and coplanar and non-coplanar IMRT plans, the population-averaged mean dose {+-} standard deviation to the planning target volume was 67 {+-} 1 Gy. The contralateral vocal cord dose was reduced from 66 {+-} 1 Gy in the conventional plans to 39 {+-} 8 Gy and 36 {+-} 6 Gy in the coplanar and non-coplanar IMRT plans, respectively. IMRT consistently reduced the doses to the other organs at risk. Conclusions: Single vocal cord irradiation with IMRT resulted in good target coverage and provided significant sparing of the critical structures. This has the potential to improve the quality-of-life outcomes after RT and maintain the same local control rates.« less
Effect on long-term average spectrum of pop singers' vocal warm-up with vocal function exercises.
Guzman, Marco; Angulo, Mabel; Muñoz, Daniel; Mayerhoff, Ross
2013-04-01
Abstract This case-control study aimed to investigate if there is any change on the spectral slope declination immediately after vocal function exercises (VFE) vs traditional vocal warm-up exercises in normal singers. Thirty-eight pop singers with perceptually normal voices were divided into two groups: an experimental group (n = 20) and a control group (n = 18). One single session with VFE for the experimental group and traditional singing warm-up exercises for the control group was applied. Voice was recorded before and after the exercises. The recorded tasks were to read a phonetically balanced text and to sing a song. Long-term average spectrum (LTAS) analysis included alpha ratio, L1-L0 ratio, and singing power ratio (SPR). Acoustic parameters of voice samples pre- and post-training were compared. Comparison between VFE and control group was also performed. Significant changes after treatment included the alpha ratio and singing power ratio in speaking voice, and SPR in the singing voice for VFE group. The traditional vocal warm-up of the control group also showed pre-post changes. Significant differences between VFE group and control group for alpha ratio and SPR were found in speaking voice samples. This study demonstrates that VFE have an immediate effect on the spectrum of the voice, specifically a decrease on the spectral slope declination. The results of this study provide support for the advantageous effect of VFE as vocal warm-up on voice quality.
Neural imaging in songbirds using fiber optic fluorescence microscopy
NASA Astrophysics Data System (ADS)
Nooshabadi, Fatemeh; Hearn, Gentry; Lints, Thierry; Maitland, Kristen C.
2012-02-01
The song control system of juvenile songbirds is an important model for studying the developmental acquisition and generation of complex learned vocal motor sequences, two processes that are fundamental to human speech and language. To understand the neural mechanisms underlying song production, it is critical to characterize the activity of identified neurons in the song control system when the bird is singing. Neural imaging in unrestrained singing birds, although technically challenging, will advance our understanding of neural ensemble coding mechanisms in this system. We are exploring the use of a fiber optic microscope for functional imaging in the brain of behaving and singing birds in order to better understand the contribution of a key brain nucleus (high vocal center nucleus; HVC) to temporal aspects of song motor control. We have constructed a fluorescence microscope with LED illumination, a fiber bundle for transmission of fluorescence excitation and emission light, a ~2x GRIN lens, and a CCD for image acquisition. The system has 2 μm resolution, 375 μm field of view, 200 μm working distance, and 1 mm outer diameter. As an initial characterization of this setup, neurons in HVC were imaged using the fiber optic microscope after injection of quantum dots or fluorescent retrograde tracers into different song nuclei. A Lucid Vivascope confocal microscope was used to confirm the imaging results. Long-term imaging of the activity of these neurons in juvenile birds during singing may lead us to a better understanding of the central motor codes for song and the central mechanism by which auditory experience modifies song motor commands to enable vocal learning and imitation.
Viscoelastic properties of rabbit vocal folds after augmentation.
Hertegård, Stellan; Dahlqvist, Ake; Laurent, Claude; Borzacchiello, Assunta; Ambrosio, Luigi
2003-03-01
Vocal fold function is closely related to tissue viscoelasticity. Augmentation substances may alter the viscoelastic properties of vocal fold tissues and hence their vibratory capacity. We sought to investigate the viscoelastic properties of rabbit vocal folds in vitro after injections of various augmentation substances. Polytetrafluoroethylene (Teflon), cross-linked collagen (Zyplast), and cross-linked hyaluronan, hylan b gel (Hylaform) were injected into the lamina propria and the thyroarytenoid muscle of rabbit vocal folds. Dynamic viscosity of the injected vocal fold as a function of frequency was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Vocal fold samples injected with hylan b gel showed the lowest dynamic viscosity, quite close to noninjected control samples. Vocal folds injected with polytetrafluoroethylene showed the highest dynamic viscosity followed by the collagen samples. The data indicated that hylan b gel in short-term renders the most natural viscoelastic properties to the vocal fold among the substances tested. This is of importance to restore/preserve the vibratory capacity of the vocal folds when glottal insufficiency is treated with injections.
Chan, Roger W; Rodriguez, Maritza L
2008-08-01
Previous studies reporting the linear viscoelastic shear properties of the human vocal fold cover or mucosa have been based on torsional rheometry, with measurements limited to low audio frequencies, up to around 80 Hz. This paper describes the design and validation of a custom-built, controlled-strain, linear, simple-shear rheometer system capable of direct empirical measurements of viscoelastic shear properties at phonatory frequencies. A tissue specimen was subjected to simple shear between two parallel, rigid acrylic plates, with a linear motor creating a translational sinusoidal displacement of the specimen via the upper plate, and the lower plate transmitting the harmonic shear force resulting from the viscoelastic response of the specimen. The displacement of the specimen was measured by a linear variable differential transformer whereas the shear force was detected by a piezoelectric transducer. The frequency response characteristics of these system components were assessed by vibration experiments with accelerometers. Measurements of the viscoelastic shear moduli (G' and G") of a standard ANSI S2.21 polyurethane material and those of human vocal fold cover specimens were made, along with estimation of the system signal and noise levels. Preliminary results showed that the rheometer can provide valid and reliable rheometric data of vocal fold lamina propria specimens at frequencies of up to around 250 Hz, well into the phonatory range.
Glottal Adduction and Subglottal Pressure in Singing.
Herbst, Christian T; Hess, Markus; Müller, Frank; Švec, Jan G; Sundberg, Johan
2015-07-01
Previous research suggests that independent variation of vocal loudness and glottal configuration (type and degree of vocal fold adduction) does not occur in untrained speech production. This study investigated whether these factors can be varied independently in trained singing and how subglottal pressure is related to average glottal airflow, voice source properties, and sound level under these conditions. A classically trained baritone produced sustained phonations on the endoscopic vowel [i:] at pitch D4 (approximately 294 Hz), exclusively varying either (a) vocal register; (b) phonation type (from "breathy" to "pressed" via cartilaginous adduction); or (c) vocal loudness, while keeping the others constant. Phonation was documented by simultaneous recording of videokymographic, electroglottographic, airflow and voice source data, and by percutaneous measurement of relative subglottal pressure. Register shifts were clearly marked in the electroglottographic wavegram display. Compared with chest register, falsetto was produced with greater pulse amplitude of the glottal flow, H1-H2, mean airflow, and with lower maximum flow declination rate (MFDR), subglottal pressure, and sound pressure. Shifts of phonation type (breathy/flow/neutral/pressed) induced comparable systematic changes. Increase of vocal loudness resulted in increased subglottal pressure, average flow, sound pressure, MFDR, glottal flow pulse amplitude, and H1-H2. When changing either vocal register or phonation type, subglottal pressure and mean airflow showed an inverse relationship, that is, variation of glottal flow resistance. The direct relation between subglottal pressure and airflow when varying only vocal loudness demonstrated independent control of vocal loudness and glottal configuration. Achieving such independent control of phonatory control parameters would be an important target in vocal pedagogy and in voice therapy. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Proton density-weighted laryngeal magnetic resonance imaging in systemically dehydrated rats.
Oleson, Steven; Lu, Kun-Han; Liu, Zhongming; Durkes, Abigail C; Sivasankar, M Preeti
2018-06-01
Dehydrated vocal folds are inefficient sound generators. Although systemic dehydration of the body is believed to induce vocal fold dehydration, this causative relationship has not been demonstrated in vivo. Here we investigate the feasibility of using in vivo proton density (PD)-weighted magnetic resonance imaging (MRI) to demonstrate hydration changes in vocal fold tissue following systemic dehydration in rats. Animal study. Sprague-Dawley rats (n = 10) were imaged at baseline and following a 10% reduction in body weight secondary to withholding water. In vivo, high-field (7 T), PD-weighted MRI was used to successfully resolve vocal fold and salivary gland tissue structures. Normalized signal intensities within the vocal fold decreased postdehydration by an average of 11.38% ± 3.95% (mean ± standard error of the mean [SEM], P = .0098) as compared to predehydration levels. The salivary glands experienced a similar decrease in normalized signal intensity by an average of 10.74% ± 4.14% (mean ± SEM, P = .0195) following dehydration. The correlation coefficient (percent change from dehydration) between vocal folds and salivary glands was 0.7145 (P = .0202). Ten percent systemic dehydration induced vocal fold dehydration as assessed by PD-weighted MRI. Changes in the hydration state of vocal fold tissue were highly correlated with that of the salivary glands in dehydrated rats in vivo. These preliminary findings demonstrate the feasibility of using PD-weighted MRI to quantify hydration states of the vocal folds and lay the foundation for further studies that explore more routine and realistic magnitudes of systemic dehydration and rehydration. NA. Laryngoscope, 128:E222-E227, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Effects of Systemic Hydration on Vocal Acoustics of 18- to 35-Year-Old Females
ERIC Educational Resources Information Center
Franca, Maria Claudia; Simpson, Kenneth O.
2012-01-01
The influence of body hydration and vocal acoustics was investigated in this study. Effects of two levels of hydration on objective measures of vocal acoustics were explored. In an attempt to reduce variability in the degree of systemic hydration and to induce a state of systemic dehydration, participants were instructed to refrain from ingestion…
Effectiveness of a Voice Training Program for Student Teachers on Vocal Health.
Richter, Bernhard; Nusseck, Manfred; Spahn, Claudia; Echternach, Matthias
2016-07-01
The effectiveness of a preventive training program on vocal health for German student teachers was investigated on specific vocal parameters. The voice quality as described by the Dysphonia Severity Index of 204 student teachers (training group: n = 123; control group: n = 81) was measured at the beginning and at the end of the student teachers training period (duration 1.5 years). Additionally, for investigating the voice-carrying capacity, a vocal loading test (VLT) was performed. Finally, participants had to provide a subjective judgment of a possible Voice Handicap Index. The training program improved the voice quality of the trained group compared with that of the control group, whose voice quality declined. The trained group was also able to better sustain their voice quality across the VLT than the control group. Both groups, however, reported a similar increase in subjective vocal strain. The presented training program clearly showed a positive impact on the voice quality and the vocal capacity. The results maintain the importance of such a training program to be integrated in the education and occupational routine of teachers. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Mouse vocal communication system: are ultrasounds learned or innate?
Arriaga, Gustavo; Jarvis, Erich D.
2013-01-01
Mouse ultrasonic vocalizations (USVs) are often used as behavioral readouts of internal states, to measure effects of social and pharmacological manipulations, and for behavioral phenotyping of mouse models for neuropsychiatric and neurodegenerative disorders. However, little is known about the neurobiological mechanisms of rodent USV production. Here we discuss the available data to assess whether male mouse song behavior and the supporting brain circuits resemble those of known vocal non-learning or vocal learning species. Recent neurobiology studies have demonstrated that the mouse USV brain system includes motor cortex and striatal regions, and that the vocal motor cortex sends a direct sparse projection to the brainstem vocal motor nucleus ambiguous, a projection thought be unique to humans among mammals. Recent behavioral studies have reported opposing conclusions on mouse vocal plasticity, including vocal ontogeny changes in USVs over early development that might not be explained by innate maturation processes, evidence for and against a role for auditory feedback in developing and maintaining normal mouse USVs, and evidence for and against limited vocal imitation of song pitch. To reconcile these findings, we suggest that the trait of vocal learning may not be dichotomous but encompass a broad set of behavioral and neural traits we call the continuum hypothesis, and that mice possess some of the traits associated with a capacity for limited vocal learning. PMID:23295209
Arneodo, Ezequiel M.; Perl, Yonatan Sanz; Goller, Franz; Mindlin, Gabriel B.
2012-01-01
Because of the parallels found with human language production and acquisition, birdsong is an ideal animal model to study general mechanisms underlying complex, learned motor behavior. The rich and diverse vocalizations of songbirds emerge as a result of the interaction between a pattern generator in the brain and a highly nontrivial nonlinear periphery. Much of the complexity of this vocal behavior has been understood by studying the physics of the avian vocal organ, particularly the syrinx. A mathematical model describing the complex periphery as a nonlinear dynamical system leads to the conclusion that nontrivial behavior emerges even when the organ is commanded by simple motor instructions: smooth paths in a low dimensional parameter space. An analysis of the model provides insight into which parameters are responsible for generating a rich variety of diverse vocalizations, and what the physiological meaning of these parameters is. By recording the physiological motor instructions elicited by a spontaneously singing muted bird and computing the model on a Digital Signal Processor in real-time, we produce realistic synthetic vocalizations that replace the bird's own auditory feedback. In this way, we build a bio-prosthetic avian vocal organ driven by a freely behaving bird via its physiologically coded motor commands. Since it is based on a low-dimensional nonlinear mathematical model of the peripheral effector, the emulation of the motor behavior requires light computation, in such a way that our bio-prosthetic device can be implemented on a portable platform. PMID:22761555
From mouth to hand: gesture, speech, and the evolution of right-handedness.
Corballis, Michael C
2003-04-01
The strong predominance of right-handedness appears to be a uniquely human characteristic, whereas the left-cerebral dominance for vocalization occurs in many species, including frogs, birds, and mammals. Right-handedness may have arisen because of an association between manual gestures and vocalization in the evolution of language. I argue that language evolved from manual gestures, gradually incorporating vocal elements. The transition may be traced through changes in the function of Broca's area. Its homologue in monkeys has nothing to do with vocal control, but contains the so-called "mirror neurons," the code for both the production of manual reaching movements and the perception of the same movements performed by others. This system is bilateral in monkeys, but predominantly left-hemispheric in humans, and in humans is involved with vocalization as well as manual actions. There is evidence that Broca's area is enlarged on the left side in Homo habilis, suggesting that a link between gesture and vocalization may go back at least two million years, although other evidence suggests that speech may not have become fully autonomous until Homo sapiens appeared some 170,000 years ago, or perhaps even later. The removal of manual gesture as a necessary component of language may explain the rapid advance of technology, allowing late migrations of Homo sapiens from Africa to replace all other hominids in other parts of the world, including the Neanderthals in Europe and Homo erectus in Asia. Nevertheless, the long association of vocalization with manual gesture left us a legacy of right-handedness.
Fergus, Daniel J.; Bass, Andrew H.
2013-01-01
Estrogens play a salient role in the development and maintenance of both male and female nervous systems and behaviors. The plainfin midshipman (Porichthys notatus), a teleost fish, has two male reproductive morphs that follow alternative mating tactics and diverge in multiple somatic, hormonal and neural traits, including the central control of morph-specific vocal behaviors. After we identified duplicate estrogen receptors (ERβ1 and ERβ2) in midshipman, we developed antibodies to localize protein expression in the central vocal-acoustic networks and saccule, the auditory division of the inner ear. As in other teleost species, ERβ1 and ERβ2 were robustly expressed in the telencephalon and hypothalamus in vocal-acoustic and other brain regions shown previously to exhibit strong expression of ERα and aromatase (estrogen synthetase, CYP19) in midshipman. Like aromatase, ERβ1 label co-localized with glial fibrillary acidic protein (GFAP) in telencephalic radial glial cells. Quantitative PCR revealed similar patterns of transcript abundance across reproductive morphs for ERβ1, ERβ2, ERα and aromatase in the forebrain and saccule. In contrast, transcript abundance for ERs and aromatase varied significantly between morphs in and around the sexually polymorphic vocal motor nucleus (VMN). Together, the results suggest that VMN is the major estrogen target within the estrogen-sensitive hindbrain vocal network that directly determines the duration, frequency and amplitude of morph-specific vocalizations. Comparable regional differences in steroid receptor abundances likely regulate morph-specific behaviors in males and females of other species exhibiting alternative reproductive tactics. PMID:23460422
Determination of West Indian manatee vocalization levels and rate
NASA Astrophysics Data System (ADS)
Phillips, Richard; Niezrecki, Christopher; Beusse, Diedrich
2004-05-01
The West Indian manatee (Trichechus manatus latirostris) has become endangered partly because of a growing number of collisions with boats. A system to warn boaters of the presence of manatees, based upon the vocalizations of manatees, could potentially reduce these boat collisions. The feasibility of this warning system would depend mainly upon two factors: the rate at which manatees vocalize and the distance in which the manatees can be detected. The research presented in this paper verifies that the average vocalization rate of the West Indian manatee is approximately one to two times per 5-min period. Several different manatee vocalization recordings were broadcast to the manatees and their response was observed. It was found that during the broadcast periods, the vocalization rates for the manatees increased substantially when compared with the average vocalization rates during nonbroadcast periods. An array of four hydrophones was used while recording the manatees. This allowed for position estimation techniques to be used to determine the location of the vocalizing manatee. Knowing the position of the manatee, the source level was determined and it was found that the mean source level of the manatee vocalizations is approximately 112 dB (re:1 Pa) @ 1 m.
Determination of West Indian manatee vocalization levels and rate
NASA Astrophysics Data System (ADS)
Phillips, Richard; Niezrecki, Christopher; Beusse, Diedrich O.
2004-01-01
The West Indian manatee (Trichechus manatus latirostris) has become endangered partly because of a growing number of collisions with boats. A system to warn boaters of the presence of manatees, based upon the vocalizations of manatees, could potentially reduce these boat collisions. The feasibility of this warning system would depend mainly upon two factors: the rate at which manatees vocalize and the distance in which the manatees can be detected. The research presented in this paper verifies that the average vocalization rate of the West Indian manatee is approximately one to two times per 5-min period. Several different manatee vocalization recordings were broadcast to the manatees and their response was observed. It was found that during the broadcast periods, the vocalization rates for the manatees increased substantially when compared with the average vocalization rates during nonbroadcast periods. An array of four hydrophones was used while recording the manatees. This allowed for position estimation techniques to be used to determine the location of the vocalizing manatee. Knowing the position of the manatee, the source level was determined and it was found that the mean source level of the manatee vocalizations is approximately 112 dB (re 1 μPa) @ 1 m.
Wang, Gang; Harpole, Clifford E; Paulose, Jiffin; Cassone, Vincent M
2014-04-01
Temperate zone birds are highly seasonal in many aspects of their physiology. In mammals, but not in birds, the pineal gland is an important component regulating seasonal patterns of primary gonadal functions. Pineal melatonin in birds instead affects seasonal changes in brain song control structures, suggesting the pineal gland regulates seasonal song behavior. The present study tests the hypothesis that the pineal gland transduces photoperiodic information to the control of seasonal song behavior to synchronize this important behavior to the appropriate phenology. House sparrows, Passer domesticus, expressed a rich array of vocalizations ranging from calls to multisyllabic songs and motifs of songs that varied under a regimen of different photoperiodic conditions that were simulated at different times of year. Control (SHAM) birds exhibited increases in song behavior when they were experimentally transferred from short days, simulating winter, to equinoctial and long days, simulating summer, and decreased vocalization when they were transferred back to short days. When maintained in long days for longer periods, the birds became reproductively photorefractory as measured by the yellowing of the birds' bills; however, song behavior persisted in the SHAM birds, suggesting a dissociation of reproduction from the song functions. Pinealectomized (PINX) birds expressed larger, more rapid increases in daily vocal rate and song repertoire size than did the SHAM birds during the long summer days. These increases gradually declined upon the extension of the long days and did not respond to the transfer to short days as was observed in the SHAM birds, suggesting that the pineal gland conveys photoperiodic information to the vocal control system, which in turn regulates song behavior. Copyright © 2014 Elsevier Inc. All rights reserved.
Repairing the vibratory vocal fold.
Long, Jennifer L
2018-01-01
A vibratory vocal fold replacement would introduce a new treatment paradigm for structural vocal fold diseases such as scarring and lamina propria loss. This work implants a tissue-engineered replacement for vocal fold lamina propria and epithelium in rabbits and compares histology and function to injured controls and orthotopic transplants. Hypotheses were that the cell-based implant would engraft and control the wound response, reducing fibrosis and restoring vibration. Translational research. Rabbit adipose-derived mesenchymal stem cells (ASC) were embedded within a three-dimensional fibrin gel, forming the cell-based outer vocal fold replacement (COVR). Sixteen rabbits underwent unilateral resection of vocal fold epithelium and lamina propria, as well as reconstruction with one of three treatments: fibrin glue alone with healing by secondary intention, replantation of autologous resected vocal fold cover, or COVR implantation. After 4 weeks, larynges were examined histologically and with phonation. Fifteen rabbits survived. All tissues incorporated well after implantation. After 1 month, both graft types improved histology and vibration relative to injured controls. Extracellular matrix (ECM) of the replanted mucosa was disrupted, and ECM of the COVR implants remained immature. Immune reaction was evident when male cells were implanted into female rabbits. Best histologic and short-term vibratory outcomes were achieved with COVR implants containing male cells implanted into male rabbits. Vocal fold cover replacement with a stem cell-based tissue-engineered construct is feasible and beneficial in acute rabbit implantation. Wound-modifying behavior of the COVR implant is judged to be an important factor in preventing fibrosis. NA. Laryngoscope, 128:153-159, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Assessing Vocal Development in Infants and Toddlers
Nathani, Suneeti; Ertmer, David J.
2012-01-01
The purpose of this study was to examine changes in prelinguistic vocal productions during the first 20 months of life. Vocalizations were classified into 23 mutually exclusive and exhaustive types, and grouped into five ascending levels using the Stark Assessment of Early Vocal Development-Revised (SAEVD-R). Data from 30 typically developing infants, aged 0–20 months, show that older infants attained higher developmental levels on the SAEVD-R than younger infants. Infants 0–2, 3–5, and 6–8 months of age primarily produced vocalizations from Levels 1 (Reflexive), 2 (Control of Phonation), and 3 (Expansion). Infants 9–20 months of age also produced vocalizations from Level 4 (Basic Canonical Syllables). Only infants from 16–20 months of age produced Level 5 (Advanced Forms) vocalizations in significant quantities. The outcomes indicate that the SAEVD-R is a valuable instrument for evaluating prelinguistic vocal development. PMID:16728333
Behroozmand, Roozbeh; Ibrahim, Nadine; Korzyukov, Oleg; Robin, Donald A.; Larson, Charles R.
2014-01-01
The ability to process auditory feedback for vocal pitch control is crucial during speaking and singing. Previous studies have suggested that musicians with absolute pitch (AP) develop specialized left-hemisphere mechanisms for pitch processing. The present study adopted an auditory feedback pitch perturbation paradigm combined with ERP recordings to test the hypothesis whether the neural mechanisms of the left-hemisphere enhance vocal pitch error detection and control in AP musicians compared with relative pitch (RP) musicians and non-musicians (NM). Results showed a stronger N1 response to pitch-shifted voice feedback in the right-hemisphere for both AP and RP musicians compared with the NM group. However, the left-hemisphere P2 component activation was greater in AP and RP musicians compared with NMs and also for the AP compared with RP musicians. The NM group was slower in generating compensatory vocal reactions to feedback pitch perturbation compared with musicians, and they failed to re-adjust their vocal pitch after the feedback perturbation was removed. These findings suggest that in the earlier stages of cortical neural processing, the right hemisphere is more active in musicians for detecting pitch changes in voice feedback. In the later stages, the left-hemisphere is more active during the processing of auditory feedback for vocal motor control and seems to involve specialized mechanisms that facilitate pitch processing in the AP compared with RP musicians. These findings indicate that the left hemisphere mechanisms of AP ability are associated with improved auditory feedback pitch processing during vocal pitch control in tasks such as speaking or singing. PMID:24355545
Limiting parental feedback disrupts vocal development in marmoset monkeys
Gultekin, Yasemin B.; Hage, Steffen R.
2017-01-01
Vocalizations of human infants undergo dramatic changes across the first year by becoming increasingly mature and speech-like. Human vocal development is partially dependent on learning by imitation through social feedback between infants and caregivers. Recent studies revealed similar developmental processes being influenced by parental feedback in marmoset monkeys for apparently innate vocalizations. Marmosets produce infant-specific vocalizations that disappear after the first postnatal months. However, it is yet unclear whether parental feedback is an obligate requirement for proper vocal development. Using quantitative measures to compare call parameters and vocal sequence structure we show that, in contrast to normally raised marmosets, marmosets that were separated from parents after the third postnatal month still produced infant-specific vocal behaviour at subadult stages. These findings suggest a significant role of social feedback on primate vocal development until the subadult stages and further show that marmoset monkeys are a compelling model system for early human vocal development. PMID:28090084
Major depressive disorder discrimination using vocal acoustic features.
Taguchi, Takaya; Tachikawa, Hirokazu; Nemoto, Kiyotaka; Suzuki, Masayuki; Nagano, Toru; Tachibana, Ryuki; Nishimura, Masafumi; Arai, Tetsuaki
2018-01-01
The voice carries various information produced by vibrations of the vocal cords and the vocal tract. Though many studies have reported a relationship between vocal acoustic features and depression, including mel-frequency cepstrum coefficients (MFCCs) which applied to speech recognition, there have been few studies in which acoustic features allowed discrimination of patients with depressive disorder. Vocal acoustic features as biomarker of depression could make differential diagnosis of patients with depressive state. In order to achieve differential diagnosis of depression, in this preliminary study, we examined whether vocal acoustic features could allow discrimination between depressive patients and healthy controls. Subjects were 36 patients who met the criteria for major depressive disorder and 36 healthy controls with no current or past psychiatric disorders. Voices of reading out digits before and after verbal fluency task were recorded. Voices were analyzed using OpenSMILE. The extracted acoustic features, including MFCCs, were used for group comparison and discriminant analysis between patients and controls. The second dimension of MFCC (MFCC 2) was significantly different between groups and allowed the discrimination between patients and controls with a sensitivity of 77.8% and a specificity of 86.1%. The difference in MFCC 2 between the two groups reflected an energy difference of frequency around 2000-3000Hz. The MFCC 2 was significantly different between depressive patients and controls. This feature could be a useful biomarker to detect major depressive disorder. Sample size was relatively small. Psychotropics could have a confounding effect on voice. Copyright © 2017 Elsevier B.V. All rights reserved.
2014-01-01
Background Whether listening to background music enhances verbal learning performance is still a matter of dispute. In this study we investigated the influence of vocal and instrumental background music on verbal learning. Methods 226 subjects were randomly assigned to one of five groups (one control group and 4 experimental groups). All participants were exposed to a verbal learning task. One group served as control group while the 4 further groups served as experimental groups. The control group learned without background music while the 4 experimental groups were exposed to vocal or instrumental musical pieces during learning with different subjective intensity and valence. Thus, we employed 4 music listening conditions (vocal music with high intensity: VOC_HIGH, vocal music with low intensity: VOC_LOW, instrumental music with high intensity: INST_HIGH, instrumental music with low intensity: INST_LOW) and one control condition (CONT) during which the subjects learned the word lists. Since it turned out that the high and low intensity groups did not differ in terms of the rated intensity during the main experiment these groups were lumped together. Thus, we worked with 3 groups: one control group and two groups, which were exposed to background music (vocal and instrumental) during verbal learning. As dependent variable, the number of learned words was used. Here we measured immediate recall during five learning sessions (recall 1 – recall 5) and delayed recall for 15 minutes (recall 6) and 14 days (recall 7) after the last learning session. Results Verbal learning improved during the first 5 recall sessions without any strong difference between the control and experimental groups. Also the delayed recalls were similar for the three groups. There was only a trend for attenuated verbal learning for the group passively listened to vocals. This learning attenuation diminished during the following learning sessions. Conclusions The exposure to vocal or instrumental background music during encoding did not influence verbal learning. We suggest that the participants are easily able to cope with this background stimulation by ignoring this information channel in order to focus on the verbal learning task. PMID:24670048
Vassiliki, Kalodimou; Irini, Messini; Nikolaos, Psychalakis; Karampela, Eleftheria; Apostolos, Papalois
2016-01-01
Background. The aim of the study was to assess the histological effects of autologous infusion of adipose-derived stem cells (ADSC) on a chronic vocal fold scar in a rabbit model as compared to an untreated scar as well as in injection of hyaluronic acid. Study Design. Animal experiment. Method. We used 74 New Zealand rabbits. Sixteen of them were used as control/normal group. We created a bilateral vocal fold wound in the remaining 58 rabbits. After 18 months we separated our population into three groups. The first group served as control/scarred group. The second one was injected with hyaluronic acid in the vocal folds, and the third received an autologous adipose-derived stem cell infusion in the scarred vocal folds (ADSC group). We measured the variation of thickness of the lamina propria of the vocal folds and analyzed histopathologic changes in each group after three months. Results. The thickness of the lamina propria was significantly reduced in the group that received the ADSC injection, as compared to the normal/scarred group. The collagen deposition, the hyaluronic acid, the elastin levels, and the organization of elastic fibers tend to return to normal after the injection of ADSC. Conclusions. Autologous injection of adipose-derived stem cells on a vocal fold chronic scar enhanced the healing of the vocal folds and the reduction of the scar tissue, even when compared to other treatments. PMID:26933440
Horita, Haruhito; Kobayashi, Masahiko; Liu, Wan-chun; Oka, Kotaro; Jarvis, Erich D.; Wada, Kazuhiro
2012-01-01
Mechanisms for the evolution of convergent behavioral traits are largely unknown. Vocal learning is one such trait that evolved multiple times and is necessary in humans for the acquisition of spoken language. Among birds, vocal learning is evolved in songbirds, parrots, and hummingbirds. Each time similar forebrain song nuclei specialized for vocal learning and production have evolved. This finding led to the hypothesis that the behavioral and neuroanatomical convergences for vocal learning could be associated with molecular convergence. We previously found that the neural activity-induced gene dual specificity phosphatase 1 (dusp1) was up-regulated in non-vocal circuits, specifically in sensory-input neurons of the thalamus and telencephalon; however, dusp1 was not up-regulated in higher order sensory neurons or motor circuits. Here we show that song motor nuclei are an exception to this pattern. The song nuclei of species from all known vocal learning avian lineages showed motor-driven up-regulation of dusp1 expression induced by singing. There was no detectable motor-driven dusp1 expression throughout the rest of the forebrain after non-vocal motor performance. This pattern contrasts with expression of the commonly studied activity-induced gene egr1, which shows motor-driven expression in song nuclei induced by singing, but also motor-driven expression in adjacent brain regions after non-vocal motor behaviors. In the vocal non-learning avian species, we found no detectable vocalizing-driven dusp1 expression in the forebrain. These findings suggest that independent evolutions of neural systems for vocal learning were accompanied by selection for specialized motor-driven expression of the dusp1 gene in those circuits. This specialized expression of dusp1 could potentially lead to differential regulation of dusp1-modulated molecular cascades in vocal learning circuits. PMID:22876306
Microstructural and mechanical characterization of scarred vocal folds.
Heris, Hossein K; Miri, Amir K; Ghattamaneni, Nageswara R; Li, Nicole Y K; Thibeault, Susan L; Wiseman, Paul W; Mongeau, Luc
2015-02-26
The goal of this study was to characterize the vocal folds microstructure and elasticity using nonlinear laser scanning microscopy and atomic force microscopy-based indentation, respectively. As a pilot study, the vocal folds of fourteen rats were unilaterally injured by full removal of lamina propria; the uninjured folds of the same animals served as controls. The area fraction of collagen fibrils was found to be greater in scarred tissues two months after injury than the uninjured controls. A novel mathematical model was also proposed to relate collagen concentration and tissue bulk modulus. This work presents a first step towards systematic investigation of microstructural and mechanical characteristics in scarred vocal fold tissue. Copyright © 2015 Elsevier Ltd. All rights reserved.
Warren, Megan R; Sangiamo, Daniel T; Neunuebel, Joshua P
2018-03-01
An integral component in the assessment of vocal behavior in groups of freely interacting animals is the ability to determine which animal is producing each vocal signal. This process is facilitated by using microphone arrays with multiple channels. Here, we made important refinements to a state-of-the-art microphone array based system used to localize vocal signals produced by freely interacting laboratory mice. Key changes to the system included increasing the number of microphones as well as refining the methodology for localizing and assigning vocal signals to individual mice. We systematically demonstrate that the improvements in the methodology for localizing mouse vocal signals led to an increase in the number of signals detected as well as the number of signals accurately assigned to an animal. These changes facilitated the acquisition of larger and more comprehensive data sets that better represent the vocal activity within an experiment. Furthermore, this system will allow more thorough analyses of the role that vocal signals play in social communication. We expect that such advances will broaden our understanding of social communication deficits in mouse models of neurological disorders. Copyright © 2018 Elsevier B.V. All rights reserved.
Predicting Achievable Fundamental Frequency Ranges in Vocalization Across Species
Titze, Ingo; Riede, Tobias; Mau, Ted
2016-01-01
Vocal folds are used as sound sources in various species, but it is unknown how vocal fold morphologies are optimized for different acoustic objectives. Here we identify two main variables affecting range of vocal fold vibration frequency, namely vocal fold elongation and tissue fiber stress. A simple vibrating string model is used to predict fundamental frequency ranges across species of different vocal fold sizes. While average fundamental frequency is predominantly determined by vocal fold length (larynx size), range of fundamental frequency is facilitated by (1) laryngeal muscles that control elongation and by (2) nonlinearity in tissue fiber tension. One adaptation that would increase fundamental frequency range is greater freedom in joint rotation or gliding of two cartilages (thyroid and cricoid), so that vocal fold length change is maximized. Alternatively, tissue layers can develop to bear a disproportionate fiber tension (i.e., a ligament with high density collagen fibers), increasing the fundamental frequency range and thereby vocal versatility. The range of fundamental frequency across species is thus not simply one-dimensional, but can be conceptualized as the dependent variable in a multi-dimensional morphospace. In humans, this could allow for variations that could be clinically important for voice therapy and vocal fold repair. Alternative solutions could also have importance in vocal training for singing and other highly-skilled vocalizations. PMID:27309543
Application of motion analysis in the study of the effect of botulinum toxin to rat vocal folds
NASA Astrophysics Data System (ADS)
Saadah, Abdul K.; Galatsanos, Nikolas P.; Inagi, K.; Bless, D.
1997-05-01
In the past we have proposed a system that measures the deformations of the vocal folds from videostroboscopic images of the larynx, in that system: (1) we extract the boundaries of the vocal folds, (2) we register elastically the vocal fold boundaries in successive frames. This yields the displacement vector field (DVF) between adjacent frames, and (3) we fit using a least-squares approach an affine transformation model to succinctly describe the deformations between adjacent frames. In this paper, we present as an example of the capabilities of this system, an initial study of the deformation changes in rat vocal folds pre and post injection with Botulinum toxin. For this application the generated DVF was segmented into right DVF and left DVF and the deformation of each segment is studied separately.
Evaluation of interventions to reduce multiply controlled vocal stereotypy.
Scalzo, Rachel; Henry, Kelsey; Davis, Tonya N; Amos, Kally; Zoch, Tamara; Turchan, Sarah; Wagner, Tara
2015-07-01
This study examined four interventions targeted at decreasing multiply controlled vocal stereotypy for a 12-year-old boy diagnosed with autism spectrum disorder and a severe intellectual disability. These interventions included Noncontingent Music, Differential Reinforcement of Other Behaviors, Self-Recording, and Functional Communication Training (FCT). In addition to measuring vocal stereotypy during each condition, task engagement and challenging behavior were also monitored. Across conditions, vocal stereotypy did not vary significantly from baseline except in FCT, when it decreased significantly. Task engagement was higher in this condition as well. It is hypothesized that FCT provided an enriched environment by increasing social interaction and access to desired items as well as removal of less preferred activities. For these reasons, there was a decrease in the need for the participant to engage in vocal stereotypy and challenging behavior and increase in his ability to engage in a task. © The Author(s) 2015.
NASA Astrophysics Data System (ADS)
Zhang, Lucy; Yu, Feimi; Krane, Michael
2017-11-01
The control volume analysis of power flow during sustained phonation is performed using results of a fully-coupled aeroelastic-aeroacoustic simulation. The control volumes consist of the laryngeal region, and the larynx and the vocal tract. Two cases are considered: an effectively infinite length vocal tract, where sound produced in the larynx radiates away and is not reflected back, and a constant-area vocal tract of normal adult human dimensions, in which phonatory sound resonates before radiating from the mouth opening. In both cases the lungs are modeled to absorb all incident sound, while providing a constant volume flow toward the larynx. Control of the acoustic boundary conditions is accomplished using perfectly matched- layers, and flow from the lungs is provided by a source distribution near the entrance to the trachea region. For both cases the power flow for the larynx and larynx plus vocal tract control volumes are computed using the integral form of the mechanical energy equation, expanded to consider power exchanges between slightly compressible flow in the larynx and the acoustic fields in the vocal tract and trachea. The funding from NIH 2R01DC005642-10A1 is greatly acknowledged.
Inferring speaker attributes in adductor spasmodic dysphonia: ratings from unfamiliar listeners.
Isetti, Derek; Xuereb, Linnea; Eadie, Tanya L
2014-05-01
To determine whether unfamiliar listeners' perceptions of speakers with adductor spasmodic dysphonia (ADSD) differ from control speakers on the parameters of relative age, confidence, tearfulness, and vocal effort and are related to speaker-rated vocal effort or voice-specific quality of life. Twenty speakers with ADSD (including 6 speakers with ADSD plus tremor) and 20 age- and sex-matched controls provided speech recordings, completed a voice-specific quality-of-life instrument (Voice Handicap Index; Jacobson et al., 1997), and rated their own vocal effort. Twenty listeners evaluated speech samples for relative age, confidence, tearfulness, and vocal effort using rating scales. Listeners judged speakers with ADSD as sounding significantly older, less confident, more tearful, and more effortful than control speakers (p < .01). Increased vocal effort was strongly associated with decreased speaker confidence (rs = .88-.89) and sounding more tearful (rs = .83-.85). Self-rated speaker effort was moderately related (rs = .45-.52) to listener impressions. Listeners' perceptions of confidence and tearfulness were also moderately associated with higher Voice Handicap Index scores (rs = .65-.70). Unfamiliar listeners judge speakers with ADSD more negatively than control speakers, with judgments extending beyond typical clinical measures. The results have implications for counseling and understanding the psychosocial effects of ADSD.
Babies in traffic: infant vocalizations and listener sex modulate auditory motion perception.
Neuhoff, John G; Hamilton, Grace R; Gittleson, Amanda L; Mejia, Adolfo
2014-04-01
Infant vocalizations and "looming sounds" are classes of environmental stimuli that are critically important to survival but can have dramatically different emotional valences. Here, we simultaneously presented listeners with a stationary infant vocalization and a 3D virtual looming tone for which listeners made auditory time-to-arrival judgments. Negatively valenced infant cries produced more cautious (anticipatory) estimates of auditory arrival time of the tone over a no-vocalization control. Positively valenced laughs had the opposite effect, and across all conditions, men showed smaller anticipatory biases than women. In Experiment 2, vocalization-matched vocoded noise stimuli did not influence concurrent auditory time-to-arrival estimates compared with a control condition. In Experiment 3, listeners estimated the egocentric distance of a looming tone that stopped before arriving. For distant stopping points, women estimated the stopping point as closer when the tone was presented with an infant cry than when it was presented with a laugh. For near stopping points, women showed no differential effect of vocalization type. Men did not show differential effects of vocalization type at either distance. Our results support the idea that both the sex of the listener and the emotional valence of infant vocalizations can influence auditory motion perception and can modulate motor responses to other behaviorally relevant environmental sounds. We also find support for previous work that shows sex differences in emotion processing are diminished under conditions of higher stress.
Early life manipulations of vasopressin-family peptides alter vocal learning.
Baran, Nicole M; Peck, Samantha C; Kim, Tabitha H; Goldstein, Michael H; Adkins-Regan, Elizabeth
2017-07-26
Vocal learning from social partners is crucial for the successful development of communication in a wide range of species. Social interactions organize attention and enhance motivation to learn species-typical behaviour. However, the neurobiological mechanisms connecting social motivation and vocal learning are unknown. Using zebra finches ( Taeniopygia guttata ), a ubiquitous model for vocal learning, we show that manipulations of nonapeptide hormones in the vasopressin family (arginine vasotocin, AVT) early in development can promote or disrupt both song and social motivation. Young male zebra finches, like human infants, are socially gregarious and require interactive feedback from adult tutors to learn mature vocal forms. To investigate the role of social motivational mechanisms in song learning, in two studies, we injected hatchling males with AVT or Manning compound (MC, a nonapeptide receptor antagonist) on days 2-8 post-hatching and recorded song at maturity. In both studies, MC males produced a worse match to tutor song than controls. In study 2, which experimentally controlled for tutor and genetic factors, AVT males also learned song significantly better compared with controls. Furthermore, song similarity correlated with several measures of social motivation throughout development. These findings provide the first evidence that nonapeptides are critical to the development of vocal learning. © 2017 The Author(s).
Mechanical properties of the vocal fold. Stress-strain studies.
Haji, T; Mori, K; Omori, K; Isshiki, N
1992-01-01
The viscoelasticity of the vocal and ventricular folds was experimentally assessed by analyzing the stress-strain relationships obtained using a newly developed measuring system. The degree of stiffness of the mid-membranous portion of the vocal fold was less than that near the anterior commissure or the vocal process. The ventricular fold was much less stiff and significantly more viscous than the vocal fold. At the membranous portion of the vocal fold, the degree of stiffness was less and that of viscosity greater at 2 mm above and below the free margin than at the free margin itself.
Genetic identification of a hindbrain nucleus essential for innate vocalization.
Hernandez-Miranda, Luis Rodrigo; Ruffault, Pierre-Louis; Bouvier, Julien C; Murray, Andrew J; Morin-Surun, Marie-Pierre; Zampieri, Niccolò; Cholewa-Waclaw, Justyna B; Ey, Elodie; Brunet, Jean-Francois; Champagnat, Jean; Fortin, Gilles; Birchmeier, Carmen
2017-07-25
Vocalization in young mice is an innate response to isolation or mechanical stimulation. Neuronal circuits that control vocalization and breathing overlap and rely on motor neurons that innervate laryngeal and expiratory muscles, but the brain center that coordinates these motor neurons has not been identified. Here, we show that the hindbrain nucleus tractus solitarius (NTS) is essential for vocalization in mice. By generating genetically modified newborn mice that specifically lack excitatory NTS neurons, we show that they are both mute and unable to produce the expiratory drive required for vocalization. Furthermore, the muteness of these newborns results in maternal neglect. We also show that neurons of the NTS directly connect to and entrain the activity of spinal (L1) and nucleus ambiguus motor pools located at positions where expiratory and laryngeal motor neurons reside. These motor neurons control expiratory pressure and laryngeal tension, respectively, thereby establishing the essential biomechanical parameters used for vocalization. In summary, our work demonstrates that the NTS is an obligatory component of the neuronal circuitry that transforms breaths into calls.
Pereira, Marcelo Charles; Repka, Carlos Domingues; Camargo, Paulo Antonio Monteiro; Rispoli, Daniel Zeni; Campos, Antônio Carlos Ligocki; Matias, Jorge Eduardo Fouto
2009-07-01
To compare the effects of topical mitomycin-C at different concentrations on submucosal collagen deposition on the vocal folds of swine. The animals were divided into three groups according to the composition of the topical solution to be applied to the vocal folds: 0.9% saline solution (control group); 4 mg/ml mitomycin-C (group 1) and 8 mg/ml mitomycin-C (group 2). Thirty days after the application, all animals were sacrificed, their vocal folds were collected and stained by the picrosirius red technique, and submucosal collagen deposition areas were estimated by the Image Pro Plus 4.5 software. Mann-Whitney test was used to compare differences between parameters of each group. The means of the areas of submucosal collagen deposits on vocal folds were 3110.44 square micrometers (microm(2)), 3115.98 microm(2) and 3105.78 microm(2) for groups control, 1 and 2, respectively. There were no statistical differences across the three groups (p>0.05). Mitomycin-C topically applied to intact vocal folds of swine did not alter submucosal collagen deposition.
Kingyon, J; Behroozmand, R; Kelley, R; Oya, H; Kawasaki, H; Narayanan, N S; Greenlee, J D W
2015-10-01
The neural basis of human speech is unclear. Intracranial electrophysiological recordings have revealed that high-gamma band oscillations (70-150Hz) are observed in the frontal lobe during speech production and in the temporal lobe during speech perception. Here, we tested the hypothesis that the frontal and temporal brain regions had high-gamma coherence during speech. We recorded electrocorticography (ECoG) from the frontal and temporal cortices of five humans who underwent surgery for medically intractable epilepsy, and studied coherence between the frontal and temporal cortex during vocalization and playback of vocalization. We report two novel results. First, we observed high-gamma band as well as theta (4-8Hz) coherence between frontal and temporal lobes. Second, both high-gamma and theta coherence were stronger when subjects were actively vocalizing as compared to playback of the same vocalizations. These findings provide evidence that coupling between sensory-motor networks measured by high-gamma coherence plays a key role in feedback-based monitoring and control of vocal output for human vocalization. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.
Bunijevac, Mila; Petrović-Lazić, Mirjana; Jovanović-Simić, Nadica; Vuković, Mile
2016-02-01
The major role of larynx in speech, respiration and swallowing makes carcinomas of this region and their treatment very influential for patients' life quality. The aim of this study was to assess the importance of voice therapy in patients after open surgery on vocal cords. This study included 21 male patients and the control group of 19 subjects. The vowel (A) was recorded and analyzed for each examinee. All the patients were recorded twice: firstly, when they contacted the clinic and secondly, after a three-month vocal therapy, which was held twiceper week on an outpatient basis. The voice analysis was carried out in the Ear, Nose and Throat (ENT) Clinic, Clinical Hospital Center "Zvezdara" in Belgrade. The values of the acoustic parameters in the patients submitted to open surgery on the vocal cords before vocal rehabilitation and the control group subjects were significantly different in all specified parameters. These results suggest that the voice of the patients was damaged before vocal rehabilitation. The results of the acoustic parameters of the vowel (A) before and after vocal rehabilitation of the patients with open surgery on vocal cords were statistically significantly different. Among the parameters--Jitter (%), Shimmer (%)--the observed difference was highly statistically significant (p < 0.01). The voice turbulence index and the noise/harmonic ratio were also notably improved, and the observed difference was statistically significant (p < 0.05). The analysis of the tremor intensity index showed no significant improvement and the observed difference was not statistically significant (p > 0.05 ). CONCLUSION. There was a significant improvement of the acoustic parameters of the vowel (A) in the study subjects three months following vocal therapy. Only one out of five representative parameters showed no significant improvement.
D'haeseleer, Evelien; Claeys, Sofie; Bettens, Kim; Leemans, Laura; Van Calster, Ann-Sophie; Van Damme, Nina; Thijs, Zoë; Daelman, Julie; Leyns, Clara; Van Lierde, Kristiane
2017-07-01
The purpose of this study was to measure the objective and subjective vocal quality in women aged between 60 and 75 years. Secondly, the impact of a teaching or singing career on the vocal quality was investigated by comparing the vocal quality of retired women with different careers. This is a case-control study. Seventy-three retired women between 60 and 75 years (mean age: 67 years, standard deviation: 4.49) participated in the study and were divided into three groups: women with a teaching career (n = 21), choir singers with a singing career (n = 12), and women with a non-vocal career (n = 40). All subjects underwent the same assessment protocol consisting of objective (aerodynamic, maximum performance, vocal range, acoustic measurements, and the Dysphonia Severity Index) and subjective (the Voice Handicap Index, auditory-perceptual evaluations by three listeners) voice measurements. In all three groups, objective and perceptual voice analysis showed a mild dysphonia. No differences in the Dysphonia Severity Index were found between the three groups. The voices of choir singers with a singing career were perceived significantly less rough than voices of the women with a non-vocal career. Additionally, the lowest frequency of the frequency range was significantly lower in the retired teachers and choir singers than in the controls. The results of this study prudently suggest that a singing or a teaching career compared with a non-vocal career has a positive impact on the vocal frequency range, and that singing has a positive impact on the perceptual vocal quality of the older female voice. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Irino, Toshio; Patterson, Roy
2005-04-01
We hear vowels produced by men, women, and children as approximately the same although there is considerable variability in glottal pulse rate and vocal tract length. At the same time, we can identify the speaker group. Recent experiments show that it is possible to identify vowels even when the glottal pulse rate and vocal tract length are condensed or expanded beyond the range of natural vocalization. This suggests that the auditory system has an automatic process to segregate information about shape and size of the vocal tract. Recently we proposed that the auditory system uses some form of Stabilized, Wavelet-Mellin Transform (SWMT) to analyze scale information in bio-acoustic sounds as a general framework for auditory processing from cochlea to cortex. This talk explains the theoretical background of the model and how the vocal information is normalized in the representation. [Work supported by GASR(B)(2) No. 15300061, JSPS.
ERIC Educational Resources Information Center
Church, Jessica A.; Fair, Damien A.; Dosenbach, Nico U. F.; Cohen, Alexander L.; Miezin, Francis M.; Petersen, Steven E.; Schlaggar, Bradley L.
2009-01-01
Tourette syndrome (TS) is a developmental disorder characterized by unwanted, repetitive behaviours that manifest as stereotyped movements and vocalizations called "tics". Operating under the hypothesis that the brain's control systems may be impaired in TS, we measured resting-state functional connectivity MRI (rs-fcMRI) between 39 previously…
Wild, J M; Krützfeldt, N E O
2012-02-15
During singing in songbirds, the extent of beak opening, like the extent of mouth opening in human singers, is partially correlated with the fundamental frequency of the sounds emitted. Since song in songbirds is under the control of "the song system" (a collection of interconnected forebrain nuclei dedicated to the learning and production of song), it might be expected that beak movements during singing would also be controlled by this system. However, direct neural connections between the telencephalic output of the song system and beak muscle motor neurons in the brainstem are conspicuous by their absence, leaving unresolved the question of how beak movements are affected during singing. By using standard tract tracing methods, we sought to answer this question by defining beak premotor neurons and examining their afferent projections. In the caudal medulla, jaw premotor cell bodies were located adjacent to the terminal field of the output of the song system, into which many premotor neurons extended their dendrites. The premotor neurons also received a novel input from the trigeminal ganglion and an overlapping input from a lateral arcopallial component of a trigeminal sensorimotor circuit that traverses the forebrain. The ganglionic input in songbirds, which is not present in doves and pigeons that vocalize with a closed beak, may modulate the activity of beak premotor neurons in concert with the output of the song system. These inputs to jaw premotor neurons could, together, affect beak movements as a means of modulating filter properties of the upper vocal tract during singing. Copyright © 2011 Wiley-Liss, Inc.
Behroozmand, Roozbeh; Ibrahim, Nadine; Korzyukov, Oleg; Robin, Donald A; Larson, Charles R
2014-02-01
The ability to process auditory feedback for vocal pitch control is crucial during speaking and singing. Previous studies have suggested that musicians with absolute pitch (AP) develop specialized left-hemisphere mechanisms for pitch processing. The present study adopted an auditory feedback pitch perturbation paradigm combined with ERP recordings to test the hypothesis whether the neural mechanisms of the left-hemisphere enhance vocal pitch error detection and control in AP musicians compared with relative pitch (RP) musicians and non-musicians (NM). Results showed a stronger N1 response to pitch-shifted voice feedback in the right-hemisphere for both AP and RP musicians compared with the NM group. However, the left-hemisphere P2 component activation was greater in AP and RP musicians compared with NMs and also for the AP compared with RP musicians. The NM group was slower in generating compensatory vocal reactions to feedback pitch perturbation compared with musicians, and they failed to re-adjust their vocal pitch after the feedback perturbation was removed. These findings suggest that in the earlier stages of cortical neural processing, the right hemisphere is more active in musicians for detecting pitch changes in voice feedback. In the later stages, the left-hemisphere is more active during the processing of auditory feedback for vocal motor control and seems to involve specialized mechanisms that facilitate pitch processing in the AP compared with RP musicians. These findings indicate that the left hemisphere mechanisms of AP ability are associated with improved auditory feedback pitch processing during vocal pitch control in tasks such as speaking or singing. Copyright © 2013 Elsevier Inc. All rights reserved.
Sensory-motor interactions for vocal pitch monitoring in non-primary human auditory cortex.
Greenlee, Jeremy D W; Behroozmand, Roozbeh; Larson, Charles R; Jackson, Adam W; Chen, Fangxiang; Hansen, Daniel R; Oya, Hiroyuki; Kawasaki, Hiroto; Howard, Matthew A
2013-01-01
The neural mechanisms underlying processing of auditory feedback during self-vocalization are poorly understood. One technique used to study the role of auditory feedback involves shifting the pitch of the feedback that a speaker receives, known as pitch-shifted feedback. We utilized a pitch shift self-vocalization and playback paradigm to investigate the underlying neural mechanisms of audio-vocal interaction. High-resolution electrocorticography (ECoG) signals were recorded directly from auditory cortex of 10 human subjects while they vocalized and received brief downward (-100 cents) pitch perturbations in their voice auditory feedback (speaking task). ECoG was also recorded when subjects passively listened to playback of their own pitch-shifted vocalizations. Feedback pitch perturbations elicited average evoked potential (AEP) and event-related band power (ERBP) responses, primarily in the high gamma (70-150 Hz) range, in focal areas of non-primary auditory cortex on superior temporal gyrus (STG). The AEPs and high gamma responses were both modulated by speaking compared with playback in a subset of STG contacts. From these contacts, a majority showed significant enhancement of high gamma power and AEP responses during speaking while the remaining contacts showed attenuated response amplitudes. The speaking-induced enhancement effect suggests that engaging the vocal motor system can modulate auditory cortical processing of self-produced sounds in such a way as to increase neural sensitivity for feedback pitch error detection. It is likely that mechanisms such as efference copies may be involved in this process, and modulation of AEP and high gamma responses imply that such modulatory effects may affect different cortical generators within distinctive functional networks that drive voice production and control.
Sensory-Motor Interactions for Vocal Pitch Monitoring in Non-Primary Human Auditory Cortex
Larson, Charles R.; Jackson, Adam W.; Chen, Fangxiang; Hansen, Daniel R.; Oya, Hiroyuki; Kawasaki, Hiroto; Howard, Matthew A.
2013-01-01
The neural mechanisms underlying processing of auditory feedback during self-vocalization are poorly understood. One technique used to study the role of auditory feedback involves shifting the pitch of the feedback that a speaker receives, known as pitch-shifted feedback. We utilized a pitch shift self-vocalization and playback paradigm to investigate the underlying neural mechanisms of audio-vocal interaction. High-resolution electrocorticography (ECoG) signals were recorded directly from auditory cortex of 10 human subjects while they vocalized and received brief downward (−100 cents) pitch perturbations in their voice auditory feedback (speaking task). ECoG was also recorded when subjects passively listened to playback of their own pitch-shifted vocalizations. Feedback pitch perturbations elicited average evoked potential (AEP) and event-related band power (ERBP) responses, primarily in the high gamma (70–150 Hz) range, in focal areas of non-primary auditory cortex on superior temporal gyrus (STG). The AEPs and high gamma responses were both modulated by speaking compared with playback in a subset of STG contacts. From these contacts, a majority showed significant enhancement of high gamma power and AEP responses during speaking while the remaining contacts showed attenuated response amplitudes. The speaking-induced enhancement effect suggests that engaging the vocal motor system can modulate auditory cortical processing of self-produced sounds in such a way as to increase neural sensitivity for feedback pitch error detection. It is likely that mechanisms such as efference copies may be involved in this process, and modulation of AEP and high gamma responses imply that such modulatory effects may affect different cortical generators within distinctive functional networks that drive voice production and control. PMID:23577157
Vocal Imitations of Non-Vocal Sounds
Houix, Olivier; Voisin, Frédéric; Misdariis, Nicolas; Susini, Patrick
2016-01-01
Imitative behaviors are widespread in humans, in particular whenever two persons communicate and interact. Several tokens of spoken languages (onomatopoeias, ideophones, and phonesthemes) also display different degrees of iconicity between the sound of a word and what it refers to. Thus, it probably comes at no surprise that human speakers use a lot of imitative vocalizations and gestures when they communicate about sounds, as sounds are notably difficult to describe. What is more surprising is that vocal imitations of non-vocal everyday sounds (e.g. the sound of a car passing by) are in practice very effective: listeners identify sounds better with vocal imitations than with verbal descriptions, despite the fact that vocal imitations are inaccurate reproductions of a sound created by a particular mechanical system (e.g. a car driving by) through a different system (the voice apparatus). The present study investigated the semantic representations evoked by vocal imitations of sounds by experimentally quantifying how well listeners could match sounds to category labels. The experiment used three different types of sounds: recordings of easily identifiable sounds (sounds of human actions and manufactured products), human vocal imitations, and computational “auditory sketches” (created by algorithmic computations). The results show that performance with the best vocal imitations was similar to the best auditory sketches for most categories of sounds, and even to the referent sounds themselves in some cases. More detailed analyses showed that the acoustic distance between a vocal imitation and a referent sound is not sufficient to account for such performance. Analyses suggested that instead of trying to reproduce the referent sound as accurately as vocally possible, vocal imitations focus on a few important features, which depend on each particular sound category. These results offer perspectives for understanding how human listeners store and access long-term sound representations, and sets the stage for the development of human-computer interfaces based on vocalizations. PMID:27992480
Infant Cries Rattle Adult Cognition.
Dudek, Joanna; Faress, Ahmed; Bornstein, Marc H; Haley, David W
2016-01-01
The attention-grabbing quality of the infant cry is well recognized, but how the emotional valence of infant vocal signals affects adult cognition and cortical activity has heretofore been unknown. We examined the effects of two contrasting infant vocalizations (cries vs. laughs) on adult performance on a Stroop task using a cross-modal distraction paradigm in which infant distractors were vocal and targets were visual. Infant vocalizations were presented before (Experiment 1) or during each Stroop trial (Experiment 2). To evaluate the influence of infant vocalizations on cognitive control, neural responses to the Stroop task were obtained by measuring electroencephalography (EEG) and event-related potentials (ERPs) in Experiment 1. Based on the previously demonstrated existence of negative arousal bias, we hypothesized that cry vocalizations would be more distracting and invoke greater conflict processing than laugh vocalizations. Similarly, we expected participants to have greater difficulty shifting attention from the vocal distractors to the target task after hearing cries vs. after hearing laughs. Behavioral results from both experiments showed a cry interference effect, in which task performance was slower with cry than with laugh distractors. Electrophysiology data further revealed that cries more than laughs reduced attention to the task (smaller P200) and increased conflict processing (larger N450), albeit differently for incongruent and congruent trials. Results from a correlation analysis showed that the amplitudes of P200 and N450 were inversely related, suggesting a reciprocal relationship between attention and conflict processing. The findings suggest that cognitive control processes contribute to an attention bias to infant signals, which is modulated in part by the valence of the infant vocalization and the demands of the cognitive task. The findings thus support the notion that infant cries elicit a negative arousal bias that is distracting; they also identify, for the first time, the neural dynamics underlying the unique influence that infant cries and laughs have on cognitive control.
Modulation of voice related to tremor and vibrato
NASA Astrophysics Data System (ADS)
Lester, Rosemary Anne
Modulation of voice is a result of physiologic oscillation within one or more components of the vocal system including the breathing apparatus (i.e., pressure supply), the larynx (i.e. sound source), and the vocal tract (i.e., sound filter). These oscillations may be caused by pathological tremor associated with neurological disorders like essential tremor or by volitional production of vibrato in singers. Because the acoustical characteristics of voice modulation specific to each component of the vocal system and the effect of these characteristics on perception are not well-understood, it is difficult to assess individuals with vocal tremor and to determine the most effective interventions for reducing the perceptual severity of the disorder. The purpose of the present studies was to determine how the acoustical characteristics associated with laryngeal-based vocal tremor affect the perception of the magnitude of voice modulation, and to determine if adjustments could be made to the voice source and vocal tract filter to alter the acoustic output and reduce the perception of modulation. This research was carried out using both a computational model of speech production and trained singers producing vibrato to simulate laryngeal-based vocal tremor with different voice source characteristics (i.e., vocal fold length and degree of vocal fold adduction) and different vocal tract filter characteristics (i.e., vowel shapes). It was expected that, by making adjustments to the voice source and vocal tract filter that reduce the amplitude of the higher harmonics, the perception of magnitude of voice modulation would be reduced. The results of this study revealed that listeners' perception of the magnitude of modulation of voice was affected by the degree of vocal fold adduction and the vocal tract shape with the computational model, but only by the vocal quality (corresponding to the degree of vocal fold adduction) with the female singer. Based on regression analyses, listeners' judgments were predicted by modulation information in both low and high frequency bands. The findings from these studies indicate that production of a breathy vocal quality might be a useful compensatory strategy for reducing the perceptual severity of modulation of voice for individuals with tremor affecting the larynx.
Tong, Zhixiang; Duncan, Randall L.
2013-01-01
We are interested in the in vitro engineering of artificial vocal fold tissues via the strategic combination of multipotent mesenchymal stem cells (MSCs), physiologically relevant mechanical stimulations, and biomimetic artificial matrices. We have constructed a vocal fold bioreactor that is capable of imposing vibratory stimulations on the cultured cells at human phonation frequencies. Separately, fibrous poly (ɛ-caprolactone) (PCL) scaffolds emulating the ligamentous structure of the vocal fold were prepared by electrospinning, were incorporated in the vocal fold bioreactor, and were driven into a wave-like motion in an axisymmetrical fashion by the oscillating air. MSC-laden PCL scaffolds were subjected to vibrations at 200 Hz with a normal center displacement of ∼40 μm for a total of 7 days. A continuous (CT) or a 1 h-on-1 h-off (OF) regime with a total dynamic culture time of 12 h per day was applied. The dynamic loading did not cause any physiological trauma to the cells. Immunohistotochemical staining revealed the reinforcement of the actin filament and the enhancement of α5β1 integrin expression under selected dynamic culture conditions. Cellular expression of essential vocal fold extracellular matrix components, such as elastin, hyaluronic acid, and matrix metalloproteinase-1, was significantly elevated as compared with the static controls, and the OF regime is more conducive to matrix production than the CT vibration mode. Analyses of genes of typical fibroblast hallmarks (tenascin-C, collagen III, and procollagen I) as well as markers for MSC differentiation into nonfibroblastic lineages confirmed MSCs' adaptation of fibroblastic behaviors. Overall, the high-frequency vibratory stimulation, when combined with a synthetic fibrous scaffold, serves as a potent modulator of MSC functions. The novel bioreactor system presented here, as a versatile, yet well-controlled model, offers an in vitro platform for understanding vibration-induced mechanotransduction and for engineering of functional vocal fold tissues. PMID:23516973
Johnson, Sarah A.; Farrington, Michelle J.; Murphy, Claire R.; McAllister, Leif A.; Kaur, Sarabjit; Chun, Catherine; Ortega, Madison T.; Marshall, Brittney L.; Hoffmann, Frauke; Ellersieck, Mark R.; Schenk, A. Katrin
2018-01-01
Rodent pups use vocalizations to communicate with one or both parents in biparental species, such as California mice (Peromyscus californicus). Previous studies have shown California mice developmentally exposed to endocrine disrupting chemicals, bisphenol A (BPA) or ethinyl estradiol (EE), demonstrate later compromised parental behaviors. Reductions in F1 parental behaviors might also be due to decreased emissions of F2 pup vocalizations. Thus, vocalizations of F2 male and female California mice pups born to F1 parents developmentally exposed to BPA, EE, or controls were examined. Postnatal days (PND) 2–4 were considered early postnatal period, PND 7 and 14 were defined as mid-postnatal period, and PND 21 and 28 were classified as late postnatal period. EE pups showed increased latency to emit the first syllable compared to controls. BPA female pups had decreased syllable duration compared to control and EE female pups during the early postnatal period but enhanced responses compared to controls at late postnatal period; whereas, male BPA and EE pups showed greater syllable duration compared to controls during early postnatal period. In mid-postnatal period, F2 BPA and EE pups emitted greater number of phrases than F2 control pups. Results indicate aspects of vocalizations were disrupted in F2 pups born to F1 parents developmentally exposed to BPA or EE, but their responses were not always identical, suggesting BPA might not activate estrogen receptors to the same extent as EE. Changes in vocalization patterns by F2 pups may be due to multigenerational exposure to BPA or EE and/or reduced parental care received. PMID:29912934
Zhang, Zhaoyan
2015-01-01
Maintaining a small glottal opening across a large range of voice conditions is critical to normal voice production. This study investigated the effectiveness of vocal fold approximation and stiffening in regulating glottal opening and airflow during phonation, using a three-dimensional numerical model of phonation. The results showed that with increasing subglottal pressure the vocal folds were gradually pushed open, leading to increased mean glottal opening and flow rate. A small glottal opening and a mean glottal flow rate typical of human phonation can be maintained against increasing subglottal pressure by proportionally increasing the degree of vocal fold approximation for low to medium subglottal pressures and vocal fold stiffening at high subglottal pressures. Although sound intensity was primarily determined by the subglottal pressure, the results suggest that, to maintain small glottal opening as the sound intensity increases, one has to simultaneously tighten vocal fold approximation and/or stiffen the vocal folds, resulting in increased glottal resistance, vocal efficiency, and fundamental frequency. PMID:25698022
Pasch, Bret; Abbasi, Mustafa Z; Wilson, Macey; Zhao, Daniel; Searle, Jeremy B; Webster, Michael S; Rice, Aaron N
2016-04-01
Nutritional stress can have lasting impacts on the development of traits involved in vocal production. Cross-fostering experiments are often used to examine the propensity for vocal learning in a variety of taxa, but few studies assess the influence of malnourishment that can occur as a byproduct of this technique. In this study, we reciprocally cross-fostered sister taxa of voluble grasshopper mice (genus Onychomys) to explore their propensity for vocal learning. Vocalizations of Onychomys leucogaster did not differ between control and cross-fostered animals, but cross-fostered Onychomys arenicola produced vocalizations that were higher in frequency in a direction away from tutors. These same animals exhibited a transient reduction in body mass early in development, indicative of malnutrition. Our findings simultaneously refute vocal learning and support the developmental stress hypothesis to highlight the importance of early ontogeny on the production of vocalizations later in life. Copyright © 2016 Elsevier Inc. All rights reserved.
Wallez, Catherine; Schaeffer, Jennifer; Meguerditchian, Adrien; Vauclair, Jacques; Schapiro, Steven J.; Hopkins, William D.
2013-01-01
Studies involving oro-facial asymmetries in nonhuman primates have largely demonstrated a right hemispheric dominance for communicative signals and conveyance of emotional information. A recent study on chimpanzee reported the first evidence of significant left-hemispheric dominance when using attention-getting sounds and rightward bias for species-typical vocalizations (Losin, Russell, Freeman, Meguerditchian, Hopkins & Fitch, 2008). The current study sought to extend the findings from Losin et al. (2008) with additional oro-facial assessment in a new colony of chimpanzees. When combining the two populations, the results indicated a consistent leftward bias for attention-getting sounds and a right lateralization for species-typical vocalizations. Collectively, the results suggest that both voluntary- controlled oro-facial and gestural communication might share the same left-hemispheric specialization and might have coevolved into a single integrated system present in a common hominid ancestor. PMID:22867751
Vocal Accuracy and Neural Plasticity Following Micromelody-Discrimination Training
Zarate, Jean Mary; Delhommeau, Karine; Wood, Sean; Zatorre, Robert J.
2010-01-01
Background Recent behavioral studies report correlational evidence to suggest that non-musicians with good pitch discrimination sing more accurately than those with poorer auditory skills. However, other studies have reported a dissociation between perceptual and vocal production skills. In order to elucidate the relationship between auditory discrimination skills and vocal accuracy, we administered an auditory-discrimination training paradigm to a group of non-musicians to determine whether training-enhanced auditory discrimination would specifically result in improved vocal accuracy. Methodology/Principal Findings We utilized micromelodies (i.e., melodies with seven different interval scales, each smaller than a semitone) as the main stimuli for auditory discrimination training and testing, and we used single-note and melodic singing tasks to assess vocal accuracy in two groups of non-musicians (experimental and control). To determine if any training-induced improvements in vocal accuracy would be accompanied by related modulations in cortical activity during singing, the experimental group of non-musicians also performed the singing tasks while undergoing functional magnetic resonance imaging (fMRI). Following training, the experimental group exhibited significant enhancements in micromelody discrimination compared to controls. However, we did not observe a correlated improvement in vocal accuracy during single-note or melodic singing, nor did we detect any training-induced changes in activity within brain regions associated with singing. Conclusions/Significance Given the observations from our auditory training regimen, we therefore conclude that perceptual discrimination training alone is not sufficient to improve vocal accuracy in non-musicians, supporting the suggested dissociation between auditory perception and vocal production. PMID:20567521
A Mutation Associated with Stuttering Alters Mouse Pup Ultrasonic Vocalizations.
Barnes, Terra D; Wozniak, David F; Gutierrez, Joanne; Han, Tae-Un; Drayna, Dennis; Holy, Timothy E
2016-04-13
A promising approach to understanding the mechanistic basis of speech is to study disorders that affect speech without compromising other cognitive or motor functions. Stuttering, also known as stammering, has been linked to mutations in the lysosomal enzyme-targeting pathway, but how this remarkably specific speech deficit arises from mutations in a family of general "cellular housekeeping" genes is unknown. To address this question, we asked whether a missense mutation associated with human stuttering causes vocal or other abnormalities in mice. We compared vocalizations from mice engineered to carry a mutation in the Gnptab (N-acetylglucosamine-1-phosphotransferase subunits alpha/beta) gene with wild-type littermates. We found significant differences in the vocalizations of pups with the human Gnptab stuttering mutation compared to littermate controls. Specifically, we found that mice with the mutation emitted fewer vocalizations per unit time and had longer pauses between vocalizations and that the entropy of the temporal sequence was significantly reduced. Furthermore, Gnptab missense mice were similar to wild-type mice on an extensive battery of non-vocal behaviors. We then used the same language-agnostic metrics for auditory signal analysis of human speech. We analyzed speech from people who stutter with mutations in this pathway and compared it to control speech and found abnormalities similar to those found in the mouse vocalizations. These data show that mutations in the lysosomal enzyme-targeting pathway produce highly specific effects in mouse pup vocalizations and establish the mouse as an attractive model for studying this disorder. Copyright © 2016 Elsevier Ltd. All rights reserved.
How to bootstrap a human communication system.
Fay, Nicolas; Arbib, Michael; Garrod, Simon
2013-01-01
How might a human communication system be bootstrapped in the absence of conventional language? We argue that motivated signs play an important role (i.e., signs that are linked to meaning by structural resemblance or by natural association). An experimental study is then reported in which participants try to communicate a range of pre-specified items to a partner using repeated non-linguistic vocalization, repeated gesture, or repeated non-linguistic vocalization plus gesture (but without using their existing language system). Gesture proved more effective (measured by communication success) and more efficient (measured by the time taken to communicate) than non-linguistic vocalization across a range of item categories (emotion, object, and action). Combining gesture and vocalization did not improve performance beyond gesture alone. We experimentally demonstrate that gesture is a more effective means of bootstrapping a human communication system. We argue that gesture outperforms non-linguistic vocalization because it lends itself more naturally to the production of motivated signs. © 2013 Cognitive Science Society, Inc.
Wingstrand, Vibe Lindeblad; Jensen, David H.; Bork, Kristian; Sebbesen, Lars; Balle, Jesper; Fischer-Nielsen, Anne; von Buchwald, Christian
2016-01-01
Objectives Therapy with mesenchymal stem cells exhibits potential for the development of novel interventions for many diseases and injuries. The use of mesenchymal stem cells in regenerative therapy for vocal fold scarring exhibited promising results to reduce stiffness and enhance the biomechanical properties of injured vocal folds. This study evaluated the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring. Data Sources PubMed, Embase, the Cochrane Library and Google Scholar were searched. Methods Controlled studies that assessed the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring were included. Primary outcomes were viscoelastic properties and mucosal wave amplitude. Results Seven preclinical animal studies (n = 152 single vocal folds) were eligible for inclusion. Evaluation of viscoelastic parameters revealed a decreased dynamic viscosity (η’) and elastic modulus (G’), i.e., decreased resistance and stiffness, in scarred vocal folds treated with mesenchymal stem cells compared to non-treated scarred vocal folds. Mucosal wave amplitude was increased in scarred vocal folds treated with mesenchymal stem cells vs. non-treated scarred vocal folds. Conclusion The results from these studies suggest an increased regenerative effect of therapy with mesenchymal stem cells for scarred vocal folds and are encouraging for further clinical studies. PMID:27631373
Wingstrand, Vibe Lindeblad; Grønhøj Larsen, Christian; Jensen, David H; Bork, Kristian; Sebbesen, Lars; Balle, Jesper; Fischer-Nielsen, Anne; von Buchwald, Christian
2016-01-01
Therapy with mesenchymal stem cells exhibits potential for the development of novel interventions for many diseases and injuries. The use of mesenchymal stem cells in regenerative therapy for vocal fold scarring exhibited promising results to reduce stiffness and enhance the biomechanical properties of injured vocal folds. This study evaluated the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring. PubMed, Embase, the Cochrane Library and Google Scholar were searched. Controlled studies that assessed the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring were included. Primary outcomes were viscoelastic properties and mucosal wave amplitude. Seven preclinical animal studies (n = 152 single vocal folds) were eligible for inclusion. Evaluation of viscoelastic parameters revealed a decreased dynamic viscosity (η') and elastic modulus (G'), i.e., decreased resistance and stiffness, in scarred vocal folds treated with mesenchymal stem cells compared to non-treated scarred vocal folds. Mucosal wave amplitude was increased in scarred vocal folds treated with mesenchymal stem cells vs. non-treated scarred vocal folds. The results from these studies suggest an increased regenerative effect of therapy with mesenchymal stem cells for scarred vocal folds and are encouraging for further clinical studies.
NASA Astrophysics Data System (ADS)
Lucero, Jorge C.; Koenig, Laura L.
2005-03-01
In this study we use a low-dimensional laryngeal model to reproduce temporal variations in oral airflow produced by speakers in the vicinity of an abduction gesture. It attempts to characterize these temporal patterns in terms of biomechanical parameters such as glottal area, vocal fold stiffness, subglottal pressure, and gender differences in laryngeal dimensions. A two-mass model of the vocal folds coupled to a two-tube approximation of the vocal tract is fitted to oral airflow records measured in men and women during the production of /aha/ utterances, using the subglottal pressure, glottal width, and Q factor as control parameters. The results show that the model is capable of reproducing the airflow records with good approximation. A nonlinear damping characteristics is needed, to reproduce the flow variation at glottal abduction. Devoicing is achieved by the combined action of vocal fold abduction, the decrease of subglottal pressure, and the increase of vocal fold tension. In general, the female larynx has a more restricted region of vocal fold oscillation than the male one. This would explain the more frequent devoicing in glottal abduction-adduction gestures for /h/ in running speech by women, compared to men. .
Vocal warm-up increases phonation threshold pressure in soprano singers at high pitch.
Motel, Tamara; Fisher, Kimberly V; Leydon, Ciara
2003-06-01
Vocal warm-up is thought to optimize singing performance. We compared effects of short-term, submaximal, vocal warm-up exercise with those of vocal rest on the soprano voice (n = 10, ages 19-21 years). Dependent variables were the minimum subglottic air pressure required for vocal fold oscillation to occur (phonation threshold pressure, Pth), and the maximum and minimum phonation fundamental frequency. Warm-up increased Pth for high pitch phonation (p = 0.033), but not for comfortable (p = 0.297) or low (p = 0.087) pitch phonation. No significant difference in the maximum phonation frequency (p = 0.193) or minimum frequency (p = 0.222) was observed. An elevated Pth at controlled high pitch, but an unchanging maximum and minimum frequency production suggests that short-term vocal exercise may increase the viscosity of the vocal fold and thus serve to stabilize the high voice.
Chan, Roger W; Siegmund, Thomas; Zhang, Kai
2009-12-01
Accurate characterization of biomechanical characteristics of the vocal fold is critical for understanding the regulation of vocal fundamental frequency (F(0)), which depends on the active control of the intrinsic laryngeal muscles as well as the passive biomechanical response of the vocal fold lamina propria. Specifically, the tissue stress-strain response and viscoelastic properties under cyclic tensile deformation are relevant, when the vocal folds are subjected to length and tension changes due to posturing. This paper describes a constitutive modeling approach quantifying the relationship between vocal fold stress and strain (or stretch), and establishes predictions of F(0) with the string model of phonation based on the constitutive parameters. Results indicated that transient and time-dependent changes in F(0), including global declinations in declarative sentences, as well as local F(0) overshoots and undershoots, can be partially attributed to the time-dependent viscoplastic response of the vocal fold cover.
A new measure of child vocal reciprocity in children with autism spectrum disorder.
Harbison, Amy L; Woynaroski, Tiffany G; Tapp, Jon; Wade, Joshua W; Warlaumont, Anne S; Yoder, Paul J
2018-06-01
Children's vocal development occurs in the context of reciprocal exchanges with a communication partner who models "speechlike" productions. We propose a new measure of child vocal reciprocity, which we define as the degree to which an adult vocal response increases the probability of an immediately following child vocal response. Vocal reciprocity is likely to be associated with the speechlikeness of vocal communication in young children with autism spectrum disorder (ASD). Two studies were conducted to test the utility of the new measure. The first used simulated vocal samples with randomly sequenced child and adult vocalizations to test the accuracy of the proposed index of child vocal reciprocity. The second was an empirical study of 21 children with ASD who were preverbal or in the early stages of language development. Daylong vocal samples collected in the natural environment were computer analyzed to derive the proposed index of child vocal reciprocity, which was highly stable when derived from two daylong vocal samples and was associated with speechlikeness of vocal communication. This association was significant even when controlling for chance probability of child vocalizations to adult vocal responses, probability of adult vocalizations, or probability of child vocalizations. A valid measure of children's vocal reciprocity might eventually improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A link to a free, publicly-available software program to derive the new measure of child vocal reciprocity is provided. Autism Res 2018, 11: 903-915. © 2018 International Society for Autism Research, Wiley Periodicals, Inc. Children and adults often engage in back-and-forth vocal exchanges. The extent to which they do so is believed to support children's early speech and language development. Two studies tested a new measure of child vocal reciprocity using computer-generated and real-life vocal samples of young children with autism collected in natural settings. The results provide initial evidence of accuracy, test-retest reliability, and validity of the new measure of child vocal reciprocity. A sound measure of children's vocal reciprocity might improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A free, publicly-available software program and manuals are provided. © 2018 International Society for Autism Research, Wiley Periodicals, Inc.
Harold, Meredith Poore; Barlow, Steven M.
2012-01-01
The vocalizations and jaw kinematics of 30 infants aged 6–8 months were recorded using a Motion Analysis System and audiovisual technologies. This study represents the first attempt to determine the effect of play environment on infants’ rate of vocalization and jaw movement. Four play conditions were compared: watching videos, social contingent reinforcement and vocal modeling with an adult, playing alone with small toys, and playing alone with large toys. The fewest vocalizations and spontaneous movement were observed when infants were watching videos or interacting with an adult. Infants vocalized most when playing with large toys. The small toys, which naturally elicited gross motor movement (e.g., waving, banging, shaking), educed fewer vocalizations. This study was also the first to quantify the kinematics of vocalized and non-vocalized jaw movements of 6–8 month-old infants. Jaw kinematics did not differentiate infants who produced canonical syllables from those who did not. All infants produced many jaw movements without vocalization. However, during vocalization, infants were unlikely to move their jaw. This contradicts current theories that infant protophonic vocalizations are jaw dominant. Results of the current study can inform socio-linguistic and kinematic theories of canonical babbling. PMID:23261792
Gender differences affecting vocal health of women in vocally demanding careers
Hunter, Eric J.; Smith, Marshall E.; Tanner, Kristine
2012-01-01
Studies suggest that occupational voice users have a greater incidence of vocal issues than the general population. Women have been found to experience vocal health problems more frequently than men, regardless of their occupation. Traditionally, it has been assumed that differences in the laryngeal system are the cause of this disproportion. Nevertheless, it is valuable to identify other potential gender distinctions which may make women more vulnerable to voice disorders. A search of the literature was conducted for gender-specific characteristics which might impact the vocal health of women. This search can be used by healthcare practitioners to help female patients avoid serious vocal health injuries, as well as to better treat women who already suffer from such vocal health issues. PMID:21722077
Yamada, Kazunori; Nakamichi, Masayuki
2017-01-01
We investigated the use of vocalizations called “grunts,” “girneys,” and “coos” accompanied by post-conflict affiliative interaction between former opponents (reconciliation) in Japanese macaques (Macaca fuscata). Although reconciliation functions to repair bonds, such interactions sometimes entail risks of receiving further aggression. Vocalizations can be used at a distance from the former opponent; thus, we predict that vocalizations are used particularly by victims of a conflict, and are frequently used in situations of uncertainty when it is difficult for them to estimate whether the former opponent will resume aggression. In addition, we predict that vocalizations are effective in preventing further aggression. To test these hypotheses, we conducted observations of post-conflict and matched-control situations in female Japanese macaques living in a free-ranging group. We found that former opponents tended to be attracted to each other within the first minute following a conflict, thus demonstrating reconciliation behavior. Vocalizations were more frequently used by the victims in post-conflict interactions than under control situations; however, this tendency was not found in aggressors. When affiliation with the former opponent occurred, victims were more likely to use vocalizations towards less familiar opponents. These findings suggest that Japanese macaques used vocalizations more often when interacting with less predictable former opponents. Victims were more likely to receive aggression from former aggressors when engaged in affiliations with them than under no such affiliations. No significant differences were found in the probability of the victims receiving aggression, regardless of whether they used vocalizations; thus, whether the victim benefits from using vocalizations in these contexts remains unclear. Japanese macaques form despotic societies and therefore, further aggression was inevitable, to some degree, after a conflict. The use of vocalizations by a victim was found to depend on the nature of their relationship with the aggressor; however, the effectiveness of this behavior requires further investigation. PMID:28558070
Katsu, Noriko; Yamada, Kazunori; Nakamichi, Masayuki
2017-01-01
We investigated the use of vocalizations called "grunts," "girneys," and "coos" accompanied by post-conflict affiliative interaction between former opponents (reconciliation) in Japanese macaques (Macaca fuscata). Although reconciliation functions to repair bonds, such interactions sometimes entail risks of receiving further aggression. Vocalizations can be used at a distance from the former opponent; thus, we predict that vocalizations are used particularly by victims of a conflict, and are frequently used in situations of uncertainty when it is difficult for them to estimate whether the former opponent will resume aggression. In addition, we predict that vocalizations are effective in preventing further aggression. To test these hypotheses, we conducted observations of post-conflict and matched-control situations in female Japanese macaques living in a free-ranging group. We found that former opponents tended to be attracted to each other within the first minute following a conflict, thus demonstrating reconciliation behavior. Vocalizations were more frequently used by the victims in post-conflict interactions than under control situations; however, this tendency was not found in aggressors. When affiliation with the former opponent occurred, victims were more likely to use vocalizations towards less familiar opponents. These findings suggest that Japanese macaques used vocalizations more often when interacting with less predictable former opponents. Victims were more likely to receive aggression from former aggressors when engaged in affiliations with them than under no such affiliations. No significant differences were found in the probability of the victims receiving aggression, regardless of whether they used vocalizations; thus, whether the victim benefits from using vocalizations in these contexts remains unclear. Japanese macaques form despotic societies and therefore, further aggression was inevitable, to some degree, after a conflict. The use of vocalizations by a victim was found to depend on the nature of their relationship with the aggressor; however, the effectiveness of this behavior requires further investigation.
Short bouts of vocalization induce long lasting fast gamma oscillations in a sensorimotor nucleus
Lewandowski, Brian; Schmidt, Marc
2011-01-01
Performance evaluation is a critical feature of motor learning. In the vocal system, it requires the integration of auditory feedback signals with vocal motor commands. The network activity that supports such integration is unknown, but it has been proposed that vocal performance evaluation occurs offline. Recording from NIf, a sensorimotor structure in the avian song system, we show that short bouts of singing in adult male zebra finches (Taeniopygia guttata) induce persistent increases in firing activity and coherent oscillations in the fast gamma range (90–150 Hz). Single units are strongly phase-locked to these oscillations, which can last up to 30 s, often outlasting vocal activity by an order of magnitude. In other systems, oscillations often are triggered by events or behavioral tasks but rarely outlast the event that triggered them by more than 1 second. The present observations are the longest reported gamma oscillations triggered by an isolated behavioral event. In mammals, gamma oscillations have been associated with memory consolidation and are hypothesized to facilitate communication between brain regions. We suggest that the timing and persistent nature of NIf’s fast gamma oscillations make them well suited to facilitate the integration of auditory and vocal motor traces associated with vocal performance evaluation. PMID:21957255
The Role of Lexical Stress on the Use of Vocal Fry in Young Adult Female Speakers.
Gibson, Todd A
2017-01-01
Vocal fry is a voice register often used by young adult women for sociolinguistic purposes. Some acoustic correlates of lexical stress, however, appear incompatible with the use of vocal fry. The objective of this study was to systematically examine the role of lexical stress in the use of vocal fry by young adult women. This is a semi-randomized controlled laboratory study. Fifty female undergraduate students were recorded repeating one-, two-, three-, and four-syllable nonwords that conformed to English phonotactics. Nonwords were presented in order from shorter to longer lengths, with stimuli randomized within syllable length. Perceptual analyses of recordings were augmented by acoustic analyses to identify each syllable in which vocal fry occurred. Eighty-six percent of participants produced at least one episode of vocal fry. Vocal fry was more likely to occur in unstressed than stressed position, and the likelihood increased as distance from the stressed syllable increased. There was considerable variability in the use of vocal fry. Frequent and infrequent users varied on the degree to which they used vocal fry in single-syllable nonwords. Vocal fry use persists among young adult women even in the absence of syntactic and pragmatic influences. Lexical stress appeared to dramatically reduce the use of vocal fry. Patterns of vocal fry use appeared to be different for frequent and infrequent users of this vocal register. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Garcia-Albea, Elena; Reeve, Sharon A.; Brothers, Kevin J.; Reeve, Kenneth F.
2014-01-01
Script-fading procedures have been shown to be effective for teaching children with autism to initiate and participate in social interactions without vocal prompts from adults. In previous script and script-fading research, however, there has been no demonstration of a generalized repertoire of vocal interactions under the control of naturally…
Luegmair, Georg; Mehta, Daryush D.; Kobler, James B.; Döllinger, Michael
2015-01-01
Vocal fold kinematics and its interaction with aerodynamic characteristics play a primary role in acoustic sound production of the human voice. Investigating the temporal details of these kinematics using high-speed videoendoscopic imaging techniques has proven challenging in part due to the limitations of quantifying complex vocal fold vibratory behavior using only two spatial dimensions. Thus, we propose an optical method of reconstructing the superior vocal fold surface in three spatial dimensions using a high-speed video camera and laser projection system. Using stereo-triangulation principles, we extend the camera-laser projector method and present an efficient image processing workflow to generate the three-dimensional vocal fold surfaces during phonation captured at 4000 frames per second. Initial results are provided for airflow-driven vibration of an ex vivo vocal fold model in which at least 75% of visible laser points contributed to the reconstructed surface. The method captures the vertical motion of the vocal folds at a high accuracy to allow for the computation of three-dimensional mucosal wave features such as vibratory amplitude, velocity, and asymmetry. PMID:26087485
Multiple Coordination Patterns in Infant and Adult Vocalizations
Abney, Drew H.; Warlaumont, Anne S.; Oller, D. Kimbrough; Wallot, Sebastian; Kello, Christopher T.
2017-01-01
The study of vocal coordination between infants and adults has led to important insights into the development of social, cognitive, emotional and linguistic abilities. We used an automatic system to identify vocalizations produced by infants and adults over the course of the day for fifteen infants studied longitudinally during the first two years of life. We measured three different types of vocal coordination: coincidence-based, rate-based, and cluster-based. Coincidence-based and rate-based coordination are established measures in the developmental literature. Cluster-based coordination is new and measures the strength of matching in the degree to which vocalization events occur in hierarchically nested clusters. We investigated whether various coordination patterns differ as a function of vocalization type, whether different coordination patterns provide unique information about the dynamics of vocal interaction, and how the various coordination patterns each relate to infant age. All vocal coordination patterns displayed greater coordination for infant speech-related vocalizations, adults adapted the hierarchical clustering of their vocalizations to match that of infants, and each of the three coordination patterns had unique associations with infant age. Altogether, our results indicate that vocal coordination between infants and adults is multifaceted, suggesting a complex relationship between vocal coordination and the development of vocal communication. PMID:29375276
Behroozmand, Roozbeh; Karvelis, Laura; Liu, Hanjun; Larson, Charles R.
2009-01-01
Objective The present study investigated whether self-vocalization enhances auditory neural responsiveness to voice pitch feedback perturbation and how this vocalization-induced neural modulation can be affected by the extent of the feedback deviation. Method Event related potentials (ERPs) were recorded in 15 subjects in response to +100, +200 and +500 cents pitch-shifted voice auditory feedback during active vocalization and passive listening to the playback of the self-produced vocalizations. Result The amplitude of the evoked P1 (latency: 73.51 ms) and P2 (latency: 199.55 ms) ERP components in response to feedback perturbation were significantly larger during vocalization than listening. The difference between P2 peak amplitudes during vocalization vs. listening was shown to be significantly larger for +100 than +500 cents stimulus. Conclusion Results indicate that the human auditory cortex is more responsive to voice F0 feedback perturbations during vocalization than passive listening. Greater vocalization-induced enhancement of the auditory responsiveness to smaller feedback perturbations may imply that the audio-vocal system detects and corrects for errors in vocal production that closely match the expected vocal output. Significance Findings of this study support previous suggestions regarding the enhanced auditory sensitivity to feedback alterations during self-vocalization, which may serve the purpose of feedback-based monitoring of one’s voice. PMID:19520602
Viscoelasticity of rabbit vocal folds after injection augmentation.
Dahlqvist, Ake; Gärskog, Ola; Laurent, Claude; Hertegård, Stellan; Ambrosio, Luigi; Borzacchiello, Assunta
2004-01-01
Vocal fold function is related to the viscoelasticity of the vocal fold tissue. Augmentation substances used for injection treatment of voice insufficiency may alter the viscoelastic properties of vocal folds and their vibratory capacity. The objective was to compare the mechanical properties (viscoelasticity) of various injectable substances and the viscoelasticity of rabbit vocal folds, 6 months after injection with one of these substances. Animal model. Cross-linked collagen (Zyplast), double cross-linked hyaluronan (hylan B gel), dextranomers in hyaluronan (DHIA), and polytetrafluoroethylene (Teflon) were injected into rabbit vocal folds. Six months after the injection, the animals were killed and the right- and left-side vocal folds were removed. Dynamic viscosity of the injected substances and the vocal folds was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Hylan B gel and DiHA showed the lowest dynamic viscosity values, and vocal folds injected with these substances also showed the lowest dynamic viscosity (similar to noninjected control samples). Teflon (and vocal folds injected with Teflon) showed the highest dynamic viscosity values, followed by the collagen samples. Substances with low viscoelasticity alter the mechanical properties of the vocal fold to a lesser degree than substances with a high viscoelasticity. The data indicated that hylan B gel and DiHA render the most natural viscoelastic properties to the vocal folds. These substances seem to be appropriate for preserving or restoring the vibratory capacity of the vocal folds when glottal insufficiency is treated with augmentative injections.
Vibration stimulates vocal mucosa-like matrix expression by hydrogel-encapsulated fibroblasts.
Kutty, Jaishankar K; Webb, Ken
2010-01-01
The composition and organization of the vocal fold extracellular matrix (ECM) provide the viscoelastic mechanical properties that are required to sustain high-frequency vibration during voice production. Although vocal injury and pathology are known to produce alterations in matrix physiology, the mechanisms responsible for the development and maintenance of vocal fold ECM are poorly understood. The objective of this study was to investigate the effect of physiologically relevant vibratory stimulation on ECM gene expression and synthesis by fibroblasts encapsulated within hyaluronic acid hydrogels that approximate the viscoelastic properties of vocal mucosa. Relative to static controls, samples exposed to vibration exhibited significant increases in mRNA expression levels of HA synthase 2, decorin, fibromodulin and MMP-1, while collagen and elastin expression were relatively unchanged. Expression levels exhibited a temporal response, with maximum increases observed after 3 and 5 days of vibratory stimulation and significant downregulation observed at 10 days. Quantitative assays of matrix accumulation confirmed significant increases in sulphated glycosaminoglycans and significant decreases in collagen after 5 and 10 days of vibratory culture, relative to static controls. Cellular remodelling and hydrogel viscosity were affected by vibratory stimulation and were influenced by varying the encapsulated cell density. These results indicate that vibration is a critical epigenetic factor regulating vocal fold ECM and suggest that rapid restoration of the phonatory microenvironment may provide a basis for reducing vocal scarring, restoring native matrix composition and improving vocal quality. 2009 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Mosko, J. D.; Stevens, K. N.; Griffin, G. R.
1983-08-01
Acoustical analyses were conducted of words produced by four speakers in a motion stress-inducing situation. The aim of the analyses was to document the kinds of changes that occur in the vocal utterances of speakers who are exposed to motion stress and to comment on the implications of these results for the design and development of voice interactive systems. The speakers differed markedly in the types and magnitudes of the changes that occurred in their speech. For some speakers, the stress-inducing experimental condition caused an increase in fundamental frequency, changes in the pattern of vocal fold vibration, shifts in vowel production and changes in the relative amplitudes of sounds containing turbulence noise. All speakers showed greater variability in the experimental condition than in more relaxed control situation. The variability was manifested in the acoustical characteristics of individual phonetic elements, particularly in speech sound variability observed serve to unstressed syllables. The kinds of changes and variability observed serve to emphasize the limitations of speech recognition systems based on template matching of patterns that are stored in the system during a training phase. There is need for a better understanding of these phonetic modifications and for developing ways of incorporating knowledge about these changes within a speech recognition system.
Communicative aspects and coping strategies in patients with Parkinson's disease.
Costa, Flávia Pereira da; Diaféria, Giovana; Behlau, Mara
2016-01-01
To investigate, in patients with Parkinson's disease (PD), the coping strategies; the most reported vocal symptoms; and the relation between coping, voice symptoms, and communicative aspects. Seventy-three subjects were included in the sample, 33 of which were participants in the experimental group (EG) with diagnosis of PD and 40 were control subjects, that is, healthy and without vocal complaints. They underwent the following procedures: application of Voice Symptoms Scale (VoiSS), Brazilian Version; Voice Disability Coping Questionnaire (VDCQ), Brazilian Version; and the questionnaire Living with Dysarthria (LwD). The EG presented deviations in all protocols: VDCQ, with the most frequently coping strategy being "self-control," VoiSS, with "Impairment" as the most prevalent domain, and LwD, presenting changes in all sections. Vocal signs and symptoms and communicative aspects were shown to have a regular correlation with coping. The correlation between vocal symptoms and communicative aspects was as follows: the greater the impairment in communication, the greater the VoiSS emotional scores and the more they complaint of voice-related signs and symptoms. Patients with PD use all kinds of coping strategies, but prefer using self-control. They present several vocal signs and symptoms, and "Impairment" was the most prevalent domain. There are difficulties in all aspects of communication. The higher the occurrence of vocal signs and symptoms, the more the patient reports the difficulties of living with dysarthria, particularly when deviations affect the emotional domain.
Fournet, Michelle E; Szabo, Andy; Mellinger, David K
2015-01-01
On low-latitude breeding grounds, humpback whales produce complex and highly stereotyped songs as well as a range of non-song sounds associated with breeding behaviors. While on their Southeast Alaskan foraging grounds, humpback whales produce a range of previously unclassified non-song vocalizations. This study investigates the vocal repertoire of Southeast Alaskan humpback whales from a sample of 299 non-song vocalizations collected over a 3-month period on foraging grounds in Frederick Sound, Southeast Alaska. Three classification systems were used, including aural spectrogram analysis, statistical cluster analysis, and discriminant function analysis, to describe and classify vocalizations. A hierarchical acoustic structure was identified; vocalizations were classified into 16 individual call types nested within four vocal classes. The combined classification method shows promise for identifying variability in call stereotypy between vocal groupings and is recommended for future classification of broad vocal repertoires.
Identification of a motor to auditory pathway important for vocal learning
Roberts, Todd F.; Hisey, Erin; Tanaka, Masashi; Kearney, Matthew; Chattree, Gaurav; Yang, Cindy F.; Shah, Nirao M.; Mooney, Richard
2017-01-01
Summary Learning to vocalize depends on the ability to adaptively modify the temporal and spectral features of vocal elements. Neurons that convey motor-related signals to the auditory system are theorized to facilitate vocal learning, but the identity and function of such neurons remain unknown. Here we identify a previously unknown neuron type in the songbird brain that transmits vocal motor signals to the auditory cortex. Genetically ablating these neurons in juveniles disrupted their ability to imitate features of an adult tutor’s song. Ablating these neurons in adults had little effect on previously learned songs, but interfered with their ability to adaptively modify the duration of vocal elements and largely prevented the degradation of song’s temporal features normally caused by deafening. These findings identify a motor to auditory circuit essential to vocal imitation and to the adaptive modification of vocal timing. PMID:28504672
LANGUAGE DEVELOPMENT. The developmental dynamics of marmoset monkey vocal production.
Takahashi, D Y; Fenley, A R; Teramoto, Y; Narayanan, D Z; Borjon, J I; Holmes, P; Ghazanfar, A A
2015-08-14
Human vocal development occurs through two parallel interactive processes that transform infant cries into more mature vocalizations, such as cooing sounds and babbling. First, natural categories of sounds change as the vocal apparatus matures. Second, parental vocal feedback sensitizes infants to certain features of those sounds, and the sounds are modified accordingly. Paradoxically, our closest living ancestors, nonhuman primates, are thought to undergo few or no production-related acoustic changes during development, and any such changes are thought to be impervious to social feedback. Using early and dense sampling, quantitative tracking of acoustic changes, and biomechanical modeling, we showed that vocalizations in infant marmoset monkeys undergo dramatic changes that cannot be solely attributed to simple consequences of growth. Using parental interaction experiments, we found that contingent parental feedback influences the rate of vocal development. These findings overturn decades-old ideas about primate vocalizations and show that marmoset monkeys are a compelling model system for early vocal development in humans. Copyright © 2015, American Association for the Advancement of Science.
Wild, J.M.; Krützfeldt, N.E.O.
2014-01-01
During singing in songbirds, the extent of beak opening, like the extent of mouth opening in human singers, is partially correlated with the fundamental frequency of the sounds emitted. Since song in songbirds is under the control of “the song system” (a collection of interconnected forebrain nuclei dedicated to the learning and production of song), it might be expected that beak movements during singing would also be controlled by this system. However, direct neural connections between the telencephalic output of the song system and beak muscle motor neurons in the brainstem are conspicuous by their absence, leaving unresolved the question of how beak movements are affected during singing. By using standard tract tracing methods, we sought to answer this question by defining beak premotor neurons and examining their afferent projections. In the caudal medulla, jaw premotor cell bodies were located adjacent to the terminal field of the output of the song system, into which many premotor neurons extended their dendrites. The premotor neurons also received a novel input from the trigeminal ganglion and an overlapping input from a lateral arcopallial component of a trigeminal sensorimotor circuit that traverses the forebrain. The ganglionic input in songbirds, which is not present in doves and pigeons that vocalize with a closed beak, may modulate the activity of beak premotor neurons in concert with the output of the song system. These inputs to jaw premotor neurons could, together, affect beak movements as a means of modulating filter properties of the upper vocal tract during singing. PMID:21858818
Computational model for vocal tract dynamics in a suboscine bird.
Assaneo, M F; Trevisan, M A
2010-09-01
In a recent work, active use of the vocal tract has been reported for singing oscines. The reconfiguration of the vocal tract during song serves to match its resonances to the syringeal fundamental frequency, demonstrating a precise coordination of the two main pieces of the avian vocal system for songbirds characterized by tonal songs. In this work we investigated the Great Kiskadee (Pitangus sulfuratus), a suboscine bird whose calls display a rich harmonic content. Using a recently developed mathematical model for the syrinx and a mobile vocal tract, we set up a computational model that provides a plausible reconstruction of the vocal tract movement using a few spectral features taken from the utterances. Moreover, synthetic calls were generated using the articulated vocal tract that accounts for all the acoustical features observed experimentally.
Singing modulates parvalbumin interneurons throughout songbird forebrain vocal control circuitry
Zengin-Toktas, Yildiz
2017-01-01
Across species, the performance of vocal signals can be modulated by the social environment. Zebra finches, for example, adjust their song performance when singing to females (‘female-directed’ or FD song) compared to when singing in isolation (‘undirected’ or UD song). These changes are salient, as females prefer the FD song over the UD song. Despite the importance of these performance changes, the neural mechanisms underlying this social modulation remain poorly understood. Previous work in finches has established that expression of the immediate early gene EGR1 is increased during singing and modulated by social context within the vocal control circuitry. Here, we examined whether particular neural subpopulations within those vocal control regions exhibit similar modulations of EGR1 expression. We compared EGR1 expression in neurons expressing parvalbumin (PV), a calcium buffer that modulates network plasticity and homeostasis, among males that performed FD song, males that produced UD song, or males that did not sing. We found that, overall, singing but not social context significantly affected EGR1 expression in PV neurons throughout the vocal control nuclei. We observed differences in EGR1 expression between two classes of PV interneurons in the basal ganglia nucleus Area X. Additionally, we found that singing altered the amount of PV expression in neurons in HVC and Area X and that distinct PV interneuron types in Area X exhibited different patterns of modulation by singing. These data indicate that throughout the vocal control circuitry the singing-related regulation of EGR1 expression in PV neurons may be less influenced by social context than in other neuron types and raise the possibility of cell-type specific differences in plasticity and calcium buffering. PMID:28235074
NASA Astrophysics Data System (ADS)
Doyle, Laurance R.; McCowan, Brenda; Hanser, Sean F.
2002-01-01
Information theory allows a quantification of the complexity of a given signaling system. We are applying information theory to dolphin whistle vocalizations, humpback whale songs, squirrel monkey chuck calls, and several other animal communication systems' in order to develop a quantitative and objective way to compare inter species communication systems' complexity. Once signaling units have been correctly classified the communication system must obey certain statistical distributions in order to contain complexity whether it is human languages, dolphin whistle vocalizations, or even a system of communication signals received from an extraterrestrial source.
Learned vocal and breathing behavior in an enculturated gorilla.
Perlman, Marcus; Clark, Nathaniel
2015-09-01
We describe the repertoire of learned vocal and breathing-related behaviors (VBBs) performed by the enculturated gorilla Koko. We examined a large video corpus of Koko and observed 439 VBBs spread across 161 bouts. Our analysis shows that Koko exercises voluntary control over the performance of nine distinctive VBBs, which involve variable coordination of her breathing, larynx, and supralaryngeal articulators like the tongue and lips. Each of these behaviors is performed in the context of particular manual action routines and gestures. Based on these and other findings, we suggest that vocal learning and the ability to exercise volitional control over vocalization, particularly in a multimodal context, might have figured relatively early into the evolution of language, with some rudimentary capacity in place at the time of our last common ancestor with great apes.
Imaizumi, Mitsuyoshi; Li-Jessen, Nicole Y K; Sato, Yuka; Yang, David T; Thibeault, Susan L
2017-04-01
One prospective treatment option for vocal fold scarring is regeneration with an engineered scaffold containing induced pluripotent stem cells (iPS). In the present study, we investigated the feasibility of utilizing an injectable hyaluronic acid (HA) scaffold encapsulated with human-iPS cell (hiPS) for regeneration of vocal folds. Thirty athymic nude rats underwent unilateral vocal fold injury. Contralateral vocal folds served as uninjured controls. Hyaluronic acid hydrogel scaffold, HA hydrogel scaffold containing hiPS, and HA hydrogel scaffold containing hiPS with epidermal growth factor (EGF) were injected in both vocal folds immediately after surgery. One and 2 weeks after injection, larynges were excised for histology, immunohistochemistry, and fluorescence in situ hybridization (FISH). Presence of HA hydrogel was confirmed in vocal folds 1 and 2 weeks post injection. The FISH analysis confirmed the presence and viability of hiPS in the injected vocal folds. Histological results demonstrated that vocal folds injected with HA hydrogel scaffold containing EGF demonstrated less fibrosis than those with HA hydrogel only. Human-iPS survived in injured rat vocal folds. The HA hydrogel with hiPS and EGF ameliorated the fibrotic response. Additional work is necessary to optimize hiPS differentiation and further confirm the safety of hiPS for clinical applications.
Elie, Julie E.; Theunissen, Frédéric E.
2018-01-01
Although a universal code for the acoustic features of animal vocal communication calls may not exist, the thorough analysis of the distinctive acoustical features of vocalization categories is important not only to decipher the acoustical code for a specific species but also to understand the evolution of communication signals and the mechanisms used to produce and understand them. Here, we recorded more than 8,000 examples of almost all the vocalizations of the domesticated zebra finch, Taeniopygia guttata: vocalizations produced to establish contact, to form and maintain pair bonds, to sound an alarm, to communicate distress or to advertise hunger or aggressive intents. We characterized each vocalization type using complete representations that avoided any a priori assumptions on the acoustic code, as well as classical bioacoustics measures that could provide more intuitive interpretations. We then used these acoustical features to rigorously determine the potential information-bearing acoustical features for each vocalization type using both a novel regularized classifier and an unsupervised clustering algorithm. Vocalization categories are discriminated by the shape of their frequency spectrum and by their pitch saliency (noisy to tonal vocalizations) but not particularly by their fundamental frequency. Notably, the spectral shape of zebra finch vocalizations contains peaks or formants that vary systematically across categories and that would be generated by active control of both the vocal organ (source) and the upper vocal tract (filter). PMID:26581377
Harold, Meredith Poore; Barlow, Steven M
2013-02-01
The vocalizations and jaw kinematics of 30 infants aged 6-8 months were recorded using a Motion Analysis System and audiovisual technologies. This study represents the first attempt to determine the effect of play environment on infants' rate of vocalization and jaw movement. Four play conditions were compared: watching videos, social contingent reinforcement and vocal modeling with an adult, playing alone with small toys, and playing alone with large toys. The fewest vocalizations and spontaneous movement were observed when infants were watching videos or interacting with an adult. Infants vocalized most when playing with large toys. The small toys, which naturally elicited gross motor movement (e.g., waving, banging, shaking), educed fewer vocalizations. This study was also the first to quantify the kinematics of vocalized and non-vocalized jaw movements of 6-8 month-old infants. Jaw kinematics did not differentiate infants who produced canonical syllables from those who did not. All infants produced many jaw movements without vocalization. However, during vocalization, infants were unlikely to move their jaw. This contradicts current theories that infant protophonic vocalizations are jaw-dominant. Results of the current study can inform socio-linguistic and kinematic theories of canonical babbling. Copyright © 2012 Elsevier Inc. All rights reserved.
Ghassemi, Marzyeh; Van Stan, Jarrad H; Mehta, Daryush D; Zañartu, Matías; Cheyne, Harold A; Hillman, Robert E; Guttag, John V
2014-06-01
Voice disorders are medical conditions that often result from vocal abuse/misuse which is referred to generically as vocal hyperfunction. Standard voice assessment approaches cannot accurately determine the actual nature, prevalence, and pathological impact of hyperfunctional vocal behaviors because such behaviors can vary greatly across the course of an individual's typical day and may not be clearly demonstrated during a brief clinical encounter. Thus, it would be clinically valuable to develop noninvasive ambulatory measures that can reliably differentiate vocal hyperfunction from normal patterns of vocal behavior. As an initial step toward this goal we used an accelerometer taped to the neck surface to provide a continuous, noninvasive acceleration signal designed to capture some aspects of vocal behavior related to vocal cord nodules, a common manifestation of vocal hyperfunction. We gathered data from 12 female adult patients diagnosed with vocal fold nodules and 12 control speakers matched for age and occupation. We derived features from weeklong neck-surface acceleration recordings by using distributions of sound pressure level and fundamental frequency over 5-min windows of the acceleration signal and normalized these features so that intersubject comparisons were meaningful. We then used supervised machine learning to show that the two groups exhibit distinct vocal behaviors that can be detected using the acceleration signal. We were able to correctly classify 22 of the 24 subjects, suggesting that in the future measures of the acceleration signal could be used to detect patients with the types of aberrant vocal behaviors that are associated with hyperfunctional voice disorders.
The brain of opera singers: experience-dependent changes in functional activation.
Kleber, B; Veit, R; Birbaumer, N; Gruzelier, J; Lotze, M
2010-05-01
Several studies have shown that motor-skill training over extended time periods results in reorganization of neural networks and changes in brain morphology. Yet, little is known about training-induced adaptive changes in the vocal system, which is largely subserved by intrinsic reflex mechanisms. We investigated highly accomplished opera singers, conservatory level vocal students, and laymen during overt singing of an Italian aria in a neuroimaging experiment. We provide the first evidence that the training of vocal skills is accompanied by increased functional activation of bilateral primary somatosensory cortex representing articulators and larynx. Opera singers showed additional activation in right primary sensorimotor cortex. Further training-related activation comprised the inferior parietal lobe and bilateral dorsolateral prefrontal cortex. At the subcortical level, expert singers showed increased activation in the basal ganglia, the thalamus, and the cerebellum. A regression analysis of functional activation with accumulated singing practice confirmed that vocal skills training correlates with increased activity of a cortical network for enhanced kinesthetic motor control and sensorimotor guidance together with increased involvement of implicit motor memory areas at the subcortical and cerebellar level. Our findings may have ramifications for both voice rehabilitation and deliberate practice of other implicit motor skills that require interoception.
Detection of the Vibration Signal from Human Vocal Folds Using a 94-GHz Millimeter-Wave Radar
Chen, Fuming; Li, Sheng; Zhang, Yang; Wang, Jianqi
2017-01-01
The detection of the vibration signal from human vocal folds provides essential information for studying human phonation and diagnosing voice disorders. Doppler radar technology has enabled the noncontact measurement of the human-vocal-fold vibration. However, existing systems must be placed in close proximity to the human throat and detailed information may be lost because of the low operating frequency. In this paper, a long-distance detection method, involving the use of a 94-GHz millimeter-wave radar sensor, is proposed for detecting the vibration signals from human vocal folds. An algorithm that combines empirical mode decomposition (EMD) and the auto-correlation function (ACF) method is proposed for detecting the signal. First, the EMD method is employed to suppress the noise of the radar-detected signal. Further, the ratio of the energy and entropy is used to detect voice activity in the radar-detected signal, following which, a short-time ACF is employed to extract the vibration signal of the human vocal folds from the processed signal. For validating the method and assessing the performance of the radar system, a vibration measurement sensor and microphone system are additionally employed for comparison. The experimental results obtained from the spectrograms, the vibration frequency of the vocal folds, and coherence analysis demonstrate that the proposed method can effectively detect the vibration of human vocal folds from a long detection distance. PMID:28282892
Effect of artificially lengthened vocal tract on vocal fold oscillation's fundamental frequency.
Hanamitsu, Masakazu; Kataoka, Hideyuki
2004-06-01
The fundamental frequency of vocal fold oscillation (F(0)) is controlled by laryngeal mechanics and aerodynamic properties. F(0) change per unit change of transglottal pressure (dF/dP) using a shutter valve has been studied and found to have nonlinear, V-shaped relationship with F(0). On the other hand, the vocal tract is also known to affect vocal fold oscillation. This study examined the effect of artificially lengthened vocal tract length on dF/dP. dF/dP was measured in six men using two mouthpieces of different lengths. The dF/dP graph for the longer vocal tract was shifted leftward relative to the shorter one. Using the one-mass model, the nadir of the "V" on the dF/dP graph was strongly influenced by the resonance around the first formant frequency. However, a more precise model is needed to account for the effects of viscosity and turbulence.
Behavior-Linked FoxP2 Regulation Enables Zebra Finch Vocal Learning
Heston, Jonathan B.
2015-01-01
Mutations in the FOXP2 transcription factor cause an inherited speech and language disorder, but how FoxP2 contributes to learning of these vocal communication signals remains unclear. FoxP2 is enriched in corticostriatal circuits of both human and songbird brains. Experimental knockdown of this enrichment in song control neurons of the zebra finch basal ganglia impairs tutor song imitation, indicating that adequate FoxP2 levels are necessary for normal vocal learning. In unmanipulated birds, vocal practice acutely downregulates FoxP2, leading to increased vocal variability and dynamic regulation of FoxP2 target genes. To determine whether this behavioral regulation is important for song learning, here, we used viral-driven overexpression of FoxP2 to counteract its downregulation. This manipulation disrupted the acute effects of song practice on vocal variability and caused inaccurate song imitation. Together, these findings indicate that dynamic behavior-linked regulation of FoxP2, rather than absolute levels, is critical for vocal learning. PMID:25698728
Bagley, Amy D.; Abramowitz, Carolyn S.; Kosson, David S.
2010-01-01
Deficits in emotion processing have been widely reported to be central to psychopathy. However, few prior studies have examined vocal affect recognition in psychopaths, and these studies suffer from significant methodological limitations. Moreover, prior studies have yielded conflicting findings regarding the specificity of psychopaths’ affect recognition deficits. This study examined vocal affect recognition in 107 male inmates under conditions requiring isolated prosodic vs. semantic analysis of affective cues and compared subgroups of offenders identified via cluster analysis on vocal affect recognition. Psychopaths demonstrated deficits in vocal affect recognition under conditions requiring use of semantic cues and conditions requiring use of prosodic cues. Moreover, both primary and secondary psychopaths exhibited relatively similar emotional deficits in the semantic analysis condition compared to nonpsychopathic control participants. This study demonstrates that psychopaths’ vocal affect recognition deficits are not due to methodological limitations of previous studies and provides preliminary evidence that primary and secondary psychopaths exhibit generally similar deficits in vocal affect recognition. PMID:19413412
Visual classification of feral cat Felis silvestris catus vocalizations.
Owens, Jessica L; Olsen, Mariana; Fontaine, Amy; Kloth, Christopher; Kershenbaum, Arik; Waller, Sara
2017-06-01
Cat vocal behavior, in particular, the vocal and social behavior of feral cats, is poorly understood, as are the differences between feral and fully domestic cats. The relationship between feral cat social and vocal behavior is important because of the markedly different ecology of feral and domestic cats, and enhanced comprehension of the repertoire and potential information content of feral cat calls can provide both better understanding of the domestication and socialization process, and improved welfare for feral cats undergoing adoption. Previous studies have used conflicting classification schemes for cat vocalizations, often relying on onomatopoeic or popular descriptions of call types (e.g., "miow"). We studied the vocalizations of 13 unaltered domestic cats that complied with our behavioral definition used to distinguish feral cats from domestic. A total of 71 acoustic units were extracted and visually analyzed for the construction of a hierarchical classification of vocal sounds, based on acoustic properties. We identified 3 major categories (tonal, pulse, and broadband) that further breakdown into 8 subcategories, and show a high degree of reliability when sounds are classified blindly by independent observers (Fleiss' Kappa K = 0.863). Due to the limited behavioral contexts in this study, additional subcategories of cat vocalizations may be identified in the future, but our hierarchical classification system allows for the addition of new categories and new subcategories as they are described. This study shows that cat vocalizations are diverse and complex, and provides an objective and reliable classification system that can be used in future studies.
NASA Astrophysics Data System (ADS)
Scheifele, Peter Martin
2003-06-01
Noise pollution has only recently become recognized as a potential danger to marine mammals in general, and to the Beluga Whale (Delphinapterus leucas) in particular. These small gregarious Odontocetes make extensive use of sound for social communication and pod cohesion. The St. Lawrence River Estuary is habitat to a small, critically endangered population of about 700 Beluga whales who congregate in four different sites in its upper estuary. The population is believed to be threatened by the stress of high-intensity, low frequency noise. One way to determine whether noise is having an effect on an animal's auditory ability might be to observe a natural and repeatable response of the auditory and vocal systems to varying noise levels. This can be accomplished by observing changes in animal vocalizations in response to auditory feedback. A response such as this observed in humans and some animals is known as the Lombard Vocal Response, which represents a reaction of the auditory system directly manifested by changes in vocalization level. In this research this population of Beluga Whales was tested to determine whether a vocalization-as-a-function-of-noise phenomenon existed by using Hidden Markhov "classified" vocalizations as targets for acoustical analyses. Correlation and regression analyses indicated that the phenomenon does exist and results of a human subjects experiment along with results from other animal species known to exhibit the response strongly implicate the Lombard Vocal Response in the Beluga.
Measurement of Lombard-like response in the beluga whale
NASA Astrophysics Data System (ADS)
Scheifele, Peter M.
2004-05-01
Noise pollution has become recognized as a potential danger to marine mammals in general, and to the St. Lawrence beluga (Delphinapterus leucas) in particular. One method to determine whether noise is having an effect on an animals auditory ability is to observe a natural and repeatable response of the auditory and vocal systems to varying noise levels. This can be accomplished by observing changes in animal vocalizations in response to auditory feedback. A response such as this observed in humans and some animals is known as the Lombard vocal response, which represents a reaction of the auditory system directly manifested by changes in vocalization level. This response is known in humans, songbirds, and some primates. In this research a population of belugas in the St. Lawrence River Estuary was tested to determine whether a vocalization-as-a-function-of-noise phenomenon existed by using hidden Markhov classified vocalizations as targets for acoustical analyses. Correlation and regression analyses of signals and noise indicated that the phenomenon does exist and results of a human subjects experiment along with results from other animal species known to exhibit the response strongly implicate the Lombard vocal response in the St. Lawrence population of beluga.
Vocal emotion of humanoid robots: a study from brain mechanism.
Wang, Youhui; Hu, Xiaohua; Dai, Weihui; Zhou, Jie; Kuo, Taitzong
2014-01-01
Driven by rapid ongoing advances in humanoid robot, increasing attention has been shifted into the issue of emotion intelligence of AI robots to facilitate the communication between man-machines and human beings, especially for the vocal emotion in interactive system of future humanoid robots. This paper explored the brain mechanism of vocal emotion by studying previous researches and developed an experiment to observe the brain response by fMRI, to analyze vocal emotion of human beings. Findings in this paper provided a new approach to design and evaluate the vocal emotion of humanoid robots based on brain mechanism of human beings.
ERIC Educational Resources Information Center
Yunusova, Yana; Weismer, Gary G.; Lindstrom, Mary J.
2011-01-01
Purpose: In this study, the authors classified vocalic segments produced by control speakers (C) and speakers with dysarthria due to amyotrophic lateral sclerosis (ALS) or Parkinson's disease (PD); classification was based on movement measures. The researchers asked the following questions: (a) Can vowels be classified on the basis of selected…
Behroozmand, Roozbeh; Ibrahim, Nadine; Korzyukov, Oleg; Robin, Donald A.; Larson, Charles R.
2015-01-01
The answer to the question of how the brain incorporates sensory feedback and links it with motor function to achieve goal-directed movement during vocalization remains unclear. We investigated the mechanisms of voice pitch motor control by examining the spectro-temporal dynamics of EEG signals when non-musicians (NM), relative pitch (RP), and absolute pitch (AP) musicians maintained vocalizations of a vowel sound and received randomized ± 100 cents pitch-shift stimuli in their auditory feedback. We identified a phase-synchronized (evoked) fronto-central activation within the theta band (5–8 Hz) that temporally overlapped with compensatory vocal responses to pitch-shifted auditory feedback and was significantly stronger in RP and AP musicians compared with non-musicians. A second component involved a non-phase-synchronized (induced) frontal activation within the delta band (1–4 Hz) that emerged at approximately 1 s after the stimulus onset. The delta activation was significantly stronger in the NM compared with RP and AP groups and correlated with the pitch rebound error (PRE), indicating the degree to which subjects failed to re-adjust their voice pitch to baseline after the stimulus offset. We propose that the evoked theta is a neurophysiological marker of enhanced pitch processing in musicians and reflects mechanisms by which humans incorporate auditory feedback to control their voice pitch. We also suggest that the delta activation reflects adaptive neural processes by which vocal production errors are monitored and used to update the state of sensory-motor networks for driving subsequent vocal behaviors. This notion is corroborated by our findings showing that larger PREs were associated with greater delta band activity in the NM compared with RP and AP groups. These findings provide new insights into the neural mechanisms of auditory feedback processing for vocal pitch motor control. PMID:25873858
Heimovics, Sarah A; Riters, Lauren V
2005-12-01
In some species, such as songbirds, much is known about how the brain regulates vocal learning, production, and perception. What remains a mystery is what regulates the motivation to communicate. European starlings (Sturnus vulgaris) sing throughout most of the year, but the social and environmental factors that motivate singing behavior differ seasonally. Male song is highly sexually motivated during, but not outside of, the breeding season. Brain areas outside the song control system, such as the medial preoptic nucleus (POM) and ventral tegmental area (VTA), have been implicated in regulating sexually motivated behaviors in birds, including song. The present study was designed to explore whether these regions, as well as three song control nuclei [area X, the high vocal center (HVC), and the robust nucleus of the arcopallium (RA)], might be involved differentially in song produced within compared to outside of a breeding context. We recorded the behavioral responses of breeding and nonbreeding condition male starlings to the introduction of a female conspecific. Males did not show context-dependent differences in the overall amount of song sung. However, immunocytochemistry for the protein product of the immediate early gene cFOS revealed a positive linear relationship between the total amount of songs sung and number of cFOS-labeled cells in POM, VTA, HVC, and RA for birds singing during, but not outside of, a breeding context. These results suggest that these regions differentially regulate male song production depending on reproductive context. Overall the data support the hypothesis that the POM and VTA interact with the song control system, specifically HVC and RA, to regulate sexually motivated vocal communication in songbirds.
Soderstrom, Ken; Tian, Qiyu
2008-01-01
CB1 cannabinoid receptors are distinctly expressed at high density within several regions of zebra finch telencephalon including those known to be involved in song learning (lMAN and Area X) and production (HVC and RA). Because: (1) exposure to cannabinoid agonists during developmental periods of auditory and sensory-motor song learning alters song patterns produced later in adulthood and; (2) densities of song region expression of CB1 waxes-and-wanes during song learning, it is becoming clear that CB1 receptor-mediated signaling is important to normal processes of vocal development. To better understand mechanisms involved in cannabinoid modulation of vocal behavior we have investigated the dose-response relationship between systemic cannabinoid exposure and changes in neuronal activity (as indicated by expression of the transcription factor, c-Fos) within telencephalic brain regions with established involvement in song learning and/or control. In adults we have found that low doses (0.1 mg/kg) of the cannabinoid agonist WIN-55212-2 decrease neuronal activity (as indicated by densities of c-fos-expressing nuclei) within vocal motor regions of caudal telencephalon (HVC and RA) while higher doses (3 mg/kg) stimulate activity. Both effects were reversed by pretreatment with the CB1-selective antagonist rimonabant. Interestingly, no effects of cannabinoid treatment were observed within the rostral song regions lMAN and Area X, despite distinct and dense CB1 receptor expression within these areas. Overall, our results demonstrate that, depending on dosage, CB1 agonism can both inhibit and stimulate neuronal activity within brain regions controlling adult vocal motor output, implicating involvement of multiple CB1-sensitive neuronal circuits. PMID:18509622
Van Stan, Jarrad H.; Mehta, Daryush D.; Zeitels, Steven M.; Burns, James A.; Barbu, Anca M.; Hillman, Robert E.
2015-01-01
Objectives Clinical management of phonotraumatic vocal fold lesions (nodules, polyps) is based largely on assumptions that abnormalities in habitual levels of sound pressure level (SPL), fundamental frequency (f0), and/or amount of voice use play a major role in lesion development and chronic persistence. This study used ambulatory voice monitoring to evaluate if significant differences in voice use exist between patients with phonotraumatic lesions and normal matched controls. Methods Subjects were 70 adult females: 35 with vocal fold nodules or polyps and 35 age-, sex-, and occupation-matched normal individuals. Weeklong summary statistics of voice use were computed from anterior neck surface acceleration recorded using a smartphone-based ambulatory voice monitor. Results Paired t-tests and Kolmogorov-Smirnov tests resulted in no statistically significant differences between patients and matched controls regarding average measures of SPL, f0, vocal dose measures, and voicing/voice rest periods. Paired t-tests comparing f0 variability between the groups resulted in statistically significant differences with moderate effect sizes. Conclusions Individuals with phonotraumatic lesions did not exhibit differences in average ambulatory measures of vocal behavior when compared with matched controls. More refined characterizations of underlying phonatory mechanisms and other potentially contributing causes are warranted to better understand risk factors associated with phonotraumatic lesions. PMID:26024911
Animal models of speech and vocal communication deficits associated with psychiatric disorders
Konopka, Genevieve; Roberts, Todd F.
2015-01-01
Disruptions in speech, language and vocal communication are hallmarks of several neuropsychiatric disorders, most notably autism spectrum disorders. Historically, the use of animal models to dissect molecular pathways and connect them to behavioral endophenotypes in cognitive disorders has proven to be an effective approach for developing and testing disease-relevant therapeutics. The unique aspects of human language when compared to vocal behaviors in other animals make such an approach potentially more challenging. However, the study of vocal learning in species with analogous brain circuits to humans may provide entry points for understanding this human-specific phenotype and diseases. Here, we review animal models of vocal learning and vocal communication, and specifically link phenotypes of psychiatric disorders to relevant model systems. Evolutionary constraints in the organization of neural circuits and synaptic plasticity result in similarities in the brain mechanisms for vocal learning and vocal communication. Comparative approaches and careful consideration of the behavioral limitations among different animal models can provide critical avenues for dissecting the molecular pathways underlying cognitive disorders that disrupt speech, language and vocal communication. PMID:26232298
The Distribution and Severity of Tremor in Speech Structures of Persons with Vocal Tremor.
Hemmerich, Abby L; Finnegan, Eileen M; Hoffman, Henry T
2017-05-01
Vocal tremor may be associated with cyclic oscillations in the pulmonary, laryngeal, velopharyngeal, or oral regions. This study aimed to correlate the overall severity of vocal tremor with the distribution and severity of tremor in structures involved. Endoscopic and clinical examinations were completed on 20 adults with vocal tremor and two age-matched controls during sustained phonation. Two judges rated the severity of vocal tremor and the severity of tremor affecting each of 13 structures. Participants with mild vocal tremor typically presented with tremor in three laryngeal structures, moderate vocal tremor in five structures (laryngeal and another region), and severe vocal tremor in eight structures affecting all regions. The severity of tremor was lowest (mean = 1.2 out of 3) in persons with mild vocal tremor and greater in persons with moderate (mean = 1.5) and severe vocal tremor (mean = 1.4). Laryngeal structures were most frequently (95%) and severely (1.7 out of 3) affected, followed by velopharynx (40% occurrence, 1.3 severity), pulmonary (40% occurrence, 1.1 severity), and oral (40% occurrence, 1.0 severity) regions. Regression analyses indicated tremor severity of the supraglottic structures, and vertical laryngeal movement contributed most to vocal tremor severity during sustained phonation (r = 0.77, F = 16.17, P < 0.0001). A strong positive correlation (r = 0.72) was found between the Tremor Index and the severity of the vocal tremor during sustained phonation. It is useful to obtain a wide endoscopic view of the larynx to visualize tremor, which is rarely isolated to the true vocal folds alone. Published by Elsevier Inc.
The importance of hyaluronic acid in vocal fold biomechanics.
Chan, R W; Gray, S D; Titze, I R
2001-06-01
This study examined the influence of hyaluronic acid (HA) on the biomechanical properties of the human vocal fold cover (the superficial layer of the lamina propria). Vocal fold tissues were freshly excised from 5 adult male cadavers and were treated with bovine testicular hyaluronidase to selectively remove HA from the lamina propria extracellular matrix (ECM). Linear viscoelastic shear properties (elastic shear modulus and dynamic viscosity) of the tissue samples before and after enzymatic treatment were quantified as a function of frequency (0.01 to 15 Hz) by a parallel-plate rotational rheometer at 37 degrees C. On removing HA from the vocal fold ECM, the elastic shear modulus (G' ) or stiffness of the vocal fold cover decreased by an average of around 35%, while the dynamic viscosity (eta') increased by 70% at higher frequencies (>1 Hz). The results suggested that HA plays an important role in determining the biomechanical properties of the vocal fold cover. As a highly hydrated glycosaminoglycan in the vocal fold ECM, it likely contributes to the maintenance of an optimal tissue viscosity that may facilitate phonation, and an optimal tissue stiffness that may be important for vocal fundamental frequency control. HA has been proposed as a potential bioimplant for the surgical repair of vocal fold ECM defects (eg, vocal fold scarring and sulcus vocalis). Our results suggested that such clinical use may be potentially optimal for voice production from a biomechanical perspective.
Zheng, Zane Z.; Vicente-Grabovetsky, Alejandro; MacDonald, Ewen N.; Munhall, Kevin G.; Cusack, Rhodri; Johnsrude, Ingrid S.
2013-01-01
The everyday act of speaking involves the complex processes of speech motor control. An important component of control is monitoring, detection and processing of errors when auditory feedback does not correspond to the intended motor gesture. Here we show, using fMRI and converging operations within a multi-voxel pattern analysis framework, that this sensorimotor process is supported by functionally differentiated brain networks. During scanning, a real-time speech-tracking system was employed to deliver two acoustically different types of distorted auditory feedback or unaltered feedback while human participants were vocalizing monosyllabic words, and to present the same auditory stimuli while participants were passively listening. Whole-brain analysis of neural-pattern similarity revealed three functional networks that were differentially sensitive to distorted auditory feedback during vocalization, compared to during passive listening. One network of regions appears to encode an ‘error signal’ irrespective of acoustic features of the error: this network, including right angular gyrus, right supplementary motor area, and bilateral cerebellum, yielded consistent neural patterns across acoustically different, distorted feedback types, only during articulation (not during passive listening). In contrast, a fronto-temporal network appears sensitive to the speech features of auditory stimuli during passive listening; this preference for speech features was diminished when the same stimuli were presented as auditory concomitants of vocalization. A third network, showing a distinct functional pattern from the other two, appears to capture aspects of both neural response profiles. Taken together, our findings suggest that auditory feedback processing during speech motor control may rely on multiple, interactive, functionally differentiated neural systems. PMID:23467350
Automated Assessment of Child Vocalization Development Using LENA.
Richards, Jeffrey A; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance
2017-07-12
To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Assessment was based on full-day audio recordings collected in a child's unrestricted, natural language environment. AVA estimates were derived using automatic speech recognition modeling techniques to categorize and quantify the sounds in child vocalizations (e.g., protophones and phonemes). These were expressed as phone and biphone frequencies, reduced to principal components, and inputted to age-based multiple linear regression models to predict independently collected criterion-expressive language scores. From these models, we generated vocal development AVA estimates as age-standardized scores and development age estimates. AVA estimates demonstrated strong statistical reliability and validity when compared with standard criterion expressive language assessments. Automated analysis of child vocalizations extracted from full-day recordings in natural settings offers a novel and efficient means to assess children's expressive vocal development. More research remains to identify specific mechanisms of operation.
Auditory Signal Processing in Communication: Perception and Performance of Vocal Sounds
Prather, Jonathan F.
2013-01-01
Learning and maintaining the sounds we use in vocal communication require accurate perception of the sounds we hear performed by others and feedback-dependent imitation of those sounds to produce our own vocalizations. Understanding how the central nervous system integrates auditory and vocal-motor information to enable communication is a fundamental goal of systems neuroscience, and insights into the mechanisms of those processes will profoundly enhance clinical therapies for communication disorders. Gaining the high-resolution insight necessary to define the circuits and cellular mechanisms underlying human vocal communication is presently impractical. Songbirds are the best animal model of human speech, and this review highlights recent insights into the neural basis of auditory perception and feedback-dependent imitation in those animals. Neural correlates of song perception are present in auditory areas, and those correlates are preserved in the auditory responses of downstream neurons that are also active when the bird sings. Initial tests indicate that singing-related activity in those downstream neurons is associated with vocal-motor performance as opposed to the bird simply hearing itself sing. Therefore, action potentials related to auditory perception and action potentials related to vocal performance are co-localized in individual neurons. Conceptual models of song learning involve comparison of vocal commands and the associated auditory feedback to compute an error signal that is used to guide refinement of subsequent song performances, yet the sites of that comparison remain unknown. Convergence of sensory and motor activity onto individual neurons points to a possible mechanism through which auditory and vocal-motor signals may be linked to enable learning and maintenance of the sounds used in vocal communication. PMID:23827717
Lester, Rosemary A.; Story, Brad H.
2015-01-01
The purpose of this study was to determine if adjustments to the voice source [i.e., fundamental frequency (F0), degree of vocal fold adduction] or vocal tract filter (i.e., vocal tract shape for vowels) reduce the perception of simulated laryngeal vocal tremor and to determine if listener perception could be explained by characteristics of the acoustical modulations. This research was carried out using a computational model of speech production that allowed for precise control and manipulation of the glottal and vocal tract configurations. Forty-two healthy adults participated in a perceptual study involving pair-comparisons of the magnitude of “shakiness” with simulated samples of laryngeal vocal tremor. Results revealed that listeners perceived a higher magnitude of voice modulation when simulated samples had a higher mean F0, greater degree of vocal fold adduction, and vocal tract shape for /i/ vs /ɑ/. However, the effect of F0 was significant only when glottal noise was not present in the acoustic signal. Acoustical analyses were performed with the simulated samples to determine the features that affected listeners' judgments. Based on regression analyses, listeners' judgments were predicted to some extent by modulation information present in both low and high frequency bands. PMID:26328711
Towards a computer-aided diagnosis system for vocal cord diseases.
Verikas, A; Gelzinis, A; Bacauskiene, M; Uloza, V
2006-01-01
The objective of this work is to investigate a possibility of creating a computer-aided decision support system for an automated analysis of vocal cord images aiming to categorize diseases of vocal cords. The problem is treated as a pattern recognition task. To obtain a concise and informative representation of a vocal cord image, colour, texture, and geometrical features are used. The representation is further analyzed by a pattern classifier categorizing the image into healthy, diffuse, and nodular classes. The approach developed was tested on 785 vocal cord images collected at the Department of Otolaryngology, Kaunas University of Medicine, Lithuania. A correct classification rate of over 87% was obtained when categorizing a set of unseen images into the aforementioned three classes. Bearing in mind the high similarity of the decision classes, the results obtained are rather encouraging and the developed tools could be very helpful for assuring objective analysis of the images of laryngeal diseases.
Human mutant huntingtin disrupts vocal learning in transgenic songbirds.
Liu, Wan-Chun; Kohn, Jessica; Szwed, Sarah K; Pariser, Eben; Sepe, Sharon; Haripal, Bhagwattie; Oshimori, Naoki; Marsala, Martin; Miyanohara, Atsushi; Lee, Ramee
2015-11-01
Speech and vocal impairments characterize many neurological disorders. However, the neurogenetic mechanisms of these disorders are not well understood, and current animal models do not have the necessary circuitry to recapitulate vocal learning deficits. We developed germline transgenic songbirds, zebra finches (Taneiopygia guttata) expressing human mutant huntingtin (mHTT), a protein responsible for the progressive deterioration of motor and cognitive function in Huntington's disease (HD). Although generally healthy, the mutant songbirds had severe vocal disorders, including poor vocal imitation, stuttering, and progressive syntax and syllable degradation. Their song abnormalities were associated with HD-related neuropathology and dysfunction of the cortical-basal ganglia (CBG) song circuit. These transgenics are, to the best of our knowledge, the first experimentally created, functional mutant songbirds. Their progressive and quantifiable vocal disorder, combined with circuit dysfunction in the CBG song system, offers a model for genetic manipulation and the development of therapeutic strategies for CBG-related vocal and motor disorders.
The role of allergy in the etiology of Reinke's edema on vocal folds.
Kravos, Alenka; Zupevc, Avgust; Cizmarevic, Bogdan; Hocevar-Boltezar, Irena
2010-05-01
To identify the role of allergy in the occurrence of Reinke's edema on vocal folds. This was a prospective study in patients with Reinke's edema on vocal folds (n = 80) and a control group of patients without Reinke's edema (n = 80). All the patients were examined by the same ENT specialist at an ENT outpatient center in the period 2001-2003. The control group comprised 80 successive patients from the same clinic who were willing to cooperate in the study and were not already being treated for allergic disease by any specialist other than an ENT specialist. Patients from both groups were examined by rigid laryngoscope, skin prick allergic testing (type I allergy) and laboratory determination of serum thyroid-stimulating hormone level. The prevalence of allergy was not significantly different between the two groups: 20.0% in the patients with Reinke's edema and 23.8% in the control group (P = 0.576). No significant difference was detected in the prevalence of allergy when comparing patients with recurrent Reinke's edema after previous surgical treatment (n = 10) with patients with a first occurrence (n = 70) (P = 1.000). However, we found significantly more type I allergy in the patients with Reinke's edema and chronic hyperplastic rhinitis than in the patients with Reinke's edema without chronic hyperplastic rhinitis. All other risk factors (reflux of gastric content to the larynx, vocal abuse or misuse, unfavorable microclimate conditions at work, thyroid diseases) appeared significantly more often in the group with Reinke's edema on vocal folds than in the control group. Type I allergy is not a crucial factor in the pathogenesis of Reinke's edema on vocal folds or in recurrence of the disease after surgical treatment.
ERIC Educational Resources Information Center
Coppoolse, Jiska M. S.; Van Kooten, T. G.; Heris, Hossein K.; Mongeau, Luc; Li, Nicole Y. K.; Thibeault, Susan L.; Pitaro, Jacob; Akinpelu, Olubunm; Daniel, Sam J.
2014-01-01
Purpose: The objective of this study was to investigate local injection with a hierarchically microstructured hyaluronic acid-gelatin (HA-Ge) hydrogel for the treatment of acute vocal fold injury using a rat model. Method: Vocal fold stripping was performed unilaterally in 108 Sprague-Dawley rats. A volume of 25 µl saline (placebo controls),…
Meaux, Lauren T; Mitchell, Kyle R; Cohen, Alex S
2018-05-01
Patients with schizophrenia are consistently rated by clinicians as having high levels of blunted vocal affect and alogia. However, objective technologies have often failed to substantiate these abnormalities. It could be the case that negative symptoms are context-dependent. The present study examined speech elicited under conditions demonstrated to exacerbate thought disorder. The Rorschach Test was administered to 36 outpatients with schizophrenia and 25 nonpatient controls. Replies to separate "perceptual" and "memory" phases were analyzed using validated acoustic analytic methods. Compared to nonpatient controls, schizophrenia patients did not display abnormal speech expression on objective measure of blunted vocal affect or alogia. Moreover, clinical ratings of negative symptoms were not significantly correlated with objective measures. These findings suggest that in patients with schizophrenia, vocal affect/alogia is generally unremarkable under ambiguous conditions. Clarifying the nature of blunted vocal affect and alogia, and how objective measures correspond to what clinicians attend to when making clinical ratings are important directions for future research. Copyright © 2018 Elsevier Inc. All rights reserved.
Vocal repertoire of the social giant otter.
Leuchtenberger, Caroline; Sousa-Lima, Renata; Duplaix, Nicole; Magnusson, William E; Mourão, Guilherme
2014-11-01
According to the "social intelligence hypothesis," species with complex social interactions have more sophisticated communication systems. Giant otters (Pteronura brasiliensis) live in groups with complex social interactions. It is likely that the vocal communication of giant otters is more sophisticated than previous studies suggest. The objectives of the current study were to describe the airborne vocal repertoire of giant otters in the Pantanal area of Brazil, to analyze call types within different behavioral contexts, and to correlate vocal complexity with level of sociability of mustelids to verify whether or not the result supports the social intelligence hypothesis. The behavior of nine giant otters groups was observed. Vocalizations recorded were acoustically and statistically analyzed to describe the species' repertoire. The repertoire was comprised by 15 sound types emitted in different behavioral contexts. The main behavioral contexts of each sound type were significantly associated with the acoustic variable ordination of different sound types. A strong correlation between vocal complexity and sociability was found for different species, suggesting that the communication systems observed in the family mustelidae support the social intelligence hypothesis.
Roy, Nelson; Fetrow, Rebecca A; Merrill, Ray M; Dromey, Christopher
2016-10-01
Vocal hyperfunction, related to abnormal laryngeal muscle activity, is considered the proximal cause of primary muscle tension dysphonia (pMTD). Relative fundamental frequency (RFF) has been proposed as an objective acoustic marker of vocal hyperfunction. This study examined (a) the ability of RFF to track changes in vocal hyperfunction after treatment for pMTD and (b) the influence of dysphonia severity, among other factors, on the feasibility of RFF computation. RFF calculations and dysphonia severity ratings were derived from pre- and posttreatment recordings from 111 women with pMTD and 20 healthy controls. Three vowel-voiceless consonant-vowel stimuli were analyzed. RFF onset slope consistently varied as a function of group (pMTD vs. controls) and time (pretherapy vs. posttherapy). Significant correlations between RFF onset cycle 1 and dysphonia severity were observed. However, in many samples, RFF could not be computed, and adjusted odds ratios revealed that these unanalyzable data were linked to dysphonia severity, phonetic (vowel-voiceless consonant-vowel) context, and group (pMTD vs. control). RFF onset appears to be sensitive to the presence and degree of suspected vocal hyperfunction before and after therapy. The large number of unanalyzable samples (related especially to dysphonia severity in the pMTD group) represents an important limitation.
Optogenetic control of contractile function in skeletal muscle
Bruegmann, Tobias; van Bremen, Tobias; Vogt, Christoph C.; Send, Thorsten; Fleischmann, Bernd K.; Sasse, Philipp
2015-01-01
Optogenetic stimulation allows activation of cells with high spatial and temporal precision. Here we show direct optogenetic stimulation of skeletal muscle from transgenic mice expressing the light-sensitive channel Channelrhodopsin-2 (ChR2). Largest tetanic contractions are observed with 5-ms light pulses at 30 Hz, resulting in 84% of the maximal force induced by electrical stimulation. We demonstrate the utility of this approach by selectively stimulating with a light guide individual intralaryngeal muscles in explanted larynges from ChR2-transgenic mice, which enables selective opening and closing of the vocal cords. Furthermore, systemic injection of adeno-associated virus into wild-type mice provides sufficient ChR2 expression for optogenetic opening of the vocal cords. Thus, direct optogenetic stimulation of skeletal muscle generates large force and provides the distinct advantage of localized and cell-type-specific activation. This technology could be useful for therapeutic purposes, such as restoring the mobility of the vocal cords in patients suffering from laryngeal paralysis. PMID:26035411
Zhang, Zhaoyan
2016-01-01
The goal of this study is to better understand the cause-effect relation between vocal fold physiology and the resulting vibration pattern and voice acoustics. Using a three-dimensional continuum model of phonation, the effects of changes in vocal fold stiffness, medial surface thickness in the vertical direction, resting glottal opening, and subglottal pressure on vocal fold vibration and different acoustic measures are investigated. The results show that the medial surface thickness has dominant effects on the vertical phase difference between the upper and lower margins of the medial surface, closed quotient, H1-H2, and higher-order harmonics excitation. The main effects of vocal fold approximation or decreasing resting glottal opening are to lower the phonation threshold pressure, reduce noise production, and increase the fundamental frequency. Increasing subglottal pressure is primarily responsible for vocal intensity increase but also leads to significant increase in noise production and an increased fundamental frequency. Increasing AP stiffness significantly increases the fundamental frequency and slightly reduces noise production. The interaction among vocal fold thickness, stiffness, approximation, and subglottal pressure in the control of F0, vocal intensity, and voice quality is discussed. PMID:27106298
Vocal function in introverts and extraverts during a psychological stress reactivity protocol.
Dietrich, Maria; Verdolini Abbott, Katherine
2012-06-01
To examine the proposal that introversion predictably influences extralaryngeal and vocal behavior in vocally healthy individuals compared with individuals with extraversion and whether differences are of a nature that may support a risk hypothesis for primary muscle tension dysphonia. Fifty-four vocally healthy female adults between the ages of 18 and 35 years were divided into 2 groups: introversion (n = 27) and extraversion (n = 27). All participants completed a psychological stress reactivity experiment. Before, during, and after the stressor (public speaking), participants were assessed on extralaryngeal muscle activity (surface electromyography: submental, infrahyoid; control site: tibialis anterior), perceived vocal effort, and vocal acoustics (fundamental frequency and intensity). Participants in the introversion group exhibited significantly greater infrahyoid muscle activity throughout the protocol and during perceived stress than participants in the extraversion group. For both groups, perceived vocal effort significantly increased during stress, and acoustic measures significantly decreased. Infrahyoid muscle activity during the stress phase was significantly correlated with introversion and Voice Handicap Index scores but not with vocal effort scores. The data provided evidence of distinct differences in extralaryngeal behavior between introverts and extraverts. The findings are consistent with the trait theory of voice disorders (Roy & Bless, 2000).
Aeroelastic Model of Vocal-Fold Vibrating Element for Studying the Phonation Threshold
NASA Astrophysics Data System (ADS)
Horáček, J.; Švec, J. G.
2002-10-01
An original theoretical model for vibration onset of the vocal folds in the air-flow coming from the human subglottal tract is designed, which allows studying the influence of the physical properties of the vocal folds (e.g., geometrical shape, mass, viscosity) on their vibration characteristics (such as the natural frequencies, mode shapes of vibration and the thresholds of instability). The mathematical model of the vocal fold is designed as a simplified dynamic system of two degrees of freedom (rotation and translation) vibrating on an elastic foundation in the wall of a channel conveying air. An approximate unsteady one-dimensional flow theory for the inviscid incompressible fluid is presented for the phonatory air-flow. A generally defined shape of the vocal-fold surface is considered for expressing the unsteady aerodynamic forces in the glottis. The parameters of the mechanical part of the model, i.e., the mass, stiffness and damping matrices, are related to the geometry and material density of the vocal folds as well as to the fundamental natural frequency and damping known from experiments. The coupled numerical solution yields the vibration characteristics (natural frequencies, damping and mode shapes of vibration), including the instability thresholds of the aeroelastic system. The vibration characteristics obtained from the coupled numerical solution of the system appear to be in reasonable qualitative agreement with the physiological data and clinical observations. The model is particularly suitable for studying the phonation threshold, i.e., the onset of vibration of the vocal folds.
Visual classification of feral cat Felis silvestris catus vocalizations
Owens, Jessica L.; Olsen, Mariana; Fontaine, Amy; Kloth, Christopher; Kershenbaum, Arik
2017-01-01
Abstract Cat vocal behavior, in particular, the vocal and social behavior of feral cats, is poorly understood, as are the differences between feral and fully domestic cats. The relationship between feral cat social and vocal behavior is important because of the markedly different ecology of feral and domestic cats, and enhanced comprehension of the repertoire and potential information content of feral cat calls can provide both better understanding of the domestication and socialization process, and improved welfare for feral cats undergoing adoption. Previous studies have used conflicting classification schemes for cat vocalizations, often relying on onomatopoeic or popular descriptions of call types (e.g., “miow”). We studied the vocalizations of 13 unaltered domestic cats that complied with our behavioral definition used to distinguish feral cats from domestic. A total of 71 acoustic units were extracted and visually analyzed for the construction of a hierarchical classification of vocal sounds, based on acoustic properties. We identified 3 major categories (tonal, pulse, and broadband) that further breakdown into 8 subcategories, and show a high degree of reliability when sounds are classified blindly by independent observers (Fleiss’ Kappa K = 0.863). Due to the limited behavioral contexts in this study, additional subcategories of cat vocalizations may be identified in the future, but our hierarchical classification system allows for the addition of new categories and new subcategories as they are described. This study shows that cat vocalizations are diverse and complex, and provides an objective and reliable classification system that can be used in future studies. PMID:29491992
Vocal Emotion of Humanoid Robots: A Study from Brain Mechanism
Wang, Youhui; Hu, Xiaohua; Zhou, Jie; Kuo, Taitzong
2014-01-01
Driven by rapid ongoing advances in humanoid robot, increasing attention has been shifted into the issue of emotion intelligence of AI robots to facilitate the communication between man-machines and human beings, especially for the vocal emotion in interactive system of future humanoid robots. This paper explored the brain mechanism of vocal emotion by studying previous researches and developed an experiment to observe the brain response by fMRI, to analyze vocal emotion of human beings. Findings in this paper provided a new approach to design and evaluate the vocal emotion of humanoid robots based on brain mechanism of human beings. PMID:24587712
Acoustic signatures of sound source-tract coupling.
Arneodo, Ezequiel M; Perl, Yonatan Sanz; Mindlin, Gabriel B
2011-04-01
Birdsong is a complex behavior, which results from the interaction between a nervous system and a biomechanical peripheral device. While much has been learned about how complex sounds are generated in the vocal organ, little has been learned about the signature on the vocalizations of the nonlinear effects introduced by the acoustic interactions between a sound source and the vocal tract. The variety of morphologies among bird species makes birdsong a most suitable model to study phenomena associated to the production of complex vocalizations. Inspired by the sound production mechanisms of songbirds, in this work we study a mathematical model of a vocal organ, in which a simple sound source interacts with a tract, leading to a delay differential equation. We explore the system numerically, and by taking it to the weakly nonlinear limit, we are able to examine its periodic solutions analytically. By these means we are able to explore the dynamics of oscillatory solutions of a sound source-tract coupled system, which are qualitatively different from those of a sound source-filter model of a vocal organ. Nonlinear features of the solutions are proposed as the underlying mechanisms of observed phenomena in birdsong, such as unilaterally produced "frequency jumps," enhancement of resonances, and the shift of the fundamental frequency observed in heliox experiments. ©2011 American Physical Society
Acoustic signatures of sound source-tract coupling
Arneodo, Ezequiel M.; Perl, Yonatan Sanz; Mindlin, Gabriel B.
2014-01-01
Birdsong is a complex behavior, which results from the interaction between a nervous system and a biomechanical peripheral device. While much has been learned about how complex sounds are generated in the vocal organ, little has been learned about the signature on the vocalizations of the nonlinear effects introduced by the acoustic interactions between a sound source and the vocal tract. The variety of morphologies among bird species makes birdsong a most suitable model to study phenomena associated to the production of complex vocalizations. Inspired by the sound production mechanisms of songbirds, in this work we study a mathematical model of a vocal organ, in which a simple sound source interacts with a tract, leading to a delay differential equation. We explore the system numerically, and by taking it to the weakly nonlinear limit, we are able to examine its periodic solutions analytically. By these means we are able to explore the dynamics of oscillatory solutions of a sound source-tract coupled system, which are qualitatively different from those of a sound source-filter model of a vocal organ. Nonlinear features of the solutions are proposed as the underlying mechanisms of observed phenomena in birdsong, such as unilaterally produced “frequency jumps,” enhancement of resonances, and the shift of the fundamental frequency observed in heliox experiments. PMID:21599213
Warhurst, Samantha; McCabe, Patricia; Heard, Rob; Yiu, Edwin; Wang, Gaowu; Madill, Catherine
2014-01-01
Purpose Acoustic and perceptual studies show a number of differences between the voices of radio performers and controls. Despite this, the vocal fold kinematics underlying these differences are largely unknown. Using high-speed videoendoscopy, this study sought to determine whether the vocal vibration features of radio performers differed from those of non-performing controls. Method Using high-speed videoendoscopy, recordings of a mid-phonatory/i/ in 16 male radio performers (aged 25–52 years) and 16 age-matched controls (aged 25–52 years) were collected. Videos were extracted and analysed semi-automatically using High-Speed Video Program, obtaining measures of fundamental frequency (f0), open quotient and speed quotient. Post-hoc analyses of sound pressure level (SPL) were also performed (n = 19). Pearson's correlations were calculated between SPL and both speed and open quotients. Results Male radio performers had a significantly higher speed quotient than their matched controls (t = 3.308, p = 0.005). No significant differences were found for f0 or open quotient. No significant correlation was found between either open or speed quotient with SPL. Discussion A higher speed quotient in male radio performers suggests that their vocal fold vibration was characterised by a higher ratio of glottal opening to closing times than controls. This result may explain findings of better voice quality, higher equivalent sound level and greater spectral tilt seen in previous research. Open quotient was not significantly different between groups, indicating that the durations of complete vocal fold closure were not different between the radio performers and controls. Further validation of these results is required to determine the aetiology of the higher speed quotient result and its implications for voice training and clinical management in performers. PMID:24971625
Indication of a Lombard vocal response in the St. Lawrence River beluga
NASA Astrophysics Data System (ADS)
Scheifele, P. M.; Andrew, S.; Cooper, R. A.; Darre, M.; Musiek, F. E.; Max, L.
2005-03-01
Noise pollution is recognized as a potential danger to marine mammals in general, and to the St. Lawrence beluga in particular. One method of determining the impacts of noise on an animal's communication is to observe a natural and repeatable response of the vocal system to variations in noise level. This is accomplished by observing intensity changes in animal vocalizations in response to environmental noise. One such response observed in humans, songbirds, and some primates is the Lombard vocal response. This response represents a vocal system reaction manifested by changes in vocalization level in direct response to changes in the noise field. In this research, a population of belugas in the St. Lawrence River Estuary was tested to determine whether a Lombard response existed by using hidden Markhov-classified vocalizations as targets for acoustical analyses. Correlation and regression analyses of signals and noise indicated that the phenomenon does exist. Further, results of human subjects experiments [Egan, J. J. (1966), Ph.D. dissertation; Scheifele, P. M. (2003), Ph.D. dissertation], along with previously reported data from other animal species, are similar to those exhibited by the belugas. Overall, findings suggest that typical noise levels in the St. Lawrence River Estuary have a detectable effect on the communication of the beluga. .
Temporal signatures of processing voiceness and emotion in sound
Gunter, Thomas C.
2017-01-01
Abstract This study explored the temporal course of vocal and emotional sound processing. Participants detected rare repetitions in a stimulus stream comprising neutral and surprised non-verbal exclamations and spectrally rotated control sounds. Spectral rotation preserved some acoustic and emotional properties of the vocal originals. Event-related potentials elicited to unrepeated sounds revealed effects of voiceness and emotion. Relative to non-vocal sounds, vocal sounds elicited a larger centro-parietally distributed N1. This effect was followed by greater positivity to vocal relative to non-vocal sounds beginning with the P2 and extending throughout the recording epoch (N4, late positive potential) with larger amplitudes in female than in male listeners. Emotion effects overlapped with the voiceness effects but were smaller and differed topographically. Voiceness and emotion interacted only for the late positive potential, which was greater for vocal-emotional as compared with all other sounds. Taken together, these results point to a multi-stage process in which voiceness and emotionality are represented independently before being integrated in a manner that biases responses to stimuli with socio-emotional relevance. PMID:28338796
Temporal signatures of processing voiceness and emotion in sound.
Schirmer, Annett; Gunter, Thomas C
2017-06-01
This study explored the temporal course of vocal and emotional sound processing. Participants detected rare repetitions in a stimulus stream comprising neutral and surprised non-verbal exclamations and spectrally rotated control sounds. Spectral rotation preserved some acoustic and emotional properties of the vocal originals. Event-related potentials elicited to unrepeated sounds revealed effects of voiceness and emotion. Relative to non-vocal sounds, vocal sounds elicited a larger centro-parietally distributed N1. This effect was followed by greater positivity to vocal relative to non-vocal sounds beginning with the P2 and extending throughout the recording epoch (N4, late positive potential) with larger amplitudes in female than in male listeners. Emotion effects overlapped with the voiceness effects but were smaller and differed topographically. Voiceness and emotion interacted only for the late positive potential, which was greater for vocal-emotional as compared with all other sounds. Taken together, these results point to a multi-stage process in which voiceness and emotionality are represented independently before being integrated in a manner that biases responses to stimuli with socio-emotional relevance. © The Author (2017). Published by Oxford University Press.
Behavior-linked FoxP2 regulation enables zebra finch vocal learning.
Heston, Jonathan B; White, Stephanie A
2015-02-18
Mutations in the FOXP2 transcription factor cause an inherited speech and language disorder, but how FoxP2 contributes to learning of these vocal communication signals remains unclear. FoxP2 is enriched in corticostriatal circuits of both human and songbird brains. Experimental knockdown of this enrichment in song control neurons of the zebra finch basal ganglia impairs tutor song imitation, indicating that adequate FoxP2 levels are necessary for normal vocal learning. In unmanipulated birds, vocal practice acutely downregulates FoxP2, leading to increased vocal variability and dynamic regulation of FoxP2 target genes. To determine whether this behavioral regulation is important for song learning, here, we used viral-driven overexpression of FoxP2 to counteract its downregulation. This manipulation disrupted the acute effects of song practice on vocal variability and caused inaccurate song imitation. Together, these findings indicate that dynamic behavior-linked regulation of FoxP2, rather than absolute levels, is critical for vocal learning. Copyright © 2015 the authors 0270-6474/15/352885-10$15.00/0.
The Effectiveness of Low-Level Light Therapy in Attenuating Vocal Fatigue.
Kagan, Loraine Sydney; Heaton, James T
2017-05-01
Low-level light therapy (LLLT) is effective in reducing inflammation, promoting wound healing, and preventing tissue damage, but has not yet been studied in the treatment of voice disorders. The objective of this study was to investigate the possible effectiveness of LLLT in attenuating symptoms of vocal fatigue created by a vocal loading task as measured by acoustic, aerodynamic, and self-reported vocal effort. In a randomized, prospective study, 16 vocally healthy adults divided into four groups underwent a 1-hour vocal loading procedure, followed by infrared wavelength LLLT (828 nm), red wavelength LLLT (628 nm), heat, or no heat-light (control) treatment targeting the laryngeal region of the ventral neck surface. Phonation threshold pressure (PTP), relative fundamental frequency (RFF), and the inability to produce soft voice (IPSV) self-perceptual rating scale were recorded (1) at baseline, (2) immediately after vocal loading, (3) after treatment, and (4) 1 hour after treatment. Vocal loading significantly increased PTP and IPSV and decreased onset and offset RFFs, consistent with a shift toward vocal dysfunction. Red light significantly normalized the combination of PTP, IPSV, and RFF measures compared to other conditions. RFF is sensitive to a vocal loading task in conjunction with PTP and IPSV, and red LLLT may have a normalizing effect on objective and subjective measures of vocal fatigue. The results of this study lay the groundwork and rationale for future research to optimize LLLT wavelength combinations and overall dose. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal Fold Epithelial Barrier in Health and Injury A Research Review
Levendoski, Elizabeth Erickson; Leydon, Ciara; Thibeault, Susan L.
2015-01-01
Purpose Vocal fold epithelium is composed of layers of individual epithelial cells joined by junctional complexes constituting a unique interface with the external environment. This barrier provides structural stability to the vocal folds and protects underlying connective tissue from injury while being nearly continuously exposed to potentially hazardous insults including environmental or systemic-based irritants such as pollutants and reflux, surgical procedures, and vibratory trauma. Small disruptions in the epithelial barrier may have a large impact on susceptibility to injury and overall vocal health. The purpose of this article is to provide a broad-based review of our current knowledge of the vocal fold epithelial barrier. Methods A comprehensive review of the literature was conducted. Details of the structure of the vocal fold epithelial barrier are presented and evaluated in the context of function in injury and pathology. The importance of the epithelial-associated vocal fold mucus barrier is also introduced. Results/Conclusions Information presented in this review is valuable for clinicians and researchers as it highlights the importance of this understudied portion of the vocal folds to overall vocal health and disease. Prevention and treatment of injury to the epithelial barrier is a significant area awaiting further investigation. PMID:24686981
Saranathan, Vinodkumar; Hamilton, Deborah; Powell, George V N; Kroodsma, Donald E; Prum, Richard O
2007-09-01
Vocal learning is thought to have evolved in three clades of birds (parrots, hummingbirds, and oscine passerines), and three clades of mammals (whales, bats, and primates). Behavioural data indicate that, unlike other suboscine passerines, the three-wattled bellbird Procnias tricarunculata (Cotingidae) is capable of vocal learning. Procnias tricarunculata shows conspicuous vocal ontogeny, striking geographical variation in song, and rapid temporal change in song within a population. Deprivation studies of vocal development in P. tricarunculata are impractical. Here, we report evidence from mitochondrial DNA sequences and nuclear microsatellite loci that genetic variation within and among the four allopatric breeding populations of P. tricarunculata is not congruent with variation in vocal behaviour. Sequences of the mitochondrial DNA control region document extensive haplotype sharing among localities and song types, and no phylogenetic resolution of geographical populations or behavioural groups. The vocally differentiated, allopatric breeding populations of P. tricarunculata are only weakly genetically differentiated populations, and are not distinct taxa. Mitochondrial DNA and microsatellite variation show small (2.9% and 13.5%, respectively) but significant correlation with geographical distance, but no significant residual variation by song type. Estimates of the strength of selection that would be needed to maintain the observed geographical pattern in vocal differentiation if songs were genetically based are unreasonably high, further discrediting the hypothesis of a genetic origin of vocal variation. These data support a fourth, phylogenetically independent origin of avian vocal learning in Procnias. Geographical variations in P. tricarunculata vocal behaviour are likely culturally evolved dialects.
Interaction between telencephalic signals and respiratory dynamics in songbirds
Méndez, Jorge M.; Mindlin, Gabriel B.
2012-01-01
The mechanisms by which telencephalic areas affect motor activities are largely unknown. They could either take over motor control from downstream motor circuits or interact with the intrinsic dynamics of these circuits. Both models have been proposed for telencephalic control of respiration during learned vocal behavior in birds. The interactive model postulates that simple signals from the telencephalic song control areas are sufficient to drive the nonlinear respiratory network into producing complex temporal sequences. We tested this basic assumption by electrically stimulating telencephalic song control areas and analyzing the resulting respiratory patterns in zebra finches and in canaries. We found strong evidence for interaction between the rhythm of stimulation and the intrinsic respiratory rhythm, including naturally emerging subharmonic behavior and integration of lateralized telencephalic input. The evidence for clear interaction in our experimental paradigm suggests that telencephalic vocal control also uses a similar mechanism. Furthermore, species differences in the response of the respiratory system to stimulation show parallels to differences in the respiratory patterns of song, suggesting that the interactive production of respiratory rhythms is manifested in species-specific specialization of the involved circuitry. PMID:22402649
A hypothesis on a role of oxytocin in the social mechanisms of speech and vocal learning.
Theofanopoulou, Constantina; Boeckx, Cedric; Jarvis, Erich D
2017-08-30
Language acquisition in humans and song learning in songbirds naturally happen as a social learning experience, providing an excellent opportunity to reveal social motivation and reward mechanisms that boost sensorimotor learning. Our knowledge about the molecules and circuits that control these social mechanisms for vocal learning and language is limited. Here we propose a hypothesis of a role for oxytocin (OT) in the social motivation and evolution of vocal learning and language. Building upon existing evidence, we suggest specific neural pathways and mechanisms through which OT might modulate vocal learning circuits in specific developmental stages. © 2017 The Authors.
A hypothesis on a role of oxytocin in the social mechanisms of speech and vocal learning
Jarvis, Erich D.
2017-01-01
Language acquisition in humans and song learning in songbirds naturally happen as a social learning experience, providing an excellent opportunity to reveal social motivation and reward mechanisms that boost sensorimotor learning. Our knowledge about the molecules and circuits that control these social mechanisms for vocal learning and language is limited. Here we propose a hypothesis of a role for oxytocin (OT) in the social motivation and evolution of vocal learning and language. Building upon existing evidence, we suggest specific neural pathways and mechanisms through which OT might modulate vocal learning circuits in specific developmental stages. PMID:28835557
Dissociable Effects on Birdsong of Androgen Signaling in Cortex-Like Brain Regions of Canaries
2017-01-01
The neural basis of how learned vocalizations change during development and in adulthood represents a major challenge facing cognitive neuroscience. This plasticity in the degree to which learned vocalizations can change in both humans and songbirds is linked to the actions of sex steroid hormones during ontogeny but also in adulthood in the context of seasonal changes in birdsong. We investigated the role of steroid hormone signaling in the brain on distinct features of birdsong using adult male canaries (Serinus canaria), which show extensive seasonal vocal plasticity as adults. Specifically, we bilaterally implanted the potent androgen receptor antagonist flutamide in two key brain regions that control birdsong. We show that androgen signaling in the motor cortical-like brain region, the robust nucleus of the arcopallium (RA), controls syllable and trill bandwidth stereotypy, while not significantly affecting higher order features of song such syllable-type usage (i.e., how many times each syllable type is used) or syllable sequences. In contrast, androgen signaling in the premotor cortical-like brain region, HVC (proper name), controls song variability by increasing the variability of syllable-type usage and syllable sequences, while having no effect on syllable or trill bandwidth stereotypy. Other aspects of song, such as the duration of trills and the number of syllables per song, were also differentially affected by androgen signaling in HVC versus RA. These results implicate androgens in regulating distinct features of complex motor output in a precise and nonredundant manner. SIGNIFICANCE STATEMENT Vocal plasticity is linked to the actions of sex steroid hormones, but the precise mechanisms are unclear. We investigated this question in adult male canaries (Serinus canaria), which show extensive vocal plasticity throughout their life. We show that androgens in two cortex-like vocal control brain regions regulate distinct aspects of vocal plasticity. For example, in HVC (proper name), androgens regulate variability in syntax but not phonology, whereas androgens in the robust nucleus of the arcopallium (RA) regulate variability in phonology but not syntax. Temporal aspects of song were also differentially affected by androgen signaling in HVC versus RA. Thus, androgen signaling may reduce vocal plasticity by acting in a nonredundant and precise manner in the brain. PMID:28821656
Microscopie non-lineaire pour l'imagerie des cordes vocales
NASA Astrophysics Data System (ADS)
Deterre, Romain
The vocal cords are two folds of epithelial tissues located in the larynx and are involved in production of the human voice. Despite their apparent simplicity, their internal structure is complex. Each fold can be divided into several layers with different mechanical properties. The gold standard for studying their structure - histology - has the inconvenience of being very invasive. Non-linear microscopy is an optical imaging technique which allows images to be taken in depth within samples in a non invasive manner. It also offers intrinsic contrasts, allowing the identification of certain fibrous proteins - elastin and collagen - which are responsible for the mechanical properties of epithelious tissues. The main goal of this research project was to assess nonlinear microscopy's performances for vocal fold imaging. The study has been broken down in two separate tasks. The first one was to evaluate the nonlinear modalities contrast against histology. For that purpose, we chose to first take images of thin samples and compare them to the corresponding histological slides. The second task was to make tests to transcribe the results obtained to in vivo imaging. A custom-built nonlinear imaging system was used for these experiments. It was developed to allow acquisition of wide-field images. A C++ based software was developped to control the microscope and allow treatment and visualization of the images. After being built, the system was further tested to check its performances in comparison with the theoretical limit as described in the literature. Thin slices of vocal folds were obtained from the team of Pr Christopher J. Hartnick from Massachusetts Eye and Ear Infirmary, Harvard Medical School. Specialists from his team analysed the histological samples to extract structural data from the vocal folds. A good correlation was measured between histological and nonlinear data. A first step in evaluating the possibility for translating these results towards in vivo imaging was performed during this project. A swine's larynx was obtained, and vocal folds were extracted for imaging purposes. This experiment showed that it is indeed possible to localize various macrostructures of the tissues with nonlinear microscopy.
Improvement of a Vocal Fold Imaging System
DOE Office of Scientific and Technical Information (OSTI.GOV)
Krauter, K. G.
Medical professionals can better serve their patients through continual update of their imaging tools. A wide range of pathologies and disease may afflict human vocal cords or, as they’re also known, vocal folds. These diseases can affect human speech hampering the ability of the patient to communicate. Vocal folds must be opened for breathing and the closed to produce speech. Currently methodologies to image markers of potential pathologies are difficult to use and often fail to detect early signs of disease. These current methodologies rely on a strobe light and slower frame rate camera in an attempt to obtain imagesmore » as the vocal folds travel over the full extent of their motion.« less
A portable high-speed camera system for vocal fold examinations.
Hertegård, Stellan; Larsson, Hans
2014-11-01
In this article, we present a new portable low-cost system for high-speed examinations of the vocal folds. Analysis of glottal vibratory parameters from the high-speed recordings is compared with videostroboscopic recordings. The high-speed system is built around a Fastec 1 monochrome camera, which is used with newly developed software, High-Speed Studio (HSS). The HSS has options for video/image recording, contains a database, and has a set of analysis options. The Fastec/HSS system has been used clinically since 2011 in more than 2000 patient examinations and recordings. The Fastec 1 camera has sufficient time resolution (≥4000 frames/s) and light sensitivity (ISO 3200) to produce images for detailed analyses of parameters pertinent to vocal fold function. The camera can be used with both rigid and flexible endoscopes. The HSS software includes options for analyses of glottal vibrations, such as kymogram, phase asymmetry, glottal area variation, open and closed phase, and angle of vocal fold abduction. It can also be used for separate analysis of the left and vocal fold movements, including maximum speed during opening and closing, a parameter possibly related to vocal fold elasticity. A blinded analysis of 32 patients with various voice disorders examined with both the Fastec/HSS system and videostroboscopy showed that the high-speed recordings were significantly better for the analysis of glottal parameters (eg, mucosal wave and vibration asymmetry). The monochrome high-speed system can be used in daily clinical work within normal clinical time limits for patient examinations. A detailed analysis can be made of voice disorders and laryngeal pathology at a relatively low cost. Copyright © 2014 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Arterbery, Adam S; Deitcher, David L; Bass, Andrew H
2010-05-15
The vocalizing midshipman fish, Porichthys notatus, has two male morphs that exhibit alternative mating tactics. Only territorial males acoustically court females with long duration (minutes to >1h) calls, whereas sneaker males attempt to steal fertilizations. During the breeding season, morph-specific tactics are paralleled by a divergence in relative testis and vocal muscle size, plasma levels of the androgen 11-ketotestosterone (11KT) and the glucocorticoid cortisol, and mRNA expression levels in the central nervous system (CNS) of the steroid-synthesizing enzyme aromatase (estrogen synthase). Here, we tested the hypothesis that the midshipman's two male morphs would further differ in the CNS, as well as in the testis and vocal muscle, in mRNA abundance for the enzymes 11beta-hydroxylase (11betaH) and 11beta-hydroxysteroid dehydrogenase (11betaHSD) that directly regulate both 11KT and cortisol synthesis. Quantitative real-time PCR demonstrated male morph-specific profiles for both enzymes. Territorial males had higher 11betaH and 11betaHSD mRNA levels in testis and vocal muscle. By contrast, sneaker males had the higher CNS expression, especially for 11betaHSD, in the region containing an expansive vocal pacemaker circuit that directly determines the temporal attributes of natural calls. We propose for territorial males that higher enzyme expression in testis underlies its greater plasma 11KT levels, which in vocal muscle provides both gluconeogenic and androgenic support for its long duration calling. We further propose for sneaker males that higher enzyme expression in the vocal CNS contributes to known cortisol-specific effects on its vocal physiology. Copyright 2010 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wood, R.; Springston, S.; Mechoso, C. R.
2011-01-21
The VAMOS Ocean-Cloud-Atmosphere-Land Study Regional Experiment (VOCALS-REx) was an international field program designed to make observations of poorly understood but critical components of the coupled climate system of the southeast Pacific. This region is characterized by strong coastal upwelling, the coolest SSTs in the tropical belt, and is home to the largest subtropical stratocumulus deck on Earth. The field intensive phase of VOCALS-REx took place during October and November 2008 and constitutes a critical part of a broader CLIVAR program (VOCALS) designed to develop and promote scientific activities leading to improved understanding, model simulations, and predictions of the southeastern Pacificmore » (SEP) coupled ocean-atmosphere-land system, on diurnal to interannual timescales. The other major components of VOCALS are a modeling program with a model hierarchy ranging from the local to global scales, and a suite of extended observations from regular research cruises, instrumented moorings, and satellites. The two central themes of VOCALS-REx focus upon (a) links between aerosols, clouds and precipitation and their impacts on marine stratocumulus radiative properties, and (b) physical and chemical couplings between the upper ocean and the lower atmosphere, including the role that mesoscale ocean eddies play. A set of hypotheses designed to be tested with the combined field, monitoring and modeling work in VOCALS is presented here. A further goal of VOCALS-REx is to provide datasets for the evaluation and improvement of large-scale numerical models. VOCALS-REx involved five research aircraft, two ships and two surface sites in northern Chile. We describe the instrument payloads and key mission strategies for these platforms and give a summary of the missions conducted.« less
A biorobotic model of the human larynx.
Manti, M; Cianchetti, M; Nacci, A; Ursino, F; Laschi, C
2015-08-01
This work focuses on a physical model of the human larynx that replicates its main components and functions. The prototype reproduces the multilayer vocal folds and the ab/adduction movements. In particular, the vocal folds prototype is made with soft materials whose mechanical properties have been obtained to be similar to the natural tissue in terms of viscoelasticity. A computational model was used to study fluid-structure interaction between vocal folds and the airflow. This tool allowed us to make a comparison between theoretical and experimental results. Measurements were performed with this prototype in an experimental platform comprising a controlled air flow, pressure sensors and a high-speed camera for measuring vocal fold vibrations. Data included oscillation frequency at the onset pressure and glottal width. Results show that the combination between vocal fold geometry, mechanical properties and dimensions exhibits an oscillation frequency close to that of the human vocal fold. Moreover, computational results show a high correlation with the experimental one.
A circular model for song motor control in Serinus canaria
Alonso, Rodrigo G.; Trevisan, Marcos A.; Amador, Ana; Goller, Franz; Mindlin, Gabriel B.
2015-01-01
Song production in songbirds is controlled by a network of nuclei distributed across several brain regions, which drives respiratory and vocal motor systems to generate sound. We built a model for birdsong production, whose variables are the average activities of different neural populations within these nuclei of the song system. We focus on the predictions of respiratory patterns of song, because these can be easily measured and therefore provide a validation for the model. We test the hypothesis that it is possible to construct a model in which (1) the activity of an expiratory related (ER) neural population fits the observed pressure patterns used by canaries during singing, and (2) a higher forebrain neural population, HVC, is sparsely active, simultaneously with significant motor instances of the pressure patterns. We show that in order to achieve these two requirements, the ER neural population needs to receive two inputs: a direct one, and its copy after being processed by other areas of the song system. The model is capable of reproducing the measured respiratory patterns and makes specific predictions on the timing of HVC activity during their production. These results suggest that vocal production is controlled by a circular network rather than by a simple top-down architecture. PMID:25904860
In the ear of the beholder: how age shapes emotion processing in nonverbal vocalizations.
Lima, César F; Alves, Tiago; Scott, Sophie K; Castro, São Luís
2014-02-01
It is well established that emotion recognition of facial expressions declines with age, but evidence for age-related differences in vocal emotions is more limited. This is especially true for nonverbal vocalizations such as laughter, sobs, or sighs. In this study, 43 younger adults (M = 22 years) and 43 older ones (M = 61.4 years) provided multiple emotion ratings of nonverbal emotional vocalizations. Contrasting with previous research, which often includes only one positive emotion (happiness) versus several negative ones, we examined 4 positive and 4 negative emotions: achievement/triumph, amusement, pleasure, relief, anger, disgust, fear, and sadness. We controlled for hearing loss and assessed general cognitive decline, cognitive control, verbal intelligence, working memory, current affect, emotion regulation, and personality. Older adults were less sensitive than younger ones to the intended vocal emotions, as indicated by decrements in ratings on the intended emotion scales and accuracy. These effects were similar for positive and negative emotions, and they were independent of age-related differences in cognitive, affective, and personality measures. Regression analyses revealed that younger and older participants' responses could be predicted from the acoustic properties of the temporal, intensity, fundamental frequency, and spectral profile of the vocalizations. The two groups were similarly efficient in using the acoustic cues, but there were differences in the patterns of emotion-specific predictors. This study suggests that ageing produces specific changes on the processing of nonverbal vocalizations. That decrements were not attenuated for positive emotions indicates that they cannot be explained by a positivity effect in older adults. PsycINFO Database Record (c) 2014 APA, all rights reserved.
RIEDE, TOBIAS
2014-01-01
Rodents produce highly variable ultrasound whistles as communication signals unlike many other mammals, who employ flow-induced vocal fold oscillations to produce sound. The role of larynx muscles in controlling sound features across different call types in ultrasound vocalization (USV) was investigated using laryngeal muscle electromyographic (EMG) activity, subglottal pressure measurements and vocal sound output in awake and spontaneously behaving Sprague–Dawley rats. Results support the hypothesis that glottal shape determines fundamental frequency. EMG activities of thyroarytenoid and cricothyroid muscles were aligned with call duration. EMG intensity increased with fundamental frequency. Phasic activities of both muscles were aligned with fast changing fundamental frequency contours, for example in trills. Activities of the sternothyroid and sternohyoid muscles, two muscles involved in vocal production in other mammals, are not critical for the production of rat USV. To test how stereotypic laryngeal and respiratory activity are across call types and individuals, sets of ten EMG and subglottal pressure parameters were measured in six different call types from six rats. Using discriminant function analysis, on average 80% of parameter sets were correctly assigned to their respective call type. This was significantly higher than the chance level. Since fundamental frequency features of USV are tightly associated with stereotypic activity of intrinsic laryngeal muscles and muscles contributing to build-up of subglottal pressure, USV provide insight into the neurophysiological control of peripheral vocal motor patterns. PMID:23423862
A Primary Role for Nucleus Accumbens and Related Limbic Network in Vocal Tics.
McCairn, Kevin W; Nagai, Yuji; Hori, Yukiko; Ninomiya, Taihei; Kikuchi, Erika; Lee, Ju-Young; Suhara, Tetsuya; Iriki, Atsushi; Minamimoto, Takafumi; Takada, Masahiko; Isoda, Masaki; Matsumoto, Masayuki
2016-01-20
Inappropriate vocal expressions, e.g., vocal tics in Tourette syndrome, severely impact quality of life. Neural mechanisms underlying vocal tics remain unexplored because no established animal model representing the condition exists. We report that unilateral disinhibition of the nucleus accumbens (NAc) generates vocal tics in monkeys. Whole-brain PET imaging identified prominent, bilateral limbic cortico-subcortical activation. Local field potentials (LFPs) developed abnormal spikes in the NAc and the anterior cingulate cortex (ACC). Vocalization could occur without obvious LFP spikes, however, when phase-phase coupling of alpha oscillations were accentuated between the NAc, ACC, and the primary motor cortex. These findings contrasted with myoclonic motor tics induced by disinhibition of the dorsolateral putamen, where PET activity was confined to the ipsilateral sensorimotor system and LFP spikes always preceded motor tics. We propose that vocal tics emerge as a consequence of dysrhythmic alpha coupling between critical nodes in the limbic and motor networks. VIDEO ABSTRACT. Copyright © 2016 Elsevier Inc. All rights reserved.
Woynaroski, Tiffany; Oller, D. Kimbrough; Keceli-Kaysili, Bahar; Xu, Dongxin; Richards, Jeffrey A.; Gilkerson, Jill; Gray, Sharmistha; Yoder, Paul
2017-01-01
Theory and research suggest that vocal development predicts “useful speech” in preschoolers with autism spectrum disorder (ASD), but conventional methods for measurement of vocal development are costly and time consuming. This longitudinal correlational study examines the reliability and validity of several automated indices of vocalization development relative to an index derived from human coded, conventional communication samples in a sample of preverbal preschoolers with ASD. Automated indices of vocal development were derived using software that is presently “in development” and/or only available for research purposes and using commercially available Language ENvironment Analysis (LENA) software. Indices of vocal development that could be derived using the software available for research purposes: (a) were highly stable with a single day-long audio recording, (b) predicted future spoken vocabulary to a degree that was nonsignificantly different from the index derived from conventional communication samples, and (c) continued to predict future spoken vocabulary even after controlling for concurrent vocabulary in our sample. The score derived from standard LENA software was similarly stable, but was not significantly correlated with future spoken vocabulary. Findings suggest that automated vocal analysis is a valid and reliable alternative to time intensive and expensive conventional communication samples for measurement of vocal development of preverbal preschoolers with ASD in research and clinical practice. PMID:27459107
Picciulin, Marta; Sebastianutto, Linda; Codarin, Antonio; Calcagno, Giuliana; Ferrero, Enrico A
2012-11-01
This study investigated whether or not boat noise causes variations in brown meagre (Sciaena umbra) vocalizations recorded in a nearshore Mediterranean marine reserve. Six nocturnal experimental sessions were carried out from June to September 2009. In each of them, a recreational boat passed over vocalizing fish 6 times with 1 boat passage every 10 min. For this purpose three different boats were used in random order: an 8.5-m cabin-cruiser (CC), a 5-m fiberglass boat (FB), and a 7-m inflatable boat (INF). In situ continuous acoustic recordings were collected using a self-standing sonobuoy. Because boat noise levels largely exceeded both background noise and S. umbra vocalizations in the species' hearing frequency range, masking of acoustic communication was assumed. Although no immediate effect was observed during a single boat passage, the S. umbra mean pulse rate increased over multiple boat passages in the experimental condition but not in the control condition, excluding that the observed effect was due to a natural rise in fish vocalizations. The observed vocal enhancement may result either from an increased density of callers or from an increased number of pulses/sounds produced by already acoustically active individuals, as a form of vocal compensation. These two explanations are discussed.
Borzacchiello, A; Mayol, L; Gärskog, O; Dahlqvist, A; Ambrosio, L
2005-06-01
The viscoelastic properties of vocal folds after injection of hyaluronic acid (hyaluronan, HA) based materials have been studied in an animal model (rabbit) six months after injection. The results indicate that the viscoelastic properties of the vocal folds injected with the HA based materials are similar to the healthy vocal folds (non-injected samples) used as control. Histological analysis has been also performed to investigate on the fate of the injected materials after six months from the implant. The HA based materials remain up to six months and they recruited fibroblasts that induce the ingrowth of new connective tissue resulting in an endogenous soft tissue augmentation. The HA based compounds are good candidate for further studies aimed at restoring/preserving the vibratory capacity of the vocal folds with injection treatment in glottal insufficiency.
Neural coding of syntactic structure in learned vocalizations in the songbird.
Fujimoto, Hisataka; Hasegawa, Taku; Watanabe, Dai
2011-07-06
Although vocal signals including human languages are composed of a finite number of acoustic elements, complex and diverse vocal patterns can be created from combinations of these elements, linked together by syntactic rules. To enable such syntactic vocal behaviors, neural systems must extract the sequence patterns from auditory information and establish syntactic rules to generate motor commands for vocal organs. However, the neural basis of syntactic processing of learned vocal signals remains largely unknown. Here we report that the basal ganglia projecting premotor neurons (HVC(X) neurons) in Bengalese finches represent syntactic rules that generate variable song sequences. When vocalizing an alternative transition segment between song elements called syllables, sparse burst spikes of HVC(X) neurons code the identity of a specific syllable type or a specific transition direction among the alternative trajectories. When vocalizing a variable repetition sequence of the same syllable, HVC(X) neurons not only signal the initiation and termination of the repetition sequence but also indicate the progress and state-of-completeness of the repetition. These different types of syntactic information are frequently integrated within the activity of single HVC(X) neurons, suggesting that syntactic attributes of the individual neurons are not programmed as a basic cellular subtype in advance but acquired in the course of vocal learning and maturation. Furthermore, some auditory-vocal mirroring type HVC(X) neurons display transition selectivity in the auditory phase, much as they do in the vocal phase, suggesting that these songbirds may extract syntactic rules from auditory experience and apply them to form their own vocal behaviors.
Respiratory and Laryngeal Changes with Vocal Loading in Younger and Older Individuals
ERIC Educational Resources Information Center
Sundarrajan, Anusha; Huber, Jessica E.; Sivasankar, M. Preeti
2017-01-01
Purpose: The objective of the current study was to investigate the effects of age and vocal loading on the respiratory and laryngeal systems. Method: Fourteen younger (M = 20 years) and 13 older (M = 75 years) healthy individuals participated in a 40-min vocal loading challenge in the presence of 70-dB background noise. Respiratory kinematic and…
The vocal monotony of monogamy
NASA Astrophysics Data System (ADS)
Thomas, Jeanette
2003-04-01
There are four phocids in waters around Antarctica: Weddell, leopard, crabeater, and Ross seals. These four species provide a unique opportunity to examine underwater vocal behavior in species sharing the same ecosystem. Some species live in pack ice, others in factice, but all are restricted to the Antarctic or sub-Antarctic islands. All breed and produce vocalizations under water. Social systems range from polygyny in large breeding colonies, to serial monogamy, to solitary species. The type of mating system influences the number of underwater vocalizations in the repertoire, with monogamous seals producing only a single call, polygynous species producing up to 35 calls, and solitary species an intermediate number of about 10 calls. Breeding occurs during the austral spring and each species carves-out an acoustic niche for communicating, with species using different frequency ranges, temporal patterns, and amplitude changes to convey their species-specific calls and presumably reduce acoustic competition. Some species exhibit geographic variations in their vocalizations around the continent, which may reflect discrete breeding populations. Some seals become silent during a vulnerable time of predation by killer whales, perhaps to avoid detection. Overall, vocalizations of these seals exhibit adaptive characteristics that reflect the co-evolution among species in the same ecosystem.
Early development of turn-taking with parents shapes vocal acoustics in infant marmoset monkeys
Takahashi, Daniel Y.; Fenley, Alicia R.; Ghazanfar, Asif A.
2016-01-01
In humans, vocal turn-taking is a ubiquitous form of social interaction. It is a communication system that exhibits the properties of a dynamical system: two individuals become coupled to each other via acoustic exchanges and mutually affect each other. Human turn-taking develops during the first year of life. We investigated the development of vocal turn-taking in infant marmoset monkeys, a New World species whose adult vocal behaviour exhibits the same universal features of human turn-taking. We find that marmoset infants undergo the same trajectory of change for vocal turn-taking as humans, and do so during the same life-history stage. Our data show that turn-taking by marmoset infants depends on the development of self-monitoring, and that contingent parental calls elicit more mature-sounding calls from infants. As in humans, there was no evidence that parental feedback affects the rate of turn-taking maturation. We conclude that vocal turn-taking by marmoset monkeys and humans is an instance of convergent evolution, possibly as a result of pressures on both species to adopt a cooperative breeding strategy and increase volubility. PMID:27069047
Oral and vocal fold diadochokinesis in dysphonic women.
Louzada, Talita; Beraldinelle, Roberta; Berretin-Felix, Giédre; Brasolotto, Alcione Ghedini
2011-01-01
The evaluation of oral and vocal fold diadochokinesis (DDK) in individuals with voice disorders may contribute to the understanding of factors that affect the balanced vocal production. Scientific studies that make use of this assessment tool support the knowledge advance of this area, reflecting the development of more appropriate therapeutic planning. To compare the results of oral and vocal fold DDK in dysphonic women and in women without vocal disorders. For this study, 28 voice recordings of women from 19 to 54 years old, diagnosed with dysphonia and submitted to a voice assessment from speech pathologist and otorhinolaryngologist, were used. The control group included 30 nondysphonic women evaluated in prior research from normal adults. The analysis parameters like number and duration of emissions, as well as the regularity of the repetition of syllables "pa", "ta", "ka" and the vowels "a" and "i," were provided by the Advanced Motor Speech Profile program (MSP) Model-5141, version-2.5.2 (KayPentax). The DDK sequence "pataka" was analyzed quantitatively through the Sound Forge 7.0 program, as well as manually with the audio-visual help of sound waves. Average values of oral and vocal fold DDK dysphonic and nondysphonic women were compared using the "t Student" test and were considered significant when p<0.05. The findings showed no significant differences between populations; however, the coefficient of variation of period (CvP) and jitter of period (JittP) average of the "ka," "a" and "i" emissions, respectively, were higher in dysphonic women (CvP=10.42%, 12.79%, 12.05%; JittP=2.05%, 6.05%, 3.63%) compared to the control group (CvP=8.86%; 10.95%, 11.20%; JittP=1.82%, 2.98%, 3.15%). Although the results do not indicate any difficulties in oral and laryngeal motor control in the dysphonic group, the largest instability in vocal fold DDK in the experimental group should be considered, and studies of this ability in individuals with communication disorders must be intensified.
Nocturnal "humming" vocalizations: adding a piece to the puzzle of giraffe vocal communication.
Baotic, Anton; Sicks, Florian; Stoeger, Angela S
2015-09-09
Recent research reveals that giraffes (Giraffa camelopardalis sp.) exhibit a socially structured, fission-fusion system. In other species possessing this kind of society, information exchange is important and vocal communication is usually well developed. But is this true for giraffes? Giraffes are known to produce sounds, but there is no evidence that they use vocalizations for communication. Reports on giraffe vocalizations are mainly anecdotal and the missing acoustic descriptions make it difficult to establish a call nomenclature. Despite inconclusive evidence to date, it is widely assumed that giraffes produce infrasonic vocalizations similar to elephants. In order to initiate a more detailed investigation of the vocal communication in giraffes, we collected data of captive individuals during day and night. We particularly focussed on detecting tonal, infrasonic or sustained vocalizations. We collected over 947 h of audio material in three European zoos and quantified the spectral and temporal components of acoustic signals to obtain an accurate set of acoustic parameters. Besides the known burst, snorts and grunts, we detected harmonic, sustained and frequency-modulated "humming" vocalizations during night recordings. None of the recorded vocalizations were within the infrasonic range. These results show that giraffes do produce vocalizations, which, based on their acoustic structure, might have the potential to function as communicative signals to convey information about the physical and motivational attributes of the caller. The data further reveal that the assumption of infrasonic communication in giraffes needs to be considered with caution and requires further investigations in future studies.
The relation of mothers' controlling vocalizations to children's intrinsic motivation.
Deci, E L; Driver, R E; Hotchkiss, L; Robbins, R J; Wilson, I M
1993-04-01
Twenty-six mother-child dyads played together in a laboratory setting. Play sessions were surreptitiously videotaped (with mothers' permission), and each maternal vocalization was transcribed and coded, first into 1 of 24 categories and then ipso facto into one of three supercategories--namely, controlling, autonomy supportive, and neutral. The degree of mothers' controllingness was calculated as the percentage of vocalizations coded as controlling. This index was correlated with the intrinsic motivation of their 6- or 7-year-old children, as assessed primarily by the free-choice behavioral measure and secondarily by a child self-report measure of interest and liking for the task. Both correlations were significantly negative, thereby suggesting that the robust laboratory findings of a negative relation between controlling contexts and individuals' intrinsic motivation are directly generalizable to the domain of parenting. Results are discussed in terms of the processes that undermine intrinsic motivation and the means through which parental controllingness is communicated.
Vasconcelos, Raquel O.; Fonseca, Paulo J.; Amorim, M. Clara P.; Ladich, Friedrich
2011-01-01
Many fishes rely on their auditory skills to interpret crucial information about predators and prey, and to communicate intraspecifically. Few studies, however, have examined how complex natural sounds are perceived in fishes. We investigated the representation of conspecific mating and agonistic calls in the auditory system of the Lusitanian toadfish Halobatrachus didactylus, and analysed auditory responses to heterospecific signals from ecologically relevant species: a sympatric vocal fish (meagre Argyrosomus regius) and a potential predator (dolphin Tursiops truncatus). Using auditory evoked potential (AEP) recordings, we showed that both sexes can resolve fine features of conspecific calls. The toadfish auditory system was most sensitive to frequencies well represented in the conspecific vocalizations (namely the mating boatwhistle), and revealed a fine representation of duration and pulsed structure of agonistic and mating calls. Stimuli and corresponding AEP amplitudes were highly correlated, indicating an accurate encoding of amplitude modulation. Moreover, Lusitanian toadfish were able to detect T. truncatus foraging sounds and A. regius calls, although at higher amplitudes. We provide strong evidence that the auditory system of a vocal fish, lacking accessory hearing structures, is capable of resolving fine features of complex vocalizations that are probably important for intraspecific communication and other relevant stimuli from the auditory scene. PMID:20861044
Johansson, Kerstin; Strömbergsson, Sofia; Robieux, Camille; McAllister, Anita
2017-01-01
Reduced respiratory function following lower cervical spinal cord injuries (CSCIs) may indirectly result in vocal dysfunction. Although self-reports indicate voice change and limitations following CSCI, earlier efforts using global perceptual ratings to distinguish speakers with CSCI from noninjured speakers have not been very successful. We investigate the use of an audience response system-based approach to distinguish speakers with CSCI from noninjured speakers, and explore whether specific vocal traits can be identified as characteristic for speakers with CSCI. Fourteen speech-language pathologists participated in a web-based perceptual task, where their overt reactions to vocal dysfunction were registered during the continuous playback of recordings of 36 speakers (18 with CSCI, and 18 matched controls). Dysphonic events were identified through manual perceptual analysis, to allow the exploration of connections between dysphonic events and listener reactions. More dysphonic events, and more listener reactions, were registered for speakers with CSCI than for noninjured speakers. Strain (particularly in phrase-final position) and creak (particularly in nonphrase-final position) distinguish speakers with CSCI from noninjured speakers. For the identification of intermittent and subtle signs of vocal dysfunction, an approach where the temporal distribution of symptoms is registered offers a viable means to distinguish speakers affected by voice dysfunction from non-affected speakers. In speakers with CSCI, clinicians should listen for presence of final strain and nonfinal creak, and pay attention to self-reported voice function and voice problems, to identify individuals in need for clinical assessment and intervention. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Rheometric properties of canine vocal fold tissues: Variation with anatomic location
Kimura, Miwako; Mau, Ted; Chan, Roger W.
2010-01-01
Objective To evaluate the in vitro rheometric properties of the canine vocal fold lamina propria and muscle at phonatory frequencies, and their changes with anatomic location. Methods Six canine larynges were harvested immediately postmortem. Viscoelastic shear properties of anterior, middle, and posterior portions of the vocal fold cover (lamina propria) as well as those of the medial thyroarytenoid (TA) muscle (vocalis muscle) were quantified by a linear, controlled-strain simple-shear rheometer. Measurements of elastic shear modulus (G’) and dynamic viscosity (η’) of the specimens were conducted with small-amplitude sinusoidal shear deformation over a frequency range of 1 Hz to 250 Hz. Results All specimens showed similar frequency dependence of the viscoelastic functions, with G’ gradually increasing with frequency and η’ decreasing with frequency monotonically. G’ and η’ of the canine vocalis muscle were significantly higher than those of the canine vocal fold cover, and η’ of the canine vocal fold cover was significantly higher than that of the human vocal fold cover. There were no significant differences in G’ and in η’ between different portions of the canine vocal fold cover. Conclusion These preliminary data based on the canine model suggested that the vocalis muscle, while in a relaxed state in vitro, is significantly stiffer and more viscous than the vocal fold cover during vibration at phonatory frequencies. For large-amplitude vocal fold vibration involving the medial portion of the TA muscle, such distinct differences in viscoelastic properties of different layers of the vocal fold should be taken into account in multi-layered biomechanical models of phonation. PMID:21035291
In Vivo Measurement of Pediatric Vocal Fold Motion Using Structured Light Laser Projection
Patel, Rita R.; Donohue, Kevin D.; Lau, Daniel; Unnikrishnan, Harikrishnan
2013-01-01
Summary Objective The aim of the study was to present the development of a miniature structured light laser projection endoscope and to quantify vocal fold length and vibratory features related to impact stress of the pediatric glottis using high-speed imaging. Study Design The custom-developed laser projection system consists of a green laser with a 4-mm diameter optics module at the tip of the endoscope, projecting 20 vertical laser lines on the glottis. Measurements of absolute phonatory vocal fold length, membranous vocal fold length, peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity were obtained in five children (6–9 years), two adult male and three adult female participants without voice disorders, and one child (10 years) with bilateral vocal fold nodules during modal phonation. Results Independent measurements made on the glottal length of a vocal fold phantom demonstrated a 0.13 mm bias error with a standard deviation of 0.23 mm, indicating adequate precision and accuracy for measuring vocal fold structures and displacement. First, in vivo measurements of amplitude-to-length ratio, peak closing velocity, and impact velocity during phonation in pediatric population and a child with vocal fold nodules are reported. Conclusion The proposed laser projection system can be used to obtain in vivo measurements of absolute length and vibratory features in children and adults. Children have large amplitude-to-length ratio compared with typically developing adults, whereas nodules result in larger peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity compared with typically developing children. PMID:23809569
Filippi, Piera; Congdon, Jenna V; Hoang, John; Bowling, Daniel L; Reber, Stephan A; Pašukonis, Andrius; Hoeschele, Marisa; Ocklenburg, Sebastian; de Boer, Bart; Sturdy, Christopher B; Newen, Albert; Güntürkün, Onur
2017-07-26
Writing over a century ago, Darwin hypothesized that vocal expression of emotion dates back to our earliest terrestrial ancestors. If this hypothesis is true, we should expect to find cross-species acoustic universals in emotional vocalizations. Studies suggest that acoustic attributes of aroused vocalizations are shared across many mammalian species, and that humans can use these attributes to infer emotional content. But do these acoustic attributes extend to non-mammalian vertebrates? In this study, we asked human participants to judge the emotional content of vocalizations of nine vertebrate species representing three different biological classes-Amphibia, Reptilia (non-aves and aves) and Mammalia. We found that humans are able to identify higher levels of arousal in vocalizations across all species. This result was consistent across different language groups (English, German and Mandarin native speakers), suggesting that this ability is biologically rooted in humans. Our findings indicate that humans use multiple acoustic parameters to infer relative arousal in vocalizations for each species, but mainly rely on fundamental frequency and spectral centre of gravity to identify higher arousal vocalizations across species. These results suggest that fundamental mechanisms of vocal emotional expression are shared among vertebrates and could represent a homologous signalling system. © 2017 The Author(s).
Forlano, Paul M.; Kim, Spencer D.; Krzyminska, Zuzanna M.; Sisneros, Joseph A.
2014-01-01
Although the neuroanatomical distribution of catecholaminergic (CA) neurons has been well documented across all vertebrate classes, few studies have examined CA connectivity to physiologically and anatomically identified neural circuitry that controls behavior. The goal of this study was to characterize CA distribution in the brain and inner ear of the plainfin midshipman fish (Porichthys notatus) with particular emphasis on their relationship with anatomically labeled circuitry that both produces and encodes social acoustic signals in this species. Neurobiotin labeling of the main auditory endorgan, the saccule, combined with tyrosine hydroxylase immunofluorescence (TH-ir) revealed a strong CA innervation of both the peripheral and central auditory system. Diencephalic TH-ir neurons in the periventricular posterior tuberculum, known to be dopaminergic, send ascending projections to the ventral telencephalon and prominent descending projections to vocal-acoustic integration sites, notably the hindbrain octavolateralis efferent nucleus, as well as onto the base of hair cells in the saccule via nerve VIII. Neurobiotin backfills of the vocal nerve in combination with TH-ir revealed CA terminals on all components of the vocal pattern generator which appears to largely originate from local TH-ir neurons but may include diencephalic projections as well. This study provides strong evidence for catecholamines as important neuromodulators of both auditory and vocal circuitry and acoustic-driven social behavior in midshipman fish. This first demonstration of TH-ir terminals in the main endorgan of hearing in a non-mammalian vertebrate suggests a conserved and important anatomical and functional role for dopamine in normal audition. PMID:24715479
Exploring vocal recovery after cranial nerve injury in Bengalese finches.
Urbano, Catherine M; Peterson, Jennifer R; Cooper, Brenton G
2013-02-08
Songbirds and humans use auditory feedback to acquire and maintain their vocalizations. The Bengalese finch (Lonchura striata domestica) is a songbird species that rapidly modifies its vocal output to adhere to an internal song memory. In this species, the left side of the bipartite vocal organ is specialized for producing louder, higher frequencies (≥2.2kHz) and denervation of the left vocal muscles eliminates these notes. Thus, the return of higher frequency notes after cranial nerve injury can be used as a measure of vocal recovery. Either the left or right side of the syrinx was denervated by resection of the tracheosyringeal portion of the hypoglossal nerve. Histologic analyses of syringeal muscle tissue showed significant muscle atrophy in the denervated side. After left nerve resection, songs were mainly composed of lower frequency syllables, but three out of five birds recovered higher frequency syllables. Right nerve resection minimally affected phonology, but it did change song syntax; syllable sequence became abnormally stereotyped after right nerve resection. Therefore, damage to the neuromuscular control of sound production resulted in reduced motor variability, and Bengalese finches are a potential model for functional vocal recovery following cranial nerve injury. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Vocal tract characteristics in Parkinson's disease.
Gillivan-Murphy, Patricia; Carding, Paul; Miller, Nick
2016-06-01
Voice tremor is strongly linked to the Parkinson's disease speech-voice symptom complex. Little is known about the underlying anatomic source(s) of voice tremor when it occurs. We review recent literature addressing this issue. Additionally we report findings from a study we conducted employing rating of vocal tract structures viewed using nasolaryngoscopy during vocal and nonspeech tasks. In Parkinson's disease, using laryngeal electromyography, tremor has not been identified in muscles in the vocal folds even when perceived auditorily. Preliminary findings using nasolaryngoscopy suggest that Parkinson's disease voice tremor is not associated with the vocal folds and may involve the palate, the global larynx, and the arytenoids. Tremor in the vertical larynx on /a/, and tremor in the arytenoid cartilages on /s/ differentiated patients with Parkinson's disease from neurologically healthy controls. Visual reliable detection of tremor when it is absent or borderline present, is challenging. Parkinson's disease voice tremor is likely to be related to oscillatory movement in structures across the vocal tract rather than just the vocal folds. To progress clinical practice, more refined tools for the visual rating of tremor would be beneficial. How far voice tremor represents a functionally significant factor for speakers would also add to the literature.
Online contributions of auditory feedback to neural activity in avian song control circuitry
Sakata, Jon T.; Brainard, Michael S.
2008-01-01
Birdsong, like human speech, relies critically on auditory feedback to provide information about the quality of vocalizations. Although the importance of auditory feedback to vocal learning is well established, whether and how feedback signals influence vocal premotor circuitry has remained obscure. Previous studies in singing birds have not detected changes to vocal premotor activity following perturbations of auditory feedback, leading to the hypothesis that contributions of feedback to vocal plasticity might rely on ‘offline’ processing. Here, we recorded single and multi-unit activity in the premotor nucleus HVC of singing Bengalese finches in response to feedback perturbations that are known to drive plastic changes in song. We found that transient feedback perturbation caused reliable decreases in HVC activity at short latencies (20-80 ms). Similar changes to HVC activity occurred in awake, non-singing finches when the bird’s own song was played back with auditory perturbations that simulated those experienced by singing birds. These data indicate that neurons in avian vocal premotor circuitry are rapidly influenced by perturbations of auditory feedback and support the possibility that feedback information in HVC contributes online to the production and plasticity of vocalizations. PMID:18971480
Vocal tract resonances in speech, singing, and playing musical instruments
Wolfe, Joe; Garnier, Maëva; Smith, John
2009-01-01
In both the voice and musical wind instruments, a valve (vocal folds, lips, or reed) lies between an upstream and downstream duct: trachea and vocal tract for the voice; vocal tract and bore for the instrument. Examining the structural similarities and functional differences gives insight into their operation and the duct-valve interactions. In speech and singing, vocal tract resonances usually determine the spectral envelope and usually have a smaller influence on the operating frequency. The resonances are important not only for the phonemic information they produce, but also because of their contribution to voice timbre, loudness, and efficiency. The role of the tract resonances is usually different in brass and some woodwind instruments, where they modify and to some extent compete or collaborate with resonances of the instrument to control the vibration of a reed or the player’s lips, and∕or the spectrum of air flow into the instrument. We give a brief overview of oscillator mechanisms and vocal tract acoustics. We discuss recent and current research on how the acoustical resonances of the vocal tract are involved in singing and the playing of musical wind instruments. Finally, we compare techniques used in determining tract resonances and suggest some future developments. PMID:19649157
Laryngeal Aerodynamics in Healthy Older Adults and Adults With Parkinson's Disease.
Matheron, Deborah; Stathopoulos, Elaine T; Huber, Jessica E; Sussman, Joan E
2017-03-01
The present study compared laryngeal aerodynamic function of healthy older adults (HOA) to adults with Parkinson's disease (PD) while speaking at a comfortable and increased vocal intensity. Laryngeal aerodynamic measures (subglottal pressure, peak-to-peak flow, minimum flow, and open quotient [OQ]) were compared between HOAs and individuals with PD who had a diagnosis of hypophonia. Increased vocal intensity was elicited via monaurally presented multitalker background noise. At a comfortable speaking intensity, HOAs and individuals with PD produced comparable vocal intensity, rates of vocal fold closure, and minimum flow. HOAs used smaller OQs, higher subglottal pressure, and lower peak-to-peak flow than individuals with PD. Both groups increased speaking intensity when speaking in noise to the same degree. However, HOAs produced increased intensity with greater driving pressure, faster vocal fold closure rates, and smaller OQs than individuals with PD. Monaural background noise elicited equivalent vocal intensity increases in HOAs and individuals with PD. Although both groups used laryngeal mechanisms as expected to increase sound pressure level, they used these mechanisms to different degrees. The HOAs appeared to have better control of the laryngeal mechanism to make changes to their vocal intensity.
Effects of background noise on acoustic characteristics of Bengalese finch songs.
Shiba, Shintaro; Okanoya, Kazuo; Tachibana, Ryosuke O
2016-12-01
Online regulation of vocalization in response to auditory feedback is one of the essential issues for vocal communication. One such audio-vocal interaction is the Lombard effect, an involuntary increase in vocal amplitude in response to the presence of background noise. Along with vocal amplitude, other acoustic characteristics, including fundamental frequency (F0), also change in some species. Bengalese finches (Lonchura striata var. domestica) are a suitable model for comparative, ethological, and neuroscientific studies on audio-vocal interaction because they require real-time auditory feedback of their own songs to maintain normal singing. Here, the changes in amplitude and F0 with a focus on the distinct song elements (i.e., notes) of Bengalese finches under noise presentation are demonstrated. To accurately analyze these acoustic characteristics, two different bandpass-filtered noises at two levels of sound intensity were used. The results confirmed that the Lombard effect occurs at the note level of Bengalese finch song. Further, individually specific modes of changes in F0 are shown. These behavioral changes suggested the vocal control mechanisms on which the auditory feedback is based have a predictable effect on amplitude, but complex spectral effects on individual note production.
Vocal tract resonances in speech, singing, and playing musical instruments.
Wolfe, Joe; Garnier, Maëva; Smith, John
2009-01-01
IN BOTH THE VOICE AND MUSICAL WIND INSTRUMENTS, A VALVE (VOCAL FOLDS, LIPS, OR REED) LIES BETWEEN AN UPSTREAM AND DOWNSTREAM DUCT: trachea and vocal tract for the voice; vocal tract and bore for the instrument. Examining the structural similarities and functional differences gives insight into their operation and the duct-valve interactions. In speech and singing, vocal tract resonances usually determine the spectral envelope and usually have a smaller influence on the operating frequency. The resonances are important not only for the phonemic information they produce, but also because of their contribution to voice timbre, loudness, and efficiency. The role of the tract resonances is usually different in brass and some woodwind instruments, where they modify and to some extent compete or collaborate with resonances of the instrument to control the vibration of a reed or the player's lips, andor the spectrum of air flow into the instrument. We give a brief overview of oscillator mechanisms and vocal tract acoustics. We discuss recent and current research on how the acoustical resonances of the vocal tract are involved in singing and the playing of musical wind instruments. Finally, we compare techniques used in determining tract resonances and suggest some future developments.
Asymmetric spatiotemporal chaos induced by a polypoid mass in the excised larynx
Zhang, Yu; Jiang, Jack J.
2008-01-01
In this paper, asymmetric spatiotemporal chaos induced by a polypoid mass simulating the laryngeal pathology of a vocal polyp is experimentally observed using high-speed imaging in an excised larynx. Spatiotemporal analysis reveals that the normal vocal folds show spatiotemporal correlation and symmetry. Normal vocal fold vibrations are dominated mainly by the first vibratory eigenmode. However, pathological vocal folds with a polypoid mass show broken symmetry and spatiotemporal irregularity. The spatial correlation is decreased. The pathological vocal folds spread vibratory energy across a large number of eigenmodes and induce asymmetric spatiotemporal chaos. High-order eigenmodes show complicated dynamics. Spatiotemporal analysis provides a valuable biomedical application for investigating the spatiotemporal chaotic dynamics of pathological vocal fold systems with a polypoid mass and may represent a valuable clinical tool for the detection of laryngeal mass lesion using high-speed imaging. PMID:19123612
An optical flow-based state-space model of the vocal folds.
Granados, Alba; Brunskog, Jonas
2017-06-01
High-speed movies of the vocal fold vibration are valuable data to reveal vocal fold features for voice pathology diagnosis. This work presents a suitable Bayesian model and a purely theoretical discussion for further development of a framework for continuum biomechanical features estimation. A linear and Gaussian nonstationary state-space model is proposed and thoroughly discussed. The evolution model is based on a self-sustained three-dimensional finite element model of the vocal folds, and the observation model involves a dense optical flow algorithm. The results show that the method is able to capture different deformation patterns between the computed optical flow and the finite element deformation, controlled by the choice of the model tissue parameters.
Martins, Regina Helena Garcia; Tavares, Elaine Lara Mendes; Pessin, Adriana Bueno Benito
2017-05-01
Reinke's edema is a benign lesion of the vocal folds that affects chronic smokers, especially women. The voice becomes hoarse and virilized, and the treatment is microsurgery. However, even after surgery and smoking cessation, many patients remain with a deep and hoarse voice. The aim of the present study was to compare pre- and postoperative acoustic and perceptual-auditory vocal analyses of women with Reinke's edema and of women in the control group, who were non-smokers. A total of 20 women with videolaryngoscopy diagnosis of Reinke's edema who underwent laryngeal microsurgery were evaluated pre- and postoperatively (6 months) by videolaryngoscopy, acoustic voice, and perceptual-auditory analyses (General degree of dysphonia, Roughness, Breathiness, Asthenia, Strain, and Instability [GRBASI] scale), and the maximum phonation times were calculated. The pre- and postoperative parameters of the women with Reinke's edema were compared with those of the control group of women with no laryngeal lesions, smoking habit, or vocal symptoms. Acoustic vocal perceptual-auditory analyses and the maximum phonation time of women with Reinke's edema improved significantly in the postoperative evaluations; nevertheless, 6 months after surgery, their voices became worse than the voices of the women from the control group. Abnormalities caused by smoking in Reinke's edema in women are not fully reversible with surgery and smoking cessation. One explanation would be the presence of possible structural alterations in fibroblasts caused by the toxicity of cigarette components, resulting in the uncontrolled production of fibrous matrix in the lamina propria, and preventing complete vocal recovery. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Boinski, S.; Gross, T.S.; Davis, J.K.
1999-01-01
The vocal behavior of captive animals is increasingly exploited as an index of well-being. Here we show that the terrestrial predator alarm (TPA) vocalization, a robust and acoustically distinctive anti-predation vocal response present in many mammal and bird species, offers useful information on the relative well-being and stress levels of captive animals. In a 16-week experiment evaluating the effects of varying levels of physical environmental enrichment (control < toys < foraging box < foraging box and toys) in the cages of eight singly housed adult male brown capuchins, we quantified the 1) emission rate of TPAs, 2) proportions of normal and abnormal behavior sample intervals, and 3) fecal and plasma cortisol levels. Variation in TPA emission across the experimental conditions was significant. We found significant reductions in the mean TPA production rate by the group in the enriched (toys, foraging box, and foraging box and toys) compared to the control condition; pre-and post-experimental conditions, however, did not differ from the control condition. Mean TPA production by the group was also significantly positively correlated to mean group levels of fecal cortisol and proportion of abnormal behavior sample intervals, and significantly negatively correlated to the average proportion of normal behavior sample intervals in the group. Based on group means, plasma cortisol levels were positively, but not significantly, related to increasing TPA rate. At the level of the responses of an individual subject, however, the covariation between the vocal and non-vocal behavioral measures and the cortisol assays seldom attained significance. Nevertheless, the direction of the relationships among these parameters within individual subjects typically mirrored those correlations based on group means. At both the group mean and individual levels, our results are consistent with the.
Su, Chih-Ying; Chuang, Hui-Ching; Tsai, Shang-Shyue; Chiu, Jeng-Fen
2005-03-01
In treating early glottic carcinomas, the outcomes of endoscopic laser cordectomy have been proven to be valuable in local control, survival, and vocal function preservation. In some extended cases, however, laser cordectomy may leave patients with poor vocal function because of vocal fold deficit. This work assesses the vocal outcome of medialization laryngoplasty with bipedicled strap muscle transposition for vocal fold deficit resulting from laser cordectomy in early glottic cancer patients. A prospective clinical series. Thirteen early glottic cancer patients who had vocal fold deficit caused by previous laser cordectomy underwent medialization laryngoplasty with bipedicled strap muscle transposition. The thyroid lamina on the cordectomy side was paramedially separated. The inner perichondrium was circumspectly raised from the overlying thyroid cartilage. After separating the thyrohyoid and cricothyroid membranes, the lamina was retracted laterally. A bipedicled strap muscle flap was then transposed into the area between the lamina and the paraglottic soft tissue. The thyroid cartilages were carefully sutured back in position. All patients received pre- and postoperative voice assessments comprising laryngostroboscopy and vocal function studies. Vocal enhancement was present in 92% (12/13) of patients after medialization laryngoplasty with strap muscle transposition. The glottal closure and maximal phonation time were noticeably improved by surgery. No dyspnea or other significant complications were observed in any patients. The outcomes show that bipedicled strap muscle transposition is a prosthesis-free, safe, and valuable laryngoplastic technique for correcting glottal incompetence caused by endoscopic laser cordectomy in early glottic cancer patients.
Cyclic Adenosine Monophosphate Regulation of Ion Transport in Porcine Vocal Fold Mucosae
Sivasankar, Mahalakshmi; Nofziger, Charity; Blazer-Yost, Bonnie
2012-01-01
Objectives/Hypothesis Cyclic adenosine monophosphate (cAMP) is an important biological molecule that regulates ion transport and inflammatory responses in epithelial tissue. The present study examined whether the adenylyl cyclase activator, forskolin, would increase cAMP concentration in porcine vocal fold mucosa and whether the effects of increased cAMP would be manifested as a functional increase in transepithelial ion transport. Additionally, changes in cAMP concentrations following exposure to an inflammatory mediator, tumor necrosis factor-α (TNFα) were investigated. Study Design In vitro experimental design with matched treatment and control groups. Methods Porcine vocal fold mucosae (N = 30) and tracheal mucosae (N = 20) were exposed to forskolin, TNFα, or vehicle (dimethyl sulfoxide) treatment. cAMP concentrations were determined with enzyme-linked immunosorbent assay. Ion transport was measured using electrophysiological techniques. Results Thirty minute exposure to forskolin significantly increased cAMP concentration and ion transport in porcine vocal fold and tracheal mucosae. However, 30-minute and 2-hour exposure to TNFα did not significantly alter cAMP concentration. Conclusions We demonstrate that forskolin-sensitive adenylyl cyclase is present in vocal fold mucosa, and further, that the product, cAMP increases vocal fold ion transport. The results presented here contribute to our understanding of the intracellular mechanisms underlying vocal fold ion transport. As ion transport is important for maintaining superficial vocal fold hydration, data demonstrating forskolin-stimulated ion transport in vocal fold mucosa suggest opportunities for developing pharmacological treatments that increase surface hydration. PMID:18596479
Multidimensional vocal assessment after laser treatment for recurrent respiratory papillomatosis.
Kono, Takeyuki; Yabe, Haruna; Uno, Kosuke; Saito, Koichiro; Ogawa, Kaoru
2017-03-01
Recurrent respiratory papillomatosis (RRP) is a benign epithelial tumor that exhibits a high frequency of recurrence. This study assesses the vocal function after laser treatment for RRP, particularly in relation to the frequency of surgery. Retrospective study. Thirty RRP patients who underwent laser surgery that controlled the tumor were included. Preoperative and postoperative Grade, Roughness, Breathiness, Asthenia, and Strain Scale, videostroboscopic findings, aerodynamic and acoustic parameters, and self-assessment questionnaires were measured and compared with an age- and sex-matched control group. Subsequently, to evaluate the association between postoperative voice quality and the number of surgeries, the patients were divided into three groups (group 1: single surgery, group 2: 2-5 surgeries, group3: >6 surgeries), and comparative multidimensional vocal assessments were performed. The mean number of surgeries was 3.4 (range, 1-8). Although all patients exhibited poorer vocal function than the control group preoperatively, they showed improvement in postoperative subjective and objective parameters. However, four patients who underwent one surgery with relatively aggressive ablation exhibited vocal cord scarring and deteriorated objective parameters. All remaining patients showed voice quality that was on par with the control group. Subgroup analysis proved no association between post-therapeutic voice quality and the patient characteristics, including preoperative staging and the number of surgical treatments performed. RRP patients can achieve a close to normal voice with high satisfaction even after recurrent surgical treatment when ablation of a subepithelial lesion using sufficient laser energy is adequate. 3b Laryngoscope, 127:679-684, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Tamplin, Jeanette; Brazzale, Danny J; Pretto, Jeffrey J; Ruehland, Warren R; Buttifant, Mary; Brown, Douglas J; Berlowitz, David J
2011-02-01
To explore how respiratory impairment after cervical spinal cord injury affects vocal function, and to explore muscle recruitment strategies used during vocal tasks after quadriplegia. It was hypothesized that to achieve the increased respiratory support required for singing and loud speech, people with quadriplegia use different patterns of muscle recruitment and control strategies compared with control subjects without spinal cord injury. Matched, parallel-group design. Large university-affiliated public hospital. Consenting participants with motor-complete C5-7 quadriplegia (n=6) and able-bodied age-matched controls (n=6) were assessed on physiologic and voice measures during vocal tasks. Not applicable. Standard respiratory function testing, surface electromyographic activity from accessory respiratory muscles, sound pressure levels during vocal tasks, the Voice Handicap Index, and the Perceptual Voice Profile. The group with quadriplegia had a reduced lung capacity (vital capacity, 71% vs 102% of predicted; P=.028), more perceived voice problems (Voice Handicap Index score, 22.5 vs 6.5; P=.046), and greater recruitment of accessory respiratory muscles during both loud and soft volumes (P=.028) than the able-bodied controls. The group with quadriplegia also demonstrated higher accessory muscle activation in changing from soft to loud speech (P=.028). People with quadriplegia have impaired vocal ability and use different muscle recruitment strategies during speech than the able-bodied. These findings will enable us to target specific measurements of respiratory physiology for assessing functional improvements in response to formal therapeutic singing training. Copyright © 2011 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Sommer, David; Erath, Byron D.; Zanartu, Matias; Peterson, Sean D.
2011-11-01
Voiced speech is produced by dynamic fluid-structure interactions in the larynx. Traditionally, reduced order models of speech have relied upon simplified inviscid flow solvers to prescribe the fluid loadings that drive vocal fold motion, neglecting viscous flow effects that occur naturally in voiced speech. Viscous phenomena, such as skewing of the intraglottal jet, have the most pronounced effect on voiced speech in cases of vocal fold paralysis where one vocal fold loses some, or all, muscular control. The impact of asymmetric intraglottal flow in pathological speech is captured in a reduced order two-mass model of speech by coupling a boundary-layer estimation of the asymmetric pressures with asymmetric tissue parameters that are representative of recurrent laryngeal nerve paralysis. Nonlinear analysis identifies the emergence of irregular and chaotic vocal fold dynamics at values representative of pathological speech conditions.
Kobayasi, Kohta I.; Hage, Steffen R.; Berquist, Sean; Feng, Jiang; Zhang, Shuyi; Metzner, Walter
2012-01-01
Mammalian vocalizations exhibit large variations in their spectrotemporal features, although it is still largely unknown which result from intrinsic biomechanical properties of the larynx and which are under direct neuromuscular control. Here we show that mere changes in laryngeal air flow yield several non-linear effects on sound production, in an isolated larynx preparation from horseshoe bats. Most notably, there are sudden jumps between two frequency bands used for either echolocation or communication in natural vocalizations. These jumps resemble changes in “registers” as in yodelling. In contrast, simulated contractions of the main larynx muscle produce linear frequency changes, but are limited to echolocation or communication frequencies. Only by combining non-linear and linear properties can this larynx therefore produce sounds covering the entire frequency range of natural calls. This may give behavioural meaning to yodelling-like vocal behaviour and reshape our thinking about how the brain controls the multitude of spectral vocal features in mammals. PMID:23149729
Garcia-Albea, Elena; Reeve, Sharon A; Brothers, Kevin J; Reeve, Kenneth F
2014-01-01
Script-fading procedures have been shown to be effective for teaching children with autism to initiate and participate in social interactions without vocal prompts from adults. In previous script and script-fading research, however, there has been no demonstration of a generalized repertoire of vocal interactions under the control of naturally occurring relevant stimuli. In this study, 4 boys with autism were taught to initiate a conversation in the presence of toys through the use of a script and script-fading procedure. Training with multiple categories and exemplars of toys was used to increase the likelihood of generalization of vocal interactions across novel toys. A multiple-probe design across participants was used to assess the effects of these procedures. The intervention successfully brought interactions by children with autism under the control of relevant stimuli in the environment. Future research pertaining to the specific implementation of these procedures (e.g., fading, script placement, participant characteristics) is discussed. © Society for the Experimental Analysis of Behavior.
Rodenas-Cuadrado, Pedro M; Mengede, Janine; Baas, Laura; Devanna, Paolo; Schmid, Tobias A; Yartsev, Michael; Firzlaff, Uwe; Vernes, Sonja C
2018-06-01
Genes including FOXP2, FOXP1, and CNTNAP2, have been implicated in human speech and language phenotypes, pointing to a role in the development of normal language-related circuitry in the brain. Although speech and language are unique to humans a comparative approach is possible by addressing language-relevant traits in animal systems. One such trait, vocal learning, represents an essential component of human spoken language, and is shared by cetaceans, pinnipeds, elephants, some birds and bats. Given their vocal learning abilities, gregarious nature, and reliance on vocalizations for social communication and navigation, bats represent an intriguing mammalian system in which to explore language-relevant genes. We used immunohistochemistry to detail the distribution of FoxP2, FoxP1, and Cntnap2 proteins, accompanied by detailed cytoarchitectural histology in the brains of two vocal learning bat species; Phyllostomus discolor and Rousettus aegyptiacus. We show widespread expression of these genes, similar to what has been previously observed in other species, including humans. A striking difference was observed in the adult P. discolor bat, which showed low levels of FoxP2 expression in the cortex that contrasted with patterns found in rodents and nonhuman primates. We created an online, open-access database within which all data can be browsed, searched, and high resolution images viewed to single cell resolution. The data presented herein reveal regions of interest in the bat brain and provide new opportunities to address the role of these language-related genes in complex vocal-motor and vocal learning behaviors in a mammalian model system. © 2018 The Authors The Journal of Comparative Neurology Published by Wiley Periodicals, Inc.
A Verbal Guidance System for Severe Disabled People
NASA Astrophysics Data System (ADS)
Redjati, Abdelghani; Bousbia-Salah, Mounir
2008-06-01
The recent development in rehabilitation technology allows to significantly broaden the range of possible applications that support handicapped people in their daily lives. This paper presents a moral and physical support for the disabled. It consists in the development of a verbal guidance system based on a speech recognition development kit `VD364'. This aid is intended to control a wheelchair and a manipulator arm for people with severe disabilities and who can speak. The study and design, conducted in the framework of this contribution have enabled an adaptation for a possible application and maximum exploitation of words that can be generated by a vocal module. The problem addressed is to allow a manipulator arm to compensate mechanically arm movements to give the handicapped satisfaction of his needs (for instance, drinking a glass of water). The objective is then to put forward a vocal command system that allows the arm to move in a well determined area to accomplish tasks that must be given by the user in addition to the displacement of the wheelchair.
Conserved mechanisms of vocalization coding in mammalian and songbird auditory midbrain.
Woolley, Sarah M N; Portfors, Christine V
2013-11-01
The ubiquity of social vocalizations among animals provides the opportunity to identify conserved mechanisms of auditory processing that subserve communication. Identifying auditory coding properties that are shared across vocal communicators will provide insight into how human auditory processing leads to speech perception. Here, we compare auditory response properties and neural coding of social vocalizations in auditory midbrain neurons of mammalian and avian vocal communicators. The auditory midbrain is a nexus of auditory processing because it receives and integrates information from multiple parallel pathways and provides the ascending auditory input to the thalamus. The auditory midbrain is also the first region in the ascending auditory system where neurons show complex tuning properties that are correlated with the acoustics of social vocalizations. Single unit studies in mice, bats and zebra finches reveal shared principles of auditory coding including tonotopy, excitatory and inhibitory interactions that shape responses to vocal signals, nonlinear response properties that are important for auditory coding of social vocalizations and modulation tuning. Additionally, single neuron responses in the mouse and songbird midbrain are reliable, selective for specific syllables, and rely on spike timing for neural discrimination of distinct vocalizations. We propose that future research on auditory coding of vocalizations in mouse and songbird midbrain neurons adopt similar experimental and analytical approaches so that conserved principles of vocalization coding may be distinguished from those that are specialized for each species. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives". Copyright © 2013 Elsevier B.V. All rights reserved.
Determining the etiology of mild vocal fold hypomobility.
Heman-Ackah, Yolanda D; Batory, Mark
2003-12-01
The prevalence of mild vocal fold hypomobility is unknown. In a study by Heman-Ackah et al, vocal fold hypomobility in a population of singing teachers was found to be associated more frequently with vocal complaints than was the presence of vocal fold masses. The etiology of mild vocal fold hypomobility has not been previously explored. In the present study, a retrospective chart review was performed of 134 patients who presented to a tertiary laryngology referral center over a 6-month period for evaluation of vocal complaints. Of the 134 patients, 61 (46%) were found to have mild vocal referring otolaryngologist. Imaging studies and laboratory tests to evaluate for structural, metabolic, and infectious causes of the decreased mobility had been ordered. Forty-nine patients completed the work-up. Of these, 41 out of 49 (84%) were found to have imaging or laboratory findings that could explain the hypomobility. Thyroid abnormalities were found to be associated with vocal fold hypomobility in 21 out of 49 (43%) of those with a complete evaluation. Other causes of vocal fold hypomobility included idiopathic (8 of 49, 16%), viral neuritis (5 of 49, 10%), central nervous system abnormality (4 of 49, 8%), neural tumor (3 of 49, 6%), joint dysfunction (3 of 49, 6%), iatrogenic nerve injury (2 of 49, 4%), myopathy (2 of 49, 4%), and noniatrogenic traumatic nerve injury (1 of 49, 2%), This study shows that unilateral vocal fold hypomobility often is associated with a physiologic process, and a complete investigation to determine the etiology is warranted in all cases.
Meerschman, Iris; Van Lierde, Kristiane; Peeters, Karen; Meersman, Eline; Claeys, Sofie; D'haeseleer, Evelien
2017-09-18
The purpose of this study was to determine the short-term effect of 2 semi-occluded vocal tract training programs, "resonant voice training using nasal consonants" versus "straw phonation," on the vocal quality of vocally healthy future occupational voice users. A multigroup pretest-posttest randomized control group design was used. Thirty healthy speech-language pathology students with a mean age of 19 years (range: 17-22 years) were randomly assigned into a resonant voice training group (practicing resonant exercises across 6 weeks, n = 10), a straw phonation group (practicing straw phonation across 6 weeks, n = 10), or a control group (receiving no voice training, n = 10). A voice assessment protocol consisting of both subjective (questionnaire, participant's self-report, auditory-perceptual evaluation) and objective (maximum performance task, aerodynamic assessment, voice range profile, acoustic analysis, acoustic voice quality index, dysphonia severity index) measurements and determinations was used to evaluate the participants' voice pre- and posttraining. Groups were compared over time using linear mixed models and generalized linear mixed models. Within-group effects of time were determined using post hoc pairwise comparisons. No significant time × group interactions were found for any of the outcome measures, indicating no differences in evolution over time among the 3 groups. Within-group effects of time showed a significant improvement in dysphonia severity index in the resonant voice training group, and a significant improvement in the intensity range in the straw phonation group. Results suggest that the semi-occluded vocal tract training programs using resonant voice training and straw phonation may have a positive impact on the vocal quality and vocal capacities of future occupational voice users. The resonant voice training caused an improved dysphonia severity index, and the straw phonation training caused an expansion of the intensity range in this population.
Liu, Ying; Hu, Huijing; Jones, Jeffery A; Guo, Zhiqiang; Li, Weifeng; Chen, Xi; Liu, Peng; Liu, Hanjun
2015-08-01
Speakers rapidly adjust their ongoing vocal productions to compensate for errors they hear in their auditory feedback. It is currently unclear what role attention plays in these vocal compensations. This event-related potential (ERP) study examined the influence of selective and divided attention on the vocal and cortical responses to pitch errors heard in auditory feedback regarding ongoing vocalisations. During the production of a sustained vowel, participants briefly heard their vocal pitch shifted up two semitones while they actively attended to auditory or visual events (selective attention), or both auditory and visual events (divided attention), or were not told to attend to either modality (control condition). The behavioral results showed that attending to the pitch perturbations elicited larger vocal compensations than attending to the visual stimuli. Moreover, ERPs were likewise sensitive to the attentional manipulations: P2 responses to pitch perturbations were larger when participants attended to the auditory stimuli compared to when they attended to the visual stimuli, and compared to when they were not explicitly told to attend to either the visual or auditory stimuli. By contrast, dividing attention between the auditory and visual modalities caused suppressed P2 responses relative to all the other conditions and caused enhanced N1 responses relative to the control condition. These findings provide strong evidence for the influence of attention on the mechanisms underlying the auditory-vocal integration in the processing of pitch feedback errors. In addition, selective attention and divided attention appear to modulate the neurobehavioral processing of pitch feedback errors in different ways. © 2015 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
Soderstrom, Ken; Wilson, Ashley R
2013-11-01
Zebra finch song is a learned behavior dependent upon successful progress through a sensitive period of late-postnatal development. This learning is associated with maturation of distinct brain nuclei and the fiber tract interconnections between them. We have previously found remarkably distinct and dense CB1 cannabinoid receptor expression within many of these song control brain regions, implying a normal role for endocannabinoid signaling in vocal learning. Activation of CB1 receptors via daily treatments with exogenous agonist during sensorimotor stages of song learning (but not in adulthood) results in persistent alteration of song patterns. Now we are working to understand physiological changes responsible for this cannabinoid-altered vocal learning. We have found that song-altering developmental treatments are associated with changes in expression of endocannabinoid signaling elements, including CB1 receptors and the principal CNS endogenous agonist, 2-AG. Within CNS, 2-AG is produced largely through activity of the α isoform of the enzyme diacylglycerol lipase (DAGLα). To better appreciate the role of 2-AG production in normal vocal development we have determined the spatial distribution of DAGLα expression within zebra finch CNS during vocal development. Early during vocal development at 25 days, DAGLα staining is typically light and of fibroid processes. Staining peaks late in the sensorimotor stage of song learning at 75 days and is characterized by fiber, neuropil and some staining of both small and large cell somata. Results provide insight to the normal role for endocannabinoid signaling in the maturation of brain regions responsible for song learning and vocal-motor output, and suggest mechanisms by which exogenous cannabinoid exposure alters acquisition of this form of vocal communication. Copyright © 2013 Elsevier B.V. All rights reserved.
Social learning of vocal structure in a nonhuman primate?
2011-01-01
Background Non-human primate communication is thought to be fundamentally different from human speech, mainly due to vast differences in vocal control. The lack of these abilities in non-human primates is especially striking if compared to some marine mammals and bird species, which has generated somewhat of an evolutionary conundrum. What are the biological roots and underlying evolutionary pressures of the human ability to voluntarily control sound production and learn the vocal utterances of others? One hypothesis is that this capacity has evolved gradually in humans from an ancestral stage that resembled the vocal behavior of modern primates. Support for this has come from studies that have documented limited vocal flexibility and convergence in different primate species, typically in calls used during social interactions. The mechanisms underlying these patterns, however, are currently unknown. Specifically, it has been difficult to rule out explanations based on genetic relatedness, suggesting that such vocal flexibility may not be the result of social learning. Results To address this point, we compared the degree of acoustic similarity of contact calls in free-ranging Campbell's monkeys as a function of their social bonds and genetic relatedness. We calculated three different indices to compare the similarities between the calls' frequency contours, the duration of grooming interactions and the microsatellite-based genetic relatedness between partners. We found a significantly positive relation between bond strength and acoustic similarity that was independent of genetic relatedness. Conclusion Genetic factors determine the general species-specific call repertoire of a primate species, while social factors can influence the fine structure of some the call types. The finding is in line with the more general hypothesis that human speech has evolved gradually from earlier primate-like vocal communication. PMID:22177339
Valadez, Victor; Ysunza, Antonio; Ocharan-Hernandez, Esther; Garrido-Bustamante, Norma; Sanchez-Valerio, Araceli; Pamplona, Ma C
2012-09-01
Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. Before the onset of voice therapy, there was a significant difference (p<0.05) in mean FF, S and J, between the patients with VN and subjects from the control group. After the voice therapy period, a significant improvement (p<0.05) was found in all acoustic voice parameters. Moreover, perceptual voice analysis demonstrated improvement in all cases. Finally, videonasolaryngoscopy demonstrated that vocal nodules were no longer discernible on the vocal folds in any of the cases. SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Bastian, Robert W; Thomas, James P
2016-09-01
Assess the correlation between self-rating scales of talkativeness and loudness with various types of voice disorders. This is a retrospective study. A total of 974 patients were analyzed. The cohort study included 430 consecutive patients presenting to the senior author with voice complaints from December 1995 to December 1998. The case-control study added 544 consecutive patients referred to the same examiner from January 1988 to December 1998 for vocal fold examination before thyroid, parathyroid, and carotid surgery. Patient responses on seven-point Likert self-rating scales of talkativeness and loudness were compared with laryngeal disease. Mucosal lesions clearly associated with vibratory trauma are strongly associated with a high self-rating of talkativeness. Laryngeal deconditioning disorders were associated with a low self-rating of talkativeness. Use of a simple self-rating scale of vocal loudness and talkativeness during history taking can reliably orient the examiner to the types of voice disorders likely to be diagnosed subsequently during vocal capability testing and visual laryngeal examination. The high degree of talkativeness and loudness seen in vocal overdoers correlates well with mucosal disorders such as nodules, polyps, capillary ectasia, epidermoid inclusion cysts, and hemorrhage. A lower degree of talkativeness correlates with muscle deconditioning disorders such as vocal fold bowing, atrophy, presbyphonia, and vocal fatigue syndrome. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Raineki, Charlis; Bodnar, Tamara S; Holman, Parker J; Baglot, Samantha L; Lan, Ni; Weinberg, Joanne
2017-11-01
The contribution of the early postnatal environment to the pervasive effects of prenatal alcohol exposure (PAE) is poorly understood. Moreover, PAE often carries increased risk of exposure to adversity/stress during early life. Dysregulation of immune function may play a role in how pre- and/or postnatal adversity/stress alters brain development. Here, we combine two animal models to examine whether PAE differentially increases vulnerability to immune dysregulation in response to early-life adversity. PAE and control litters were exposed to either limited bedding (postnatal day [PN] 8-12) to model early-life adversity or normal bedding, and maternal behavior and pup vocalizations were recorded. Peripheral (serum) and central (amygdala) immune (cytokines and C-reactive protein - CRP) responses of PAE animals to early-life adversity were evaluated at PN12. Insufficient bedding increased negative maternal behavior in both groups. Early-life adversity increased vocalization in all animals; however, PAE pups vocalized less than controls. Early-life adversity reduced serum TNF-α, KC/GRO, and IL-10 levels in control but not PAE animals. PAE increased serum CRP, and levels were even higher in pups exposed to adversity. Finally, PAE reduced KC/GRO and increased IL-10 levels in the amygdala. Our results indicate that PAE alters immune system development and both behavioral and immune responses to early-life adversity, which could have subsequent consequences for brain development and later life health. Copyright © 2017 Elsevier Inc. All rights reserved.
Vocal Changes Following Thyroid Surgery: Prospective Study of Objective and Subjective Parameters.
Delgado-Vargas, Beatriz; Lloris Romero-Salazar, Azucena; Cobeta, Ignacio
2017-10-19
Vocal changes are frequent following a surgical procedure to the thyroid gland. Even though they are a recognized morbidity, their bases are yet to be defined as well as their effect on vocal parameters. This study investigates the objective and subjective changes that occur after the surgery. This study is a prospective analysis of consecutive cases. This study was conducted in a single-center tertiary care facility. Patients programmed for any thyroid procedure in Hospital Universitario Ramón y Cajal were enrolled consecutively to perform the vocal analysis before and after the surgery from April 2014 to April 2016. Patients were divided according to the vocal fold motility, and their vocal and aerodynamic parameters were obtained by means of electroglottography and phonatory aerodynamic system. Patients filled in the 10-item Voice Handicap Index (VHI-10) questionnaire. Statistical analysis was performed comparing vocal and aerodynamic parameters and quality of life before and after the surgery. 218 patients met inclusion criteria and completed the protocol. A total of 86.6% of the sample showed no vocal motility impairment, whereas the rest of the patients showed a paresis or a paralysis. Maximum phonatory time and VHI-10 questionnaire showed a statistically significant difference (P < 0.05) between groups. No differences were assessed regarding other vocal parameters. Efforts are still needed to understand the groundings and magnitude of the vocal changes after a thyroid surgery. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Communication modality sampling for a toddler with Angelman syndrome.
Hyppa Martin, Jolene; Reichle, Joe; Dimian, Adele; Chen, Mo
2013-10-01
Vocal, gestural, and graphic communication modes were implemented concurrently with a toddler with Angelman syndrome to identify the most efficiently learned communication mode to emphasize in an initial augmentative communication system. Symbols representing preferred objects were introduced in vocal, gestural, and graphic communication modes using an alternating treatment single-subject experimental design. Conventionally accepted prompting strategies were used to teach symbols in each communication mode. Because the learner did not vocally imitate, vocal mode intervention focused on increasing vocal frequency as an initial step. When graphic and gestural mode performances were compared, the learner most accurately produced requests in graphic mode (percentage of nonoverlapping data = 96). Given the lack of success in prompting vocal productions, a comparison between vocal and the other two communication modes was not made. A growing body of evidence suggests that concurrent modality sampling is a promising low-inference, data-driven procedure that can be used to inform selection of a communication mode(s) for initial emphasis with young children. Concurrent modality sampling can guide clinical decisions regarding the allocation of treatment resources to promote success in building an initial communicative repertoire.
Biases in facial and vocal emotion recognition in chronic schizophrenia
Dondaine, Thibaut; Robert, Gabriel; Péron, Julie; Grandjean, Didier; Vérin, Marc; Drapier, Dominique; Millet, Bruno
2014-01-01
There has been extensive research on impaired emotion recognition in schizophrenia in the facial and vocal modalities. The literature points to biases toward non-relevant emotions for emotional faces but few studies have examined biases in emotional recognition across different modalities (facial and vocal). In order to test emotion recognition biases, we exposed 23 patients with stabilized chronic schizophrenia and 23 healthy controls (HCs) to emotional facial and vocal tasks asking them to rate emotional intensity on visual analog scales. We showed that patients with schizophrenia provided higher intensity ratings on the non-target scales (e.g., surprise scale for fear stimuli) than HCs for the both tasks. Furthermore, with the exception of neutral vocal stimuli, they provided the same intensity ratings on the target scales as the HCs. These findings suggest that patients with chronic schizophrenia have emotional biases when judging emotional stimuli in the visual and vocal modalities. These biases may stem from a basic sensorial deficit, a high-order cognitive dysfunction, or both. The respective roles of prefrontal-subcortical circuitry and the basal ganglia are discussed. PMID:25202287
Guillette, Lauren M; Sturdy, Christopher B
2011-11-01
Recent research in songbirds has demonstrated that male singing behavior varies systematically with personality traits such as exploration and risk taking. Here we examine whether the production of bird calls, in addition to bird songs, is repeatable and related to exploratory behavior, using the black-capped chickadee (Poecile atricapillus) as a model. We assessed the exploratory behavior of individual birds in a novel environment task. We then recorded the vocalizations and accompanying motor behavior of both male and female chickadees, over the course of several days, in two different contexts: a control condition with no playback and a stressful condition where chick-a-dee mobbing calls were played to individual birds. We found that several vocalizations and behaviors were repeatable within both a control and a stressful context, and across contexts. While there was no relationship between vocal output and exploratory behavior in the control context, production of alarm and chick-a-dee calls in the stressful condition was positively associated with exploratory behavior. These findings are important because they show that bird calls, in addition to bird song, are an aspect of personality, in that calls are consistent both within and across contexts, and covary with other personality measures (exploration).
NASA Astrophysics Data System (ADS)
Guillette, Lauren M.; Sturdy, Christopher B.
2011-11-01
Recent research in songbirds has demonstrated that male singing behavior varies systematically with personality traits such as exploration and risk taking. Here we examine whether the production of bird calls, in addition to bird songs, is repeatable and related to exploratory behavior, using the black-capped chickadee ( Poecile atricapillus) as a model. We assessed the exploratory behavior of individual birds in a novel environment task. We then recorded the vocalizations and accompanying motor behavior of both male and female chickadees, over the course of several days, in two different contexts: a control condition with no playback and a stressful condition where chick-a-dee mobbing calls were played to individual birds. We found that several vocalizations and behaviors were repeatable within both a control and a stressful context, and across contexts. While there was no relationship between vocal output and exploratory behavior in the control context, production of alarm and chick-a-dee calls in the stressful condition was positively associated with exploratory behavior. These findings are important because they show that bird calls, in addition to bird song, are an aspect of personality, in that calls are consistent both within and across contexts, and covary with other personality measures (exploration).
Vibratory regime classification of infant phonation.
Buder, Eugene H; Chorna, Lesya B; Oller, D Kimbrough; Robinson, Rebecca B
2008-09-01
Infant phonation is highly variable in many respects, including the basic vibratory patterns by which the vocal tissues create acoustic signals. Previous studies have identified the regular occurrence of nonmodal phonation types in normal infant phonation. The glottis is like many oscillating systems that, because of nonlinear relationships among the elements, may vibrate in ways representing the deterministic patterns classified theoretically within the mathematical framework of nonlinear dynamics. The infant's preverbal vocal explorations present such a variety of phonations that it may be possible to find effectively all the classes of vibration predicted by nonlinear dynamic theory. The current report defines acoustic criteria for an important subset of such vibratory regimes, and demonstrates that analysts can be trained to reliably use these criteria for a classification that includes all instances of infant phonation in the recorded corpora. The method is thus internally comprehensive in the sense that all phonations are classified, but it is not exhaustive in the sense that all vocal qualities are thereby represented. Using the methods thus developed, this study also demonstrates that the distributions of these phonation types vary significantly across sessions of recording in the first year of life, suggesting developmental changes. The method of regime classification is thus capable of tracking changes that may be indicative of maturation of the mechanism, the learning of categories of phonatory control, and the possibly varying use of vocalizations across social contexts.
A duetting perspective on avian song learning.
Rivera-Cáceres, Karla D; Templeton, Christopher N
2017-12-25
Avian song learning has a rich history of study and has become the preeminent system for understanding the ontogeny of vocal communication in animals. Song learning in birds has many parallels with human language learning, ranging from the neural mechanisms involved to the importance of social factors in shaping signal acquisition. While much has been learned about the process of song learning, virtually all of the research done to date has focused on temperate species, where often only one sex (the male) sings. Duetting species, in which both males and females learn to sing and learn to combine their songs into temporally coordinated joint displays, could provide many insights into the processes by which vocal learning takes place. Here we highlight three key features of song learning-neuroendocrine control mechanisms, timing and life history stages of song acquisition, and the role of social factors in song selection and use-that have been elucidated from species where only males sing, and compare these with duetting species. We summarize what is known about song learning in duetting species and then provide several suggestions for fruitful directions for future research. We suggest that focusing research efforts on duetting species could significantly advance our understanding of vocal learning in birds and further cement the importance of avian species as models for understanding human conversations and the processes of vocal learning more broadly. Copyright © 2017 Elsevier B.V. All rights reserved.
Paternal kin recognition in the high frequency / ultrasonic range in a solitary foraging mammal
2012-01-01
Background Kin selection is a driving force in the evolution of mammalian social complexity. Recognition of paternal kin using vocalizations occurs in taxa with cohesive, complex social groups. This is the first investigation of paternal kin recognition via vocalizations in a small-brained, solitary foraging mammal, the grey mouse lemur (Microcebus murinus), a frequent model for ancestral primates. We analyzed the high frequency/ultrasonic male advertisement (courtship) call and alarm call. Results Multi-parametric analyses of the calls’ acoustic parameters and discriminant function analyses showed that advertisement calls, but not alarm calls, contain patrilineal signatures. Playback experiments controlling for familiarity showed that females paid more attention to advertisement calls from unrelated males than from their fathers. Reactions to alarm calls from unrelated males and fathers did not differ. Conclusions 1) Findings provide the first evidence of paternal kin recognition via vocalizations in a small-brained, solitarily foraging mammal. 2) High predation, small body size, and dispersed social systems may select for acoustic paternal kin recognition in the high frequency/ultrasonic ranges, thus limiting risks of inbreeding and eavesdropping by predators or conspecific competitors. 3) Paternal kin recognition via vocalizations in mammals is not dependent upon a large brain and high social complexity, but may already have been an integral part of the dispersed social networks from which more complex, kin-based sociality emerged. PMID:23198727
2015-01-01
Abstract Vowels provide the acoustic foundation of communication through speech and song, but little is known about how the brain orchestrates their production. Positron emission tomography was used to study regional cerebral blood flow (rCBF) during sustained production of the vowel /a/. Acoustic and blood flow data from 13, normal, right-handed, native speakers of American English were analyzed to identify CBF patterns that predicted the stability of the first and second formants of this vowel. Formants are bands of resonance frequencies that provide vowel identity and contribute to voice quality. The results indicated that formant stability was directly associated with blood flow increases and decreases in both left- and right-sided brain regions. Secondary brain regions (those associated with the regions predicting formant stability) were more likely to have an indirect negative relationship with first formant variability, but an indirect positive relationship with second formant variability. These results are not definitive maps of vowel production, but they do suggest that the level of motor control necessary to produce stable vowels is reflected in the complexity of an underlying neural system. These results also extend a systems approach to functional image analysis, previously applied to normal and ataxic speech rate that is solely based on identifying patterns of brain activity associated with specific performance measures. Understanding the complex relationships between multiple brain regions and the acoustic characteristics of vocal stability may provide insight into the pathophysiology of the dysarthrias, vocal disorders, and other speech changes in neurological and psychiatric disorders. PMID:25295385
Pre-attentive auditory discrimination skill in Indian classical vocal musicians and non-musicians.
Sanju, Himanshu Kumar; Kumar, Prawin
2016-09-01
To test for pre-attentive auditory discrimination skills in Indian classical vocal musicians and non-musicians. Mismatch negativity (MMN) was recorded to test for pre-attentive auditory discrimination skills with a pair of stimuli of /1000 Hz/ and /1100 Hz/, with /1000 Hz/ as the frequent stimulus and /1100 Hz/ as the infrequent stimulus. Onset, offset and peak latencies were the considered latency parameters, whereas peak amplitude and area under the curve were considered for amplitude analysis. Exactly 50 participants, out of which the experimental group had 25 adult Indian classical vocal musicians and 25 age-matched non-musicians served as the control group, were included in the study. Experimental group participants had a minimum professional music experience in Indian classic vocal music of 10 years. However, control group participants did not have any formal training in music. Descriptive statistics showed better waveform morphology in the experimental group as compared to the control. MANOVA showed significantly better onset latency, peak amplitude and area under the curve in the experimental group but no significant difference in the offset and peak latencies between the two groups. The present study probably points towards the enhancement of pre-attentive auditory discrimination skills in Indian classical vocal musicians compared to non-musicians. It indicates that Indian classical musical training enhances pre-attentive auditory discrimination skills in musicians, leading to higher peak amplitude and a greater area under the curve compared to non-musicians.
In Vivo measurement of pediatric vocal fold motion using structured light laser projection.
Patel, Rita R; Donohue, Kevin D; Lau, Daniel; Unnikrishnan, Harikrishnan
2013-07-01
The aim of the study was to present the development of a miniature structured light laser projection endoscope and to quantify vocal fold length and vibratory features related to impact stress of the pediatric glottis using high-speed imaging. The custom-developed laser projection system consists of a green laser with a 4-mm diameter optics module at the tip of the endoscope, projecting 20 vertical laser lines on the glottis. Measurements of absolute phonatory vocal fold length, membranous vocal fold length, peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity were obtained in five children (6-9 years), two adult male and three adult female participants without voice disorders, and one child (10 years) with bilateral vocal fold nodules during modal phonation. Independent measurements made on the glottal length of a vocal fold phantom demonstrated a 0.13mm bias error with a standard deviation of 0.23mm, indicating adequate precision and accuracy for measuring vocal fold structures and displacement. First, in vivo measurements of amplitude-to-length ratio, peak closing velocity, and impact velocity during phonation in pediatric population and a child with vocal fold nodules are reported. The proposed laser projection system can be used to obtain in vivo measurements of absolute length and vibratory features in children and adults. Children have large amplitude-to-length ratio compared with typically developing adults, whereas nodules result in larger peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity compared with typically developing children. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
A rare case of non-surgical vocal cord paralysis: Vocal cord hematoma.
Arıkan, Akif Enes; Teksöz, Serkan; Bilgin, İsmail Ahmet; Tarhan, Özge; Özyeğin, Ateş
2017-01-01
Although vocal cord paralysis (VCP) following thyroidectomy is primarily associated with surgical trauma, it is not the sole etiology. Vocal cord paralysis following thyroidectomy can be caused by a vocal cord hematoma with an incidence of 1.4% due to direct injury during orotracheal intubation. In this article, we present a case of VCP caused by vocal cord hematoma. A 32-year-old male patient who has been receiving propylthiouracil treatment for toxic multinodular goiter since 10 years was admitted to our hospital to be operated because of persisting complaints. The patient was hospitalized for sutureless thyroidectomy after he became euthyroid. Preoperative fiberoptic laryngoscopy performed by the ear, nose, and throat department revealed bilaterally motile vocal folds and a completely open airway. Patient underwent sutureless total thyroidectomy with a vessel sealing device (Ligasure TM LF1212, Covidien, CO), and a minivac drainage system was placed in the thyroid lodge. On the morning of the first postoperative day, 50 mL of serosanguinous fluid was drained. The patient's voice was normal, and there was no ecchymosis. Postoperative fiberoptic laryngoscopy revealed a hematoma near the right vocal fold and paralysis of the right vocal fold; however, the airway was open. It should be kept in mind that VCP is not solely due to surgery but can also result from intubation, as observed in this case.
A rare case of non-surgical vocal cord paralysis: Vocal cord hematoma
Arıkan, Akif Enes; Teksöz, Serkan; Bilgin, İsmail Ahmet; Tarhan, Özge; Özyeğin, Ateş
2017-01-01
Although vocal cord paralysis (VCP) following thyroidectomy is primarily associated with surgical trauma, it is not the sole etiology. Vocal cord paralysis following thyroidectomy can be caused by a vocal cord hematoma with an incidence of 1.4% due to direct injury during orotracheal intubation. In this article, we present a case of VCP caused by vocal cord hematoma. A 32-year-old male patient who has been receiving propylthiouracil treatment for toxic multinodular goiter since 10 years was admitted to our hospital to be operated because of persisting complaints. The patient was hospitalized for sutureless thyroidectomy after he became euthyroid. Preoperative fiberoptic laryngoscopy performed by the ear, nose, and throat department revealed bilaterally motile vocal folds and a completely open airway. Patient underwent sutureless total thyroidectomy with a vessel sealing device (LigasureTM LF1212, Covidien, CO), and a minivac drainage system was placed in the thyroid lodge. On the morning of the first postoperative day, 50 mL of serosanguinous fluid was drained. The patient’s voice was normal, and there was no ecchymosis. Postoperative fiberoptic laryngoscopy revealed a hematoma near the right vocal fold and paralysis of the right vocal fold; however, the airway was open. It should be kept in mind that VCP is not solely due to surgery but can also result from intubation, as observed in this case. PMID:29260141
How low can you go? Physical production mechanism of elephant infrasonic vocalizations.
Herbst, Christian T; Stoeger, Angela S; Frey, Roland; Lohscheller, Jörg; Titze, Ingo R; Gumpenberger, Michaela; Fitch, W Tecumseh
2012-08-03
Elephants can communicate using sounds below the range of human hearing ("infrasounds" below 20 hertz). It is commonly speculated that these vocalizations are produced in the larynx, either by neurally controlled muscle twitching (as in cat purring) or by flow-induced self-sustained vibrations of the vocal folds (as in human speech and song). We used direct high-speed video observations of an excised elephant larynx to demonstrate flow-induced self-sustained vocal fold vibration in the absence of any neural signals, thus excluding the need for any "purring" mechanism. The observed physical principles of voice production apply to a wide variety of mammals, extending across a remarkably large range of fundamental frequencies and body sizes, spanning more than five orders of magnitude.
Self-Organization: Complex Dynamical Systems in the Evolution of Speech
NASA Astrophysics Data System (ADS)
Oudeyer, Pierre-Yves
Human vocalization systems are characterized by complex structural properties. They are combinatorial, based on the systematic reuse of phonemes, and the set of repertoires in human languages is characterized by both strong statistical regularities—universals—and a great diversity. Besides, they are conventional codes culturally shared in each community of speakers. What are the origins of the forms of speech? What are the mechanisms that permitted their evolution in the course of phylogenesis and cultural evolution? How can a shared speech code be formed in a community of individuals? This chapter focuses on the way the concept of self-organization, and its interaction with natural selection, can throw light on these three questions. In particular, a computational model is presented which shows that a basic neural equipment for adaptive holistic vocal imitation, coupling directly motor and perceptual representations in the brain, can generate spontaneously shared combinatorial systems of vocalizations in a society of babbling individuals. Furthermore, we show how morphological and physiological innate constraints can interact with these self-organized mechanisms to account for both the formation of statistical regularities and diversity in vocalization systems.
In vivo comparison of biomimetic approaches for tissue regeneration of the scarred vocal fold.
Thibeault, Susan L; Klemuk, Sarah A; Smith, Marshall E; Leugers, Cecilia; Prestwich, Glenn
2009-07-01
The objective of this study was to determine if three different biomimetic approaches could facilitate tissue regeneration and improve viscoelastic properties in the scarred vocal fold lamina propria extracellular matrix (ECM). Twenty rabbit vocal folds were biopsied bilaterally; 2 months postinjury rabbits were unilaterally treated with (i) autologous fibroblasts, (ii) a semisynthetic ECM (sECM), or (iii) autologous fibroblasts encapsulated in sECM. Saline was injected as a control into the contralateral fold. Animals were sacrificed 2 months after treatment. Outcomes measured were procollagen, collagen, and fibronectin levels in the lamina propria, and tissue viscosity and elasticity across three frequency decades. All treatment groups demonstrated accelerated proliferation of the ECM. Vocal fold lamina propria treated with autologous fibroblasts were found to have significantly improved viscosity (p = 0.0077) and elasticity (p = 0.0081) compared to saline. This treatment group had significantly elevated fibronectin levels. sECM and autologous fibroblasts/sECM groups had significantly elevated levels of procollagen, collagen, and fibronectin, indicating abundant matrix production as compared to saline with viscoelastic measures that did not differ statistically from controls. The use of autologous fibroblasts led to better restoration of the vocal fold lamina propria biomechanical properties. Optimization of cell-scaffold interactions and subsequent cell behavior is necessary for utilization of scaffold and scaffold-cell approaches.
Evaluation of the Grafted Fascia in the Vocal Fold of Dogs: A Histologic Study.
Carvalho, Eduardo G B; Pauna, Henrique F; Machado, Almiro J; Nicola, Ester M D; Altemani, Albina M A M; Crespo, Agrício N
2017-09-01
There is no consensus on the ideal graft for medialization surgery of the vocal folds in the literature. One of the most favorable proposals is the use of autologous fascia, which seems limited by the lack of information regarding the integration of grafted tissue. Our study aims to evaluate the degree of fully engrafted fascia integration in the vocal fold lamina propria of dogs. Fourteen adult mongrel dogs that underwent intravenous general anesthesia were selected and kept under spontaneous ventilation. A fascia lata fragment of 4 cm 2 was obtained from the right leg of each dog. The dogs underwent laryngoscopy; a 3 mm incision was made in the vocal process, next to the vestibular process, and the fascia was grafted into the right vocal fold. The left vocal fold was used as a control. The animals were divided into two groups: group A, evaluated after 2 months of the procedure, and group B, evaluated after 6 months of the procedure. Histologic analysis was made semiquantitatively regarding the presence of inflammatory reaction, fibrosis, and neovascularization. Our final studied group comprised 12 dogs. Microscopic examination of the larynx revealed the absence of any detectable inflammation in the incision site. The lamina propria of the grafted vocal fold showed identifiable compact, thick, and eosinophilic collagen bands. The surrounding tissue showed thin collagen bands with some organization, similar to the contralateral vocal fold. The grafted fascia integrates into the vocal fold lamina propria and seems not to cause inflammatory reaction response. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Compairing Picture Exchange and Voice Output Communication Aids in Young Children with Autism
ERIC Educational Resources Information Center
Lorah, Elizabeth R.
2012-01-01
The Center for Disease Control estimates that one in 88 births result in a diagnosis of autism (CDC, 2012). Of those individuals diagnosed with autism approximately 25-61% fail to develop vocal output capabilities (Weitxz, Dexter, & Moore, 1997). The use of Augmentative and Alternative Communication (AAC) systems, such as Picture Exchange (PE)…
A Vowel-Based Method for Vocal Tract Control in Clarinet Pedagogy
ERIC Educational Resources Information Center
González, Darleny; Payri, Blas
2017-01-01
Our review of scientific literature shows that the activity inside the clarinetist's vocal tract (VT) affects pitch and timbre, while also facilitating technical exercises. Clarinetists adapt their VT intuitively and, in some cases, may compensate an inadequate VT configuration through unnecessary pressure, resulting in technical blockage,…
ERIC Educational Resources Information Center
Hursh, Daniel E.
In Skinner's analysis of verbal behavior, three categories of environmental control over instances of verbal behavior appear to be relevant to the study of infant vocal development: the mand, the tact, and the echoic categories. Procedures used in the remediation of language deficiencies and procedures found in work in the area of language…
Difference between the vocalizations of two sister species of pigeons explained in dynamical terms.
Alonso, R Gogui; Kopuchian, Cecilia; Amador, Ana; Suarez, Maria de Los Angeles; Tubaro, Pablo L; Mindlin, Gabriel B
2016-05-01
Vocal communication is an unique example, where the nonlinear nature of the periphery can give rise to complex sounds even when driven by simple neural instructions. In this work we studied the case of two close-related bird species, Patagioenas maculosa and Patagioenas picazuro, whose vocalizations differ only in the timbre. The temporal modulation of the fundamental frequency is similar in both cases, differing only in the existence of sidebands around the fundamental frequency in the P. maculosa. We tested the hypothesis that the qualitative difference between these vocalizations lies in the nonlinear nature of the syrinx. In particular, we propose that the roughness of maculosa's vocalizations is due to an asymmetry between the right and left vibratory membranes, whose nonlinear dynamics generate the sound. To test the hypothesis, we generated a biomechanical model for vocal production with an asymmetric parameter Q with which we can control the level of asymmetry between these membranes. Using this model we generated synthetic vocalizations with the principal acoustic features of both species. In addition, we confirmed the anatomical predictions by making post mortem inspection of the syrinxes, showing that the species with tonal song (picazuro) has a more symmetrical pair of membranes compared to maculosa.
Difference between the vocalizations of two sister species of pigeons explained in dynamical terms
Alonso, R. Gogui; Kopuchian, Cecilia; Amador, Ana; de los Angeles Suarez, Maria; Tubaro, Pablo L.; Mindlin, Gabriel B.
2016-01-01
Vocal communication is a unique example where the nonlinear nature of the periphery can give rise to complex sounds even when driven by simple neural instructions. In this work we studied the case of two close-related bird species, Patagioenas maculosa and Patagioenas picazuro, whose vocalizations differ only in the timbre. The temporal modulation of the fundamental frequency is similar in both cases, differing only in the existence of sidebands around the fundamental frequency in the Patagioenas maculosa. We tested the hypothesis that the qualitative difference between these vocalizations lies in the nonlinear nature of the syrinx. In particular, we propose that the roughness of maculosa's vocalizations is due to an asymmetry between the right and left vibratory membranes, whose nonlinear dynamics generate the sound. To test the hypothesis, we generated a biomechanical model for vocal production with an asymmetric parameter Q with which we can control the level of asymmetry between these membranes. Using this model we generated synthetic vocalizations with the principal acoustic features of both species. In addition, we confirmed the anatomical predictions by making post-mortem inspection of the syrinxes, showing that the species with tonal song (picazuro) has a more symmetrical pair of membranes compared to maculosa. PMID:27033354
Understanding The Neural Mechanisms Involved In Sensory Control Of Voice Production
Parkinson, Amy L.; Flagmeier, Sabina G.; Manes, Jordan L.; Larson, Charles R.; Rogers, Bill; Robin, Donald A.
2012-01-01
Auditory feedback is important for the control of voice fundamental frequency (F0). In the present study we used neuroimaging to identify regions of the brain responsible for sensory control of the voice. We used a pitch-shift paradigm where subjects respond to an alteration, or shift, of voice pitch auditory feedback with a reflexive change in F0. To determine the neural substrates involved in these audio-vocal responses, subjects underwent fMRI scanning while vocalizing with or without pitch-shifted feedback. The comparison of shifted and unshifted vocalization revealed activation bilaterally in the superior temporal gyrus (STG) in response to the pitch shifted feedback. We hypothesize that the STG activity is related to error detection by auditory error cells located in the superior temporal cortex and efference copy mechanisms whereby this region is responsible for the coding of a mismatch between actual and predicted voice F0. PMID:22406500
NASA Astrophysics Data System (ADS)
Laje, Rodrigo; Mindlin, Gabriel B.
2002-12-01
We present a model for the activities of neural circuits in a nucleus found in the brains of songbirds: the robust nucleus of the archistriatum (RA). This is a fore brain song control nucleus responsible for the phasic and precise neural signals driving vocal and respiratory motor neurons during singing. Driving a physical model of the avian vocal organ with the signals generated by the neural model, we produce synthetic songs. This allows us to show that certain connectivity architectures in the RA give rise to a wide range of different vocalizations under simple excitatory instructions.
Paul, Nilanjan; Kumar, Suman; Chatterjee, Indranil; Mukherjee, Biswarup
2011-01-01
In-depth study on laryngeal biomechanics and vocal fold vibratory patterns reveal that a single vibratory cycle can be divided into two major phases, the closed and open phase, which is subdivided into opening and closing phases. Studies reveal that the relative time course of abduction and adduction, which in turn is dependent on the relative relaxing and tensing of the vocal fold cover and body, to be the determining factor in production of a particular vocal register like the modal (or chest), falsetto, glottal fry registers. Studies further point out Electroglottography to be particularly suitable for the study of vocal vibratory patterns during register changes. However, to date, there has been limited study on quantitative parameterization of EGG wave form in vocal fry register. Moreover, contradictory findings abound in literature regarding effects of gender and vowel types on vocal vibratory patterns, especially during phonation at different registers. The present study endeavors to find out the effects of vowel and gender differences on the vocal fold vibratory patterns in different registers and how these would be reflected in standard EGG parameters of Contact Quotient (CQ) and Contact Index (CI), taking into consideration the Indian sociolinguistic context. Electroglottographic recordings of 10 young adults (5 males and 5 females) were taken while the subjects phonated the three vowels /a/,/i/,/u/ each in two vocal registers, modal and vocal fry. Obtained raw EGG were further normalized using the Derived EGG algorithm and theCQ and CI values were derived. Obtained data were subject to statistical analysis using the 3-way ANOVA with gender, vowel and vocal register as the three variables. Post-hoc Dunnett C multiple comparison analysis were also performed. Results reveal that CQ values are significantly higher in vocal fry than modal phonation for both males and females, indicating a relatively hyperconstricted vocal system during vocal fry. The males have significantly greater CQ values than females both at modal and vocal fry phonations which indicate that the males are predisposed to greater vocal fold constriction. Females demonstrated no significant increase in CI values in vocal fry state; and in some cases actually decrease in the CI values which suggest an inherently distinct vocal fold physiological adjustment from that in males. No vowel effects were found in any conditions. Perturbation values (CQP and CIP) are significantly more in vocal fry register than in modal register, and the increase was more in case of females than males. The findings give strong evidence to certain hypotheses in literature regarding effects of vowel, gender and phonatory register on vocal fold vibratory patterns.
Liang, G T; Duan, B Y; Zhang, Y Y; Luo, S W; Lu, L; Yang, L P; Wang, B R
2017-01-20
Objective: Building a canine vocal fold scar model and analyzing the changes in morphology, histopathology and related factors of extra cellular matrix (ECM) of vocal cord healing at different time periods. Method: Five experimental dogs were randomly divided into the control group(one dog) and the experimental group (four dogs). No special treatment was done on the control group, and the experimental group was given CO₂ laser through laryngoscope with minimally invasive surgery on bilateral vocal cords. Observation of the morphological changes of injured vocal cords was made at five different time, pre-operation and 6 h, 3 w, 8 w and 12 w post-operation. HE staining, immunofluorescence, were used respectively to observe the histopathological and ultrastructural changes of each layer of vocal cord. Observation were made on the changing pattern of lysyl oxidase (LOX), heat shock proteins70 (HSP70), and the integrated optical density index (IOD) of Hyaluronic acid (HA) in vocal cord tissues. Result: ①Laryngoscope shows there were mild congestion and edema and inflammatory exudation on the wound surface of vocal cord 3w post-operation. On the wound surface of vocal cord 8 w post-operation, the congestion and edema disappear, the local contraction sink, and no adhesion and granulation form. 12 w post-operation, the surface of the vocal cord was smooth, there were local contractures, vocal fold scar form, and hoarseness in the bilateral vocal cords was obvious. ②HE staining shows 6 h post-operation there were a large number of inflammatory cell infiltration, red blood cell leakage, and cell congestion and edema on the wound surface of vocal cord. 3 w post-operation, there were fibroblast proliferation, angiogenesis, and a large number of fibrous tissues disorderly arranged on the wound surface of vocal cord. 8w post-operation, each layer of fibrous tissues were in hyperplasia and the blood vessels was thicken after on the wound surface of vocal cord, 12 w post-operation, a large number of collagens increases and were in group or fasciculation disorder. There were many irregular gaps in fibers, and blood vessels and glands become rare or disappear. ③The immunofluorescence showed LOX, HSP70 and HA were all localized in the cellular cytoplasm and nucleus. The expression levels were different at different postoperative time, and they were mainly relatively strong expressed in the inflammatory cells, vascular endothelial cells and the glands. ④The comparison of IOD values: The IOD values of LOX were different at different time periods ( P < 0.05). There were no significant differences in IOD of HSP70 between pre-operation and 12 w post-operation, but there are significant differences among other groups ( P < 0.05). There were no significant differences in IOD of HA between pre-operation and 12 w post-operation, but there were significant differences among other groups ( P < 0.01). ⑤The expression of LOX decreased 6h post-operation and increased 3-12 w post-operation. The expression of HSP70 post-operation reached the peak 6 h post-operation and decreased 3-12 w post-operation. The expression of HA decreased 6 h post-operation, increased to the peak 8 w post-operation, and decreased 8-12 w post-operation.⑥The transmission electron microscope showed 3 w to 8 w post-operation, in the intrinsic layer of the vocal cord, the fibroblasts were extremely active, the cells swelled, and the organelles were abundant.12 w post-operation, there were a small number of fibroblasts in the active state, and larger gaps between the fibers and fewer and thinner elastic fibers. Conclusion: CO₂ laser ablation of vocal cords under the Micro Post-Laryngoscope could establish reliable animal model of vocal fold scar, and 12 w reticular the vocal fold scars basically form. LOX, HSP70, HA play different roles at early, middle, and late stages in the vocal fold scar formation, and they can be used as a more sensitive index for vocal fold scar formation. Copyright© by the Editorial Department of Journal of Clinical Otorhinolaryngology Head and Neck Surgery.
A Joyful Noise: The Vocal Health of Worship Leaders and Contemporary Christian Singers.
Neto, Leon; Meyer, David
2017-03-01
Contemporary commercial music (CCM) is a term that encompasses many styles of music. A growing subset of CCM is contemporary Christian music, a genre that has outpaced other popular styles such as Latin, jazz, and classical music. Contemporary Christian singers (CCSs) and worship leaders (WLs) are a subset of CCM musicians that face unique vocal demands and risks. They typically lack professional training and often perform in acoustically disadvantageous venues with substandard sound reinforcement systems. The vocal needs and risks of these singers are not well understood, and because of this, their training and care may be suboptimal. The aim of the present study was to investigate the vocal health of this growing population and their awareness of standard vocal hygiene principles. An online questionnaire was designed and administered to participants in the Americas, Europe, Australia, and Asia. A total of 614 participants responded to the questionnaire, which is made available in English, Portuguese, and Spanish. Many participants reported vocal symptoms such as vocal fatigue (n = 213; 34.7%), tickling or choking sensation (n = 149; 24.3%), loss of upper range (n = 172; 28%), and complete loss of voice (n = 25; 4.1%). One third of the participants (n = 210; 34%) indicated that they do not warm up their voices before performances and over half of the participants (n = 319; 52%) have no formal vocal training. Results suggest that this population demonstrates low awareness of vocal hygiene principles, frequently experience difficulty with their voices, and may face elevated risk of vocal pathology. Future studies of this population may confirm the vocal risks that our preliminary findings suggest. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The vocal repertoire of Tibetan macaques (Macaca thibetana): A quantitative classification.
Bernstein, Sofia K; Sheeran, Lori K; Wagner, R Steven; Li, Jin-Hua; Koda, Hiroki
2016-09-01
Vocal repertoires are basic and essential components for describing vocal communication in animals. Studying the entire suite of vocal signals aids investigations on the variation of acoustic structure across social contexts, comparisons on the complexity of communication systems across taxa, and in exploration of the evolutionary origins of species-specific vocalizations. Here, we describe the vocal repertoire of the largest species in the macaque genus, Macaca thibetana. We extracted thirty acoustic parameters from call recordings. Post hoc validation through quantitative analyses of the a priori repertoire classified eleven call types: coo, squawk, squeal, noisy scream, growl, bark, compound squeak, leap coo, weeping, modulated tonal scream, and pant. In comparison to the rest of the genus, Tibetan macaques uttered a wider array of vocalizations in the context of copulations. Previous reports did not include modulated tonal screams and pants during harassment of copulatory dyads. Furthermore, in comparison to the rest of the genus, Tibetan macaque females emit acoustically distinct copulation calls. The vocal repertoire of Tibetan macaques contributes to the literature on the emergence of species-specific calls in the genus Macaca with potential insights from social, reproductive, and ecological comparisons across species. Am. J. Primatol. 78:937-949, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Tao, Chao; Regner, Michael F.; Zhang, Yu; Jiang, Jack J.
2014-01-01
Summary The relationship between the vocal fold elongation and the phonation threshold pressure (PTP) was experimentally and theoretically investigated. The PTP values of seventeen excised canine larynges with 0% to 15% bilateral vocal fold elongations in 5% elongation steps were measured using an excised larynx phonation system. It was found that twelve larynges exhibited a monotonic relationship between PTP and elongation; in these larynges, the 0% elongation condition had the lowest PTP. Five larynges exhibited a PTP minimum at 5% elongation. To provide a theoretical explanation of these phenomena, a two-mass model was modified to simulate vibration of the elongated vocal folds. Two pairs of longitudinal springs were used to represent the longitudinal elastin in the vocal folds. This model showed that when the vocal folds were elongated, the increased longitudinal tension would increase the PTP value and the increased vocal fold length would decrease the PTP value. The antagonistic effects contributed by these two factors were found to be able to cause either a monotonic or a non-monotonic relationship between PTP and elongation, which were consistent with experimental observations. Because PTP describes the ease of phonation, this study suggests that there may exist a nonzero optimal vocal fold elongation for the greatest ease for phonation in some larynges. PMID:25530744
The Neural Basis of Vocal Pitch Imitation in Humans.
Belyk, Michel; Pfordresher, Peter Q; Liotti, Mario; Brown, Steven
2016-04-01
Vocal imitation is a phenotype that is unique to humans among all primate species, and so an understanding of its neural basis is critical in explaining the emergence of both speech and song in human evolution. Two principal neural models of vocal imitation have emerged from a consideration of nonhuman animals. One hypothesis suggests that putative mirror neurons in the inferior frontal gyrus pars opercularis of Broca's area may be important for imitation. An alternative hypothesis derived from the study of songbirds suggests that the corticostriate motor pathway performs sensorimotor processes that are specific to vocal imitation. Using fMRI with a sparse event-related sampling design, we investigated the neural basis of vocal imitation in humans by comparing imitative vocal production of pitch sequences with both nonimitative vocal production and pitch discrimination. The strongest difference between these tasks was found in the putamen bilaterally, providing a striking parallel to the role of the analogous region in songbirds. Other areas preferentially activated during imitation included the orofacial motor cortex, Rolandic operculum, and SMA, which together outline the corticostriate motor loop. No differences were seen in the inferior frontal gyrus. The corticostriate system thus appears to be the central pathway for vocal imitation in humans, as predicted from an analogy with songbirds.
Social Vocalizations of Big Brown Bats Vary with Behavioral Context
Gadziola, Marie A.; Grimsley, Jasmine M. S.; Faure, Paul A.; Wenstrup, Jeffrey J.
2012-01-01
Bats are among the most gregarious and vocal mammals, with some species demonstrating a diverse repertoire of syllables under a variety of behavioral contexts. Despite extensive characterization of big brown bat (Eptesicus fuscus) biosonar signals, there have been no detailed studies of adult social vocalizations. We recorded and analyzed social vocalizations and associated behaviors of captive big brown bats under four behavioral contexts: low aggression, medium aggression, high aggression, and appeasement. Even limited to these contexts, big brown bats possess a rich repertoire of social vocalizations, with 18 distinct syllable types automatically classified using a spectrogram cross-correlation procedure. For each behavioral context, we describe vocalizations in terms of syllable acoustics, temporal emission patterns, and typical syllable sequences. Emotion-related acoustic cues are evident within the call structure by context-specific syllable types or variations in the temporal emission pattern. We designed a paradigm that could evoke aggressive vocalizations while monitoring heart rate as an objective measure of internal physiological state. Changes in the magnitude and duration of elevated heart rate scaled to the level of evoked aggression, confirming the behavioral state classifications assessed by vocalizations and behavioral displays. These results reveal a complex acoustic communication system among big brown bats in which acoustic cues and call structure signal the emotional state of a caller. PMID:22970247
Vocal Fry Use in Adult Female Speakers Exposed to Two Languages.
Gibson, Todd A; Summers, Connie; Walls, Sydney
2017-07-01
Several studies have identified the widespread use of vocal fry among American women. Popular explanations for this phenomenon appeal to sociolinguistic purposes that likely take significant time for second language users to learn. The objective of this study was to determine if mere exposure to this vocal register, as opposed to nuanced sociolinguistic motivations, might explain its widespread use. This study used multigroup within- and between-subjects design. Fifty-eight women from one of three language background groups (functionally monolingual in English, functionally monolingual in Spanish, and Spanish-English bilinguals) living in El Paso, Texas, repeated a list of nonwords conforming to the sound rules of English and another list of nonwords conforming to the sound rules of Spanish. Perceptual analysis identified each episode of vocal fry. There were no statistically significant differences between groups in their frequency of vocal fry use despite large differences in their amount of English-language exposure. All groups produced more vocal fry when repeating English than when repeating Spanish nonwords. Because the human perceptual system encodes for vocal qualities even after minimal language experience, the widespread use of vocal fry among female residents in the United States likely is owing to mere exposure to English rather than nuanced sociolinguistic motivations. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Mencio, Caitlin; Kuberan, Balagurunathan; Goller, Franz
2017-02-01
Neural control of complex vocal behaviors, such as birdsong and speech, requires integration of biomechanical nonlinearities through muscular output. Although control of airflow and tension of vibrating tissues are known functions of vocal muscles, it remains unclear how specific muscle characteristics contribute to specific acoustic parameters. To address this gap, we removed heparan sulfate chains using heparitinases to perturb neuromuscular transmission subtly in the syrinx of adult male zebra finches (Taeniopygia guttata). Infusion of heparitinases into ventral syringeal muscles altered their excitation threshold and reduced neuromuscular transmission changing their ability to modulate airflow. The changes in muscle activation dynamics caused a reduction in frequency modulation rates and elimination of many high-frequency syllables but did not alter the fundamental frequency of syllables. Sound amplitude was reduced and sound onset pressure was increased, suggesting a role of muscles in the induction of self-sustained oscillations under low-airflow conditions, thus enhancing vocal efficiency. These changes were reversed to preinfusion levels by 7 days after infusion. These results illustrate complex interactions between the control of airflow and tension and further define the importance of syringeal muscle in the control of a variety of acoustic song characteristics. In summary, the findings reported here show that altering neuromuscular transmission can lead to reversible changes to the acoustic structure of song. Understanding the full extent of muscle involvement in song production is critical in decoding the motor program for the production of complex vocal behavior, including our search for parallels between birdsong and human speech motor control. It is largely unknown how fine motor control of acoustic parameters is achieved in vocal organs. Subtle manipulation of syringeal muscle function was used to test how active motor control influences acoustic parameters. Slowed activation kinetics of muscles reduced frequency modulation and, unexpectedly, caused a distinct decrease in sound amplitude and increase in phonation onset pressure. These results show that active control enhances the efficiency of energy conversion in the syrinx. Copyright © 2017 the American Physiological Society.
Measurement of flow separation in a human vocal folds model
NASA Astrophysics Data System (ADS)
Šidlof, Petr; Doaré, Olivier; Cadot, Olivier; Chaigne, Antoine
2011-07-01
The paper provides experimental data on flow separation from a model of the human vocal folds. Data were measured on a four times scaled physical model, where one vocal fold was fixed and the other oscillated due to fluid-structure interaction. The vocal folds were fabricated from silicone rubber and placed on elastic support in the wall of a transparent wind tunnel. A PIV system was used to visualize the flow fields immediately downstream of the glottis and to measure the velocity fields. From the visualizations, the position of the flow separation point was evaluated using a semiautomatic procedure and plotted for different airflow velocities. The separation point position was quantified relative to the orifice width separately for the left and right vocal folds to account for flow asymmetry. The results indicate that the flow separation point remains close to the narrowest cross-section during most of the vocal fold vibration cycle, but moves significantly further downstream shortly prior to and after glottal closure.
Tashkeela: Novel corpus of Arabic vocalized texts, data for auto-diacritization systems.
Zerrouki, Taha; Balla, Amar
2017-04-01
Arabic diacritics are often missed in Arabic scripts. This feature is a handicap for new learner to read َArabic, text to speech conversion systems, reading and semantic analysis of Arabic texts. The automatic diacritization systems are the best solution to handle this issue. But such automation needs resources as diactritized texts to train and evaluate such systems. In this paper, we describe our corpus of Arabic diacritized texts. This corpus is called Tashkeela. It can be used as a linguistic resource tool for natural language processing such as automatic diacritics systems, dis-ambiguity mechanism, features and data extraction. The corpus is freely available, it contains 75 million of fully vocalized words mainly 97 books from classical and modern Arabic language. The corpus is collected from manually vocalized texts using web crawling process.
Qualification of a Quantitative Laryngeal Imaging System Using Videostroboscopy and Videokymography
Popolo, Peter S.; Titze, Ingo R.
2008-01-01
Objectives: We sought to determine whether full-cycle glottal width measurements could be obtained with a quantitative laryngeal imaging system using videostroboscopy, and whether glottal width and vocal fold length measurements were repeatable and reliable. Methods: Synthetic vocal folds were phonated on a laboratory bench, and dynamic images were obtained in repeated trials by use of videostroboscopy and videokymography (VKG) with an imaging system equipped with a 2-point laser projection device for measuring absolute dimensions. Video images were also obtained with an industrial videoscope system with a built-in laser measurement capability. Maximum glottal width and vocal fold length were compared among these 3 methods. Results: The average variation in maximum glottal width measurements between stroboscopic data and VKG data was 3.10%. The average variations in width measurements between the clinical system and the industrial system were 1.93% (stroboscopy) and 3.49% (VKG). The variations in vocal fold length were similarly small. The standard deviations across trials were 0.29 mm for width and 0.48 mm for length (stroboscopy), 0.18 mm for width (VKG), and 0.25 mm for width and 0.84 mm for length (industrial). Conclusions: For stable, periodic vibration, the full extent of the glottal width can be reliably measured with the quantitative videostroboscopy system. PMID:18646436
The Effect of Habitat Acoustics on Common Marmoset Vocal Signal Transmission
MORRILL, RYAN J.; THOMAS, A. WREN; SCHIEL, NICOLA; SOUTO, ANTONIO; MILLER, CORY T.
2013-01-01
Noisy acoustic environments present several challenges for the evolution of acoustic communication systems. Among the most significant is the need to limit degradation of spectro-temporal signal structure in order to maintain communicative efficacy. This can be achieved by selecting for several potentially complementary processes. Selection can act on behavioral mechanisms permitting signalers to control the timing and occurrence of signal production to avoid acoustic interference. Likewise, the signal itself may be the target of selection, biasing the evolution of its structure to comprise acoustic features that avoid interference from ambient noise or degrade minimally in the habitat. Here, we address the latter topic for common marmoset (Callithrix jacchus) long-distance contact vocalizations, known as phee calls. Our aim was to test whether this vocalization is specifically adapted for transmission in a species-typical forest habitat, the Atlantic forests of northeastern Brazil. We combined seasonal analyses of ambient habitat acoustics with experiments in which pure tones, clicks, and vocalizations were broadcast and rerecorded at different distances to characterize signal degradation in the habitat. Ambient sound was analyzed from intervals throughout the day and over rainy and dry seasons, showing temporal regularities across varied timescales. Broadcast experiment results indicated that the tone and click stimuli showed the typically inverse relationship between frequency and signaling efficacy. Although marmoset phee calls degraded over distance with marked predictability compared with artificial sounds, they did not otherwise appear to be specially designed for increased transmission efficacy or minimal interference in this habitat. We discuss these data in the context of other similar studies and evidence of potential behavioral mechanisms for avoiding acoustic interference in order to maintain effective vocal communication in common marmosets. PMID:23592313
The effect of habitat acoustics on common marmoset vocal signal transmission.
Morrill, Ryan J; Thomas, A Wren; Schiel, Nicola; Souto, Antonio; Miller, Cory T
2013-09-01
Noisy acoustic environments present several challenges for the evolution of acoustic communication systems. Among the most significant is the need to limit degradation of spectro-temporal signal structure in order to maintain communicative efficacy. This can be achieved by selecting for several potentially complementary processes. Selection can act on behavioral mechanisms permitting signalers to control the timing and occurrence of signal production to avoid acoustic interference. Likewise, the signal itself may be the target of selection, biasing the evolution of its structure to comprise acoustic features that avoid interference from ambient noise or degrade minimally in the habitat. Here, we address the latter topic for common marmoset (Callithrix jacchus) long-distance contact vocalizations, known as phee calls. Our aim was to test whether this vocalization is specifically adapted for transmission in a species-typical forest habitat, the Atlantic forests of northeastern Brazil. We combined seasonal analyses of ambient habitat acoustics with experiments in which pure tones, clicks, and vocalizations were broadcast and rerecorded at different distances to characterize signal degradation in the habitat. Ambient sound was analyzed from intervals throughout the day and over rainy and dry seasons, showing temporal regularities across varied timescales. Broadcast experiment results indicated that the tone and click stimuli showed the typically inverse relationship between frequency and signaling efficacy. Although marmoset phee calls degraded over distance with marked predictability compared with artificial sounds, they did not otherwise appear to be specially designed for increased transmission efficacy or minimal interference in this habitat. We discuss these data in the context of other similar studies and evidence of potential behavioral mechanisms for avoiding acoustic interference in order to maintain effective vocal communication in common marmosets. © 2013 Wiley Periodicals, Inc.
González-García, Nadia; González, Martha A; Rendón, Pablo L
2016-07-15
Relationships between musical pitches are described as either consonant, when associated with a pleasant and harmonious sensation, or dissonant, when associated with an inharmonious feeling. The accurate singing of musical intervals requires communication between auditory feedback processing and vocal motor control (i.e. audio-vocal integration) to ensure that each note is produced correctly. The objective of this study is to investigate the neural mechanisms through which trained musicians produce consonant and dissonant intervals. We utilized 4 musical intervals (specifically, an octave, a major seventh, a fifth, and a tritone) as the main stimuli for auditory discrimination testing, and we used the same interval tasks to assess vocal accuracy in a group of musicians (11 subjects, all female vocal students at conservatory level). The intervals were chosen so as to test for differences in recognition and production of consonant and dissonant intervals, as well as narrow and wide intervals. The subjects were studied using fMRI during performance of the interval tasks; the control condition consisted of passive listening. Singing dissonant intervals as opposed to singing consonant intervals led to an increase in activation in several regions, most notably the primary auditory cortex, the primary somatosensory cortex, the amygdala, the left putamen, and the right insula. Singing wide intervals as opposed to singing narrow intervals resulted in the activation of the right anterior insula. Moreover, we also observed a correlation between singing in tune and brain activity in the premotor cortex, and a positive correlation between training and activation of primary somatosensory cortex, primary motor cortex, and premotor cortex during singing. When singing dissonant intervals, a higher degree of training correlated with the right thalamus and the left putamen. Our results indicate that singing dissonant intervals requires greater involvement of neural mechanisms associated with integrating external feedback from auditory and sensorimotor systems than singing consonant intervals, and it would then seem likely that dissonant intervals are intoned by adjusting the neural mechanisms used for the production of consonant intervals. Singing wide intervals requires a greater degree of control than singing narrow intervals, as it involves neural mechanisms which again involve the integration of internal and external feedback. Copyright © 2016 Elsevier B.V. All rights reserved.
Memory-dependent adjustment of vocal response latencies in a territorial songbird.
Geberzahn, Nicole; Hultsch, Henrike; Todt, Dietmar
2013-06-01
Vocal interactions in songbirds can be used as a model system to investigate the interplay of intrinsic singing programmes (e.g. influences from vocal memories) and external variables (e.g. social factors). When characterizing vocal interactions between territorial rivals two aspects are important: (1) the timing of songs in relation to the conspecific's singing and (2) the use of a song pattern that matches the rival's song. Responses in both domains can be used to address a territorial rival. This study is the first to investigate the relation of the timing of vocal responses to (1) the vocal memory of a responding subject and (2) the selection of the song pattern that the subject uses as a response. To this end, we conducted interactive playback experiments with adult nightingales (Luscinia megarhynchos) that had been hand-reared and tutored in the laboratory. We analysed the subjects' vocal response latencies towards broadcast playback stimuli that they either had in their own vocal repertoire (songs shared with playback) or that they had not heard before (unknown songs). Likewise, we compared vocal response latencies between responses that matched the stimulus song and those that did not. Our findings showed that the latency of singing in response to the playback was shorter for shared versus unknown song stimuli when subjects overlapped the playback stimuli with their own song. Moreover birds tended to overlap faster when vocally matching the stimulus song rather than when replying with a non-matching song type. We conclude that memory of song patterns influenced response latencies and discuss possible mechanisms. Copyright © 2012 Elsevier Ltd. All rights reserved.
Delgadillo, José Alberto; Vielma, Jesús; Hernandez, Horacio; Flores, José Alfredo; Duarte, Gerardo; Fernández, Ilda Graciela; Keller, Matthieu; Gelez, Hélène
2012-09-01
We investigated whether live vocalizations emitted by bucks interacting with anestrous females stimulate secretion of LH, estrous behavior and ovulation in anestrous goats. In experiment 1, bucks rendered sexually active by exposure to long days followed by natural photoperiod were exposed in a light-proof-building to five anestrous females. Buck vocalizations were reproduced through a microphone-amplifier-loudspeaker system to an open pen where one group of goats (n=6) was exposed for 10 days to these live vocalizations. Another group of females (n=6) was isolated from males and vocalizations. The proportion of goats displaying estrous behavior was significantly higher in females exposed to buck vocalizations than in females isolated from males. The proportion of goats that ovulated did not differ between the 2 groups (exposed to males versus isolated). In experiment 2, female goats that either had previous contact with males (n=7), or no previous contact with males (n=7) were exposed to live buck vocalizations, reproduced as described in experiment 1, for 5 days. The number and amplitude of LH pulses did not differ between groups before exposition to buck vocalizations. Five days of exposure to male vocalizations significantly increased LH pulsatility only in females that had previous contact with males, while LH pulse amplitude was not modified. We concluded that live buck vocalizations can stimulate estrous behavior and LH secretion in goats if they have had previous contact with bucks. Copyright © 2012 Elsevier Inc. All rights reserved.
Kim, Eun Na; Sung, Myung Whun; Kwon, Tack-Kyun; Cho, Yong Woo; Kwon, Seong Keun
2016-01-01
Vocal fold paralysis results from various etiologies and can induce voice changes, swallowing complications, and issues with aspiration. Vocal fold paralysis is typically managed using injection laryngoplasty with fat or synthetic polymers. Injection with autologous fat has shown excellent biocompatibility. However, it has several disadvantages such as unpredictable resorption rate, morbidities associated with liposuction procedure which has to be done in operating room under general anesthesia. Human adipose-derived extracellular matrix (ECM) grafts have been reported to form new adipose tissue and have greater biostability than autologous fat graft. Here, we present an injectable hydrogel that is constructed from adipose tissue derived soluble extracellular matrix (sECM) and methylcellulose (MC) for use in vocal fold augmentation. Human sECM derived from adipose tissue was extracted using two major steps—ECM was isolated from human adipose tissue and was subsequently solubilized. Injectable sECM/MC hydrogels were prepared by blending of sECM and MC. Sustained vocal fold augmentation and symmetric vocal fold vibration were accomplished by the sECM/MC hydrogel in paralyzed vocal fold which were confirmed by laryngoscope, histology and a high-speed imaging system. There were increased number of collagen fibers and fatty granules at the injection site without significant inflammation or fibrosis. Overall, these results indicate that the sECM/MC hydrogel can enhance vocal function in paralyzed vocal folds without early resorption and has potential as a promising material for injection laryngoplasty for stable vocal fold augmentation which can overcome the shortcomings of autologous fat such as unpredictable duration and morbidity associated with the fat harvest. PMID:27768757
Oral and vocal fold diadochokinesis in dysphonic women
LOUZADA, Talita; BERALDINELLE, Roberta; BERRETIN-FELIX, Giédre; BRASOLOTTO, Alcione Ghedini
2011-01-01
The evaluation of oral and vocal fold diadochokinesis (DDK) in individuals with voice disorders may contribute to the understanding of factors that affect the balanced vocal production. Scientific studies that make use of this assessment tool support the knowledge advance of this area, reflecting the development of more appropriate therapeutic planning. Objective To compare the results of oral and vocal fold DDK in dysphonic women and in women without vocal disorders. Material and methods For this study, 28 voice recordings of women from 19 to 54 years old, diagnosed with dysphonia and submitted to a voice assessment from speech pathologist and otorhinolaryngologist, were used. The control group included 30 nondysphonic women evaluated in prior research from normal adults. The analysis parameters like number and duration of emissions, as well as the regularity of the repetition of syllables "pa", "ta", "ka" and the vowels "a" and "i," were provided by the Advanced Motor Speech Profile program (MSP) Model-5141, version-2.5.2 (KayPentax). The DDK sequence "pataka" was analyzed quantitatively through the Sound Forge 7.0 program, as well as manually with the audio-visual help of sound waves. Average values of oral and vocal fold DDK dysphonic and nondysphonic women were compared using the "t Student" test and were considered significant when p<0.05. Results The findings showed no significant differences between populations; however, the coefficient of variation of period (CvP) and jitter of period (JittP) average of the "ka," "a" and "i" emissions, respectively, were higher in dysphonic women (CvP=10.42%, 12.79%, 12.05%; JittP=2.05%, 6.05%, 3.63%) compared to the control group (CvP=8.86%; 10.95%, 11.20%; JittP=1.82%, 2.98%, 3.15%). Conclusion Although the results do not indicate any difficulties in oral and laryngeal motor control in the dysphonic group, the largest instability in vocal fold DDK in the experimental group should be considered, and studies of this ability in individuals with communication disorders must be intensified. PMID:22230989
Woynaroski, Tiffany; Oller, D Kimbrough; Keceli-Kaysili, Bahar; Xu, Dongxin; Richards, Jeffrey A; Gilkerson, Jill; Gray, Sharmistha; Yoder, Paul
2017-03-01
Theory and research suggest that vocal development predicts "useful speech" in preschoolers with autism spectrum disorder (ASD), but conventional methods for measurement of vocal development are costly and time consuming. This longitudinal correlational study examines the reliability and validity of several automated indices of vocalization development relative to an index derived from human coded, conventional communication samples in a sample of preverbal preschoolers with ASD. Automated indices of vocal development were derived using software that is presently "in development" and/or only available for research purposes and using commercially available Language ENvironment Analysis (LENA) software. Indices of vocal development that could be derived using the software available for research purposes: (a) were highly stable with a single day-long audio recording, (b) predicted future spoken vocabulary to a degree that was nonsignificantly different from the index derived from conventional communication samples, and (c) continued to predict future spoken vocabulary even after controlling for concurrent vocabulary in our sample. The score derived from standard LENA software was similarly stable, but was not significantly correlated with future spoken vocabulary. Findings suggest that automated vocal analysis is a valid and reliable alternative to time intensive and expensive conventional communication samples for measurement of vocal development of preverbal preschoolers with ASD in research and clinical practice. Autism Res 2017, 10: 508-519. © 2016 International Society for Autism Research, Wiley Periodicals, Inc. © 2016 International Society for Autism Research, Wiley Periodicals, Inc.
Effects of long-acting beta adrenergic agonists on vocal fold ion transport.
Sivasankar, Mahalakshmi; Blazer-Yost, Bonnie
2009-03-01
Inhaled medications prescribed for the hypersensitive airway typically combine corticosteroids and long-acting beta2 adrenergic agonists (LABAs). The phonatory side effects of these combination treatments are widely recognized. However, there is limited understanding of the physiological changes induced by these medications that underlie the phonatory side effects. The objective of this study was to investigate the distinct effects of corticosteroids and LABAs on vocal fold mucosal physiology. Understanding the physiological changes to the vocal folds after corticosteroid and LABA treatments is necessary to prevent the prevalent vocal decrement associated with these medications. Experimental in vitro design with treatment and control groups. Native porcine vocal fold mucosae (N = 38) were exposed to corticosteroid or LABA treatments. Ion transport was measured continuously at baseline and after treatment. To quantify the nature of ion transport, vocal folds were also treated with chloride and sodium channel inhibitors. Corticosteroid treatment did not alter ion transport. Conversely, exposure to LABAs significantly increased ion transport. This increase in ion transport was transient, observed immediately after treatment in all tissue and associated with increased chloride secretion. The distinct effects of corticosteroids and LABAs on vocal fold physiology have not been examined to date. This study demonstrates that short-term treatment with LABAs, but not corticosteroids, significantly increases ion transport. These findings suggest that one underlying physiological mechanism for phonatory changes associated with inhaled treatments may be related to acute alterations in vocal fold ion transport and surface hydration.
Infant Vocal-Motor Coordination: Precursor to the Gesture-Speech System?
ERIC Educational Resources Information Center
Iverson, Jana M.; Fagan, Mary K.
2004-01-01
This study was designed to provide a general picture of infant vocal-motor coordination and test predictions generated by Iverson and Thelen's (1999) model of the development of the gesture-speech system. Forty-seven 6- to 9-month-old infants were videotaped with a primary caregiver during rattle and toy play. Results indicated an age-related…
Meredith, Liza; Peterson, Carol B.; Frazier, Patricia A.
2015-01-01
Objectives Although an association between psychosocial distress (depression, anxiety, somatization, and perceived stress) and voice disorders has been observed, little is known about the relationship between distress and patient-reported voice handicap. Further, the psychological mechanisms underlying this relationship are poorly understood. Perceived control plays an important role in distress associated with other medical disorders. The objectives of this study were to 1) characterize the relationship between distress and patient-reported voice handicap and 2) examine the role of perceived control in this relationship. Study Design Cross-sectional study in tertiary care academic voice clinic. Methods Distress, perceived stress, voice handicap, and perceived control were measured using established assessment scales. Association was measured with Pearson’s correlation coefficient; moderation was assessed using multiple hierarchical regression. Results 533 patients enrolled. 34% met criteria for clinically significant distress (i.e., depression, anxiety, and/or somatization). A weak association (r=0.13, p=0.003) was observed between severity of psychosocial distress and vocal handicap. Present perceived control was inversely associated with distress (r=−0.41, p<0.0001), stress (r=−0.30, p<0.0001), and voice handicap (r=−0.30, p<0.0001). The relationship between voice handicap and psychosocial distress was moderated by perceived control (b for interaction term −0.15, p<0.001); greater vocal handicap was associated with greater distress in patients with low perceived control. Conclusions Severity of distress and vocal handicap were positively related, and the relation between them was moderated by perceived control. Vocal handicap was more related to distress among those with low perceived control; targeting this potential mechanism may facilitate new approaches for improved care. PMID:25795347
Howe, M S; McGowan, R S
2009-11-01
An analysis is made of the nonlinear interactions between flow in the subglottal vocal tract and glottis, sound waves in the subglottal system and a mechanical model of the vocal folds. The mean flow through the system is produced by a nominally steady contraction of the lungs, and mechanical experiments frequently involve a 'lung cavity' coupled to an experimental subglottal tube of arbitrary or ill-defined effective length L, on the basis that the actual value of L has little or no influence on excitation of the vocal folds. A simple, self-exciting single mass mathematical model of the vocal folds is used to investigate the sound generated within the subglottal domain and the unsteady volume flux from the glottis for experiments where it is required to suppress feedback of sound from the supraglottal vocal tract. In experiments where the assumed absorption of sound within the sponge-like interior of the lungs is small, the influence of changes in L can be very significant: when the subglottal tube behaves as an open-ended resonator (when L is as large as half the acoustic wavelength) there is predicted to be a mild increase in volume flux magnitude and a small change in waveform. However, the strong appearance of second harmonics of the acoustic field is predicted at intermediate lengths, when L is roughly one quarter of the acoustic wavelength. In cases of large lung damping, however, only modest changes in the volume flux are predicted to occur with variations in L.
Convergent transcriptional specializations in the brains of humans and song-learning birds
Pfenning, Andreas R.; Hara, Erina; Whitney, Osceola; Rivas, Miriam V.; Wang, Rui; Roulhac, Petra L.; Howard, Jason T.; Wirthlin, Morgan; Lovell, Peter V.; Ganapathy, Ganeshkumar; Mouncastle, Jacquelyn; Moseley, M. Arthur; Thompson, J. Will; Soderblom, Erik J.; Iriki, Atsushi; Kato, Masaki; Gilbert, M. Thomas P.; Zhang, Guojie; Bakken, Trygve; Bongaarts, Angie; Bernard, Amy; Lein, Ed; Mello, Claudio V.; Hartemink, Alexander J.; Jarvis, Erich D.
2015-01-01
Song-learning birds and humans share independently evolved similarities in brain pathways for vocal learning that are essential for song and speech and are not found in most other species. Comparisons of brain transcriptomes of song-learning birds and humans relative to vocal nonlearners identified convergent gene expression specializations in specific song and speech brain regions of avian vocal learners and humans. The strongest shared profiles relate bird motor and striatal song-learning nuclei, respectively, with human laryngeal motor cortex and parts of the striatum that control speech production and learning. Most of the associated genes function in motor control and brain connectivity. Thus, convergent behavior and neural connectivity for a complex trait are associated with convergent specialized expression of multiple genes. PMID:25504733
Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback.
Behroozmand, Roozbeh; Larson, Charles R
2011-06-06
The motor-driven predictions about expected sensory feedback (efference copies) have been proposed to play an important role in recognition of sensory consequences of self-produced motor actions. In the auditory system, this effect was suggested to result in suppression of sensory neural responses to self-produced voices that are predicted by the efference copies during vocal production in comparison with passive listening to the playback of the identical self-vocalizations. In the present study, event-related potentials (ERPs) were recorded in response to upward pitch shift stimuli (PSS) with five different magnitudes (0, +50, +100, +200 and +400 cents) at voice onset during active vocal production and passive listening to the playback. Results indicated that the suppression of the N1 component during vocal production was largest for unaltered voice feedback (PSS: 0 cents), became smaller as the magnitude of PSS increased to 200 cents, and was almost completely eliminated in response to 400 cents stimuli. Findings of the present study suggest that the brain utilizes the motor predictions (efference copies) to determine the source of incoming stimuli and maximally suppresses the auditory responses to unaltered feedback of self-vocalizations. The reduction of suppression for 50, 100 and 200 cents and its elimination for 400 cents pitch-shifted voice auditory feedback support the idea that motor-driven suppression of voice feedback leads to distinctly different sensory neural processing of self vs. non-self vocalizations. This characteristic may enable the audio-vocal system to more effectively detect and correct for unexpected errors in the feedback of self-produced voice pitch compared with externally-generated sounds.
Li, Ding; Li, Meng; Xia, Siwen; Zheng, Hongliang
2011-01-01
Objective To evaluate the long-term efficacy of delayed laryngeal reinnervation using the main branch of the ansa cervicalis in treatment of unilateral vocal fold paralysis (UVFP) caused by thyroid surgery. Summary of Background Data UVFP remains a serious complication of thyroid surgery. Up to now, a completely satisfactory surgical treatment of UVFP has been elusive. Methods From Jan. 1996 to Jan. 2008, a total of 237 UVFP patients who underwent ansa cervicalis main branch-to-recurrent laryngeal nerve (RLN) anastomosis were enrolled as UVFP group; another 237 age- and gender-matched normal subjects served as control group. Videostroboscopy, vocal function assessment (acoustic analysis, perceptual evaluation and maximum phonation time), and electromyography were performed preoperatively and postoperatively. The mean follow-up period was 5.2±2.7 years, ranging from 2 to 12 years. Results Analysis of videostroboscopic findings indicated that the glottic closure, vocal fold edge, vocal fold position, phase symmetry and regularity were significantly improved in the UVFP group (P<0.001, postoperative vs. preoperative). The postoperative parameters of vocal function were also significantly improved in the UVFP group (P<0.001) and showed no statistical differences compared to the control group (P>0.05, respectively). Postoperative laryngeal electromyography confirmed successful reinnervation of laryngeal muscle. Conclusions Delayed laryngeal reinnervation with the main branch of ansa cervicalis is a feasible and effective approach for treatment of thyroid surgery-related UVFP; it can restore the physiological laryngeal phonatory function to the normal or a nearly normal voice quality. PMID:21559458
[Case-control survey on risk factors of benign vocal fold lesions].
Huang, Dong-Yan; Yang, Wei-Yan; Yu, Ping; He, Yao; Han, Dong-Yi
2008-02-01
To investigate the risk factors that may relate with benign vocal fold lesions including vocal fold nodule, vocal fold polyp, chronic laryngitis and Reinke's edema In present series, 321 cases who were performed laryngoscope were invited to participate the survey. Among them 168 cases with benign vocal fold lesions composed the case group. Another 153 cases with normal larynx composed the control group. Each case were undertook the same questionnaire. Logistic regression analysis was preformed to investigate the possible risk factors. The result demonstrated the occurring of benign vocal fold lesions positively correlated to five factors, including occupation, work or residence environment noise, alcohol-consuming, voice-using hours per day and abuse of voice. Occupations with intensive voice-use were more vulnerable to developing these disorders. Occurring risk of occupations type II with moderate voice-use was 1.934 times than that of occupations type I with lesser voice-use (OR = 1.934). And risk of occupations type III with upper voice-use was 2.633 times than that of type I. Risk raised 1.302 times with each more hour of voice use per day. OR of the following factors of voice abuse, environment noise, alcohol-consuming was 4.744, 2.115 and 2.177, respectively. The result suggested that people should abstain from alcohol, lowering the environment noise, prevent overuse and abuse of voice in order to decrease the prevalence of these disorders, which is especially important for the professional voice users, e. g. teachers or managers. The essential therapy for these disorders is to correct bad phonation habits.
Acute exposure to vibration is an apoptosis-inducing stimulus in the vocal fold epithelium.
Novaleski, Carolyn K; Kimball, Emily E; Mizuta, Masanobu; Rousseau, Bernard
2016-10-01
Clinical voice disorders pose significant communication-related challenges to patients. The purpose of this study was to quantify the rate of apoptosis and tumor necrosis factor-alpha (TNF-α) signaling in vocal fold epithelial cells in response to increasing time-doses and cycle-doses of vibration. 20 New Zealand white breeder rabbits were randomized to three groups of time-doses of vibration exposure (30, 60, 120min) or a control group (120min of vocal fold adduction and abduction). Estimated cycle-doses of vocal fold vibration were extrapolated based on mean fundamental frequency. Laryngeal tissue specimens were evaluated for apoptosis and gene transcript and protein levels of TNF-α. Results revealed that terminal deoxynucleotidyl transferase dUTP nick end labeling (TUNEL) staining was significantly higher after 120min of vibration compared to the control. Transmission electron microscopy (TEM) revealed no significant effect of time-dose on the mean area of epithelial cell nuclei. Extrapolated cycle-doses of vibration exposure were closely related to experimental time-dose conditions, although no significant correlations were observed with TUNEL staining or mean area of epithelial cell nuclei. TUNEL staining was positively correlated with TNF-α protein expression. Our findings suggest that apoptosis can be induced in the vocal fold epithelium after 120min of modal intensity phonation. In contrast, shorter durations of vibration exposure do not result in apoptosis signaling. However, morphological features of apoptosis are not observed using TEM. Future studies are necessary to examine the contribution of abnormal apoptosis to vocal fold diseases. Copyright © 2016 Elsevier Ltd. All rights reserved.
Acute Exposure to Vibration is an Apoptosis-Inducing Stimulus in the Vocal Fold Epithelium
Novaleski, Carolyn K.; Kimball, Emily E.; Mizuta, Masanobu; Rousseau, Bernard
2016-01-01
Clinical voice disorders pose significant communication-related challenges to patients. The purpose of this study was to quantify the rate of apoptosis and tumor necrosis factor-alpha (TNF-α) signaling in vocal fold epithelial cells in response to increasing time-doses and cycle-doses of vibration. 20 New Zealand white breeder rabbits were randomized to three groups of time-doses of vibration exposure (30, 60, 120 minutes) or a control group (120 minutes of vocal fold adduction and abduction). Estimated cycle-doses of vocal fold vibration were extrapolated based on mean fundamental frequency. Laryngeal tissue specimens were evaluated for apoptosis and gene transcript and protein levels of TNF-α. Results revealed that terminal deoxynucleotidyl transferase dUTP nick end labeling (TUNEL) staining was significantly higher after 120 minutes of vibration compared to the control. Transmission electron microscopy (TEM) revealed no significant effect of time-dose on the mean area of epithelial cell nuclei. Extrapolated cycle-doses of vibration exposure were closely related to experimental time-dose conditions, although no significant correlations were observed with TUNEL staining or mean area of epithelial cell nuclei. TUNEL staining was positively correlated with TNF-α protein expression. Our findings suggest that apoptosis can be induced in the vocal fold epithelium after 120 minutes of modal intensity phonation. In contrast, shorter durations of vibration exposure do not result in apoptosis signaling. However, morphological features of apoptosis are not observed using TEM. Future studies are necessary to examine the contribution of abnormal apoptosis to vocal fold diseases. PMID:27577014
The evolution of speech: a comparative review.
Fitch
2000-07-01
The evolution of speech can be studied independently of the evolution of language, with the advantage that most aspects of speech acoustics, physiology and neural control are shared with animals, and thus open to empirical investigation. At least two changes were necessary prerequisites for modern human speech abilities: (1) modification of vocal tract morphology, and (2) development of vocal imitative ability. Despite an extensive literature, attempts to pinpoint the timing of these changes using fossil data have proven inconclusive. However, recent comparative data from nonhuman primates have shed light on the ancestral use of formants (a crucial cue in human speech) to identify individuals and gauge body size. Second, comparative analysis of the diverse vertebrates that have evolved vocal imitation (humans, cetaceans, seals and birds) provides several distinct, testable hypotheses about the adaptive function of vocal mimicry. These developments suggest that, for understanding the evolution of speech, comparative analysis of living species provides a viable alternative to fossil data. However, the neural basis for vocal mimicry and for mimesis in general remains unknown.
The effects of noncontingent music and response interruption and redirection on vocal stereotypy.
Gibbs, Ashley R; Tullis, Christopher A; Thomas, Raven; Elkins, Brittany
2018-06-17
Vocal stereotypy is a commonly occurring challenging behavior in children with autism spectrum disorder (ASD) that is frequently maintained by automatic reinforcement and often interferes with skill acquisition. Matched stimulation (MS), and response interruption and redirection (RIRD) are two interventions that have been demonstrated to be effective in reducing the occurrence of vocal stereotypy with participants with ASD. The current study sought to determine if the combination of MS (noncontingent music) and RIRD was more effective at reducing vocal stereotypy than RIRD alone and if the parents of children with ASD found the combination of MS and RIRD more socially valid than RIRD alone. The results suggested that the combined intervention resulted in greater suppression of vocal stereotypy and increased occurrences of on-task behavior in both participants. Additionally, RIRD required fewer implementations and had a shorter duration when combined with MS. Results suggest that the combination of MS and RIRD may be an effective intervention outside of highly controlled settings. © 2018 Society for the Experimental Analysis of Behavior.
Vocal fold fixation caused by penetration of a high-velocity steel projectile.
Guo, Chau-Shiang; Ho, Chi-Kung; Hsu, Ruey-Fen
2014-01-01
Vocal fold fixation as a result of trauma caused by a foreign body is rare. We report a unique case of vocal fold fixation caused by traumatic penetration of a shard of steel in a 31-year-old steelworker. While the patient was at work, an airborne projectile suddenly pierced his neck and entered his larynx, causing progressive hoarseness and dyspnea. Flexible laryngoscopy detected no obvious foreign body, but it did reveal that the right vocal fold had become immobile. Computed tomography revealed that a 2.5-cm sliver of steel had become impacted in the right cricoarytenoid joint, which made the arytenoid cartilage unable to rotate. An emergency tracheostomy was performed with local anesthesia to construct a functioning airway, and then rigid laryngoscopy was performed with general anesthesia. The foreign body was removed with the assistance of a microscope and microscissors. Postoperatively, the patient immediately regained control of his right vocal fold, and he experienced no permanent injury.
Popular song and lyrics synchronization and its application to music information retrieval
NASA Astrophysics Data System (ADS)
Chen, Kai; Gao, Sheng; Zhu, Yongwei; Sun, Qibin
2006-01-01
An automatic synchronization system of the popular song and its lyrics is presented in the paper. The system includes two main components: a) automatically detecting vocal/non-vocal in the audio signal and b) automatically aligning the acoustic signal of the song with its lyric using speech recognition techniques and positioning the boundaries of the lyrics in its acoustic realization at the multiple levels simultaneously (e.g. the word / syllable level and phrase level). The GMM models and a set of HMM-based acoustic model units are carefully designed and trained for the detection and alignment. To eliminate the severe mismatch due to the diversity of musical signal and sparse training data available, the unsupervised adaptation technique such as maximum likelihood linear regression (MLLR) is exploited for tailoring the models to the real environment, which improves robustness of the synchronization system. To further reduce the effect of the missed non-vocal music on alignment, a novel grammar net is build to direct the alignment. As we know, this is the first automatic synchronization system only based on the low-level acoustic feature such as MFCC. We evaluate the system on a Chinese song dataset collecting from 3 popular singers. We obtain 76.1% for the boundary accuracy at the syllable level (BAS) and 81.5% for the boundary accuracy at the phrase level (BAP) using fully automatic vocal/non-vocal detection and alignment. The synchronization system has many applications such as multi-modality (audio and textual) content-based popular song browsing and retrieval. Through the study, we would like to open up the discussion of some challenging problems when developing a robust synchronization system for largescale database.
Cohen, Alex S; Dinzeo, Thomas J; Donovan, Neila J; Brown, Caitlin E; Morrison, Sean C
2015-03-30
Vocal expression reflects an integral component of communication that varies considerably within individuals across contexts and is disrupted in a range of neurological and psychiatric disorders. There is reason to suspect that variability in vocal expression reflects, in part, the availability of "on-line" resources (e.g., working memory, attention). Thus, understanding vocal expression is a potentially important biometric index of information processing, not only across but within individuals over time. A first step in this line of research involves establishing a link between vocal expression and information processing systems in healthy adults. The present study employed a dual attention experimental task where participants provided natural speech while simultaneously engaged in a baseline, medium or high nonverbal processing-load task. Objective, automated, and computerized analysis was employed to measure vocal expression in 226 adults. Increased processing load resulted in longer pauses, fewer utterances, greater silence overall and less variability in frequency and intensity levels. These results provide compelling evidence of a link between information processing resources and vocal expression, and provide important information for the development of an automated, inexpensive and uninvasive biometric measure of information processing. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Geographical variation of St. Lucia Parrot flight vocalizations
Kleeman, Patrick M.; Gilardi, James D.
2005-01-01
Parrots are vocal learners and many species of parrots are capable of learning new calls, even as adults. This capability gives parrots the potential to develop communication systems that can vary dramatically over space. St. Lucia Parrot (Amazona versicolor) flight vocalizations were examined for geographic variation between four different sites on the island of St. Lucia. Spectrographic cross-correlation analysis of a commonly used flight vocalization, the p-chow call, demonstrated quantitative differences between sites. Additionally, the similarity of p-chows decreased as the distance between sites increased. Flight call repertoires also differed among sites; parrots at the Des Bottes and Quilesse sites each used one flight call unique to those sites, while parrots at the Barre de L'Isle site used a flight call that Quilesse parrots gave only while perched. It is unclear whether the vocal variation changed clinally with distance, or whether there were discrete dialect boundaries as in a congener, the Yellow-naped Parrot (Amazona auropalliata, Wright 1996). The geographical scale over which the St. Lucia Parrot's vocal variation occurred was dramatically smaller than that of the Yellow-naped Parrot. Similar patterns of fine-scale vocal variation may be more widespread among other parrot species in the Caribbean than previously documented.
Cortical representations of communication sounds.
Heiser, Marc A; Cheung, Steven W
2008-10-01
This review summarizes recent research into cortical processing of vocalizations in animals and humans. There has been a resurgent interest in this topic accompanied by an increased number of studies using animal models with complex vocalizations and new methods in human brain imaging. Recent results from such studies are discussed. Experiments have begun to reveal the bilateral cortical fields involved in communication sound processing and the transformations of neural representations that occur among those fields. Advances have also been made in understanding the neuronal basis of interaction between developmental exposures and behavioral experiences with vocalization perception. Exposure to sounds during the developmental period produces large effects on brain responses, as do a variety of specific trained tasks in adults. Studies have also uncovered a neural link between the motor production of vocalizations and the representation of vocalizations in cortex. Parallel experiments in humans and animals are answering important questions about vocalization processing in the central nervous system. This dual approach promises to reveal microscopic, mesoscopic, and macroscopic principles of large-scale dynamic interactions between brain regions that underlie the complex phenomenon of vocalization perception. Such advances will yield a greater understanding of the causes, consequences, and treatment of disorders related to speech processing.
Hypnosis for Asthma and Vocal Cord Dysfunction in a Patient With Autism.
Kaslovsky, Robert; Gottsegen, David
2015-10-01
Wheezing in children often is the result of asthma, but vocal cord dysfunction (VCD) may cause stridor or sounds that sometimes are misattributed to the wheezing of asthma. The frequent comorbidity of asthma and VCD also adds to the difficulty in making a clear diagnosis. The challenges of evaluating and treating wheezing are complicated further in children with developmental disorders, such as autism, because of the difficulties of obtaining an adequate history and assessing the clinical response to treatment. This article presents a patient with multiple psychiatric problems, including autism, with severe recurrent wheezing as a result of vocal cord dysfunction and asthma. Hypnosis has previously proven efficacious for treating vocal cord dysfunction, and in this case, hypnotic techniques were major factors in successful symptom control.
McGettigan, Carolyn; Eisner, Frank; Agnew, Zarinah K; Manly, Tom; Wisbey, Duncan; Scott, Sophie K
2014-01-01
Historically, the study of human identity perception has focused on faces, but the voice is also central to our expressions and experiences of identity (P. Belin, Fecteau, & Bedard, 2004). Our voices are highly flexible and dynamic; talkers speak differently depending on their health, emotional state, and the social setting, as well as extrinsic factors such as background noise. However, to date, there have been no studies of the neural correlates of identity modulation in speech production. In the current fMRI experiment, we measured the neural activity supporting controlled voice change in adult participants performing spoken impressions. We reveal that deliberate modulation of vocal identity recruits the left anterior insula and inferior frontal gyrus, supporting the planning of novel articulations. Bilateral sites in posterior superior temporal/inferior parietal cortex and a region in right mid/anterior superior temporal sulcus showed greater responses during the emulation of specific vocal identities than for impressions of generic accents. Using functional connectivity analyses, we describe roles for these three sites in their interactions with the brain regions supporting speech planning and production. Our findings mark a significant step toward understanding the neural control of vocal identity, with wider implications for the cognitive control of voluntary motor acts. PMID:23691984
Speech recognition for embedded automatic positioner for laparoscope
NASA Astrophysics Data System (ADS)
Chen, Xiaodong; Yin, Qingyun; Wang, Yi; Yu, Daoyin
2014-07-01
In this paper a novel speech recognition methodology based on Hidden Markov Model (HMM) is proposed for embedded Automatic Positioner for Laparoscope (APL), which includes a fixed point ARM processor as the core. The APL system is designed to assist the doctor in laparoscopic surgery, by implementing the specific doctor's vocal control to the laparoscope. Real-time respond to the voice commands asks for more efficient speech recognition algorithm for the APL. In order to reduce computation cost without significant loss in recognition accuracy, both arithmetic and algorithmic optimizations are applied in the method presented. First, depending on arithmetic optimizations most, a fixed point frontend for speech feature analysis is built according to the ARM processor's character. Then the fast likelihood computation algorithm is used to reduce computational complexity of the HMM-based recognition algorithm. The experimental results show that, the method shortens the recognition time within 0.5s, while the accuracy higher than 99%, demonstrating its ability to achieve real-time vocal control to the APL.
Quantitative evaluation of the voice range profile in patients with voice disorder.
Ikeda, Y; Masuda, T; Manako, H; Yamashita, H; Yamamoto, T; Komiyama, S
1999-01-01
In 1953, Calvet first displayed the fundamental frequency (pitch) and sound pressure level (intensity) of a voice on a two-dimensional plane and created a voice range profile. This profile has been used to evaluate clinically various vocal disorders, although such evaluations to date have been subjective without quantitative assessment. In the present study, a quantitative system was developed to evaluate the voice range profile utilizing a personal computer. The area of the voice range profile was defined as the voice volume. This volume was analyzed in 137 males and 175 females who were treated for various dysphonias at Kyushu University between 1984 and 1990. Ten normal subjects served as controls. The voice volume in cases with voice disorders significantly decreased irrespective of the disease and sex. Furthermore, cases having better improvement after treatment showed a tendency for the voice volume to increase. These findings illustrated the voice volume as a useful clinical test for evaluating voice control in cases with vocal disorders.
The Impact of Vocal Hyperfunction on Relative Fundamental Frequency during Voicing Offset and Onset
ERIC Educational Resources Information Center
Stepp, Cara E.; Hillman, Robert E.; Heaton, James T.
2010-01-01
Purpose: This study tested the hypothesis that individuals with vocal hyperfunction would show decreases in relative fundamental frequency (RFF) surrounding a voiceless consonant. Method: This retrospective study of 2 clinical databases used speech samples from 15 control participants and women with hyperfunction-related voice disorders: 82 prior…
ERIC Educational Resources Information Center
Murray, Elizabeth S. Heller; Lien, Yu-An S.; Van Stan, Jarrad H.; Mehta, Daryush D.; Hillman, Robert E.; Noordzij, J. Pieter; Stepp, Cara E.
2017-01-01
Purpose: The purpose of this article is to examine the ability of an acoustic measure, relative fundamental frequency (RFF), to distinguish between two subtypes of vocal hyperfunction (VH): phonotraumatic (PVH) and non-phonotraumatic (NPVH). Method: RFF values were compared among control individuals with typical voices (N = 49), individuals with…
Co-Prevalence of Tremor with Spasmodic Dysphonia: A Case-Control Study
White, Laura; Klein, Adam; Hapner, Edie; Delgaudio, John; Hanfelt, John; Jinnah, H. A.; Johns, Michael
2011-01-01
OBJECTIVES/HYPOTHESIS The aim of this study was to define the co-prevalence of tremor with spasmodic dysphonia (SD). STUDY DESIGN A single institution prospective, case-control study was performed from May 2010 to July 2010. METHODS Consecutive patients with SD (cases) and other voice disorders (controls) were enrolled prospectively. Each participant underwent a voice evaluation and an evaluation for tremor. RESULTS 146 voice disorder controls and 128 patients with SD were enrolled. 26% of patients with SD had vocal tremor, 21% had non-vocal tremor. Patients with SD were 2.8 times more likely to have co-prevalent tremor than the control group (OR = 2.81; 95% CI, 1.55 to 5.08) and only 35% of patients with SD had been seen by a neurologist for the evaluation of dystonia and tremor. CONCLUSIONS Tremor is highly prevalent in patients with SD. It is important for each patient diagnosed with SD to undergo an evaluation for tremor, this is especially important in patients diagnosed with vocal tremor. Level of evidence 3b. PMID:21792965
Effect of growth factors on hyaluronan production by canine vocal fold fibroblasts.
Hirano, Shigeru; Bless, Diane M; Heisey, Dennis; Ford, Charles N
2003-07-01
Hyaluronan (HYA) is considered to be a crucial factor in scarless wound healing and in maintaining tissue viscosity of the vocal fold lamina propria. In this study focusing on the effects of growth factors, we examined how HYA is produced and controlled in canine cultured vocal fold fibroblasts. Fibroblasts were taken from the lamina propria of the vocal folds of 8 dogs and cultured with and without growth factors. The production of HYA in the supernatant culture was quantitatively examined by enzyme-linked immunosorbent assay. Hepatocyte growth factor, epidermal growth factor, basic fibroblast growth factor, and transforming growth factor beta1 all stimulated HYA synthesis from vocal fold fibroblasts. These effects differed with the concentration of growth factors and the incubation period. We also examined how frequently the growth factors had to be administered in order to maintain appropriate levels of HYA. A single administration was sufficient to maintain appropriate HYA levels for at least 7 days. The present studies have demonstrated positive effects of growth factors in stimulating HYA production. Further in vivo study is needed to clarify the usefulness of these growth factors in the management of vocal fold scarring.
Characteristics of phonation onset in a two-layer vocal fold model.
Zhang, Zhaoyan
2009-02-01
Characteristics of phonation onset were investigated in a two-layer body-cover continuum model of the vocal folds as a function of the biomechanical and geometric properties of the vocal folds. The analysis showed that an increase in either the body or cover stiffness generally increased the phonation threshold pressure and phonation onset frequency, although the effectiveness of varying body or cover stiffness as a pitch control mechanism varied depending on the body-cover stiffness ratio. Increasing body-cover stiffness ratio reduced the vibration amplitude of the body layer, and the vocal fold motion was gradually restricted to the medial surface, resulting in more effective flow modulation and higher sound production efficiency. The fluid-structure interaction induced synchronization of more than one group of eigenmodes so that two or more eigenmodes may be simultaneously destabilized toward phonation onset. At certain conditions, a slight change in vocal fold stiffness or geometry may cause phonation onset to occur as eigenmode synchronization due to a different pair of eigenmodes, leading to sudden changes in phonation onset frequency, vocal fold vibration pattern, and sound production efficiency. Although observed in a linear stability analysis, a similar mechanism may also play a role in register changes at finite-amplitude oscillations.
Effects of adventitious acute vocal trauma: Relative fundamental frequency and listener perception
Murray, Elizabeth Heller; Hands, Gabrielle L.; Calabrese, Carolyn R.; Stepp, Cara E.
2015-01-01
Objective High voice users (individuals who demonstrate excessive or loud vocal use) are at risk for developing voice disorders. The objective of this study was to examine, both acoustically and perceptually, vocal changes in healthy speakers following an acute period of high voice use. Methods Members of a university women’s volleyball team (N=12) were recorded a week prior (Pre) and week following (Post) the 10-week spring season; N=6 control speakers were recorded over the same time period for comparison. Speakers read four sentences, which were analyzed for relative fundamental frequency (RFF). Eight naïve listeners participated in an auditory-perceptual visual sort and rate (VSR) task, in which they rated each voice sample’s overall severity and strain. Results No significant differences were found as a function of time point in the VSR ratings for the volleyball group. Onset cycle 1 RFF values were significantly lower (p = 0.04) in the Post recordings of the volleyball participants compared to Pre recordings, but there was no significant difference (p = 0.20) in offset cycle 10 RFF values. Receiver operating characteristic analyses indicated moderate sensitivity and specificity of onset cycle 1 RFF for discrimination between the volleyball and control participants. Changes were not apparent in the control group as a function of time for either, onset cycle 1 RFF, offset cycle 10 FF, or either vocal attribute. Conclusion Onset cycle 1 RFF may be an effective marker for detecting vocal changes over an acute high voice use period of time before perceptual changes are noted. PMID:26028369
Patel, Sona; Lodhavia, Anjli; Frankford, Saul; Korzyukov, Oleg; Larson, Charles R.
2016-01-01
Objective/Hypothesis It is known that singers are able to control their voice to maintain a relatively constant vocal quality while transitioning between vocal registers; however, the neural mechanisms underlying this effect are not understood. It was hypothesized that greater attention to the acoustical feedback of the voice and increased control of the vocal musculature during register transitions compared to singing within a register would be represented as neurological differences in event-related potentials (ERPs). Study Design/Methods Nine singers sang musical notes at the high end of the modal register (the boundary between the modal and head/falsetto registers) and at the low end (the boundary between the modal and fry/pulse registers). While singing, the pitch of the voice auditory feedback was unexpectedly shifted either into the adjacent register (“toward” the register boundary) or within the modal register (“away from” the boundary). Singers were instructed to maintain a constant pitch and ignore any changes to their voice feedback. Results Vocal response latencies and magnitude of the accompanying N1 and P2 ERPs were greatest at the lower (modal-fry) boundary when the pitch shift carried the subjects’ voices into the fry register as opposed to remaining within the modal register. Conclusions These findings suggest that when a singer lowers the pitch of their voice such that it enters the fry register from the modal register, there is increased sensory-motor control of the voice, reflected as increased magnitude of the neural potentials to help minimize qualitative changes in the voice. PMID:26739860
The evolution of syntax: an exaptationist perspective.
Fitch, W Tecumseh
2011-01-01
The evolution of language required elaboration of a number of independent mechanisms in the hominin lineage, including systems involved in signaling, semantics, and syntax. Two perspectives on the evolution of syntax can be contrasted. The "continuist" perspective seeks the evolutionary roots of complex human syntax in simpler combinatory systems used in animal communication systems, such as iteration and sequencing. The "exaptationist" perspective posits evolutionary change of function, so that systems today used for linguistic communication might previously have served quite different functions in earlier hominids. I argue that abundant biological evidence supports an exaptationist perspective, in general, and that it must be taken seriously when considering language evolution. When applied to syntax, this suggests that core computational components used today in language could have originally served non-linguistic functions such as motor control, non-verbal thought, or spatial reasoning. I outline three specific exaptationist hypotheses for spoken language. These three hypotheses each posit a change of functionality in a precursor circuit, and its transformation into a neural circuit or region specifically involved in language today. Hypothesis 1 suggests that the precursor mechanism for intentional vocal control, specifically direct cortical control over the larynx, was manual motor control subserved by the cortico-spinal tract. The second is that the arcuate fasciculus, which today connects syntactic and lexical regions, had its origin in intracortical connections subserving vocal imitation. The third is that the specialized components of Broca's area, specifically BA 45, had their origins in non-linguistic motor control, and specifically hierarchical planning of action. I conclude by illustrating the importance of both homology (studied via primates) and convergence (typically analyzed in birds) for testing such evolutionary hypotheses.
The role of the medial temporal limbic system in processing emotions in voice and music.
Frühholz, Sascha; Trost, Wiebke; Grandjean, Didier
2014-12-01
Subcortical brain structures of the limbic system, such as the amygdala, are thought to decode the emotional value of sensory information. Recent neuroimaging studies, as well as lesion studies in patients, have shown that the amygdala is sensitive to emotions in voice and music. Similarly, the hippocampus, another part of the temporal limbic system (TLS), is responsive to vocal and musical emotions, but its specific roles in emotional processing from music and especially from voices have been largely neglected. Here we review recent research on vocal and musical emotions, and outline commonalities and differences in the neural processing of emotions in the TLS in terms of emotional valence, emotional intensity and arousal, as well as in terms of acoustic and structural features of voices and music. We summarize the findings in a neural framework including several subcortical and cortical functional pathways between the auditory system and the TLS. This framework proposes that some vocal expressions might already receive a fast emotional evaluation via a subcortical pathway to the amygdala, whereas cortical pathways to the TLS are thought to be equally used for vocal and musical emotions. While the amygdala might be specifically involved in a coarse decoding of the emotional value of voices and music, the hippocampus might process more complex vocal and musical emotions, and might have an important role especially for the decoding of musical emotions by providing memory-based and contextual associations. Copyright © 2014 Elsevier Ltd. All rights reserved.
Dysphonia and dysphagia after anterior cervical decompression.
Tervonen, Hanna; Niemelä, Mika; Lauri, Eija-Riitta; Back, Leif; Juvas, Anja; Räsänen, Pirjo; Roine, Risto P; Sintonen, Harri; Salmi, Tapani; Vilkman, S Erkki; Aaltonen, Leena-Maija
2007-08-01
In this paper, the authors investigate the effects of anterior cervical decompression (ACD) on swallowing and vocal function. The study comprised 114 patients who underwent ACD. The early group (50 patients) was examined immediately pre- and postoperatively, and the late group (64 patients) was examined at only 3 to 9 months postoperatively. Fifty age- and sex-matched patients from the Department of Otorhinolaryngology-Head and Neck Surgery who had not been intubated in the previous 5 years were used as a control group. All patients in the early and control groups were examined by a laryngologist; patients in the late group were examined by a laryngologist and a neurosurgeon. Videolaryngostroboscopy was performed in all members of the patient and control groups, and the function of the ninth through 12th cranial nerves were clinically evaluated. Data were collected concerning swallowing, voice quality, surgery results, and health-related quality of life. Patients with persistent dysphonia were referred for phoniatric evaluation and laryngeal electromyography (EMG). Those with persistent dysphagia underwent transoral endoscopic evaluation of swallowing function and videofluorography. Sixty percent of patients in the early group reported dysphonia and 69% reported dysphagia at the immediate postoperative visit. Unilateral vocal fold paresis occurred in 12%. The prevalence of both dysphonia and dysphagia decreased in both groups 3 to 9 months postoperatively. All six patients with vocal fold paresis in the early group recovered, and in the late group there were two cases of vocal fold paresis. The results of laryngeal EMG were abnormal in 14 of 16 patients with persistent dysphonia. Neither intraoperative factors nor age or sex had any effect on the occurrence of dysphonia, dysphagia, or vocal fold paresis. Most patients were satisfied with the surgical outcome. Dysphonia, dysphagia, and vocal fold paresis are common but usually transient complications of ACD. Recurrent laryngeal nerve damage detected by EMG is not rare. Pre-and postoperative laryngeal examination of ACD patients should be considered.
Involvement of the avian song system in reproductive behaviour
Wild, J. Martin; Botelho, João F.
2015-01-01
The song system of songbirds consists of an interconnected set of forebrain nuclei that has traditionally been regarded as dedicated to the learning and production of song. Here, however, we suggest that the song system could also influence muscles used in reproductive behaviour, such as the cloacal sphincter muscle. We show that the same medullary nucleus, retroambigualis (RAm), that projects upon spinal motoneurons innervating expiratory muscles (which provide the pressure head for vocalization) and upon vocal motoneurons for respiratory–vocal coordination also projects upon cloacal motoneurons. Furthermore, RAm neurons projecting to sacral spinal levels were shown to receive direct projections from nucleus robustus arcopallialis (RA) of the forebrain song system. Thus, by indicating a possible disynaptic relationship between RA and motoneurons innervating the reproductive organ, in both males and females, these results potentially extend the role of the song system to include consummatory as well as appetitive aspects of reproductive behaviour. PMID:26631245
Pizolato, Raquel Aparecida; Rehder, Maria Inês Beltrati Cornacchioni; Meneghim, Marcelo de Castro; Ambrosano, Glaucia Maria Bovi; Mialhe, Fábio Luiz; Pereira, Antonio Carlos
2013-02-27
Voice problems are more common in teachers due to intensive voice use during routine at work. There is evidence that occupational disphonia prevention programs are important in improving the quality voice and consequently the quality of subjects' lives. To investigate the impact of educational voice interventions for teachers on quality of life and voice. A longitudinal interventional study involving 70 teachers randomly selected from 11 public schools, 30 to receive educational intervention with vocal training exercises and vocal hygiene habits (experimental group) and 40 to receive guidance on vocal hygiene habits (control group control). Before the process of educational activities, the Voice-Related Quality of Life instrument (V-RQOL) was applied, and 3 months after conclusion of the activities, the subjects were interviewed again, using the same instrument. For data analysis, Prox MIXED were applied, with a level of significance α < 0.05. Teachers showed significantly higher domain and overall V-RQOL scores after preventive intervention, in both control and experimental groups. Nevertheless, there was no statistical difference in scores between the groups. Educational actions for vocal health had a positive impact on the quality of life of the participants, and the incorporation of permanent educational actions at institutional level is suggested.
2013-01-01
Background Voice problems are more common in teachers due to intensive voice use during routine at work. There is evidence that occupational disphonia prevention programs are important in improving the quality voice and consequently the quality of subjects’ lives. Aim To investigate the impact of educational voice interventions for teachers on quality of life and voice. Methods A longitudinal interventional study involving 70 teachers randomly selected from 11 public schools, 30 to receive educational intervention with vocal training exercises and vocal hygiene habits (experimental group) and 40 to receive guidance on vocal hygiene habits (control group control). Before the process of educational activities, the Voice-Related Quality of Life instrument (V-RQOL) was applied, and 3 months after conclusion of the activities, the subjects were interviewed again, using the same instrument. For data analysis, Prox MIXED were applied, with a level of significance α < 0.05. Results: Teachers showed significantly higher domain and overall V-RQOL scores after preventive intervention, in both control and experimental groups. Nevertheless, there was no statistical difference in scores between the groups. Conclusion Educational actions for vocal health had a positive impact on the quality of life of the participants, and the incorporation of permanent educational actions at institutional level is suggested. PMID:23445566
ERIC Educational Resources Information Center
Menard, Lucie; Schwartz, Jean-Luc; Boe, Louise-Jean
2004-01-01
The development of speech from infancy to adulthood results from the interaction of neurocognitive factors, by which phonological representations and motor control abilities are gradually acquired, and physical factors, involving the complex changes in the morphology of the articulatory system. In this article, an articulatory-to-acoustic model,…
ERIC Educational Resources Information Center
Lancioni, Giulio E.; Singh, Nirbhay N.; O'Reilly, Mark F.; Oliva, Doretta; Montironi, Gianluigi
2004-01-01
The use of microswitches has been considered a crucial strategy to help individuals with extensive multiple disabilities overcome passivity and achieve control of environmental stimulation (Crawford & Schuster, 1993; Gutowski, 1996; Ko, McConachie, & Jolleff, 1998). In recent years, considerable efforts have been made to extend the evaluation of…
Action Enhances Acoustic Cues for 3-D Target Localization by Echolocating Bats
Wohlgemuth, Melville J.
2016-01-01
Under natural conditions, animals encounter a barrage of sensory information from which they must select and interpret biologically relevant signals. Active sensing can facilitate this process by engaging motor systems in the sampling of sensory information. The echolocating bat serves as an excellent model to investigate the coupling between action and sensing because it adaptively controls both the acoustic signals used to probe the environment and movements to receive echoes at the auditory periphery. We report here that the echolocating bat controls the features of its sonar vocalizations in tandem with the positioning of the outer ears to maximize acoustic cues for target detection and localization. The bat’s adaptive control of sonar vocalizations and ear positioning occurs on a millisecond timescale to capture spatial information from arriving echoes, as well as on a longer timescale to track target movement. Our results demonstrate that purposeful control over sonar sound production and reception can serve to improve acoustic cues for localization tasks. This finding also highlights the general importance of movement to sensory processing across animal species. Finally, our discoveries point to important parallels between spatial perception by echolocation and vision. PMID:27608186
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tuomi, Lisa, E-mail: lisa.tuomi@vgregion.se; Andréll, Paulin; Finizia, Caterina
Background: Patients treated with radiation therapy for laryngeal cancer often experience voice problems. The aim of this randomized controlled trial was to assess the efficacy of voice rehabilitation for laryngeal cancer patients after having undergone radiation therapy and to investigate whether differences between different tumor localizations with regard to rehabilitation outcomes exist. Methods and Materials: Sixty-nine male patients irradiated for laryngeal cancer participated. Voice recordings and self-assessments of communicative dysfunction were performed 1 and 6 months after radiation therapy. Thirty-three patients were randomized to structured voice rehabilitation with a speech-language pathologist and 36 to a control group. Furthermore, comparisons withmore » 23 healthy control individuals were made. Acoustic analyses were performed for all patients, including the healthy control individuals. The Swedish version of the Self Evaluation of Communication Experiences after Laryngeal Cancer and self-ratings of voice function were used to assess vocal and communicative function. Results: The patients who received vocal rehabilitation experienced improved self-rated vocal function after rehabilitation. Patients with supraglottic tumors who received voice rehabilitation had statistically significant improvements in voice quality and self-rated vocal function, whereas the control group did not. Conclusion: Voice rehabilitation for male patients with laryngeal cancer is efficacious regarding patient-reported outcome measurements. The patients experienced better voice function after rehabilitation. Patients with supraglottic tumors also showed an improvement in terms of acoustic voice outcomes. Rehabilitation with a speech-language pathologist is recommended for laryngeal cancer patients after radiation therapy, particularly for patients with supraglottic tumors.« less
Gelada vocal sequences follow Menzerath's linguistic law.
Gustison, Morgan L; Semple, Stuart; Ferrer-I-Cancho, Ramon; Bergman, Thore J
2016-05-10
Identifying universal principles underpinning diverse natural systems is a key goal of the life sciences. A powerful approach in addressing this goal has been to test whether patterns consistent with linguistic laws are found in nonhuman animals. Menzerath's law is a linguistic law that states that, the larger the construct, the smaller the size of its constituents. Here, to our knowledge, we present the first evidence that Menzerath's law holds in the vocal communication of a nonhuman species. We show that, in vocal sequences of wild male geladas (Theropithecus gelada), construct size (sequence size in number of calls) is negatively correlated with constituent size (duration of calls). Call duration does not vary significantly with position in the sequence, but call sequence composition does change with sequence size and most call types are abbreviated in larger sequences. We also find that intercall intervals follow the same relationship with sequence size as do calls. Finally, we provide formal mathematical support for the idea that Menzerath's law reflects compression-the principle of minimizing the expected length of a code. Our findings suggest that a common principle underpins human and gelada vocal communication, highlighting the value of exploring the applicability of linguistic laws in vocal systems outside the realm of language.
Prespeech motor learning in a neural network using reinforcement☆
Warlaumont, Anne S.; Westermann, Gert; Buder, Eugene H.; Oller, D. Kimbrough
2012-01-01
Vocal motor development in infancy provides a crucial foundation for language development. Some significant early accomplishments include learning to control the process of phonation (the production of sound at the larynx) and learning to produce the sounds of one’s language. Previous work has shown that social reinforcement shapes the kinds of vocalizations infants produce. We present a neural network model that provides an account of how vocal learning may be guided by reinforcement. The model consists of a self-organizing map that outputs to muscles of a realistic vocalization synthesizer. Vocalizations are spontaneously produced by the network. If a vocalization meets certain acoustic criteria, it is reinforced, and the weights are updated to make similar muscle activations increasingly likely to recur. We ran simulations of the model under various reinforcement criteria and tested the types of vocalizations it produced after learning in the differ-ent conditions. When reinforcement was contingent on the production of phonated (i.e. voiced) sounds, the network’s post learning productions were almost always phonated, whereas when reinforcement was not contingent on phonation, the network’s post-learning productions were almost always not phonated. When reinforcement was contingent on both phonation and proximity to English vowels as opposed to Korean vowels, the model’s post-learning productions were more likely to resemble the English vowels and vice versa. PMID:23275137
Prespeech motor learning in a neural network using reinforcement.
Warlaumont, Anne S; Westermann, Gert; Buder, Eugene H; Oller, D Kimbrough
2013-02-01
Vocal motor development in infancy provides a crucial foundation for language development. Some significant early accomplishments include learning to control the process of phonation (the production of sound at the larynx) and learning to produce the sounds of one's language. Previous work has shown that social reinforcement shapes the kinds of vocalizations infants produce. We present a neural network model that provides an account of how vocal learning may be guided by reinforcement. The model consists of a self-organizing map that outputs to muscles of a realistic vocalization synthesizer. Vocalizations are spontaneously produced by the network. If a vocalization meets certain acoustic criteria, it is reinforced, and the weights are updated to make similar muscle activations increasingly likely to recur. We ran simulations of the model under various reinforcement criteria and tested the types of vocalizations it produced after learning in the different conditions. When reinforcement was contingent on the production of phonated (i.e. voiced) sounds, the network's post-learning productions were almost always phonated, whereas when reinforcement was not contingent on phonation, the network's post-learning productions were almost always not phonated. When reinforcement was contingent on both phonation and proximity to English vowels as opposed to Korean vowels, the model's post-learning productions were more likely to resemble the English vowels and vice versa. Copyright © 2012 Elsevier Ltd. All rights reserved.
Vocal Tract Morphology in Inhaling Singing: An MRI-Based Study.
Moerman, Mieke; Vanhecke, Françoise; Van Assche, Lieven; Vercruysse, Johan; Daemers, Kristin; Leman, Marc
2016-07-01
Inhaling singing is a recently developed singing technique explored by the soprano singer Françoise Vanhecke. It is based on an inspiratory airflow instead of an expiratory airflow. This article describes the anatomical structural differences of the vocal tract between inhaling and exhaling singing. We hypothesize that the vocal tract alters significantly in inhaling singing, especially concerning the configuration of the anatomical structures in the oral cavity and the subglottal region. This is a prospective study. A professional singer (F.V.) performed sustained tones from F5 chromatically rising up to Bb5 on the vowel /a/. Vocal tract anatomy is assessed by magnetic resonance imaging. Wilcoxon directional testing demonstrates (1) that the vocal tract volume above the glottal region does not differ statistically in contrast to the subglottal region and (2) significant changes in the configuration of the tongue, the upright position of the epiglottis, the length of the floor of mouth, and the distance between the teeth. The narrowing of the subglottis is considered to be secondary to suction forces used in the inhaling singing technique. The changes in the anatomical structures above the vocal folds possibly suggest a valve-like function controlling the air inlet together with the regulator function of the resonator capacities of the vocal tract. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal exploration is locally regulated during song learning
Ravbar, Primoz; Parra, Lucas C.; Lipkind, Dina; Tchernichovski, Ofer
2012-01-01
Exploratory variability is essential for sensory-motor learning, but it is not known how and at what time scales it is regulated. We manipulated song learning in zebra finches to experimentally control the requirements for vocal exploration in different parts of their song. We first trained birds to perform a one-syllable song, and once they mastered it we added a new syllable to the song model. Remarkably, when practicing the modified song, birds rapidly alternated between high and low acoustic variability to confine vocal exploration to the newly added syllable. Further, even within syllables, acoustic variability changed independently across song elements that were only milliseconds apart. Analysis of the entire vocal output during learning revealed that the variability of each song element decreased as it approached the target, correlating with momentary local distance from the target and less so with the overall distance. We conclude that vocal error is computed locally in sub-syllabic time scales and that song elements can be learned and crystalized independently. Songbirds have dedicated brain circuitry for vocal babbling in the anterior forebrain pathway (AFP), which generates exploratory song patterns that drive premotor neurons at the song nucleus RA (robust nucleus of the arcopallium). We hypothesize that either AFP adjusts the gain of vocal exploration in fine time scales, or that the sensitivity of RA premotor neurons to AFP/HVC inputs varies across song elements. PMID:22399765
[3D visualization and analysis of vocal fold dynamics].
Bohr, C; Döllinger, M; Kniesburges, S; Traxdorf, M
2016-04-01
Visual investigation methods of the larynx mainly allow for the two-dimensional presentation of the three-dimensional structures of the vocal fold dynamics. The vertical component of the vocal fold dynamics is often neglected, yielding a loss of information. The latest studies show that the vertical dynamic components are in the range of the medio-lateral dynamics and play a significant role within the phonation process. This work presents a method for future 3D reconstruction and visualization of endoscopically recorded vocal fold dynamics. The setup contains a high-speed camera (HSC) and a laser projection system (LPS). The LPS projects a regular grid on the vocal fold surfaces and in combination with the HSC allows a three-dimensional reconstruction of the vocal fold surface. Hence, quantitative information on displacements and velocities can be provided. The applicability of the method is presented for one ex-vivo human larynx, one ex-vivo porcine larynx and one synthetic silicone larynx. The setup introduced allows the reconstruction of the entire visible vocal fold surfaces for each oscillation status. This enables a detailed analysis of the three dimensional dynamics (i. e. displacements, velocities, accelerations) of the vocal folds. The next goal is the miniaturization of the LPS to allow clinical in-vivo analysis in humans. We anticipate new insight on dependencies between 3D dynamic behavior and the quality of the acoustic outcome for healthy and disordered phonation.
[Clinical study on vocal cords spontaneous rehabilitation after CO2 laser surgery].
Zhang, Qingxiang; Hu, Huiying; Sun, Guoyan; Yu, Zhenkun
2014-10-01
To study the spontaneous rehabilitation and phonation quality of vocal cords after different types of CO2 laser microsurgery. Surgical procedures based on Remacle system Type I, Type II, Type III, Type IV and Type V a respectively. Three hundred and fifteen cases with hoarseness based on strobe laryngoscopy results were prospectively assigned to different group according to vocal lesions apperence,vocal vibration and imaging of larynx CT/MRI. Each group holded 63 cases. The investigation included the vocal cords morphological features,the patients' subjective feelings and objective results of vocal cords. There are no severe complications for all patients in perioperative period. Vocal scar found in Type I ,1 case; Type II, 9 cases ;Type III, 47 cases; Type IV, 61 cases and Type Va 63 cases respectively after surgery. The difference of Vocal scar formation after surgery between surgical procedures are statistical significance (χ2 = 222.24, P < 0.05). Hoarseness improved after the surgery in 59 cases of Type I , 51 cases of Type II, 43 cases of Type III, 21 cases of Type IV and 17 cases of Type Va. There are statistically significance (χ2 = 89.46, P < 0.05) between different surgical procedures. The parameters of strobe laryngoscope: there are statistical significance on jitter between procedures (F 44.51, P < 0.05), but without difference within Type I and Type II (P > 0.05). This happened in shimmer parameter and the maximum phonation time (MPT) as jitter. There are no statistical significance between Type IV and Type Va on MPT (P > 0.05). Morphological and functional rehabilitation of vocal cord will be affected obviously when the body layer is injured. The depth and range of the CO2 laser microsurgery are the key factors affecting the vocal rehabilitation.
Dreiss, A N; Ruppli, C A; Roulin, A
2014-01-01
To compete over limited parental resources, young animals communicate with their parents and siblings by producing honest vocal signals of need. Components of begging calls that are sensitive to food deprivation may honestly signal need, whereas other components may be associated with individual-specific attributes that do not change with time such as identity, sex, absolute age and hierarchy. In a sib-sib communication system where barn owl (Tyto alba) nestlings vocally negotiate priority access to food resources, we show that calls have individual signatures that are used by nestlings to recognize which siblings are motivated to compete, even if most vocalization features vary with hunger level. Nestlings were more identifiable when food-deprived than food-satiated, suggesting that vocal identity is emphasized when the benefit of winning a vocal contest is higher. In broods where siblings interact iteratively, we speculate that individual-specific signature permits siblings to verify that the most vocal individual in the absence of parents is the one that indeed perceived the food brought by parents. Individual recognition may also allow nestlings to associate identity with individual-specific characteristics such as position in the within-brood dominance hierarchy. Calls indeed revealed age hierarchy and to a lower extent sex and absolute age. Using a cross-fostering experimental design, we show that most acoustic features were related to the nest of origin (but not the nest of rearing), suggesting a genetic or an early developmental effect on the ontogeny of vocal signatures. To conclude, our study suggests that sibling competition has promoted the evolution of vocal behaviours that signal not only hunger level but also intrinsic individual characteristics such as identity, family, sex and age. © 2013 The Authors. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.
Echolocating bats rely on audiovocal feedback to adapt sonar signal design.
Luo, Jinhong; Moss, Cynthia F
2017-10-10
Many species of bat emit acoustic signals and use information carried by echoes reflecting from nearby objects to navigate and forage. It is widely documented that echolocating bats adjust the features of sonar calls in response to echo feedback; however, it remains unknown whether audiovocal feedback contributes to sonar call design. Audiovocal feedback refers to the monitoring of one's own vocalizations during call production and has been intensively studied in nonecholocating animals. Audiovocal feedback not only is a necessary component of vocal learning but also guides the control of the spectro-temporal structure of vocalizations. Here, we show that audiovocal feedback is directly involved in the echolocating bat's control of sonar call features. As big brown bats tracked targets from a stationary position, we played acoustic jamming signals, simulating calls of another bat, timed to selectively perturb audiovocal feedback or echo feedback. We found that the bats exhibited the largest call-frequency adjustments when the jamming signals occurred during vocal production. By contrast, bats did not show sonar call-frequency adjustments when the jamming signals coincided with the arrival of target echoes. Furthermore, bats rapidly adapted sonar call design in the first vocalization following the jamming signal, revealing a response latency in the range of 66 to 94 ms. Thus, bats, like songbirds and humans, rely on audiovocal feedback to structure sonar signal design.
A Bayesian Account of Vocal Adaptation to Pitch-Shifted Auditory Feedback
Hahnloser, Richard H. R.
2017-01-01
Motor systems are highly adaptive. Both birds and humans compensate for synthetically induced shifts in the pitch (fundamental frequency) of auditory feedback stemming from their vocalizations. Pitch-shift compensation is partial in the sense that large shifts lead to smaller relative compensatory adjustments of vocal pitch than small shifts. Also, compensation is larger in subjects with high motor variability. To formulate a mechanistic description of these findings, we adapt a Bayesian model of error relevance. We assume that vocal-auditory feedback loops in the brain cope optimally with known sensory and motor variability. Based on measurements of motor variability, optimal compensatory responses in our model provide accurate fits to published experimental data. Optimal compensation correctly predicts sensory acuity, which has been estimated in psychophysical experiments as just-noticeable pitch differences. Our model extends the utility of Bayesian approaches to adaptive vocal behaviors. PMID:28135267
Park, Hee-Sun; Kim, Ha-Jung; Ro, Young-Jin; Yang, Hong-Seuk; Koh, Won-Uk
2017-01-01
Abstract Rationale: Recurrent laryngeal nerve block is an uncommon complication that can occur after an interscalene brachial plexus block (ISB), which may lead to vocal cord palsy or paresis. However, if the recurrent laryngeal nerve is blocked in patients with a preexisting contralateral vocal cord palsy following neck surgery, this may lead to devastating acute respiratory failure. Thus, ISB is contraindicated in patients with contralateral vocal cord lesion. To the best of our knowledge, there are no reports of bilateral vocal cord paresis, which occurred after a continuous ISB and endotracheal intubation in a patient with no history of vocal cord injury or surgery of the neck. Patient concerns: A 59 year old woman was planned for open acromioplasty and rotator cuff repair under general anesthesia. General anesthesia was induced following an ISB using 0.2% ropivacaine and catheter insertion for postoperative pain control. Diagnoses: While recovering in the postanesthesia care unit (PACU), however, the patient complained of a sore throat and hoarseness without respiratory insufficiency. On the morning of the first postoperative day, she still complained of mild dyspnea, dysphonia, and slight aspiration. She was subsequently diagnosed with bilateral vocal cord paresis following an endoscopic laryngoscopy examination. Interventions: The continuous ISB catheter was immediately removed and the dyspnea and hoarseness symptoms improved, although mild aspiration during drinking water was still present. Outcomes: On the 4th postoperative day, a laryngoscopy examination revealed that the right vocal cord movement had returned to normal but that the left vocal cord paresis still remained. Lessons: When ISB is planned, a detailed history-taking and examination of the airway are essential for patient safety and we recommend that any local anesthetics be carefully injected under ultrasound guidance. We also recommend the use of low concentration of local anesthetics to avoid possible paralysis of the vocal cord. PMID:28403100
Vibratory Regime Classification of Infant Phonation
Buder, Eugene H.; Chorna, Lesya B.; Oller, D. Kimbrough; Robinson, Rebecca B.
2008-01-01
Infant phonation is highly variable in many respects, including the basic vibratory patterns by which the vocal tissues create acoustic signals. Previous studies have identified the regular occurrence of non-modal phonation types in normal infant phonation. The glottis is like many oscillating systems that, because of non-linear relationships among the elements, may vibrate in ways representing the deterministic patterns classified theoretically within the mathematical framework of non-linear dynamics. The infant’s pre-verbal vocal explorations present such a variety of phonations that it may be possible to find effectively all the classes of vibration predicted by non-linear dynamic theory. The current report defines acoustic criteria for an important subset of such vibratory regimes, and demonstrates that analysts can be trained to reliably use these criteria for a classification that includes all instances of infant phonation in the recorded corpora. The method is thus internally comprehensive in the sense that all phonations are classified, but it is not exhaustive in the sense that all vocal qualities are thereby represented. Using the methods thus developed, this study also demonstrates that the distributions of these phonation types vary significantly across sessions of recording in the first year of life, suggesting developmental changes. The method of regime classification is thus capable of tracking changes that may be indicative of maturation of the mechanism, the learning of categories of phonatory control, and the possibly varying use of vocalizations across social contexts. PMID:17509829
Wellmann, Kristen A; George, Finney; Brnouti, Fares; Mooney, Sandra M
2015-06-01
Prenatal ethanol exposure disrupts social behavior in humans and rodents. One system particularly important for social behavior is the somatosensory system. Prenatal ethanol exposure alters the structure and function of this area. Docosahexaenoic acid (DHA), an omega 3 polyunsaturated fatty acid, is necessary for normal brain development and brains from ethanol-exposed animals are DHA deficient. Thus, we determined whether postnatal DHA supplementation ameliorated behavioral deficits induced by prenatal ethanol exposure. Timed pregnant Long-Evans rats were assigned to one of three groups: ad libitum access to an ethanol-containing liquid diet, pair fed an isocaloric isonutritive non-alcohol liquid diet, or ad libitum access to chow and water. Pups were assigned to one of two postnatal treatment groups; gavaged intragastrically once per day between postnatal day (P)11 and P20 with DHA (10 mg/kg in artificial rat milk) or artificial rat milk. A third group was left untreated. Isolation-induced ultrasonic vocalizations (iUSVs) were recorded on P14. Social behavior and play-induced USVs were tested on P28 or P42. Somatosensory performance was tested with a gap crossing test around P33 or on P42. Anxiety was tested on elevated plus maze around P35. Animals exposed to ethanol prenatally vocalized less, play fought less, and crossed a significantly shorter gap than control-treated animals. Administration of DHA ameliorated these ethanol-induced deficits such that the ethanol-exposed animals given DHA were no longer significantly different to control-treated animals. Thus, DHA administration may have therapeutic value to reverse some of ethanol's damaging effects. Copyright © 2015 Elsevier B.V. All rights reserved.
A real-time LPC-based vocal tract area display for voice development.
Rossiter, D; Howard, D M; Downes, M
1994-12-01
This article reports the design and implementation of a graphical display that presents an approximation to vocal tract area in real time for voiced vowel articulation. The acoustic signal is digitally sampled by the system. From these data a set of reflection coefficients is derived using linear predictive coding. A matrix of area coefficients is then determined that approximates the vocal tract area of the user. From this information a graphical display is then generated. The complete cycle of analysis and display is repeated at approximately 20 times/s. Synchronised audio and visual sequences can be recorded and used as dynamic targets for articulatory development. Use of the system is illustrated by diagrams of system output for spoken cardinal vowels and for vowels sung in a trained and untrained style.
Sleep, offline processing, and vocal learning
Margoliash, Daniel; Schmidt, Marc F
2009-01-01
The study of song learning and the neural song system has provided an important comparative model system for the study of speech and language acquisition. We describe some recent advances in the bird song system, focusing on the role of offline processing including sleep in processing sensory information and in guiding developmental song learning. These observations motivate a new model of the organization and role of the sensory memories in vocal learning. PMID:19906416
The power of your vocal image.
McCoy, L A
1996-03-01
Your vocal image is the impression that listeners form of you based on the sound of your voice. In a dental office, where the initial patient contact usually occurs over the phone, your vocal image is vitally important. According to social psychologists, people begin to make relatively durable first impressions within six to 12 seconds of perceiving a sensory cue. This means that patients begin to form their impressions of a telephone speaker almost immediately. Based on the qualities of the speaker's voice and how it is used, they'll form impressions related to everything from the speaker's physical and personality characteristics to his or her intellectual ability, and eventually even generalize their impressions to include the office that the speaker represents. If you want to improve your vocal image, you must first be aware of exactly what that image is. There are two factors that combine to create a vocal impression--the speaker's physical vocal tools and the sound that is created by them. The five physical tools involved are the lungs, vocal cords, throat, mouth and ears. At each stage in the sound production process, we can easily fall into negative habits and lazy patterns if we're not careful. Although we can't do much about our physical voice mechanism, we can certainly exercise a great deal of control over how our voice is used. A strong, confident voice is an essential part of effective interpersonal communication. If you want to project an image of confidence and professionalism, don't overlook the subtle benefits of effective vocal power.
Effect of pneumotach on measurement of vocal function
NASA Astrophysics Data System (ADS)
Walters, Gage; McPhail, Michael; Krane, Michael
2017-11-01
Aerodynamic and acoustic measurements of vocal function were performed in a physical model of the human airway with and without a pneumotach (Rothenberg mask), used by clinicians to measure vocal volume flow. The purpose of these experiments was to assess whether the device alters acoustic and aerodynamic conditions sufficiently to change phonation behavior. The airway model, which mimics acoustic behavior of an adult human airway from trachea to mouth, consists of a 31.5cm long straight duct with a 2.54cm square cross section. Model vocal folds comprised of molded silicone rubber were set into vibration by introducing airflow from a compressed air source. Measurements included transglottal pressure difference, mean volume flow, vocal fold vibratory motion, and sound pressure measured at the mouth. The experiments show that while the pneumotach imparted measurable aerodynamic and acoustic loads on the system, measurement of mean glottal resistance was not affected. Acoustic pressure levels were attenuated, however, suggesting clinical acoustic measurements of vocal function need correction when performed in conjunction with a pneumotach Acknowledge support from NIH DC R01005642-11.
Wavelet based detection of manatee vocalizations
NASA Astrophysics Data System (ADS)
Gur, Berke M.; Niezrecki, Christopher
2005-04-01
The West Indian manatee (Trichechus manatus latirostris) has become endangered partly because of watercraft collisions in Florida's coastal waterways. Several boater warning systems, based upon manatee vocalizations, have been proposed to reduce the number of collisions. Three detection methods based on the Fourier transform (threshold, harmonic content and autocorrelation methods) were previously suggested and tested. In the last decade, the wavelet transform has emerged as an alternative to the Fourier transform and has been successfully applied in various fields of science and engineering including the acoustic detection of dolphin vocalizations. As of yet, no prior research has been conducted in analyzing manatee vocalizations using the wavelet transform. Within this study, the wavelet transform is used as an alternative to the Fourier transform in detecting manatee vocalizations. The wavelet coefficients are analyzed and tested against a specified criterion to determine the existence of a manatee call. The performance of the method presented is tested on the same data previously used in the prior studies, and the results are compared. Preliminary results indicate that using the wavelet transform as a signal processing technique to detect manatee vocalizations shows great promise.
Localization of Label-Retaining Cells in Murine Vocal Fold Epithelium
ERIC Educational Resources Information Center
Leydon, Ciara; Bartlett, Rebecca S.; Roenneburg, Drew A.; Thibeault, Susan L.
2011-01-01
Purpose: Epithelial homeostasis is critical for vocal fold health, yet little is known about the cells that support epithelial self-renewal. As a known characteristic of stem cells is that they are slow-cycling in vivo, the purpose of this prospective controlled study was to identify and quantify slow-cycling cells or putative stem cells in murine…
Attention and Gaze Control in Picture Naming, Word Reading, and Word Categorizing
ERIC Educational Resources Information Center
Roelofs, Ardi
2007-01-01
The trigger for shifting gaze between stimuli requiring vocal and manual responses was examined. Participants were presented with picture-word stimuli and left- or right-pointing arrows. They vocally named the picture (Experiment 1), read the word (Experiment 2), or categorized the word (Experiment 3) and shifted their gaze to the arrow to…
Acoustic, respiratory kinematic and electromyographic effects of vocal training
NASA Astrophysics Data System (ADS)
Mendes, Ana Paula De Brito Garcia
The longitudinal effects of vocal training on the respiratory, phonatory and articulatory systems were investigated in this study. During four semesters, fourteen voice major students were recorded while speaking and singing. Acoustic, temporal, respiratory kinematic and electromyographic parameters were measured to determine changes in the three systems as a function of vocal training. Acoustic measures of the speaking voice included fundamental frequency, sound pressure level (SPL), percent jitter and shimmer, and harmonic-to-noise ratio. Temporal measures included duration of sentences, diphthongs and the closure durations of stop consonants. Acoustic measures of the singing voice included fundamental frequency and sound pressure level of the phonational range, vibrato pulses per second, vibrato amplitude variation and the presence of the singer's formant. Analysis of the data revealed that vocal training had a significant effect on the singing voice. Fundamental frequency and SPL of the 90% level and 90--10% of the phonational range increased significantly during four semesters of vocal training. Physiological data was collected from four subjects during three semesters of vocal training. Respiratory kinematic measures included lung volume, rib cage and abdominal excursions extracted from spoken sung samples. Descriptive statistics revealed that rib cage and abdominal excursions increased from the 1st to the 2nd semester and decrease from the 2nd to the 3rd semester of vocal training. Electromyographic measures of the pectoralis major, rectus abdominis and external obliques muscles revealed that burst duration means decreased from the 1st to the 2nd semester and increased from the 2nd to the 3rd semester. Peak amplitude means increased from the 1st to the 2nd and decreased from the 2nd to the 3rd semester of vocal training. Chest wall excursions and muscle force generation of the three muscles increased as the demanding level and the length of the phonatory tasks increased.
Acoustic characteristics used by Japanese macaques for individual discrimination.
Furuyama, Takafumi; Kobayasi, Kohta I; Riquimaroux, Hiroshi
2017-10-01
The vocalizations of primates contain information about speaker individuality. Many primates, including humans, are able to distinguish conspecifics based solely on vocalizations. The purpose of this study was to investigate the acoustic characteristics used by Japanese macaques in individual vocal discrimination. Furthermore, we tested human subjects using monkey vocalizations to evaluate species specificity with respect to such discriminations. Two monkeys and five humans were trained to discriminate the coo calls of two unfamiliar monkeys. We created a stimulus continuum between the vocalizations of the two monkeys as a set of probe stimuli (whole morph). We also created two sets of continua in which only one acoustic parameter, fundamental frequency ( f 0 ) or vocal tract characteristic (VTC), was changed from the coo call of one monkey to that of another while the other acoustic feature remained the same ( f 0 morph and VTC morph, respectively). According to the results, the reaction times both of monkeys and humans were correlated with the morph proportion under the whole morph and f 0 morph conditions. The reaction time to the VTC morph was correlated with the morph proportion in both monkeys, whereas the reaction time in humans, on average, was not correlated with morph proportion. Japanese monkeys relied more consistently on VTC than did humans for discriminating monkey vocalizations. Our results support the idea that the auditory system of primates is specialized for processing conspecific vocalizations and suggest that VTC is a significant acoustic feature used by Japanese macaques to discriminate conspecific vocalizations. © 2017. Published by The Company of Biologists Ltd.
A meta-analysis of outcomes of hydration intervention on phonation threshold pressure.
Leydon, Ciara; Wroblewski, Marcin; Eichorn, Naomi; Sivasankar, Mahalakshmi
2010-11-01
Vocal fold hydration is purported to promote optimal biomechanical characteristics of vocal fold mucosa, increase efficiency of vocal fold oscillation, and enhance voice quality. The purpose of this work was to determine the magnitude and consistency of the effect of vocal fold hydration on vocal fold function across published clinical studies. We completed a comprehensive meta-analysis of the effects of superficial and systemic vocal fold hydration on phonation threshold pressure (PTP), a measure of efficiency of voice production. We identified 34 studies that examined the effects of hydration on vocal function. Of these studies, 14 examined the effects of hydration on PTP. Nine of these articles met the criteria for inclusion in this analysis. We observed an average effect size of 0.33, indicating that, overall, hydration treatment demonstrated a tendency to reduce PTP. However, this decrease in phonatory effort did not reach significance at the 95% confidence level. The effects of hydration intervention varied considerably across studies (-0.19 to 3.96). We considered that two factors, pitch level of the task and vocal health of participants, may have contributed to this variability in findings. However, our analysis found that these factors could not account for differences in effect size. To understand the variability in outcomes across studies, the role of factors that may impact the effects of hydration, such as the amount, type, and duration of intervention, must be determined. Only then can we obtain data to guide best clinical practice for protecting and rehabilitating vocal function. Copyright © 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
A Chinese alligator in heliox: formant frequencies in a crocodilian
Reber, Stephan A.; Nishimura, Takeshi; Janisch, Judith; Robertson, Mark; Fitch, W. Tecumseh
2015-01-01
ABSTRACT Crocodilians are among the most vocal non-avian reptiles. Adults of both sexes produce loud vocalizations known as ‘bellows’ year round, with the highest rate during the mating season. Although the specific function of these vocalizations remains unclear, they may advertise the caller's body size, because relative size differences strongly affect courtship and territorial behaviour in crocodilians. In mammals and birds, a common mechanism for producing honest acoustic signals of body size is via formant frequencies (vocal tract resonances). To our knowledge, formants have to date never been documented in any non-avian reptile, and formants do not seem to play a role in the vocalizations of anurans. We tested for formants in crocodilian vocalizations by using playbacks to induce a female Chinese alligator (Alligator sinensis) to bellow in an airtight chamber. During vocalizations, the animal inhaled either normal air or a helium/oxygen mixture (heliox) in which the velocity of sound is increased. Although heliox allows normal respiration, it alters the formant distribution of the sound spectrum. An acoustic analysis of the calls showed that the source signal components remained constant under both conditions, but an upward shift of high-energy frequency bands was observed in heliox. We conclude that these frequency bands represent formants. We suggest that crocodilian vocalizations could thus provide an acoustic indication of body size via formants. Because birds and crocodilians share a common ancestor with all dinosaurs, a better understanding of their vocal production systems may also provide insight into the communication of extinct Archosaurians. PMID:26246611
Rodenas‐Cuadrado, Pedro M.; Mengede, Janine; Baas, Laura; Devanna, Paolo; Schmid, Tobias A.; Yartsev, Michael; Firzlaff, Uwe
2018-01-01
Abstract Genes including FOXP2, FOXP1, and CNTNAP2, have been implicated in human speech and language phenotypes, pointing to a role in the development of normal language‐related circuitry in the brain. Although speech and language are unique to humans a comparative approach is possible by addressing language‐relevant traits in animal systems. One such trait, vocal learning, represents an essential component of human spoken language, and is shared by cetaceans, pinnipeds, elephants, some birds and bats. Given their vocal learning abilities, gregarious nature, and reliance on vocalizations for social communication and navigation, bats represent an intriguing mammalian system in which to explore language‐relevant genes. We used immunohistochemistry to detail the distribution of FoxP2, FoxP1, and Cntnap2 proteins, accompanied by detailed cytoarchitectural histology in the brains of two vocal learning bat species; Phyllostomus discolor and Rousettus aegyptiacus. We show widespread expression of these genes, similar to what has been previously observed in other species, including humans. A striking difference was observed in the adult P. discolor bat, which showed low levels of FoxP2 expression in the cortex that contrasted with patterns found in rodents and nonhuman primates. We created an online, open‐access database within which all data can be browsed, searched, and high resolution images viewed to single cell resolution. The data presented herein reveal regions of interest in the bat brain and provide new opportunities to address the role of these language‐related genes in complex vocal‐motor and vocal learning behaviors in a mammalian model system. PMID:29297931
From imitation to meaning: circuit plasticity and the acquisition of a conventionalized semantics
García, Ricardo R.; Zamorano, Francisco; Aboitiz, Francisco
2014-01-01
The capacity for language is arguably the most remarkable innovation of the human brain. A relatively recent interpretation prescribes that part of the language-related circuits were co-opted from circuitry involved in hand control—the mirror neuron system (MNS), involved both in the perception and in the execution of voluntary grasping actions. A less radical view is that in early humans, communication was opportunistic and multimodal, using signs, vocalizations or whatever means available to transmit social information. However, one point that is not yet clear under either perspective is how learned communication acquired a semantic property thereby allowing us to name objects and eventually describe our surrounding environment. Here we suggest a scenario involving both manual gestures and learned vocalizations that led to the development of a primitive form of conventionalized reference. This proposal is based on comparative evidence gathered from other species and on neurolinguistic evidence in humans, which points to a crucial role for vocal learning in the early development of language. Firstly, the capacity to direct the attention of others to a common object may have been crucial for developing a consensual referential system. Pointing, which is a ritualized grasping gesture, may have been crucial to this end. Vocalizations also served to generate joint attention among conversants, especially when combined with gaze direction. Another contributing element was the development of pantomimic actions resembling events or animals. In conjunction with this mimicry, the development of plastic neural circuits that support complex, learned vocalizations was probably a significant factor in the evolution of conventionalized semantics in our species. Thus, vocal imitations of sounds, as in onomatopoeias (words whose sound resembles their meaning), are possibly supported by mirror system circuits, and may have been relevant in the acquisition of early meanings. PMID:25152726
Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback
2011-01-01
Background The motor-driven predictions about expected sensory feedback (efference copies) have been proposed to play an important role in recognition of sensory consequences of self-produced motor actions. In the auditory system, this effect was suggested to result in suppression of sensory neural responses to self-produced voices that are predicted by the efference copies during vocal production in comparison with passive listening to the playback of the identical self-vocalizations. In the present study, event-related potentials (ERPs) were recorded in response to upward pitch shift stimuli (PSS) with five different magnitudes (0, +50, +100, +200 and +400 cents) at voice onset during active vocal production and passive listening to the playback. Results Results indicated that the suppression of the N1 component during vocal production was largest for unaltered voice feedback (PSS: 0 cents), became smaller as the magnitude of PSS increased to 200 cents, and was almost completely eliminated in response to 400 cents stimuli. Conclusions Findings of the present study suggest that the brain utilizes the motor predictions (efference copies) to determine the source of incoming stimuli and maximally suppresses the auditory responses to unaltered feedback of self-vocalizations. The reduction of suppression for 50, 100 and 200 cents and its elimination for 400 cents pitch-shifted voice auditory feedback support the idea that motor-driven suppression of voice feedback leads to distinctly different sensory neural processing of self vs. non-self vocalizations. This characteristic may enable the audio-vocal system to more effectively detect and correct for unexpected errors in the feedback of self-produced voice pitch compared with externally-generated sounds. PMID:21645406
Optimal Duration for Voice Rest After Vocal Fold Surgery: Randomized Controlled Clinical Study.
Kaneko, Mami; Shiromoto, Osamu; Fujiu-Kurachi, Masako; Kishimoto, Yo; Tateya, Ichiro; Hirano, Shigeru
2017-01-01
Voice rest is commonly recommended after phonomicrosurgery to prevent worsening of vocal fold injuries. However, the most effective duration of voice rest is unknown. Recently, early vocal stimulation was recommended as a means to improve wound healing. The purpose of this study is to examine the optimal duration of voice rest after phonomicrosurgery. Randomized controlled clinical study. Patients undergoing phonomicrosurgery for leukoplakia, carcinoma in situ, vocal fold polyp, Reinke's edema, and cyst were chosen. Participants were randomly assigned to voice rest for 3 or 7 postoperative days. Voice therapy was administered to both groups after voice rest. Grade, roughness, breathiness, asthenia, and strain (GRBAS) scale, stroboscopic examination, aerodynamic assessment, acoustic analysis, and Voice Handicap Index-10 (VHI-10) were performed pre- and postoperatively at 1, 3, and 6 months. Stroboscopic examination evaluated normalized mucosal wave amplitude (NMWA). Parameters were compared between both groups. Thirty-one patients were analyzed (3-day group, n = 16; 7-day group, n = 15). Jitter, shimmer, and VHI-10 were significantly better in the 3-day group at 1 month post operation. GRBAS was significantly better in the 3-day group at 1 and 3 months post operation, and NMWA was significantly better in the 3-day group at 1, 3, and 6 months post operation compared to the 7-day group. The data suggest that 3 days of voice rest followed by voice therapy may lead to better wound healing of the vocal fold compared to 7 days of voice rest. Appropriate mechanical stimulation during early stages of vocal fold wound healing may lead to favorable functional recovery. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Role of N-Methyl-D-Aspartate Receptors in Action-Based Predictive Coding Deficits in Schizophrenia.
Kort, Naomi S; Ford, Judith M; Roach, Brian J; Gunduz-Bruce, Handan; Krystal, John H; Jaeger, Judith; Reinhart, Robert M G; Mathalon, Daniel H
2017-03-15
Recent theoretical models of schizophrenia posit that dysfunction of the neural mechanisms subserving predictive coding contributes to symptoms and cognitive deficits, and this dysfunction is further posited to result from N-methyl-D-aspartate glutamate receptor (NMDAR) hypofunction. Previously, by examining auditory cortical responses to self-generated speech sounds, we demonstrated that predictive coding during vocalization is disrupted in schizophrenia. To test the hypothesized contribution of NMDAR hypofunction to this disruption, we examined the effects of the NMDAR antagonist, ketamine, on predictive coding during vocalization in healthy volunteers and compared them with the effects of schizophrenia. In two separate studies, the N1 component of the event-related potential elicited by speech sounds during vocalization (talk) and passive playback (listen) were compared to assess the degree of N1 suppression during vocalization, a putative measure of auditory predictive coding. In the crossover study, 31 healthy volunteers completed two randomly ordered test days, a saline day and a ketamine day. Event-related potentials during the talk/listen task were obtained before infusion and during infusion on both days, and N1 amplitudes were compared across days. In the case-control study, N1 amplitudes from 34 schizophrenia patients and 33 healthy control volunteers were compared. N1 suppression to self-produced vocalizations was significantly and similarly diminished by ketamine (Cohen's d = 1.14) and schizophrenia (Cohen's d = .85). Disruption of NMDARs causes dysfunction in predictive coding during vocalization in a manner similar to the dysfunction observed in schizophrenia patients, consistent with the theorized contribution of NMDAR hypofunction to predictive coding deficits in schizophrenia. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Simulated Birdwatchers’ Playback Affects the Behavior of Two Tropical Birds
Harris, J. Berton C.; Haskell, David G.
2013-01-01
Although recreational birdwatchers may benefit conservation by generating interest in birds, they may also have negative effects. One such potentially negative impact is the widespread use of recorded vocalizations, or “playback,” to attract birds of interest, including range-restricted and threatened species. Although playback has been widely used to test hypotheses about the evolution of behavior, no peer-reviewed study has examined the impacts of playback in a birdwatching context on avian behavior. We studied the effects of simulated birdwatchers’ playback on the vocal behavior of Plain-tailed Wrens Thryothorus euophrys and Rufous Antpittas Grallaria rufula in Ecuador. Study species’ vocal behavior was monitored for an hour after playing either a single bout of five minutes of song or a control treatment of background noise. We also studied the effects of daily five minute playback on five groups of wrens over 20 days. In single bout experiments, antpittas made more vocalizations of all types, except for trills, after playback compared to controls. Wrens sang more duets after playback, but did not produce more contact calls. In repeated playback experiments, wren responses were strong at first, but hardly detectable by day 12. During the study, one study group built a nest, apparently unperturbed, near a playback site. The playback-induced habituation and changes in vocal behavior we observed suggest that scientists should consider birdwatching activity when selecting research sites so that results are not biased by birdwatchers’ playback. Increased vocalizations after playback could be interpreted as a negative effect of playback if birds expend energy, become stressed, or divert time from other activities. In contrast, the habituation we documented suggests that frequent, regular birdwatchers’ playback may have minor effects on wren behavior. PMID:24147094
Simulated birdwatchers' playback affects the behavior of two tropical birds.
Harris, J Berton C; Haskell, David G
2013-01-01
Although recreational birdwatchers may benefit conservation by generating interest in birds, they may also have negative effects. One such potentially negative impact is the widespread use of recorded vocalizations, or "playback," to attract birds of interest, including range-restricted and threatened species. Although playback has been widely used to test hypotheses about the evolution of behavior, no peer-reviewed study has examined the impacts of playback in a birdwatching context on avian behavior. We studied the effects of simulated birdwatchers' playback on the vocal behavior of Plain-tailed Wrens Thryothorus euophrys and Rufous Antpittas Grallaria rufula in Ecuador. Study species' vocal behavior was monitored for an hour after playing either a single bout of five minutes of song or a control treatment of background noise. We also studied the effects of daily five minute playback on five groups of wrens over 20 days. In single bout experiments, antpittas made more vocalizations of all types, except for trills, after playback compared to controls. Wrens sang more duets after playback, but did not produce more contact calls. In repeated playback experiments, wren responses were strong at first, but hardly detectable by day 12. During the study, one study group built a nest, apparently unperturbed, near a playback site. The playback-induced habituation and changes in vocal behavior we observed suggest that scientists should consider birdwatching activity when selecting research sites so that results are not biased by birdwatchers' playback. Increased vocalizations after playback could be interpreted as a negative effect of playback if birds expend energy, become stressed, or divert time from other activities. In contrast, the habituation we documented suggests that frequent, regular birdwatchers' playback may have minor effects on wren behavior.
Production and survival of projection neurons in a forebrain vocal center of adult male canaries
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kirn, J.R.; Alvarez-Buylla, A.; Nottebohm, F.
1991-06-01
Neurons are produced in the adult canary telencephalon. Many of these cells are incorporated into the high vocal center (nucleus HVC), which participates in the control of learned song. In the present work, 3H-thymidine and fluorogold were employed to follow the differentiation and survival of HVC neurons born in adulthood. We found that many HVC neurons born in September grow long axons to the robust nucleus of the archistriatum (nucleus RA) and thus become part of the efferent pathway for song control. Many of these new neurons have already established their connections with RA by 30 d after their birth.more » By 240 d, 75-80% of the September-born HVC neurons project to RA. Most of these new projection neurons survive at least 8 months. The longevity of HVC neurons born in September suggests that these cells remain part of the vocal control circuit long enough to participate in the yearly renewal of the song repertoire.« less
Animal vocal sequences: not the Markov chains we thought they were
Kershenbaum, Arik; Bowles, Ann E.; Freeberg, Todd M.; Jin, Dezhe Z.; Lameira, Adriano R.; Bohn, Kirsten
2014-01-01
Many animals produce vocal sequences that appear complex. Most researchers assume that these sequences are well characterized as Markov chains (i.e. that the probability of a particular vocal element can be calculated from the history of only a finite number of preceding elements). However, this assumption has never been explicitly tested. Furthermore, it is unclear how language could evolve in a single step from a Markovian origin, as is frequently assumed, as no intermediate forms have been found between animal communication and human language. Here, we assess whether animal taxa produce vocal sequences that are better described by Markov chains, or by non-Markovian dynamics such as the ‘renewal process’ (RP), characterized by a strong tendency to repeat elements. We examined vocal sequences of seven taxa: Bengalese finches Lonchura striata domestica, Carolina chickadees Poecile carolinensis, free-tailed bats Tadarida brasiliensis, rock hyraxes Procavia capensis, pilot whales Globicephala macrorhynchus, killer whales Orcinus orca and orangutans Pongo spp. The vocal systems of most of these species are more consistent with a non-Markovian RP than with the Markovian models traditionally assumed. Our data suggest that non-Markovian vocal sequences may be more common than Markov sequences, which must be taken into account when evaluating alternative hypotheses for the evolution of signalling complexity, and perhaps human language origins. PMID:25143037
Hernández-Morato, Ignacio; Valderrama-Canales, Francisco J; Berdugo, Gabriel; Arias, Gonzalo; McHanwell, Stephen; Sañudo, José; Vázquez, Teresa; Pascual-Font, Arán
2013-01-01
Motoneurons innervating laryngeal muscles are located in the nucleus ambiguus (Amb), but there is no general agreement on the somatotopic representation and even less is known on how an injury in the recurrent laryngeal nerve (RLN) affects this pattern. This study analyzes the normal somatotopy of those motoneurons and describes its changes over time after a crush injury to the RLN. In the control group (control group 1, n = 9 rats), the posterior cricoarytenoid (PCA) and thyroarytenoid (TA) muscles were injected with cholera toxin-B. In the experimental groups the left RLN of each animal was crushed with a fine tip forceps and, after several survival periods (1, 2, 4, 8, 12 weeks; minimum six rats per time), the PCA and TA muscles were injected as described above. After each surgery, the motility of the vocal folds was evaluated. Additional control experiments were performed; the second control experiment (control group 2, n = 6 rats) was performed labeling the TA and PCA immediately prior to the section of the superior laryngeal nerve (SLN), in order to eliminate the possibility of accidental labeling of the cricothyroid (CT) muscle by spread from the injection site. The third control group (control group 3, n = 5 rats) was included to determine if there is some sprouting from the SLN into the territories of the RLN after a crush of this last nerve. One week after the crush injury of the RLN, the PCA and TA muscles were injected immediately before the section of the SLN. The results show that a single population of neurons represents each muscle with the PCA in the most rostral position followed caudalwards by the TA. One week post-RLN injury, both the somatotopy and the number of labeled motoneurons changed, where the labeled neurons were distributed randomly; in addition, an area of topographical overlap of the two populations was observed and vocal fold mobility was lost. In the rest of the survival periods, the overlapping area is larger, but the movement of the vocal folds tends to recover. After 12 weeks of survival, the disorganization within the Amb is the largest, but the number of motoneurons is similar to control, and all animals recovered the movement of the left vocal fold. Our additional controls indicate that no tracer spread to the CT muscle occurred, and that many of the labeled motoneurons from the PCA after 1 week post-RLN injury correspond to motoneurons whose axons travel in the SLN. Therefore, it seems that after RLN injury there is a collateral sprouting and collateral innervation. Although the somatotopic organization of the Amb is lost after a crush injury of the RLN and does not recover in the times studied here, the movement of the vocal folds as well as the number of neurons that supply the TA and the PCA muscles recovered within 8 weeks, indicating that the central nervous system of the rat has a great capacity of plasticity. PMID:23444899
Communication Modality Sampling for a Toddler with Angelman Syndrome
ERIC Educational Resources Information Center
Martin, Jolene Hyppa; Reichle, Joe; Dimian, Adele; Chen, Mo
2013-01-01
Purpose: Vocal, gestural, and graphic communication modes were implemented concurrently with a toddler with Angelman syndrome to identify the most efficiently learned communication mode to emphasize in an initial augmentative communication system. Method: Symbols representing preferred objects were introduced in vocal, gestural, and graphic…
Analysis of human scream and its impact on text-independent speaker verification.
Hansen, John H L; Nandwana, Mahesh Kumar; Shokouhi, Navid
2017-04-01
Scream is defined as sustained, high-energy vocalizations that lack phonological structure. Lack of phonological structure is how scream is identified from other forms of loud vocalization, such as "yell." This study investigates the acoustic aspects of screams and addresses those that are known to prevent standard speaker identification systems from recognizing the identity of screaming speakers. It is well established that speaker variability due to changes in vocal effort and Lombard effect contribute to degraded performance in automatic speech systems (i.e., speech recognition, speaker identification, diarization, etc.). However, previous research in the general area of speaker variability has concentrated on human speech production, whereas less is known about non-speech vocalizations. The UT-NonSpeech corpus is developed here to investigate speaker verification from scream samples. This study considers a detailed analysis in terms of fundamental frequency, spectral peak shift, frame energy distribution, and spectral tilt. It is shown that traditional speaker recognition based on the Gaussian mixture models-universal background model framework is unreliable when evaluated with screams.
Recording vocalizations with Bluetooth technology.
Gaona-González, Andrés; Santillán-Doherty, Ana María; Arenas-Rosas, Rita Virginia; Muñoz-Delgado, Jairo; Aguillón-Pantaleón, Miguel Angel; Ordoñez-Gómez, José Domingo; Márquez-Arias, Alejandra
2011-06-01
We propose a method for capturing vocalizations that is designed to avoid some of the limiting factors found in traditional bioacoustical methods, such as the impossibility of obtaining continuous long-term registers or analyzing amplitude due to the continuous change of distance between the subject and the position of the recording system. Using Bluetooth technology, vocalizations are captured and transmitted wirelessly into a receiving system without affecting the quality of the signal. The recordings of the proposed system were compared to those obtained as a reference, which were based on the coding of the signal with the so-called pulse-code modulation technique in WAV audio format without any compressing process. The evaluation showed p < .05 for the measured quantitative and qualitative parameters. We also describe how the transmitting system is encapsulated and fixed on the animal and a way to video record a spider monkey's behavior simultaneously with the audio recordings.
Acoustic detection of manatee vocalizations
NASA Astrophysics Data System (ADS)
Niezrecki, Christopher; Phillips, Richard; Meyer, Michael; Beusse, Diedrich O.
2003-09-01
The West Indian manatee (trichechus manatus latirostris) has become endangered partly because of a growing number of collisions with boats. A system to warn boaters of the presence of manatees, that can signal to boaters that manatees are present in the immediate vicinity, could potentially reduce these boat collisions. In order to identify the presence of manatees, acoustic methods are employed. Within this paper, three different detection algorithms are used to detect the calls of the West Indian manatee. The detection systems are tested in the laboratory using simulated manatee vocalizations from an audio compact disk. The detection method that provides the best overall performance is able to correctly identify ~96% of the manatee vocalizations. However, the system also results in a false alarm rate of ~16%. The results of this work may ultimately lead to the development of a manatee warning system that can warn boaters of the presence of manatees.
Zarate, Jean Mary
2013-01-01
Singing provides a unique opportunity to examine music performance—the musical instrument is contained wholly within the body, thus eliminating the need for creating artificial instruments or tasks in neuroimaging experiments. Here, more than two decades of voice and singing research will be reviewed to give an overview of the sensory-motor control of the singing voice, starting from the vocal tract and leading up to the brain regions involved in singing. Additionally, to demonstrate how sensory feedback is integrated with vocal motor control, recent functional magnetic resonance imaging (fMRI) research on somatosensory and auditory feedback processing during singing will be presented. The relationship between the brain and singing behavior will be explored also by examining: (1) neuroplasticity as a function of various lengths and types of training, (2) vocal amusia due to a compromised singing network, and (3) singing performance in individuals with congenital amusia. Finally, the auditory-motor control network for singing will be considered alongside dual-stream models of auditory processing in music and speech to refine both these theoretical models and the singing network itself. PMID:23761746
NASA Astrophysics Data System (ADS)
Rogers, Dylan; Wei, Nathaniel; Ringenber, Hunter; Krane, Michael; Wei, Timothy
2017-11-01
This study builds on the parallel presentation of Ringenberg, et al. (APS-DFD 2017) involving simultaneous, temporally and spatially resolved flow and pressure measurements in a scaled-up vocal fold model. In this talk, data from experiments replicating characteristics of diseased vocal folds are presented. This begins with vocal folds that do not fully close and continues with asymmetric oscillations. Data are compared to symmetric, i.e. `healthy', oscillatory motions presented in the companion talk. Having pressure and flow data for individual as well as phase averaged oscillations for these diseased cases highlights the potential for aeroacoustic analysis in this complex system. Supported by NIH Grant No. 2R01 DC005642-11.
Effect of the losses in the vocal tract on determination of the area function.
Gülmezoğlu, M Bilginer; Barkana, Atalay
2003-01-01
In this work, the cross-sectional areas of the vocal tract are determined for the lossy and lossless cases by using the pole-zero models obtained from the electrical equivalent circuit model of the vocal tract and the system identification method. The cross-sectional areas are used to compare the lossy and lossless cases. In the lossy case, the internal losses due to wall vibration, heat conduction, air friction and viscosity are considered, that is, the complex poles and zeros obtained from the models are used directly. Whereas, in the lossless case, only the imaginary parts of these poles and zeros are used. The vocal tract shapes obtained for the lossy case are close to the actual ones.
Coppoolse, Jiska M. S.; Van Kooten, T. G.; Heris, Hossein K.; Mongeau, Luc; Li, Nicole Y. K.; Thibeault, Susan L.; Pitaro, Jacob; Akinpelu, Olubunmi; Daniel, Sam J.
2016-01-01
Purpose The objective of this study was to investigate local injection with a hierarchically microstructured hyaluronic acid–gelatin (HA-Ge) hydrogel for the treatment of acute vocal fold injury using a rat model. Method Vocal fold stripping was performed unilaterally in 108 Sprague-Dawley rats. A volume of 25 ml saline (placebo controls), HA-bulk, or HA-Ge hydrogel was injected into the lamina propria (LP) 5 days after surgery. The vocal folds were harvested at 3, 14, and 28 days after injection and analyzed using hematoxylin and eosin staining and immunohistochemistry staining for macrophages, myofibroblasts, elastin, collagen type I, and collagen type III. Results The macrophage count was statistically significantly lower in the HA-Ge group than in the saline group (p < .05) at Day 28. Results suggested that the HA-Ge injection did not induce inflammatory or rejection response. Myofibroblast counts and elastin were statistically insignificant across treatment groups at all time points. Increased elastin deposition was qualitatively observed in both HA groups from Day 3 to Day 28, and not in the saline group. Significantly more elastin was observed in the HA-bulk group than in the uninjured group at Day 28. Significantly more collagen type I was observed in the HA-bulk and HA-Ge groups than in the saline group (p < .05) at Day 28. The collagen type I concentration in the HA-Ge and saline groups was found to be comparable to that in the uninjured controls at Day 28. The concentration of collagen type III in all treatment groups was similar to that in uninjured controls at Day 28. Conclusion Local HA-Ge and HA-bulk injections for acute injured vocal folds were biocompatible and did not induce adverse response. PMID:24687141
Galindo, Gabriel E.; Peterson, Sean D.; Erath, Byron D.; Castro, Christian; Hillman, Robert E.
2017-01-01
Purpose Our goal was to test prevailing assumptions about the underlying biomechanical and aeroacoustic mechanisms associated with phonotraumatic lesions of the vocal folds using a numerical lumped-element model of voice production. Method A numerical model with a triangular glottis, posterior glottal opening, and arytenoid posturing is proposed. Normal voice is altered by introducing various prephonatory configurations. Potential compensatory mechanisms (increased subglottal pressure, muscle activation, and supraglottal constriction) are adjusted to restore an acoustic target output through a control loop that mimics a simplified version of auditory feedback. Results The degree of incomplete glottal closure in both the membranous and posterior portions of the folds consistently leads to a reduction in sound pressure level, fundamental frequency, harmonic richness, and harmonics-to-noise ratio. The compensatory mechanisms lead to significantly increased vocal-fold collision forces, maximum flow-declination rate, and amplitude of unsteady flow, without significantly altering the acoustic output. Conclusion Modeling provided potentially important insights into the pathophysiology of phonotraumatic vocal hyperfunction by demonstrating that compensatory mechanisms can counteract deterioration in the voice acoustic signal due to incomplete glottal closure, but this also leads to high vocal-fold collision forces (reflected in aerodynamic measures), which significantly increases the risk of developing phonotrauma. PMID:28837719
Meaning in the avian auditory cortex: Neural representation of communication calls
Elie, Julie E; Theunissen, Frédéric E
2014-01-01
Understanding how the brain extracts the behavioral meaning carried by specific vocalization types that can be emitted by various vocalizers and in different conditions is a central question in auditory research. This semantic categorization is a fundamental process required for acoustic communication and presupposes discriminative and invariance properties of the auditory system for conspecific vocalizations. Songbirds have been used extensively to study vocal learning, but the communicative function of all their vocalizations and their neural representation has yet to be examined. In our research, we first generated a library containing almost the entire zebra finch vocal repertoire and organized communication calls along 9 different categories based on their behavioral meaning. We then investigated the neural representations of these semantic categories in the primary and secondary auditory areas of 6 anesthetized zebra finches. To analyze how single units encode these call categories, we described neural responses in terms of their discrimination, selectivity and invariance properties. Quantitative measures for these neural properties were obtained using an optimal decoder based both on spike counts and spike patterns. Information theoretic metrics show that almost half of the single units encode semantic information. Neurons achieve higher discrimination of these semantic categories by being more selective and more invariant. These results demonstrate that computations necessary for semantic categorization of meaningful vocalizations are already present in the auditory cortex and emphasize the value of a neuro-ethological approach to understand vocal communication. PMID:25728175
Xu, Chet C; Chan, Roger W; Sun, Han; Zhan, Xiaowei
2017-11-01
A mixed-effects model approach was introduced in this study for the statistical analysis of rheological data of vocal fold tissues, in order to account for the data correlation caused by multiple measurements of each tissue sample across the test frequency range. Such data correlation had often been overlooked in previous studies in the past decades. The viscoelastic shear properties of the vocal fold lamina propria of two commonly used laryngeal research animal species (i.e. rabbit, porcine) were measured by a linear, controlled-strain simple-shear rheometer. Along with published canine and human rheological data, the vocal fold viscoelastic shear moduli of these animal species were compared to those of human over a frequency range of 1-250Hz using the mixed-effects models. Our results indicated that tissues of the rabbit, canine and porcine vocal fold lamina propria were significantly stiffer and more viscous than those of human. Mixed-effects models were shown to be able to more accurately analyze rheological data generated from repeated measurements. Copyright © 2017 Elsevier Ltd. All rights reserved.
Long-term memory of heterospecific vocalizations by African lions
NASA Astrophysics Data System (ADS)
Grinnell, Jon; van Dyk, Gus; Slotow, Rob
2005-09-01
Animals that use and evaluate long-distance signals have the potential to glean valuable information about others in their environment via eavesdropping. In those areas where they coexist, African lions (Panthera leo) are a significant eavesdropper on spotted hyenas (Crocuta crocuta), often using hyena vocalizations to locate and scavenge from hyena kills. This relationship was used to test African lions' long-term memory of the vocalizations of spotted hyenas via playback experiments. Hyena whoops and a control sound (Canis lupus howls) were played to three populations of lions in South Africa: (1) lions with past experience of spotted hyenas; (2) lions with current experience; and (3) lions with no experience. The results strongly suggest that lions have the cognitive ability to remember the vocalizations of spotted hyenas even after 10 years with no contact of any kind with them. Such long-term memory of heterospecific vocalizations may be widespread in species that gain fitness benefits from eavesdropping on others, but where such species are sympatric and often interact it may pass unrecognized as short-term memory instead.
Glottal aerodynamics in compliant, life-sized vocal fold models
NASA Astrophysics Data System (ADS)
McPhail, Michael; Dowell, Grant; Krane, Michael
2013-11-01
This talk presents high-speed PIV measurements in compliant, life-sized models of the vocal folds. A clearer understanding of the fluid-structure interaction of voiced speech, how it produces sound, and how it varies with pathology is required to improve clinical diagnosis and treatment of vocal disorders. Physical models of the vocal folds can answer questions regarding the fundamental physics of speech, as well as the ability of clinical measures to detect the presence and extent of disorder. Flow fields were recorded in the supraglottal region of the models to estimate terms in the equations of fluid motion, and their relative importance. Experiments were conducted over a range of driving pressures with flow rates, given by a ball flowmeter, and subglottal pressures, given by a micro-manometer, reported for each case. Imaging of vocal fold motion, vector fields showing glottal jet behavior, and terms estimated by control volume analysis will be presented. The use of these results for a comparison with clinical measures, and for the estimation of aeroacoustic source strengths will be discussed. Acknowledge support from NIH R01 DC005642.
Mouse Vocal Communication System: Are Ultrasounds Learned or Innate?
ERIC Educational Resources Information Center
Arriaga, Gustavo; Jarvis, Erich D.
2013-01-01
Mouse ultrasonic vocalizations (USVs) are often used as behavioral readouts of internal states, to measure effects of social and pharmacological manipulations, and for behavioral phenotyping of mouse models for neuropsychiatric and neurodegenerative disorders. However, little is known about the neurobiological mechanisms of rodent USV production.…
ERP Correlates of Language-Specific Processing of Auditory Pitch Feedback during Self-Vocalization
ERIC Educational Resources Information Center
Chen, Zhaocong; Liu, Peng; Wang, Emily Q.; Larson, Charles R.; Huang, Dongfeng; Liu, Hanjun
2012-01-01
The present study investigated whether the neural correlates for auditory feedback control of vocal pitch can be shaped by tone language experience. Event-related potentials (P2/N1) were recorded from adult native speakers of Mandarin and Cantonese who heard their voice auditory feedback shifted in pitch by -50, -100, -200, or -500 cents when they…
A Randomized Controlled Trial of Two Semi-Occluded Vocal Tract Voice Therapy Protocols
ERIC Educational Resources Information Center
Kapsner-Smith, Mara R.; Hunter, Eric J.; Kirkham, Kimberly; Cox, Karin; Titze, Ingo R.
2015-01-01
Purpose: Although there is a long history of use of semi-occluded vocal tract gestures in voice therapy, including phonation through thin tubes or straws, the efficacy of phonation through tubes has not been established. This study compares results from a therapy program on the basis of phonation through a flow-resistant tube (FRT) with Vocal…
ERIC Educational Resources Information Center
Miliotis, Adriane; Sidener, Tina M.; Reeve, Kenneth F.; Carbone, Vincent; Sidener, David W.; Rader, Lisa; Delmolino, Lara
2012-01-01
Stimulus-stimulus pairing (SSP) of vocalizations pairs the speech of others with the delivery of highly preferred items. The goal of this procedure is to produce a temporary increase in vocalizations, thus creating a larger variety of sounds that can subsequently be brought under appropriate stimulus control (Esch, Carr, & Grow, 2009). In this…
Deguchi, Shinji; Kawashima, Kazutaka; Washio, Seiichi
2008-12-01
The effect of artificially altered transglottal pressures on the voice fundamental frequency (F0) is known to be associated with vocal fold stiffness. Its measurement, though useful as a potential diagnostic tool for noncontact assessment of vocal fold stiffness, often requires manual and painstaking determination of an unstable F0 of voice. Here, we provide a computer-aided technique that enables one to carry out the determination easily and accurately. Human subjects vocalized in accordance with a series of reference sounds from a speaker controlled by a computer. Transglottal pressures were altered by means of a valve embedded in a mouthpiece. Time-varying vocal F0 was extracted, without manual procedures, from a specific range of the voice spectrum determined on the basis of the controlled reference sounds. The validity of the proposed technique was assessed for 11 healthy subjects. Fluctuating voice F0 was tracked automatically during experiments, providing the relationship between transglottal pressure change and F0 on the computer. The proposed technique overcomes the difficulty in automatic determination of the voice F0, which tends to be transient both in normal voice and in some types of pathological voice.
Case-control study of risk factors for spasmodic dysphonia: A comparison with other voice disorders.
Tanner, Kristine; Roy, Nelson; Merrill, Ray M; Sauder, Cara; Houtz, Daniel R; Smith, Marshall E
2012-05-01
This epidemiology study examined risk factors uniquely associated with spasmodic dysphonia (SD). Case-control. A questionnaire was administered to 150 patients with SD (with and without coexisting vocal tremor) and 136 patients with other structural, neurological, and functional voice disorders (excluding SD and vocal tremor). Questions included personal and family medical histories, environmental exposures, trauma, illnesses, voice use habits, and the Short Form 36. Several factors were uniquely associated with SD (α = .05), including: 1) a personal history of cervical dystonia, sinus and throat illnesses, mumps, rubella, dust exposure, and frequent volunteer voice use, 2) a family history of voice disorders, 3) an immediate family history of vocal tremor and meningitis, and 4) an extended family history of head and neck tremor, ocular disease, and meningitis. Vocal tremor coexisted with SD in 29% of cases. Measles and mumps vaccines were protective for SD. SD is likely multifactorial and associated with several endogenous and exogenous factors. Certain viral exposures, voice use patterns, and familial neurological conditions may contribute to the onset of SD later in life. Copyright © 2011 The American Laryngological, Rhinological, and Otological Society, Inc.
Bottalico, Pasquale; Graetzer, Simone; Hunter, Eric J.
2015-01-01
Speakers adjust their vocal effort when communicating in different room acoustic and noise conditions and when instructed to speak at different volumes. The present paper reports on the effects of voice style, noise level, and acoustic feedback on vocal effort, evaluated as sound pressure level, and self-reported vocal fatigue, comfort, and control. Speakers increased their level in the presence of babble and when instructed to talk in a loud style, and lowered it when acoustic feedback was increased and when talking in a soft style. Self-reported responses indicated a preference for the normal style without babble noise. PMID:26723357
Disrupting vagal feedback affects birdsong motor control.
Méndez, Jorge M; Dall'asén, Analía G; Goller, Franz
2010-12-15
Coordination of different motor systems for sound production involves the use of feedback mechanisms. Song production in oscines is a well-established animal model for studying learned vocal behavior. Whereas the online use of auditory feedback has been studied in the songbird model, very little is known about the role of other feedback mechanisms. Auditory feedback is required for the maintenance of stereotyped adult song. In addition, the use of somatosensory feedback to maintain pressure during song has been demonstrated with experimentally induced fluctuations in air sac pressure. Feedback information mediating this response is thought to be routed to the central nervous system via afferent fibers of the vagus nerve. Here, we tested the effects of unilateral vagotomy on the peripheral motor patterns of song production and the acoustic features. Unilateral vagotomy caused a variety of disruptions and alterations to the respiratory pattern of song, some of which affected the acoustic structure of vocalizations. These changes were most pronounced a few days after nerve resection and varied between individuals. In the most extreme cases, the motor gestures of respiration were so severely disrupted that individual song syllables or the song motif were atypically terminated. Acoustic changes also suggest altered use of the two sound generators and upper vocal tract filtering, indicating that the disruption of vagal feedback caused changes to the motor program of all motor systems involved in song production and modification. This evidence for the use of vagal feedback by the song system with disruption of song during the first days after nerve cut provides a contrast to the longer-term effects of auditory feedback disruption. It suggests a significant role for somatosensory feedback that differs from that of auditory feedback.
Disrupting vagal feedback affects birdsong motor control
Méndez, Jorge M.; Dall'Asén, Analía G.; Goller, Franz
2010-01-01
Coordination of different motor systems for sound production involves the use of feedback mechanisms. Song production in oscines is a well-established animal model for studying learned vocal behavior. Whereas the online use of auditory feedback has been studied in the songbird model, very little is known about the role of other feedback mechanisms. Auditory feedback is required for the maintenance of stereotyped adult song. In addition, the use of somatosensory feedback to maintain pressure during song has been demonstrated with experimentally induced fluctuations in air sac pressure. Feedback information mediating this response is thought to be routed to the central nervous system via afferent fibers of the vagus nerve. Here, we tested the effects of unilateral vagotomy on the peripheral motor patterns of song production and the acoustic features. Unilateral vagotomy caused a variety of disruptions and alterations to the respiratory pattern of song, some of which affected the acoustic structure of vocalizations. These changes were most pronounced a few days after nerve resection and varied between individuals. In the most extreme cases, the motor gestures of respiration were so severely disrupted that individual song syllables or the song motif were atypically terminated. Acoustic changes also suggest altered use of the two sound generators and upper vocal tract filtering, indicating that the disruption of vagal feedback caused changes to the motor program of all motor systems involved in song production and modification. This evidence for the use of vagal feedback by the song system with disruption of song during the first days after nerve cut provides a contrast to the longer-term effects of auditory feedback disruption. It suggests a significant role for somatosensory feedback that differs from that of auditory feedback. PMID:21113000
Gelada vocal sequences follow Menzerath’s linguistic law
Gustison, Morgan L.; Semple, Stuart; Ferrer-i-Cancho, Ramon; Bergman, Thore J.
2016-01-01
Identifying universal principles underpinning diverse natural systems is a key goal of the life sciences. A powerful approach in addressing this goal has been to test whether patterns consistent with linguistic laws are found in nonhuman animals. Menzerath’s law is a linguistic law that states that, the larger the construct, the smaller the size of its constituents. Here, to our knowledge, we present the first evidence that Menzerath’s law holds in the vocal communication of a nonhuman species. We show that, in vocal sequences of wild male geladas (Theropithecus gelada), construct size (sequence size in number of calls) is negatively correlated with constituent size (duration of calls). Call duration does not vary significantly with position in the sequence, but call sequence composition does change with sequence size and most call types are abbreviated in larger sequences. We also find that intercall intervals follow the same relationship with sequence size as do calls. Finally, we provide formal mathematical support for the idea that Menzerath’s law reflects compression—the principle of minimizing the expected length of a code. Our findings suggest that a common principle underpins human and gelada vocal communication, highlighting the value of exploring the applicability of linguistic laws in vocal systems outside the realm of language. PMID:27091968
Demopoulos, Carly; Hopkins, Joyce; Kopald, Brandon E; Paulson, Kim; Doyle, Lauren; Andrews, Whitney E; Lewine, Jeffrey David
2015-11-01
The primary aim of this study was to examine whether there is an association between magnetoencephalography-based (MEG) indices of basic cortical auditory processing and vocal affect recognition (VAR) ability in individuals with autism spectrum disorder (ASD). MEG data were collected from 25 children/adolescents with ASD and 12 control participants using a paired-tone paradigm to measure quality of auditory physiology, sensory gating, and rapid auditory processing. Group differences were examined in auditory processing and vocal affect recognition ability. The relationship between differences in auditory processing and vocal affect recognition deficits was examined in the ASD group. Replicating prior studies, participants with ASD showed longer M1n latencies and impaired rapid processing compared with control participants. These variables were significantly related to VAR, with the linear combination of auditory processing variables accounting for approximately 30% of the variability after controlling for age and language skills in participants with ASD. VAR deficits in ASD are typically interpreted as part of a core, higher order dysfunction of the "social brain"; however, these results suggest they also may reflect basic deficits in auditory processing that compromise the extraction of socially relevant cues from the auditory environment. As such, they also suggest that therapeutic targeting of sensory dysfunction in ASD may have additional positive implications for other functional deficits. (c) 2015 APA, all rights reserved).
Merullo, Devin P; Cordes, Melissa A; Susan DeVries, M; Stevenson, Sharon A; Riters, Lauren V
2015-11-01
Vocalizations coordinate social interactions in many species and often are important for behaviors such as mate attraction or territorial defense. Although the neural circuitry underlying vocal communication is well-known for some animal groups, such as songbirds, the motivational processes that regulate vocal signals are not as clearly understood. Neurotensin (NT) is a neuropeptide implicated in motivation that can modulate the activity of dopaminergic neurons. Dopaminergic projections from the ventral tegmental area (VTA) are key to mediating highly motivated, goal-directed behaviors, including sexually-motivated birdsong. However, the role of NT in modifying vocal communication or other social behaviors has not been well-studied. Here in European starlings (Sturnus vulgaris) we analyzed relationships between sexually-motivated song and NT and NT1 receptor (NTSR1) expression in VTA. Additionally, we examined NT and NTSR1 expression in four regions that receive dopaminergic projections from VTA and are involved in courtship song: the medial preoptic nucleus (POM), the lateral septum (LS), Area X, and HVC. Relationships between NT and NTSR1 expression and non-vocal courtship and agonistic behaviors were also examined. NT expression in Area X positively related to sexually-motivated song production. NT expression in POM positively correlated with non-vocal courtship behavior and agonistic behavior. NT expression in POM was greatest in males owning nesting sites, and the opposite pattern was observed for NTSR1 expression in LS. These results are the first to implicate NT in Area X in birdsong, and further highlight NT as a potential neuromodulator for the control of vocal communication and other social behaviors. Copyright © 2015 Elsevier Inc. All rights reserved.
Plotsky, K; Rendall, D; Riede, T; Chase, K
2013-09-01
Body size is an important determinant of resource and mate competition in many species. Competition is often mediated by conspicuous vocal displays, which may help to intimidate rivals and attract mates by providing honest cues to signaler size. Fitch proposed that vocal tract resonances (or formants) should provide particularly good, or honest, acoustic cues to signaler size because they are determined by the length of the vocal tract, which in turn, is hypothesized to scale reliably with overall body size. There is some empirical support for this hypothesis, but to date, many of the effects have been either mixed for males compared with females, weaker than expected in one or the other sex, or complicated by sampling issues. In this paper, we undertake a direct test of Fitch's hypothesis in two canid species using large samples that control for age- and sex-related variation. The samples involved radiographic images of 120 Portuguese water dogs Canis lupus familiaris and 121 Russian silver foxes Vulpes vulpes . Direct measurements were made of vocal tract length from X-ray images and compared against independent measures of body size. In adults of both species, and within both sexes, overall vocal tract length was strongly and significantly correlated with body size. Effects were strongest for the oral component of the vocal tract. By contrast, the length of the pharyngeal component was not as consistently related to body size. These outcomes are some of the clearest evidence to date in support of Fitch's hypothesis. At the same time, they highlight the potential for elements of both honest and deceptive body signaling to occur simultaneously via differential acoustic cues provided by the oral versus pharyngeal components of the vocal tract.
Plotsky, K.; Rendall, D.; Riede, T.; Chase, K.
2013-01-01
Body size is an important determinant of resource and mate competition in many species. Competition is often mediated by conspicuous vocal displays, which may help to intimidate rivals and attract mates by providing honest cues to signaler size. Fitch proposed that vocal tract resonances (or formants) should provide particularly good, or honest, acoustic cues to signaler size because they are determined by the length of the vocal tract, which in turn, is hypothesized to scale reliably with overall body size. There is some empirical support for this hypothesis, but to date, many of the effects have been either mixed for males compared with females, weaker than expected in one or the other sex, or complicated by sampling issues. In this paper, we undertake a direct test of Fitch’s hypothesis in two canid species using large samples that control for age- and sex-related variation. The samples involved radiographic images of 120 Portuguese water dogs Canis lupus familiaris and 121 Russian silver foxes Vulpes vulpes. Direct measurements were made of vocal tract length from X-ray images and compared against independent measures of body size. In adults of both species, and within both sexes, overall vocal tract length was strongly and significantly correlated with body size. Effects were strongest for the oral component of the vocal tract. By contrast, the length of the pharyngeal component was not as consistently related to body size. These outcomes are some of the clearest evidence to date in support of Fitch’s hypothesis. At the same time, they highlight the potential for elements of both honest and deceptive body signaling to occur simultaneously via differential acoustic cues provided by the oral versus pharyngeal components of the vocal tract. PMID:24363497
Rhythmic synchronization tapping to an audio–visual metronome in budgerigars
Hasegawa, Ai; Okanoya, Kazuo; Hasegawa, Toshikazu; Seki, Yoshimasa
2011-01-01
In all ages and countries, music and dance have constituted a central part in human culture and communication. Recently, vocal-learning animals such as parrots and elephants have been found to share rhythmic ability with humans. Thus, we investigated the rhythmic synchronization of budgerigars, a vocal-mimicking parrot species, under controlled conditions and a systematically designed experimental paradigm as a first step in understanding the evolution of musical entrainment. We trained eight budgerigars to perform isochronous tapping tasks in which they pecked a key to the rhythm of audio–visual metronome-like stimuli. The budgerigars showed evidence of entrainment to external stimuli over a wide range of tempos. They seemed to be inherently inclined to tap at fast tempos, which have a similar time scale to the rhythm of budgerigars' natural vocalizations. We suggest that vocal learning might have contributed to their performance, which resembled that of humans. PMID:22355637
Rhythmic synchronization tapping to an audio-visual metronome in budgerigars.
Hasegawa, Ai; Okanoya, Kazuo; Hasegawa, Toshikazu; Seki, Yoshimasa
2011-01-01
In all ages and countries, music and dance have constituted a central part in human culture and communication. Recently, vocal-learning animals such as parrots and elephants have been found to share rhythmic ability with humans. Thus, we investigated the rhythmic synchronization of budgerigars, a vocal-mimicking parrot species, under controlled conditions and a systematically designed experimental paradigm as a first step in understanding the evolution of musical entrainment. We trained eight budgerigars to perform isochronous tapping tasks in which they pecked a key to the rhythm of audio-visual metronome-like stimuli. The budgerigars showed evidence of entrainment to external stimuli over a wide range of tempos. They seemed to be inherently inclined to tap at fast tempos, which have a similar time scale to the rhythm of budgerigars' natural vocalizations. We suggest that vocal learning might have contributed to their performance, which resembled that of humans.
A Neural Code That Is Isometric to Vocal Output and Correlates with Its Sensory Consequences
Vyssotski, Alexei L.; Stepien, Anna E.; Keller, Georg B.; Hahnloser, Richard H. R.
2016-01-01
What cortical inputs are provided to motor control areas while they drive complex learned behaviors? We study this question in the nucleus interface of the nidopallium (NIf), which is required for normal birdsong production and provides the main source of auditory input to HVC, the driver of adult song. In juvenile and adult zebra finches, we find that spikes in NIf projection neurons precede vocalizations by several tens of milliseconds and are insensitive to distortions of auditory feedback. We identify a local isometry between NIf output and vocalizations: quasi-identical notes produced in different syllables are preceded by highly similar NIf spike patterns. NIf multiunit firing during song precedes responses in auditory cortical neurons by about 50 ms, revealing delayed congruence between NIf spiking and a neural representation of auditory feedback. Our findings suggest that NIf codes for imminent acoustic events within vocal performance. PMID:27723764
A neural circuit mechanism for regulating vocal variability during song learning in zebra finches.
Garst-Orozco, Jonathan; Babadi, Baktash; Ölveczky, Bence P
2014-12-15
Motor skill learning is characterized by improved performance and reduced motor variability. The neural mechanisms that couple skill level and variability, however, are not known. The zebra finch, a songbird, presents a unique opportunity to address this question because production of learned song and induction of vocal variability are instantiated in distinct circuits that converge on a motor cortex analogue controlling vocal output. To probe the interplay between learning and variability, we made intracellular recordings from neurons in this area, characterizing how their inputs from the functionally distinct pathways change throughout song development. We found that inputs that drive stereotyped song-patterns are strengthened and pruned, while inputs that induce variability remain unchanged. A simple network model showed that strengthening and pruning of action-specific connections reduces the sensitivity of motor control circuits to variable input and neural 'noise'. This identifies a simple and general mechanism for learning-related regulation of motor variability.
Systemic Hydration: Relating Science to Clinical Practice in Vocal Health
Hartley, Naomi A.; Thibeault, Susan L.
2014-01-01
Objectives To examine the current state of the science regarding the role of systemic hydration in vocal function and health. Study Design Literature Review Methods Literature search spanning multiple disciplines, including speech-language pathology, nutrition and dietetics, medicine, sports and exercise science, physiology and biomechanics. Results The relationship between hydration and physical function is an area of common interest amongst multiple professions. Each discipline provides valuable insight into the connection between performance and water balance, as well as complimentary methods of investigation. Existing voice literature suggests a relationship between hydration and voice production, however the underlying mechanisms are not yet defined and a treatment effect for systemic hydration remains to be demonstrated. Literature from other disciplines sheds light on methodological shortcomings and in some cases offers an alternative explanation for observed phenomena. Conclusions A growing body of literature in the field of voice science is documenting a relationship between hydration and vocal function, however greater understanding is required to guide best practice in the maintenance of vocal health and management of voice disorders. Integration of knowledge and technical expertise from multiple disciplines facilitates analysis of existing literature and provides guidance as to future research. PMID:24880674
Pre-Lingual Communication and Attachment Behavior.
ERIC Educational Resources Information Center
Modarressi, Taghi; McCulloch, Duncan
Infant's crying may have an important mediating role in the formation of attachment behavior. The earliest vocalizations are discussed in terms of an acoustic communications model in which the baby's vocal repertoire becomes incorporated into a closed-loop, feedback system with his mother. Certain pre-lingual "signals" may be associated with those…
Le Roux, Aliza; Cherry, Michael I; Manser, Marta B
2009-05-01
We describe the vocal repertoire of a facultatively social carnivore, the yellow mongoose, Cynictis penicillata. Using a combination of close-range observations, recordings and experiments with simulated predators, we were able to obtain clear descriptions of call structure and function for a wide range of calls used by this herpestid. The vocal repertoire of the yellow mongooses comprised ten call types, half of which were used in appeasing or fearful contexts and half in aggressive interactions. Data from this study suggest that the yellow mongoose uses an urgency-based alarm calling system, indicating high and low urgency through two distinct call types. Compared to solitary mongooses, the yellow mongoose has a large proportion of 'friendly' vocalisations that enhance group cohesion, but its vocal repertoire is smaller and less context-specific than those of obligate social species. This study of the vocal repertoire of the yellow mongoose is, to our knowledge, the most complete to have been conducted on a facultatively social species in its natural habitat.
NASA Astrophysics Data System (ADS)
Le Roux, Aliza; Cherry, Michael I.; Manser, Marta B.
2009-05-01
We describe the vocal repertoire of a facultatively social carnivore, the yellow mongoose, Cynictis penicillata. Using a combination of close-range observations, recordings and experiments with simulated predators, we were able to obtain clear descriptions of call structure and function for a wide range of calls used by this herpestid. The vocal repertoire of the yellow mongooses comprised ten call types, half of which were used in appeasing or fearful contexts and half in aggressive interactions. Data from this study suggest that the yellow mongoose uses an urgency-based alarm calling system, indicating high and low urgency through two distinct call types. Compared to solitary mongooses, the yellow mongoose has a large proportion of ‘friendly’ vocalisations that enhance group cohesion, but its vocal repertoire is smaller and less context-specific than those of obligate social species. This study of the vocal repertoire of the yellow mongoose is, to our knowledge, the most complete to have been conducted on a facultatively social species in its natural habitat.
Misono, Stephanie; Meredith, Liza; Peterson, Carol B; Frazier, Patricia A
2016-03-01
Although an association between psychosocial distress (depression, anxiety, somatization, and perceived stress) and voice disorders has been observed, little is known about the relationship between distress and patient-reported voice handicap. Furthermore, the psychological mechanisms underlying this relationship are poorly understood. Perceived control plays an important role in distress associated with other medical disorders. The objectives of this study were to (1) characterize the relationship between distress and patient-reported voice handicap and (2) examine the role of perceived control in this relationship. This is a cross-sectional study in a tertiary care academic voice clinic. Distress, perceived stress, voice handicap, and perceived control were measured using established assessment scales. Association was measured with Pearson correlation coefficients; moderation was assessed using multiple hierarchical regression. A total of 533 patients enrolled. Thirty-four percent of the patients met criteria for clinically significant distress (ie, depression, anxiety, and/or somatization). A weak association (r = 0.13; P = 0.003) was observed between severity of psychosocial distress and vocal handicap. Present perceived control was inversely associated with distress (r = -0.41; P < 0.0001), stress (r = -0.30; P < 0.0001), and voice handicap (r = -0.30; P < 0.0001). The relationship between voice handicap and psychosocial distress was moderated by perceived control (b for interaction term, -0.15; P < 0.001); greater vocal handicap was associated with greater distress in patients with low perceived control. Severity of distress and vocal handicap were positively related, and the relation between them was moderated by perceived control. Vocal handicap was more related to distress among those with low perceived control; targeting this potential mechanism may facilitate new approaches for improved care. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Yang, Anxiong; Stingl, Michael; Berry, David A.; Lohscheller, Jörg; Voigt, Daniel; Eysholdt, Ulrich; Döllinger, Michael
2011-01-01
With the use of an endoscopic, high-speed camera, vocal fold dynamics may be observed clinically during phonation. However, observation and subjective judgment alone may be insufficient for clinical diagnosis and documentation of improved vocal function, especially when the laryngeal disease lacks any clear morphological presentation. In this study, biomechanical parameters of the vocal folds are computed by adjusting the corresponding parameters of a three-dimensional model until the dynamics of both systems are similar. First, a mathematical optimization method is presented. Next, model parameters (such as pressure, tension and masses) are adjusted to reproduce vocal fold dynamics, and the deduced parameters are physiologically interpreted. Various combinations of global and local optimization techniques are attempted. Evaluation of the optimization procedure is performed using 50 synthetically generated data sets. The results show sufficient reliability, including 0.07 normalized error, 96% correlation, and 91% accuracy. The technique is also demonstrated on data from human hemilarynx experiments, in which a low normalized error (0.16) and high correlation (84%) values were achieved. In the future, this technique may be applied to clinical high-speed images, yielding objective measures with which to document improved vocal function of patients with voice disorders. PMID:21877808
Performance of a reduced-order FSI model for flow-induced vocal fold vibration
NASA Astrophysics Data System (ADS)
Chang, Siyuan; Luo, Haoxiang; Luo's lab Team
2016-11-01
Vocal fold vibration during speech production involves a three-dimensional unsteady glottal jet flow and three-dimensional nonlinear tissue mechanics. A full 3D fluid-structure interaction (FSI) model is computationally expensive even though it provides most accurate information about the system. On the other hand, an efficient reduced-order FSI model is useful for fast simulation and analysis of the vocal fold dynamics, which is often needed in procedures such as optimization and parameter estimation. In this work, we study the performance of a reduced-order model as compared with the corresponding full 3D model in terms of its accuracy in predicting the vibration frequency and deformation mode. In the reduced-order model, we use a 1D flow model coupled with a 3D tissue model. Two different hyperelastic tissue behaviors are assumed. In addition, the vocal fold thickness and subglottal pressure are varied for systematic comparison. The result shows that the reduced-order model provides consistent predictions as the full 3D model across different tissue material assumptions and subglottal pressures. However, the vocal fold thickness has most effect on the model accuracy, especially when the vocal fold is thin. Supported by the NSF.
Niebudek-Bogusz, Ewa; Sliwińska-Kowalska, Mariola
2006-01-01
An assessment of the vocal system, as a part of the medical certification of occupational diseases, should be objective and reliable. Therefore, interest in the method of acoustic voice analysis enabling objective assessment of voice parameters is still growing. The aim of the present study was to evaluate the applicability of acoustic analysis with vocal loading test to the diagnostics of occupational voice disorders. The results of acoustic voice analysis were compared using IRIS software for phoniatrics, before and after a 30-min vocal loading test in 35 female teachers with diagnosed occupational voice disorders (group I) and in 31 female teachers with functional dysphonia (group II). In group I, vocal effort produced significant abnormalities in voice acoustic parameters, compared to group II. These included significantly increased mean fundamental frequency (Fo) value (by 11 Hz) and worsened jitter, shimmer and NHR parameters. Also, the percentage of subjects showing abnormalities in voice acoustic analysis was higher in this group. Conducting voice acoustic analysis before and after the vocal loading test makes it possible to objectively confirm irreversible voice impairments in persons with work-related pathologies of the larynx, which is essential for medical certification of occupational voice diseases.
Social calls provide novel insights into the evolution of vocal learning
Sewall, Kendra B.; Young, Anna M.; Wright, Timothy F.
2016-01-01
Learned song is among the best-studied models of animal communication. In oscine songbirds, where learned song is most prevalent, it is used primarily for intrasexual selection and mate attraction. Learning of a different class of vocal signals, known as contact calls, is found in a diverse array of species, where they are used to mediate social interactions among individuals. We argue that call learning provides a taxonomically rich system for studying testable hypotheses for the evolutionary origins of vocal learning. We describe and critically evaluate four nonmutually exclusive hypotheses for the origin and current function of vocal learning of calls, which propose that call learning (1) improves auditory detection and recognition, (2) signals local knowledge, (3) signals group membership, or (4) allows for the encoding of more complex social information. We propose approaches to testing these four hypotheses but emphasize that all of them share the idea that social living, not sexual selection, is a central driver of vocal learning. Finally, we identify future areas for research on call learning that could provide new perspectives on the origins and mechanisms of vocal learning in both animals and humans. PMID:28163325
Nishigori, Hideo; Kagami, Keisuke; Nishigori, Hidekazu
2014-03-15
The effects of glucocorticoid receptor dysfunction during embryogenesis on the imprinting abilities and social behaviors of hatchlings were examined using "fertile hen's egg-embryo-chick" system. Of embryos treated with mifepristone (0.4μmol/egg) on day 14, over 75% hatched a day later than the controls (day 22) without external anomalies. The mifepristone-treated hatchlings were assayed for imprinting ability on post-hatching day 2 and for social behaviors on day 3. The findings were as follows: imprinting ability (expressed as preference score) was significantly lower in mifepristone-treated hatchlings than in controls (0.65±0.06 vs. 0.92±0.02, P<0.005). Aggregation tests to evaluate the speed (seconds) required for four chicks, individually isolated with cardboard dividers in a box, to form a group after removal of the barriers showed that aggregation was significantly slower in mifepristone-treated hatchlings than in controls (8.7±1.1 vs. 2.6±0.3, P<0.001). In belongingness tests to evaluate the speed (seconds) for a chick isolated at a corner to join a group of three chicks placed at the opposite corner, mifepristone-treated hatchlings took significantly longer than controls (4.5±0.4/40 cm vs. 2.4±0.08/40 cm, P<0.001). In vocalization tests, using a decibel meter to measure average decibel level/30s (chick vocalization), mifepristone-treated hatchlings had significantly weaker vocalizations than controls (14.2±1.9/30s vs. 26.4±1.3/30s P<0.001). In conclusion, glucocorticoid receptor dysfunction during the last week embryogenesis altered the programming of brain development, resulting in impaired behavioral activities in late life. Copyright © 2013 Elsevier B.V. All rights reserved.
Maciej, Peter; Ndao, Ibrahima; Hammerschmidt, Kurt; Fischer, Julia
2013-09-23
To understand the evolution of acoustic communication in animals, it is important to distinguish between the structure and the usage of vocal signals, since both aspects are subject to different constraints. In terrestrial mammals, the structure of calls is largely innate, while individuals have a greater ability to actively initiate or withhold calls. In closely related taxa, one would therefore predict a higher flexibility in call usage compared to call structure. In the present study, we investigated the vocal repertoire of free living Guinea baboons (Papio papio) and examined the structure and usage of the animals' vocal signals. Guinea baboons live in a complex multi-level social organization and exhibit a largely tolerant and affiliative social style, contrary to most other baboon taxa. To classify the vocal repertoire of male and female Guinea baboons, cluster analyses were used and focal observations were conducted to assess the usage of vocal signals in the particular contexts. In general, the vocal repertoire of Guinea baboons largely corresponded to the vocal repertoire other baboon taxa. The usage of calls, however, differed considerably from other baboon taxa and corresponded with the specific characteristics of the Guinea baboons' social behaviour. While Guinea baboons showed a diminished usage of contest and display vocalizations (a common pattern observed in chacma baboons), they frequently used vocal signals during affiliative and greeting interactions. Our study shows that the call structure of primates is largely unaffected by the species' social system (including grouping patterns and social interactions), while the usage of calls can be more flexibly adjusted, reflecting the quality of social interactions of the individuals. Our results support the view that the primary function of social signals is to regulate social interactions, and therefore the degree of competition and cooperation may be more important to explain variation in call usage than grouping patterns or group size.
2013-01-01
Background To understand the evolution of acoustic communication in animals, it is important to distinguish between the structure and the usage of vocal signals, since both aspects are subject to different constraints. In terrestrial mammals, the structure of calls is largely innate, while individuals have a greater ability to actively initiate or withhold calls. In closely related taxa, one would therefore predict a higher flexibility in call usage compared to call structure. In the present study, we investigated the vocal repertoire of free living Guinea baboons (Papio papio) and examined the structure and usage of the animals’ vocal signals. Guinea baboons live in a complex multi-level social organization and exhibit a largely tolerant and affiliative social style, contrary to most other baboon taxa. To classify the vocal repertoire of male and female Guinea baboons, cluster analyses were used and focal observations were conducted to assess the usage of vocal signals in the particular contexts. Results In general, the vocal repertoire of Guinea baboons largely corresponded to the vocal repertoire other baboon taxa. The usage of calls, however, differed considerably from other baboon taxa and corresponded with the specific characteristics of the Guinea baboons’ social behaviour. While Guinea baboons showed a diminished usage of contest and display vocalizations (a common pattern observed in chacma baboons), they frequently used vocal signals during affiliative and greeting interactions. Conclusions Our study shows that the call structure of primates is largely unaffected by the species’ social system (including grouping patterns and social interactions), while the usage of calls can be more flexibly adjusted, reflecting the quality of social interactions of the individuals. Our results support the view that the primary function of social signals is to regulate social interactions, and therefore the degree of competition and cooperation may be more important to explain variation in call usage than grouping patterns or group size. PMID:24059742
Animal vocal sequences: not the Markov chains we thought they were.
Kershenbaum, Arik; Bowles, Ann E; Freeberg, Todd M; Jin, Dezhe Z; Lameira, Adriano R; Bohn, Kirsten
2014-10-07
Many animals produce vocal sequences that appear complex. Most researchers assume that these sequences are well characterized as Markov chains (i.e. that the probability of a particular vocal element can be calculated from the history of only a finite number of preceding elements). However, this assumption has never been explicitly tested. Furthermore, it is unclear how language could evolve in a single step from a Markovian origin, as is frequently assumed, as no intermediate forms have been found between animal communication and human language. Here, we assess whether animal taxa produce vocal sequences that are better described by Markov chains, or by non-Markovian dynamics such as the 'renewal process' (RP), characterized by a strong tendency to repeat elements. We examined vocal sequences of seven taxa: Bengalese finches Lonchura striata domestica, Carolina chickadees Poecile carolinensis, free-tailed bats Tadarida brasiliensis, rock hyraxes Procavia capensis, pilot whales Globicephala macrorhynchus, killer whales Orcinus orca and orangutans Pongo spp. The vocal systems of most of these species are more consistent with a non-Markovian RP than with the Markovian models traditionally assumed. Our data suggest that non-Markovian vocal sequences may be more common than Markov sequences, which must be taken into account when evaluating alternative hypotheses for the evolution of signalling complexity, and perhaps human language origins. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Poor neuro-motor tuning of the human larynx: a comparison of sung and whistled pitch imitation
Johnson, Joseph F.; Kotz, Sonja A.
2018-01-01
Vocal imitation is a hallmark of human communication that underlies the capacity to learn to speak and sing. Even so, poor vocal imitation abilities are surprisingly common in the general population and even expert vocalists cannot match the precision of a musical instrument. Although humans have evolved a greater degree of control over the laryngeal muscles that govern voice production, this ability may be underdeveloped compared with control over the articulatory muscles, such as the tongue and lips, volitional control of which emerged earlier in primate evolution. Human participants imitated simple melodies by either singing (i.e. producing pitch with the larynx) or whistling (i.e. producing pitch with the lips and tongue). Sung notes were systematically biased towards each individual's habitual pitch, which we hypothesize may act to conserve muscular effort. Furthermore, while participants who sung more precisely also whistled more precisely, sung imitations were less precise than whistled imitations. The laryngeal muscles that control voice production are under less precise control than the oral muscles that are involved in whistling. This imprecision may be due to the relatively recent evolution of volitional laryngeal-motor control in humans, which may be tuned just well enough for the coarse modulation of vocal-pitch in speech. PMID:29765635
Stelzel, Christine; Schauenburg, Gesche; Rapp, Michael A.; Heinzel, Stephan; Granacher, Urs
2017-01-01
Age-related decline in executive functions and postural control due to degenerative processes in the central nervous system have been related to increased fall-risk in old age. Many studies have shown cognitive-postural dual-task interference in old adults, but research on the role of specific executive functions in this context has just begun. In this study, we addressed the question whether postural control is impaired depending on the coordination of concurrent response-selection processes related to the compatibility of input and output modality mappings as compared to impairments related to working-memory load in the comparison of cognitive dual and single tasks. Specifically, we measured total center of pressure (CoP) displacements in healthy female participants aged 19–30 and 66–84 years while they performed different versions of a spatial one-back working memory task during semi-tandem stance on an unstable surface (i.e., balance pad) while standing on a force plate. The specific working-memory tasks comprised: (i) modality compatible single tasks (i.e., visual-manual or auditory-vocal tasks), (ii) modality compatible dual tasks (i.e., visual-manual and auditory-vocal tasks), (iii) modality incompatible single tasks (i.e., visual-vocal or auditory-manual tasks), and (iv) modality incompatible dual tasks (i.e., visual-vocal and auditory-manual tasks). In addition, participants performed the same tasks while sitting. As expected from previous research, old adults showed generally impaired performance under high working-memory load (i.e., dual vs. single one-back task). In addition, modality compatibility affected one-back performance in dual-task but not in single-task conditions with strikingly pronounced impairments in old adults. Notably, the modality incompatible dual task also resulted in a selective increase in total CoP displacements compared to the modality compatible dual task in the old but not in the young participants. These results suggest that in addition to effects of working-memory load, processes related to simultaneously overcoming special linkages between input- and output modalities interfere with postural control in old but not in young female adults. Our preliminary data provide further evidence for the involvement of cognitive control processes in postural tasks. PMID:28484411
Auditory responses in the amygdala to social vocalizations
NASA Astrophysics Data System (ADS)
Gadziola, Marie A.
The underlying goal of this dissertation is to understand how the amygdala, a brain region involved in establishing the emotional significance of sensory input, contributes to the processing of complex sounds. The general hypothesis is that communication calls of big brown bats (Eptesicus fuscus) transmit relevant information about social context that is reflected in the activity of amygdalar neurons. The first specific aim analyzed social vocalizations emitted under a variety of behavioral contexts, and related vocalizations to an objective measure of internal physiological state by monitoring the heart rate of vocalizing bats. These experiments revealed a complex acoustic communication system among big brown bats in which acoustic cues and call structure signal the emotional state of a sender. The second specific aim characterized the responsiveness of single neurons in the basolateral amygdala to a range of social syllables. Neurons typically respond to the majority of tested syllables, but effectively discriminate among vocalizations by varying the response duration. This novel coding strategy underscores the importance of persistent firing in the general functioning of the amygdala. The third specific aim examined the influence of acoustic context by characterizing both the behavioral and neurophysiological responses to natural vocal sequences. Vocal sequences differentially modify the internal affective state of a listening bat, with lower aggression vocalizations evoking the greatest change in heart rate. Amygdalar neurons employ two different coding strategies: low background neurons respond selectively to very few stimuli, whereas high background neurons respond broadly to stimuli but demonstrate variation in response magnitude and timing. Neurons appear to discriminate the valence of stimuli, with aggression sequences evoking robust population-level responses across all sound levels. Further, vocal sequences show improved discrimination among stimuli compared to isolated syllables, and this improved discrimination is expressed in part by the timing of action potentials. Taken together, these data support the hypothesis that big brown bat social vocalizations transmit relevant information about the social context that is encoded within the discharge pattern of amygdalar neurons ultimately responsible for coordinating appropriate social behaviors. I further propose that vocalization-evoked amygdalar activity will have significant impact on subsequent sensory processing and plasticity.